11/06/2021 21:16:12 - INFO - __main__ - Distributed environment: MULTI_GPU Backend: nccl Num processes: 16 Process index: 0 Local process index: 0 Device: cuda:0 Use FP16 precision: True /home/leandro/codeparrot-small/./ is already a clone of https://huggingface.co/lvwerra/codeparrot-small. Make sure you pull the latest changes with `repo.git_pull()`. 11/06/2021 21:16:13 - WARNING - huggingface_hub.repository - /home/leandro/codeparrot-small/./ is already a clone of https://huggingface.co/lvwerra/codeparrot-small. Make sure you pull the latest changes with `repo.git_pull()`. Revision `proud-haze-135` does not exist. Created and checked out branch `proud-haze-135`. 11/06/2021 21:16:13 - WARNING - huggingface_hub.repository - Revision `proud-haze-135` does not exist. Created and checked out branch `proud-haze-135`. 11/06/2021 21:16:13 - WARNING - huggingface_hub.repository - loading configuration file ./config.json Model config GPT2Config { "activation_function": "gelu_new", "architectures": [ "GPT2LMHeadModel" ], "attn_pdrop": 0.1, "bos_token_id": 50256, "embd_pdrop": 0.1, "eos_token_id": 50256, "initializer_range": 0.02, "layer_norm_epsilon": 1e-05, "model_type": "gpt2", "n_ctx": 1024, "n_embd": 768, "n_head": 12, "n_inner": null, "n_layer": 12, "n_positions": 1024, "reorder_and_upcast_attn": true, "resid_pdrop": 0.1, "scale_attn_by_inverse_layer_idx": true, "scale_attn_weights": true, "summary_activation": null, "summary_first_dropout": 0.1, "summary_proj_to_labels": true, "summary_type": "cls_index", "summary_use_proj": true, "task_specific_params": { "text-generation": { "do_sample": true, "max_length": 50 } }, "torch_dtype": "float32", "transformers_version": "4.12.2", "use_cache": true, "vocab_size": 32768 } loading weights file ./pytorch_model.bin All model checkpoint weights were used when initializing GPT2LMHeadModel. All the weights of GPT2LMHeadModel were initialized from the model checkpoint at ./. If your task is similar to the task the model of the checkpoint was trained on, you can already use GPT2LMHeadModel for predictions without further training. Didn't find file ./added_tokens.json. We won't load it. loading file ./vocab.json loading file ./merges.txt loading file ./tokenizer.json loading file None loading file ./special_tokens_map.json loading file ./tokenizer_config.json 11/06/2021 21:16:15 - INFO - datasets.data_files - Some files matched the pattern '*' at /home/leandro/codeparrot-clean-train but don't have valid data file extensions: [PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/0b/f3/0bf3cd1320065c163f47a112458dc107650e3e862094b703b76073bd0b68663d'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-rebase.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/fsmonitor-watchman.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-applypatch.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/37/26/3726a0239b5cb7d0ef3ea36886c533d0becc7404217763015559edb546d53c94'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/e7/a9/e7a9ccbfe6bd92476f83eba205c47ed23732ace4c1bd7458d76d666ebbba3b1c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/73/73/737327c2b47693e00050aa3410c5eb402c66211a79740ab57f1c763a1e557563'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/commit-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/2a/7e/2a7e50bbdb90d6c4cec534c3f1dc7ec0e6a0dada15c07cfd94615940c632ce02'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/5a/5f/5a5fbc19e0e76787f668ada7235203c10b0cbcdea0ecf8f873f8ec281cfe3494'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/74/31/7431977a8e3a6eb0348b821009495f85d9373c1f730f4a74b0db43326568f77d'), PosixPath('/home/leandro/codeparrot-clean-train/.git/logs/refs/heads/main'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/50/38/503872def2ac44733fbefc2602ab16224caca0896aa1eba045025ef2d60efcdc'), PosixPath('/home/leandro/codeparrot-clean-train/.git/refs/heads/main'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/update.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/b6/ce/b6ce495492aedfc91b66efdfd214b2dfe44867c719d51590e1868e42f4e9b6dd'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/df/08/df0840d1657530c8fa9f82864be5999c515f54341d926c430a82528a6bb83740'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/2f/62/2f628d890bceee216f87edb3c45d2e384ee2501ce41a4c4169efaa3363bef1d2'), PosixPath('/home/leandro/codeparrot-clean-train/.git/HEAD'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/0f/7a/0f7a67cd83c1c069995f0f2510ebf818dcc71d9658f189de1231d2b7aac8883c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/05/39/053944e1daead0b6de8e46ea2e0bc68b9247604c63a55d444ac3b9adb12e2cd2'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/dc/ac/dcacb03d8f43f7879c5eab4422644d7b3797b47dbb0c9c84d88cbc85822d8306'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ac/e3/ace3ac440b380d604ab198cf8e838a2a375e7b0a6b5699ec74a8c79648f4bab8'), PosixPath('/home/leandro/codeparrot-clean-train/.git/description'), PosixPath('/home/leandro/codeparrot-clean-train/.git/refs/remotes/origin/HEAD'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/d4/9f/d49f1929644619c39cff677367ff2e18223a8046ec8f61e224954a10aa2ccf8f'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/2e/aa/2eaa21b832ed1496fb7f0b259666dbfc36ed483d81494d1e8705f9d601509c12'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/f1/a7/f1a7a250e1f6164a7fb602131ff54b69deb305258792f2358075403769d58fe5'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/d0/02/d0024828eece6d4d1c25cb4e539328be97fa28ce66a3b8d2374a117711cfd520'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/90/a5/90a573501de640c3e0e6f1b3508306febc96faf6061bb33c67894c168a1879c6'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/post-checkout'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/post-commit'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/5d/42/5d42ba9f195510757a3699005a7c43ddede4b598caf8a5f2f8c84d1125fa6324'), PosixPath('/home/leandro/codeparrot-clean-train/.git/logs/HEAD'), PosixPath('/home/leandro/codeparrot-clean-train/.git/packed-refs'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/5f/d1/5fd1bb56db810b65d1fd3866dc43d9c7b690c8f52b9ca8119b2a5f4c49d13eec'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/7c/0e/7c0ef87edb0e556939282c859c7c893a91b5b0f931394ca4cca4f4ec98a61951'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ee/c1/eec1a9546aac0444a706c09f6aab67cd64403940657417e30212b7ff1e16665c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/55/b6/55b6989a41ae296337356153e6081c61484d0b6734b6905683823e7317d01c42'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-commit.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/cc/58/cc58b22515c4fd7d891287ee717c2054290b20c17b1c34693fd8964ab730687b'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/b6/8a/b68a74f9784402dcb311f4db72a873035e47b98b185a1813ab2c1645cb7255a2'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/fb/84/fb84ca8000808f62718994e4b44e79d88a05b345e9638d9f6cf6c8a5472da01f'), PosixPath('/home/leandro/codeparrot-clean-train/.git/objects/pack/pack-12438cb8112d3b4104fefcb88d751872b5e0fd6e.pack'), PosixPath('/home/leandro/codeparrot-clean-train/.git/info/exclude'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/12/8d/128d56e09d9d741b2778d733e595838a50a5e82fdc9adbb0aa8645457716b97e'), PosixPath('/home/leandro/codeparrot-clean-train/.git/objects/pack/pack-12438cb8112d3b4104fefcb88d751872b5e0fd6e.idx'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/b4/83/b4836655e350f0796acd2b1a206e657c2808d9f136afae095e0b94a790c704e1'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/3e/f2/3ef240d0b394384803ae1bbe3b30974e11eb9b1b6ad4f49afc2ed0f7c9eae0d6'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/14/08/14089cad26037080ee900bede2fd42d5cac70738b2e77402b36681e1d2a521f6'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/86/0e/860eda34e90456533e9dd41a5c0fdb74c54dc8d9cf43d6c60b887b2c858be831'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/post-update.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-push'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/15/ac/15ac016e4cd702bb184457cbf5674d71b632fc34c29611ba4de549b85c67acfb'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/a4/6b/a46b5c08d39691524b46fadf78eab5efefa29978edfee799ec3587d928dc1302'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/fa/e6/fae6b44a24c1c35f15053a19a6b2b2af5cc9fb8bdaf0da409068a2a1f333f28e'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/17/5e/175e7375d6f65993071aa653bdd4e8b117cc02d1d2353cd7bcdbaaf7fe8b3c9c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ac/36/ac36d12d37c1dc8ee8d3b8f0eae93966ae73482ef725615bb1a715802ddd4dd4'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/17/96/1796f12729d0407cc57500c9c87959e0e7becd729f37374702868ed8765015f4'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/67/f1/67f1ff0d590fbf4aa9afa161c290fe9be17538d4b723278bb21fd6408b0e6a3e'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/55/c9/55c9c0b2f26de96e0311ee43e8eaa78ad1af387d0c59a26f22c5ebd507dda321'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-push.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/config'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/9f/7e/9f7e18a3980d4b3d5ed9469ab7a2d67b608e8aa6fff38d876f86719c8f2a7a82'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/e6/48/e6484a578778beccab26c8549608ec13970e6bcdb9541cdccad20f4d984e8181'), PosixPath('/home/leandro/codeparrot-clean-train/.git/index'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/54/60/5460223b92bb118814a7777a939f4005b7426a7e4a068c193c10d1b86eeb862b'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/prepare-commit-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/logs/refs/remotes/origin/HEAD'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ae/45/ae45741df674456bc63bad91374d2ba5ef988d33d6e2a322ef0a5ac8af040371'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/60/41/604177fe5560efd99d93091fadab6293afe7cd7d12f81638c301de1c937c1583'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/ef/e1/efe1759837b74b5b5ed3df1a09d4c880f9ad20413d958f79d35bf1cb6a2a09d4'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/pre-receive.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/32/be/32beb30e381ff02fb71854b5534306f395ef00f51f02b62da1f027c8c7fab26f'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/9b/1b/9b1b8e52b9262f03f1719d3950dc8dfa2b9719dc2e273603023f6f329c1b2068'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/56/80/56803c607a19ccb576c90bdb10a02cfa7b3affc67dd150fa41b00cc22213b174'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/4e/39/4e392fcaae564652d234d07b4f71eeed90efe51b1b714831e39d77f3e537d3df'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/applypatch-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/cd/33/cd339656799518495d23aedf1503459be6d3086e22672e80edab8403d12ded1c'), PosixPath('/home/leandro/codeparrot-clean-train/.git/lfs/objects/f1/62/f162b06b5dca01aa85ef9a675d396c0fbab1d009b5bee1c5b7ea6b415c6f12a4'), PosixPath('/home/leandro/codeparrot-clean-train/.git/hooks/post-merge')] Resolving data files: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 53/53 [00:00<00:00, 196030.08it/s] 11/06/2021 21:16:15 - WARNING - datasets.builder - Using custom data configuration codeparrot-clean-train-e839c6c1585da466 11/06/2021 21:16:15 - INFO - datasets.data_files - Some files matched the pattern '*' at /home/leandro/codeparrot-clean-valid but don't have valid data file extensions: [PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/60/0dc2964cf471fa4aac706659009777cf176497'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/commit-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/refs/remotes/origin/HEAD'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/95/7b2579c6ef20995a09efd9a17f8fd90606f5ed'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-push.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/refs/heads/main'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/config'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/info/exclude'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/applypatch-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-commit.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/update.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/logs/refs/heads/main'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/HEAD'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/index'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/25/747fcf966f2b7b3a2f4149130bff69ebe83718'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/15/4f5f07c68026fb069c4bdfe3966893737035f4'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/6d/d1188965fcd7feab0efc3506668a615805e13f'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/description'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/post-merge'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/prepare-commit-msg.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/d9/cd7ad451bcd8a388471b341a961d0e6e6ff558'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-rebase.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/55/36bbd68dd8f283092b22eb77a051175c1b727a'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/lfs/objects/7f/8c/7f8c20a737c9084779bcdb853325ad4774d0db52c74aa2a63fd658d6787eb35b'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/fsmonitor-watchman.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-applypatch.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/post-update.sample'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/09/e6a70d1aadc53ed29b9890332f184f89d0a39b'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/logs/refs/remotes/origin/HEAD'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/objects/5e/d5325308cb9a07b2c5807dad51120c9a75b6db'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-push'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/post-checkout'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/post-commit'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/packed-refs'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/logs/HEAD'), PosixPath('/home/leandro/codeparrot-clean-valid/.git/hooks/pre-receive.sample')] 11/06/2021 21:16:15 - WARNING - datasets.builder - Using custom data configuration codeparrot-clean-valid-ced470bd23403144 Token indices sequence length is longer than the specified maximum sequence length for this model (1489 > 1024). Running this sequence through the model will result in indexing errors 11/06/2021 21:16:43 - INFO - __main__ - Step 1: {'lr': 0.0, 'samples': 192, 'steps': 0, 'loss/train': 10.55798625946045} 11/06/2021 21:16:43 - INFO - root - Reducer buckets have been rebuilt in this iteration. 11/06/2021 21:16:43 - INFO - __main__ - Step 2: {'lr': 2.5e-07, 'samples': 384, 'steps': 1, 'loss/train': 10.535750389099121} 11/06/2021 21:16:43 - INFO - __main__ - Step 3: {'lr': 5e-07, 'samples': 576, 'steps': 2, 'loss/train': 10.530282974243164} 11/06/2021 21:16:44 - INFO - __main__ - Step 4: {'lr': 7.5e-07, 'samples': 768, 'steps': 3, 'loss/train': 10.527787208557129} 11/06/2021 21:16:45 - INFO - __main__ - Step 5: {'lr': 1e-06, 'samples': 960, 'steps': 4, 'loss/train': 10.491048812866211} 11/06/2021 21:16:46 - INFO - __main__ - Step 6: {'lr': 1.25e-06, 'samples': 1152, 'steps': 5, 'loss/train': 10.409588813781738} 11/06/2021 21:16:46 - INFO - __main__ - Step 7: {'lr': 1.5e-06, 'samples': 1344, 'steps': 6, 'loss/train': 10.350728034973145} 11/06/2021 21:16:46 - INFO - __main__ - Step 8: {'lr': 1.75e-06, 'samples': 1536, 'steps': 7, 'loss/train': 10.252238273620605} 11/06/2021 21:16:47 - INFO - __main__ - Step 9: {'lr': 2e-06, 'samples': 1728, 'steps': 8, 'loss/train': 10.193534851074219} 11/06/2021 21:16:47 - INFO - __main__ - Step 10: {'lr': 2.25e-06, 'samples': 1920, 'steps': 9, 'loss/train': 9.953790664672852} 11/06/2021 21:16:48 - INFO - __main__ - Step 11: {'lr': 2.5e-06, 'samples': 2112, 'steps': 10, 'loss/train': 10.194929122924805} 11/06/2021 21:16:49 - INFO - __main__ - Step 12: {'lr': 2.75e-06, 'samples': 2304, 'steps': 11, 'loss/train': 9.89802074432373} 11/06/2021 21:16:49 - INFO - __main__ - Step 13: {'lr': 3e-06, 'samples': 2496, 'steps': 12, 'loss/train': 9.843729972839355} 11/06/2021 21:16:49 - INFO - __main__ - Step 14: {'lr': 3.25e-06, 'samples': 2688, 'steps': 13, 'loss/train': 9.845044136047363} 11/06/2021 21:16:50 - INFO - __main__ - Step 15: {'lr': 3.5e-06, 'samples': 2880, 'steps': 14, 'loss/train': 9.869210243225098} 11/06/2021 21:16:51 - INFO - __main__ - Step 16: {'lr': 3.75e-06, 'samples': 3072, 'steps': 15, 'loss/train': 9.587459564208984} 11/06/2021 21:16:51 - INFO - __main__ - Step 17: {'lr': 4e-06, 'samples': 3264, 'steps': 16, 'loss/train': 9.667202949523926} 11/06/2021 21:16:51 - INFO - __main__ - Step 18: {'lr': 4.250000000000001e-06, 'samples': 3456, 'steps': 17, 'loss/train': 9.495230674743652} 11/06/2021 21:16:52 - INFO - __main__ - Step 19: {'lr': 4.5e-06, 'samples': 3648, 'steps': 18, 'loss/train': 9.640376091003418} 11/06/2021 21:16:52 - INFO - __main__ - Step 20: {'lr': 4.75e-06, 'samples': 3840, 'steps': 19, 'loss/train': 9.428448677062988} 11/06/2021 21:16:52 - INFO - __main__ - Step 21: {'lr': 5e-06, 'samples': 4032, 'steps': 20, 'loss/train': 9.341026306152344} 11/06/2021 21:16:54 - INFO - __main__ - Step 22: {'lr': 5.2500000000000006e-06, 'samples': 4224, 'steps': 21, 'loss/train': 9.372577667236328} 11/06/2021 21:16:54 - INFO - __main__ - Step 23: {'lr': 5.5e-06, 'samples': 4416, 'steps': 22, 'loss/train': 8.967851638793945} 11/06/2021 21:16:54 - INFO - __main__ - Step 24: {'lr': 5.75e-06, 'samples': 4608, 'steps': 23, 'loss/train': 8.74506950378418} 11/06/2021 21:16:55 - INFO - __main__ - Step 25: {'lr': 6e-06, 'samples': 4800, 'steps': 24, 'loss/train': 9.786674499511719} 11/06/2021 21:16:55 - INFO - __main__ - Step 26: {'lr': 6.25e-06, 'samples': 4992, 'steps': 25, 'loss/train': 9.504456520080566} 11/06/2021 21:16:56 - INFO - __main__ - Step 27: {'lr': 6.5e-06, 'samples': 5184, 'steps': 26, 'loss/train': 9.166744232177734} 11/06/2021 21:16:56 - INFO - __main__ - Step 28: {'lr': 6.75e-06, 'samples': 5376, 'steps': 27, 'loss/train': 8.682860374450684} 11/06/2021 21:16:57 - INFO - __main__ - Step 29: {'lr': 7e-06, 'samples': 5568, 'steps': 28, 'loss/train': 8.596318244934082} 11/06/2021 21:16:57 - INFO - __main__ - Step 30: {'lr': 7.250000000000001e-06, 'samples': 5760, 'steps': 29, 'loss/train': 9.048979759216309} 11/06/2021 21:16:57 - INFO - __main__ - Step 31: {'lr': 7.5e-06, 'samples': 5952, 'steps': 30, 'loss/train': 9.320890426635742} 11/06/2021 21:16:58 - INFO - __main__ - Step 32: {'lr': 7.75e-06, 'samples': 6144, 'steps': 31, 'loss/train': 8.952228546142578} 11/06/2021 21:16:59 - INFO - __main__ - Step 33: {'lr': 8e-06, 'samples': 6336, 'steps': 32, 'loss/train': 8.751225471496582} 11/06/2021 21:16:59 - INFO - __main__ - Step 34: {'lr': 8.25e-06, 'samples': 6528, 'steps': 33, 'loss/train': 9.156981468200684} 11/06/2021 21:17:00 - INFO - __main__ - Step 35: {'lr': 8.500000000000002e-06, 'samples': 6720, 'steps': 34, 'loss/train': 8.837956428527832} 11/06/2021 21:17:00 - INFO - __main__ - Step 36: {'lr': 8.750000000000001e-06, 'samples': 6912, 'steps': 35, 'loss/train': 8.935142517089844} 11/06/2021 21:17:01 - INFO - __main__ - Step 37: {'lr': 9e-06, 'samples': 7104, 'steps': 36, 'loss/train': 9.019933700561523} 11/06/2021 21:17:02 - INFO - __main__ - Step 38: {'lr': 9.25e-06, 'samples': 7296, 'steps': 37, 'loss/train': 8.594483375549316} 11/06/2021 21:17:02 - INFO - __main__ - Step 39: {'lr': 9.5e-06, 'samples': 7488, 'steps': 38, 'loss/train': 9.565625190734863} 11/06/2021 21:17:02 - INFO - __main__ - Step 40: {'lr': 9.75e-06, 'samples': 7680, 'steps': 39, 'loss/train': 9.195219039916992} 11/06/2021 21:17:03 - INFO - __main__ - Step 41: {'lr': 1e-05, 'samples': 7872, 'steps': 40, 'loss/train': 9.008049011230469} 11/06/2021 21:17:04 - INFO - __main__ - Step 42: {'lr': 1.025e-05, 'samples': 8064, 'steps': 41, 'loss/train': 9.54212760925293} 11/06/2021 21:17:04 - INFO - __main__ - Step 43: {'lr': 1.0500000000000001e-05, 'samples': 8256, 'steps': 42, 'loss/train': 9.074606895446777} 11/06/2021 21:17:04 - INFO - __main__ - Step 44: {'lr': 1.0749999999999999e-05, 'samples': 8448, 'steps': 43, 'loss/train': 9.575305938720703} 11/06/2021 21:17:05 - INFO - __main__ - Step 45: {'lr': 1.1e-05, 'samples': 8640, 'steps': 44, 'loss/train': 9.862631797790527} 11/06/2021 21:17:05 - INFO - __main__ - Step 46: {'lr': 1.1249999999999999e-05, 'samples': 8832, 'steps': 45, 'loss/train': 8.833338737487793} 11/06/2021 21:17:06 - INFO - __main__ - Step 47: {'lr': 1.15e-05, 'samples': 9024, 'steps': 46, 'loss/train': 8.830769538879395} 11/06/2021 21:17:07 - INFO - __main__ - Step 48: {'lr': 1.1750000000000001e-05, 'samples': 9216, 'steps': 47, 'loss/train': 8.828520774841309} 11/06/2021 21:17:07 - INFO - __main__ - Step 49: {'lr': 1.2e-05, 'samples': 9408, 'steps': 48, 'loss/train': 8.692312240600586} 11/06/2021 21:17:07 - INFO - __main__ - Step 50: {'lr': 1.2250000000000001e-05, 'samples': 9600, 'steps': 49, 'loss/train': 8.698874473571777} 11/06/2021 21:17:08 - INFO - __main__ - Step 51: {'lr': 1.25e-05, 'samples': 9792, 'steps': 50, 'loss/train': 8.904641151428223} 11/06/2021 21:17:09 - INFO - __main__ - Step 52: {'lr': 1.275e-05, 'samples': 9984, 'steps': 51, 'loss/train': 8.66476821899414} 11/06/2021 21:17:09 - INFO - __main__ - Step 53: {'lr': 1.3e-05, 'samples': 10176, 'steps': 52, 'loss/train': 8.561541557312012} 11/06/2021 21:17:10 - INFO - __main__ - Step 54: {'lr': 1.325e-05, 'samples': 10368, 'steps': 53, 'loss/train': 8.71354866027832} 11/06/2021 21:17:10 - INFO - __main__ - Step 55: {'lr': 1.35e-05, 'samples': 10560, 'steps': 54, 'loss/train': 8.084650993347168} 11/06/2021 21:17:10 - INFO - __main__ - Step 56: {'lr': 1.375e-05, 'samples': 10752, 'steps': 55, 'loss/train': 8.701323509216309} 11/06/2021 21:17:11 - INFO - __main__ - Step 57: {'lr': 1.4e-05, 'samples': 10944, 'steps': 56, 'loss/train': 8.886054039001465} 11/06/2021 21:17:12 - INFO - __main__ - Step 58: {'lr': 1.425e-05, 'samples': 11136, 'steps': 57, 'loss/train': 8.962408065795898} 11/06/2021 21:17:12 - INFO - __main__ - Step 59: {'lr': 1.4500000000000002e-05, 'samples': 11328, 'steps': 58, 'loss/train': 8.731340408325195} 11/06/2021 21:17:13 - INFO - __main__ - Step 60: {'lr': 1.475e-05, 'samples': 11520, 'steps': 59, 'loss/train': 8.48225212097168} 11/06/2021 21:17:13 - INFO - __main__ - Step 61: {'lr': 1.5e-05, 'samples': 11712, 'steps': 60, 'loss/train': 8.860502243041992} 11/06/2021 21:17:14 - INFO - __main__ - Step 62: {'lr': 1.525e-05, 'samples': 11904, 'steps': 61, 'loss/train': 8.848859786987305} 11/06/2021 21:17:15 - INFO - __main__ - Step 63: {'lr': 1.55e-05, 'samples': 12096, 'steps': 62, 'loss/train': 8.20711612701416} 11/06/2021 21:17:15 - INFO - __main__ - Step 64: {'lr': 1.575e-05, 'samples': 12288, 'steps': 63, 'loss/train': 10.296394348144531} 11/06/2021 21:17:15 - INFO - __main__ - Step 65: {'lr': 1.6e-05, 'samples': 12480, 'steps': 64, 'loss/train': 7.71311092376709} 11/06/2021 21:17:16 - INFO - __main__ - Step 66: {'lr': 1.6250000000000002e-05, 'samples': 12672, 'steps': 65, 'loss/train': 8.466562271118164} 11/06/2021 21:17:16 - INFO - __main__ - Step 67: {'lr': 1.65e-05, 'samples': 12864, 'steps': 66, 'loss/train': 8.35257339477539} 11/06/2021 21:17:17 - INFO - __main__ - Step 68: {'lr': 1.675e-05, 'samples': 13056, 'steps': 67, 'loss/train': 8.386396408081055} 11/06/2021 21:17:18 - INFO - __main__ - Step 69: {'lr': 1.7000000000000003e-05, 'samples': 13248, 'steps': 68, 'loss/train': 8.12002944946289} 11/06/2021 21:17:18 - INFO - __main__ - Step 70: {'lr': 1.7250000000000003e-05, 'samples': 13440, 'steps': 69, 'loss/train': 8.70462417602539} 11/06/2021 21:17:18 - INFO - __main__ - Step 71: {'lr': 1.7500000000000002e-05, 'samples': 13632, 'steps': 70, 'loss/train': 8.239697456359863} 11/06/2021 21:17:19 - INFO - __main__ - Step 72: {'lr': 1.7749999999999998e-05, 'samples': 13824, 'steps': 71, 'loss/train': 7.610179424285889} 11/06/2021 21:17:20 - INFO - __main__ - Step 73: {'lr': 1.8e-05, 'samples': 14016, 'steps': 72, 'loss/train': 7.8869452476501465} 11/06/2021 21:17:20 - INFO - __main__ - Step 74: {'lr': 1.825e-05, 'samples': 14208, 'steps': 73, 'loss/train': 7.692283630371094} 11/06/2021 21:17:21 - INFO - __main__ - Step 75: {'lr': 1.85e-05, 'samples': 14400, 'steps': 74, 'loss/train': 8.208292007446289} 11/06/2021 21:17:21 - INFO - __main__ - Step 76: {'lr': 1.875e-05, 'samples': 14592, 'steps': 75, 'loss/train': 7.97852897644043} 11/06/2021 21:17:21 - INFO - __main__ - Step 77: {'lr': 1.9e-05, 'samples': 14784, 'steps': 76, 'loss/train': 8.777739524841309} 11/06/2021 21:17:22 - INFO - __main__ - Step 78: {'lr': 1.925e-05, 'samples': 14976, 'steps': 77, 'loss/train': 7.68981409072876} 11/06/2021 21:17:23 - INFO - __main__ - Step 79: {'lr': 1.95e-05, 'samples': 15168, 'steps': 78, 'loss/train': 7.656458854675293} 11/06/2021 21:17:23 - INFO - __main__ - Step 80: {'lr': 1.975e-05, 'samples': 15360, 'steps': 79, 'loss/train': 8.30695915222168} 11/06/2021 21:17:23 - INFO - __main__ - Step 81: {'lr': 2e-05, 'samples': 15552, 'steps': 80, 'loss/train': 7.897383689880371} 11/06/2021 21:17:25 - INFO - __main__ - Step 83: {'lr': 2.05e-05, 'samples': 15936, 'steps': 82, 'loss/train': 8.247127532958984} 11/06/2021 21:17:25 - INFO - __main__ - Step 84: {'lr': 2.0750000000000003e-05, 'samples': 16128, 'steps': 83, 'loss/train': 8.18776798248291} 11/06/2021 21:17:25 - INFO - __main__ - Step 85: {'lr': 2.1000000000000002e-05, 'samples': 16320, 'steps': 84, 'loss/train': 7.7213358879089355} 11/06/2021 21:17:26 - INFO - __main__ - Step 86: {'lr': 2.125e-05, 'samples': 16512, 'steps': 85, 'loss/train': 7.880347728729248} 11/06/2021 21:17:27 - INFO - __main__ - Step 88: {'lr': 2.175e-05, 'samples': 16896, 'steps': 87, 'loss/train': 7.1267242431640625} 11/06/2021 21:17:27 - INFO - __main__ - Step 89: {'lr': 2.2e-05, 'samples': 17088, 'steps': 88, 'loss/train': 7.992337703704834} 11/06/2021 21:17:29 - INFO - __main__ - Step 92: {'lr': 2.275e-05, 'samples': 17664, 'steps': 91, 'loss/train': 7.861474990844727}650268554688} 11/06/2021 21:17:29 - INFO - __main__ - Step 93: {'lr': 2.3e-05, 'samples': 17856, 'steps': 92, 'loss/train': 7.493942737579346} 11/06/2021 21:17:31 - INFO - __main__ - Step 98: {'lr': 2.425e-05, 'samples': 18816, 'steps': 97, 'loss/train': 7.735995292663574}650268554688} 11/06/2021 21:17:31 - INFO - __main__ - Step 98: {'lr': 2.425e-05, 'samples': 18816, 'steps': 97, 'loss/train': 7.735995292663574}650268554688} 11/06/2021 21:17:35 - INFO - __main__ - Step 104: {'lr': 2.575e-05, 'samples': 19968, 'steps': 103, 'loss/train': 7.325517654418945}0268554688} 11/06/2021 21:17:38 - INFO - __main__ - Step 110: {'lr': 2.725e-05, 'samples': 21120, 'steps': 109, 'loss/train': 8.008770942687988}0268554688} 11/06/2021 21:17:38 - INFO - __main__ - Step 110: {'lr': 2.725e-05, 'samples': 21120, 'steps': 109, 'loss/train': 8.008770942687988}0268554688} 11/06/2021 21:17:41 - INFO - __main__ - Step 116: {'lr': 2.875e-05, 'samples': 22272, 'steps': 115, 'loss/train': 7.741561412811279}0268554688} 11/06/2021 21:17:43 - INFO - __main__ - Step 122: {'lr': 3.025e-05, 'samples': 23424, 'steps': 121, 'loss/train': 6.9648213386535645}268554688} 11/06/2021 21:17:43 - INFO - __main__ - Step 122: {'lr': 3.025e-05, 'samples': 23424, 'steps': 121, 'loss/train': 6.9648213386535645}268554688} 11/06/2021 21:17:47 - INFO - __main__ - Step 128: {'lr': 3.175e-05, 'samples': 24576, 'steps': 127, 'loss/train': 7.172966003417969}}268554688} 11/06/2021 21:17:49 - INFO - __main__ - Step 134: {'lr': 3.325e-05, 'samples': 25728, 'steps': 133, 'loss/train': 7.035802841186523}}268554688} 11/06/2021 21:17:49 - INFO - __main__ - Step 134: {'lr': 3.325e-05, 'samples': 25728, 'steps': 133, 'loss/train': 7.035802841186523}}268554688} 11/06/2021 21:17:53 - INFO - __main__ - Step 140: {'lr': 3.4750000000000004e-05, 'samples': 26880, 'steps': 139, 'loss/train': 6.972182273864746} 11/06/2021 21:17:56 - INFO - __main__ - Step 146: {'lr': 3.625e-05, 'samples': 28032, 'steps': 145, 'loss/train': 7.03615140914917}2182273864746} 11/06/2021 21:17:56 - INFO - __main__ - Step 146: {'lr': 3.625e-05, 'samples': 28032, 'steps': 145, 'loss/train': 7.03615140914917}2182273864746} 11/06/2021 21:17:59 - INFO - __main__ - Step 152: {'lr': 3.775e-05, 'samples': 29184, 'steps': 151, 'loss/train': 6.523715019226074}182273864746} 11/06/2021 21:18:01 - INFO - __main__ - Step 158: {'lr': 3.925e-05, 'samples': 30336, 'steps': 157, 'loss/train': 6.675495624542236}182273864746} 11/06/2021 21:18:04 - INFO - __main__ - Step 164: {'lr': 4.075e-05, 'samples': 31488, 'steps': 163, 'loss/train': 6.836953163146973}182273864746} 11/06/2021 21:18:04 - INFO - __main__ - Step 164: {'lr': 4.075e-05, 'samples': 31488, 'steps': 163, 'loss/train': 6.836953163146973}182273864746} 11/06/2021 21:18:08 - INFO - __main__ - Step 170: {'lr': 4.2250000000000004e-05, 'samples': 32640, 'steps': 169, 'loss/train': 7.073266506195068} 11/06/2021 21:18:08 - INFO - __main__ - Step 170: {'lr': 4.2250000000000004e-05, 'samples': 32640, 'steps': 169, 'loss/train': 7.073266506195068} 11/06/2021 21:18:11 - INFO - __main__ - Step 176: {'lr': 4.375e-05, 'samples': 33792, 'steps': 175, 'loss/train': 7.043330192565918}266506195068} 11/06/2021 21:18:13 - INFO - __main__ - Step 182: {'lr': 4.525e-05, 'samples': 34944, 'steps': 181, 'loss/train': 6.373216152191162}266506195068} 11/06/2021 21:18:13 - INFO - __main__ - Step 182: {'lr': 4.525e-05, 'samples': 34944, 'steps': 181, 'loss/train': 6.373216152191162}266506195068} 11/06/2021 21:18:16 - INFO - __main__ - Step 188: {'lr': 4.675e-05, 'samples': 36096, 'steps': 187, 'loss/train': 5.526083469390869}266506195068} 11/06/2021 21:18:19 - INFO - __main__ - Step 194: {'lr': 4.825e-05, 'samples': 37248, 'steps': 193, 'loss/train': 6.355683326721191}266506195068} 11/06/2021 21:18:22 - INFO - __main__ - Step 200: {'lr': 4.975e-05, 'samples': 38400, 'steps': 199, 'loss/train': 6.113000392913818}266506195068} 11/06/2021 21:18:22 - INFO - __main__ - Step 200: {'lr': 4.975e-05, 'samples': 38400, 'steps': 199, 'loss/train': 6.113000392913818}266506195068} 11/06/2021 21:18:25 - INFO - __main__ - Step 206: {'lr': 5.125e-05, 'samples': 39552, 'steps': 205, 'loss/train': 5.817543029785156}266506195068} 11/06/2021 21:18:28 - INFO - __main__ - Step 212: {'lr': 5.275e-05, 'samples': 40704, 'steps': 211, 'loss/train': 5.677988529205322}266506195068} 11/06/2021 21:18:28 - INFO - __main__ - Step 212: {'lr': 5.275e-05, 'samples': 40704, 'steps': 211, 'loss/train': 5.677988529205322}266506195068} 11/06/2021 21:18:31 - INFO - __main__ - Step 218: {'lr': 5.4250000000000004e-05, 'samples': 41856, 'steps': 217, 'loss/train': 6.029814720153809} 11/06/2021 21:18:34 - INFO - __main__ - Step 224: {'lr': 5.575e-05, 'samples': 43008, 'steps': 223, 'loss/train': 6.467023849487305}814720153809} 11/06/2021 21:18:34 - INFO - __main__ - Step 224: {'lr': 5.575e-05, 'samples': 43008, 'steps': 223, 'loss/train': 6.467023849487305}814720153809} 11/06/2021 21:18:37 - INFO - __main__ - Step 230: {'lr': 5.725e-05, 'samples': 44160, 'steps': 229, 'loss/train': 5.728048324584961}814720153809} 11/06/2021 21:18:39 - INFO - __main__ - Step 236: {'lr': 5.875e-05, 'samples': 45312, 'steps': 235, 'loss/train': 6.2582688331604}1}814720153809} 11/06/2021 21:18:42 - INFO - __main__ - Step 242: {'lr': 6.025e-05, 'samples': 46464, 'steps': 241, 'loss/train': 6.203149318695068}814720153809} 11/06/2021 21:18:42 - INFO - __main__ - Step 242: {'lr': 6.025e-05, 'samples': 46464, 'steps': 241, 'loss/train': 6.203149318695068}814720153809} 11/06/2021 21:18:45 - INFO - __main__ - Step 248: {'lr': 6.175e-05, 'samples': 47616, 'steps': 247, 'loss/train': 5.754123210906982}814720153809} 11/06/2021 21:18:45 - INFO - __main__ - Step 248: {'lr': 6.175e-05, 'samples': 47616, 'steps': 247, 'loss/train': 5.754123210906982}814720153809} 11/06/2021 21:18:49 - INFO - __main__ - Step 254: {'lr': 6.325e-05, 'samples': 48768, 'steps': 253, 'loss/train': 5.7925004959106445}14720153809} 11/06/2021 21:18:52 - INFO - __main__ - Step 260: {'lr': 6.475e-05, 'samples': 49920, 'steps': 259, 'loss/train': 3.9729881286621094}14720153809} 11/06/2021 21:18:55 - INFO - __main__ - Step 266: {'lr': 6.625000000000001e-05, 'samples': 51072, 'steps': 265, 'loss/train': 6.112322807312012}} 11/06/2021 21:18:55 - INFO - __main__ - Step 266: {'lr': 6.625000000000001e-05, 'samples': 51072, 'steps': 265, 'loss/train': 6.112322807312012}} 11/06/2021 21:18:57 - INFO - __main__ - Step 272: {'lr': 6.775000000000001e-05, 'samples': 52224, 'steps': 271, 'loss/train': 5.43068790435791}}} 11/06/2021 21:19:00 - INFO - __main__ - Step 278: {'lr': 6.925e-05, 'samples': 53376, 'steps': 277, 'loss/train': 5.7302045822143555}790435791}}} 11/06/2021 21:19:00 - INFO - __main__ - Step 278: {'lr': 6.925e-05, 'samples': 53376, 'steps': 277, 'loss/train': 5.7302045822143555}790435791}}} 11/06/2021 21:19:03 - INFO - __main__ - Step 284: {'lr': 7.075e-05, 'samples': 54528, 'steps': 283, 'loss/train': 5.200248718261719}}790435791}}} 11/06/2021 21:19:06 - INFO - __main__ - Step 290: {'lr': 7.225e-05, 'samples': 55680, 'steps': 289, 'loss/train': 6.18521785736084}}}790435791}}} 11/06/2021 21:19:06 - INFO - __main__ - Step 290: {'lr': 7.225e-05, 'samples': 55680, 'steps': 289, 'loss/train': 6.18521785736084}}}790435791}}} 11/06/2021 21:19:09 - INFO - __main__ - Step 296: {'lr': 7.375e-05, 'samples': 56832, 'steps': 295, 'loss/train': 5.397673606872559}}790435791}}} 11/06/2021 21:19:12 - INFO - __main__ - Step 302: {'lr': 7.525e-05, 'samples': 57984, 'steps': 301, 'loss/train': 5.377347469329834}}790435791}}} 11/06/2021 21:19:15 - INFO - __main__ - Step 307: {'lr': 7.65e-05, 'samples': 58944, 'steps': 306, 'loss/train': 5.420361518859863}}}790435791}}} 11/06/2021 21:19:15 - INFO - __main__ - Step 307: {'lr': 7.65e-05, 'samples': 58944, 'steps': 306, 'loss/train': 5.420361518859863}}}790435791}}} 11/06/2021 21:19:18 - INFO - __main__ - Step 314: {'lr': 7.825e-05, 'samples': 60288, 'steps': 313, 'loss/train': 5.498117923736572}}790435791}}} 11/06/2021 21:19:21 - INFO - __main__ - Step 320: {'lr': 7.975e-05, 'samples': 61440, 'steps': 319, 'loss/train': 5.5839314460754395}790435791}}} 11/06/2021 21:19:21 - INFO - __main__ - Step 320: {'lr': 7.975e-05, 'samples': 61440, 'steps': 319, 'loss/train': 5.5839314460754395}790435791}}} 11/06/2021 21:19:24 - INFO - __main__ - Step 327: {'lr': 8.15e-05, 'samples': 62784, 'steps': 326, 'loss/train': 5.665132522583008}5}790435791}}} 11/06/2021 21:19:26 - INFO - __main__ - Step 331: {'lr': 8.25e-05, 'samples': 63552, 'steps': 330, 'loss/train': 5.495372295379639}5}790435791}}} 11/06/2021 21:19:28 - INFO - __main__ - Step 335: {'lr': 8.350000000000001e-05, 'samples': 64320, 'steps': 334, 'loss/train': 5.6893181800842285} 11/06/2021 21:19:30 - INFO - __main__ - Step 340: {'lr': 8.475000000000001e-05, 'samples': 65280, 'steps': 339, 'loss/train': 5.477786540985107}} 11/06/2021 21:19:33 - INFO - __main__ - Step 344: {'lr': 8.575000000000001e-05, 'samples': 66048, 'steps': 343, 'loss/train': 5.646244525909424}} 11/06/2021 21:19:35 - INFO - __main__ - Step 348: {'lr': 8.675e-05, 'samples': 66816, 'steps': 347, 'loss/train': 5.6107988357543945}4525909424}} 11/06/2021 21:19:36 - INFO - __main__ - Step 352: {'lr': 8.774999999999999e-05, 'samples': 67584, 'steps': 351, 'loss/train': 6.211921215057373}} 11/06/2021 21:19:38 - INFO - __main__ - Step 356: {'lr': 8.875e-05, 'samples': 68352, 'steps': 355, 'loss/train': 5.443009853363037}21215057373}} 11/06/2021 21:19:41 - INFO - __main__ - Step 361: {'lr': 8.999999999999999e-05, 'samples': 69312, 'steps': 360, 'loss/train': 7.289037227630615}} 11/06/2021 21:19:43 - INFO - __main__ - Step 365: {'lr': 9.1e-05, 'samples': 70080, 'steps': 364, 'loss/train': 5.4572978019714355}037227630615}} 11/06/2021 21:19:45 - INFO - __main__ - Step 369: {'lr': 9.2e-05, 'samples': 70848, 'steps': 368, 'loss/train': 5.474288463592529}}037227630615}} 11/06/2021 21:19:46 - INFO - __main__ - Step 373: {'lr': 9.3e-05, 'samples': 71616, 'steps': 372, 'loss/train': 5.273648262023926}}037227630615}} 11/06/2021 21:19:48 - INFO - __main__ - Step 377: {'lr': 9.400000000000001e-05, 'samples': 72384, 'steps': 376, 'loss/train': 5.5180864334106445} 11/06/2021 21:19:51 - INFO - __main__ - Step 382: {'lr': 9.525e-05, 'samples': 73344, 'steps': 381, 'loss/train': 5.458894729614258}864334106445} 11/06/2021 21:19:51 - INFO - __main__ - Step 382: {'lr': 9.525e-05, 'samples': 73344, 'steps': 381, 'loss/train': 5.458894729614258}864334106445} 11/06/2021 21:19:55 - INFO - __main__ - Step 389: {'lr': 9.7e-05, 'samples': 74688, 'steps': 388, 'loss/train': 5.781015396118164}8}864334106445} 11/06/2021 21:19:56 - INFO - __main__ - Step 393: {'lr': 9.800000000000001e-05, 'samples': 75456, 'steps': 392, 'loss/train': 5.350080966949463}} 11/06/2021 21:19:58 - INFO - __main__ - Step 397: {'lr': 9.900000000000001e-05, 'samples': 76224, 'steps': 396, 'loss/train': 5.724830150604248}} 11/06/2021 21:20:01 - INFO - __main__ - Step 402: {'lr': 0.00010025000000000001, 'samples': 77184, 'steps': 401, 'loss/train': 5.318662166595459} 11/06/2021 21:20:03 - INFO - __main__ - Step 406: {'lr': 0.00010125000000000001, 'samples': 77952, 'steps': 405, 'loss/train': 5.227279186248779} 11/06/2021 21:20:05 - INFO - __main__ - Step 410: {'lr': 0.00010224999999999999, 'samples': 78720, 'steps': 409, 'loss/train': 5.491655349731445} 11/06/2021 21:20:06 - INFO - __main__ - Step 414: {'lr': 0.00010325, 'samples': 79488, 'steps': 413, 'loss/train': 5.201007843017578}55349731445} 11/06/2021 21:20:08 - INFO - __main__ - Step 418: {'lr': 0.00010425, 'samples': 80256, 'steps': 417, 'loss/train': 5.490600109100342}55349731445} 11/06/2021 21:20:11 - INFO - __main__ - Step 423: {'lr': 0.0001055, 'samples': 81216, 'steps': 422, 'loss/train': 5.291647434234619}}55349731445} 11/06/2021 21:20:13 - INFO - __main__ - Step 427: {'lr': 0.0001065, 'samples': 81984, 'steps': 426, 'loss/train': 5.437173366546631}}55349731445} 11/06/2021 21:20:15 - INFO - __main__ - Step 431: {'lr': 0.0001075, 'samples': 82752, 'steps': 430, 'loss/train': 4.334190845489502}}55349731445} 11/06/2021 21:20:16 - INFO - __main__ - Step 435: {'lr': 0.00010850000000000001, 'samples': 83520, 'steps': 434, 'loss/train': 5.3725810050964355} 11/06/2021 21:20:18 - INFO - __main__ - Step 439: {'lr': 0.0001095, 'samples': 84288, 'steps': 438, 'loss/train': 5.41857385635376}25810050964355} 11/06/2021 21:20:18 - INFO - __main__ - Step 439: {'lr': 0.0001095, 'samples': 84288, 'steps': 438, 'loss/train': 5.41857385635376}25810050964355} 11/06/2021 21:20:23 - INFO - __main__ - Step 447: {'lr': 0.0001115, 'samples': 85824, 'steps': 446, 'loss/train': 5.4635491371154785}810050964355} 11/06/2021 21:20:25 - INFO - __main__ - Step 451: {'lr': 0.00011250000000000001, 'samples': 86592, 'steps': 450, 'loss/train': 4.955085754394531}} 11/06/2021 21:20:26 - INFO - __main__ - Step 455: {'lr': 0.00011350000000000001, 'samples': 87360, 'steps': 454, 'loss/train': 4.927133083343506}} 11/06/2021 21:20:29 - INFO - __main__ - Step 459: {'lr': 0.0001145, 'samples': 88128, 'steps': 458, 'loss/train': 6.301289081573486}133083343506}} 11/06/2021 21:20:31 - INFO - __main__ - Step 464: {'lr': 0.00011575000000000001, 'samples': 89088, 'steps': 463, 'loss/train': 4.844412326812744}} 11/06/2021 21:20:33 - INFO - __main__ - Step 468: {'lr': 0.00011675, 'samples': 89856, 'steps': 467, 'loss/train': 5.613701820373535}12326812744}} 11/06/2021 21:20:33 - INFO - __main__ - Step 468: {'lr': 0.00011675, 'samples': 89856, 'steps': 467, 'loss/train': 5.613701820373535}12326812744}} 11/06/2021 21:20:36 - INFO - __main__ - Step 475: {'lr': 0.0001185, 'samples': 91200, 'steps': 474, 'loss/train': 5.4914021492004395}12326812744}} 11/06/2021 21:20:39 - INFO - __main__ - Step 480: {'lr': 0.00011975, 'samples': 92160, 'steps': 479, 'loss/train': 5.236664295196533}12326812744}} 11/06/2021 21:20:39 - INFO - __main__ - Step 480: {'lr': 0.00011975, 'samples': 92160, 'steps': 479, 'loss/train': 5.236664295196533}12326812744}} 11/06/2021 21:20:43 - INFO - __main__ - Step 488: {'lr': 0.00012175, 'samples': 93696, 'steps': 487, 'loss/train': 5.1655683517456055}2326812744}} 11/06/2021 21:20:45 - INFO - __main__ - Step 492: {'lr': 0.00012275, 'samples': 94464, 'steps': 491, 'loss/train': 5.145169734954834}}2326812744}} 11/06/2021 21:20:47 - INFO - __main__ - Step 496: {'lr': 0.00012375, 'samples': 95232, 'steps': 495, 'loss/train': 4.821667194366455}}2326812744}} 11/06/2021 21:20:49 - INFO - __main__ - Step 501: {'lr': 0.000125, 'samples': 96192, 'steps': 500, 'loss/train': 5.50537109375}66455}}2326812744}} 11/06/2021 21:20:51 - INFO - __main__ - Step 505: {'lr': 0.000126, 'samples': 96960, 'steps': 504, 'loss/train': 5.220231056213379}5}}2326812744}} 11/06/2021 21:20:51 - INFO - __main__ - Step 505: {'lr': 0.000126, 'samples': 96960, 'steps': 504, 'loss/train': 5.220231056213379}5}}2326812744}} 11/06/2021 21:20:55 - INFO - __main__ - Step 512: {'lr': 0.00012775000000000002, 'samples': 98304, 'steps': 511, 'loss/train': 5.125716686248779}} 11/06/2021 21:20:57 - INFO - __main__ - Step 516: {'lr': 0.00012875, 'samples': 99072, 'steps': 515, 'loss/train': 5.111538410186768}16686248779}} 11/06/2021 21:20:59 - INFO - __main__ - Step 520: {'lr': 0.00012975, 'samples': 99840, 'steps': 519, 'loss/train': 4.82402229309082}}16686248779}} 11/06/2021 21:20:59 - INFO - __main__ - Step 520: {'lr': 0.00012975, 'samples': 99840, 'steps': 519, 'loss/train': 4.82402229309082}}16686248779}} 11/06/2021 21:21:03 - INFO - __main__ - Step 527: {'lr': 0.0001315, 'samples': 101184, 'steps': 526, 'loss/train': 5.979576587677002}16686248779}} 11/06/2021 21:21:05 - INFO - __main__ - Step 531: {'lr': 0.00013250000000000002, 'samples': 101952, 'steps': 530, 'loss/train': 4.945707321166992} 11/06/2021 21:21:05 - INFO - __main__ - Step 531: {'lr': 0.00013250000000000002, 'samples': 101952, 'steps': 530, 'loss/train': 4.945707321166992} 11/06/2021 21:21:05 - INFO - __main__ - Step 531: {'lr': 0.00013250000000000002, 'samples': 101952, 'steps': 530, 'loss/train': 4.945707321166992} 11/06/2021 21:21:10 - INFO - __main__ - Step 542: {'lr': 0.00013525, 'samples': 104064, 'steps': 541, 'loss/train': 6.563281059265137}07321166992} 11/06/2021 21:21:13 - INFO - __main__ - Step 547: {'lr': 0.0001365, 'samples': 105024, 'steps': 546, 'loss/train': 5.063644886016846}}07321166992} 11/06/2021 21:21:15 - INFO - __main__ - Step 551: {'lr': 0.0001375, 'samples': 105792, 'steps': 550, 'loss/train': 5.3217949867248535}07321166992} 11/06/2021 21:21:17 - INFO - __main__ - Step 555: {'lr': 0.0001385, 'samples': 106560, 'steps': 554, 'loss/train': 1.4391024112701416}07321166992} 11/06/2021 21:21:18 - INFO - __main__ - Step 559: {'lr': 0.0001395, 'samples': 107328, 'steps': 558, 'loss/train': 5.34543514251709}6}07321166992} 11/06/2021 21:21:20 - INFO - __main__ - Step 563: {'lr': 0.00014050000000000003, 'samples': 108096, 'steps': 562, 'loss/train': 5.075902462005615} 11/06/2021 21:21:23 - INFO - __main__ - Step 568: {'lr': 0.00014175, 'samples': 109056, 'steps': 567, 'loss/train': 5.004296779632568}02462005615} 11/06/2021 21:21:23 - INFO - __main__ - Step 568: {'lr': 0.00014175, 'samples': 109056, 'steps': 567, 'loss/train': 5.004296779632568}02462005615} 11/06/2021 21:21:23 - INFO - __main__ - Step 568: {'lr': 0.00014175, 'samples': 109056, 'steps': 567, 'loss/train': 5.004296779632568}02462005615} 11/06/2021 21:21:28 - INFO - __main__ - Step 578: {'lr': 0.00014424999999999998, 'samples': 110976, 'steps': 577, 'loss/train': 4.281363487243652} 11/06/2021 21:21:31 - INFO - __main__ - Step 583: {'lr': 0.00014549999999999999, 'samples': 111936, 'steps': 582, 'loss/train': 5.447309970855713} 11/06/2021 21:21:31 - INFO - __main__ - Step 583: {'lr': 0.00014549999999999999, 'samples': 111936, 'steps': 582, 'loss/train': 5.447309970855713} 11/06/2021 21:21:35 - INFO - __main__ - Step 591: {'lr': 0.0001475, 'samples': 113472, 'steps': 590, 'loss/train': 5.016930103302002}309970855713} 11/06/2021 21:21:36 - INFO - __main__ - Step 595: {'lr': 0.0001485, 'samples': 114240, 'steps': 594, 'loss/train': 4.915690898895264}309970855713} 11/06/2021 21:21:38 - INFO - __main__ - Step 599: {'lr': 0.0001495, 'samples': 115008, 'steps': 598, 'loss/train': 4.568467140197754}309970855713} 11/06/2021 21:21:38 - INFO - __main__ - Step 599: {'lr': 0.0001495, 'samples': 115008, 'steps': 598, 'loss/train': 4.568467140197754}309970855713} 11/06/2021 21:21:38 - INFO - __main__ - Step 599: {'lr': 0.0001495, 'samples': 115008, 'steps': 598, 'loss/train': 4.568467140197754}309970855713} 11/06/2021 21:21:44 - INFO - __main__ - Step 610: {'lr': 0.00015225, 'samples': 117120, 'steps': 609, 'loss/train': 4.78218412399292}309970855713} 11/06/2021 21:21:47 - INFO - __main__ - Step 615: {'lr': 0.0001535, 'samples': 118080, 'steps': 614, 'loss/train': 5.002647876739502}309970855713} 11/06/2021 21:21:49 - INFO - __main__ - Step 619: {'lr': 0.00015450000000000001, 'samples': 118848, 'steps': 618, 'loss/train': 4.601840019226074} 11/06/2021 21:21:51 - INFO - __main__ - Step 623: {'lr': 0.0001555, 'samples': 119616, 'steps': 622, 'loss/train': 4.604945182800293}840019226074} 11/06/2021 21:21:52 - INFO - __main__ - Step 627: {'lr': 0.0001565, 'samples': 120384, 'steps': 626, 'loss/train': 4.22480583190918}}840019226074} 11/06/2021 21:21:55 - INFO - __main__ - Step 631: {'lr': 0.0001575, 'samples': 121152, 'steps': 630, 'loss/train': 4.3496575355529785}40019226074} 11/06/2021 21:21:57 - INFO - __main__ - Step 636: {'lr': 0.00015875, 'samples': 122112, 'steps': 635, 'loss/train': 5.596816062927246}40019226074} 11/06/2021 21:21:59 - INFO - __main__ - Step 640: {'lr': 0.00015975, 'samples': 122880, 'steps': 639, 'loss/train': 5.002859115600586}40019226074} 11/06/2021 21:21:59 - INFO - __main__ - Step 640: {'lr': 0.00015975, 'samples': 122880, 'steps': 639, 'loss/train': 5.002859115600586}40019226074} 11/06/2021 21:22:02 - INFO - __main__ - Step 647: {'lr': 0.0001615, 'samples': 124224, 'steps': 646, 'loss/train': 4.445610046386719}}40019226074} 11/06/2021 21:22:04 - INFO - __main__ - Step 651: {'lr': 0.00016250000000000002, 'samples': 124992, 'steps': 650, 'loss/train': 4.449684143066406} 11/06/2021 21:22:07 - INFO - __main__ - Step 656: {'lr': 0.00016375000000000002, 'samples': 125952, 'steps': 655, 'loss/train': 4.627468109130859} 11/06/2021 21:22:09 - INFO - __main__ - Step 661: {'lr': 0.000165, 'samples': 126912, 'steps': 660, 'loss/train': 4.710036277770996}7468109130859} 11/06/2021 21:22:09 - INFO - __main__ - Step 661: {'lr': 0.000165, 'samples': 126912, 'steps': 660, 'loss/train': 4.710036277770996}7468109130859} 11/06/2021 21:22:12 - INFO - __main__ - Step 668: {'lr': 0.00016675000000000001, 'samples': 128256, 'steps': 667, 'loss/train': 4.959212303161621} 11/06/2021 21:22:15 - INFO - __main__ - Step 672: {'lr': 0.00016775, 'samples': 129024, 'steps': 671, 'loss/train': 4.435437202453613}12303161621} 11/06/2021 21:22:17 - INFO - __main__ - Step 677: {'lr': 0.00016900000000000002, 'samples': 129984, 'steps': 676, 'loss/train': 4.679195880889893} 11/06/2021 21:22:19 - INFO - __main__ - Step 681: {'lr': 0.00017, 'samples': 130752, 'steps': 680, 'loss/train': 4.423855781555176}79195880889893} 11/06/2021 21:22:21 - INFO - __main__ - Step 685: {'lr': 0.000171, 'samples': 131520, 'steps': 684, 'loss/train': 4.730793476104736}9195880889893} 11/06/2021 21:22:21 - INFO - __main__ - Step 685: {'lr': 0.000171, 'samples': 131520, 'steps': 684, 'loss/train': 4.730793476104736}9195880889893} 11/06/2021 21:22:25 - INFO - __main__ - Step 692: {'lr': 0.00017275, 'samples': 132864, 'steps': 691, 'loss/train': 4.4996209144592285}5880889893} 11/06/2021 21:22:25 - INFO - __main__ - Step 692: {'lr': 0.00017275, 'samples': 132864, 'steps': 691, 'loss/train': 4.4996209144592285}5880889893} 11/06/2021 21:22:25 - INFO - __main__ - Step 692: {'lr': 0.00017275, 'samples': 132864, 'steps': 691, 'loss/train': 4.4996209144592285}5880889893} 11/06/2021 21:22:31 - INFO - __main__ - Step 703: {'lr': 0.00017549999999999998, 'samples': 134976, 'steps': 702, 'loss/train': 4.290457248687744} 11/06/2021 21:22:33 - INFO - __main__ - Step 709: {'lr': 0.000177, 'samples': 136128, 'steps': 708, 'loss/train': 4.986819267272949}0457248687744} 11/06/2021 21:22:33 - INFO - __main__ - Step 709: {'lr': 0.000177, 'samples': 136128, 'steps': 708, 'loss/train': 4.986819267272949}0457248687744} 11/06/2021 21:22:37 - INFO - __main__ - Step 716: {'lr': 0.00017875, 'samples': 137472, 'steps': 715, 'loss/train': 4.557127475738525}57248687744} 11/06/2021 21:22:39 - INFO - __main__ - Step 720: {'lr': 0.00017975, 'samples': 138240, 'steps': 719, 'loss/train': 4.539857864379883}57248687744} 11/06/2021 21:22:39 - INFO - __main__ - Step 720: {'lr': 0.00017975, 'samples': 138240, 'steps': 719, 'loss/train': 4.539857864379883}57248687744} 11/06/2021 21:22:43 - INFO - __main__ - Step 728: {'lr': 0.00018175, 'samples': 139776, 'steps': 727, 'loss/train': 4.543506145477295}57248687744} 11/06/2021 21:22:45 - INFO - __main__ - Step 732: {'lr': 0.00018275, 'samples': 140544, 'steps': 731, 'loss/train': 4.329798221588135}57248687744} 11/06/2021 21:22:47 - INFO - __main__ - Step 736: {'lr': 0.00018375, 'samples': 141312, 'steps': 735, 'loss/train': 5.126845359802246}57248687744} 11/06/2021 21:22:49 - INFO - __main__ - Step 741: {'lr': 0.000185, 'samples': 142272, 'steps': 740, 'loss/train': 4.524784564971924}6}57248687744} 11/06/2021 21:22:51 - INFO - __main__ - Step 745: {'lr': 0.000186, 'samples': 143040, 'steps': 744, 'loss/train': 4.4243059158325195}}57248687744} 11/06/2021 21:22:51 - INFO - __main__ - Step 745: {'lr': 0.000186, 'samples': 143040, 'steps': 744, 'loss/train': 4.4243059158325195}}57248687744} 11/06/2021 21:22:55 - INFO - __main__ - Step 752: {'lr': 0.00018775, 'samples': 144384, 'steps': 751, 'loss/train': 4.103482723236084}57248687744} 11/06/2021 21:22:57 - INFO - __main__ - Step 757: {'lr': 0.000189, 'samples': 145344, 'steps': 756, 'loss/train': 4.462242126464844}4}57248687744} 11/06/2021 21:22:57 - INFO - __main__ - Step 757: {'lr': 0.000189, 'samples': 145344, 'steps': 756, 'loss/train': 4.462242126464844}4}57248687744} 11/06/2021 21:23:02 - INFO - __main__ - Step 765: {'lr': 0.000191, 'samples': 146880, 'steps': 764, 'loss/train': 4.935421466827393}4}57248687744} 11/06/2021 21:23:04 - INFO - __main__ - Step 769: {'lr': 0.000192, 'samples': 147648, 'steps': 768, 'loss/train': 5.1701979637146}3}4}57248687744} 11/06/2021 21:23:05 - INFO - __main__ - Step 773: {'lr': 0.000193, 'samples': 148416, 'steps': 772, 'loss/train': 4.7581634521484375}}57248687744} 11/06/2021 21:23:07 - INFO - __main__ - Step 777: {'lr': 0.000194, 'samples': 149184, 'steps': 776, 'loss/train': 4.947287559509277}}}57248687744} 11/06/2021 21:23:10 - INFO - __main__ - Step 782: {'lr': 0.00019525, 'samples': 150144, 'steps': 781, 'loss/train': 4.558263301849365}57248687744} 11/06/2021 21:23:12 - INFO - __main__ - Step 786: {'lr': 0.00019625, 'samples': 150912, 'steps': 785, 'loss/train': 4.140824317932129}57248687744} 11/06/2021 21:23:12 - INFO - __main__ - Step 786: {'lr': 0.00019625, 'samples': 150912, 'steps': 785, 'loss/train': 4.140824317932129}57248687744} 11/06/2021 21:23:15 - INFO - __main__ - Step 793: {'lr': 0.00019800000000000002, 'samples': 152256, 'steps': 792, 'loss/train': 4.764225482940674} 11/06/2021 21:23:17 - INFO - __main__ - Step 797: {'lr': 0.000199, 'samples': 153024, 'steps': 796, 'loss/train': 4.558513641357422}4225482940674} 11/06/2021 21:23:17 - INFO - __main__ - Step 797: {'lr': 0.000199, 'samples': 153024, 'steps': 796, 'loss/train': 4.558513641357422}4225482940674} 11/06/2021 21:23:21 - INFO - __main__ - Step 804: {'lr': 0.00020075000000000003, 'samples': 154368, 'steps': 803, 'loss/train': 2.2776834964752197} 11/06/2021 21:23:21 - INFO - __main__ - Step 804: {'lr': 0.00020075000000000003, 'samples': 154368, 'steps': 803, 'loss/train': 2.2776834964752197} 11/06/2021 21:23:26 - INFO - __main__ - Step 812: {'lr': 0.00020275000000000002, 'samples': 155904, 'steps': 811, 'loss/train': 4.3080902099609375} 11/06/2021 21:23:27 - INFO - __main__ - Step 816: {'lr': 0.00020375, 'samples': 156672, 'steps': 815, 'loss/train': 4.675464153289795}902099609375} 11/06/2021 21:23:29 - INFO - __main__ - Step 820: {'lr': 0.00020475, 'samples': 157440, 'steps': 819, 'loss/train': 5.175144672393799}902099609375} 11/06/2021 21:23:32 - INFO - __main__ - Step 825: {'lr': 0.000206, 'samples': 158400, 'steps': 824, 'loss/train': 4.396790981292725}9}902099609375} 11/06/2021 21:23:34 - INFO - __main__ - Step 829: {'lr': 0.000207, 'samples': 159168, 'steps': 828, 'loss/train': 4.163397312164307}9}902099609375} 11/06/2021 21:23:36 - INFO - __main__ - Step 833: {'lr': 0.000208, 'samples': 159936, 'steps': 832, 'loss/train': 4.7883687019348145}}902099609375} 11/06/2021 21:23:37 - INFO - __main__ - Step 837: {'lr': 0.00020899999999999998, 'samples': 160704, 'steps': 836, 'loss/train': 4.873067378997803}} 11/06/2021 21:23:40 - INFO - __main__ - Step 841: {'lr': 0.00021, 'samples': 161472, 'steps': 840, 'loss/train': 6.019371509552002}73067378997803}} 11/06/2021 21:23:40 - INFO - __main__ - Step 841: {'lr': 0.00021, 'samples': 161472, 'steps': 840, 'loss/train': 6.019371509552002}73067378997803}} 11/06/2021 21:23:43 - INFO - __main__ - Step 848: {'lr': 0.00021175, 'samples': 162816, 'steps': 847, 'loss/train': 4.035452842712402}67378997803}} 11/06/2021 21:23:45 - INFO - __main__ - Step 852: {'lr': 0.00021275, 'samples': 163584, 'steps': 851, 'loss/train': 4.537678241729736}67378997803}} 11/06/2021 21:23:45 - INFO - __main__ - Step 852: {'lr': 0.00021275, 'samples': 163584, 'steps': 851, 'loss/train': 4.537678241729736}67378997803}} 11/06/2021 21:23:45 - INFO - __main__ - Step 852: {'lr': 0.00021275, 'samples': 163584, 'steps': 851, 'loss/train': 4.537678241729736}67378997803}} 11/06/2021 21:23:51 - INFO - __main__ - Step 863: {'lr': 0.0002155, 'samples': 165696, 'steps': 862, 'loss/train': 4.436905384063721}}67378997803}} 11/06/2021 21:23:53 - INFO - __main__ - Step 867: {'lr': 0.0002165, 'samples': 166464, 'steps': 866, 'loss/train': 4.653875827789307}}67378997803}} 11/06/2021 21:23:55 - INFO - __main__ - Step 872: {'lr': 0.00021775, 'samples': 167424, 'steps': 871, 'loss/train': 1.9217588901519775}7378997803}} 11/06/2021 21:23:58 - INFO - __main__ - Step 877: {'lr': 0.000219, 'samples': 168384, 'steps': 876, 'loss/train': 4.769442558288574}75}7378997803}} 11/06/2021 21:23:58 - INFO - __main__ - Step 877: {'lr': 0.000219, 'samples': 168384, 'steps': 876, 'loss/train': 4.769442558288574}75}7378997803}} 11/06/2021 21:24:02 - INFO - __main__ - Step 884: {'lr': 0.00022075, 'samples': 169728, 'steps': 883, 'loss/train': 4.347689151763916}}7378997803}} 11/06/2021 21:24:03 - INFO - __main__ - Step 888: {'lr': 0.00022175, 'samples': 170496, 'steps': 887, 'loss/train': 4.312256813049316}}7378997803}} 11/06/2021 21:24:05 - INFO - __main__ - Step 892: {'lr': 0.00022275000000000002, 'samples': 171264, 'steps': 891, 'loss/train': 4.234619617462158}} 11/06/2021 21:24:07 - INFO - __main__ - Step 897: {'lr': 0.000224, 'samples': 172224, 'steps': 896, 'loss/train': 3.291337251663208}4619617462158}} 11/06/2021 21:24:10 - INFO - __main__ - Step 903: {'lr': 0.0002255, 'samples': 173376, 'steps': 902, 'loss/train': 4.559982776641846}619617462158}} 11/06/2021 21:24:10 - INFO - __main__ - Step 903: {'lr': 0.0002255, 'samples': 173376, 'steps': 902, 'loss/train': 4.559982776641846}619617462158}} 11/06/2021 21:24:10 - INFO - __main__ - Step 903: {'lr': 0.0002255, 'samples': 173376, 'steps': 902, 'loss/train': 4.559982776641846}619617462158}} 11/06/2021 21:24:15 - INFO - __main__ - Step 913: {'lr': 0.000228, 'samples': 175296, 'steps': 912, 'loss/train': 4.5875163078308105}619617462158}} 11/06/2021 21:24:15 - INFO - __main__ - Step 913: {'lr': 0.000228, 'samples': 175296, 'steps': 912, 'loss/train': 4.5875163078308105}619617462158}} 11/06/2021 21:24:19 - INFO - __main__ - Step 921: {'lr': 0.00023, 'samples': 176832, 'steps': 920, 'loss/train': 4.508198261260986}5}619617462158}} 11/06/2021 21:24:21 - INFO - __main__ - Step 925: {'lr': 0.000231, 'samples': 177600, 'steps': 924, 'loss/train': 5.673306941986084}}619617462158}} 11/06/2021 21:24:23 - INFO - __main__ - Step 929: {'lr': 0.00023200000000000003, 'samples': 178368, 'steps': 928, 'loss/train': 3.650172233581543}} 11/06/2021 21:24:25 - INFO - __main__ - Step 934: {'lr': 0.00023325, 'samples': 179328, 'steps': 933, 'loss/train': 3.821877956390381}72233581543}} 11/06/2021 21:24:25 - INFO - __main__ - Step 934: {'lr': 0.00023325, 'samples': 179328, 'steps': 933, 'loss/train': 3.821877956390381}72233581543}} 11/06/2021 21:24:30 - INFO - __main__ - Step 942: {'lr': 0.00023525, 'samples': 180864, 'steps': 941, 'loss/train': 4.118391036987305}72233581543}} 11/06/2021 21:24:32 - INFO - __main__ - Step 946: {'lr': 0.00023625, 'samples': 181632, 'steps': 945, 'loss/train': 4.698647975921631}72233581543}} 11/06/2021 21:24:33 - INFO - __main__ - Step 950: {'lr': 0.00023725, 'samples': 182400, 'steps': 949, 'loss/train': 4.038322448730469}72233581543}} 11/06/2021 21:24:35 - INFO - __main__ - Step 954: {'lr': 0.00023825, 'samples': 183168, 'steps': 953, 'loss/train': 4.3065505027771}9}72233581543}} 11/06/2021 21:24:38 - INFO - __main__ - Step 959: {'lr': 0.0002395, 'samples': 184128, 'steps': 958, 'loss/train': 4.376034259796143}}72233581543}} 11/06/2021 21:24:38 - INFO - __main__ - Step 959: {'lr': 0.0002395, 'samples': 184128, 'steps': 958, 'loss/train': 4.376034259796143}}72233581543}} 11/06/2021 21:24:41 - INFO - __main__ - Step 966: {'lr': 0.00024125, 'samples': 185472, 'steps': 965, 'loss/train': 4.45121431350708}}72233581543}} 11/06/2021 21:24:43 - INFO - __main__ - Step 970: {'lr': 0.00024225, 'samples': 186240, 'steps': 969, 'loss/train': 4.633758544921875}72233581543}} 11/06/2021 21:24:46 - INFO - __main__ - Step 975: {'lr': 0.0002435, 'samples': 187200, 'steps': 974, 'loss/train': 4.149434566497803}}72233581543}} 11/06/2021 21:24:48 - INFO - __main__ - Step 979: {'lr': 0.0002445, 'samples': 187968, 'steps': 978, 'loss/train': 4.069120407104492}}72233581543}} 11/06/2021 21:24:50 - INFO - __main__ - Step 983: {'lr': 0.0002455, 'samples': 188736, 'steps': 982, 'loss/train': 4.437088966369629}}72233581543}} 11/06/2021 21:24:51 - INFO - __main__ - Step 987: {'lr': 0.00024650000000000003, 'samples': 189504, 'steps': 986, 'loss/train': 3.8113763332366943} 11/06/2021 21:24:53 - INFO - __main__ - Step 991: {'lr': 0.0002475, 'samples': 190272, 'steps': 990, 'loss/train': 4.3600897789001465}763332366943} 11/06/2021 21:24:56 - INFO - __main__ - Step 996: {'lr': 0.00024875, 'samples': 191232, 'steps': 995, 'loss/train': 4.652307510375977}763332366943} 11/06/2021 21:24:58 - INFO - __main__ - Step 1000: {'lr': 0.00024975, 'samples': 192000, 'steps': 999, 'loss/train': 4.688298225402832}63332366943} 11/06/2021 21:24:58 - INFO - __main__ - Step 1000: {'lr': 0.00024975, 'samples': 192000, 'steps': 999, 'loss/train': 4.688298225402832}63332366943} 11/06/2021 21:25:01 - INFO - __main__ - Step 1007: {'lr': 0.0002515, 'samples': 193344, 'steps': 1006, 'loss/train': 2.1154818534851074}3332366943} 11/06/2021 21:25:04 - INFO - __main__ - Step 1012: {'lr': 0.00025275, 'samples': 194304, 'steps': 1011, 'loss/train': 3.661561965942383}3332366943} 11/06/2021 21:25:06 - INFO - __main__ - Step 1017: {'lr': 0.000254, 'samples': 195264, 'steps': 1016, 'loss/train': 6.699620723724365}3}3332366943} 11/06/2021 21:25:08 - INFO - __main__ - Step 1021: {'lr': 0.000255, 'samples': 196032, 'steps': 1020, 'loss/train': 3.97232723236084}}3}3332366943} 11/06/2021 21:25:10 - INFO - __main__ - Step 1025: {'lr': 0.000256, 'samples': 196800, 'steps': 1024, 'loss/train': 4.093658924102783}3}3332366943} 11/06/2021 21:25:11 - INFO - __main__ - Step 1029: {'lr': 0.000257, 'samples': 197568, 'steps': 1028, 'loss/train': 3.6540799140930176}}3332366943} 11/06/2021 21:25:14 - INFO - __main__ - Step 1033: {'lr': 0.00025800000000000004, 'samples': 198336, 'steps': 1032, 'loss/train': 1.9851371049880981} 11/06/2021 21:25:16 - INFO - __main__ - Step 1038: {'lr': 0.00025925, 'samples': 199296, 'steps': 1037, 'loss/train': 5.716175556182861}371049880981} 11/06/2021 21:25:18 - INFO - __main__ - Step 1042: {'lr': 0.00026025, 'samples': 200064, 'steps': 1041, 'loss/train': 4.489924430847168}371049880981} 11/06/2021 21:25:18 - INFO - __main__ - Step 1042: {'lr': 0.00026025, 'samples': 200064, 'steps': 1041, 'loss/train': 4.489924430847168}371049880981} 11/06/2021 21:25:21 - INFO - __main__ - Step 1049: {'lr': 0.000262, 'samples': 201408, 'steps': 1048, 'loss/train': 3.938405990600586}8}371049880981} 11/06/2021 21:25:23 - INFO - __main__ - Step 1053: {'lr': 0.000263, 'samples': 202176, 'steps': 1052, 'loss/train': 4.3338303565979}6}8}371049880981} 11/06/2021 21:25:26 - INFO - __main__ - Step 1058: {'lr': 0.00026425, 'samples': 203136, 'steps': 1057, 'loss/train': 4.140725135803223}371049880981} 11/06/2021 21:25:26 - INFO - __main__ - Step 1058: {'lr': 0.00026425, 'samples': 203136, 'steps': 1057, 'loss/train': 4.140725135803223}371049880981} 11/06/2021 21:25:30 - INFO - __main__ - Step 1066: {'lr': 0.00026625, 'samples': 204672, 'steps': 1065, 'loss/train': 3.962754011154175}371049880981} 11/06/2021 21:25:31 - INFO - __main__ - Step 1070: {'lr': 0.00026725, 'samples': 205440, 'steps': 1069, 'loss/train': 2.362624168395996}371049880981} 11/06/2021 21:25:34 - INFO - __main__ - Step 1074: {'lr': 0.00026825, 'samples': 206208, 'steps': 1073, 'loss/train': 3.240086078643799}371049880981} 11/06/2021 21:25:36 - INFO - __main__ - Step 1079: {'lr': 0.00026950000000000005, 'samples': 207168, 'steps': 1078, 'loss/train': 4.370131015777588}} 11/06/2021 21:25:38 - INFO - __main__ - Step 1084: {'lr': 0.00027075, 'samples': 208128, 'steps': 1083, 'loss/train': 3.959185838699341}31015777588}} 11/06/2021 21:25:38 - INFO - __main__ - Step 1084: {'lr': 0.00027075, 'samples': 208128, 'steps': 1083, 'loss/train': 3.959185838699341}31015777588}} 11/06/2021 21:25:42 - INFO - __main__ - Step 1091: {'lr': 0.0002725, 'samples': 209472, 'steps': 1090, 'loss/train': 3.5656397342681885}31015777588}} 11/06/2021 21:25:44 - INFO - __main__ - Step 1095: {'lr': 0.00027350000000000003, 'samples': 210240, 'steps': 1094, 'loss/train': 4.242520332336426}} 11/06/2021 21:25:46 - INFO - __main__ - Step 1099: {'lr': 0.0002745, 'samples': 211008, 'steps': 1098, 'loss/train': 4.089423179626465}520332336426}} 11/06/2021 21:25:48 - INFO - __main__ - Step 1103: {'lr': 0.00027550000000000003, 'samples': 211776, 'steps': 1102, 'loss/train': 6.588270664215088}} 11/06/2021 21:25:48 - INFO - __main__ - Step 1103: {'lr': 0.00027550000000000003, 'samples': 211776, 'steps': 1102, 'loss/train': 6.588270664215088}} 11/06/2021 21:25:52 - INFO - __main__ - Step 1110: {'lr': 0.00027725, 'samples': 213120, 'steps': 1109, 'loss/train': 4.272808074951172}70664215088}} 11/06/2021 21:25:52 - INFO - __main__ - Step 1110: {'lr': 0.00027725, 'samples': 213120, 'steps': 1109, 'loss/train': 4.272808074951172}70664215088}} 11/06/2021 21:25:56 - INFO - __main__ - Step 1118: {'lr': 0.00027925, 'samples': 214656, 'steps': 1117, 'loss/train': 4.0414958000183105}0664215088}} 11/06/2021 21:25:57 - INFO - __main__ - Step 1122: {'lr': 0.00028025, 'samples': 215424, 'steps': 1121, 'loss/train': 4.281680583953857}}0664215088}} 11/06/2021 21:26:00 - INFO - __main__ - Step 1126: {'lr': 0.00028125000000000003, 'samples': 216192, 'steps': 1125, 'loss/train': 4.240171432495117}} 11/06/2021 21:26:00 - INFO - __main__ - Step 1126: {'lr': 0.00028125000000000003, 'samples': 216192, 'steps': 1125, 'loss/train': 4.240171432495117}} 11/06/2021 21:26:04 - INFO - __main__ - Step 1134: {'lr': 0.00028325000000000003, 'samples': 217728, 'steps': 1133, 'loss/train': 4.218957424163818}} 11/06/2021 21:26:06 - INFO - __main__ - Step 1138: {'lr': 0.00028425, 'samples': 218496, 'steps': 1137, 'loss/train': 4.085984706878662}57424163818}} 11/06/2021 21:26:07 - INFO - __main__ - Step 1142: {'lr': 0.00028525, 'samples': 219264, 'steps': 1141, 'loss/train': 3.885024070739746}57424163818}} 11/06/2021 21:26:10 - INFO - __main__ - Step 1146: {'lr': 0.00028625, 'samples': 220032, 'steps': 1145, 'loss/train': 3.5987935066223145}7424163818}} 11/06/2021 21:26:12 - INFO - __main__ - Step 1151: {'lr': 0.0002875, 'samples': 220992, 'steps': 1150, 'loss/train': 3.9367594718933105}}7424163818}} 11/06/2021 21:26:14 - INFO - __main__ - Step 1155: {'lr': 0.00028849999999999997, 'samples': 221760, 'steps': 1154, 'loss/train': 4.096200942993164}} 11/06/2021 21:26:16 - INFO - __main__ - Step 1159: {'lr': 0.0002895, 'samples': 222528, 'steps': 1158, 'loss/train': 2.7520174980163574}00942993164}} 11/06/2021 21:26:18 - INFO - __main__ - Step 1163: {'lr': 0.00029049999999999996, 'samples': 223296, 'steps': 1162, 'loss/train': 4.1517109870910645} 11/06/2021 21:26:20 - INFO - __main__ - Step 1167: {'lr': 0.0002915, 'samples': 224064, 'steps': 1166, 'loss/train': 3.959521770477295}7109870910645} 11/06/2021 21:26:22 - INFO - __main__ - Step 1172: {'lr': 0.00029275000000000004, 'samples': 225024, 'steps': 1171, 'loss/train': 4.092573642730713}} 11/06/2021 21:26:24 - INFO - __main__ - Step 1176: {'lr': 0.00029375, 'samples': 225792, 'steps': 1175, 'loss/train': 3.6532394886016846}3642730713}} 11/06/2021 21:26:24 - INFO - __main__ - Step 1176: {'lr': 0.00029375, 'samples': 225792, 'steps': 1175, 'loss/train': 3.6532394886016846}3642730713}} 11/06/2021 21:26:27 - INFO - __main__ - Step 1183: {'lr': 0.00029549999999999997, 'samples': 227136, 'steps': 1182, 'loss/train': 4.172230243682861}} 11/06/2021 21:26:30 - INFO - __main__ - Step 1188: {'lr': 0.00029675000000000003, 'samples': 228096, 'steps': 1187, 'loss/train': 3.7164466381073}1}} 11/06/2021 21:26:32 - INFO - __main__ - Step 1193: {'lr': 0.000298, 'samples': 229056, 'steps': 1192, 'loss/train': 5.458200454711914}64466381073}1}} 11/06/2021 21:26:32 - INFO - __main__ - Step 1193: {'lr': 0.000298, 'samples': 229056, 'steps': 1192, 'loss/train': 5.458200454711914}64466381073}1}} 11/06/2021 21:26:36 - INFO - __main__ - Step 1200: {'lr': 0.00029975000000000005, 'samples': 230400, 'steps': 1199, 'loss/train': 4.041784286499023}} 11/06/2021 21:26:37 - INFO - __main__ - Step 1204: {'lr': 0.00030075, 'samples': 231168, 'steps': 1203, 'loss/train': 4.123647689819336}84286499023}} 11/06/2021 21:26:40 - INFO - __main__ - Step 1208: {'lr': 0.00030175000000000004, 'samples': 231936, 'steps': 1207, 'loss/train': 4.022327423095703}} 11/06/2021 21:26:42 - INFO - __main__ - Step 1213: {'lr': 0.000303, 'samples': 232896, 'steps': 1212, 'loss/train': 3.7347517013549805}327423095703}} 11/06/2021 21:26:44 - INFO - __main__ - Step 1218: {'lr': 0.00030425000000000005, 'samples': 233856, 'steps': 1217, 'loss/train': 3.7530202865600586} 11/06/2021 21:26:44 - INFO - __main__ - Step 1218: {'lr': 0.00030425000000000005, 'samples': 233856, 'steps': 1217, 'loss/train': 3.7530202865600586} 11/06/2021 21:26:44 - INFO - __main__ - Step 1218: {'lr': 0.00030425000000000005, 'samples': 233856, 'steps': 1217, 'loss/train': 3.7530202865600586} 11/06/2021 21:26:50 - INFO - __main__ - Step 1229: {'lr': 0.000307, 'samples': 235968, 'steps': 1228, 'loss/train': 4.392697811126709}30202865600586} 11/06/2021 21:26:52 - INFO - __main__ - Step 1234: {'lr': 0.00030825000000000004, 'samples': 236928, 'steps': 1233, 'loss/train': 3.6062018871307373} 11/06/2021 21:26:54 - INFO - __main__ - Step 1238: {'lr': 0.00030925, 'samples': 237696, 'steps': 1237, 'loss/train': 5.033356666564941}018871307373} 11/06/2021 21:26:56 - INFO - __main__ - Step 1242: {'lr': 0.00031025000000000003, 'samples': 238464, 'steps': 1241, 'loss/train': 3.553842067718506}} 11/06/2021 21:26:58 - INFO - __main__ - Step 1246: {'lr': 0.00031125000000000006, 'samples': 239232, 'steps': 1245, 'loss/train': 3.7371480464935303} 11/06/2021 21:27:00 - INFO - __main__ - Step 1250: {'lr': 0.00031225000000000003, 'samples': 240000, 'steps': 1249, 'loss/train': 3.6074068546295166} 11/06/2021 21:27:02 - INFO - __main__ - Step 1255: {'lr': 0.00031350000000000003, 'samples': 240960, 'steps': 1254, 'loss/train': 3.425907850265503}} 11/06/2021 21:27:04 - INFO - __main__ - Step 1259: {'lr': 0.0003145, 'samples': 241728, 'steps': 1258, 'loss/train': 4.367678642272949}907850265503}} 11/06/2021 21:27:04 - INFO - __main__ - Step 1259: {'lr': 0.0003145, 'samples': 241728, 'steps': 1258, 'loss/train': 4.367678642272949}907850265503}} 11/06/2021 21:27:07 - INFO - __main__ - Step 1266: {'lr': 0.00031624999999999996, 'samples': 243072, 'steps': 1265, 'loss/train': 3.6147239208221436} 11/06/2021 21:27:10 - INFO - __main__ - Step 1271: {'lr': 0.0003175, 'samples': 244032, 'steps': 1270, 'loss/train': 3.718264102935791}7239208221436} 11/06/2021 21:27:10 - INFO - __main__ - Step 1271: {'lr': 0.0003175, 'samples': 244032, 'steps': 1270, 'loss/train': 3.718264102935791}7239208221436} 11/06/2021 21:27:14 - INFO - __main__ - Step 1278: {'lr': 0.00031925, 'samples': 245376, 'steps': 1277, 'loss/train': 2.880706548690796}239208221436} 11/06/2021 21:27:16 - INFO - __main__ - Step 1282: {'lr': 0.00032025, 'samples': 246144, 'steps': 1281, 'loss/train': 2.492274761199951}239208221436} 11/06/2021 21:27:18 - INFO - __main__ - Step 1286: {'lr': 0.00032125, 'samples': 246912, 'steps': 1285, 'loss/train': 4.252817153930664}239208221436} 11/06/2021 21:27:20 - INFO - __main__ - Step 1291: {'lr': 0.00032250000000000003, 'samples': 247872, 'steps': 1290, 'loss/train': 3.7204911708831787} 11/06/2021 21:27:23 - INFO - __main__ - Step 1296: {'lr': 0.00032375, 'samples': 248832, 'steps': 1295, 'loss/train': 3.61068058013916}4911708831787} 11/06/2021 21:27:25 - INFO - __main__ - Step 1300: {'lr': 0.00032475, 'samples': 249600, 'steps': 1299, 'loss/train': 3.8040266036987305}11708831787} 11/06/2021 21:27:27 - INFO - __main__ - Step 1304: {'lr': 0.00032575, 'samples': 250368, 'steps': 1303, 'loss/train': 3.846041440963745}}11708831787} 11/06/2021 21:27:27 - INFO - __main__ - Step 1304: {'lr': 0.00032575, 'samples': 250368, 'steps': 1303, 'loss/train': 3.846041440963745}}11708831787} 11/06/2021 21:27:30 - INFO - __main__ - Step 1311: {'lr': 0.00032750000000000005, 'samples': 251712, 'steps': 1310, 'loss/train': 3.8077216148376465} 11/06/2021 21:27:33 - INFO - __main__ - Step 1317: {'lr': 0.00032900000000000003, 'samples': 252864, 'steps': 1316, 'loss/train': 3.848276138305664}} 11/06/2021 21:27:35 - INFO - __main__ - Step 1321: {'lr': 0.00033, 'samples': 253632, 'steps': 1320, 'loss/train': 3.623622179031372}48276138305664}} 11/06/2021 21:27:35 - INFO - __main__ - Step 1321: {'lr': 0.00033, 'samples': 253632, 'steps': 1320, 'loss/train': 3.623622179031372}48276138305664}} 11/06/2021 21:27:38 - INFO - __main__ - Step 1328: {'lr': 0.00033175, 'samples': 254976, 'steps': 1327, 'loss/train': 3.815504550933838}76138305664}} 11/06/2021 21:27:40 - INFO - __main__ - Step 1332: {'lr': 0.00033275, 'samples': 255744, 'steps': 1331, 'loss/train': 3.404329299926758}76138305664}} 11/06/2021 21:27:40 - INFO - __main__ - Step 1332: {'lr': 0.00033275, 'samples': 255744, 'steps': 1331, 'loss/train': 3.404329299926758}76138305664}} 11/06/2021 21:27:40 - INFO - __main__ - Step 1332: {'lr': 0.00033275, 'samples': 255744, 'steps': 1331, 'loss/train': 3.404329299926758}76138305664}} 11/06/2021 21:27:46 - INFO - __main__ - Step 1343: {'lr': 0.0003355, 'samples': 257856, 'steps': 1342, 'loss/train': 3.234477996826172}}76138305664}} 11/06/2021 21:27:48 - INFO - __main__ - Step 1347: {'lr': 0.00033650000000000005, 'samples': 258624, 'steps': 1346, 'loss/train': 3.310671329498291}} 11/06/2021 21:27:50 - INFO - __main__ - Step 1352: {'lr': 0.00033775, 'samples': 259584, 'steps': 1351, 'loss/train': 3.3598763942718506}1329498291}} 11/06/2021 21:27:53 - INFO - __main__ - Step 1356: {'lr': 0.00033875, 'samples': 260352, 'steps': 1355, 'loss/train': 3.3796229362487793}1329498291}} 11/06/2021 21:27:55 - INFO - __main__ - Step 1360: {'lr': 0.00033975, 'samples': 261120, 'steps': 1359, 'loss/train': 3.659208059310913}}1329498291}} 11/06/2021 21:27:56 - INFO - __main__ - Step 1364: {'lr': 0.00034075, 'samples': 261888, 'steps': 1363, 'loss/train': 4.257739543914795}}1329498291}} 11/06/2021 21:27:58 - INFO - __main__ - Step 1368: {'lr': 0.00034175, 'samples': 262656, 'steps': 1367, 'loss/train': 2.7847213745117188}1329498291}} 11/06/2021 21:27:58 - INFO - __main__ - Step 1368: {'lr': 0.00034175, 'samples': 262656, 'steps': 1367, 'loss/train': 2.7847213745117188}1329498291}} 11/06/2021 21:28:03 - INFO - __main__ - Step 1376: {'lr': 0.00034375, 'samples': 264192, 'steps': 1375, 'loss/train': 3.6449296474456787}1329498291}} 11/06/2021 21:28:05 - INFO - __main__ - Step 1380: {'lr': 0.00034475, 'samples': 264960, 'steps': 1379, 'loss/train': 3.7418301105499268}1329498291}} 11/06/2021 21:28:06 - INFO - __main__ - Step 1384: {'lr': 0.00034575000000000003, 'samples': 265728, 'steps': 1383, 'loss/train': 3.5448079109191895} 11/06/2021 21:28:08 - INFO - __main__ - Step 1388: {'lr': 0.00034675, 'samples': 266496, 'steps': 1387, 'loss/train': 5.252135276794434}079109191895} 11/06/2021 21:28:11 - INFO - __main__ - Step 1393: {'lr': 0.000348, 'samples': 267456, 'steps': 1392, 'loss/train': 3.698169231414795}4}079109191895} 11/06/2021 21:28:11 - INFO - __main__ - Step 1393: {'lr': 0.000348, 'samples': 267456, 'steps': 1392, 'loss/train': 3.698169231414795}4}079109191895} 11/06/2021 21:28:15 - INFO - __main__ - Step 1401: {'lr': 0.00035, 'samples': 268992, 'steps': 1400, 'loss/train': 3.4211158752441406}4}079109191895} 11/06/2021 21:28:16 - INFO - __main__ - Step 1405: {'lr': 0.00035099999999999997, 'samples': 269760, 'steps': 1404, 'loss/train': 3.948554277420044}} 11/06/2021 21:28:18 - INFO - __main__ - Step 1409: {'lr': 0.000352, 'samples': 270528, 'steps': 1408, 'loss/train': 3.544276237487793}8554277420044}} 11/06/2021 21:28:18 - INFO - __main__ - Step 1409: {'lr': 0.000352, 'samples': 270528, 'steps': 1408, 'loss/train': 3.544276237487793}8554277420044}} 11/06/2021 21:28:23 - INFO - __main__ - Step 1417: {'lr': 0.000354, 'samples': 272064, 'steps': 1416, 'loss/train': 3.9992828369140625}554277420044}} 11/06/2021 21:28:23 - INFO - __main__ - Step 1417: {'lr': 0.000354, 'samples': 272064, 'steps': 1416, 'loss/train': 3.9992828369140625}554277420044}} 11/06/2021 21:28:26 - INFO - __main__ - Step 1424: {'lr': 0.00035575, 'samples': 273408, 'steps': 1423, 'loss/train': 4.185265064239502}54277420044}} 11/06/2021 21:28:29 - INFO - __main__ - Step 1430: {'lr': 0.00035725000000000004, 'samples': 274560, 'steps': 1429, 'loss/train': 3.588322162628174}} 11/06/2021 21:28:31 - INFO - __main__ - Step 1434: {'lr': 0.00035825, 'samples': 275328, 'steps': 1433, 'loss/train': 3.3060061931610107}2162628174}} 11/06/2021 21:28:33 - INFO - __main__ - Step 1438: {'lr': 0.00035925000000000003, 'samples': 276096, 'steps': 1437, 'loss/train': 3.045379877090454}} 11/06/2021 21:28:33 - INFO - __main__ - Step 1438: {'lr': 0.00035925000000000003, 'samples': 276096, 'steps': 1437, 'loss/train': 3.045379877090454}} 11/06/2021 21:28:36 - INFO - __main__ - Step 1445: {'lr': 0.000361, 'samples': 277440, 'steps': 1444, 'loss/train': 3.245635747909546}5379877090454}} 11/06/2021 21:28:39 - INFO - __main__ - Step 1450: {'lr': 0.00036225000000000005, 'samples': 278400, 'steps': 1449, 'loss/train': 3.1428816318511963} 11/06/2021 21:28:41 - INFO - __main__ - Step 1455: {'lr': 0.0003635, 'samples': 279360, 'steps': 1454, 'loss/train': 3.257840871810913}8816318511963} 11/06/2021 21:28:41 - INFO - __main__ - Step 1455: {'lr': 0.0003635, 'samples': 279360, 'steps': 1454, 'loss/train': 3.257840871810913}8816318511963} 11/06/2021 21:28:44 - INFO - __main__ - Step 1462: {'lr': 0.00036525, 'samples': 280704, 'steps': 1461, 'loss/train': 3.5692012310028076}16318511963} 11/06/2021 21:28:46 - INFO - __main__ - Step 1466: {'lr': 0.00036625000000000004, 'samples': 281472, 'steps': 1465, 'loss/train': 3.158153772354126}} 11/06/2021 21:28:49 - INFO - __main__ - Step 1471: {'lr': 0.0003675, 'samples': 282432, 'steps': 1470, 'loss/train': 3.3369407653808594}53772354126}} 11/06/2021 21:28:51 - INFO - __main__ - Step 1475: {'lr': 0.0003685, 'samples': 283200, 'steps': 1474, 'loss/train': 2.7034976482391357}53772354126}} 11/06/2021 21:28:51 - INFO - __main__ - Step 1475: {'lr': 0.0003685, 'samples': 283200, 'steps': 1474, 'loss/train': 2.7034976482391357}53772354126}} 11/06/2021 21:28:54 - INFO - __main__ - Step 1481: {'lr': 0.00037, 'samples': 284352, 'steps': 1480, 'loss/train': 3.1665492057800293}7}53772354126}} 11/06/2021 21:28:54 - INFO - __main__ - Step 1481: {'lr': 0.00037, 'samples': 284352, 'steps': 1480, 'loss/train': 3.1665492057800293}7}53772354126}} 11/06/2021 21:28:58 - INFO - __main__ - Step 1489: {'lr': 0.000372, 'samples': 285888, 'steps': 1488, 'loss/train': 3.5077946186065674}}53772354126}} 11/06/2021 21:29:00 - INFO - __main__ - Step 1493: {'lr': 0.000373, 'samples': 286656, 'steps': 1492, 'loss/train': 3.127577066421509}}}53772354126}} 11/06/2021 21:29:00 - INFO - __main__ - Step 1493: {'lr': 0.000373, 'samples': 286656, 'steps': 1492, 'loss/train': 3.127577066421509}}}53772354126}} 11/06/2021 21:29:04 - INFO - __main__ - Step 1500: {'lr': 0.00037475000000000003, 'samples': 288000, 'steps': 1499, 'loss/train': 3.4952762126922607} 11/06/2021 21:29:07 - INFO - __main__ - Step 1504: {'lr': 0.00037575, 'samples': 288768, 'steps': 1503, 'loss/train': 4.3909077644348145}62126922607} 11/06/2021 21:29:09 - INFO - __main__ - Step 1509: {'lr': 0.000377, 'samples': 289728, 'steps': 1508, 'loss/train': 3.3940863609313965}5}62126922607} 11/06/2021 21:29:11 - INFO - __main__ - Step 1513: {'lr': 0.000378, 'samples': 290496, 'steps': 1512, 'loss/train': 3.0633351802825928}5}62126922607} 11/06/2021 21:29:11 - INFO - __main__ - Step 1513: {'lr': 0.000378, 'samples': 290496, 'steps': 1512, 'loss/train': 3.0633351802825928}5}62126922607} 11/06/2021 21:29:11 - INFO - __main__ - Step 1513: {'lr': 0.000378, 'samples': 290496, 'steps': 1512, 'loss/train': 3.0633351802825928}5}62126922607} 11/06/2021 21:29:17 - INFO - __main__ - Step 1523: {'lr': 0.00038050000000000003, 'samples': 292416, 'steps': 1522, 'loss/train': 1.1130292415618896} 11/06/2021 21:29:19 - INFO - __main__ - Step 1527: {'lr': 0.0003815, 'samples': 293184, 'steps': 1526, 'loss/train': 2.968132972717285}0292415618896} 11/06/2021 21:29:19 - INFO - __main__ - Step 1527: {'lr': 0.0003815, 'samples': 293184, 'steps': 1526, 'loss/train': 2.968132972717285}0292415618896} 11/06/2021 21:29:23 - INFO - __main__ - Step 1535: {'lr': 0.0003835, 'samples': 294720, 'steps': 1534, 'loss/train': 3.5091545581817627}292415618896} 11/06/2021 21:29:25 - INFO - __main__ - Step 1539: {'lr': 0.0003845, 'samples': 295488, 'steps': 1538, 'loss/train': 3.5664308071136475}292415618896} 11/06/2021 21:29:27 - INFO - __main__ - Step 1543: {'lr': 0.0003855, 'samples': 296256, 'steps': 1542, 'loss/train': 3.427462100982666}}292415618896} 11/06/2021 21:29:29 - INFO - __main__ - Step 1548: {'lr': 0.00038675, 'samples': 297216, 'steps': 1547, 'loss/train': 3.2712979316711426}92415618896} 11/06/2021 21:29:29 - INFO - __main__ - Step 1548: {'lr': 0.00038675, 'samples': 297216, 'steps': 1547, 'loss/train': 3.2712979316711426}92415618896} 11/06/2021 21:29:29 - INFO - __main__ - Step 1548: {'lr': 0.00038675, 'samples': 297216, 'steps': 1547, 'loss/train': 3.2712979316711426}92415618896} 11/06/2021 21:29:35 - INFO - __main__ - Step 1558: {'lr': 0.00038925, 'samples': 299136, 'steps': 1557, 'loss/train': 3.1180825233459473}92415618896} 11/06/2021 21:29:37 - INFO - __main__ - Step 1564: {'lr': 0.00039075, 'samples': 300288, 'steps': 1563, 'loss/train': 3.6809487342834473}92415618896} 11/06/2021 21:29:37 - INFO - __main__ - Step 1564: {'lr': 0.00039075, 'samples': 300288, 'steps': 1563, 'loss/train': 3.6809487342834473}92415618896} 11/06/2021 21:29:41 - INFO - __main__ - Step 1571: {'lr': 0.0003925, 'samples': 301632, 'steps': 1570, 'loss/train': 2.498239040374756}3}92415618896} 11/06/2021 21:29:43 - INFO - __main__ - Step 1575: {'lr': 0.0003935, 'samples': 302400, 'steps': 1574, 'loss/train': 2.607938289642334}3}92415618896} 11/06/2021 21:29:45 - INFO - __main__ - Step 1580: {'lr': 0.00039474999999999997, 'samples': 303360, 'steps': 1579, 'loss/train': 3.356081008911133}} 11/06/2021 21:29:47 - INFO - __main__ - Step 1584: {'lr': 0.00039575, 'samples': 304128, 'steps': 1583, 'loss/train': 3.310321807861328}81008911133}} 11/06/2021 21:29:49 - INFO - __main__ - Step 1588: {'lr': 0.00039675, 'samples': 304896, 'steps': 1587, 'loss/train': 3.2099974155426025}1008911133}} 11/06/2021 21:29:51 - INFO - __main__ - Step 1592: {'lr': 0.00039775, 'samples': 305664, 'steps': 1591, 'loss/train': 2.313783884048462}}1008911133}} 11/06/2021 21:29:53 - INFO - __main__ - Step 1596: {'lr': 0.00039875, 'samples': 306432, 'steps': 1595, 'loss/train': 2.543515205383301}}1008911133}} 11/06/2021 21:29:55 - INFO - __main__ - Step 1600: {'lr': 0.00039975, 'samples': 307200, 'steps': 1599, 'loss/train': 3.1416678428649902}1008911133}} 11/06/2021 21:29:58 - INFO - __main__ - Step 1605: {'lr': 0.00040100000000000004, 'samples': 308160, 'steps': 1604, 'loss/train': 3.134627103805542}} 11/06/2021 21:29:58 - INFO - __main__ - Step 1605: {'lr': 0.00040100000000000004, 'samples': 308160, 'steps': 1604, 'loss/train': 3.134627103805542}} 11/06/2021 21:30:01 - INFO - __main__ - Step 1612: {'lr': 0.00040275, 'samples': 309504, 'steps': 1611, 'loss/train': 3.279484987258911}27103805542}} 11/06/2021 21:30:03 - INFO - __main__ - Step 1616: {'lr': 0.00040375000000000003, 'samples': 310272, 'steps': 1615, 'loss/train': 2.6593739986419678} 11/06/2021 21:30:05 - INFO - __main__ - Step 1621: {'lr': 0.00040500000000000003, 'samples': 311232, 'steps': 1620, 'loss/train': 3.027345657348633}} 11/06/2021 21:30:05 - INFO - __main__ - Step 1621: {'lr': 0.00040500000000000003, 'samples': 311232, 'steps': 1620, 'loss/train': 3.027345657348633}} 11/06/2021 21:30:09 - INFO - __main__ - Step 1629: {'lr': 0.00040699999999999997, 'samples': 312768, 'steps': 1628, 'loss/train': 3.6013989448547363} 11/06/2021 21:30:11 - INFO - __main__ - Step 1633: {'lr': 0.000408, 'samples': 313536, 'steps': 1632, 'loss/train': 2.9736287593841553}3989448547363} 11/06/2021 21:30:13 - INFO - __main__ - Step 1637: {'lr': 0.00040899999999999997, 'samples': 314304, 'steps': 1636, 'loss/train': 3.0355074405670166} 11/06/2021 21:30:15 - INFO - __main__ - Step 1642: {'lr': 0.00041025, 'samples': 315264, 'steps': 1641, 'loss/train': 2.085822105407715}074405670166} 11/06/2021 21:30:15 - INFO - __main__ - Step 1642: {'lr': 0.00041025, 'samples': 315264, 'steps': 1641, 'loss/train': 2.085822105407715}074405670166} 11/06/2021 21:30:19 - INFO - __main__ - Step 1649: {'lr': 0.000412, 'samples': 316608, 'steps': 1648, 'loss/train': 3.3871734142303467}}074405670166} 11/06/2021 21:30:21 - INFO - __main__ - Step 1653: {'lr': 0.000413, 'samples': 317376, 'steps': 1652, 'loss/train': 3.269364595413208}}}074405670166} 11/06/2021 21:30:24 - INFO - __main__ - Step 1659: {'lr': 0.0004145, 'samples': 318528, 'steps': 1658, 'loss/train': 3.018845558166504}}074405670166} 11/06/2021 21:30:24 - INFO - __main__ - Step 1659: {'lr': 0.0004145, 'samples': 318528, 'steps': 1658, 'loss/train': 3.018845558166504}}074405670166} 11/06/2021 21:30:27 - INFO - __main__ - Step 1665: {'lr': 0.000416, 'samples': 319680, 'steps': 1664, 'loss/train': 2.756814956665039}}}074405670166} 11/06/2021 21:30:29 - INFO - __main__ - Step 1669: {'lr': 0.000417, 'samples': 320448, 'steps': 1668, 'loss/train': 2.675729274749756}}}074405670166} 11/06/2021 21:30:31 - INFO - __main__ - Step 1674: {'lr': 0.00041825, 'samples': 321408, 'steps': 1673, 'loss/train': 3.209273338317871}074405670166} 11/06/2021 21:30:34 - INFO - __main__ - Step 1678: {'lr': 0.00041925, 'samples': 322176, 'steps': 1677, 'loss/train': 2.869393825531006}074405670166} 11/06/2021 21:30:36 - INFO - __main__ - Step 1682: {'lr': 0.00042025, 'samples': 322944, 'steps': 1681, 'loss/train': 2.867621421813965}074405670166} 11/06/2021 21:30:37 - INFO - __main__ - Step 1686: {'lr': 0.00042125, 'samples': 323712, 'steps': 1685, 'loss/train': 3.0616962909698486}74405670166} 11/06/2021 21:30:39 - INFO - __main__ - Step 1690: {'lr': 0.00042225000000000005, 'samples': 324480, 'steps': 1689, 'loss/train': 2.9707889556884766} 11/06/2021 21:30:39 - INFO - __main__ - Step 1690: {'lr': 0.00042225000000000005, 'samples': 324480, 'steps': 1689, 'loss/train': 2.9707889556884766} 11/06/2021 21:30:39 - INFO - __main__ - Step 1690: {'lr': 0.00042225000000000005, 'samples': 324480, 'steps': 1689, 'loss/train': 2.9707889556884766} 11/06/2021 21:30:45 - INFO - __main__ - Step 1702: {'lr': 0.00042525, 'samples': 326784, 'steps': 1701, 'loss/train': 2.888021469116211}889556884766} 11/06/2021 21:30:47 - INFO - __main__ - Step 1706: {'lr': 0.00042625000000000003, 'samples': 327552, 'steps': 1705, 'loss/train': 2.8724474906921387} 11/06/2021 21:30:50 - INFO - __main__ - Step 1711: {'lr': 0.0004275, 'samples': 328512, 'steps': 1710, 'loss/train': 3.4125115871429443}474906921387} 11/06/2021 21:30:52 - INFO - __main__ - Step 1715: {'lr': 0.0004285, 'samples': 329280, 'steps': 1714, 'loss/train': 3.073770046234131}}474906921387} 11/06/2021 21:30:54 - INFO - __main__ - Step 1719: {'lr': 0.0004295, 'samples': 330048, 'steps': 1718, 'loss/train': 3.0501821041107178}474906921387} 11/06/2021 21:30:55 - INFO - __main__ - Step 1723: {'lr': 0.0004305, 'samples': 330816, 'steps': 1722, 'loss/train': 2.9833667278289795}474906921387} 11/06/2021 21:30:57 - INFO - __main__ - Step 1727: {'lr': 0.0004315, 'samples': 331584, 'steps': 1726, 'loss/train': 2.6767044067382812}474906921387} 11/06/2021 21:31:00 - INFO - __main__ - Step 1732: {'lr': 0.00043275000000000003, 'samples': 332544, 'steps': 1731, 'loss/train': 3.1521098613739014} 11/06/2021 21:31:02 - INFO - __main__ - Step 1736: {'lr': 0.00043375000000000005, 'samples': 333312, 'steps': 1735, 'loss/train': 2.645113945007324}} 11/06/2021 21:31:04 - INFO - __main__ - Step 1740: {'lr': 0.00043475, 'samples': 334080, 'steps': 1739, 'loss/train': 2.9333367347717285}3945007324}} 11/06/2021 21:31:05 - INFO - __main__ - Step 1744: {'lr': 0.00043575000000000005, 'samples': 334848, 'steps': 1743, 'loss/train': 2.652301788330078}} 11/06/2021 21:31:07 - INFO - __main__ - Step 1748: {'lr': 0.00043675, 'samples': 335616, 'steps': 1747, 'loss/train': 2.979066848754883}01788330078}} 11/06/2021 21:31:10 - INFO - __main__ - Step 1753: {'lr': 0.000438, 'samples': 336576, 'steps': 1752, 'loss/train': 2.3090498447418213}}01788330078}} 11/06/2021 21:31:12 - INFO - __main__ - Step 1757: {'lr': 0.000439, 'samples': 337344, 'steps': 1756, 'loss/train': 3.583254337310791}}}01788330078}} 11/06/2021 21:31:14 - INFO - __main__ - Step 1761: {'lr': 0.00044, 'samples': 338112, 'steps': 1760, 'loss/train': 2.958988904953003}}}}01788330078}} 11/06/2021 21:31:16 - INFO - __main__ - Step 1765: {'lr': 0.000441, 'samples': 338880, 'steps': 1764, 'loss/train': 2.26016902923584}}}}01788330078}} 11/06/2021 21:31:17 - INFO - __main__ - Step 1769: {'lr': 0.000442, 'samples': 339648, 'steps': 1768, 'loss/train': 3.196798086166382}}}01788330078}} 11/06/2021 21:31:19 - INFO - __main__ - Step 1773: {'lr': 0.00044300000000000003, 'samples': 340416, 'steps': 1772, 'loss/train': 2.7474842071533203} 11/06/2021 21:31:22 - INFO - __main__ - Step 1778: {'lr': 0.00044425, 'samples': 341376, 'steps': 1777, 'loss/train': 1.8993228673934937}42071533203} 11/06/2021 21:31:24 - INFO - __main__ - Step 1782: {'lr': 0.00044525, 'samples': 342144, 'steps': 1781, 'loss/train': 3.134248971939087}}42071533203} 11/06/2021 21:31:26 - INFO - __main__ - Step 1786: {'lr': 0.00044625, 'samples': 342912, 'steps': 1785, 'loss/train': 1.7093374729156494}42071533203} 11/06/2021 21:31:27 - INFO - __main__ - Step 1790: {'lr': 0.00044725, 'samples': 343680, 'steps': 1789, 'loss/train': 2.685868263244629}}42071533203} 11/06/2021 21:31:29 - INFO - __main__ - Step 1794: {'lr': 0.00044824999999999997, 'samples': 344448, 'steps': 1793, 'loss/train': 2.5539019107818604} 11/06/2021 21:31:32 - INFO - __main__ - Step 1799: {'lr': 0.00044950000000000003, 'samples': 345408, 'steps': 1798, 'loss/train': 2.5512850284576416} 11/06/2021 21:31:34 - INFO - __main__ - Step 1803: {'lr': 0.0004505, 'samples': 346176, 'steps': 1802, 'loss/train': 3.1000237464904785}850284576416} 11/06/2021 21:31:36 - INFO - __main__ - Step 1807: {'lr': 0.0004515, 'samples': 346944, 'steps': 1806, 'loss/train': 2.2962563037872314}850284576416} 11/06/2021 21:31:36 - INFO - __main__ - Step 1807: {'lr': 0.0004515, 'samples': 346944, 'steps': 1806, 'loss/train': 2.2962563037872314}850284576416} 11/06/2021 21:31:39 - INFO - __main__ - Step 1814: {'lr': 0.00045325, 'samples': 348288, 'steps': 1813, 'loss/train': 3.2586300373077393}50284576416} 11/06/2021 21:31:42 - INFO - __main__ - Step 1820: {'lr': 0.00045475, 'samples': 349440, 'steps': 1819, 'loss/train': 2.775869846343994}}50284576416} 11/06/2021 21:31:42 - INFO - __main__ - Step 1820: {'lr': 0.00045475, 'samples': 349440, 'steps': 1819, 'loss/train': 2.775869846343994}}50284576416} 11/06/2021 21:31:45 - INFO - __main__ - Step 1827: {'lr': 0.00045650000000000004, 'samples': 350784, 'steps': 1826, 'loss/train': 2.706651210784912}} 11/06/2021 21:31:48 - INFO - __main__ - Step 1831: {'lr': 0.0004575, 'samples': 351552, 'steps': 1830, 'loss/train': 3.1444313526153564}51210784912}} 11/06/2021 21:31:48 - INFO - __main__ - Step 1831: {'lr': 0.0004575, 'samples': 351552, 'steps': 1830, 'loss/train': 3.1444313526153564}51210784912}} 11/06/2021 21:31:52 - INFO - __main__ - Step 1839: {'lr': 0.00045950000000000006, 'samples': 353088, 'steps': 1838, 'loss/train': 3.515568971633911}} 11/06/2021 21:31:53 - INFO - __main__ - Step 1843: {'lr': 0.0004605, 'samples': 353856, 'steps': 1842, 'loss/train': 2.9142487049102783}68971633911}} 11/06/2021 21:31:55 - INFO - __main__ - Step 1847: {'lr': 0.00046150000000000005, 'samples': 354624, 'steps': 1846, 'loss/train': 3.0182647705078125} 11/06/2021 21:31:58 - INFO - __main__ - Step 1852: {'lr': 0.00046275, 'samples': 355584, 'steps': 1851, 'loss/train': 2.291944980621338}647705078125} 11/06/2021 21:31:58 - INFO - __main__ - Step 1852: {'lr': 0.00046275, 'samples': 355584, 'steps': 1851, 'loss/train': 2.291944980621338}647705078125} 11/06/2021 21:31:58 - INFO - __main__ - Step 1852: {'lr': 0.00046275, 'samples': 355584, 'steps': 1851, 'loss/train': 2.291944980621338}647705078125} 11/06/2021 21:32:03 - INFO - __main__ - Step 1863: {'lr': 0.00046550000000000004, 'samples': 357696, 'steps': 1862, 'loss/train': 2.8213651180267334} 11/06/2021 21:32:05 - INFO - __main__ - Step 1868: {'lr': 0.00046675, 'samples': 358656, 'steps': 1867, 'loss/train': 2.391618013381958}651180267334} 11/06/2021 21:32:05 - INFO - __main__ - Step 1868: {'lr': 0.00046675, 'samples': 358656, 'steps': 1867, 'loss/train': 2.391618013381958}651180267334} 11/06/2021 21:32:05 - INFO - __main__ - Step 1868: {'lr': 0.00046675, 'samples': 358656, 'steps': 1867, 'loss/train': 2.391618013381958}651180267334} 11/06/2021 21:32:11 - INFO - __main__ - Step 1879: {'lr': 0.0004695, 'samples': 360768, 'steps': 1878, 'loss/train': 2.863210916519165}}651180267334} 11/06/2021 21:32:13 - INFO - __main__ - Step 1884: {'lr': 0.00047075000000000003, 'samples': 361728, 'steps': 1883, 'loss/train': 2.5549018383026123} 11/06/2021 21:32:13 - INFO - __main__ - Step 1884: {'lr': 0.00047075000000000003, 'samples': 361728, 'steps': 1883, 'loss/train': 2.5549018383026123} 11/06/2021 21:32:18 - INFO - __main__ - Step 1892: {'lr': 0.00047275, 'samples': 363264, 'steps': 1891, 'loss/train': 2.8816325664520264}18383026123} 11/06/2021 21:32:19 - INFO - __main__ - Step 1896: {'lr': 0.00047375, 'samples': 364032, 'steps': 1895, 'loss/train': 2.8144490718841553}18383026123} 11/06/2021 21:32:21 - INFO - __main__ - Step 1900: {'lr': 0.00047475, 'samples': 364800, 'steps': 1899, 'loss/train': 3.1232104301452637}18383026123} 11/06/2021 21:32:24 - INFO - __main__ - Step 1905: {'lr': 0.00047599999999999997, 'samples': 365760, 'steps': 1904, 'loss/train': 2.754312515258789}} 11/06/2021 21:32:26 - INFO - __main__ - Step 1909: {'lr': 0.000477, 'samples': 366528, 'steps': 1908, 'loss/train': 3.0879056453704834}312515258789}} 11/06/2021 21:32:26 - INFO - __main__ - Step 1909: {'lr': 0.000477, 'samples': 366528, 'steps': 1908, 'loss/train': 3.0879056453704834}312515258789}} 11/06/2021 21:32:30 - INFO - __main__ - Step 1917: {'lr': 0.000479, 'samples': 368064, 'steps': 1916, 'loss/train': 2.7934629917144775}312515258789}} 11/06/2021 21:32:31 - INFO - __main__ - Step 1921: {'lr': 0.00048, 'samples': 368832, 'steps': 1920, 'loss/train': 4.627929210662842}5}312515258789}} 11/06/2021 21:32:33 - INFO - __main__ - Step 1925: {'lr': 0.000481, 'samples': 369600, 'steps': 1924, 'loss/train': 2.7547266483306885}312515258789}} 11/06/2021 21:32:36 - INFO - __main__ - Step 1930: {'lr': 0.00048225000000000004, 'samples': 370560, 'steps': 1929, 'loss/train': 3.3702750205993652} 11/06/2021 21:32:38 - INFO - __main__ - Step 1934: {'lr': 0.00048325, 'samples': 371328, 'steps': 1933, 'loss/train': 1.854628324508667}750205993652} 11/06/2021 21:32:40 - INFO - __main__ - Step 1938: {'lr': 0.00048425000000000003, 'samples': 372096, 'steps': 1937, 'loss/train': 2.217445135116577}} 11/06/2021 21:32:41 - INFO - __main__ - Step 1942: {'lr': 0.00048525, 'samples': 372864, 'steps': 1941, 'loss/train': 2.725562334060669}45135116577}} 11/06/2021 21:32:43 - INFO - __main__ - Step 1946: {'lr': 0.00048625000000000003, 'samples': 373632, 'steps': 1945, 'loss/train': 2.831516981124878}} 11/06/2021 21:32:46 - INFO - __main__ - Step 1951: {'lr': 0.0004875, 'samples': 374592, 'steps': 1950, 'loss/train': 3.081878900527954}516981124878}} 11/06/2021 21:32:48 - INFO - __main__ - Step 1956: {'lr': 0.00048875, 'samples': 375552, 'steps': 1955, 'loss/train': 2.585529327392578}16981124878}} 11/06/2021 21:32:50 - INFO - __main__ - Step 1960: {'lr': 0.0004897500000000001, 'samples': 376320, 'steps': 1959, 'loss/train': 2.583451986312866}}} 11/06/2021 21:32:50 - INFO - __main__ - Step 1960: {'lr': 0.0004897500000000001, 'samples': 376320, 'steps': 1959, 'loss/train': 2.583451986312866}}} 11/06/2021 21:32:54 - INFO - __main__ - Step 1967: {'lr': 0.0004915, 'samples': 377664, 'steps': 1966, 'loss/train': 3.0170748233795166}1986312866}}} 11/06/2021 21:32:56 - INFO - __main__ - Step 1972: {'lr': 0.00049275, 'samples': 378624, 'steps': 1971, 'loss/train': 2.834716796875}66}1986312866}}} 11/06/2021 21:32:58 - INFO - __main__ - Step 1977: {'lr': 0.000494, 'samples': 379584, 'steps': 1976, 'loss/train': 2.7673490047454834}}1986312866}}} 11/06/2021 21:33:00 - INFO - __main__ - Step 1981: {'lr': 0.000495, 'samples': 380352, 'steps': 1980, 'loss/train': 2.695145845413208}}}1986312866}}} 11/06/2021 21:33:03 - INFO - __main__ - Step 1985: {'lr': 0.000496, 'samples': 381120, 'steps': 1984, 'loss/train': 2.6449930667877197}}1986312866}}} 11/06/2021 21:33:04 - INFO - __main__ - Step 1989: {'lr': 0.000497, 'samples': 381888, 'steps': 1988, 'loss/train': 2.3569090366363525}}1986312866}}} 11/06/2021 21:33:06 - INFO - __main__ - Step 1993: {'lr': 0.000498, 'samples': 382656, 'steps': 1992, 'loss/train': 2.463730573654175}}}1986312866}}} 11/06/2021 21:33:08 - INFO - __main__ - Step 1998: {'lr': 0.00049925, 'samples': 383616, 'steps': 1997, 'loss/train': 2.460160970687866}1986312866}}} 11/06/2021 21:33:10 - INFO - __main__ - Step 2002: {'lr': 0.0004999999999436769, 'samples': 384384, 'steps': 2001, 'loss/train': 2.369316816329956}}} 11/06/2021 21:33:13 - INFO - __main__ - Step 2006: {'lr': 0.0004999999985919232, 'samples': 385152, 'steps': 2005, 'loss/train': 2.7397634983062744}} 11/06/2021 21:33:14 - INFO - __main__ - Step 2010: {'lr': 0.0004999999954378312, 'samples': 385920, 'steps': 2009, 'loss/train': 2.108156681060791}}} 11/06/2021 21:33:16 - INFO - __main__ - Step 2014: {'lr': 0.000499999990481401, 'samples': 386688, 'steps': 2013, 'loss/train': 2.6608567237854004}}} 11/06/2021 21:33:18 - INFO - __main__ - Step 2018: {'lr': 0.0004999999837226326, 'samples': 387456, 'steps': 2017, 'loss/train': 2.4174294471740723}} 11/06/2021 21:33:20 - INFO - __main__ - Step 2022: {'lr': 0.0004999999751615261, 'samples': 388224, 'steps': 2021, 'loss/train': 2.4763200283050537}} 11/06/2021 21:33:22 - INFO - __main__ - Step 2026: {'lr': 0.0004999999647980814, 'samples': 388992, 'steps': 2025, 'loss/train': 2.2224512100219727}} 11/06/2021 21:33:24 - INFO - __main__ - Step 2030: {'lr': 0.0004999999526322987, 'samples': 389760, 'steps': 2029, 'loss/train': 2.9428741931915283}} 11/06/2021 21:33:26 - INFO - __main__ - Step 2035: {'lr': 0.0004999999348905326, 'samples': 390720, 'steps': 2034, 'loss/train': 2.2460341453552246}} 11/06/2021 21:33:28 - INFO - __main__ - Step 2039: {'lr': 0.0004999999186694897, 'samples': 391488, 'steps': 2038, 'loss/train': 2.7766733169555664}} 11/06/2021 21:33:30 - INFO - __main__ - Step 2043: {'lr': 0.0004999999006461091, 'samples': 392256, 'steps': 2042, 'loss/train': 2.7250239849090576}} 11/06/2021 21:33:30 - INFO - __main__ - Step 2043: {'lr': 0.0004999999006461091, 'samples': 392256, 'steps': 2042, 'loss/train': 2.7250239849090576}} 11/06/2021 21:33:34 - INFO - __main__ - Step 2050: {'lr': 0.0004999998647683184, 'samples': 393600, 'steps': 2049, 'loss/train': 2.4513065814971924}} 11/06/2021 21:33:36 - INFO - __main__ - Step 2056: {'lr': 0.0004999998296227291, 'samples': 394752, 'steps': 2055, 'loss/train': 2.863607168197632}}} 11/06/2021 21:33:36 - INFO - __main__ - Step 2056: {'lr': 0.0004999998296227291, 'samples': 394752, 'steps': 2055, 'loss/train': 2.863607168197632}}} 11/06/2021 21:33:40 - INFO - __main__ - Step 2063: {'lr': 0.0004999997834941459, 'samples': 396096, 'steps': 2062, 'loss/train': 3.102501153945923}}} 11/06/2021 21:33:42 - INFO - __main__ - Step 2067: {'lr': 0.0004999997546567423, 'samples': 396864, 'steps': 2066, 'loss/train': 2.925659418106079}}} 11/06/2021 21:33:44 - INFO - __main__ - Step 2072: {'lr': 0.0004999997160754522, 'samples': 397824, 'steps': 2071, 'loss/train': 2.0112996101379395}} 11/06/2021 21:33:46 - INFO - __main__ - Step 2076: {'lr': 0.0004999996831827918, 'samples': 398592, 'steps': 2075, 'loss/train': 2.0490965843200684}} 11/06/2021 21:33:48 - INFO - __main__ - Step 2080: {'lr': 0.0004999996484877955, 'samples': 399360, 'steps': 2079, 'loss/train': 2.834831953048706}}} 11/06/2021 21:33:50 - INFO - __main__ - Step 2084: {'lr': 0.0004999996119904633, 'samples': 400128, 'steps': 2083, 'loss/train': 2.5453062057495117}} 11/06/2021 21:33:52 - INFO - __main__ - Step 2088: {'lr': 0.0004999995736907957, 'samples': 400896, 'steps': 2087, 'loss/train': 2.4093174934387207}} 11/06/2021 21:33:54 - INFO - __main__ - Step 2093: {'lr': 0.0004999995232816774, 'samples': 401856, 'steps': 2092, 'loss/train': 2.0404558181762695}} 11/06/2021 21:33:57 - INFO - __main__ - Step 2098: {'lr': 0.0004999994700564109, 'samples': 402816, 'steps': 2097, 'loss/train': 2.6748390197753906}} 11/06/2021 21:33:57 - INFO - __main__ - Step 2098: {'lr': 0.0004999994700564109, 'samples': 402816, 'steps': 2097, 'loss/train': 2.6748390197753906}} 11/06/2021 21:34:00 - INFO - __main__ - Step 2105: {'lr': 0.00049999939080991, 'samples': 404160, 'steps': 2104, 'loss/train': 1.7636381387710571}6}} 11/06/2021 21:34:02 - INFO - __main__ - Step 2109: {'lr': 0.000499999343047986, 'samples': 404928, 'steps': 2108, 'loss/train': 2.693075180053711}6}} 11/06/2021 21:34:04 - INFO - __main__ - Step 2114: {'lr': 0.0004999992808110495, 'samples': 405888, 'steps': 2113, 'loss/train': 2.4178295135498047}} 11/06/2021 21:34:04 - INFO - __main__ - Step 2114: {'lr': 0.0004999992808110495, 'samples': 405888, 'steps': 2113, 'loss/train': 2.4178295135498047}} 11/06/2021 21:34:08 - INFO - __main__ - Step 2122: {'lr': 0.0004999991753743689, 'samples': 407424, 'steps': 2121, 'loss/train': 2.5452983379364014}} 11/06/2021 21:34:10 - INFO - __main__ - Step 2126: {'lr': 0.0004999991199525299, 'samples': 408192, 'steps': 2125, 'loss/train': 2.357633113861084}}} 11/06/2021 21:34:12 - INFO - __main__ - Step 2130: {'lr': 0.000499999062728359, 'samples': 408960, 'steps': 2129, 'loss/train': 2.727692127227783}}}} 11/06/2021 21:34:14 - INFO - __main__ - Step 2135: {'lr': 0.0004999989886636166, 'samples': 409920, 'steps': 2134, 'loss/train': 2.996830463409424}}} 11/06/2021 21:34:14 - INFO - __main__ - Step 2135: {'lr': 0.0004999989886636166, 'samples': 409920, 'steps': 2134, 'loss/train': 2.996830463409424}}} 11/06/2021 21:34:18 - INFO - __main__ - Step 2142: {'lr': 0.0004999988802418587, 'samples': 411264, 'steps': 2141, 'loss/train': 2.5951883792877197}} 11/06/2021 21:34:20 - INFO - __main__ - Step 2146: {'lr': 0.0004999988158083643, 'samples': 412032, 'steps': 2145, 'loss/train': 2.607226848602295}}} 11/06/2021 21:34:22 - INFO - __main__ - Step 2151: {'lr': 0.0004999987327319701, 'samples': 412992, 'steps': 2150, 'loss/train': 2.9837472438812256}} 11/06/2021 21:34:25 - INFO - __main__ - Step 2156: {'lr': 0.0004999986468394367, 'samples': 413952, 'steps': 2155, 'loss/train': 2.50052547454834}6}} 11/06/2021 21:34:27 - INFO - __main__ - Step 2160: {'lr': 0.0004999985760977903, 'samples': 414720, 'steps': 2159, 'loss/train': 2.2128641605377197}} 11/06/2021 21:34:27 - INFO - __main__ - Step 2160: {'lr': 0.0004999985760977903, 'samples': 414720, 'steps': 2159, 'loss/train': 2.2128641605377197}} 11/06/2021 21:34:30 - INFO - __main__ - Step 2167: {'lr': 0.0004999984479630577, 'samples': 416064, 'steps': 2166, 'loss/train': 2.6970345973968506}} 11/06/2021 21:34:32 - INFO - __main__ - Step 2172: {'lr': 0.0004999983530588853, 'samples': 417024, 'steps': 2171, 'loss/train': 2.061142921447754}}} 11/06/2021 21:34:35 - INFO - __main__ - Step 2177: {'lr': 0.0004999982553385778, 'samples': 417984, 'steps': 2176, 'loss/train': 2.5779266357421875}} 11/06/2021 21:34:35 - INFO - __main__ - Step 2177: {'lr': 0.0004999982553385778, 'samples': 417984, 'steps': 2176, 'loss/train': 2.5779266357421875}} 11/06/2021 21:34:38 - INFO - __main__ - Step 2184: {'lr': 0.0004999981137990425, 'samples': 419328, 'steps': 2183, 'loss/train': 3.1171112060546875}} 11/06/2021 21:34:40 - INFO - __main__ - Step 2188: {'lr': 0.0004999980304411116, 'samples': 420096, 'steps': 2187, 'loss/train': 2.344801902770996}}} 11/06/2021 21:34:43 - INFO - __main__ - Step 2194: {'lr': 0.0004999979020248577, 'samples': 421248, 'steps': 2193, 'loss/train': 2.4983091354370117}} 11/06/2021 21:34:43 - INFO - __main__ - Step 2194: {'lr': 0.0004999979020248577, 'samples': 421248, 'steps': 2193, 'loss/train': 2.4983091354370117}} 11/06/2021 21:34:47 - INFO - __main__ - Step 2201: {'lr': 0.0004999977470805383, 'samples': 422592, 'steps': 2200, 'loss/train': 2.3798577785491943}} 11/06/2021 21:34:48 - INFO - __main__ - Step 2205: {'lr': 0.0004999976560627344, 'samples': 423360, 'steps': 2204, 'loss/train': 2.8710994720458984}} 11/06/2021 21:34:50 - INFO - __main__ - Step 2209: {'lr': 0.000499997563242609, 'samples': 424128, 'steps': 2208, 'loss/train': 2.5492539405822754}}} 11/06/2021 21:34:53 - INFO - __main__ - Step 2214: {'lr': 0.0004999974446829389, 'samples': 425088, 'steps': 2213, 'loss/train': 2.4633007049560547}} 11/06/2021 21:34:55 - INFO - __main__ - Step 2218: {'lr': 0.0004999973478075928, 'samples': 425856, 'steps': 2217, 'loss/train': 2.6798555850982666}} 11/06/2021 21:34:57 - INFO - __main__ - Step 2222: {'lr': 0.0004999972491299276, 'samples': 426624, 'steps': 2221, 'loss/train': 2.5645909309387207}} 11/06/2021 21:34:58 - INFO - __main__ - Step 2226: {'lr': 0.000499997148649944, 'samples': 427392, 'steps': 2225, 'loss/train': 2.6324121952056885}}} 11/06/2021 21:35:00 - INFO - __main__ - Step 2230: {'lr': 0.0004999970463676427, 'samples': 428160, 'steps': 2229, 'loss/train': 2.492368221282959}}} 11/06/2021 21:35:03 - INFO - __main__ - Step 2235: {'lr': 0.0004999969159802577, 'samples': 429120, 'steps': 2234, 'loss/train': 2.6638810634613037}} 11/06/2021 21:35:05 - INFO - __main__ - Step 2239: {'lr': 0.0004999968096427443, 'samples': 429888, 'steps': 2238, 'loss/train': 1.956204891204834}}} 11/06/2021 21:35:07 - INFO - __main__ - Step 2243: {'lr': 0.0004999967015029155, 'samples': 430656, 'steps': 2242, 'loss/train': 2.4464919567108154}} 11/06/2021 21:35:08 - INFO - __main__ - Step 2247: {'lr': 0.0004999965915607722, 'samples': 431424, 'steps': 2246, 'loss/train': 1.3028373718261719}} 11/06/2021 21:35:10 - INFO - __main__ - Step 2251: {'lr': 0.0004999964798163152, 'samples': 432192, 'steps': 2250, 'loss/train': 2.384847640991211}}} 11/06/2021 21:35:13 - INFO - __main__ - Step 2256: {'lr': 0.0004999963376012416, 'samples': 433152, 'steps': 2255, 'loss/train': 2.2551157474517822}} 11/06/2021 21:35:13 - INFO - __main__ - Step 2256: {'lr': 0.0004999963376012416, 'samples': 433152, 'steps': 2255, 'loss/train': 2.2551157474517822}} 11/06/2021 21:35:17 - INFO - __main__ - Step 2264: {'lr': 0.0004999961041996109, 'samples': 434688, 'steps': 2263, 'loss/train': 2.3833937644958496}} 11/06/2021 21:35:18 - INFO - __main__ - Step 2268: {'lr': 0.0004999959847953299, 'samples': 435456, 'steps': 2267, 'loss/train': 1.9739242792129517}} 11/06/2021 21:35:20 - INFO - __main__ - Step 2272: {'lr': 0.0004999958635887394, 'samples': 436224, 'steps': 2271, 'loss/train': 2.7278921604156494}} 11/06/2021 21:35:23 - INFO - __main__ - Step 2278: {'lr': 0.0004999956783995257, 'samples': 437376, 'steps': 2277, 'loss/train': 2.585498809814453}}} 11/06/2021 21:35:25 - INFO - __main__ - Step 2282: {'lr': 0.0004999955526871659, 'samples': 438144, 'steps': 2281, 'loss/train': 1.7193819284439087}} 11/06/2021 21:35:27 - INFO - __main__ - Step 2286: {'lr': 0.0004999954251724999, 'samples': 438912, 'steps': 2285, 'loss/train': 2.635653495788574}}} 11/06/2021 21:35:27 - INFO - __main__ - Step 2286: {'lr': 0.0004999954251724999, 'samples': 438912, 'steps': 2285, 'loss/train': 2.635653495788574}}} 11/06/2021 21:35:31 - INFO - __main__ - Step 2293: {'lr': 0.0004999951976850377, 'samples': 440256, 'steps': 2292, 'loss/train': 1.871019721031189}}} 11/06/2021 21:35:33 - INFO - __main__ - Step 2298: {'lr': 0.0004999950318146737, 'samples': 441216, 'steps': 2297, 'loss/train': 2.5910117626190186}} 11/06/2021 21:35:35 - INFO - __main__ - Step 2302: {'lr': 0.0004999948970907921, 'samples': 441984, 'steps': 2301, 'loss/train': 2.345691442489624}}} 11/06/2021 21:35:35 - INFO - __main__ - Step 2302: {'lr': 0.0004999948970907921, 'samples': 441984, 'steps': 2301, 'loss/train': 2.345691442489624}}} 11/06/2021 21:35:38 - INFO - __main__ - Step 2309: {'lr': 0.0004999946569872118, 'samples': 443328, 'steps': 2308, 'loss/train': 2.174234628677368}}} 11/06/2021 21:35:41 - INFO - __main__ - Step 2314: {'lr': 0.0004999944821053422, 'samples': 444288, 'steps': 2313, 'loss/train': 2.653313636779785}}} 11/06/2021 21:35:43 - INFO - __main__ - Step 2319: {'lr': 0.0004999943044073813, 'samples': 445248, 'steps': 2318, 'loss/train': 2.925724983215332}}} 11/06/2021 21:35:45 - INFO - __main__ - Step 2323: {'lr': 0.000499994160221428, 'samples': 446016, 'steps': 2322, 'loss/train': 2.5466785430908203}}} 11/06/2021 21:35:45 - INFO - __main__ - Step 2323: {'lr': 0.000499994160221428, 'samples': 446016, 'steps': 2322, 'loss/train': 2.5466785430908203}}} 11/06/2021 21:35:49 - INFO - __main__ - Step 2330: {'lr': 0.0004999939035592351, 'samples': 447360, 'steps': 2329, 'loss/train': 1.2856340408325195}} 11/06/2021 21:35:51 - INFO - __main__ - Step 2335: {'lr': 0.0004999937168497954, 'samples': 448320, 'steps': 2334, 'loss/train': 2.982269763946533}}} 11/06/2021 21:35:53 - INFO - __main__ - Step 2339: {'lr': 0.0004999935654546638, 'samples': 449088, 'steps': 2338, 'loss/train': 2.4741744995117188}} 11/06/2021 21:35:53 - INFO - __main__ - Step 2339: {'lr': 0.0004999935654546638, 'samples': 449088, 'steps': 2338, 'loss/train': 2.4741744995117188}} 11/06/2021 21:35:57 - INFO - __main__ - Step 2346: {'lr': 0.0004999932961764192, 'samples': 450432, 'steps': 2345, 'loss/train': 2.2040252685546875}} 11/06/2021 21:35:59 - INFO - __main__ - Step 2350: {'lr': 0.0004999931398249876, 'samples': 451200, 'steps': 2349, 'loss/train': 1.3503772020339966}} 11/06/2021 21:36:01 - INFO - __main__ - Step 2355: {'lr': 0.0004999929418512296, 'samples': 452160, 'steps': 2354, 'loss/train': 2.645233392715454}}} 11/06/2021 21:36:03 - INFO - __main__ - Step 2359: {'lr': 0.0004999927814446498, 'samples': 452928, 'steps': 2358, 'loss/train': 2.382434844970703}}} 11/06/2021 21:36:05 - INFO - __main__ - Step 2363: {'lr': 0.0004999926192357836, 'samples': 453696, 'steps': 2362, 'loss/train': 2.667649269104004}}} 11/06/2021 21:36:07 - INFO - __main__ - Step 2367: {'lr': 0.0004999924552246324, 'samples': 454464, 'steps': 2366, 'loss/train': 2.5451650619506836}} 11/06/2021 21:36:09 - INFO - __main__ - Step 2371: {'lr': 0.0004999922894111975, 'samples': 455232, 'steps': 2370, 'loss/train': 2.5309088230133057}} 11/06/2021 21:36:11 - INFO - __main__ - Step 2376: {'lr': 0.0004999920796099437, 'samples': 456192, 'steps': 2375, 'loss/train': 2.113866090774536}}} 11/06/2021 21:36:13 - INFO - __main__ - Step 2380: {'lr': 0.0004999919097413743, 'samples': 456960, 'steps': 2379, 'loss/train': 2.0128252506256104}} 11/06/2021 21:36:16 - INFO - __main__ - Step 2384: {'lr': 0.000499991738070525, 'samples': 457728, 'steps': 2383, 'loss/train': 2.820688247680664}4}} 11/06/2021 21:36:17 - INFO - __main__ - Step 2388: {'lr': 0.000499991564597397, 'samples': 458496, 'steps': 2387, 'loss/train': 2.985884189605713}4}} 11/06/2021 21:36:19 - INFO - __main__ - Step 2392: {'lr': 0.0004999913893219915, 'samples': 459264, 'steps': 2391, 'loss/train': 2.4803824424743652}} 11/06/2021 21:36:22 - INFO - __main__ - Step 2397: {'lr': 0.0004999911676932838, 'samples': 460224, 'steps': 2396, 'loss/train': 2.7710964679718018}} 11/06/2021 21:36:24 - INFO - __main__ - Step 2401: {'lr': 0.0004999909883627587, 'samples': 460992, 'steps': 2400, 'loss/train': 2.3043596744537354}} 11/06/2021 21:36:26 - INFO - __main__ - Step 2405: {'lr': 0.0004999908072299602, 'samples': 461760, 'steps': 2404, 'loss/train': 2.4561195373535156}} 11/06/2021 21:36:26 - INFO - __main__ - Step 2405: {'lr': 0.0004999908072299602, 'samples': 461760, 'steps': 2404, 'loss/train': 2.4561195373535156}} 11/06/2021 21:36:29 - INFO - __main__ - Step 2412: {'lr': 0.0004999904859108467, 'samples': 463104, 'steps': 2411, 'loss/train': 2.3780782222747803}} 11/06/2021 21:36:31 - INFO - __main__ - Step 2417: {'lr': 0.000499990253017938, 'samples': 464064, 'steps': 2416, 'loss/train': 2.537015438079834}3}} 11/06/2021 21:36:31 - INFO - __main__ - Step 2417: {'lr': 0.000499990253017938, 'samples': 464064, 'steps': 2416, 'loss/train': 2.537015438079834}3}} 11/06/2021 21:36:35 - INFO - __main__ - Step 2425: {'lr': 0.0004999898745319145, 'samples': 465600, 'steps': 2424, 'loss/train': 2.419081449508667}}} 11/06/2021 21:36:37 - INFO - __main__ - Step 2429: {'lr': 0.000499989682585504, 'samples': 466368, 'steps': 2428, 'loss/train': 1.5299700498580933}}} 11/06/2021 21:36:39 - INFO - __main__ - Step 2434: {'lr': 0.0004999894401180576, 'samples': 467328, 'steps': 2433, 'loss/train': 2.631399631500244}}} 11/06/2021 21:36:39 - INFO - __main__ - Step 2434: {'lr': 0.0004999894401180576, 'samples': 467328, 'steps': 2433, 'loss/train': 2.631399631500244}}} 11/06/2021 21:36:44 - INFO - __main__ - Step 2442: {'lr': 0.0004999890463127924, 'samples': 468864, 'steps': 2441, 'loss/train': 2.5758795738220215}} 11/06/2021 21:36:45 - INFO - __main__ - Step 2446: {'lr': 0.0004999888467067702, 'samples': 469632, 'steps': 2445, 'loss/train': 2.458528757095337}}} 11/06/2021 21:36:47 - INFO - __main__ - Step 2450: {'lr': 0.00049998864529849, 'samples': 470400, 'steps': 2449, 'loss/train': 1.814773440361023}7}}} 11/06/2021 21:36:49 - INFO - __main__ - Step 2454: {'lr': 0.0004999884420879534, 'samples': 471168, 'steps': 2453, 'loss/train': 2.2643680572509766}} 11/06/2021 21:36:49 - INFO - __main__ - Step 2454: {'lr': 0.0004999884420879534, 'samples': 471168, 'steps': 2453, 'loss/train': 2.2643680572509766}} 11/06/2021 21:36:53 - INFO - __main__ - Step 2461: {'lr': 0.0004999880821328395, 'samples': 472512, 'steps': 2460, 'loss/train': 2.13460373878479}6}} 11/06/2021 21:36:55 - INFO - __main__ - Step 2466: {'lr': 0.0004999878216428201, 'samples': 473472, 'steps': 2465, 'loss/train': 3.099053144454956}}} 11/06/2021 21:36:57 - INFO - __main__ - Step 2470: {'lr': 0.0004999876112232726, 'samples': 474240, 'steps': 2469, 'loss/train': 2.2806897163391113}} 11/06/2021 21:36:59 - INFO - __main__ - Step 2474: {'lr': 0.0004999873990014763, 'samples': 475008, 'steps': 2473, 'loss/train': 2.296961784362793}}} 11/06/2021 21:36:59 - INFO - __main__ - Step 2474: {'lr': 0.0004999873990014763, 'samples': 475008, 'steps': 2473, 'loss/train': 2.296961784362793}}} 11/06/2021 21:37:03 - INFO - __main__ - Step 2481: {'lr': 0.0004999870232766756, 'samples': 476352, 'steps': 2480, 'loss/train': 2.6014020442962646}} 11/06/2021 21:37:05 - INFO - __main__ - Step 2486: {'lr': 0.0004999867515226088, 'samples': 477312, 'steps': 2485, 'loss/train': 2.565920829772949}}} 11/06/2021 21:37:05 - INFO - __main__ - Step 2486: {'lr': 0.0004999867515226088, 'samples': 477312, 'steps': 2485, 'loss/train': 2.565920829772949}}} 11/06/2021 21:37:09 - INFO - __main__ - Step 2494: {'lr': 0.000499986310858814, 'samples': 478848, 'steps': 2493, 'loss/train': 2.30643892288208}9}}} 11/06/2021 21:37:11 - INFO - __main__ - Step 2498: {'lr': 0.0004999860878235564, 'samples': 479616, 'steps': 2497, 'loss/train': 2.4118523597717285}} 11/06/2021 21:37:13 - INFO - __main__ - Step 2502: {'lr': 0.0004999858629860609, 'samples': 480384, 'steps': 2501, 'loss/train': 2.3400754928588867}} 11/06/2021 21:37:15 - INFO - __main__ - Step 2506: {'lr': 0.000499985636346329, 'samples': 481152, 'steps': 2505, 'loss/train': 2.2163236141204834}}} 11/06/2021 21:37:17 - INFO - __main__ - Step 2511: {'lr': 0.0004999853505122718, 'samples': 482112, 'steps': 2510, 'loss/train': 2.314603567123413}}} 11/06/2021 21:37:19 - INFO - __main__ - Step 2515: {'lr': 0.0004999851198175141, 'samples': 482880, 'steps': 2514, 'loss/train': 2.3666229248046875}} 11/06/2021 21:37:19 - INFO - __main__ - Step 2515: {'lr': 0.0004999851198175141, 'samples': 482880, 'steps': 2514, 'loss/train': 2.3666229248046875}} 11/06/2021 21:37:23 - INFO - __main__ - Step 2522: {'lr': 0.0004999847117650708, 'samples': 484224, 'steps': 2521, 'loss/train': 2.2497646808624268}} 11/06/2021 21:37:25 - INFO - __main__ - Step 2527: {'lr': 0.0004999844169198617, 'samples': 485184, 'steps': 2526, 'loss/train': 2.444234609603882}}} 11/06/2021 21:37:28 - INFO - __main__ - Step 2532: {'lr': 0.0004999841192586746, 'samples': 486144, 'steps': 2531, 'loss/train': 2.247392177581787}}} 11/06/2021 21:37:28 - INFO - __main__ - Step 2532: {'lr': 0.0004999841192586746, 'samples': 486144, 'steps': 2531, 'loss/train': 2.247392177581787}}} 11/06/2021 21:37:31 - INFO - __main__ - Step 2539: {'lr': 0.000499983697802176, 'samples': 487488, 'steps': 2538, 'loss/train': 2.763502836227417}}}} 11/06/2021 21:37:33 - INFO - __main__ - Step 2543: {'lr': 0.0004999834544918369, 'samples': 488256, 'steps': 2542, 'loss/train': 1.3541216850280762}} 11/06/2021 21:37:35 - INFO - __main__ - Step 2548: {'lr': 0.0004999831478195429, 'samples': 489216, 'steps': 2547, 'loss/train': 2.1510307788848877}} 11/06/2021 21:37:38 - INFO - __main__ - Step 2553: {'lr': 0.0004999828383312851, 'samples': 490176, 'steps': 2552, 'loss/train': 2.293485403060913}}} 11/06/2021 21:37:40 - INFO - __main__ - Step 2557: {'lr': 0.0004999825887131874, 'samples': 490944, 'steps': 2556, 'loss/train': 2.3221940994262695}} 11/06/2021 21:37:42 - INFO - __main__ - Step 2561: {'lr': 0.000499982337292877, 'samples': 491712, 'steps': 2560, 'loss/train': 2.518115758895874}5}} 11/06/2021 21:37:44 - INFO - __main__ - Step 2565: {'lr': 0.0004999820840703554, 'samples': 492480, 'steps': 2564, 'loss/train': 2.438570022583008}}} 11/06/2021 21:37:45 - INFO - __main__ - Step 2569: {'lr': 0.0004999818290456249, 'samples': 493248, 'steps': 2568, 'loss/train': 2.5977227687835693}} 11/06/2021 21:37:48 - INFO - __main__ - Step 2574: {'lr': 0.0004999815077303579, 'samples': 494208, 'steps': 2573, 'loss/train': 2.548187017440796}}} 11/06/2021 21:37:50 - INFO - __main__ - Step 2578: {'lr': 0.0004999812486506637, 'samples': 494976, 'steps': 2577, 'loss/train': 1.985874056816101}}} 11/06/2021 21:37:52 - INFO - __main__ - Step 2582: {'lr': 0.0004999809877687662, 'samples': 495744, 'steps': 2581, 'loss/train': 2.4952125549316406}} 11/06/2021 21:37:54 - INFO - __main__ - Step 2586: {'lr': 0.0004999807250846676, 'samples': 496512, 'steps': 2585, 'loss/train': 2.023077964782715}}} 11/06/2021 21:37:55 - INFO - __main__ - Step 2590: {'lr': 0.0004999804605983697, 'samples': 497280, 'steps': 2589, 'loss/train': 1.7954328060150146}} 11/06/2021 21:37:57 - INFO - __main__ - Step 2594: {'lr': 0.0004999801943098743, 'samples': 498048, 'steps': 2593, 'loss/train': 2.145956039428711}}} 11/06/2021 21:38:00 - INFO - __main__ - Step 2599: {'lr': 0.0004999798589149179, 'samples': 499008, 'steps': 2598, 'loss/train': 1.718825101852417}}} 11/06/2021 21:38:00 - INFO - __main__ - Step 2599: {'lr': 0.0004999798589149179, 'samples': 499008, 'steps': 2598, 'loss/train': 1.718825101852417}}} 11/06/2021 21:38:03 - INFO - __main__ - Step 2606: {'lr': 0.000499979384631223, 'samples': 500352, 'steps': 2605, 'loss/train': 2.426281690597534}}}} 11/06/2021 21:38:05 - INFO - __main__ - Step 2611: {'lr': 0.0004999790424780492, 'samples': 501312, 'steps': 2610, 'loss/train': 2.595461368560791}}} 11/06/2021 21:38:08 - INFO - __main__ - Step 2616: {'lr': 0.0004999786975089577, 'samples': 502272, 'steps': 2615, 'loss/train': 4.580399513244629}}} 11/06/2021 21:38:08 - INFO - __main__ - Step 2616: {'lr': 0.0004999786975089577, 'samples': 502272, 'steps': 2615, 'loss/train': 4.580399513244629}}} 11/06/2021 21:38:11 - INFO - __main__ - Step 2623: {'lr': 0.0004999782098214957, 'samples': 503616, 'steps': 2622, 'loss/train': 2.4918506145477295}} 11/06/2021 21:38:13 - INFO - __main__ - Step 2627: {'lr': 0.0004999779286649461, 'samples': 504384, 'steps': 2626, 'loss/train': 2.620913028717041}}} 11/06/2021 21:38:16 - INFO - __main__ - Step 2632: {'lr': 0.0004999775746849451, 'samples': 505344, 'steps': 2631, 'loss/train': 1.9219590425491333}} 11/06/2021 21:38:18 - INFO - __main__ - Step 2636: {'lr': 0.0004999772894734954, 'samples': 506112, 'steps': 2635, 'loss/train': 1.6440210342407227}} 11/06/2021 21:38:20 - INFO - __main__ - Step 2640: {'lr': 0.0004999770024598711, 'samples': 506880, 'steps': 2639, 'loss/train': 2.848144054412842}}} 11/06/2021 21:38:22 - INFO - __main__ - Step 2644: {'lr': 0.0004999767136440742, 'samples': 507648, 'steps': 2643, 'loss/train': 2.4299545288085938}} 11/06/2021 21:38:23 - INFO - __main__ - Step 2648: {'lr': 0.0004999764230261072, 'samples': 508416, 'steps': 2647, 'loss/train': 2.69739031791687}8}} 11/06/2021 21:38:26 - INFO - __main__ - Step 2653: {'lr': 0.0004999760572193492, 'samples': 509376, 'steps': 2652, 'loss/train': 2.0966432094573975}} 11/06/2021 21:38:28 - INFO - __main__ - Step 2657: {'lr': 0.0004999757625465063, 'samples': 510144, 'steps': 2656, 'loss/train': 1.6061630249023438}} 11/06/2021 21:38:30 - INFO - __main__ - Step 2661: {'lr': 0.0004999754660714999, 'samples': 510912, 'steps': 2660, 'loss/train': 2.1438426971435547}} 11/06/2021 21:38:32 - INFO - __main__ - Step 2665: {'lr': 0.0004999751677943322, 'samples': 511680, 'steps': 2664, 'loss/train': 1.7713021039962769}} 11/06/2021 21:38:33 - INFO - __main__ - Step 2669: {'lr': 0.0004999748677150051, 'samples': 512448, 'steps': 2668, 'loss/train': 2.6738812923431396}} 11/06/2021 21:38:35 - INFO - __main__ - Step 2673: {'lr': 0.0004999745658335209, 'samples': 513216, 'steps': 2672, 'loss/train': 2.6007511615753174}} 11/06/2021 21:38:38 - INFO - __main__ - Step 2678: {'lr': 0.0004999741859473857, 'samples': 514176, 'steps': 2677, 'loss/train': 2.3065757751464844}} 11/06/2021 21:38:38 - INFO - __main__ - Step 2678: {'lr': 0.0004999741859473857, 'samples': 514176, 'steps': 2677, 'loss/train': 2.3065757751464844}} 11/06/2021 21:38:41 - INFO - __main__ - Step 2685: {'lr': 0.0004999736493761477, 'samples': 515520, 'steps': 2684, 'loss/train': 2.4622879028320312}} 11/06/2021 21:38:43 - INFO - __main__ - Step 2689: {'lr': 0.0004999733402860572, 'samples': 516288, 'steps': 2688, 'loss/train': 2.6095635890960693}} 11/06/2021 21:38:46 - INFO - __main__ - Step 2694: {'lr': 0.0004999729513891762, 'samples': 517248, 'steps': 2693, 'loss/train': 2.7909016609191895}} 11/06/2021 21:38:48 - INFO - __main__ - Step 2698: {'lr': 0.0004999726382442601, 'samples': 518016, 'steps': 2697, 'loss/train': 1.9356613159179688}} 11/06/2021 21:38:48 - INFO - __main__ - Step 2698: {'lr': 0.0004999726382442601, 'samples': 518016, 'steps': 2697, 'loss/train': 1.9356613159179688}} 11/06/2021 21:38:52 - INFO - __main__ - Step 2705: {'lr': 0.0004999720859042565, 'samples': 519360, 'steps': 2704, 'loss/train': 2.2500193119049072}} 11/06/2021 21:38:54 - INFO - __main__ - Step 2710: {'lr': 0.0004999716879966747, 'samples': 520320, 'steps': 2709, 'loss/train': 2.6745805740356445}} 11/06/2021 21:38:56 - INFO - __main__ - Step 2714: {'lr': 0.0004999713676432082, 'samples': 521088, 'steps': 2713, 'loss/train': 2.1871511936187744}} 11/06/2021 21:38:57 - INFO - __main__ - Step 2718: {'lr': 0.0004999710454876099, 'samples': 521856, 'steps': 2717, 'loss/train': 2.10378360748291}4}} 11/06/2021 21:39:00 - INFO - __main__ - Step 2722: {'lr': 0.000499970721529882, 'samples': 522624, 'steps': 2721, 'loss/train': 3.079136848449707}4}} 11/06/2021 21:39:02 - INFO - __main__ - Step 2727: {'lr': 0.000499970314048481, 'samples': 523584, 'steps': 2726, 'loss/train': 2.2870664596557617}}} 11/06/2021 21:39:04 - INFO - __main__ - Step 2731: {'lr': 0.0004999699860359702, 'samples': 524352, 'steps': 2730, 'loss/train': 2.656719446182251}}} 11/06/2021 21:39:06 - INFO - __main__ - Step 2735: {'lr': 0.0004999696562213375, 'samples': 525120, 'steps': 2734, 'loss/train': 2.221712827682495}}} 11/06/2021 21:39:08 - INFO - __main__ - Step 2739: {'lr': 0.0004999693246045854, 'samples': 525888, 'steps': 2738, 'loss/train': 2.2682254314422607}} 11/06/2021 21:39:09 - INFO - __main__ - Step 2743: {'lr': 0.000499968991185716, 'samples': 526656, 'steps': 2742, 'loss/train': 2.7001304626464844}}} 11/06/2021 21:39:11 - INFO - __main__ - Step 2747: {'lr': 0.0004999686559647319, 'samples': 527424, 'steps': 2746, 'loss/train': 2.5279910564422607}} 11/06/2021 21:39:11 - INFO - __main__ - Step 2747: {'lr': 0.0004999686559647319, 'samples': 527424, 'steps': 2746, 'loss/train': 2.5279910564422607}} 11/06/2021 21:39:11 - INFO - __main__ - Step 2747: {'lr': 0.0004999686559647319, 'samples': 527424, 'steps': 2746, 'loss/train': 2.5279910564422607}} 11/06/2021 21:39:17 - INFO - __main__ - Step 2758: {'lr': 0.0004999677248148916, 'samples': 529536, 'steps': 2757, 'loss/train': 2.4732248783111572}} 11/06/2021 21:39:20 - INFO - __main__ - Step 2764: {'lr': 0.0004999672111707639, 'samples': 530688, 'steps': 2763, 'loss/train': 3.058380603790283}}} 11/06/2021 21:39:22 - INFO - __main__ - Step 2768: {'lr': 0.0004999668664887175, 'samples': 531456, 'steps': 2767, 'loss/train': 2.2657551765441895}} 11/06/2021 21:39:24 - INFO - __main__ - Step 2772: {'lr': 0.0004999665200045716, 'samples': 532224, 'steps': 2771, 'loss/train': 1.0660734176635742}} 11/06/2021 21:39:26 - INFO - __main__ - Step 2776: {'lr': 0.000499966171718329, 'samples': 532992, 'steps': 2775, 'loss/train': 2.4398868083953857}}} 11/06/2021 21:39:28 - INFO - __main__ - Step 2780: {'lr': 0.0004999658216299919, 'samples': 533760, 'steps': 2779, 'loss/train': 2.217278242111206}}} 11/06/2021 21:39:30 - INFO - __main__ - Step 2785: {'lr': 0.0004999653814853791, 'samples': 534720, 'steps': 2784, 'loss/train': 2.1000912189483643}} 11/06/2021 21:39:32 - INFO - __main__ - Step 2789: {'lr': 0.0004999650273423389, 'samples': 535488, 'steps': 2788, 'loss/train': 2.207897663116455}}} 11/06/2021 21:39:32 - INFO - __main__ - Step 2789: {'lr': 0.0004999650273423389, 'samples': 535488, 'steps': 2788, 'loss/train': 2.207897663116455}}} 11/06/2021 21:39:35 - INFO - __main__ - Step 2795: {'lr': 0.0004999644927488678, 'samples': 536640, 'steps': 2794, 'loss/train': 2.290945529937744}}} 11/06/2021 21:39:38 - INFO - __main__ - Step 2800: {'lr': 0.0004999640441569793, 'samples': 537600, 'steps': 2799, 'loss/train': 1.9303408861160278}} 11/06/2021 21:39:40 - INFO - __main__ - Step 2805: {'lr': 0.0004999635927493423, 'samples': 538560, 'steps': 2804, 'loss/train': 2.4244282245635986}} 11/06/2021 21:39:40 - INFO - __main__ - Step 2805: {'lr': 0.0004999635927493423, 'samples': 538560, 'steps': 2804, 'loss/train': 2.4244282245635986}} 11/06/2021 21:39:44 - INFO - __main__ - Step 2812: {'lr': 0.0004999629560482026, 'samples': 539904, 'steps': 2811, 'loss/train': 1.2682995796203613}} 11/06/2021 21:39:46 - INFO - __main__ - Step 2816: {'lr': 0.0004999625897411311, 'samples': 540672, 'steps': 2815, 'loss/train': 2.5656561851501465}} 11/06/2021 21:39:48 - INFO - __main__ - Step 2821: {'lr': 0.0004999621293231331, 'samples': 541632, 'steps': 2820, 'loss/train': 2.8650639057159424}} 11/06/2021 21:39:50 - INFO - __main__ - Step 2825: {'lr': 0.000499961758961411, 'samples': 542400, 'steps': 2824, 'loss/train': 2.4017333984375}424}} 11/06/2021 21:39:52 - INFO - __main__ - Step 2829: {'lr': 0.0004999613867976264, 'samples': 543168, 'steps': 2828, 'loss/train': 2.7821502685546875}} 11/06/2021 21:39:54 - INFO - __main__ - Step 2833: {'lr': 0.0004999610128317818, 'samples': 543936, 'steps': 2832, 'loss/train': 1.7340575456619263}} 11/06/2021 21:39:56 - INFO - __main__ - Step 2837: {'lr': 0.0004999606370638801, 'samples': 544704, 'steps': 2836, 'loss/train': 1.946204423904419}}} 11/06/2021 21:39:58 - INFO - __main__ - Step 2842: {'lr': 0.0004999601648198641, 'samples': 545664, 'steps': 2841, 'loss/train': 2.791858434677124}}} 11/06/2021 21:39:58 - INFO - __main__ - Step 2842: {'lr': 0.0004999601648198641, 'samples': 545664, 'steps': 2841, 'loss/train': 2.791858434677124}}} 11/06/2021 21:40:02 - INFO - __main__ - Step 2850: {'lr': 0.0004999594033727747, 'samples': 547200, 'steps': 2849, 'loss/train': 2.0793142318725586}} 11/06/2021 21:40:04 - INFO - __main__ - Step 2854: {'lr': 0.0004999590199461602, 'samples': 547968, 'steps': 2853, 'loss/train': 2.220766067504883}}} 11/06/2021 21:40:06 - INFO - __main__ - Step 2858: {'lr': 0.000499958634717503, 'samples': 548736, 'steps': 2857, 'loss/train': 1.4249017238616943}}} 11/06/2021 21:40:08 - INFO - __main__ - Step 2862: {'lr': 0.0004999582476868055, 'samples': 549504, 'steps': 2861, 'loss/train': 2.6905484199523926}} 11/06/2021 21:40:10 - INFO - __main__ - Step 2867: {'lr': 0.0004999577613643192, 'samples': 550464, 'steps': 2866, 'loss/train': 2.0363290309906006}} 11/06/2021 21:40:10 - INFO - __main__ - Step 2867: {'lr': 0.0004999577613643192, 'samples': 550464, 'steps': 2866, 'loss/train': 2.0363290309906006}} 11/06/2021 21:40:14 - INFO - __main__ - Step 2874: {'lr': 0.000499957075782501, 'samples': 551808, 'steps': 2873, 'loss/train': 1.8168832063674927}}} 11/06/2021 21:40:16 - INFO - __main__ - Step 2878: {'lr': 0.0004999566815436715, 'samples': 552576, 'steps': 2877, 'loss/train': 2.445081949234009}}} 11/06/2021 21:40:18 - INFO - __main__ - Step 2883: {'lr': 0.0004999561862110358, 'samples': 553536, 'steps': 2882, 'loss/train': 1.861746072769165}}} 11/06/2021 21:40:18 - INFO - __main__ - Step 2883: {'lr': 0.0004999561862110358, 'samples': 553536, 'steps': 2882, 'loss/train': 1.861746072769165}}} 11/06/2021 21:40:22 - INFO - __main__ - Step 2891: {'lr': 0.0004999553878222482, 'samples': 555072, 'steps': 2890, 'loss/train': 1.2182731628417969}} 11/06/2021 21:40:24 - INFO - __main__ - Step 2895: {'lr': 0.000499954985924828, 'samples': 555840, 'steps': 2894, 'loss/train': 1.992488980293274}9}} 11/06/2021 21:40:26 - INFO - __main__ - Step 2899: {'lr': 0.0004999545822253941, 'samples': 556608, 'steps': 2898, 'loss/train': 1.9909908771514893}} 11/06/2021 21:40:28 - INFO - __main__ - Step 2903: {'lr': 0.0004999541767239493, 'samples': 557376, 'steps': 2902, 'loss/train': 2.2086076736450195}} 11/06/2021 21:40:30 - INFO - __main__ - Step 2909: {'lr': 0.0004999535650930182, 'samples': 558528, 'steps': 2908, 'loss/train': 1.6833531856536865}} 11/06/2021 21:40:30 - INFO - __main__ - Step 2909: {'lr': 0.0004999535650930182, 'samples': 558528, 'steps': 2908, 'loss/train': 1.6833531856536865}} 11/06/2021 21:40:34 - INFO - __main__ - Step 2914: {'lr': 0.0004999530523033817, 'samples': 559488, 'steps': 2913, 'loss/train': 1.3060599565505981}} 11/06/2021 21:40:36 - INFO - __main__ - Step 2920: {'lr': 0.0004999524332391937, 'samples': 560640, 'steps': 2919, 'loss/train': 1.4589003324508667}} 11/06/2021 21:40:38 - INFO - __main__ - Step 2924: {'lr': 0.0004999520182772402, 'samples': 561408, 'steps': 2923, 'loss/train': 2.112090826034546}}} 11/06/2021 21:40:41 - INFO - __main__ - Step 2928: {'lr': 0.0004999516015132945, 'samples': 562176, 'steps': 2927, 'loss/train': 1.2997283935546875}} 11/06/2021 21:40:43 - INFO - __main__ - Step 2932: {'lr': 0.0004999511829473593, 'samples': 562944, 'steps': 2931, 'loss/train': 1.8353915214538574}} 11/06/2021 21:40:44 - INFO - __main__ - Step 2936: {'lr': 0.0004999507625794378, 'samples': 563712, 'steps': 2935, 'loss/train': 2.2181787490844727}} 11/06/2021 21:40:46 - INFO - __main__ - Step 2940: {'lr': 0.000499950340409533, 'samples': 564480, 'steps': 2939, 'loss/train': 2.1608986854553223}}} 11/06/2021 21:40:49 - INFO - __main__ - Step 2945: {'lr': 0.0004999498101631177, 'samples': 565440, 'steps': 2944, 'loss/train': 2.750171422958374}}} 11/06/2021 21:40:51 - INFO - __main__ - Step 2949: {'lr': 0.0004999493839387615, 'samples': 566208, 'steps': 2948, 'loss/train': 0.8493632674217224}} 11/06/2021 21:40:51 - INFO - __main__ - Step 2949: {'lr': 0.0004999493839387615, 'samples': 566208, 'steps': 2948, 'loss/train': 0.8493632674217224}} 11/06/2021 21:40:54 - INFO - __main__ - Step 2956: {'lr': 0.0004999486337101419, 'samples': 567552, 'steps': 2955, 'loss/train': 2.1768510341644287}} 11/06/2021 21:40:56 - INFO - __main__ - Step 2961: {'lr': 0.0004999480944538655, 'samples': 568512, 'steps': 2960, 'loss/train': 2.1335995197296143}} 11/06/2021 21:40:59 - INFO - __main__ - Step 2966: {'lr': 0.0004999475523820203, 'samples': 569472, 'steps': 2965, 'loss/train': 1.6673521995544434}} 11/06/2021 21:41:01 - INFO - __main__ - Step 2970: {'lr': 0.0004999471166973385, 'samples': 570240, 'steps': 2969, 'loss/train': 2.754880905151367}}} 11/06/2021 21:41:01 - INFO - __main__ - Step 2970: {'lr': 0.0004999471166973385, 'samples': 570240, 'steps': 2969, 'loss/train': 2.754880905151367}}} 11/06/2021 21:41:04 - INFO - __main__ - Step 2977: {'lr': 0.0004999463499131884, 'samples': 571584, 'steps': 2976, 'loss/train': 2.5181667804718018}} 11/06/2021 21:41:06 - INFO - __main__ - Step 2982: {'lr': 0.000499945798831564, 'samples': 572544, 'steps': 2981, 'loss/train': 2.780032157897949}8}} 11/06/2021 21:41:09 - INFO - __main__ - Step 2987: {'lr': 0.0004999452449343967, 'samples': 573504, 'steps': 2986, 'loss/train': 2.3532779216766357}} 11/06/2021 21:41:11 - INFO - __main__ - Step 2991: {'lr': 0.000499944799789476, 'samples': 574272, 'steps': 2990, 'loss/train': 2.203000545501709}7}} 11/06/2021 21:41:13 - INFO - __main__ - Step 2995: {'lr': 0.0004999443528426149, 'samples': 575040, 'steps': 2994, 'loss/train': 2.4999752044677734}} 11/06/2021 21:41:14 - INFO - __main__ - Step 2999: {'lr': 0.0004999439040938168, 'samples': 575808, 'steps': 2998, 'loss/train': 2.217271327972412}}} 11/06/2021 21:41:16 - INFO - __main__ - Step 3003: {'lr': 0.0004999434535430848, 'samples': 576576, 'steps': 3002, 'loss/train': 2.3395111560821533}} 11/06/2021 21:41:19 - INFO - __main__ - Step 3008: {'lr': 0.0004999428878207054, 'samples': 577536, 'steps': 3007, 'loss/train': 2.3040289878845215}} 11/06/2021 21:41:21 - INFO - __main__ - Step 3012: {'lr': 0.0004999424332156341, 'samples': 578304, 'steps': 3011, 'loss/train': 1.8335940837860107}} 11/06/2021 21:41:21 - INFO - __main__ - Step 3012: {'lr': 0.0004999424332156341, 'samples': 578304, 'steps': 3011, 'loss/train': 1.8335940837860107}} 11/06/2021 21:41:24 - INFO - __main__ - Step 3019: {'lr': 0.0004999416333208835, 'samples': 579648, 'steps': 3018, 'loss/train': 1.6759024858474731}} 11/06/2021 21:41:26 - INFO - __main__ - Step 3023: {'lr': 0.0004999411737605313, 'samples': 580416, 'steps': 3022, 'loss/train': 1.946350336074829}}} 11/06/2021 21:41:26 - INFO - __main__ - Step 3023: {'lr': 0.0004999411737605313, 'samples': 580416, 'steps': 3022, 'loss/train': 1.946350336074829}}} 11/06/2021 21:41:31 - INFO - __main__ - Step 3031: {'lr': 0.0004999402492340875, 'samples': 581952, 'steps': 3030, 'loss/train': 2.4372713565826416}} 11/06/2021 21:41:33 - INFO - __main__ - Step 3035: {'lr': 0.0004999397842680027, 'samples': 582720, 'steps': 3034, 'loss/train': 2.364650249481201}}} 11/06/2021 21:41:34 - INFO - __main__ - Step 3039: {'lr': 0.0004999393175000137, 'samples': 583488, 'steps': 3038, 'loss/train': 2.1946680545806885}} 11/06/2021 21:41:37 - INFO - __main__ - Step 3043: {'lr': 0.000499938848930124, 'samples': 584256, 'steps': 3042, 'loss/train': 1.9267772436141968}}} 11/06/2021 21:41:37 - INFO - __main__ - Step 3043: {'lr': 0.000499938848930124, 'samples': 584256, 'steps': 3042, 'loss/train': 1.9267772436141968}}} 11/06/2021 21:41:40 - INFO - __main__ - Step 3051: {'lr': 0.0004999379063846555, 'samples': 585792, 'steps': 3050, 'loss/train': 2.1658904552459717}} 11/06/2021 21:41:42 - INFO - __main__ - Step 3055: {'lr': 0.0004999374324090837, 'samples': 586560, 'steps': 3054, 'loss/train': 2.6458959579467773}} 11/06/2021 21:41:45 - INFO - __main__ - Step 3060: {'lr': 0.0004999368374057155, 'samples': 587520, 'steps': 3059, 'loss/train': 2.637629985809326}}} 11/06/2021 21:41:47 - INFO - __main__ - Step 3064: {'lr': 0.0004999363593759022, 'samples': 588288, 'steps': 3063, 'loss/train': 2.5826447010040283}} 11/06/2021 21:41:47 - INFO - __main__ - Step 3064: {'lr': 0.0004999363593759022, 'samples': 588288, 'steps': 3063, 'loss/train': 2.5826447010040283}} 11/06/2021 21:41:50 - INFO - __main__ - Step 3071: {'lr': 0.0004999355184879587, 'samples': 589632, 'steps': 3070, 'loss/train': 1.911078691482544}}} 11/06/2021 21:41:53 - INFO - __main__ - Step 3076: {'lr': 0.0004999349144751997, 'samples': 590592, 'steps': 3075, 'loss/train': 2.093614339828491}}} 11/06/2021 21:41:53 - INFO - __main__ - Step 3076: {'lr': 0.0004999349144751997, 'samples': 590592, 'steps': 3075, 'loss/train': 2.093614339828491}}} 11/06/2021 21:41:56 - INFO - __main__ - Step 3082: {'lr': 0.0004999341859435345, 'samples': 591744, 'steps': 3081, 'loss/train': 2.610215902328491}}} 11/06/2021 21:41:56 - INFO - __main__ - Step 3082: {'lr': 0.0004999341859435345, 'samples': 591744, 'steps': 3081, 'loss/train': 2.610215902328491}}} 11/06/2021 21:42:01 - INFO - __main__ - Step 3091: {'lr': 0.0004999330855444274, 'samples': 593472, 'steps': 3090, 'loss/train': 2.4830446243286133}} 11/06/2021 21:42:02 - INFO - __main__ - Step 3095: {'lr': 0.0004999325935501395, 'samples': 594240, 'steps': 3094, 'loss/train': 2.2615649700164795}} 11/06/2021 21:42:04 - INFO - __main__ - Step 3099: {'lr': 0.0004999320997539992, 'samples': 595008, 'steps': 3098, 'loss/train': 2.1478734016418457}} 11/06/2021 21:42:07 - INFO - __main__ - Step 3104: {'lr': 0.0004999314799749745, 'samples': 595968, 'steps': 3103, 'loss/train': 3.2617650032043457}} 11/06/2021 21:42:07 - INFO - __main__ - Step 3104: {'lr': 0.0004999314799749745, 'samples': 595968, 'steps': 3103, 'loss/train': 3.2617650032043457}} 11/06/2021 21:42:10 - INFO - __main__ - Step 3111: {'lr': 0.0004999306075545002, 'samples': 597312, 'steps': 3110, 'loss/train': 2.7295010089874268}} 11/06/2021 21:42:12 - INFO - __main__ - Step 3115: {'lr': 0.0004999301065509863, 'samples': 598080, 'steps': 3114, 'loss/train': 2.4557383060455322}} 11/06/2021 21:42:15 - INFO - __main__ - Step 3120: {'lr': 0.0004999294777627649, 'samples': 599040, 'steps': 3119, 'loss/train': 1.3478337526321411}} 11/06/2021 21:42:17 - INFO - __main__ - Step 3125: {'lr': 0.0004999288461591842, 'samples': 600000, 'steps': 3124, 'loss/train': 1.7113127708435059}} 11/06/2021 21:42:17 - INFO - __main__ - Step 3125: {'lr': 0.0004999288461591842, 'samples': 600000, 'steps': 3124, 'loss/train': 1.7113127708435059}} 11/06/2021 21:42:17 - INFO - __main__ - Step 3125: {'lr': 0.0004999288461591842, 'samples': 600000, 'steps': 3124, 'loss/train': 1.7113127708435059}} 11/06/2021 21:42:22 - INFO - __main__ - Step 3135: {'lr': 0.0004999275745059741, 'samples': 601920, 'steps': 3134, 'loss/train': 2.9237546920776367}} 11/06/2021 21:42:25 - INFO - __main__ - Step 3141: {'lr': 0.0004999268061085959, 'samples': 603072, 'steps': 3140, 'loss/train': 2.418550968170166}}} 11/06/2021 21:42:25 - INFO - __main__ - Step 3141: {'lr': 0.0004999268061085959, 'samples': 603072, 'steps': 3140, 'loss/train': 2.418550968170166}}} 11/06/2021 21:42:29 - INFO - __main__ - Step 3148: {'lr': 0.0004999259045210901, 'samples': 604416, 'steps': 3147, 'loss/train': 1.4385805130004883}} 11/06/2021 21:42:30 - INFO - __main__ - Step 3152: {'lr': 0.0004999253868507476, 'samples': 605184, 'steps': 3151, 'loss/train': 2.5812270641326904}} 11/06/2021 21:42:33 - INFO - __main__ - Step 3156: {'lr': 0.0004999248673786049, 'samples': 605952, 'steps': 3155, 'loss/train': 1.7660547494888306}} 11/06/2021 21:42:35 - INFO - __main__ - Step 3162: {'lr': 0.0004999240847920233, 'samples': 607104, 'steps': 3161, 'loss/train': 1.7738767862319946}} 11/06/2021 21:42:37 - INFO - __main__ - Step 3166: {'lr': 0.0004999235608153961, 'samples': 607872, 'steps': 3165, 'loss/train': 1.9972015619277954}} 11/06/2021 21:42:39 - INFO - __main__ - Step 3170: {'lr': 0.0004999230350369816, 'samples': 608640, 'steps': 3169, 'loss/train': 2.8902382850646973}} 11/06/2021 21:42:39 - INFO - __main__ - Step 3170: {'lr': 0.0004999230350369816, 'samples': 608640, 'steps': 3169, 'loss/train': 2.8902382850646973}} 11/06/2021 21:42:42 - INFO - __main__ - Step 3177: {'lr': 0.0004999221105892172, 'samples': 609984, 'steps': 3176, 'loss/train': 2.14980411529541}3}} 11/06/2021 21:42:45 - INFO - __main__ - Step 3182: {'lr': 0.000499921446891053, 'samples': 610944, 'steps': 3181, 'loss/train': 2.965468168258667}3}} 11/06/2021 21:42:47 - INFO - __main__ - Step 3187: {'lr': 0.0004999207803776201, 'samples': 611904, 'steps': 3186, 'loss/train': 2.031585693359375}}} 11/06/2021 21:42:49 - INFO - __main__ - Step 3191: {'lr': 0.0004999202451398853, 'samples': 612672, 'steps': 3190, 'loss/train': 2.0862905979156494}} 11/06/2021 21:42:49 - INFO - __main__ - Step 3191: {'lr': 0.0004999202451398853, 'samples': 612672, 'steps': 3190, 'loss/train': 2.0862905979156494}} 11/06/2021 21:42:53 - INFO - __main__ - Step 3198: {'lr': 0.0004999193041383588, 'samples': 614016, 'steps': 3197, 'loss/train': 2.2492079734802246}} 11/06/2021 21:42:55 - INFO - __main__ - Step 3203: {'lr': 0.0004999186286161169, 'samples': 614976, 'steps': 3202, 'loss/train': 2.909585952758789}}} 11/06/2021 21:42:55 - INFO - __main__ - Step 3203: {'lr': 0.0004999186286161169, 'samples': 614976, 'steps': 3202, 'loss/train': 2.909585952758789}}} 11/06/2021 21:42:55 - INFO - __main__ - Step 3203: {'lr': 0.0004999186286161169, 'samples': 614976, 'steps': 3202, 'loss/train': 2.909585952758789}}} 11/06/2021 21:43:00 - INFO - __main__ - Step 3213: {'lr': 0.0004999172691259293, 'samples': 616896, 'steps': 3212, 'loss/train': 1.8541127443313599}} 11/06/2021 21:43:00 - INFO - __main__ - Step 3213: {'lr': 0.0004999172691259293, 'samples': 616896, 'steps': 3212, 'loss/train': 1.8541127443313599}} 11/06/2021 21:43:05 - INFO - __main__ - Step 3222: {'lr': 0.0004999160359567011, 'samples': 618624, 'steps': 3221, 'loss/train': 1.9945106506347656}} 11/06/2021 21:43:07 - INFO - __main__ - Step 3226: {'lr': 0.0004999154849536698, 'samples': 619392, 'steps': 3225, 'loss/train': 2.3978424072265625}} 11/06/2021 21:43:08 - INFO - __main__ - Step 3230: {'lr': 0.0004999149321489095, 'samples': 620160, 'steps': 3229, 'loss/train': 3.806713581085205}}} 11/06/2021 21:43:10 - INFO - __main__ - Step 3234: {'lr': 0.0004999143775424241, 'samples': 620928, 'steps': 3233, 'loss/train': 2.1690218448638916}} 11/06/2021 21:43:13 - INFO - __main__ - Step 3239: {'lr': 0.0004999136817506478, 'samples': 621888, 'steps': 3238, 'loss/train': 2.375570297241211}}} 11/06/2021 21:43:15 - INFO - __main__ - Step 3243: {'lr': 0.000499913123090296, 'samples': 622656, 'steps': 3242, 'loss/train': 2.318620204925537}}}} 11/06/2021 21:43:17 - INFO - __main__ - Step 3247: {'lr': 0.0004999125626282322, 'samples': 623424, 'steps': 3246, 'loss/train': 2.054605484008789}}} 11/06/2021 21:43:17 - INFO - __main__ - Step 3247: {'lr': 0.0004999125626282322, 'samples': 623424, 'steps': 3246, 'loss/train': 2.054605484008789}}} 11/06/2021 21:43:20 - INFO - __main__ - Step 3255: {'lr': 0.0004999114362989849, 'samples': 624960, 'steps': 3254, 'loss/train': 2.2318007946014404}} 11/06/2021 21:43:23 - INFO - __main__ - Step 3260: {'lr': 0.0004999107286835006, 'samples': 625920, 'steps': 3259, 'loss/train': 0.5624179244041443}} 11/06/2021 21:43:25 - INFO - __main__ - Step 3264: {'lr': 0.0004999101605642061, 'samples': 626688, 'steps': 3263, 'loss/train': 2.3034634590148926}} 11/06/2021 21:43:25 - INFO - __main__ - Step 3264: {'lr': 0.0004999101605642061, 'samples': 626688, 'steps': 3263, 'loss/train': 2.3034634590148926}} 11/06/2021 21:43:28 - INFO - __main__ - Step 3271: {'lr': 0.0004999091620201255, 'samples': 628032, 'steps': 3270, 'loss/train': 4.105412006378174}}} 11/06/2021 21:43:31 - INFO - __main__ - Step 3277: {'lr': 0.0004999083017335951, 'samples': 629184, 'steps': 3276, 'loss/train': 2.3697509765625}4}}} 11/06/2021 21:43:33 - INFO - __main__ - Step 3281: {'lr': 0.0004999077259571442, 'samples': 629952, 'steps': 3280, 'loss/train': 2.139662504196167}}} 11/06/2021 21:43:33 - INFO - __main__ - Step 3281: {'lr': 0.0004999077259571442, 'samples': 629952, 'steps': 3280, 'loss/train': 2.139662504196167}}} 11/06/2021 21:43:36 - INFO - __main__ - Step 3288: {'lr': 0.0004999067140130819, 'samples': 631296, 'steps': 3287, 'loss/train': 2.0896875858306885}} 11/06/2021 21:43:39 - INFO - __main__ - Step 3292: {'lr': 0.0004999061332820401, 'samples': 632064, 'steps': 3291, 'loss/train': 1.7333624362945557}} 11/06/2021 21:43:41 - INFO - __main__ - Step 3297: {'lr': 0.0004999054048346517, 'samples': 633024, 'steps': 3296, 'loss/train': 2.7712512016296387}} 11/06/2021 21:43:43 - INFO - __main__ - Step 3302: {'lr': 0.0004999046735721755, 'samples': 633984, 'steps': 3301, 'loss/train': 2.4104175567626953}} 11/06/2021 21:43:43 - INFO - __main__ - Step 3302: {'lr': 0.0004999046735721755, 'samples': 633984, 'steps': 3301, 'loss/train': 2.4104175567626953}} 11/06/2021 21:43:47 - INFO - __main__ - Step 3309: {'lr': 0.0004999036450753767, 'samples': 635328, 'steps': 3308, 'loss/train': 2.1639621257781982}} 11/06/2021 21:43:48 - INFO - __main__ - Step 3313: {'lr': 0.0004999030548856586, 'samples': 636096, 'steps': 3312, 'loss/train': 1.3736777305603027}} 11/06/2021 21:43:51 - INFO - __main__ - Step 3318: {'lr': 0.0004999023146149565, 'samples': 637056, 'steps': 3317, 'loss/train': 1.93985116481781}7}} 11/06/2021 21:43:53 - INFO - __main__ - Step 3323: {'lr': 0.000499901571529201, 'samples': 638016, 'steps': 3322, 'loss/train': 1.9539903402328491}}} 11/06/2021 21:43:55 - INFO - __main__ - Step 3327: {'lr': 0.000499900975033764, 'samples': 638784, 'steps': 3326, 'loss/train': 2.444103240966797}}}} 11/06/2021 21:43:55 - INFO - __main__ - Step 3327: {'lr': 0.000499900975033764, 'samples': 638784, 'steps': 3326, 'loss/train': 2.444103240966797}}}} 11/06/2021 21:43:59 - INFO - __main__ - Step 3334: {'lr': 0.0004998999268315932, 'samples': 640128, 'steps': 3333, 'loss/train': 2.300365447998047}}} 11/06/2021 21:44:01 - INFO - __main__ - Step 3339: {'lr': 0.000499899174737724, 'samples': 641088, 'steps': 3338, 'loss/train': 2.363450765609741}}}} 11/06/2021 21:44:03 - INFO - __main__ - Step 3343: {'lr': 0.0004998985710358155, 'samples': 641856, 'steps': 3342, 'loss/train': 1.9851315021514893}} 11/06/2021 21:44:05 - INFO - __main__ - Step 3347: {'lr': 0.0004998979655323, 'samples': 642624, 'steps': 3346, 'loss/train': 1.8560844659805298}93}} 11/06/2021 21:44:05 - INFO - __main__ - Step 3347: {'lr': 0.0004998979655323, 'samples': 642624, 'steps': 3346, 'loss/train': 1.8560844659805298}93}} 11/06/2021 21:44:09 - INFO - __main__ - Step 3354: {'lr': 0.0004998969015660438, 'samples': 643968, 'steps': 3353, 'loss/train': 1.8647414445877075}} 11/06/2021 21:44:09 - INFO - __main__ - Step 3354: {'lr': 0.0004998969015660438, 'samples': 643968, 'steps': 3353, 'loss/train': 1.8647414445877075}} 11/06/2021 21:44:13 - INFO - __main__ - Step 3363: {'lr': 0.0004998955255022547, 'samples': 645696, 'steps': 3362, 'loss/train': 2.9360525608062744}} 11/06/2021 21:44:15 - INFO - __main__ - Step 3367: {'lr': 0.0004998949109907697, 'samples': 646464, 'steps': 3366, 'loss/train': 2.29533052444458}4}} 11/06/2021 21:44:17 - INFO - __main__ - Step 3371: {'lr': 0.000499894294677704, 'samples': 647232, 'steps': 3370, 'loss/train': 2.024826765060425}4}} 11/06/2021 21:44:19 - INFO - __main__ - Step 3375: {'lr': 0.000499893676563062, 'samples': 648000, 'steps': 3374, 'loss/train': 2.1932318210601807}}} 11/06/2021 21:44:21 - INFO - __main__ - Step 3380: {'lr': 0.0004998929013863, 'samples': 648960, 'steps': 3379, 'loss/train': 1.080916404724121}07}}} 11/06/2021 21:44:21 - INFO - __main__ - Step 3380: {'lr': 0.0004998929013863, 'samples': 648960, 'steps': 3379, 'loss/train': 1.080916404724121}07}}} 11/06/2021 21:44:25 - INFO - __main__ - Step 3387: {'lr': 0.0004998918114097237, 'samples': 650304, 'steps': 3386, 'loss/train': 2.346414566040039}}} 11/06/2021 21:44:27 - INFO - __main__ - Step 3391: {'lr': 0.0004998911860888217, 'samples': 651072, 'steps': 3390, 'loss/train': 1.96201491355896}}}} 11/06/2021 21:44:29 - INFO - __main__ - Step 3396: {'lr': 0.0004998904019042596, 'samples': 652032, 'steps': 3395, 'loss/train': 2.2461233139038086}} 11/06/2021 21:44:32 - INFO - __main__ - Step 3401: {'lr': 0.0004998896149047786, 'samples': 652992, 'steps': 3400, 'loss/train': 2.4505343437194824}} 11/06/2021 21:44:32 - INFO - __main__ - Step 3401: {'lr': 0.0004998896149047786, 'samples': 652992, 'steps': 3400, 'loss/train': 2.4505343437194824}} 11/06/2021 21:44:35 - INFO - __main__ - Step 3408: {'lr': 0.0004998885083764582, 'samples': 654336, 'steps': 3407, 'loss/train': 2.54028582572937}4}} 11/06/2021 21:44:37 - INFO - __main__ - Step 3412: {'lr': 0.0004998878735974493, 'samples': 655104, 'steps': 3411, 'loss/train': 2.2348644733428955}} 11/06/2021 21:44:39 - INFO - __main__ - Step 3417: {'lr': 0.0004998870775902872, 'samples': 656064, 'steps': 3416, 'loss/train': 2.356597900390625}}} 11/06/2021 21:44:39 - INFO - __main__ - Step 3417: {'lr': 0.0004998870775902872, 'samples': 656064, 'steps': 3416, 'loss/train': 2.356597900390625}}} 11/06/2021 21:44:39 - INFO - __main__ - Step 3417: {'lr': 0.0004998870775902872, 'samples': 656064, 'steps': 3416, 'loss/train': 2.356597900390625}}} 11/06/2021 21:44:45 - INFO - __main__ - Step 3428: {'lr': 0.0004998853164661606, 'samples': 658176, 'steps': 3427, 'loss/train': 1.7819459438323975}} 11/06/2021 21:44:47 - INFO - __main__ - Step 3433: {'lr': 0.0004998845114514095, 'samples': 659136, 'steps': 3432, 'loss/train': 1.8727471828460693}} 11/06/2021 21:44:47 - INFO - __main__ - Step 3433: {'lr': 0.0004998845114514095, 'samples': 659136, 'steps': 3432, 'loss/train': 1.8727471828460693}} 11/06/2021 21:44:52 - INFO - __main__ - Step 3440: {'lr': 0.0004998833797018074, 'samples': 660480, 'steps': 3439, 'loss/train': 1.823896050453186}}} 11/06/2021 21:44:53 - INFO - __main__ - Step 3444: {'lr': 0.0004998827305106884, 'samples': 661248, 'steps': 3443, 'loss/train': 1.7395120859146118}} 11/06/2021 21:44:55 - INFO - __main__ - Step 3448: {'lr': 0.0004998820795180766, 'samples': 662016, 'steps': 3447, 'loss/train': 2.5539913177490234}} 11/06/2021 21:44:58 - INFO - __main__ - Step 3453: {'lr': 0.0004998812632439697, 'samples': 662976, 'steps': 3452, 'loss/train': 1.7184646129608154}} 11/06/2021 21:45:00 - INFO - __main__ - Step 3457: {'lr': 0.0004998806081980162, 'samples': 663744, 'steps': 3456, 'loss/train': 2.589240074157715}}} 11/06/2021 21:45:02 - INFO - __main__ - Step 3461: {'lr': 0.0004998799513505851, 'samples': 664512, 'steps': 3460, 'loss/train': 2.196791648864746}}} 11/06/2021 21:45:03 - INFO - __main__ - Step 3465: {'lr': 0.0004998792927016812, 'samples': 665280, 'steps': 3464, 'loss/train': 2.557910203933716}}} 11/06/2021 21:45:05 - INFO - __main__ - Step 3469: {'lr': 0.0004998786322513093, 'samples': 666048, 'steps': 3468, 'loss/train': 2.309483528137207}}} 11/06/2021 21:45:08 - INFO - __main__ - Step 3475: {'lr': 0.0004998776381980092, 'samples': 667200, 'steps': 3474, 'loss/train': 2.124244213104248}}} 11/06/2021 21:45:10 - INFO - __main__ - Step 3479: {'lr': 0.000499876973243988, 'samples': 667968, 'steps': 3478, 'loss/train': 1.411661148071289}}}} 11/06/2021 21:45:10 - INFO - __main__ - Step 3479: {'lr': 0.000499876973243988, 'samples': 667968, 'steps': 3478, 'loss/train': 1.411661148071289}}}} 11/06/2021 21:45:13 - INFO - __main__ - Step 3486: {'lr': 0.0004998758052397115, 'samples': 669312, 'steps': 3485, 'loss/train': 1.9367340803146362}} 11/06/2021 21:45:15 - INFO - __main__ - Step 3491: {'lr': 0.0004998749675732357, 'samples': 670272, 'steps': 3490, 'loss/train': 2.008915662765503}}} 11/06/2021 21:45:18 - INFO - __main__ - Step 3495: {'lr': 0.000499874295413438, 'samples': 671040, 'steps': 3494, 'loss/train': 1.9268534183502197}}} 11/06/2021 21:45:20 - INFO - __main__ - Step 3499: {'lr': 0.0004998736214522084, 'samples': 671808, 'steps': 3498, 'loss/train': 3.6572864055633545}} 11/06/2021 21:45:21 - INFO - __main__ - Step 3503: {'lr': 0.0004998729456895516, 'samples': 672576, 'steps': 3502, 'loss/train': 2.645084857940674}}} 11/06/2021 21:45:24 - INFO - __main__ - Step 3507: {'lr': 0.0004998722681254725, 'samples': 673344, 'steps': 3506, 'loss/train': 1.835058569908142}}} 11/06/2021 21:45:26 - INFO - __main__ - Step 3512: {'lr': 0.000499871418637131, 'samples': 674304, 'steps': 3511, 'loss/train': 1.7968868017196655}}} 11/06/2021 21:45:28 - INFO - __main__ - Step 3516: {'lr': 0.0004998707370198695, 'samples': 675072, 'steps': 3515, 'loss/train': 2.0000085830688477}} 11/06/2021 21:45:28 - INFO - __main__ - Step 3516: {'lr': 0.0004998707370198695, 'samples': 675072, 'steps': 3515, 'loss/train': 2.0000085830688477}} 11/06/2021 21:45:31 - INFO - __main__ - Step 3523: {'lr': 0.0004998695398550309, 'samples': 676416, 'steps': 3522, 'loss/train': 2.8342642784118652}} 11/06/2021 21:45:34 - INFO - __main__ - Step 3528: {'lr': 0.0004998686813596668, 'samples': 677376, 'steps': 3527, 'loss/train': 2.326568126678467}}} 11/06/2021 21:45:36 - INFO - __main__ - Step 3532: {'lr': 0.0004998679925368094, 'samples': 678144, 'steps': 3531, 'loss/train': 1.9451133012771606}} 11/06/2021 21:45:38 - INFO - __main__ - Step 3537: {'lr': 0.0004998671289750386, 'samples': 679104, 'steps': 3536, 'loss/train': 2.357372283935547}}} 11/06/2021 21:45:38 - INFO - __main__ - Step 3537: {'lr': 0.0004998671289750386, 'samples': 679104, 'steps': 3536, 'loss/train': 2.357372283935547}}} 11/06/2021 21:45:41 - INFO - __main__ - Step 3544: {'lr': 0.0004998659152599381, 'samples': 680448, 'steps': 3543, 'loss/train': 2.9186902046203613}} 11/06/2021 21:45:44 - INFO - __main__ - Step 3548: {'lr': 0.0004998652192315644, 'samples': 681216, 'steps': 3547, 'loss/train': 1.8039276599884033}} 11/06/2021 21:45:46 - INFO - __main__ - Step 3552: {'lr': 0.000499864521401824, 'samples': 681984, 'steps': 3551, 'loss/train': 2.100595474243164}3}} 11/06/2021 21:45:48 - INFO - __main__ - Step 3556: {'lr': 0.0004998638217707222, 'samples': 682752, 'steps': 3555, 'loss/train': 2.264774799346924}}} 11/06/2021 21:45:50 - INFO - __main__ - Step 3560: {'lr': 0.0004998631203382639, 'samples': 683520, 'steps': 3559, 'loss/train': 2.074711561203003}}} 11/06/2021 21:45:51 - INFO - __main__ - Step 3564: {'lr': 0.0004998624171044541, 'samples': 684288, 'steps': 3563, 'loss/train': 1.9695250988006592}} 11/06/2021 21:45:53 - INFO - __main__ - Step 3568: {'lr': 0.0004998617120692977, 'samples': 685056, 'steps': 3567, 'loss/train': 1.8647109270095825}} 11/06/2021 21:45:56 - INFO - __main__ - Step 3573: {'lr': 0.0004998608282422169, 'samples': 686016, 'steps': 3572, 'loss/train': 2.195950746536255}}} 11/06/2021 21:45:56 - INFO - __main__ - Step 3573: {'lr': 0.0004998608282422169, 'samples': 686016, 'steps': 3572, 'loss/train': 2.195950746536255}}} 11/06/2021 21:45:59 - INFO - __main__ - Step 3580: {'lr': 0.0004998595861558016, 'samples': 687360, 'steps': 3579, 'loss/train': 2.119218111038208}}} 11/06/2021 21:46:01 - INFO - __main__ - Step 3584: {'lr': 0.0004998588739153108, 'samples': 688128, 'steps': 3583, 'loss/train': 1.7447315454483032}} 11/06/2021 21:46:04 - INFO - __main__ - Step 3589: {'lr': 0.0004998579810815905, 'samples': 689088, 'steps': 3588, 'loss/train': 2.701934576034546}}} 11/06/2021 21:46:04 - INFO - __main__ - Step 3589: {'lr': 0.0004998579810815905, 'samples': 689088, 'steps': 3588, 'loss/train': 2.701934576034546}}} 11/06/2021 21:46:08 - INFO - __main__ - Step 3597: {'lr': 0.0004998565466933702, 'samples': 690624, 'steps': 3596, 'loss/train': 2.6100289821624756}} 11/06/2021 21:46:10 - INFO - __main__ - Step 3601: {'lr': 0.0004998558267973013, 'samples': 691392, 'steps': 3600, 'loss/train': 2.1335763931274414}} 11/06/2021 21:46:11 - INFO - __main__ - Step 3605: {'lr': 0.0004998551050999336, 'samples': 692160, 'steps': 3604, 'loss/train': 1.8231139183044434}} 11/06/2021 21:46:13 - INFO - __main__ - Step 3609: {'lr': 0.0004998543816012723, 'samples': 692928, 'steps': 3608, 'loss/train': 2.1296401023864746}} 11/06/2021 21:46:16 - INFO - __main__ - Step 3614: {'lr': 0.0004998534746948843, 'samples': 693888, 'steps': 3613, 'loss/train': 1.9264954328536987}} 11/06/2021 21:46:16 - INFO - __main__ - Step 3614: {'lr': 0.0004998534746948843, 'samples': 693888, 'steps': 3613, 'loss/train': 1.9264954328536987}} 11/06/2021 21:46:19 - INFO - __main__ - Step 3621: {'lr': 0.0004998522002975783, 'samples': 695232, 'steps': 3620, 'loss/train': 2.4836816787719727}} 11/06/2021 21:46:21 - INFO - __main__ - Step 3625: {'lr': 0.0004998514695937945, 'samples': 696000, 'steps': 3624, 'loss/train': 2.3775951862335205}} 11/06/2021 21:46:21 - INFO - __main__ - Step 3625: {'lr': 0.0004998514695937945, 'samples': 696000, 'steps': 3624, 'loss/train': 2.3775951862335205}} 11/06/2021 21:46:25 - INFO - __main__ - Step 3633: {'lr': 0.0004998500027824298, 'samples': 697536, 'steps': 3632, 'loss/train': 1.352131724357605}}} 11/06/2021 21:46:27 - INFO - __main__ - Step 3637: {'lr': 0.0004998492666748594, 'samples': 698304, 'steps': 3636, 'loss/train': 2.1306369304656982}} 11/06/2021 21:46:30 - INFO - __main__ - Step 3642: {'lr': 0.0004998483440073871, 'samples': 699264, 'steps': 3641, 'loss/train': 2.3207335472106934}} 11/06/2021 21:46:32 - INFO - __main__ - Step 3646: {'lr': 0.0004998476038470082, 'samples': 700032, 'steps': 3645, 'loss/train': 2.1402628421783447}} 11/06/2021 21:46:34 - INFO - __main__ - Step 3650: {'lr': 0.0004998468618853896, 'samples': 700800, 'steps': 3649, 'loss/train': 2.1500542163848877}} 11/06/2021 21:46:34 - INFO - __main__ - Step 3650: {'lr': 0.0004998468618853896, 'samples': 700800, 'steps': 3649, 'loss/train': 2.1500542163848877}} 11/06/2021 21:46:37 - INFO - __main__ - Step 3657: {'lr': 0.0004998455591183406, 'samples': 702144, 'steps': 3656, 'loss/train': 2.267188310623169}}} 11/06/2021 21:46:40 - INFO - __main__ - Step 3662: {'lr': 0.00049984462519315, 'samples': 703104, 'steps': 3661, 'loss/train': 2.506985664367676}9}}} 11/06/2021 21:46:42 - INFO - __main__ - Step 3667: {'lr': 0.0004998436884535562, 'samples': 704064, 'steps': 3666, 'loss/train': 1.854463815689087}}} 11/06/2021 21:46:42 - INFO - __main__ - Step 3667: {'lr': 0.0004998436884535562, 'samples': 704064, 'steps': 3666, 'loss/train': 1.854463815689087}}} 11/06/2021 21:46:45 - INFO - __main__ - Step 3674: {'lr': 0.0004998423722899475, 'samples': 705408, 'steps': 3673, 'loss/train': 1.8246042728424072}} 11/06/2021 21:46:47 - INFO - __main__ - Step 3678: {'lr': 0.0004998416177198022, 'samples': 706176, 'steps': 3677, 'loss/train': 1.930174708366394}}} 11/06/2021 21:46:50 - INFO - __main__ - Step 3683: {'lr': 0.0004998406719741888, 'samples': 707136, 'steps': 3682, 'loss/train': 1.118072509765625}}} 11/06/2021 21:46:52 - INFO - __main__ - Step 3687: {'lr': 0.0004998399133513594, 'samples': 707904, 'steps': 3686, 'loss/train': 2.232158899307251}}} 11/06/2021 21:46:54 - INFO - __main__ - Step 3691: {'lr': 0.0004998391529273457, 'samples': 708672, 'steps': 3690, 'loss/train': 2.0140085220336914}} 11/06/2021 21:46:54 - INFO - __main__ - Step 3691: {'lr': 0.0004998391529273457, 'samples': 708672, 'steps': 3690, 'loss/train': 2.0140085220336914}} 11/06/2021 21:46:57 - INFO - __main__ - Step 3698: {'lr': 0.0004998378178512388, 'samples': 710016, 'steps': 3697, 'loss/train': 2.363736391067505}}} 11/06/2021 21:47:00 - INFO - __main__ - Step 3704: {'lr': 0.0004998366691099395, 'samples': 711168, 'steps': 3703, 'loss/train': 1.9696950912475586}} 11/06/2021 21:47:02 - INFO - __main__ - Step 3708: {'lr': 0.0004998359010309544, 'samples': 711936, 'steps': 3707, 'loss/train': 2.483283758163452}}} 11/06/2021 21:47:04 - INFO - __main__ - Step 3712: {'lr': 0.0004998351311508143, 'samples': 712704, 'steps': 3711, 'loss/train': 1.7308366298675537}} 11/06/2021 21:47:06 - INFO - __main__ - Step 3716: {'lr': 0.0004998343594695242, 'samples': 713472, 'steps': 3715, 'loss/train': 1.7858660221099854}} 11/06/2021 21:47:08 - INFO - __main__ - Step 3720: {'lr': 0.0004998335859870903, 'samples': 714240, 'steps': 3719, 'loss/train': 1.7998706102371216}} 11/06/2021 21:47:10 - INFO - __main__ - Step 3724: {'lr': 0.0004998328107035176, 'samples': 715008, 'steps': 3723, 'loss/train': 2.392589807510376}}} 11/06/2021 21:47:12 - INFO - __main__ - Step 3729: {'lr': 0.0004998318390662095, 'samples': 715968, 'steps': 3728, 'loss/train': 1.9741984605789185}} 11/06/2021 21:47:14 - INFO - __main__ - Step 3733: {'lr': 0.0004998310597300956, 'samples': 716736, 'steps': 3732, 'loss/train': 2.505441665649414}}} 11/06/2021 21:47:14 - INFO - __main__ - Step 3733: {'lr': 0.0004998310597300956, 'samples': 716736, 'steps': 3732, 'loss/train': 2.505441665649414}}} 11/06/2021 21:47:17 - INFO - __main__ - Step 3740: {'lr': 0.0004998296915579539, 'samples': 718080, 'steps': 3739, 'loss/train': 2.2116055488586426}} 11/06/2021 21:47:20 - INFO - __main__ - Step 3745: {'lr': 0.0004998287109150547, 'samples': 719040, 'steps': 3744, 'loss/train': 1.8798199892044067}} 11/06/2021 21:47:22 - INFO - __main__ - Step 3750: {'lr': 0.0004998277274579313, 'samples': 720000, 'steps': 3749, 'loss/train': 2.172743558883667}}} 11/06/2021 21:47:24 - INFO - __main__ - Step 3754: {'lr': 0.0004998269386659988, 'samples': 720768, 'steps': 3753, 'loss/train': 2.281588077545166}}} 11/06/2021 21:47:26 - INFO - __main__ - Step 3758: {'lr': 0.0004998261480729755, 'samples': 721536, 'steps': 3757, 'loss/train': 2.4779069423675537}} 11/06/2021 21:47:28 - INFO - __main__ - Step 3762: {'lr': 0.0004998253556788675, 'samples': 722304, 'steps': 3761, 'loss/train': 2.088002920150757}}} 11/06/2021 21:47:30 - INFO - __main__ - Step 3766: {'lr': 0.0004998245614836802, 'samples': 723072, 'steps': 3765, 'loss/train': 2.278775930404663}}} 11/06/2021 21:47:32 - INFO - __main__ - Step 3771: {'lr': 0.0004998235662069372, 'samples': 724032, 'steps': 3770, 'loss/train': 1.7649283409118652}} 11/06/2021 21:47:32 - INFO - __main__ - Step 3771: {'lr': 0.0004998235662069372, 'samples': 724032, 'steps': 3770, 'loss/train': 1.7649283409118652}} 11/06/2021 21:47:36 - INFO - __main__ - Step 3778: {'lr': 0.0004998221680917004, 'samples': 725376, 'steps': 3777, 'loss/train': 2.202800989151001}}} 11/06/2021 21:47:36 - INFO - __main__ - Step 3778: {'lr': 0.0004998221680917004, 'samples': 725376, 'steps': 3777, 'loss/train': 2.202800989151001}}} 11/06/2021 21:47:40 - INFO - __main__ - Step 3786: {'lr': 0.0004998205634917566, 'samples': 726912, 'steps': 3785, 'loss/train': 2.1383769512176514}} 11/06/2021 21:47:42 - INFO - __main__ - Step 3791: {'lr': 0.0004998195569584168, 'samples': 727872, 'steps': 3790, 'loss/train': 2.279456377029419}}} 11/06/2021 21:47:45 - INFO - __main__ - Step 3796: {'lr': 0.000499818547610956, 'samples': 728832, 'steps': 3795, 'loss/train': 1.8099079132080078}}} 11/06/2021 21:47:45 - INFO - __main__ - Step 3796: {'lr': 0.000499818547610956, 'samples': 728832, 'steps': 3795, 'loss/train': 1.8099079132080078}}} 11/06/2021 21:47:48 - INFO - __main__ - Step 3803: {'lr': 0.0004998171297968095, 'samples': 730176, 'steps': 3802, 'loss/train': 2.3628175258636475}} 11/06/2021 21:47:50 - INFO - __main__ - Step 3807: {'lr': 0.0004998163171408928, 'samples': 730944, 'steps': 3806, 'loss/train': 2.0892038345336914}} 11/06/2021 21:47:50 - INFO - __main__ - Step 3807: {'lr': 0.0004998163171408928, 'samples': 730944, 'steps': 3806, 'loss/train': 2.0892038345336914}} 11/06/2021 21:47:55 - INFO - __main__ - Step 3815: {'lr': 0.0004998146864260231, 'samples': 732480, 'steps': 3814, 'loss/train': 2.178349733352661}}} 11/06/2021 21:47:56 - INFO - __main__ - Step 3819: {'lr': 0.000499813868367082, 'samples': 733248, 'steps': 3818, 'loss/train': 1.899258017539978}}}} 11/06/2021 21:47:58 - INFO - __main__ - Step 3823: {'lr': 0.0004998130485071444, 'samples': 734016, 'steps': 3822, 'loss/train': 2.2223479747772217}} 11/06/2021 21:48:00 - INFO - __main__ - Step 3828: {'lr': 0.0004998120211495803, 'samples': 734976, 'steps': 3827, 'loss/train': 1.8643083572387695}} 11/06/2021 21:48:00 - INFO - __main__ - Step 3828: {'lr': 0.0004998120211495803, 'samples': 734976, 'steps': 3827, 'loss/train': 1.8643083572387695}} 11/06/2021 21:48:05 - INFO - __main__ - Step 3836: {'lr': 0.0004998103715242875, 'samples': 736512, 'steps': 3835, 'loss/train': 2.376988649368286}}} 11/06/2021 21:48:06 - INFO - __main__ - Step 3840: {'lr': 0.0004998095440101815, 'samples': 737280, 'steps': 3839, 'loss/train': 2.203437089920044}}} 11/06/2021 21:48:08 - INFO - __main__ - Step 3844: {'lr': 0.0004998087146951101, 'samples': 738048, 'steps': 3843, 'loss/train': 0.9520333409309387}} 11/06/2021 21:48:11 - INFO - __main__ - Step 3849: {'lr': 0.0004998076755186727, 'samples': 739008, 'steps': 3848, 'loss/train': 1.7136802673339844}} 11/06/2021 21:48:13 - INFO - __main__ - Step 3854: {'lr': 0.0004998066335282483, 'samples': 739968, 'steps': 3853, 'loss/train': 1.9457613229751587}} 11/06/2021 21:48:13 - INFO - __main__ - Step 3854: {'lr': 0.0004998066335282483, 'samples': 739968, 'steps': 3853, 'loss/train': 1.9457613229751587}} 11/06/2021 21:48:16 - INFO - __main__ - Step 3860: {'lr': 0.0004998053794252925, 'samples': 741120, 'steps': 3859, 'loss/train': 1.1343226432800293}} 11/06/2021 21:48:18 - INFO - __main__ - Step 3865: {'lr': 0.0004998043312441378, 'samples': 742080, 'steps': 3864, 'loss/train': 2.002265214920044}}} 11/06/2021 21:48:18 - INFO - __main__ - Step 3865: {'lr': 0.0004998043312441378, 'samples': 742080, 'steps': 3864, 'loss/train': 2.002265214920044}}} 11/06/2021 21:48:22 - INFO - __main__ - Step 3873: {'lr': 0.0004998026483012803, 'samples': 743616, 'steps': 3872, 'loss/train': 0.3874906599521637}} 11/06/2021 21:48:22 - INFO - __main__ - Step 3873: {'lr': 0.0004998026483012803, 'samples': 743616, 'steps': 3872, 'loss/train': 0.3874906599521637}} 11/06/2021 21:48:26 - INFO - __main__ - Step 3880: {'lr': 0.0004998011698170245, 'samples': 744960, 'steps': 3879, 'loss/train': 2.270320415496826}}} 11/06/2021 21:48:26 - INFO - __main__ - Step 3880: {'lr': 0.0004998011698170245, 'samples': 744960, 'steps': 3879, 'loss/train': 2.270320415496826}}} 11/06/2021 21:48:30 - INFO - __main__ - Step 3888: {'lr': 0.0004997994733673409, 'samples': 746496, 'steps': 3887, 'loss/train': 2.0216381549835205}} 11/06/2021 21:48:32 - INFO - __main__ - Step 3892: {'lr': 0.0004997986224411571, 'samples': 747264, 'steps': 3891, 'loss/train': 1.900480031967163}}} 11/06/2021 21:48:34 - INFO - __main__ - Step 3897: {'lr': 0.0004997975562509315, 'samples': 748224, 'steps': 3896, 'loss/train': 1.9882245063781738}} 11/06/2021 21:48:37 - INFO - __main__ - Step 3902: {'lr': 0.0004997964872468327, 'samples': 749184, 'steps': 3901, 'loss/train': 2.4843428134918213}} 11/06/2021 21:48:39 - INFO - __main__ - Step 3906: {'lr': 0.0004997956300175732, 'samples': 749952, 'steps': 3905, 'loss/train': 2.6430604457855225}} 11/06/2021 21:48:39 - INFO - __main__ - Step 3906: {'lr': 0.0004997956300175732, 'samples': 749952, 'steps': 3905, 'loss/train': 2.6430604457855225}} 11/06/2021 21:48:42 - INFO - __main__ - Step 3913: {'lr': 0.0004997941255330416, 'samples': 751296, 'steps': 3912, 'loss/train': 1.9108836650848389}} 11/06/2021 21:48:44 - INFO - __main__ - Step 3918: {'lr': 0.00049979304752463, 'samples': 752256, 'steps': 3917, 'loss/train': 1.5085346698760986}9}} 11/06/2021 21:48:47 - INFO - __main__ - Step 3923: {'lr': 0.0004997919667023962, 'samples': 753216, 'steps': 3922, 'loss/train': 6.834549903869629}}} 11/06/2021 21:48:47 - INFO - __main__ - Step 3923: {'lr': 0.0004997919667023962, 'samples': 753216, 'steps': 3922, 'loss/train': 6.834549903869629}}} 11/06/2021 21:48:50 - INFO - __main__ - Step 3930: {'lr': 0.0004997904488240704, 'samples': 754560, 'steps': 3929, 'loss/train': 1.9056246280670166}} 11/06/2021 21:48:52 - INFO - __main__ - Step 3934: {'lr': 0.000499789578988887, 'samples': 755328, 'steps': 3933, 'loss/train': 2.532392740249634}6}} 11/06/2021 21:48:54 - INFO - __main__ - Step 3939: {'lr': 0.0004997884891625037, 'samples': 756288, 'steps': 3938, 'loss/train': 2.200429916381836}}} 11/06/2021 21:48:57 - INFO - __main__ - Step 3944: {'lr': 0.0004997873965223495, 'samples': 757248, 'steps': 3943, 'loss/train': 2.6807804107666016}} 11/06/2021 21:48:57 - INFO - __main__ - Step 3944: {'lr': 0.0004997873965223495, 'samples': 757248, 'steps': 3943, 'loss/train': 2.6807804107666016}} 11/06/2021 21:49:00 - INFO - __main__ - Step 3951: {'lr': 0.0004997858620990217, 'samples': 758592, 'steps': 3950, 'loss/train': 1.9574391841888428}} 11/06/2021 21:49:02 - INFO - __main__ - Step 3955: {'lr': 0.0004997849828095969, 'samples': 759360, 'steps': 3954, 'loss/train': 2.0461413860321045}} 11/06/2021 21:49:05 - INFO - __main__ - Step 3960: {'lr': 0.0004997838811654584, 'samples': 760320, 'steps': 3959, 'loss/train': 2.6631009578704834}} 11/06/2021 21:49:07 - INFO - __main__ - Step 3964: {'lr': 0.0004997829978242693, 'samples': 761088, 'steps': 3963, 'loss/train': 1.9565014839172363}} 11/06/2021 21:49:09 - INFO - __main__ - Step 3968: {'lr': 0.0004997821126823062, 'samples': 761856, 'steps': 3967, 'loss/train': 2.048008680343628}}} 11/06/2021 21:49:10 - INFO - __main__ - Step 3972: {'lr': 0.0004997812257395758, 'samples': 762624, 'steps': 3971, 'loss/train': 0.8743030428886414}} 11/06/2021 21:49:12 - INFO - __main__ - Step 3976: {'lr': 0.0004997803369960844, 'samples': 763392, 'steps': 3975, 'loss/train': 1.4740993976593018}} 11/06/2021 21:49:15 - INFO - __main__ - Step 3981: {'lr': 0.0004997792235344096, 'samples': 764352, 'steps': 3980, 'loss/train': 2.1326520442962646}} 11/06/2021 21:49:17 - INFO - __main__ - Step 3985: {'lr': 0.0004997783307392292, 'samples': 765120, 'steps': 3984, 'loss/train': 1.8312244415283203}} 11/06/2021 21:49:19 - INFO - __main__ - Step 3989: {'lr': 0.0004997774361433086, 'samples': 765888, 'steps': 3988, 'loss/train': 2.3569905757904053}} 11/06/2021 21:49:19 - INFO - __main__ - Step 3989: {'lr': 0.0004997774361433086, 'samples': 765888, 'steps': 3988, 'loss/train': 2.3569905757904053}} 11/06/2021 21:49:22 - INFO - __main__ - Step 3996: {'lr': 0.000499775866267436, 'samples': 767232, 'steps': 3995, 'loss/train': 2.0774638652801514}}} 11/06/2021 21:49:25 - INFO - __main__ - Step 4001: {'lr': 0.0004997747415511704, 'samples': 768192, 'steps': 4000, 'loss/train': 2.3865301609039307}} 11/06/2021 21:49:27 - INFO - __main__ - Step 4005: {'lr': 0.0004997738397523537, 'samples': 768960, 'steps': 4004, 'loss/train': 2.0718185901641846}} 11/06/2021 21:49:29 - INFO - __main__ - Step 4009: {'lr': 0.0004997729361528292, 'samples': 769728, 'steps': 4008, 'loss/train': 2.3731486797332764}} 11/06/2021 21:49:30 - INFO - __main__ - Step 4013: {'lr': 0.0004997720307526034, 'samples': 770496, 'steps': 4012, 'loss/train': 2.1763927936553955}} 11/06/2021 21:49:32 - INFO - __main__ - Step 4017: {'lr': 0.0004997711235516829, 'samples': 771264, 'steps': 4016, 'loss/train': 1.86122727394104}5}} 11/06/2021 21:49:35 - INFO - __main__ - Step 4022: {'lr': 0.0004997699870183151, 'samples': 772224, 'steps': 4021, 'loss/train': 2.2087340354919434}} 11/06/2021 21:49:37 - INFO - __main__ - Step 4026: {'lr': 0.0004997690757658552, 'samples': 772992, 'steps': 4025, 'loss/train': 1.9826215505599976}} 11/06/2021 21:49:39 - INFO - __main__ - Step 4030: {'lr': 0.000499768162712722, 'samples': 773760, 'steps': 4029, 'loss/train': 2.1849615573883057}}} 11/06/2021 21:49:40 - INFO - __main__ - Step 4034: {'lr': 0.0004997672478589219, 'samples': 774528, 'steps': 4033, 'loss/train': 1.615881085395813}}} 11/06/2021 21:49:42 - INFO - __main__ - Step 4038: {'lr': 0.0004997663312044614, 'samples': 775296, 'steps': 4037, 'loss/train': 1.8073010444641113}} 11/06/2021 21:49:45 - INFO - __main__ - Step 4043: {'lr': 0.0004997651828542173, 'samples': 776256, 'steps': 4042, 'loss/train': 2.349440336227417}}} 11/06/2021 21:49:47 - INFO - __main__ - Step 4047: {'lr': 0.0004997642621482955, 'samples': 777024, 'steps': 4046, 'loss/train': 3.536803960800171}}} 11/06/2021 21:49:49 - INFO - __main__ - Step 4051: {'lr': 0.0004997633396417348, 'samples': 777792, 'steps': 4050, 'loss/train': 2.1511425971984863}} 11/06/2021 21:49:51 - INFO - __main__ - Step 4055: {'lr': 0.000499762415334542, 'samples': 778560, 'steps': 4054, 'loss/train': 1.871009111404419}3}} 11/06/2021 21:49:52 - INFO - __main__ - Step 4059: {'lr': 0.0004997614892267238, 'samples': 779328, 'steps': 4058, 'loss/train': 1.7731740474700928}} 11/06/2021 21:49:55 - INFO - __main__ - Step 4064: {'lr': 0.0004997603290598317, 'samples': 780288, 'steps': 4063, 'loss/train': 1.8180835247039795}} 11/06/2021 21:49:57 - INFO - __main__ - Step 4068: {'lr': 0.0004997593989006306, 'samples': 781056, 'steps': 4067, 'loss/train': 1.7563910484313965}} 11/06/2021 21:49:57 - INFO - __main__ - Step 4068: {'lr': 0.0004997593989006306, 'samples': 781056, 'steps': 4067, 'loss/train': 1.7563910484313965}} 11/06/2021 21:50:00 - INFO - __main__ - Step 4074: {'lr': 0.0004997580002856993, 'samples': 782208, 'steps': 4073, 'loss/train': 2.6347973346710205}} 11/06/2021 21:50:02 - INFO - __main__ - Step 4079: {'lr': 0.0004997568316784852, 'samples': 783168, 'steps': 4078, 'loss/train': 2.6030142307281494}} 11/06/2021 21:50:02 - INFO - __main__ - Step 4079: {'lr': 0.0004997568316784852, 'samples': 783168, 'steps': 4078, 'loss/train': 2.6030142307281494}} 11/06/2021 21:50:07 - INFO - __main__ - Step 4087: {'lr': 0.0004997549560550464, 'samples': 784704, 'steps': 4086, 'loss/train': 2.3296737670898438}} 11/06/2021 21:50:08 - INFO - __main__ - Step 4091: {'lr': 0.000499754015542466, 'samples': 785472, 'steps': 4090, 'loss/train': 2.175802707672119}8}} 11/06/2021 21:50:10 - INFO - __main__ - Step 4095: {'lr': 0.0004997530732293209, 'samples': 786240, 'steps': 4094, 'loss/train': 1.8951164484024048}} 11/06/2021 21:50:13 - INFO - __main__ - Step 4100: {'lr': 0.0004997518928058553, 'samples': 787200, 'steps': 4099, 'loss/train': 1.9039506912231445}} 11/06/2021 21:50:15 - INFO - __main__ - Step 4104: {'lr': 0.0004997509464414639, 'samples': 787968, 'steps': 4103, 'loss/train': 1.8288829326629639}} 11/06/2021 21:50:17 - INFO - __main__ - Step 4108: {'lr': 0.0004997499982765299, 'samples': 788736, 'steps': 4107, 'loss/train': 2.3028995990753174}} 11/06/2021 21:50:17 - INFO - __main__ - Step 4108: {'lr': 0.0004997499982765299, 'samples': 788736, 'steps': 4107, 'loss/train': 2.3028995990753174}} 11/06/2021 21:50:20 - INFO - __main__ - Step 4115: {'lr': 0.0004997483346553597, 'samples': 790080, 'steps': 4114, 'loss/train': 2.204193592071533}}} 11/06/2021 21:50:23 - INFO - __main__ - Step 4120: {'lr': 0.0004997471429785394, 'samples': 791040, 'steps': 4119, 'loss/train': 2.340329170227051}}} 11/06/2021 21:50:25 - INFO - __main__ - Step 4125: {'lr': 0.0004997459484884139, 'samples': 792000, 'steps': 4124, 'loss/train': 2.5181665420532227}} 11/06/2021 21:50:27 - INFO - __main__ - Step 4129: {'lr': 0.0004997449908707428, 'samples': 792768, 'steps': 4128, 'loss/train': 1.6142773628234863}} 11/06/2021 21:50:29 - INFO - __main__ - Step 4133: {'lr': 0.0004997440314525718, 'samples': 793536, 'steps': 4132, 'loss/train': 1.7447444200515747}} 11/06/2021 21:50:30 - INFO - __main__ - Step 4137: {'lr': 0.000499743070233908, 'samples': 794304, 'steps': 4136, 'loss/train': 2.2369892597198486}}} 11/06/2021 21:50:33 - INFO - __main__ - Step 4141: {'lr': 0.0004997421072147581, 'samples': 795072, 'steps': 4140, 'loss/train': 2.0105512142181396}} 11/06/2021 21:50:35 - INFO - __main__ - Step 4146: {'lr': 0.0004997409009088979, 'samples': 796032, 'steps': 4145, 'loss/train': 2.225456476211548}}} 11/06/2021 21:50:35 - INFO - __main__ - Step 4146: {'lr': 0.0004997409009088979, 'samples': 796032, 'steps': 4145, 'loss/train': 2.225456476211548}}} 11/06/2021 21:50:39 - INFO - __main__ - Step 4154: {'lr': 0.0004997389649679987, 'samples': 797568, 'steps': 4153, 'loss/train': 1.9823795557022095}} 11/06/2021 21:50:41 - INFO - __main__ - Step 4158: {'lr': 0.0004997379942968611, 'samples': 798336, 'steps': 4157, 'loss/train': 2.2602744102478027}} 11/06/2021 21:50:42 - INFO - __main__ - Step 4162: {'lr': 0.0004997370218252741, 'samples': 799104, 'steps': 4161, 'loss/train': 1.6607433557510376}} 11/06/2021 21:50:45 - INFO - __main__ - Step 4166: {'lr': 0.0004997360475532447, 'samples': 799872, 'steps': 4165, 'loss/train': 2.014327049255371}}} 11/06/2021 21:50:47 - INFO - __main__ - Step 4171: {'lr': 0.0004997348271813466, 'samples': 800832, 'steps': 4170, 'loss/train': 2.2488949298858643}} 11/06/2021 21:50:47 - INFO - __main__ - Step 4171: {'lr': 0.0004997348271813466, 'samples': 800832, 'steps': 4170, 'loss/train': 2.2488949298858643}} 11/06/2021 21:50:51 - INFO - __main__ - Step 4179: {'lr': 0.000499732868734929, 'samples': 802368, 'steps': 4178, 'loss/train': 1.4197605848312378}}} 11/06/2021 21:50:53 - INFO - __main__ - Step 4183: {'lr': 0.0004997318868110981, 'samples': 803136, 'steps': 4182, 'loss/train': 2.6250104904174805}} 11/06/2021 21:50:55 - INFO - __main__ - Step 4187: {'lr': 0.0004997309030868617, 'samples': 803904, 'steps': 4186, 'loss/train': 1.2969268560409546}} 11/06/2021 21:50:57 - INFO - __main__ - Step 4192: {'lr': 0.000499729670899757, 'samples': 804864, 'steps': 4191, 'loss/train': 2.5529255867004395}}} 11/06/2021 21:50:57 - INFO - __main__ - Step 4192: {'lr': 0.000499729670899757, 'samples': 804864, 'steps': 4191, 'loss/train': 2.5529255867004395}}} 11/06/2021 21:51:01 - INFO - __main__ - Step 4199: {'lr': 0.0004997279411117916, 'samples': 806208, 'steps': 4198, 'loss/train': 0.3961840569972992}} 11/06/2021 21:51:03 - INFO - __main__ - Step 4203: {'lr': 0.000499726950186005, 'samples': 806976, 'steps': 4202, 'loss/train': 2.567746639251709}2}} 11/06/2021 21:51:05 - INFO - __main__ - Step 4208: {'lr': 0.0004997257089970024, 'samples': 807936, 'steps': 4207, 'loss/train': 2.0918617248535156}} 11/06/2021 21:51:07 - INFO - __main__ - Step 4212: {'lr': 0.0004997247140203939, 'samples': 808704, 'steps': 4211, 'loss/train': 2.0272629261016846}} 11/06/2021 21:51:09 - INFO - __main__ - Step 4216: {'lr': 0.0004997237172434316, 'samples': 809472, 'steps': 4215, 'loss/train': 1.893647313117981}}} 11/06/2021 21:51:09 - INFO - __main__ - Step 4216: {'lr': 0.0004997237172434316, 'samples': 809472, 'steps': 4215, 'loss/train': 1.893647313117981}}} 11/06/2021 21:51:13 - INFO - __main__ - Step 4223: {'lr': 0.0004997219685516684, 'samples': 810816, 'steps': 4222, 'loss/train': 2.0050837993621826}} 11/06/2021 21:51:15 - INFO - __main__ - Step 4228: {'lr': 0.0004997207161104951, 'samples': 811776, 'steps': 4227, 'loss/train': 2.2613799571990967}} 11/06/2021 21:51:15 - INFO - __main__ - Step 4228: {'lr': 0.0004997207161104951, 'samples': 811776, 'steps': 4227, 'loss/train': 2.2613799571990967}} 11/06/2021 21:51:19 - INFO - __main__ - Step 4236: {'lr': 0.0004997187063535679, 'samples': 813312, 'steps': 4235, 'loss/train': 2.289562463760376}}} 11/06/2021 21:51:21 - INFO - __main__ - Step 4240: {'lr': 0.0004997176987746352, 'samples': 814080, 'steps': 4239, 'loss/train': 2.1802866458892822}} 11/06/2021 21:51:21 - INFO - __main__ - Step 4240: {'lr': 0.0004997176987746352, 'samples': 814080, 'steps': 4239, 'loss/train': 2.1802866458892822}} 11/06/2021 21:51:25 - INFO - __main__ - Step 4248: {'lr': 0.0004997156782158679, 'samples': 815616, 'steps': 4247, 'loss/train': 2.169619083404541}}} 11/06/2021 21:51:27 - INFO - __main__ - Step 4252: {'lr': 0.0004997146652360478, 'samples': 816384, 'steps': 4251, 'loss/train': 1.8407737016677856}} 11/06/2021 21:51:29 - INFO - __main__ - Step 4256: {'lr': 0.0004997136504559465, 'samples': 817152, 'steps': 4255, 'loss/train': 1.9230844974517822}} 11/06/2021 21:51:31 - INFO - __main__ - Step 4260: {'lr': 0.0004997126338755714, 'samples': 817920, 'steps': 4259, 'loss/train': 2.247880697250366}}} 11/06/2021 21:51:31 - INFO - __main__ - Step 4260: {'lr': 0.0004997126338755714, 'samples': 817920, 'steps': 4259, 'loss/train': 2.247880697250366}}} 11/06/2021 21:51:35 - INFO - __main__ - Step 4268: {'lr': 0.0004997105953140288, 'samples': 819456, 'steps': 4267, 'loss/train': 1.6954803466796875}} 11/06/2021 21:51:37 - INFO - __main__ - Step 4272: {'lr': 0.0004997095733328761, 'samples': 820224, 'steps': 4271, 'loss/train': 2.232881784439087}}} 11/06/2021 21:51:39 - INFO - __main__ - Step 4276: {'lr': 0.0004997085495514788, 'samples': 820992, 'steps': 4275, 'loss/train': 2.0807552337646484}} 11/06/2021 21:51:41 - INFO - __main__ - Step 4281: {'lr': 0.0004997072672931497, 'samples': 821952, 'steps': 4280, 'loss/train': 2.4467787742614746}} 11/06/2021 21:51:44 - INFO - __main__ - Step 4285: {'lr': 0.0004997062394612293, 'samples': 822720, 'steps': 4284, 'loss/train': 2.1867868900299072}} 11/06/2021 21:51:46 - INFO - __main__ - Step 4289: {'lr': 0.0004997052098290886, 'samples': 823488, 'steps': 4288, 'loss/train': 2.015610694885254}}} 11/06/2021 21:51:47 - INFO - __main__ - Step 4293: {'lr': 0.0004997041783967348, 'samples': 824256, 'steps': 4292, 'loss/train': 2.276956558227539}}} 11/06/2021 21:51:49 - INFO - __main__ - Step 4297: {'lr': 0.0004997031451641754, 'samples': 825024, 'steps': 4296, 'loss/train': 1.8525234460830688}} 11/06/2021 21:51:52 - INFO - __main__ - Step 4302: {'lr': 0.0004997018510919483, 'samples': 825984, 'steps': 4301, 'loss/train': 1.1239334344863892}} 11/06/2021 21:51:54 - INFO - __main__ - Step 4306: {'lr': 0.0004997008138089536, 'samples': 826752, 'steps': 4305, 'loss/train': 2.289822816848755}}} 11/06/2021 21:51:56 - INFO - __main__ - Step 4310: {'lr': 0.0004996997747257775, 'samples': 827520, 'steps': 4309, 'loss/train': 2.347663640975952}}} 11/06/2021 21:51:57 - INFO - __main__ - Step 4314: {'lr': 0.0004996987338424276, 'samples': 828288, 'steps': 4313, 'loss/train': 2.120884418487549}}} 11/06/2021 21:51:59 - INFO - __main__ - Step 4318: {'lr': 0.0004996976911589114, 'samples': 829056, 'steps': 4317, 'loss/train': 1.8161555528640747}} 11/06/2021 21:52:02 - INFO - __main__ - Step 4323: {'lr': 0.0004996963852730436, 'samples': 830016, 'steps': 4322, 'loss/train': 1.9152145385742188}} 11/06/2021 21:52:04 - INFO - __main__ - Step 4327: {'lr': 0.0004996953385391806, 'samples': 830784, 'steps': 4326, 'loss/train': 2.2413148880004883}} 11/06/2021 21:52:04 - INFO - __main__ - Step 4327: {'lr': 0.0004996953385391806, 'samples': 830784, 'steps': 4326, 'loss/train': 2.2413148880004883}} 11/06/2021 21:52:07 - INFO - __main__ - Step 4334: {'lr': 0.0004996935024233335, 'samples': 832128, 'steps': 4333, 'loss/train': 2.192603349685669}}} 11/06/2021 21:52:09 - INFO - __main__ - Step 4338: {'lr': 0.0004996924507390985, 'samples': 832896, 'steps': 4337, 'loss/train': 2.24872088432312}}}} 11/06/2021 21:52:12 - INFO - __main__ - Step 4344: {'lr': 0.0004996908698375216, 'samples': 834048, 'steps': 4343, 'loss/train': 1.9386653900146484}} 11/06/2021 21:52:14 - INFO - __main__ - Step 4348: {'lr': 0.0004996898136529982, 'samples': 834816, 'steps': 4347, 'loss/train': 2.482868194580078}}} 11/06/2021 21:52:16 - INFO - __main__ - Step 4352: {'lr': 0.0004996887556683729, 'samples': 835584, 'steps': 4351, 'loss/train': 2.027623414993286}}} 11/06/2021 21:52:16 - INFO - __main__ - Step 4352: {'lr': 0.0004996887556683729, 'samples': 835584, 'steps': 4351, 'loss/train': 2.027623414993286}}} 11/06/2021 21:52:19 - INFO - __main__ - Step 4359: {'lr': 0.0004996868998638059, 'samples': 836928, 'steps': 4358, 'loss/train': 2.3234333992004395}} 11/06/2021 21:52:22 - INFO - __main__ - Step 4364: {'lr': 0.000499685570913961, 'samples': 837888, 'steps': 4363, 'loss/train': 1.978306770324707}5}} 11/06/2021 21:52:24 - INFO - __main__ - Step 4368: {'lr': 0.0004996845057290039, 'samples': 838656, 'steps': 4367, 'loss/train': 2.1156513690948486}} 11/06/2021 21:52:25 - INFO - __main__ - Step 4372: {'lr': 0.0004996834387439831, 'samples': 839424, 'steps': 4371, 'loss/train': 2.4186854362487793}} 11/06/2021 21:52:27 - INFO - __main__ - Step 4376: {'lr': 0.0004996823699589062, 'samples': 840192, 'steps': 4375, 'loss/train': 2.005566120147705}}} 11/06/2021 21:52:30 - INFO - __main__ - Step 4381: {'lr': 0.0004996810314462429, 'samples': 841152, 'steps': 4380, 'loss/train': 1.7387698888778687}} 11/06/2021 21:52:32 - INFO - __main__ - Step 4385: {'lr': 0.0004996799586110681, 'samples': 841920, 'steps': 4384, 'loss/train': 2.161149501800537}}} 11/06/2021 21:52:34 - INFO - __main__ - Step 4389: {'lr': 0.0004996788839758622, 'samples': 842688, 'steps': 4388, 'loss/train': 2.4196763038635254}} 11/06/2021 21:52:35 - INFO - __main__ - Step 4393: {'lr': 0.0004996778075406331, 'samples': 843456, 'steps': 4392, 'loss/train': 1.8474143743515015}} 11/06/2021 21:52:37 - INFO - __main__ - Step 4397: {'lr': 0.0004996767293053885, 'samples': 844224, 'steps': 4396, 'loss/train': 1.8675569295883179}} 11/06/2021 21:52:40 - INFO - __main__ - Step 4402: {'lr': 0.0004996753789800729, 'samples': 845184, 'steps': 4401, 'loss/train': 2.3262462615966797}} 11/06/2021 21:52:42 - INFO - __main__ - Step 4406: {'lr': 0.0004996742966948219, 'samples': 845952, 'steps': 4405, 'loss/train': 1.8191969394683838}} 11/06/2021 21:52:44 - INFO - __main__ - Step 4410: {'lr': 0.0004996732126095807, 'samples': 846720, 'steps': 4409, 'loss/train': 1.7854758501052856}} 11/06/2021 21:52:44 - INFO - __main__ - Step 4410: {'lr': 0.0004996732126095807, 'samples': 846720, 'steps': 4409, 'loss/train': 1.7854758501052856}} 11/06/2021 21:52:47 - INFO - __main__ - Step 4417: {'lr': 0.000499671311129206, 'samples': 848064, 'steps': 4416, 'loss/train': 1.7873398065567017}}} 11/06/2021 21:52:50 - INFO - __main__ - Step 4422: {'lr': 0.0004996699495539947, 'samples': 849024, 'steps': 4421, 'loss/train': 2.1597673892974854}} 11/06/2021 21:52:52 - INFO - __main__ - Step 4427: {'lr': 0.0004996685851663477, 'samples': 849984, 'steps': 4426, 'loss/train': 2.1259970664978027}} 11/06/2021 21:52:54 - INFO - __main__ - Step 4431: {'lr': 0.0004996674916312867, 'samples': 850752, 'steps': 4430, 'loss/train': 2.205284595489502}}} 11/06/2021 21:52:54 - INFO - __main__ - Step 4431: {'lr': 0.0004996674916312867, 'samples': 850752, 'steps': 4430, 'loss/train': 2.205284595489502}}} 11/06/2021 21:52:57 - INFO - __main__ - Step 4438: {'lr': 0.0004996655736138265, 'samples': 852096, 'steps': 4437, 'loss/train': 2.149383783340454}}} 11/06/2021 21:52:57 - INFO - __main__ - Step 4438: {'lr': 0.0004996655736138265, 'samples': 852096, 'steps': 4437, 'loss/train': 2.149383783340454}}} 11/06/2021 21:53:01 - INFO - __main__ - Step 4446: {'lr': 0.0004996633748441472, 'samples': 853632, 'steps': 4445, 'loss/train': 2.116128444671631}}} 11/06/2021 21:53:04 - INFO - __main__ - Step 4450: {'lr': 0.0004996622727594363, 'samples': 854400, 'steps': 4449, 'loss/train': 1.287934422492981}}} 11/06/2021 21:53:06 - INFO - __main__ - Step 4455: {'lr': 0.0004996608926224345, 'samples': 855360, 'steps': 4454, 'loss/train': 2.0416908264160156}} 11/06/2021 21:53:08 - INFO - __main__ - Step 4459: {'lr': 0.0004996597864879521, 'samples': 856128, 'steps': 4458, 'loss/train': 2.2517218589782715}} 11/06/2021 21:53:10 - INFO - __main__ - Step 4463: {'lr': 0.0004996586785535841, 'samples': 856896, 'steps': 4462, 'loss/train': 2.0258359909057617}} 11/06/2021 21:53:12 - INFO - __main__ - Step 4467: {'lr': 0.0004996575688193386, 'samples': 857664, 'steps': 4466, 'loss/train': 1.737178087234497}}} 11/06/2021 21:53:14 - INFO - __main__ - Step 4471: {'lr': 0.0004996564572852235, 'samples': 858432, 'steps': 4470, 'loss/train': 2.0802853107452393}} 11/06/2021 21:53:14 - INFO - __main__ - Step 4471: {'lr': 0.0004996564572852235, 'samples': 858432, 'steps': 4470, 'loss/train': 2.0802853107452393}} 11/06/2021 21:53:18 - INFO - __main__ - Step 4479: {'lr': 0.0004996542288174166, 'samples': 859968, 'steps': 4478, 'loss/train': 2.4320411682128906}} 11/06/2021 21:53:19 - INFO - __main__ - Step 4483: {'lr': 0.000499653111883741, 'samples': 860736, 'steps': 4482, 'loss/train': 2.1700544357299805}}} 11/06/2021 21:53:21 - INFO - __main__ - Step 4487: {'lr': 0.0004996519931502279, 'samples': 861504, 'steps': 4486, 'loss/train': 2.3324050903320312}} 11/06/2021 21:53:24 - INFO - __main__ - Step 4492: {'lr': 0.0004996505922023274, 'samples': 862464, 'steps': 4491, 'loss/train': 2.6008143424987793}} 11/06/2021 21:53:26 - INFO - __main__ - Step 4497: {'lr': 0.0004996491884422092, 'samples': 863424, 'steps': 4496, 'loss/train': 2.276155948638916}}} 11/06/2021 21:53:28 - INFO - __main__ - Step 4501: {'lr': 0.0004996480634093287, 'samples': 864192, 'steps': 4500, 'loss/train': 1.751892328262329}}} 11/06/2021 21:53:30 - INFO - __main__ - Step 4505: {'lr': 0.0004996469365766471, 'samples': 864960, 'steps': 4504, 'loss/train': 1.7238401174545288}} 11/06/2021 21:53:32 - INFO - __main__ - Step 4509: {'lr': 0.0004996458079441727, 'samples': 865728, 'steps': 4508, 'loss/train': 1.8585339784622192}} 11/06/2021 21:53:34 - INFO - __main__ - Step 4513: {'lr': 0.0004996446775119134, 'samples': 866496, 'steps': 4512, 'loss/train': 1.8141558170318604}} 11/06/2021 21:53:36 - INFO - __main__ - Step 4517: {'lr': 0.0004996435452798775, 'samples': 867264, 'steps': 4516, 'loss/train': 1.8776859045028687}} 11/06/2021 21:53:38 - INFO - __main__ - Step 4522: {'lr': 0.0004996421274589091, 'samples': 868224, 'steps': 4521, 'loss/train': 2.311401128768921}}} 11/06/2021 21:53:40 - INFO - __main__ - Step 4526: {'lr': 0.0004996409911774056, 'samples': 868992, 'steps': 4525, 'loss/train': 1.9752427339553833}} 11/06/2021 21:53:40 - INFO - __main__ - Step 4526: {'lr': 0.0004996409911774056, 'samples': 868992, 'steps': 4525, 'loss/train': 1.9752427339553833}} 11/06/2021 21:53:44 - INFO - __main__ - Step 4533: {'lr': 0.000499638998354131, 'samples': 870336, 'steps': 4532, 'loss/train': 2.118048667907715}3}} 11/06/2021 21:53:46 - INFO - __main__ - Step 4538: {'lr': 0.0004996375715344278, 'samples': 871296, 'steps': 4537, 'loss/train': 2.1683661937713623}} 11/06/2021 21:53:46 - INFO - __main__ - Step 4538: {'lr': 0.0004996375715344278, 'samples': 871296, 'steps': 4537, 'loss/train': 2.1683661937713623}} 11/06/2021 21:53:49 - INFO - __main__ - Step 4545: {'lr': 0.0004996355692625678, 'samples': 872640, 'steps': 4544, 'loss/train': 2.3875391483306885}} 11/06/2021 21:53:51 - INFO - __main__ - Step 4549: {'lr': 0.0004996344226326137, 'samples': 873408, 'steps': 4548, 'loss/train': 2.1202685832977295}} 11/06/2021 21:53:54 - INFO - __main__ - Step 4554: {'lr': 0.0004996329868143404, 'samples': 874368, 'steps': 4553, 'loss/train': 2.062023639678955}}} 11/06/2021 21:53:54 - INFO - __main__ - Step 4554: {'lr': 0.0004996329868143404, 'samples': 874368, 'steps': 4553, 'loss/train': 2.062023639678955}}} 11/06/2021 21:53:58 - INFO - __main__ - Step 4562: {'lr': 0.0004996306836561094, 'samples': 875904, 'steps': 4561, 'loss/train': 2.1488685607910156}} 11/06/2021 21:53:59 - INFO - __main__ - Step 4566: {'lr': 0.0004996295293774762, 'samples': 876672, 'steps': 4565, 'loss/train': 2.0341947078704834}} 11/06/2021 21:54:01 - INFO - __main__ - Step 4570: {'lr': 0.0004996283732991755, 'samples': 877440, 'steps': 4569, 'loss/train': 2.2277791500091553}} 11/06/2021 21:54:04 - INFO - __main__ - Step 4575: {'lr': 0.0004996269256705301, 'samples': 878400, 'steps': 4574, 'loss/train': 2.103940486907959}}} 11/06/2021 21:54:06 - INFO - __main__ - Step 4580: {'lr': 0.0004996254752299337, 'samples': 879360, 'steps': 4579, 'loss/train': 1.8763165473937988}} 11/06/2021 21:54:06 - INFO - __main__ - Step 4580: {'lr': 0.0004996254752299337, 'samples': 879360, 'steps': 4579, 'loss/train': 1.8763165473937988}} 11/06/2021 21:54:10 - INFO - __main__ - Step 4587: {'lr': 0.0004996234398890521, 'samples': 880704, 'steps': 4586, 'loss/train': 1.3432775735855103}} 11/06/2021 21:54:10 - INFO - __main__ - Step 4587: {'lr': 0.0004996234398890521, 'samples': 880704, 'steps': 4586, 'loss/train': 1.3432775735855103}} 11/06/2021 21:54:13 - INFO - __main__ - Step 4595: {'lr': 0.0004996211070366018, 'samples': 882240, 'steps': 4594, 'loss/train': 2.191429376602173}}} 11/06/2021 21:54:13 - INFO - __main__ - Step 4595: {'lr': 0.0004996211070366018, 'samples': 882240, 'steps': 4594, 'loss/train': 2.191429376602173}}} 11/06/2021 21:54:18 - INFO - __main__ - Step 4604: {'lr': 0.0004996184739732291, 'samples': 883968, 'steps': 4603, 'loss/train': 2.1833322048187256}} 11/06/2021 21:54:18 - INFO - __main__ - Step 4604: {'lr': 0.0004996184739732291, 'samples': 883968, 'steps': 4603, 'loss/train': 2.1833322048187256}} 11/06/2021 21:54:21 - INFO - __main__ - Step 4612: {'lr': 0.0004996161258242025, 'samples': 885504, 'steps': 4611, 'loss/train': 2.7224349975585938}} 11/06/2021 21:54:24 - INFO - __main__ - Step 4617: {'lr': 0.0004996146545756786, 'samples': 886464, 'steps': 4616, 'loss/train': 1.9531595706939697}} 11/06/2021 21:54:26 - INFO - __main__ - Step 4621: {'lr': 0.0004996134755523532, 'samples': 887232, 'steps': 4620, 'loss/train': 1.572229027748108}}} 11/06/2021 21:54:28 - INFO - __main__ - Step 4626: {'lr': 0.0004996119992425782, 'samples': 888192, 'steps': 4625, 'loss/train': 2.1612305641174316}} 11/06/2021 21:54:30 - INFO - __main__ - Step 4630: {'lr': 0.0004996108161702736, 'samples': 888960, 'steps': 4629, 'loss/train': 1.8879029750823975}} 11/06/2021 21:54:30 - INFO - __main__ - Step 4630: {'lr': 0.0004996108161702736, 'samples': 888960, 'steps': 4629, 'loss/train': 1.8879029750823975}} 11/06/2021 21:54:33 - INFO - __main__ - Step 4637: {'lr': 0.0004996087414636207, 'samples': 890304, 'steps': 4636, 'loss/train': 1.917240858078003}}} 11/06/2021 21:54:36 - INFO - __main__ - Step 4642: {'lr': 0.000499607256156199, 'samples': 891264, 'steps': 4641, 'loss/train': 1.8053956031799316}}} 11/06/2021 21:54:38 - INFO - __main__ - Step 4647: {'lr': 0.000499605768037048, 'samples': 892224, 'steps': 4646, 'loss/train': 2.353457450866699}}}} 11/06/2021 21:54:38 - INFO - __main__ - Step 4647: {'lr': 0.000499605768037048, 'samples': 892224, 'steps': 4646, 'loss/train': 2.353457450866699}}}} 11/06/2021 21:54:42 - INFO - __main__ - Step 4654: {'lr': 0.000499603679946563, 'samples': 893568, 'steps': 4653, 'loss/train': 2.03910756111145}}}}} 11/06/2021 21:54:44 - INFO - __main__ - Step 4658: {'lr': 0.0004996024842777106, 'samples': 894336, 'steps': 4657, 'loss/train': 1.9979157447814941}} 11/06/2021 21:54:46 - INFO - __main__ - Step 4663: {'lr': 0.0004996009871611382, 'samples': 895296, 'steps': 4662, 'loss/train': 2.1121556758880615}} 11/06/2021 21:54:46 - INFO - __main__ - Step 4663: {'lr': 0.0004996009871611382, 'samples': 895296, 'steps': 4662, 'loss/train': 2.1121556758880615}} 11/06/2021 21:54:50 - INFO - __main__ - Step 4668: {'lr': 0.0004995994872329069, 'samples': 896256, 'steps': 4667, 'loss/train': 2.1744236946105957}} 11/06/2021 21:54:50 - INFO - __main__ - Step 4668: {'lr': 0.0004995994872329069, 'samples': 896256, 'steps': 4667, 'loss/train': 2.1744236946105957}} 11/06/2021 21:54:53 - INFO - __main__ - Step 4676: {'lr': 0.0004995970814995285, 'samples': 897792, 'steps': 4675, 'loss/train': 1.969705581665039}}} 11/06/2021 21:54:56 - INFO - __main__ - Step 4681: {'lr': 0.0004995955742610635, 'samples': 898752, 'steps': 4680, 'loss/train': 1.9930462837219238}} 11/06/2021 21:54:58 - INFO - __main__ - Step 4686: {'lr': 0.0004995940642110005, 'samples': 899712, 'steps': 4685, 'loss/train': 2.056427478790283}}} 11/06/2021 21:54:58 - INFO - __main__ - Step 4686: {'lr': 0.0004995940642110005, 'samples': 899712, 'steps': 4685, 'loss/train': 2.056427478790283}}} 11/06/2021 21:55:01 - INFO - __main__ - Step 4693: {'lr': 0.0004995919454174603, 'samples': 901056, 'steps': 4692, 'loss/train': 2.0533816814422607}} 11/06/2021 21:55:03 - INFO - __main__ - Step 4697: {'lr': 0.0004995907322041214, 'samples': 901824, 'steps': 4696, 'loss/train': 2.322920560836792}}} 11/06/2021 21:55:06 - INFO - __main__ - Step 4702: {'lr': 0.0004995892131570598, 'samples': 902784, 'steps': 4701, 'loss/train': 1.2235736846923828}} 11/06/2021 21:55:06 - INFO - __main__ - Step 4702: {'lr': 0.0004995892131570598, 'samples': 902784, 'steps': 4701, 'loss/train': 1.2235736846923828}} 11/06/2021 21:55:10 - INFO - __main__ - Step 4710: {'lr': 0.0004995867768337938, 'samples': 904320, 'steps': 4709, 'loss/train': 2.073693037033081}}} 11/06/2021 21:55:12 - INFO - __main__ - Step 4714: {'lr': 0.0004995855559731176, 'samples': 905088, 'steps': 4713, 'loss/train': 2.266838550567627}}} 11/06/2021 21:55:13 - INFO - __main__ - Step 4718: {'lr': 0.000499584333313091, 'samples': 905856, 'steps': 4717, 'loss/train': 1.7350718975067139}}} 11/06/2021 21:55:16 - INFO - __main__ - Step 4723: {'lr': 0.0004995828024577346, 'samples': 906816, 'steps': 4722, 'loss/train': 2.221000909805298}}} 11/06/2021 21:55:18 - INFO - __main__ - Step 4727: {'lr': 0.0004995815757492019, 'samples': 907584, 'steps': 4726, 'loss/train': 1.697227954864502}}} 11/06/2021 21:55:20 - INFO - __main__ - Step 4731: {'lr': 0.0004995803472413474, 'samples': 908352, 'steps': 4730, 'loss/train': 2.062716245651245}}} 11/06/2021 21:55:22 - INFO - __main__ - Step 4735: {'lr': 0.0004995791169341801, 'samples': 909120, 'steps': 4734, 'loss/train': 1.6100257635116577}} 11/06/2021 21:55:24 - INFO - __main__ - Step 4739: {'lr': 0.0004995778848277088, 'samples': 909888, 'steps': 4738, 'loss/train': 1.9893282651901245}} 11/06/2021 21:55:26 - INFO - __main__ - Step 4744: {'lr': 0.0004995763421643621, 'samples': 910848, 'steps': 4743, 'loss/train': 2.054396152496338}}} 11/06/2021 21:55:29 - INFO - __main__ - Step 4749: {'lr': 0.000499574796689634, 'samples': 911808, 'steps': 4748, 'loss/train': 1.5545167922973633}}} 11/06/2021 21:55:29 - INFO - __main__ - Step 4749: {'lr': 0.000499574796689634, 'samples': 911808, 'steps': 4748, 'loss/train': 1.5545167922973633}}} 11/06/2021 21:55:32 - INFO - __main__ - Step 4756: {'lr': 0.0004995726283019275, 'samples': 913152, 'steps': 4755, 'loss/train': 5.469394207000732}}} 11/06/2021 21:55:34 - INFO - __main__ - Step 4760: {'lr': 0.0004995713867492564, 'samples': 913920, 'steps': 4759, 'loss/train': 2.178823471069336}}} 11/06/2021 21:55:37 - INFO - __main__ - Step 4765: {'lr': 0.0004995698322782257, 'samples': 914880, 'steps': 4764, 'loss/train': 1.983769416809082}}} 11/06/2021 21:55:39 - INFO - __main__ - Step 4769: {'lr': 0.0004995685866772586, 'samples': 915648, 'steps': 4768, 'loss/train': 1.642903447151184}}} 11/06/2021 21:55:39 - INFO - __main__ - Step 4769: {'lr': 0.0004995685866772586, 'samples': 915648, 'steps': 4768, 'loss/train': 1.642903447151184}}} 11/06/2021 21:55:42 - INFO - __main__ - Step 4776: {'lr': 0.000499566402546179, 'samples': 916992, 'steps': 4775, 'loss/train': 1.7766259908676147}}} 11/06/2021 21:55:44 - INFO - __main__ - Step 4781: {'lr': 0.0004995648390790249, 'samples': 917952, 'steps': 4780, 'loss/train': 1.954245686531067}}} 11/06/2021 21:55:46 - INFO - __main__ - Step 4785: {'lr': 0.0004995635862811994, 'samples': 918720, 'steps': 4784, 'loss/train': 2.129288911819458}}} 11/06/2021 21:55:46 - INFO - __main__ - Step 4785: {'lr': 0.0004995635862811994, 'samples': 918720, 'steps': 4784, 'loss/train': 2.129288911819458}}} 11/06/2021 21:55:50 - INFO - __main__ - Step 4792: {'lr': 0.0004995613895557048, 'samples': 920064, 'steps': 4791, 'loss/train': 1.883157730102539}}} 11/06/2021 21:55:52 - INFO - __main__ - Step 4796: {'lr': 0.0004995601318101231, 'samples': 920832, 'steps': 4795, 'loss/train': 2.0214784145355225}} 11/06/2021 21:55:54 - INFO - __main__ - Step 4801: {'lr': 0.0004995585570980684, 'samples': 921792, 'steps': 4800, 'loss/train': 2.3950815200805664}} 11/06/2021 21:55:54 - INFO - __main__ - Step 4801: {'lr': 0.0004995585570980684, 'samples': 921792, 'steps': 4800, 'loss/train': 2.3950815200805664}} 11/06/2021 21:55:59 - INFO - __main__ - Step 4809: {'lr': 0.000499556031711532, 'samples': 923328, 'steps': 4808, 'loss/train': 2.4952619075775146}}} 11/06/2021 21:56:00 - INFO - __main__ - Step 4813: {'lr': 0.000499554766319553, 'samples': 924096, 'steps': 4812, 'loss/train': 1.8378645181655884}}} 11/06/2021 21:56:02 - INFO - __main__ - Step 4817: {'lr': 0.0004995534991284455, 'samples': 924864, 'steps': 4816, 'loss/train': 1.8735243082046509}} 11/06/2021 21:56:05 - INFO - __main__ - Step 4822: {'lr': 0.0004995519126095506, 'samples': 925824, 'steps': 4821, 'loss/train': 2.3707022666931152}} 11/06/2021 21:56:05 - INFO - __main__ - Step 4822: {'lr': 0.0004995519126095506, 'samples': 925824, 'steps': 4821, 'loss/train': 2.3707022666931152}} 11/06/2021 21:56:09 - INFO - __main__ - Step 4830: {'lr': 0.0004995493683322259, 'samples': 927360, 'steps': 4829, 'loss/train': 1.904067873954773}}} 11/06/2021 21:56:10 - INFO - __main__ - Step 4834: {'lr': 0.0004995480934949247, 'samples': 928128, 'steps': 4833, 'loss/train': 1.7859026193618774}} 11/06/2021 21:56:12 - INFO - __main__ - Step 4838: {'lr': 0.0004995468168585431, 'samples': 928896, 'steps': 4837, 'loss/train': 1.3020102977752686}} 11/06/2021 21:56:15 - INFO - __main__ - Step 4843: {'lr': 0.0004995452185331235, 'samples': 929856, 'steps': 4842, 'loss/train': 0.33405929803848267} 11/06/2021 21:56:17 - INFO - __main__ - Step 4847: {'lr': 0.0004995439378488449, 'samples': 930624, 'steps': 4846, 'loss/train': 1.7952702045440674}} 11/06/2021 21:56:19 - INFO - __main__ - Step 4851: {'lr': 0.0004995426553655159, 'samples': 931392, 'steps': 4850, 'loss/train': 1.6499428749084473}} 11/06/2021 21:56:20 - INFO - __main__ - Step 4855: {'lr': 0.0004995413710831458, 'samples': 932160, 'steps': 4854, 'loss/train': 2.0572216510772705}} 11/06/2021 21:56:22 - INFO - __main__ - Step 4859: {'lr': 0.0004995400850017438, 'samples': 932928, 'steps': 4858, 'loss/train': 1.930557131767273}}} 11/06/2021 21:56:25 - INFO - __main__ - Step 4864: {'lr': 0.000499538474870117, 'samples': 933888, 'steps': 4863, 'loss/train': 1.7647819519042969}}} 11/06/2021 21:56:27 - INFO - __main__ - Step 4868: {'lr': 0.0004995371847409273, 'samples': 934656, 'steps': 4867, 'loss/train': 0.9957026243209839}} 11/06/2021 21:56:29 - INFO - __main__ - Step 4872: {'lr': 0.0004995358928127359, 'samples': 935424, 'steps': 4871, 'loss/train': 1.3676517009735107}} 11/06/2021 21:56:30 - INFO - __main__ - Step 4876: {'lr': 0.0004995345990855522, 'samples': 936192, 'steps': 4875, 'loss/train': 2.013723850250244}}} 11/06/2021 21:56:32 - INFO - __main__ - Step 4880: {'lr': 0.0004995333035593853, 'samples': 936960, 'steps': 4879, 'loss/train': 1.7942310571670532}} 11/06/2021 21:56:35 - INFO - __main__ - Step 4885: {'lr': 0.0004995316816218712, 'samples': 937920, 'steps': 4884, 'loss/train': 2.102433681488037}}} 11/06/2021 21:56:37 - INFO - __main__ - Step 4890: {'lr': 0.000499530056873479, 'samples': 938880, 'steps': 4889, 'loss/train': 2.1150357723236084}}} 11/06/2021 21:56:37 - INFO - __main__ - Step 4890: {'lr': 0.000499530056873479, 'samples': 938880, 'steps': 4889, 'loss/train': 2.1150357723236084}}} 11/06/2021 21:56:41 - INFO - __main__ - Step 4897: {'lr': 0.0004995277775034894, 'samples': 940224, 'steps': 4896, 'loss/train': 1.8550939559936523}} 11/06/2021 21:56:42 - INFO - __main__ - Step 4901: {'lr': 0.0004995264725328151, 'samples': 940992, 'steps': 4900, 'loss/train': 1.1252635717391968}} 11/06/2021 21:56:44 - INFO - __main__ - Step 4905: {'lr': 0.0004995251657632165, 'samples': 941760, 'steps': 4904, 'loss/train': 1.6950825452804565}} 11/06/2021 21:56:47 - INFO - __main__ - Step 4910: {'lr': 0.0004995235297714951, 'samples': 942720, 'steps': 4909, 'loss/train': 1.769343614578247}}} 11/06/2021 21:56:49 - INFO - __main__ - Step 4914: {'lr': 0.0004995222189543509, 'samples': 943488, 'steps': 4913, 'loss/train': 6.304197311401367}}} 11/06/2021 21:56:51 - INFO - __main__ - Step 4918: {'lr': 0.0004995209063383129, 'samples': 944256, 'steps': 4917, 'loss/train': 1.607182264328003}}} 11/06/2021 21:56:53 - INFO - __main__ - Step 4922: {'lr': 0.0004995195919233906, 'samples': 945024, 'steps': 4921, 'loss/train': 2.1027209758758545}} 11/06/2021 21:56:55 - INFO - __main__ - Step 4926: {'lr': 0.0004995182757095935, 'samples': 945792, 'steps': 4925, 'loss/train': 2.3778295516967773}} 11/06/2021 21:56:55 - INFO - __main__ - Step 4926: {'lr': 0.0004995182757095935, 'samples': 945792, 'steps': 4925, 'loss/train': 2.3778295516967773}} 11/06/2021 21:56:58 - INFO - __main__ - Step 4933: {'lr': 0.0004995159680069346, 'samples': 947136, 'steps': 4932, 'loss/train': 1.8914155960083008}} 11/06/2021 21:57:01 - INFO - __main__ - Step 4937: {'lr': 0.0004995146468462806, 'samples': 947904, 'steps': 4936, 'loss/train': 1.576684832572937}}} 11/06/2021 21:57:03 - INFO - __main__ - Step 4942: {'lr': 0.0004995129928658466, 'samples': 948864, 'steps': 4941, 'loss/train': 1.9636731147766113}} 11/06/2021 21:57:03 - INFO - __main__ - Step 4942: {'lr': 0.0004995129928658466, 'samples': 948864, 'steps': 4941, 'loss/train': 1.9636731147766113}} 11/06/2021 21:57:07 - INFO - __main__ - Step 4950: {'lr': 0.0004995103406509713, 'samples': 950400, 'steps': 4949, 'loss/train': 1.9087032079696655}} 11/06/2021 21:57:09 - INFO - __main__ - Step 4954: {'lr': 0.0004995090118453167, 'samples': 951168, 'steps': 4953, 'loss/train': 1.9926605224609375}} 11/06/2021 21:57:11 - INFO - __main__ - Step 4958: {'lr': 0.0004995076812408636, 'samples': 951936, 'steps': 4957, 'loss/train': 1.9032562971115112}} 11/06/2021 21:57:13 - INFO - __main__ - Step 4963: {'lr': 0.0004995060154557513, 'samples': 952896, 'steps': 4962, 'loss/train': 2.1550307273864746}} 11/06/2021 21:57:13 - INFO - __main__ - Step 4963: {'lr': 0.0004995060154557513, 'samples': 952896, 'steps': 4962, 'loss/train': 2.1550307273864746}} 11/06/2021 21:57:17 - INFO - __main__ - Step 4971: {'lr': 0.0004995033443535541, 'samples': 954432, 'steps': 4970, 'loss/train': 1.7052550315856934}} 11/06/2021 21:57:17 - INFO - __main__ - Step 4971: {'lr': 0.0004995033443535541, 'samples': 954432, 'steps': 4970, 'loss/train': 1.7052550315856934}} 11/06/2021 21:57:21 - INFO - __main__ - Step 4979: {'lr': 0.0004995006660563262, 'samples': 955968, 'steps': 4978, 'loss/train': 2.2604053020477295}} 11/06/2021 21:57:23 - INFO - __main__ - Step 4984: {'lr': 0.0004994989884668665, 'samples': 956928, 'steps': 4983, 'loss/train': 1.7262250185012817}} 11/06/2021 21:57:23 - INFO - __main__ - Step 4984: {'lr': 0.0004994989884668665, 'samples': 956928, 'steps': 4983, 'loss/train': 1.7262250185012817}} 11/06/2021 21:57:27 - INFO - __main__ - Step 4992: {'lr': 0.0004994962984778784, 'samples': 958464, 'steps': 4991, 'loss/train': 0.9767778515815735}} 11/06/2021 21:57:29 - INFO - __main__ - Step 4996: {'lr': 0.000499494950785319, 'samples': 959232, 'steps': 4995, 'loss/train': 0.9503514766693115}}} 11/06/2021 21:57:31 - INFO - __main__ - Step 5000: {'lr': 0.0004994936012940626, 'samples': 960000, 'steps': 4999, 'loss/train': 2.0416653156280518}} 11/06/2021 21:57:33 - INFO - __main__ - Step 5004: {'lr': 0.0004994922500041186, 'samples': 960768, 'steps': 5003, 'loss/train': 1.5910391807556152}} 11/06/2021 21:57:33 - INFO - __main__ - Step 5004: {'lr': 0.0004994922500041186, 'samples': 960768, 'steps': 5003, 'loss/train': 1.5910391807556152}} 11/06/2021 21:57:37 - INFO - __main__ - Step 5012: {'lr': 0.0004994895420282072, 'samples': 962304, 'steps': 5011, 'loss/train': 2.228909969329834}}} 11/06/2021 21:57:39 - INFO - __main__ - Step 5016: {'lr': 0.0004994881853422594, 'samples': 963072, 'steps': 5015, 'loss/train': 1.8377931118011475}} 11/06/2021 21:57:41 - INFO - __main__ - Step 5020: {'lr': 0.000499486826857663, 'samples': 963840, 'steps': 5019, 'loss/train': 1.8107075691223145}}} 11/06/2021 21:57:43 - INFO - __main__ - Step 5025: {'lr': 0.0004994851262225832, 'samples': 964800, 'steps': 5024, 'loss/train': 1.3919230699539185}} 11/06/2021 21:57:45 - INFO - __main__ - Step 5029: {'lr': 0.0004994837636910638, 'samples': 965568, 'steps': 5028, 'loss/train': 1.7136719226837158}} 11/06/2021 21:57:47 - INFO - __main__ - Step 5033: {'lr': 0.0004994823993609279, 'samples': 966336, 'steps': 5032, 'loss/train': 0.8590723872184753}} 11/06/2021 21:57:47 - INFO - __main__ - Step 5033: {'lr': 0.0004994823993609279, 'samples': 966336, 'steps': 5032, 'loss/train': 0.8590723872184753}} 11/06/2021 21:57:51 - INFO - __main__ - Step 5040: {'lr': 0.0004994800074552985, 'samples': 967680, 'steps': 5039, 'loss/train': 1.7678231000900269}} 11/06/2021 21:57:53 - INFO - __main__ - Step 5046: {'lr': 0.0004994779528664095, 'samples': 968832, 'steps': 5045, 'loss/train': 2.0043511390686035}} 11/06/2021 21:57:55 - INFO - __main__ - Step 5050: {'lr': 0.000499476580892263, 'samples': 969600, 'steps': 5049, 'loss/train': 1.4724314212799072}}} 11/06/2021 21:57:58 - INFO - __main__ - Step 5054: {'lr': 0.000499475207119552, 'samples': 970368, 'steps': 5053, 'loss/train': 2.118231773376465}}}} 11/06/2021 21:58:00 - INFO - __main__ - Step 5058: {'lr': 0.0004994738315482859, 'samples': 971136, 'steps': 5057, 'loss/train': 1.5080969333648682}} 11/06/2021 21:58:01 - INFO - __main__ - Step 5062: {'lr': 0.0004994724541784749, 'samples': 971904, 'steps': 5061, 'loss/train': 2.3541839122772217}} 11/06/2021 21:58:03 - INFO - __main__ - Step 5066: {'lr': 0.000499471075010129, 'samples': 972672, 'steps': 5065, 'loss/train': 6.243468761444092}7}} 11/06/2021 21:58:03 - INFO - __main__ - Step 5066: {'lr': 0.000499471075010129, 'samples': 972672, 'steps': 5065, 'loss/train': 6.243468761444092}7}} 11/06/2021 21:58:08 - INFO - __main__ - Step 5074: {'lr': 0.0004994683112778718, 'samples': 974208, 'steps': 5073, 'loss/train': 1.9211030006408691}} 11/06/2021 21:58:09 - INFO - __main__ - Step 5078: {'lr': 0.0004994669267139806, 'samples': 974976, 'steps': 5077, 'loss/train': 2.0437135696411133}} 11/06/2021 21:58:11 - INFO - __main__ - Step 5082: {'lr': 0.0004994655403515941, 'samples': 975744, 'steps': 5081, 'loss/train': 1.8559695482254028}} 11/06/2021 21:58:11 - INFO - __main__ - Step 5082: {'lr': 0.0004994655403515941, 'samples': 975744, 'steps': 5081, 'loss/train': 1.8559695482254028}} 11/06/2021 21:58:15 - INFO - __main__ - Step 5090: {'lr': 0.0004994627622313757, 'samples': 977280, 'steps': 5089, 'loss/train': 1.8420456647872925}} 11/06/2021 21:58:17 - INFO - __main__ - Step 5094: {'lr': 0.0004994613704735638, 'samples': 978048, 'steps': 5093, 'loss/train': 2.3359320163726807}} 11/06/2021 21:58:19 - INFO - __main__ - Step 5098: {'lr': 0.0004994599769172967, 'samples': 978816, 'steps': 5097, 'loss/train': 2.2054691314697266}} 11/06/2021 21:58:21 - INFO - __main__ - Step 5103: {'lr': 0.0004994582324429008, 'samples': 979776, 'steps': 5102, 'loss/train': 1.894245982170105}}} 11/06/2021 21:58:21 - INFO - __main__ - Step 5103: {'lr': 0.0004994582324429008, 'samples': 979776, 'steps': 5102, 'loss/train': 1.894245982170105}}} 11/06/2021 21:58:21 - INFO - __main__ - Step 5103: {'lr': 0.0004994582324429008, 'samples': 979776, 'steps': 5102, 'loss/train': 1.894245982170105}}} 11/06/2021 21:58:27 - INFO - __main__ - Step 5114: {'lr': 0.0004994543847078787, 'samples': 981888, 'steps': 5113, 'loss/train': 1.3129998445510864}} 11/06/2021 21:58:29 - INFO - __main__ - Step 5119: {'lr': 0.0004994526312413897, 'samples': 982848, 'steps': 5118, 'loss/train': 1.6690380573272705}} 11/06/2021 21:58:32 - INFO - __main__ - Step 5124: {'lr': 0.000499450874964913, 'samples': 983808, 'steps': 5123, 'loss/train': 2.1291959285736084}}} 11/06/2021 21:58:34 - INFO - __main__ - Step 5128: {'lr': 0.0004994494679205539, 'samples': 984576, 'steps': 5127, 'loss/train': 2.0760858058929443}} 11/06/2021 21:58:34 - INFO - __main__ - Step 5128: {'lr': 0.0004994494679205539, 'samples': 984576, 'steps': 5127, 'loss/train': 2.0760858058929443}} 11/06/2021 21:58:37 - INFO - __main__ - Step 5135: {'lr': 0.0004994470012656052, 'samples': 985920, 'steps': 5134, 'loss/train': 2.2697160243988037}} 11/06/2021 21:58:39 - INFO - __main__ - Step 5140: {'lr': 0.0004994452359973012, 'samples': 986880, 'steps': 5139, 'loss/train': 1.6053614616394043}} 11/06/2021 21:58:39 - INFO - __main__ - Step 5140: {'lr': 0.0004994452359973012, 'samples': 986880, 'steps': 5139, 'loss/train': 1.6053614616394043}} 11/06/2021 21:58:39 - INFO - __main__ - Step 5140: {'lr': 0.0004994452359973012, 'samples': 986880, 'steps': 5139, 'loss/train': 1.6053614616394043}} 11/06/2021 21:58:45 - INFO - __main__ - Step 5151: {'lr': 0.0004994413425161969, 'samples': 988992, 'steps': 5150, 'loss/train': 1.2515465021133423}} 11/06/2021 21:58:47 - INFO - __main__ - Step 5155: {'lr': 0.0004994399233330426, 'samples': 989760, 'steps': 5154, 'loss/train': 2.2096810340881348}} 11/06/2021 21:58:47 - INFO - __main__ - Step 5155: {'lr': 0.0004994399233330426, 'samples': 989760, 'steps': 5154, 'loss/train': 2.2096810340881348}} 11/06/2021 21:58:51 - INFO - __main__ - Step 5163: {'lr': 0.0004994370795718425, 'samples': 991296, 'steps': 5162, 'loss/train': 2.0610859394073486}} 11/06/2021 21:58:53 - INFO - __main__ - Step 5168: {'lr': 0.000499435298568331, 'samples': 992256, 'steps': 5167, 'loss/train': 2.4085593223571777}}} 11/06/2021 21:58:53 - INFO - __main__ - Step 5168: {'lr': 0.000499435298568331, 'samples': 992256, 'steps': 5167, 'loss/train': 2.4085593223571777}}} 11/06/2021 21:58:58 - INFO - __main__ - Step 5176: {'lr': 0.000499432443118353, 'samples': 993792, 'steps': 5175, 'loss/train': 2.3936564922332764}}} 11/06/2021 21:59:00 - INFO - __main__ - Step 5180: {'lr': 0.0004994310126959887, 'samples': 994560, 'steps': 5179, 'loss/train': 1.998255968093872}}} 11/06/2021 21:59:01 - INFO - __main__ - Step 5184: {'lr': 0.0004994295804753885, 'samples': 995328, 'steps': 5183, 'loss/train': 1.9894294738769531}} 11/06/2021 21:59:03 - INFO - __main__ - Step 5188: {'lr': 0.0004994281464565623, 'samples': 996096, 'steps': 5187, 'loss/train': 1.9112964868545532}} 11/06/2021 21:59:03 - INFO - __main__ - Step 5188: {'lr': 0.0004994281464565623, 'samples': 996096, 'steps': 5187, 'loss/train': 1.9112964868545532}} 11/06/2021 21:59:08 - INFO - __main__ - Step 5196: {'lr': 0.0004994252730242734, 'samples': 997632, 'steps': 5195, 'loss/train': 2.2125959396362305}} 11/06/2021 21:59:10 - INFO - __main__ - Step 5200: {'lr': 0.0004994238336108315, 'samples': 998400, 'steps': 5199, 'loss/train': 1.941856861114502}}} 11/06/2021 21:59:11 - INFO - __main__ - Step 5204: {'lr': 0.0004994223923992052, 'samples': 999168, 'steps': 5203, 'loss/train': 1.8816064596176147}} 11/06/2021 21:59:14 - INFO - __main__ - Step 5209: {'lr': 0.000499420588355991, 'samples': 1000128, 'steps': 5208, 'loss/train': 2.1907718181610107}} 11/06/2021 21:59:16 - INFO - __main__ - Step 5213: {'lr': 0.0004994191430984876, 'samples': 1000896, 'steps': 5212, 'loss/train': 1.5323981046676636} 11/06/2021 21:59:18 - INFO - __main__ - Step 5217: {'lr': 0.0004994176960428333, 'samples': 1001664, 'steps': 5216, 'loss/train': 2.4069254398345947} 11/06/2021 21:59:18 - INFO - __main__ - Step 5217: {'lr': 0.0004994176960428333, 'samples': 1001664, 'steps': 5216, 'loss/train': 2.4069254398345947} 11/06/2021 21:59:21 - INFO - __main__ - Step 5224: {'lr': 0.0004994151593686699, 'samples': 1003008, 'steps': 5223, 'loss/train': 0.5793285965919495} 11/06/2021 21:59:24 - INFO - __main__ - Step 5229: {'lr': 0.0004994133440870712, 'samples': 1003968, 'steps': 5228, 'loss/train': 2.3762245178222656} 11/06/2021 21:59:24 - INFO - __main__ - Step 5229: {'lr': 0.0004994133440870712, 'samples': 1003968, 'steps': 5228, 'loss/train': 2.3762245178222656} 11/06/2021 21:59:28 - INFO - __main__ - Step 5236: {'lr': 0.0004994107979728019, 'samples': 1005312, 'steps': 5235, 'loss/train': 1.0656429529190063} 11/06/2021 21:59:29 - INFO - __main__ - Step 5240: {'lr': 0.0004994093405779842, 'samples': 1006080, 'steps': 5239, 'loss/train': 1.9837573766708374} 11/06/2021 21:59:32 - INFO - __main__ - Step 5245: {'lr': 0.0004994075163059134, 'samples': 1007040, 'steps': 5244, 'loss/train': 1.456217646598816}} 11/06/2021 21:59:32 - INFO - __main__ - Step 5245: {'lr': 0.0004994075163059134, 'samples': 1007040, 'steps': 5244, 'loss/train': 1.456217646598816}} 11/06/2021 21:59:36 - INFO - __main__ - Step 5253: {'lr': 0.0004994045916268913, 'samples': 1008576, 'steps': 5252, 'loss/train': 1.8525824546813965} 11/06/2021 21:59:38 - INFO - __main__ - Step 5257: {'lr': 0.0004994031265903063, 'samples': 1009344, 'steps': 5256, 'loss/train': 2.477855920791626}} 11/06/2021 21:59:39 - INFO - __main__ - Step 5261: {'lr': 0.0004994016597556862, 'samples': 1010112, 'steps': 5260, 'loss/train': 2.4520223140716553} 11/06/2021 21:59:42 - INFO - __main__ - Step 5266: {'lr': 0.00049939982368394, 'samples': 1011072, 'steps': 5265, 'loss/train': 1.825718879699707}53} 11/06/2021 21:59:42 - INFO - __main__ - Step 5266: {'lr': 0.00049939982368394, 'samples': 1011072, 'steps': 5265, 'loss/train': 1.825718879699707}53} 11/06/2021 21:59:42 - INFO - __main__ - Step 5266: {'lr': 0.00049939982368394, 'samples': 1011072, 'steps': 5265, 'loss/train': 1.825718879699707}53} 11/06/2021 21:59:47 - INFO - __main__ - Step 5276: {'lr': 0.0004993961431122901, 'samples': 1012992, 'steps': 5275, 'loss/train': 2.171734094619751}} 11/06/2021 21:59:50 - INFO - __main__ - Step 5281: {'lr': 0.0004993942986124278, 'samples': 1013952, 'steps': 5280, 'loss/train': 1.8966882228851318} 11/06/2021 21:59:52 - INFO - __main__ - Step 5286: {'lr': 0.0004993924513032349, 'samples': 1014912, 'steps': 5285, 'loss/train': 2.124772071838379}} 11/06/2021 21:59:54 - INFO - __main__ - Step 5290: {'lr': 0.0004993909714331766, 'samples': 1015680, 'steps': 5289, 'loss/train': 2.3629143238067627} 11/06/2021 21:59:54 - INFO - __main__ - Step 5290: {'lr': 0.0004993909714331766, 'samples': 1015680, 'steps': 5289, 'loss/train': 2.3629143238067627} 11/06/2021 21:59:57 - INFO - __main__ - Step 5297: {'lr': 0.0004993883773342695, 'samples': 1017024, 'steps': 5296, 'loss/train': 1.9744642972946167} 11/06/2021 22:00:00 - INFO - __main__ - Step 5303: {'lr': 0.0004993861494384669, 'samples': 1018176, 'steps': 5302, 'loss/train': 1.8540432453155518} 11/06/2021 22:00:02 - INFO - __main__ - Step 5307: {'lr': 0.0004993846619272052, 'samples': 1018944, 'steps': 5306, 'loss/train': 2.057713508605957}} 11/06/2021 22:00:04 - INFO - __main__ - Step 5311: {'lr': 0.0004993831726180414, 'samples': 1019712, 'steps': 5310, 'loss/train': 0.6815013289451599} 11/06/2021 22:00:04 - INFO - __main__ - Step 5311: {'lr': 0.0004993831726180414, 'samples': 1019712, 'steps': 5310, 'loss/train': 0.6815013289451599} 11/06/2021 22:00:08 - INFO - __main__ - Step 5318: {'lr': 0.0004993805620008353, 'samples': 1021056, 'steps': 5317, 'loss/train': 2.118875026702881}} 11/06/2021 22:00:10 - INFO - __main__ - Step 5323: {'lr': 0.0004993786939032451, 'samples': 1022016, 'steps': 5322, 'loss/train': 1.7816057205200195} 11/06/2021 22:00:10 - INFO - __main__ - Step 5323: {'lr': 0.0004993786939032451, 'samples': 1022016, 'steps': 5322, 'loss/train': 1.7816057205200195} 11/06/2021 22:00:14 - INFO - __main__ - Step 5331: {'lr': 0.0004993756991040675, 'samples': 1023552, 'steps': 5330, 'loss/train': 1.8130137920379639} 11/06/2021 22:00:16 - INFO - __main__ - Step 5335: {'lr': 0.0004993741990077172, 'samples': 1024320, 'steps': 5334, 'loss/train': 2.3342111110687256} 11/06/2021 22:00:18 - INFO - __main__ - Step 5340: {'lr': 0.0004993723213590868, 'samples': 1025280, 'steps': 5339, 'loss/train': 2.2036027908325195} 11/06/2021 22:00:20 - INFO - __main__ - Step 5344: {'lr': 0.0004993708172176417, 'samples': 1026048, 'steps': 5343, 'loss/train': 1.9710315465927124} 11/06/2021 22:00:22 - INFO - __main__ - Step 5349: {'lr': 0.0004993689345126771, 'samples': 1027008, 'steps': 5348, 'loss/train': 2.151108503341675}} 11/06/2021 22:00:24 - INFO - __main__ - Step 5353: {'lr': 0.0004993674263261921, 'samples': 1027776, 'steps': 5352, 'loss/train': 1.639163851737976}} 11/06/2021 22:00:26 - INFO - __main__ - Step 5357: {'lr': 0.0004993659163419294, 'samples': 1028544, 'steps': 5356, 'loss/train': 2.263580322265625}} 11/06/2021 22:00:28 - INFO - __main__ - Step 5361: {'lr': 0.0004993644045598997, 'samples': 1029312, 'steps': 5360, 'loss/train': 2.4149515628814697} 11/06/2021 22:00:30 - INFO - __main__ - Step 5365: {'lr': 0.0004993628909801138, 'samples': 1030080, 'steps': 5364, 'loss/train': 1.8695933818817139} 11/06/2021 22:00:32 - INFO - __main__ - Step 5369: {'lr': 0.000499361375602583, 'samples': 1030848, 'steps': 5368, 'loss/train': 2.0226683616638184}} 11/06/2021 22:00:34 - INFO - __main__ - Step 5374: {'lr': 0.0004993594788526069, 'samples': 1031808, 'steps': 5373, 'loss/train': 2.1130635738372803} 11/06/2021 22:00:37 - INFO - __main__ - Step 5378: {'lr': 0.0004993579594301895, 'samples': 1032576, 'steps': 5377, 'loss/train': 1.8319172859191895} 11/06/2021 22:00:39 - INFO - __main__ - Step 5382: {'lr': 0.0004993564382100624, 'samples': 1033344, 'steps': 5381, 'loss/train': 1.3644241094589233} 11/06/2021 22:00:40 - INFO - __main__ - Step 5386: {'lr': 0.0004993549151922367, 'samples': 1034112, 'steps': 5385, 'loss/train': 2.5312018394470215} 11/06/2021 22:00:42 - INFO - __main__ - Step 5390: {'lr': 0.0004993533903767235, 'samples': 1034880, 'steps': 5389, 'loss/train': 1.5707823038101196} 11/06/2021 22:00:42 - INFO - __main__ - Step 5390: {'lr': 0.0004993533903767235, 'samples': 1034880, 'steps': 5389, 'loss/train': 1.5707823038101196} 11/06/2021 22:00:46 - INFO - __main__ - Step 5398: {'lr': 0.0004993503353526779, 'samples': 1036416, 'steps': 5397, 'loss/train': 1.566808819770813}} 11/06/2021 22:00:48 - INFO - __main__ - Step 5402: {'lr': 0.0004993488051441677, 'samples': 1037184, 'steps': 5401, 'loss/train': 1.2702760696411133} 11/06/2021 22:00:50 - INFO - __main__ - Step 5406: {'lr': 0.000499347273138014, 'samples': 1037952, 'steps': 5405, 'loss/train': 1.485642433166504}3} 11/06/2021 22:00:52 - INFO - __main__ - Step 5411: {'lr': 0.0004993453556024023, 'samples': 1038912, 'steps': 5410, 'loss/train': 1.6171423196792603} 11/06/2021 22:00:55 - INFO - __main__ - Step 5416: {'lr': 0.0004993434352580115, 'samples': 1039872, 'steps': 5415, 'loss/train': 1.5889499187469482} 11/06/2021 22:00:55 - INFO - __main__ - Step 5416: {'lr': 0.0004993434352580115, 'samples': 1039872, 'steps': 5415, 'loss/train': 1.5889499187469482} 11/06/2021 22:00:58 - INFO - __main__ - Step 5422: {'lr': 0.0004993411271371842, 'samples': 1041024, 'steps': 5421, 'loss/train': 1.9106769561767578} 11/06/2021 22:00:58 - INFO - __main__ - Step 5422: {'lr': 0.0004993411271371842, 'samples': 1041024, 'steps': 5421, 'loss/train': 1.9106769561767578} 11/06/2021 22:00:58 - INFO - __main__ - Step 5422: {'lr': 0.0004993411271371842, 'samples': 1041024, 'steps': 5421, 'loss/train': 1.9106769561767578} 11/06/2021 22:01:04 - INFO - __main__ - Step 5433: {'lr': 0.0004993368850777052, 'samples': 1043136, 'steps': 5432, 'loss/train': 2.2101669311523438} 11/06/2021 22:01:06 - INFO - __main__ - Step 5438: {'lr': 0.0004993349523749431, 'samples': 1044096, 'steps': 5437, 'loss/train': 1.8289518356323242} 11/06/2021 22:01:08 - INFO - __main__ - Step 5442: {'lr': 0.0004993334041904957, 'samples': 1044864, 'steps': 5441, 'loss/train': 0.3798553943634033} 11/06/2021 22:01:10 - INFO - __main__ - Step 5446: {'lr': 0.0004993318542085157, 'samples': 1045632, 'steps': 5445, 'loss/train': 2.161842107772827}} 11/06/2021 22:01:12 - INFO - __main__ - Step 5450: {'lr': 0.0004993303024290143, 'samples': 1046400, 'steps': 5449, 'loss/train': 2.0517914295196533} 11/06/2021 22:01:14 - INFO - __main__ - Step 5455: {'lr': 0.0004993283601768902, 'samples': 1047360, 'steps': 5454, 'loss/train': 2.6343281269073486} 11/06/2021 22:01:16 - INFO - __main__ - Step 5459: {'lr': 0.0004993268043530067, 'samples': 1048128, 'steps': 5458, 'loss/train': 1.2634351253509521} 11/06/2021 22:01:16 - INFO - __main__ - Step 5459: {'lr': 0.0004993268043530067, 'samples': 1048128, 'steps': 5458, 'loss/train': 1.2634351253509521} 11/06/2021 22:01:20 - INFO - __main__ - Step 5467: {'lr': 0.000499323687312796, 'samples': 1049664, 'steps': 5466, 'loss/train': 1.905300498008728}1} 11/06/2021 22:01:22 - INFO - __main__ - Step 5471: {'lr': 0.0004993221260964912, 'samples': 1050432, 'steps': 5470, 'loss/train': 2.155994415283203}} 11/06/2021 22:01:24 - INFO - __main__ - Step 5476: {'lr': 0.0004993201720484458, 'samples': 1051392, 'steps': 5475, 'loss/train': 1.8536326885223389} 11/06/2021 22:01:27 - INFO - __main__ - Step 5481: {'lr': 0.0004993182151919049, 'samples': 1052352, 'steps': 5480, 'loss/train': 2.0209407806396484} 11/06/2021 22:01:29 - INFO - __main__ - Step 5485: {'lr': 0.0004993166476845701, 'samples': 1053120, 'steps': 5484, 'loss/train': 2.2243385314941406} 11/06/2021 22:01:29 - INFO - __main__ - Step 5485: {'lr': 0.0004993166476845701, 'samples': 1053120, 'steps': 5484, 'loss/train': 2.2243385314941406} 11/06/2021 22:01:32 - INFO - __main__ - Step 5492: {'lr': 0.000499313900221719, 'samples': 1054464, 'steps': 5491, 'loss/train': 1.8107589483261108}} 11/06/2021 22:01:34 - INFO - __main__ - Step 5497: {'lr': 0.0004993119343781406, 'samples': 1055424, 'steps': 5496, 'loss/train': 1.697355031967163}} 11/06/2021 22:01:36 - INFO - __main__ - Step 5501: {'lr': 0.0004993103596812267, 'samples': 1056192, 'steps': 5500, 'loss/train': 1.8864690065383911} 11/06/2021 22:01:39 - INFO - __main__ - Step 5506: {'lr': 0.0004993083887825393, 'samples': 1057152, 'steps': 5505, 'loss/train': 2.18703556060791}1} 11/06/2021 22:01:41 - INFO - __main__ - Step 5510: {'lr': 0.0004993068100415671, 'samples': 1057920, 'steps': 5509, 'loss/train': 1.877155065536499}} 11/06/2021 22:01:41 - INFO - __main__ - Step 5510: {'lr': 0.0004993068100415671, 'samples': 1057920, 'steps': 5509, 'loss/train': 1.877155065536499}} 11/06/2021 22:01:44 - INFO - __main__ - Step 5517: {'lr': 0.0004993040429200211, 'samples': 1059264, 'steps': 5516, 'loss/train': 2.012702465057373}} 11/06/2021 22:01:46 - INFO - __main__ - Step 5522: {'lr': 0.0004993020630346509, 'samples': 1060224, 'steps': 5521, 'loss/train': 2.444692611694336}} 11/06/2021 22:01:49 - INFO - __main__ - Step 5527: {'lr': 0.0004993000803409891, 'samples': 1061184, 'steps': 5526, 'loss/train': 2.20894455909729}}} 11/06/2021 22:01:51 - INFO - __main__ - Step 5531: {'lr': 0.0004992984921641048, 'samples': 1061952, 'steps': 5530, 'loss/train': 2.0082509517669678} 11/06/2021 22:01:51 - INFO - __main__ - Step 5531: {'lr': 0.0004992984921641048, 'samples': 1061952, 'steps': 5530, 'loss/train': 2.0082509517669678} 11/06/2021 22:01:54 - INFO - __main__ - Step 5538: {'lr': 0.0004992957085298571, 'samples': 1063296, 'steps': 5537, 'loss/train': 1.3260631561279297} 11/06/2021 22:01:56 - INFO - __main__ - Step 5543: {'lr': 0.0004992937168498126, 'samples': 1064256, 'steps': 5542, 'loss/train': 1.7761310338974}97} 11/06/2021 22:01:59 - INFO - __main__ - Step 5548: {'lr': 0.0004992917223615706, 'samples': 1065216, 'steps': 5547, 'loss/train': 1.4636272192001343} 11/06/2021 22:01:59 - INFO - __main__ - Step 5548: {'lr': 0.0004992917223615706, 'samples': 1065216, 'steps': 5547, 'loss/train': 1.4636272192001343} 11/06/2021 22:02:02 - INFO - __main__ - Step 5554: {'lr': 0.000499289325268891, 'samples': 1066368, 'steps': 5553, 'loss/train': 1.8733552694320679}} 11/06/2021 22:02:04 - INFO - __main__ - Step 5558: {'lr': 0.0004992877249605838, 'samples': 1067136, 'steps': 5557, 'loss/train': 2.2034895420074463} 11/06/2021 22:02:06 - INFO - __main__ - Step 5563: {'lr': 0.0004992857220478841, 'samples': 1068096, 'steps': 5562, 'loss/train': 1.9436842203140259} 11/06/2021 22:02:08 - INFO - __main__ - Step 5567: {'lr': 0.0004992841176958858, 'samples': 1068864, 'steps': 5566, 'loss/train': 1.8892325162887573} 11/06/2021 22:02:08 - INFO - __main__ - Step 5567: {'lr': 0.0004992841176958858, 'samples': 1068864, 'steps': 5566, 'loss/train': 1.8892325162887573} 11/06/2021 22:02:12 - INFO - __main__ - Step 5574: {'lr': 0.000499281305755438, 'samples': 1070208, 'steps': 5573, 'loss/train': 1.4345474243164062}} 11/06/2021 22:02:14 - INFO - __main__ - Step 5578: {'lr': 0.0004992796964612302, 'samples': 1070976, 'steps': 5577, 'loss/train': 2.1370227336883545} 11/06/2021 22:02:16 - INFO - __main__ - Step 5583: {'lr': 0.0004992776823162362, 'samples': 1071936, 'steps': 5582, 'loss/train': 1.6253973245620728} 11/06/2021 22:02:19 - INFO - __main__ - Step 5588: {'lr': 0.0004992756653632252, 'samples': 1072896, 'steps': 5587, 'loss/train': 2.1524899005889893} 11/06/2021 22:02:19 - INFO - __main__ - Step 5588: {'lr': 0.0004992756653632252, 'samples': 1072896, 'steps': 5587, 'loss/train': 2.1524899005889893} 11/06/2021 22:02:22 - INFO - __main__ - Step 5595: {'lr': 0.0004992728369115848, 'samples': 1074240, 'steps': 5594, 'loss/train': 0.8529731631278992} 11/06/2021 22:02:24 - INFO - __main__ - Step 5600: {'lr': 0.0004992708132194259, 'samples': 1075200, 'steps': 5599, 'loss/train': 1.7190282344818115} 11/06/2021 22:02:24 - INFO - __main__ - Step 5600: {'lr': 0.0004992708132194259, 'samples': 1075200, 'steps': 5599, 'loss/train': 1.7190282344818115} 11/06/2021 22:02:28 - INFO - __main__ - Step 5608: {'lr': 0.0004992675694714671, 'samples': 1076736, 'steps': 5607, 'loss/train': 2.1556620597839355} 11/06/2021 22:02:28 - INFO - __main__ - Step 5608: {'lr': 0.0004992675694714671, 'samples': 1076736, 'steps': 5607, 'loss/train': 2.1556620597839355} 11/06/2021 22:02:32 - INFO - __main__ - Step 5616: {'lr': 0.0004992643185352765, 'samples': 1078272, 'steps': 5615, 'loss/train': 2.0697097778320312} 11/06/2021 22:02:32 - INFO - __main__ - Step 5616: {'lr': 0.0004992643185352765, 'samples': 1078272, 'steps': 5615, 'loss/train': 2.0697097778320312} 11/06/2021 22:02:37 - INFO - __main__ - Step 5624: {'lr': 0.0004992610604109481, 'samples': 1079808, 'steps': 5623, 'loss/train': 2.165574312210083}} 11/06/2021 22:02:38 - INFO - __main__ - Step 5628: {'lr': 0.0004992594286532615, 'samples': 1080576, 'steps': 5627, 'loss/train': 1.958828330039978}} 11/06/2021 22:02:40 - INFO - __main__ - Step 5632: {'lr': 0.0004992577950985757, 'samples': 1081344, 'steps': 5631, 'loss/train': 2.0259532928466797} 11/06/2021 22:02:42 - INFO - __main__ - Step 5637: {'lr': 0.0004992557506282061, 'samples': 1082304, 'steps': 5636, 'loss/train': 1.9845712184906006} 11/06/2021 22:02:45 - INFO - __main__ - Step 5641: {'lr': 0.000499254113030315, 'samples': 1083072, 'steps': 5640, 'loss/train': 1.8128447532653809}} 11/06/2021 22:02:47 - INFO - __main__ - Step 5645: {'lr': 0.0004992524736354631, 'samples': 1083840, 'steps': 5644, 'loss/train': 2.096315622329712}} 11/06/2021 22:02:49 - INFO - __main__ - Step 5649: {'lr': 0.000499250832443662, 'samples': 1084608, 'steps': 5648, 'loss/train': 2.2662413120269775}} 11/06/2021 22:02:50 - INFO - __main__ - Step 5653: {'lr': 0.0004992491894549236, 'samples': 1085376, 'steps': 5652, 'loss/train': 1.7326874732971191} 11/06/2021 22:02:52 - INFO - __main__ - Step 5657: {'lr': 0.00049924754466926, 'samples': 1086144, 'steps': 5656, 'loss/train': 2.187472105026245}91} 11/06/2021 22:02:55 - INFO - __main__ - Step 5662: {'lr': 0.000499245486160272, 'samples': 1087104, 'steps': 5661, 'loss/train': 1.9282915592193604}} 11/06/2021 22:02:57 - INFO - __main__ - Step 5666: {'lr': 0.0004992438373315694, 'samples': 1087872, 'steps': 5665, 'loss/train': 1.5366052389144897} 11/06/2021 22:02:59 - INFO - __main__ - Step 5670: {'lr': 0.0004992421867059801, 'samples': 1088640, 'steps': 5669, 'loss/train': 1.8856055736541748} 11/06/2021 22:03:00 - INFO - __main__ - Step 5674: {'lr': 0.0004992405342835158, 'samples': 1089408, 'steps': 5673, 'loss/train': 1.913179636001587}} 11/06/2021 22:03:02 - INFO - __main__ - Step 5678: {'lr': 0.0004992388800641885, 'samples': 1090176, 'steps': 5677, 'loss/train': 2.1851115226745605} 11/06/2021 22:03:05 - INFO - __main__ - Step 5683: {'lr': 0.0004992368097632089, 'samples': 1091136, 'steps': 5682, 'loss/train': 2.1074047088623047} 11/06/2021 22:03:07 - INFO - __main__ - Step 5687: {'lr': 0.0004992351515009833, 'samples': 1091904, 'steps': 5686, 'loss/train': 1.8593014478683472} 11/06/2021 22:03:07 - INFO - __main__ - Step 5687: {'lr': 0.0004992351515009833, 'samples': 1091904, 'steps': 5686, 'loss/train': 1.8593014478683472} 11/06/2021 22:03:10 - INFO - __main__ - Step 5694: {'lr': 0.0004992322452184876, 'samples': 1093248, 'steps': 5693, 'loss/train': 2.2784321308135986} 11/06/2021 22:03:12 - INFO - __main__ - Step 5699: {'lr': 0.0004992301659334095, 'samples': 1094208, 'steps': 5698, 'loss/train': 1.779240369796753}} 11/06/2021 22:03:15 - INFO - __main__ - Step 5704: {'lr': 0.0004992280838408496, 'samples': 1095168, 'steps': 5703, 'loss/train': 2.011932611465454}} 11/06/2021 22:03:17 - INFO - __main__ - Step 5708: {'lr': 0.0004992264161454306, 'samples': 1095936, 'steps': 5707, 'loss/train': 2.132072925567627}} 11/06/2021 22:03:17 - INFO - __main__ - Step 5708: {'lr': 0.0004992264161454306, 'samples': 1095936, 'steps': 5707, 'loss/train': 2.132072925567627}} 11/06/2021 22:03:20 - INFO - __main__ - Step 5715: {'lr': 0.000499223493354998, 'samples': 1097280, 'steps': 5714, 'loss/train': 1.668257713317871}}} 11/06/2021 22:03:23 - INFO - __main__ - Step 5720: {'lr': 0.0004992214022786546, 'samples': 1098240, 'steps': 5719, 'loss/train': 2.1646392345428467} 11/06/2021 22:03:25 - INFO - __main__ - Step 5724: {'lr': 0.000499219727396263, 'samples': 1099008, 'steps': 5723, 'loss/train': 1.9365618228912354}} 11/06/2021 22:03:25 - INFO - __main__ - Step 5724: {'lr': 0.000499219727396263, 'samples': 1099008, 'steps': 5723, 'loss/train': 1.9365618228912354}} 11/06/2021 22:03:28 - INFO - __main__ - Step 5731: {'lr': 0.0004992167920287443, 'samples': 1100352, 'steps': 5730, 'loss/train': 1.7039726972579956} 11/06/2021 22:03:31 - INFO - __main__ - Step 5736: {'lr': 0.0004992146919688584, 'samples': 1101312, 'steps': 5735, 'loss/train': 2.2007718086242676} 11/06/2021 22:03:31 - INFO - __main__ - Step 5736: {'lr': 0.0004992146919688584, 'samples': 1101312, 'steps': 5735, 'loss/train': 2.2007718086242676} 11/06/2021 22:03:35 - INFO - __main__ - Step 5744: {'lr': 0.0004992113260338517, 'samples': 1102848, 'steps': 5743, 'loss/train': 1.731091022491455}} 11/06/2021 22:03:37 - INFO - __main__ - Step 5748: {'lr': 0.0004992096403713635, 'samples': 1103616, 'steps': 5747, 'loss/train': 2.426661491394043}} 11/06/2021 22:03:39 - INFO - __main__ - Step 5752: {'lr': 0.0004992079529122351, 'samples': 1104384, 'steps': 5751, 'loss/train': 2.0824427604675293} 11/06/2021 22:03:41 - INFO - __main__ - Step 5757: {'lr': 0.0004992058410618177, 'samples': 1105344, 'steps': 5756, 'loss/train': 1.8771767616271973} 11/06/2021 22:03:43 - INFO - __main__ - Step 5761: {'lr': 0.0004992041495602931, 'samples': 1106112, 'steps': 5760, 'loss/train': 1.8871279954910278} 11/06/2021 22:03:45 - INFO - __main__ - Step 5765: {'lr': 0.0004992024562621678, 'samples': 1106880, 'steps': 5764, 'loss/train': 2.004040479660034}} 11/06/2021 22:03:47 - INFO - __main__ - Step 5769: {'lr': 0.000499200761167454, 'samples': 1107648, 'steps': 5768, 'loss/train': 1.7773163318634033}} 11/06/2021 22:03:49 - INFO - __main__ - Step 5773: {'lr': 0.000499199064276164, 'samples': 1108416, 'steps': 5772, 'loss/train': 2.8149049282073975}} 11/06/2021 22:03:51 - INFO - __main__ - Step 5777: {'lr': 0.0004991973655883099, 'samples': 1109184, 'steps': 5776, 'loss/train': 1.8211041688919067} 11/06/2021 22:03:53 - INFO - __main__ - Step 5782: {'lr': 0.0004991952397020927, 'samples': 1110144, 'steps': 5781, 'loss/train': 1.9769641160964966} 11/06/2021 22:03:55 - INFO - __main__ - Step 5786: {'lr': 0.0004991935369720143, 'samples': 1110912, 'steps': 5785, 'loss/train': 2.5987279415130615} 11/06/2021 22:03:57 - INFO - __main__ - Step 5790: {'lr': 0.0004991918324454117, 'samples': 1111680, 'steps': 5789, 'loss/train': 1.8697025775909424} 11/06/2021 22:03:59 - INFO - __main__ - Step 5794: {'lr': 0.0004991901261222971, 'samples': 1112448, 'steps': 5793, 'loss/train': 2.1112143993377686} 11/06/2021 22:04:01 - INFO - __main__ - Step 5798: {'lr': 0.000499188418002683, 'samples': 1113216, 'steps': 5797, 'loss/train': 2.0395941734313965}} 11/06/2021 22:04:03 - INFO - __main__ - Step 5803: {'lr': 0.0004991862803268564, 'samples': 1114176, 'steps': 5802, 'loss/train': 2.107743501663208}} 11/06/2021 22:04:03 - INFO - __main__ - Step 5803: {'lr': 0.0004991862803268564, 'samples': 1114176, 'steps': 5802, 'loss/train': 2.107743501663208}} 11/06/2021 22:04:07 - INFO - __main__ - Step 5810: {'lr': 0.0004991832828649661, 'samples': 1115520, 'steps': 5809, 'loss/train': 1.6817560195922852} 11/06/2021 22:04:09 - INFO - __main__ - Step 5814: {'lr': 0.0004991815675594768, 'samples': 1116288, 'steps': 5813, 'loss/train': 1.5646218061447144} 11/06/2021 22:04:11 - INFO - __main__ - Step 5819: {'lr': 0.0004991794209013758, 'samples': 1117248, 'steps': 5818, 'loss/train': 1.8572213649749756} 11/06/2021 22:04:13 - INFO - __main__ - Step 5824: {'lr': 0.0004991772714363649, 'samples': 1118208, 'steps': 5823, 'loss/train': 1.790226936340332}} 11/06/2021 22:04:13 - INFO - __main__ - Step 5824: {'lr': 0.0004991772714363649, 'samples': 1118208, 'steps': 5823, 'loss/train': 1.790226936340332}} 11/06/2021 22:04:17 - INFO - __main__ - Step 5831: {'lr': 0.0004991742574697866, 'samples': 1119552, 'steps': 5830, 'loss/train': 1.9625569581985474} 11/06/2021 22:04:19 - INFO - __main__ - Step 5835: {'lr': 0.0004991725327331366, 'samples': 1120320, 'steps': 5834, 'loss/train': 1.6535284519195557} 11/06/2021 22:04:22 - INFO - __main__ - Step 5840: {'lr': 0.0004991703742861762, 'samples': 1121280, 'steps': 5839, 'loss/train': 2.014793872833252}} 11/06/2021 22:04:24 - INFO - __main__ - Step 5844: {'lr': 0.0004991686455077049, 'samples': 1122048, 'steps': 5843, 'loss/train': 2.0172688961029053} 11/06/2021 22:04:25 - INFO - __main__ - Step 5848: {'lr': 0.0004991669149328889, 'samples': 1122816, 'steps': 5847, 'loss/train': 1.8865562677383423} 11/06/2021 22:04:27 - INFO - __main__ - Step 5852: {'lr': 0.0004991651825617406, 'samples': 1123584, 'steps': 5851, 'loss/train': 2.156768560409546}} 11/06/2021 22:04:29 - INFO - __main__ - Step 5856: {'lr': 0.0004991634483942725, 'samples': 1124352, 'steps': 5855, 'loss/train': 1.9906128644943237} 11/06/2021 22:04:29 - INFO - __main__ - Step 5856: {'lr': 0.0004991634483942725, 'samples': 1124352, 'steps': 5855, 'loss/train': 1.9906128644943237} 11/06/2021 22:04:33 - INFO - __main__ - Step 5863: {'lr': 0.0004991604092788465, 'samples': 1125696, 'steps': 5862, 'loss/train': 3.404467821121216}} 11/06/2021 22:04:36 - INFO - __main__ - Step 5868: {'lr': 0.0004991582351140747, 'samples': 1126656, 'steps': 5867, 'loss/train': 1.8088363409042358} 11/06/2021 22:04:38 - INFO - __main__ - Step 5872: {'lr': 0.0004991564937614526, 'samples': 1127424, 'steps': 5871, 'loss/train': 2.1050267219543457} 11/06/2021 22:04:38 - INFO - __main__ - Step 5872: {'lr': 0.0004991564937614526, 'samples': 1127424, 'steps': 5871, 'loss/train': 2.1050267219543457} 11/06/2021 22:04:41 - INFO - __main__ - Step 5879: {'lr': 0.0004991534420721278, 'samples': 1128768, 'steps': 5878, 'loss/train': 1.775754451751709}} 11/06/2021 22:04:43 - INFO - __main__ - Step 5884: {'lr': 0.0004991512589260939, 'samples': 1129728, 'steps': 5883, 'loss/train': 2.0570759773254395} 11/06/2021 22:04:46 - INFO - __main__ - Step 5889: {'lr': 0.0004991490729734672, 'samples': 1130688, 'steps': 5888, 'loss/train': 1.4462894201278687} 11/06/2021 22:04:46 - INFO - __main__ - Step 5889: {'lr': 0.0004991490729734672, 'samples': 1130688, 'steps': 5888, 'loss/train': 1.4462894201278687} 11/06/2021 22:04:48 - INFO - __main__ - Step 5895: {'lr': 0.0004991464461256472, 'samples': 1131840, 'steps': 5894, 'loss/train': 1.8709157705307007} 11/06/2021 22:04:51 - INFO - __main__ - Step 5900: {'lr': 0.0004991442539986029, 'samples': 1132800, 'steps': 5899, 'loss/train': 2.119931697845459}} 11/06/2021 22:04:53 - INFO - __main__ - Step 5905: {'lr': 0.0004991420590650448, 'samples': 1133760, 'steps': 5904, 'loss/train': 1.9303275346755981} 11/06/2021 22:04:53 - INFO - __main__ - Step 5905: {'lr': 0.0004991420590650448, 'samples': 1133760, 'steps': 5904, 'loss/train': 1.9303275346755981} 11/06/2021 22:04:57 - INFO - __main__ - Step 5912: {'lr': 0.0004991389814431672, 'samples': 1135104, 'steps': 5911, 'loss/train': 2.336418390274048}} 11/06/2021 22:04:58 - INFO - __main__ - Step 5916: {'lr': 0.0004991372203324098, 'samples': 1135872, 'steps': 5915, 'loss/train': 1.72451913356781}}} 11/06/2021 22:05:00 - INFO - __main__ - Step 5920: {'lr': 0.0004991354574255344, 'samples': 1136640, 'steps': 5919, 'loss/train': 1.7401177883148193} 11/06/2021 22:05:03 - INFO - __main__ - Step 5925: {'lr': 0.0004991332512661682, 'samples': 1137600, 'steps': 5924, 'loss/train': 1.9979768991470337} 11/06/2021 22:05:03 - INFO - __main__ - Step 5925: {'lr': 0.0004991332512661682, 'samples': 1137600, 'steps': 5924, 'loss/train': 1.9979768991470337} 11/06/2021 22:05:07 - INFO - __main__ - Step 5933: {'lr': 0.0004991297155739015, 'samples': 1139136, 'steps': 5932, 'loss/train': 2.3104774951934814} 11/06/2021 22:05:09 - INFO - __main__ - Step 5937: {'lr': 0.0004991279450336656, 'samples': 1139904, 'steps': 5936, 'loss/train': 2.049226999282837}} 11/06/2021 22:05:11 - INFO - __main__ - Step 5942: {'lr': 0.0004991257293326752, 'samples': 1140864, 'steps': 5941, 'loss/train': 1.340391993522644}} 11/06/2021 22:05:13 - INFO - __main__ - Step 5946: {'lr': 0.0004991239547513419, 'samples': 1141632, 'steps': 5945, 'loss/train': 1.9071139097213745} 11/06/2021 22:05:13 - INFO - __main__ - Step 5946: {'lr': 0.0004991239547513419, 'samples': 1141632, 'steps': 5945, 'loss/train': 1.9071139097213745} 11/06/2021 22:05:17 - INFO - __main__ - Step 5952: {'lr': 0.0004991212895118035, 'samples': 1142784, 'steps': 5951, 'loss/train': 2.292984962463379}} 11/06/2021 22:05:19 - INFO - __main__ - Step 5958: {'lr': 0.0004991186202312576, 'samples': 1143936, 'steps': 5957, 'loss/train': 2.46972393989563}}} 11/06/2021 22:05:22 - INFO - __main__ - Step 5962: {'lr': 0.000499116838465911, 'samples': 1144704, 'steps': 5961, 'loss/train': 2.2520675659179688}} 11/06/2021 22:05:23 - INFO - __main__ - Step 5966: {'lr': 0.0004991150549045931, 'samples': 1145472, 'steps': 5965, 'loss/train': 2.390394926071167}} 11/06/2021 22:05:25 - INFO - __main__ - Step 5970: {'lr': 0.0004991132695473167, 'samples': 1146240, 'steps': 5969, 'loss/train': 1.7997961044311523} 11/06/2021 22:05:27 - INFO - __main__ - Step 5975: {'lr': 0.0004991110353251744, 'samples': 1147200, 'steps': 5974, 'loss/train': 1.894894003868103}} 11/06/2021 22:05:30 - INFO - __main__ - Step 5979: {'lr': 0.0004991092459270388, 'samples': 1147968, 'steps': 5978, 'loss/train': 1.8954336643218994} 11/06/2021 22:05:32 - INFO - __main__ - Step 5983: {'lr': 0.0004991074547329867, 'samples': 1148736, 'steps': 5982, 'loss/train': 2.0363080501556396} 11/06/2021 22:05:33 - INFO - __main__ - Step 5987: {'lr': 0.0004991056617430308, 'samples': 1149504, 'steps': 5986, 'loss/train': 1.5863991975784302} 11/06/2021 22:05:35 - INFO - __main__ - Step 5991: {'lr': 0.0004991038669571844, 'samples': 1150272, 'steps': 5990, 'loss/train': 2.2613940238952637} 11/06/2021 22:05:38 - INFO - __main__ - Step 5996: {'lr': 0.0004991016209494249, 'samples': 1151232, 'steps': 5995, 'loss/train': 1.6935555934906006} 11/06/2021 22:05:40 - INFO - __main__ - Step 6000: {'lr': 0.0004990998221228718, 'samples': 1152000, 'steps': 5999, 'loss/train': 2.1965417861938477} 11/06/2021 22:05:40 - INFO - __main__ - Step 6000: {'lr': 0.0004990998221228718, 'samples': 1152000, 'steps': 5999, 'loss/train': 2.1965417861938477} 11/06/2021 22:05:43 - INFO - __main__ - Step 6007: {'lr': 0.000499096669855151, 'samples': 1153344, 'steps': 6006, 'loss/train': 1.9373234510421753}} 11/06/2021 22:05:45 - INFO - __main__ - Step 6011: {'lr': 0.0004990948660900455, 'samples': 1154112, 'steps': 6010, 'loss/train': 1.6468206644058228} 11/06/2021 22:05:47 - INFO - __main__ - Step 6015: {'lr': 0.0004990930605291272, 'samples': 1154880, 'steps': 6014, 'loss/train': 1.9965391159057617} 11/06/2021 22:05:47 - INFO - __main__ - Step 6015: {'lr': 0.0004990930605291272, 'samples': 1154880, 'steps': 6014, 'loss/train': 1.9965391159057617} 11/06/2021 22:05:51 - INFO - __main__ - Step 6023: {'lr': 0.0004990894440199042, 'samples': 1156416, 'steps': 6022, 'loss/train': 2.2078778743743896} 11/06/2021 22:05:53 - INFO - __main__ - Step 6027: {'lr': 0.0004990876330716256, 'samples': 1157184, 'steps': 6026, 'loss/train': 1.7819007635116577} 11/06/2021 22:05:55 - INFO - __main__ - Step 6032: {'lr': 0.0004990853668609902, 'samples': 1158144, 'steps': 6031, 'loss/train': 1.8239428997039795} 11/06/2021 22:05:57 - INFO - __main__ - Step 6036: {'lr': 0.0004990835518722683, 'samples': 1158912, 'steps': 6035, 'loss/train': 1.736742615699768}} 11/06/2021 22:05:59 - INFO - __main__ - Step 6040: {'lr': 0.0004990817350878152, 'samples': 1159680, 'steps': 6039, 'loss/train': 1.2319962978363037} 11/06/2021 22:06:01 - INFO - __main__ - Step 6044: {'lr': 0.0004990799165076438, 'samples': 1160448, 'steps': 6043, 'loss/train': 0.32099586725234985} 11/06/2021 22:06:03 - INFO - __main__ - Step 6048: {'lr': 0.0004990780961317674, 'samples': 1161216, 'steps': 6047, 'loss/train': 1.82656729221344}85} 11/06/2021 22:06:05 - INFO - __main__ - Step 6052: {'lr': 0.000499076273960199, 'samples': 1161984, 'steps': 6051, 'loss/train': 2.0586600303649902}5} 11/06/2021 22:06:07 - INFO - __main__ - Step 6057: {'lr': 0.0004990739937205668, 'samples': 1162944, 'steps': 6056, 'loss/train': 1.2735779285430908}} 11/06/2021 22:06:09 - INFO - __main__ - Step 6061: {'lr': 0.0004990721675087397, 'samples': 1163712, 'steps': 6060, 'loss/train': 2.1596553325653076}} 11/06/2021 22:06:11 - INFO - __main__ - Step 6065: {'lr': 0.0004990703395012634, 'samples': 1164480, 'steps': 6064, 'loss/train': 2.094088554382324}}} 11/06/2021 22:06:13 - INFO - __main__ - Step 6069: {'lr': 0.000499068509698151, 'samples': 1165248, 'steps': 6068, 'loss/train': 1.7530931234359741}}} 11/06/2021 22:06:15 - INFO - __main__ - Step 6073: {'lr': 0.0004990666780994156, 'samples': 1166016, 'steps': 6072, 'loss/train': 2.116274356842041}}} 11/06/2021 22:06:17 - INFO - __main__ - Step 6078: {'lr': 0.0004990643860759222, 'samples': 1166976, 'steps': 6077, 'loss/train': 1.642020583152771}}} 11/06/2021 22:06:17 - INFO - __main__ - Step 6078: {'lr': 0.0004990643860759222, 'samples': 1166976, 'steps': 6077, 'loss/train': 1.642020583152771}}} 11/06/2021 22:06:17 - INFO - __main__ - Step 6078: {'lr': 0.0004990643860759222, 'samples': 1166976, 'steps': 6077, 'loss/train': 1.642020583152771}}} 11/06/2021 22:06:23 - INFO - __main__ - Step 6090: {'lr': 0.0004990588737726809, 'samples': 1169280, 'steps': 6089, 'loss/train': 2.0593221187591553}} 11/06/2021 22:06:25 - INFO - __main__ - Step 6094: {'lr': 0.0004990570327471427, 'samples': 1170048, 'steps': 6093, 'loss/train': 1.9696969985961914}} 11/06/2021 22:06:27 - INFO - __main__ - Step 6099: {'lr': 0.0004990547289402433, 'samples': 1171008, 'steps': 6098, 'loss/train': 1.9792393445968628}} 11/06/2021 22:06:30 - INFO - __main__ - Step 6104: {'lr': 0.0004990524223278384, 'samples': 1171968, 'steps': 6103, 'loss/train': 1.6591429710388184}} 11/06/2021 22:06:32 - INFO - __main__ - Step 6108: {'lr': 0.0004990505750179682, 'samples': 1172736, 'steps': 6107, 'loss/train': 1.9363466501235962}} 11/06/2021 22:06:34 - INFO - __main__ - Step 6112: {'lr': 0.0004990487259126043, 'samples': 1173504, 'steps': 6111, 'loss/train': 1.6976191997528076}} 11/06/2021 22:06:34 - INFO - __main__ - Step 6112: {'lr': 0.0004990487259126043, 'samples': 1173504, 'steps': 6111, 'loss/train': 1.6976191997528076}} 11/06/2021 22:06:37 - INFO - __main__ - Step 6119: {'lr': 0.0004990454856578513, 'samples': 1174848, 'steps': 6118, 'loss/train': 1.8817105293273926}} 11/06/2021 22:06:40 - INFO - __main__ - Step 6124: {'lr': 0.0004990431678236849, 'samples': 1175808, 'steps': 6123, 'loss/train': 1.8044439554214478}} 11/06/2021 22:06:42 - INFO - __main__ - Step 6128: {'lr': 0.0004990413115364803, 'samples': 1176576, 'steps': 6127, 'loss/train': 1.5602920055389404}} 11/06/2021 22:06:44 - INFO - __main__ - Step 6132: {'lr': 0.000499039453453849, 'samples': 1177344, 'steps': 6131, 'loss/train': 1.8965604305267334}}} 11/06/2021 22:06:45 - INFO - __main__ - Step 6136: {'lr': 0.0004990375935758042, 'samples': 1178112, 'steps': 6135, 'loss/train': 1.5371315479278564}} 11/06/2021 22:06:47 - INFO - __main__ - Step 6140: {'lr': 0.0004990357319023597, 'samples': 1178880, 'steps': 6139, 'loss/train': 1.368839144706726}}} 11/06/2021 22:06:47 - INFO - __main__ - Step 6140: {'lr': 0.0004990357319023597, 'samples': 1178880, 'steps': 6139, 'loss/train': 1.368839144706726}}} 11/06/2021 22:06:52 - INFO - __main__ - Step 6148: {'lr': 0.0004990320031693242, 'samples': 1180416, 'steps': 6147, 'loss/train': 2.2118520736694336}} 11/06/2021 22:06:53 - INFO - __main__ - Step 6152: {'lr': 0.0004990301361097603, 'samples': 1181184, 'steps': 6151, 'loss/train': 1.1245484352111816}} 11/06/2021 22:06:55 - INFO - __main__ - Step 6156: {'lr': 0.0004990282672548503, 'samples': 1181952, 'steps': 6155, 'loss/train': 1.679478645324707}}} 11/06/2021 22:06:57 - INFO - __main__ - Step 6160: {'lr': 0.0004990263966046075, 'samples': 1182720, 'steps': 6159, 'loss/train': 1.8187702894210815}} 11/06/2021 22:07:00 - INFO - __main__ - Step 6166: {'lr': 0.000499023587263024, 'samples': 1183872, 'steps': 6165, 'loss/train': 1.6445512771606445}}} 11/06/2021 22:07:02 - INFO - __main__ - Step 6170: {'lr': 0.0004990217121245084, 'samples': 1184640, 'steps': 6169, 'loss/train': 1.9792548418045044}} 11/06/2021 22:07:02 - INFO - __main__ - Step 6170: {'lr': 0.0004990217121245084, 'samples': 1184640, 'steps': 6169, 'loss/train': 1.9792548418045044}} 11/06/2021 22:07:05 - INFO - __main__ - Step 6177: {'lr': 0.0004990184263122088, 'samples': 1185984, 'steps': 6176, 'loss/train': 1.861528754234314}}} 11/06/2021 22:07:08 - INFO - __main__ - Step 6182: {'lr': 0.0004990160759373033, 'samples': 1186944, 'steps': 6181, 'loss/train': 2.10902738571167}}}} 11/06/2021 22:07:10 - INFO - __main__ - Step 6187: {'lr': 0.0004990137227573278, 'samples': 1187904, 'steps': 6186, 'loss/train': 3.2400269508361816}} 11/06/2021 22:07:12 - INFO - __main__ - Step 6191: {'lr': 0.0004990118381937148, 'samples': 1188672, 'steps': 6190, 'loss/train': 1.4901134967803955}} 11/06/2021 22:07:12 - INFO - __main__ - Step 6191: {'lr': 0.0004990118381937148, 'samples': 1188672, 'steps': 6190, 'loss/train': 1.4901134967803955}} 11/06/2021 22:07:15 - INFO - __main__ - Step 6198: {'lr': 0.0004990085358876658, 'samples': 1190016, 'steps': 6197, 'loss/train': 1.6927975416183472}} 11/06/2021 22:07:17 - INFO - __main__ - Step 6202: {'lr': 0.0004990066463872462, 'samples': 1190784, 'steps': 6201, 'loss/train': 1.7335742712020874}} 11/06/2021 22:07:20 - INFO - __main__ - Step 6207: {'lr': 0.000499004281987256, 'samples': 1191744, 'steps': 6206, 'loss/train': 1.6635841131210327}}} 11/06/2021 22:07:20 - INFO - __main__ - Step 6207: {'lr': 0.000499004281987256, 'samples': 1191744, 'steps': 6206, 'loss/train': 1.6635841131210327}}} 11/06/2021 22:07:23 - INFO - __main__ - Step 6214: {'lr': 0.0004990009671149811, 'samples': 1193088, 'steps': 6213, 'loss/train': 1.9133497476577759}} 11/06/2021 22:07:26 - INFO - __main__ - Step 6218: {'lr': 0.0004989990704339361, 'samples': 1193856, 'steps': 6217, 'loss/train': 1.0167715549468994}} 11/06/2021 22:07:26 - INFO - __main__ - Step 6218: {'lr': 0.0004989990704339361, 'samples': 1193856, 'steps': 6217, 'loss/train': 1.0167715549468994}} 11/06/2021 22:07:30 - INFO - __main__ - Step 6226: {'lr': 0.0004989952716864931, 'samples': 1195392, 'steps': 6225, 'loss/train': 2.1702065467834473}} 11/06/2021 22:07:31 - INFO - __main__ - Step 6230: {'lr': 0.0004989933696201225, 'samples': 1196160, 'steps': 6229, 'loss/train': 1.936288595199585}}} 11/06/2021 22:07:34 - INFO - __main__ - Step 6234: {'lr': 0.0004989914657586707, 'samples': 1196928, 'steps': 6233, 'loss/train': 1.8493226766586304}} 11/06/2021 22:07:36 - INFO - __main__ - Step 6239: {'lr': 0.0004989890834075441, 'samples': 1197888, 'steps': 6238, 'loss/train': 1.5966987609863281}} 11/06/2021 22:07:38 - INFO - __main__ - Step 6243: {'lr': 0.0004989871755072101, 'samples': 1198656, 'steps': 6242, 'loss/train': 1.8593974113464355}} 11/06/2021 22:07:38 - INFO - __main__ - Step 6243: {'lr': 0.0004989871755072101, 'samples': 1198656, 'steps': 6242, 'loss/train': 1.8593974113464355}} 11/06/2021 22:07:41 - INFO - __main__ - Step 6250: {'lr': 0.0004989838323623272, 'samples': 1200000, 'steps': 6249, 'loss/train': 2.3803365230560303}} 11/06/2021 22:07:43 - INFO - __main__ - Step 6254: {'lr': 0.000498981919525676, 'samples': 1200768, 'steps': 6253, 'loss/train': 1.925179362297058}3}} 11/06/2021 22:07:46 - INFO - __main__ - Step 6259: {'lr': 0.0004989795259556469, 'samples': 1201728, 'steps': 6258, 'loss/train': 2.1959168910980225}} 11/06/2021 22:07:48 - INFO - __main__ - Step 6264: {'lr': 0.0004989771295809594, 'samples': 1202688, 'steps': 6263, 'loss/train': 1.5440324544906616}} 11/06/2021 22:07:50 - INFO - __main__ - Step 6268: {'lr': 0.0004989752104618736, 'samples': 1203456, 'steps': 6267, 'loss/train': 2.009376287460327}}} 11/06/2021 22:07:50 - INFO - __main__ - Step 6268: {'lr': 0.0004989752104618736, 'samples': 1203456, 'steps': 6267, 'loss/train': 2.009376287460327}}} 11/06/2021 22:07:53 - INFO - __main__ - Step 6275: {'lr': 0.0004989718476843828, 'samples': 1204800, 'steps': 6274, 'loss/train': 1.9315811395645142}} 11/06/2021 22:07:55 - INFO - __main__ - Step 6279: {'lr': 0.0004989699236292173, 'samples': 1205568, 'steps': 6278, 'loss/train': 1.9111852645874023}} 11/06/2021 22:07:58 - INFO - __main__ - Step 6284: {'lr': 0.0004989675160361669, 'samples': 1206528, 'steps': 6283, 'loss/train': 2.0830070972442627}} 11/06/2021 22:07:58 - INFO - __main__ - Step 6284: {'lr': 0.0004989675160361669, 'samples': 1206528, 'steps': 6283, 'loss/train': 2.0830070972442627}} 11/06/2021 22:08:02 - INFO - __main__ - Step 6292: {'lr': 0.0004989636580538896, 'samples': 1208064, 'steps': 6291, 'loss/train': 1.3750081062316895}} 11/06/2021 22:08:03 - INFO - __main__ - Step 6296: {'lr': 0.0004989617263704437, 'samples': 1208832, 'steps': 6295, 'loss/train': 1.7614363431930542}} 11/06/2021 22:08:06 - INFO - __main__ - Step 6300: {'lr': 0.0004989597928921447, 'samples': 1209600, 'steps': 6299, 'loss/train': 2.2829113006591797}} 11/06/2021 22:08:08 - INFO - __main__ - Step 6305: {'lr': 0.0004989573735202802, 'samples': 1210560, 'steps': 6304, 'loss/train': 1.910634994506836}}} 11/06/2021 22:08:10 - INFO - __main__ - Step 6309: {'lr': 0.000498955436003613, 'samples': 1211328, 'steps': 6308, 'loss/train': 1.7411454916000366}}} 11/06/2021 22:08:12 - INFO - __main__ - Step 6313: {'lr': 0.0004989534966921382, 'samples': 1212096, 'steps': 6312, 'loss/train': 1.8578460216522217}} 11/06/2021 22:08:14 - INFO - __main__ - Step 6317: {'lr': 0.0004989515555858697, 'samples': 1212864, 'steps': 6316, 'loss/train': 2.206059694290161}}} 11/06/2021 22:08:16 - INFO - __main__ - Step 6321: {'lr': 0.0004989496126848215, 'samples': 1213632, 'steps': 6320, 'loss/train': 1.8948540687561035}} 11/06/2021 22:08:18 - INFO - __main__ - Step 6326: {'lr': 0.0004989471815346237, 'samples': 1214592, 'steps': 6325, 'loss/train': 2.1662731170654297}} 11/06/2021 22:08:20 - INFO - __main__ - Step 6330: {'lr': 0.0004989452345953725, 'samples': 1215360, 'steps': 6329, 'loss/train': 1.8572430610656738}} 11/06/2021 22:08:20 - INFO - __main__ - Step 6330: {'lr': 0.0004989452345953725, 'samples': 1215360, 'steps': 6329, 'loss/train': 1.8572430610656738}} 11/06/2021 22:08:24 - INFO - __main__ - Step 6337: {'lr': 0.0004989418231331124, 'samples': 1216704, 'steps': 6336, 'loss/train': 2.056525230407715}}} 11/06/2021 22:08:26 - INFO - __main__ - Step 6342: {'lr': 0.0004989393830092705, 'samples': 1217664, 'steps': 6341, 'loss/train': 2.555983781814575}}} 11/06/2021 22:08:28 - INFO - __main__ - Step 6347: {'lr': 0.0004989369400812225, 'samples': 1218624, 'steps': 6346, 'loss/train': 1.2773443460464478}} 11/06/2021 22:08:31 - INFO - __main__ - Step 6351: {'lr': 0.0004989349837197742, 'samples': 1219392, 'steps': 6350, 'loss/train': 1.6861871480941772}} 11/06/2021 22:08:33 - INFO - __main__ - Step 6355: {'lr': 0.0004989330255636656, 'samples': 1220160, 'steps': 6354, 'loss/train': 1.5128992795944214}} 11/06/2021 22:08:34 - INFO - __main__ - Step 6359: {'lr': 0.000498931065612911, 'samples': 1220928, 'steps': 6358, 'loss/train': 1.3788177967071533}}} 11/06/2021 22:08:36 - INFO - __main__ - Step 6363: {'lr': 0.0004989291038675245, 'samples': 1221696, 'steps': 6362, 'loss/train': 2.109246015548706}}} 11/06/2021 22:08:39 - INFO - __main__ - Step 6368: {'lr': 0.0004989266491621117, 'samples': 1222656, 'steps': 6367, 'loss/train': 1.7916598320007324}} 11/06/2021 22:08:41 - INFO - __main__ - Step 6372: {'lr': 0.0004989246833788549, 'samples': 1223424, 'steps': 6371, 'loss/train': 1.6870710849761963}} 11/06/2021 22:08:41 - INFO - __main__ - Step 6372: {'lr': 0.0004989246833788549, 'samples': 1223424, 'steps': 6371, 'loss/train': 1.6870710849761963}} 11/06/2021 22:08:41 - INFO - __main__ - Step 6372: {'lr': 0.0004989246833788549, 'samples': 1223424, 'steps': 6371, 'loss/train': 1.6870710849761963}} 11/06/2021 22:08:46 - INFO - __main__ - Step 6381: {'lr': 0.0004989202538050939, 'samples': 1225152, 'steps': 6380, 'loss/train': 1.969157099723816}}} 11/06/2021 22:08:49 - INFO - __main__ - Step 6387: {'lr': 0.000498917295708727, 'samples': 1226304, 'steps': 6386, 'loss/train': 2.260986328125}816}}} 11/06/2021 22:08:51 - INFO - __main__ - Step 6391: {'lr': 0.0004989153214013135, 'samples': 1227072, 'steps': 6390, 'loss/train': 2.1878926753997803}} 11/06/2021 22:08:51 - INFO - __main__ - Step 6391: {'lr': 0.0004989153214013135, 'samples': 1227072, 'steps': 6390, 'loss/train': 2.1878926753997803}} 11/06/2021 22:08:54 - INFO - __main__ - Step 6398: {'lr': 0.0004989118620452884, 'samples': 1228416, 'steps': 6397, 'loss/train': 1.6816266775131226}} 11/06/2021 22:08:56 - INFO - __main__ - Step 6402: {'lr': 0.0004989098828029836, 'samples': 1229184, 'steps': 6401, 'loss/train': 2.1822431087493896}} 11/06/2021 22:08:59 - INFO - __main__ - Step 6407: {'lr': 0.0004989074062266177, 'samples': 1230144, 'steps': 6406, 'loss/train': 5.536162853240967}}} 11/06/2021 22:09:01 - INFO - __main__ - Step 6411: {'lr': 0.0004989054229467546, 'samples': 1230912, 'steps': 6410, 'loss/train': 1.9058226346969604}} 11/06/2021 22:09:03 - INFO - __main__ - Step 6415: {'lr': 0.0004989034378724443, 'samples': 1231680, 'steps': 6414, 'loss/train': 2.1043949127197266}} 11/06/2021 22:09:04 - INFO - __main__ - Step 6419: {'lr': 0.0004989014510037013, 'samples': 1232448, 'steps': 6418, 'loss/train': 1.0096818208694458}} 11/06/2021 22:09:06 - INFO - __main__ - Step 6423: {'lr': 0.00049889946234054, 'samples': 1233216, 'steps': 6422, 'loss/train': 1.466779351234436}58}} 11/06/2021 22:09:09 - INFO - __main__ - Step 6428: {'lr': 0.0004988969739882091, 'samples': 1234176, 'steps': 6427, 'loss/train': 1.8715691566467285}} 11/06/2021 22:09:09 - INFO - __main__ - Step 6428: {'lr': 0.0004988969739882091, 'samples': 1234176, 'steps': 6427, 'loss/train': 1.8715691566467285}} 11/06/2021 22:09:12 - INFO - __main__ - Step 6433: {'lr': 0.0004988944828321499, 'samples': 1235136, 'steps': 6432, 'loss/train': 1.8814183473587036}} 11/06/2021 22:09:12 - INFO - __main__ - Step 6433: {'lr': 0.0004988944828321499, 'samples': 1235136, 'steps': 6432, 'loss/train': 1.8814183473587036}} 11/06/2021 22:09:16 - INFO - __main__ - Step 6442: {'lr': 0.0004988899916859372, 'samples': 1236864, 'steps': 6441, 'loss/train': 2.750673294067383}}} 11/06/2021 22:09:18 - INFO - __main__ - Step 6446: {'lr': 0.0004988879927051484, 'samples': 1237632, 'steps': 6445, 'loss/train': 1.573569655418396}}} 11/06/2021 22:09:20 - INFO - __main__ - Step 6451: {'lr': 0.0004988854914558994, 'samples': 1238592, 'steps': 6450, 'loss/train': 1.7760518789291382}} 11/06/2021 22:09:20 - INFO - __main__ - Step 6451: {'lr': 0.0004988854914558994, 'samples': 1238592, 'steps': 6450, 'loss/train': 1.7760518789291382}} 11/06/2021 22:09:25 - INFO - __main__ - Step 6459: {'lr': 0.0004988814836256269, 'samples': 1240128, 'steps': 6458, 'loss/train': 1.4734045267105103}} 11/06/2021 22:09:26 - INFO - __main__ - Step 6463: {'lr': 0.0004988794770190717, 'samples': 1240896, 'steps': 6462, 'loss/train': 2.106597661972046}}} 11/06/2021 22:09:29 - INFO - __main__ - Step 6468: {'lr': 0.0004988769662377013, 'samples': 1241856, 'steps': 6467, 'loss/train': 2.2871172428131104}} 11/06/2021 22:09:31 - INFO - __main__ - Step 6472: {'lr': 0.0004988749555940814, 'samples': 1242624, 'steps': 6471, 'loss/train': 1.6348545551300049}} 11/06/2021 22:09:33 - INFO - __main__ - Step 6476: {'lr': 0.0004988729431562339, 'samples': 1243392, 'steps': 6475, 'loss/train': 2.184319496154785}}} 11/06/2021 22:09:35 - INFO - __main__ - Step 6480: {'lr': 0.0004988709289241736, 'samples': 1244160, 'steps': 6479, 'loss/train': 2.1878020763397217}} 11/06/2021 22:09:36 - INFO - __main__ - Step 6484: {'lr': 0.000498868912897915, 'samples': 1244928, 'steps': 6483, 'loss/train': 1.5354762077331543}}} 11/06/2021 22:09:39 - INFO - __main__ - Step 6489: {'lr': 0.0004988663903420222, 'samples': 1245888, 'steps': 6488, 'loss/train': 2.216024398803711}}} 11/06/2021 22:09:39 - INFO - __main__ - Step 6489: {'lr': 0.0004988663903420222, 'samples': 1245888, 'steps': 6488, 'loss/train': 2.216024398803711}}} 11/06/2021 22:09:43 - INFO - __main__ - Step 6497: {'lr': 0.0004988623484215673, 'samples': 1247424, 'steps': 6496, 'loss/train': 0.6462783217430115}} 11/06/2021 22:09:45 - INFO - __main__ - Step 6501: {'lr': 0.0004988603247701276, 'samples': 1248192, 'steps': 6500, 'loss/train': 1.803402066230774}}} 11/06/2021 22:09:46 - INFO - __main__ - Step 6505: {'lr': 0.0004988582993245661, 'samples': 1248960, 'steps': 6504, 'loss/train': 2.119647264480591}}} 11/06/2021 22:09:48 - INFO - __main__ - Step 6509: {'lr': 0.0004988562720848973, 'samples': 1249728, 'steps': 6508, 'loss/train': 2.310293674468994}}} 11/06/2021 22:09:51 - INFO - __main__ - Step 6514: {'lr': 0.0004988537355123699, 'samples': 1250688, 'steps': 6513, 'loss/train': 2.3833720684051514}} 11/06/2021 22:09:53 - INFO - __main__ - Step 6518: {'lr': 0.0004988517042360128, 'samples': 1251456, 'steps': 6517, 'loss/train': 1.9547860622406006}} 11/06/2021 22:09:55 - INFO - __main__ - Step 6522: {'lr': 0.0004988496711655961, 'samples': 1252224, 'steps': 6521, 'loss/train': 1.5848283767700195}} 11/06/2021 22:09:56 - INFO - __main__ - Step 6526: {'lr': 0.0004988476363011341, 'samples': 1252992, 'steps': 6525, 'loss/train': 2.017069101333618}}} 11/06/2021 22:09:58 - INFO - __main__ - Step 6530: {'lr': 0.0004988455996426418, 'samples': 1253760, 'steps': 6529, 'loss/train': 1.9897722005844116}} 11/06/2021 22:10:01 - INFO - __main__ - Step 6535: {'lr': 0.0004988430512966932, 'samples': 1254720, 'steps': 6534, 'loss/train': 2.3683085441589355}} 11/06/2021 22:10:03 - INFO - __main__ - Step 6539: {'lr': 0.000498841010601686, 'samples': 1255488, 'steps': 6538, 'loss/train': 2.037982225418091}5}} 11/06/2021 22:10:05 - INFO - __main__ - Step 6543: {'lr': 0.000498838968112696, 'samples': 1256256, 'steps': 6542, 'loss/train': 1.6090011596679688}}} 11/06/2021 22:10:07 - INFO - __main__ - Step 6547: {'lr': 0.000498836923829738, 'samples': 1257024, 'steps': 6546, 'loss/train': 1.8499354124069214}}} 11/06/2021 22:10:09 - INFO - __main__ - Step 6551: {'lr': 0.0004988348777528267, 'samples': 1257792, 'steps': 6550, 'loss/train': 2.171156883239746}}} 11/06/2021 22:10:11 - INFO - __main__ - Step 6556: {'lr': 0.0004988323176339633, 'samples': 1258752, 'steps': 6555, 'loss/train': 1.467564582824707}}} 11/06/2021 22:10:11 - INFO - __main__ - Step 6556: {'lr': 0.0004988323176339633, 'samples': 1258752, 'steps': 6555, 'loss/train': 1.467564582824707}}} 11/06/2021 22:10:15 - INFO - __main__ - Step 6564: {'lr': 0.0004988282156135539, 'samples': 1260288, 'steps': 6563, 'loss/train': 1.7303180694580078}} 11/06/2021 22:10:17 - INFO - __main__ - Step 6568: {'lr': 0.000498826161912506, 'samples': 1261056, 'steps': 6567, 'loss/train': 1.8563683032989502}}} 11/06/2021 22:10:18 - INFO - __main__ - Step 6572: {'lr': 0.0004988241064175826, 'samples': 1261824, 'steps': 6571, 'loss/train': 2.2202649116516113}} 11/06/2021 22:10:21 - INFO - __main__ - Step 6577: {'lr': 0.0004988215345263132, 'samples': 1262784, 'steps': 6576, 'loss/train': 1.774949550628662}}} 11/06/2021 22:10:21 - INFO - __main__ - Step 6577: {'lr': 0.0004988215345263132, 'samples': 1262784, 'steps': 6576, 'loss/train': 1.774949550628662}}} 11/06/2021 22:10:25 - INFO - __main__ - Step 6585: {'lr': 0.0004988174136703066, 'samples': 1264320, 'steps': 6584, 'loss/train': 1.922788381576538}}} 11/06/2021 22:10:27 - INFO - __main__ - Step 6589: {'lr': 0.0004988153505515771, 'samples': 1265088, 'steps': 6588, 'loss/train': 2.8506147861480713}} 11/06/2021 22:10:29 - INFO - __main__ - Step 6593: {'lr': 0.0004988132856390498, 'samples': 1265856, 'steps': 6592, 'loss/train': 1.3303931951522827}} 11/06/2021 22:10:31 - INFO - __main__ - Step 6597: {'lr': 0.0004988112189327397, 'samples': 1266624, 'steps': 6596, 'loss/train': 2.0449981689453125}} 11/06/2021 22:10:31 - INFO - __main__ - Step 6597: {'lr': 0.0004988112189327397, 'samples': 1266624, 'steps': 6596, 'loss/train': 2.0449981689453125}} 11/06/2021 22:10:35 - INFO - __main__ - Step 6606: {'lr': 0.0004988065622851006, 'samples': 1268352, 'steps': 6605, 'loss/train': 1.55886709690094}5}} 11/06/2021 22:10:37 - INFO - __main__ - Step 6610: {'lr': 0.0004988044897490993, 'samples': 1269120, 'steps': 6609, 'loss/train': 1.8489357233047485}} 11/06/2021 22:10:39 - INFO - __main__ - Step 6614: {'lr': 0.0004988024154193785, 'samples': 1269888, 'steps': 6613, 'loss/train': 2.244598627090454}}} 11/06/2021 22:10:41 - INFO - __main__ - Step 6618: {'lr': 0.0004988003392959533, 'samples': 1270656, 'steps': 6617, 'loss/train': 2.182772397994995}}} 11/06/2021 22:10:43 - INFO - __main__ - Step 6622: {'lr': 0.0004987982613788384, 'samples': 1271424, 'steps': 6621, 'loss/train': 1.808817744255066}}} 11/06/2021 22:10:45 - INFO - __main__ - Step 6626: {'lr': 0.0004987961816680492, 'samples': 1272192, 'steps': 6625, 'loss/train': 1.5684220790863037}} 11/06/2021 22:10:47 - INFO - __main__ - Step 6630: {'lr': 0.0004987941001636004, 'samples': 1272960, 'steps': 6629, 'loss/train': 2.308150291442871}}} 11/06/2021 22:10:48 - INFO - __main__ - Step 6634: {'lr': 0.0004987920168655071, 'samples': 1273728, 'steps': 6633, 'loss/train': 1.4046695232391357}} 11/06/2021 22:10:51 - INFO - __main__ - Step 6639: {'lr': 0.0004987894102206008, 'samples': 1274688, 'steps': 6638, 'loss/train': 1.5049811601638794}} 11/06/2021 22:10:53 - INFO - __main__ - Step 6644: {'lr': 0.0004987868007731778, 'samples': 1275648, 'steps': 6643, 'loss/train': 1.0979185104370117}} 11/06/2021 22:10:53 - INFO - __main__ - Step 6644: {'lr': 0.0004987868007731778, 'samples': 1275648, 'steps': 6643, 'loss/train': 1.0979185104370117}} 11/06/2021 22:10:53 - INFO - __main__ - Step 6644: {'lr': 0.0004987868007731778, 'samples': 1275648, 'steps': 6643, 'loss/train': 1.0979185104370117}} 11/06/2021 22:10:58 - INFO - __main__ - Step 6654: {'lr': 0.000498781573470899, 'samples': 1277568, 'steps': 6653, 'loss/train': 2.209815502166748}7}} 11/06/2021 22:11:01 - INFO - __main__ - Step 6659: {'lr': 0.0004987789556161022, 'samples': 1278528, 'steps': 6658, 'loss/train': 0.2782423198223114}} 11/06/2021 22:11:03 - INFO - __main__ - Step 6663: {'lr': 0.0004987768593145362, 'samples': 1279296, 'steps': 6662, 'loss/train': 1.7183184623718262}} 11/06/2021 22:11:05 - INFO - __main__ - Step 6667: {'lr': 0.0004987747612194499, 'samples': 1280064, 'steps': 6666, 'loss/train': 1.2572715282440186}} 11/06/2021 22:11:07 - INFO - __main__ - Step 6671: {'lr': 0.0004987726613308584, 'samples': 1280832, 'steps': 6670, 'loss/train': 1.7459203004837036}} 11/06/2021 22:11:08 - INFO - __main__ - Step 6675: {'lr': 0.0004987705596487771, 'samples': 1281600, 'steps': 6674, 'loss/train': 1.3550761938095093}} 11/06/2021 22:11:10 - INFO - __main__ - Step 6679: {'lr': 0.000498768456173221, 'samples': 1282368, 'steps': 6678, 'loss/train': 1.7885398864746094}}} 11/06/2021 22:11:13 - INFO - __main__ - Step 6685: {'lr': 0.0004987652975971546, 'samples': 1283520, 'steps': 6684, 'loss/train': 2.027494192123413}}} 11/06/2021 22:11:15 - INFO - __main__ - Step 6689: {'lr': 0.0004987631896379779, 'samples': 1284288, 'steps': 6688, 'loss/train': 1.4638168811798096}} 11/06/2021 22:11:15 - INFO - __main__ - Step 6689: {'lr': 0.0004987631896379779, 'samples': 1284288, 'steps': 6688, 'loss/train': 1.4638168811798096}} 11/06/2021 22:11:19 - INFO - __main__ - Step 6696: {'lr': 0.0004987594963940066, 'samples': 1285632, 'steps': 6695, 'loss/train': 1.7367832660675049}} 11/06/2021 22:11:21 - INFO - __main__ - Step 6700: {'lr': 0.0004987573835029569, 'samples': 1286400, 'steps': 6699, 'loss/train': 2.1391384601593018}} 11/06/2021 22:11:23 - INFO - __main__ - Step 6705: {'lr': 0.0004987547398672061, 'samples': 1287360, 'steps': 6704, 'loss/train': 1.8196769952774048}} 11/06/2021 22:11:25 - INFO - __main__ - Step 6710: {'lr': 0.000498752093429329, 'samples': 1288320, 'steps': 6709, 'loss/train': 1.8113231658935547}}} 11/06/2021 22:11:27 - INFO - __main__ - Step 6714: {'lr': 0.0004987499742615167, 'samples': 1289088, 'steps': 6713, 'loss/train': 1.8710038661956787}} 11/06/2021 22:11:27 - INFO - __main__ - Step 6714: {'lr': 0.0004987499742615167, 'samples': 1289088, 'steps': 6713, 'loss/train': 1.8710038661956787}} 11/06/2021 22:11:31 - INFO - __main__ - Step 6721: {'lr': 0.0004987462614026624, 'samples': 1290432, 'steps': 6720, 'loss/train': 1.4481638669967651}} 11/06/2021 22:11:33 - INFO - __main__ - Step 6725: {'lr': 0.0004987441373032393, 'samples': 1291200, 'steps': 6724, 'loss/train': 1.264276385307312}}} 11/06/2021 22:11:35 - INFO - __main__ - Step 6730: {'lr': 0.000498741479657156, 'samples': 1292160, 'steps': 6729, 'loss/train': 2.122650146484375}}}} 11/06/2021 22:11:38 - INFO - __main__ - Step 6735: {'lr': 0.0004987388192090959, 'samples': 1293120, 'steps': 6734, 'loss/train': 1.5254181623458862}} 11/06/2021 22:11:38 - INFO - __main__ - Step 6735: {'lr': 0.0004987388192090959, 'samples': 1293120, 'steps': 6734, 'loss/train': 1.5254181623458862}} 11/06/2021 22:11:41 - INFO - __main__ - Step 6742: {'lr': 0.0004987350898745477, 'samples': 1294464, 'steps': 6741, 'loss/train': 1.4959521293640137}} 11/06/2021 22:11:43 - INFO - __main__ - Step 6747: {'lr': 0.0004987324227018657, 'samples': 1295424, 'steps': 6746, 'loss/train': 2.1900782585144043}} 11/06/2021 22:11:45 - INFO - __main__ - Step 6751: {'lr': 0.0004987302869463686, 'samples': 1296192, 'steps': 6750, 'loss/train': 1.5255533456802368}} 11/06/2021 22:11:48 - INFO - __main__ - Step 6756: {'lr': 0.0004987276147303337, 'samples': 1297152, 'steps': 6755, 'loss/train': 1.5767613649368286}} 11/06/2021 22:11:48 - INFO - __main__ - Step 6756: {'lr': 0.0004987276147303337, 'samples': 1297152, 'steps': 6755, 'loss/train': 1.5767613649368286}} 11/06/2021 22:11:51 - INFO - __main__ - Step 6763: {'lr': 0.0004987238689208327, 'samples': 1298496, 'steps': 6762, 'loss/train': 1.8828072547912598}} 11/06/2021 22:11:53 - INFO - __main__ - Step 6767: {'lr': 0.0004987217259926904, 'samples': 1299264, 'steps': 6766, 'loss/train': 0.23178565502166748} 11/06/2021 22:11:56 - INFO - __main__ - Step 6772: {'lr': 0.0004987190448109354, 'samples': 1300224, 'steps': 6771, 'loss/train': 2.2244198322296143}} 11/06/2021 22:11:56 - INFO - __main__ - Step 6772: {'lr': 0.0004987190448109354, 'samples': 1300224, 'steps': 6771, 'loss/train': 2.2244198322296143}} 11/06/2021 22:11:59 - INFO - __main__ - Step 6779: {'lr': 0.0004987152864495887, 'samples': 1301568, 'steps': 6778, 'loss/train': 2.1859946250915527}} 11/06/2021 22:12:01 - INFO - __main__ - Step 6784: {'lr': 0.0004987125985437468, 'samples': 1302528, 'steps': 6783, 'loss/train': 2.15081524848938}7}} 11/06/2021 22:12:03 - INFO - __main__ - Step 6788: {'lr': 0.0004987104462018828, 'samples': 1303296, 'steps': 6787, 'loss/train': 1.6238676309585571}} 11/06/2021 22:12:06 - INFO - __main__ - Step 6793: {'lr': 0.0004987077532530899, 'samples': 1304256, 'steps': 6792, 'loss/train': 1.743523120880127}}} 11/06/2021 22:12:08 - INFO - __main__ - Step 6797: {'lr': 0.0004987055968769045, 'samples': 1305024, 'steps': 6796, 'loss/train': 1.779247522354126}}} 11/06/2021 22:12:08 - INFO - __main__ - Step 6797: {'lr': 0.0004987055968769045, 'samples': 1305024, 'steps': 6796, 'loss/train': 1.779247522354126}}} 11/06/2021 22:12:11 - INFO - __main__ - Step 6804: {'lr': 0.0004987018189041675, 'samples': 1306368, 'steps': 6803, 'loss/train': 1.7351499795913696}} 11/06/2021 22:12:13 - INFO - __main__ - Step 6808: {'lr': 0.0004986996575972517, 'samples': 1307136, 'steps': 6807, 'loss/train': 0.7521131038665771}} 11/06/2021 22:12:13 - INFO - __main__ - Step 6808: {'lr': 0.0004986996575972517, 'samples': 1307136, 'steps': 6807, 'loss/train': 0.7521131038665771}} 11/06/2021 22:12:13 - INFO - __main__ - Step 6808: {'lr': 0.0004986996575972517, 'samples': 1307136, 'steps': 6807, 'loss/train': 0.7521131038665771}} 11/06/2021 22:12:19 - INFO - __main__ - Step 6820: {'lr': 0.0004986931629187848, 'samples': 1309440, 'steps': 6819, 'loss/train': 2.2714736461639404}} 11/06/2021 22:12:21 - INFO - __main__ - Step 6824: {'lr': 0.0004986909944401082, 'samples': 1310208, 'steps': 6823, 'loss/train': 1.4807871580123901}} 11/06/2021 22:12:24 - INFO - __main__ - Step 6829: {'lr': 0.0004986882813204967, 'samples': 1311168, 'steps': 6828, 'loss/train': 2.6850411891937256}} 11/06/2021 22:12:24 - INFO - __main__ - Step 6829: {'lr': 0.0004986882813204967, 'samples': 1311168, 'steps': 6828, 'loss/train': 2.6850411891937256}} 11/06/2021 22:12:24 - INFO - __main__ - Step 6829: {'lr': 0.0004986882813204967, 'samples': 1311168, 'steps': 6828, 'loss/train': 2.6850411891937256}} 11/06/2021 22:12:29 - INFO - __main__ - Step 6839: {'lr': 0.0004986828466771718, 'samples': 1313088, 'steps': 6838, 'loss/train': 2.1545591354370117}} 11/06/2021 22:12:32 - INFO - __main__ - Step 6845: {'lr': 0.0004986795805126339, 'samples': 1314240, 'steps': 6844, 'loss/train': 1.878450632095337}}} 11/06/2021 22:12:34 - INFO - __main__ - Step 6849: {'lr': 0.0004986774008285816, 'samples': 1315008, 'steps': 6848, 'loss/train': 1.9744795560836792}} 11/06/2021 22:12:34 - INFO - __main__ - Step 6849: {'lr': 0.0004986774008285816, 'samples': 1315008, 'steps': 6848, 'loss/train': 1.9744795560836792}} 11/06/2021 22:12:37 - INFO - __main__ - Step 6856: {'lr': 0.000498673582067567, 'samples': 1316352, 'steps': 6855, 'loss/train': 1.685018539428711}2}} 11/06/2021 22:12:40 - INFO - __main__ - Step 6861: {'lr': 0.0004986708510196688, 'samples': 1317312, 'steps': 6860, 'loss/train': 1.852432131767273}}} 11/06/2021 22:12:42 - INFO - __main__ - Step 6866: {'lr': 0.0004986681171705893, 'samples': 1318272, 'steps': 6865, 'loss/train': 1.9213021993637085}} 11/06/2021 22:12:44 - INFO - __main__ - Step 6870: {'lr': 0.000498665928074496, 'samples': 1319040, 'steps': 6869, 'loss/train': 1.9043452739715576}}} 11/06/2021 22:12:44 - INFO - __main__ - Step 6870: {'lr': 0.000498665928074496, 'samples': 1319040, 'steps': 6869, 'loss/train': 1.9043452739715576}}} 11/06/2021 22:12:47 - INFO - __main__ - Step 6876: {'lr': 0.0004986626410690099, 'samples': 1320192, 'steps': 6875, 'loss/train': 1.3328086137771606}} 11/06/2021 22:12:49 - INFO - __main__ - Step 6881: {'lr': 0.0004986598988165718, 'samples': 1321152, 'steps': 6880, 'loss/train': 2.5729262828826904}} 11/06/2021 22:12:49 - INFO - __main__ - Step 6881: {'lr': 0.0004986598988165718, 'samples': 1321152, 'steps': 6880, 'loss/train': 2.5729262828826904}} 11/06/2021 22:12:53 - INFO - __main__ - Step 6889: {'lr': 0.0004986555053864833, 'samples': 1322688, 'steps': 6888, 'loss/train': 1.7372727394104004}} 11/06/2021 22:12:56 - INFO - __main__ - Step 6893: {'lr': 0.000498653305982463, 'samples': 1323456, 'steps': 6892, 'loss/train': 1.350880742073059}4}} 11/06/2021 22:12:57 - INFO - __main__ - Step 6897: {'lr': 0.0004986511047858134, 'samples': 1324224, 'steps': 6896, 'loss/train': 2.2133257389068604}} 11/06/2021 22:12:59 - INFO - __main__ - Step 6902: {'lr': 0.0004986483507691403, 'samples': 1325184, 'steps': 6901, 'loss/train': 1.9309344291687012}} 11/06/2021 22:13:02 - INFO - __main__ - Step 6907: {'lr': 0.0004986455939515395, 'samples': 1326144, 'steps': 6906, 'loss/train': 1.7874330282211304}} 11/06/2021 22:13:02 - INFO - __main__ - Step 6907: {'lr': 0.0004986455939515395, 'samples': 1326144, 'steps': 6906, 'loss/train': 1.7874330282211304}} 11/06/2021 22:13:05 - INFO - __main__ - Step 6914: {'lr': 0.0004986417297013987, 'samples': 1327488, 'steps': 6913, 'loss/train': 1.6202863454818726}} 11/06/2021 22:13:07 - INFO - __main__ - Step 6918: {'lr': 0.0004986395190937048, 'samples': 1328256, 'steps': 6917, 'loss/train': 1.7865066528320312}} 11/06/2021 22:13:09 - INFO - __main__ - Step 6922: {'lr': 0.000498637306693481, 'samples': 1329024, 'steps': 6921, 'loss/train': 1.9036977291107178}}} 11/06/2021 22:13:11 - INFO - __main__ - Step 6927: {'lr': 0.00049863453867248, 'samples': 1329984, 'steps': 6926, 'loss/train': 1.5846327543258667}}}} 11/06/2021 22:13:14 - INFO - __main__ - Step 6932: {'lr': 0.0004986317678507069, 'samples': 1330944, 'steps': 6931, 'loss/train': 1.773486852645874}}} 11/06/2021 22:13:14 - INFO - __main__ - Step 6932: {'lr': 0.0004986317678507069, 'samples': 1330944, 'steps': 6931, 'loss/train': 1.773486852645874}}} 11/06/2021 22:13:18 - INFO - __main__ - Step 6939: {'lr': 0.0004986278839949866, 'samples': 1332288, 'steps': 6938, 'loss/train': 2.0607035160064697}} 11/06/2021 22:13:19 - INFO - __main__ - Step 6943: {'lr': 0.0004986256621842417, 'samples': 1333056, 'steps': 6942, 'loss/train': 1.3075493574142456}} 11/06/2021 22:13:21 - INFO - __main__ - Step 6947: {'lr': 0.0004986234385810668, 'samples': 1333824, 'steps': 6946, 'loss/train': 1.9208937883377075}} 11/06/2021 22:13:24 - INFO - __main__ - Step 6952: {'lr': 0.0004986206565565173, 'samples': 1334784, 'steps': 6951, 'loss/train': 1.6181141138076782}} 11/06/2021 22:13:26 - INFO - __main__ - Step 6956: {'lr': 0.000498618428920433, 'samples': 1335552, 'steps': 6955, 'loss/train': 1.4519563913345337}}} 11/06/2021 22:13:28 - INFO - __main__ - Step 6960: {'lr': 0.0004986161994919706, 'samples': 1336320, 'steps': 6959, 'loss/train': 1.547726035118103}}} 11/06/2021 22:13:29 - INFO - __main__ - Step 6964: {'lr': 0.0004986139682711463, 'samples': 1337088, 'steps': 6963, 'loss/train': 2.060143232345581}}} 11/06/2021 22:13:32 - INFO - __main__ - Step 6968: {'lr': 0.000498611735257976, 'samples': 1337856, 'steps': 6967, 'loss/train': 1.8004080057144165}}} 11/06/2021 22:13:34 - INFO - __main__ - Step 6972: {'lr': 0.000498609500452476, 'samples': 1338624, 'steps': 6971, 'loss/train': 1.8543205261230469}}} 11/06/2021 22:13:36 - INFO - __main__ - Step 6976: {'lr': 0.0004986072638546623, 'samples': 1339392, 'steps': 6975, 'loss/train': 1.7745708227157593}} 11/06/2021 22:13:36 - INFO - __main__ - Step 6976: {'lr': 0.0004986072638546623, 'samples': 1339392, 'steps': 6975, 'loss/train': 1.7745708227157593}} 11/06/2021 22:13:36 - INFO - __main__ - Step 6976: {'lr': 0.0004986072638546623, 'samples': 1339392, 'steps': 6975, 'loss/train': 1.7745708227157593}} 11/06/2021 22:13:41 - INFO - __main__ - Step 6986: {'lr': 0.0004986016645188615, 'samples': 1341312, 'steps': 6985, 'loss/train': 1.4657028913497925}} 11/06/2021 22:13:44 - INFO - __main__ - Step 6991: {'lr': 0.0004985988606503426, 'samples': 1342272, 'steps': 6990, 'loss/train': 1.556723713874817}}} 11/06/2021 22:13:46 - INFO - __main__ - Step 6996: {'lr': 0.0004985960539814534, 'samples': 1343232, 'steps': 6995, 'loss/train': 1.901774287223816}}} 11/06/2021 22:13:46 - INFO - __main__ - Step 6996: {'lr': 0.0004985960539814534, 'samples': 1343232, 'steps': 6995, 'loss/train': 1.901774287223816}}} 11/06/2021 22:13:49 - INFO - __main__ - Step 7003: {'lr': 0.0004985921199404467, 'samples': 1344576, 'steps': 7002, 'loss/train': 1.5738434791564941}} 11/06/2021 22:13:51 - INFO - __main__ - Step 7007: {'lr': 0.0004985898694527498, 'samples': 1345344, 'steps': 7006, 'loss/train': 1.9892717599868774}} 11/06/2021 22:13:54 - INFO - __main__ - Step 7012: {'lr': 0.0004985870538228884, 'samples': 1346304, 'steps': 7011, 'loss/train': 1.831001877784729}}} 11/06/2021 22:13:54 - INFO - __main__ - Step 7012: {'lr': 0.0004985870538228884, 'samples': 1346304, 'steps': 7011, 'loss/train': 1.831001877784729}}} 11/06/2021 22:13:58 - INFO - __main__ - Step 7020: {'lr': 0.0004985825429906299, 'samples': 1347840, 'steps': 7019, 'loss/train': 1.8477237224578857}} 11/06/2021 22:13:59 - INFO - __main__ - Step 7024: {'lr': 0.0004985802848863135, 'samples': 1348608, 'steps': 7023, 'loss/train': 2.0270166397094727}} 11/06/2021 22:14:02 - INFO - __main__ - Step 7028: {'lr': 0.0004985780249898941, 'samples': 1349376, 'steps': 7027, 'loss/train': 1.8955899477005005}} 11/06/2021 22:14:04 - INFO - __main__ - Step 7033: {'lr': 0.0004985751975992497, 'samples': 1350336, 'steps': 7032, 'loss/train': 2.039046287536621}}} 11/06/2021 22:14:06 - INFO - __main__ - Step 7037: {'lr': 0.000498572933670658, 'samples': 1351104, 'steps': 7036, 'loss/train': 1.8642250299453735}}} 11/06/2021 22:14:08 - INFO - __main__ - Step 7041: {'lr': 0.0004985706679500163, 'samples': 1351872, 'steps': 7040, 'loss/train': 2.2261483669281006}} 11/06/2021 22:14:08 - INFO - __main__ - Step 7041: {'lr': 0.0004985706679500163, 'samples': 1351872, 'steps': 7040, 'loss/train': 2.2261483669281006}} 11/06/2021 22:14:11 - INFO - __main__ - Step 7048: {'lr': 0.000498566698626822, 'samples': 1353216, 'steps': 7047, 'loss/train': 2.0857248306274414}}} 11/06/2021 22:14:14 - INFO - __main__ - Step 7053: {'lr': 0.0004985638600359542, 'samples': 1354176, 'steps': 7052, 'loss/train': 2.2024543285369873}} 11/06/2021 22:14:16 - INFO - __main__ - Step 7058: {'lr': 0.0004985610186451104, 'samples': 1355136, 'steps': 7057, 'loss/train': 2.400442123413086}}} 11/06/2021 22:14:18 - INFO - __main__ - Step 7062: {'lr': 0.0004985587435164742, 'samples': 1355904, 'steps': 7061, 'loss/train': 2.2818222045898438}} 11/06/2021 22:14:18 - INFO - __main__ - Step 7062: {'lr': 0.0004985587435164742, 'samples': 1355904, 'steps': 7061, 'loss/train': 2.2818222045898438}} 11/06/2021 22:14:21 - INFO - __main__ - Step 7069: {'lr': 0.0004985547577294963, 'samples': 1357248, 'steps': 7068, 'loss/train': 1.4728955030441284}} 11/06/2021 22:14:24 - INFO - __main__ - Step 7074: {'lr': 0.0004985519073789447, 'samples': 1358208, 'steps': 7073, 'loss/train': 2.0647716522216797}} 11/06/2021 22:14:24 - INFO - __main__ - Step 7074: {'lr': 0.0004985519073789447, 'samples': 1358208, 'steps': 7073, 'loss/train': 2.0647716522216797}} 11/06/2021 22:14:27 - INFO - __main__ - Step 7081: {'lr': 0.000498547912184446, 'samples': 1359552, 'steps': 7080, 'loss/train': 2.0644752979278564}}} 11/06/2021 22:14:30 - INFO - __main__ - Step 7085: {'lr': 0.0004985456267523346, 'samples': 1360320, 'steps': 7084, 'loss/train': 1.65984308719635}}}} 11/06/2021 22:14:32 - INFO - __main__ - Step 7090: {'lr': 0.0004985427674424038, 'samples': 1361280, 'steps': 7089, 'loss/train': 1.8997712135314941}} 11/06/2021 22:14:32 - INFO - __main__ - Step 7090: {'lr': 0.0004985427674424038, 'samples': 1361280, 'steps': 7089, 'loss/train': 1.8997712135314941}} 11/06/2021 22:14:32 - INFO - __main__ - Step 7090: {'lr': 0.0004985427674424038, 'samples': 1361280, 'steps': 7089, 'loss/train': 1.8997712135314941}} 11/06/2021 22:14:38 - INFO - __main__ - Step 7101: {'lr': 0.0004985364671055223, 'samples': 1363392, 'steps': 7100, 'loss/train': 2.0017964839935303}} 11/06/2021 22:14:40 - INFO - __main__ - Step 7106: {'lr': 0.000498533598836542, 'samples': 1364352, 'steps': 7105, 'loss/train': 1.2057009935379028}}} 11/06/2021 22:14:42 - INFO - __main__ - Step 7110: {'lr': 0.0004985313022056191, 'samples': 1365120, 'steps': 7109, 'loss/train': 1.8806291818618774}} 11/06/2021 22:14:44 - INFO - __main__ - Step 7114: {'lr': 0.0004985290037829462, 'samples': 1365888, 'steps': 7113, 'loss/train': 3.7923812866210938}} 11/06/2021 22:14:46 - INFO - __main__ - Step 7118: {'lr': 0.00049852670356854, 'samples': 1366656, 'steps': 7117, 'loss/train': 1.7581599950790405}8}} 11/06/2021 22:14:48 - INFO - __main__ - Step 7122: {'lr': 0.000498524401562417, 'samples': 1367424, 'steps': 7121, 'loss/train': 1.7350369691848755}}} 11/06/2021 22:14:50 - INFO - __main__ - Step 7127: {'lr': 0.0004985215215351869, 'samples': 1368384, 'steps': 7126, 'loss/train': 1.9587048292160034}} 11/06/2021 22:14:50 - INFO - __main__ - Step 7127: {'lr': 0.0004985215215351869, 'samples': 1368384, 'steps': 7126, 'loss/train': 1.9587048292160034}} 11/06/2021 22:14:53 - INFO - __main__ - Step 7134: {'lr': 0.0004985174847939135, 'samples': 1369728, 'steps': 7133, 'loss/train': 2.039775848388672}}} 11/06/2021 22:14:56 - INFO - __main__ - Step 7138: {'lr': 0.0004985151756210897, 'samples': 1370496, 'steps': 7137, 'loss/train': 1.8486833572387695}} 11/06/2021 22:14:58 - INFO - __main__ - Step 7143: {'lr': 0.0004985122866355768, 'samples': 1371456, 'steps': 7142, 'loss/train': 2.1116602420806885}} 11/06/2021 22:15:00 - INFO - __main__ - Step 7148: {'lr': 0.0004985093948506689, 'samples': 1372416, 'steps': 7147, 'loss/train': 1.5859280824661255}} 11/06/2021 22:15:03 - INFO - __main__ - Step 7152: {'lr': 0.0004985070794072002, 'samples': 1373184, 'steps': 7151, 'loss/train': 2.0557656288146973}} 11/06/2021 22:15:05 - INFO - __main__ - Step 7156: {'lr': 0.0004985047621721561, 'samples': 1373952, 'steps': 7155, 'loss/train': 0.8238686323165894}} 11/06/2021 22:15:06 - INFO - __main__ - Step 7160: {'lr': 0.0004985024431455534, 'samples': 1374720, 'steps': 7159, 'loss/train': 1.8155590295791626}} 11/06/2021 22:15:08 - INFO - __main__ - Step 7164: {'lr': 0.0004985001223274089, 'samples': 1375488, 'steps': 7163, 'loss/train': 2.357848644256592}}} 11/06/2021 22:15:10 - INFO - __main__ - Step 7169: {'lr': 0.000498497218785398, 'samples': 1376448, 'steps': 7168, 'loss/train': 1.955859899520874}}}} 11/06/2021 22:15:10 - INFO - __main__ - Step 7169: {'lr': 0.000498497218785398, 'samples': 1376448, 'steps': 7168, 'loss/train': 1.955859899520874}}}} 11/06/2021 22:15:10 - INFO - __main__ - Step 7169: {'lr': 0.000498497218785398, 'samples': 1376448, 'steps': 7168, 'loss/train': 1.955859899520874}}}} 11/06/2021 22:15:16 - INFO - __main__ - Step 7179: {'lr': 0.000498491403303733, 'samples': 1378368, 'steps': 7178, 'loss/train': 2.4547572135925293}}} 11/06/2021 22:15:18 - INFO - __main__ - Step 7185: {'lr': 0.0004984879086403304, 'samples': 1379520, 'steps': 7184, 'loss/train': 2.017399311065674}}} 11/06/2021 22:15:21 - INFO - __main__ - Step 7189: {'lr': 0.000498485576625429, 'samples': 1380288, 'steps': 7188, 'loss/train': 1.9039487838745117}}} 11/06/2021 22:15:21 - INFO - __main__ - Step 7189: {'lr': 0.000498485576625429, 'samples': 1380288, 'steps': 7188, 'loss/train': 1.9039487838745117}}} 11/06/2021 22:15:24 - INFO - __main__ - Step 7196: {'lr': 0.0004984814912887563, 'samples': 1381632, 'steps': 7195, 'loss/train': 2.1821253299713135}} 11/06/2021 22:15:26 - INFO - __main__ - Step 7201: {'lr': 0.0004984785698322699, 'samples': 1382592, 'steps': 7200, 'loss/train': 1.3614236116409302}} 11/06/2021 22:15:29 - INFO - __main__ - Step 7206: {'lr': 0.0004984756455767684, 'samples': 1383552, 'steps': 7205, 'loss/train': 1.8581416606903076}} 11/06/2021 22:15:31 - INFO - __main__ - Step 7210: {'lr': 0.0004984733041570983, 'samples': 1384320, 'steps': 7209, 'loss/train': 1.8552770614624023}} 11/06/2021 22:15:31 - INFO - __main__ - Step 7210: {'lr': 0.0004984733041570983, 'samples': 1384320, 'steps': 7209, 'loss/train': 1.8552770614624023}} 11/06/2021 22:15:34 - INFO - __main__ - Step 7217: {'lr': 0.0004984692023622938, 'samples': 1385664, 'steps': 7216, 'loss/train': 2.0930893421173096}} 11/06/2021 22:15:36 - INFO - __main__ - Step 7222: {'lr': 0.000498466269150165, 'samples': 1386624, 'steps': 7221, 'loss/train': 1.4030883312225342}}} 11/06/2021 22:15:39 - INFO - __main__ - Step 7227: {'lr': 0.0004984633331391596, 'samples': 1387584, 'steps': 7226, 'loss/train': 1.6768375635147095}} 11/06/2021 22:15:39 - INFO - __main__ - Step 7227: {'lr': 0.0004984633331391596, 'samples': 1387584, 'steps': 7226, 'loss/train': 1.6768375635147095}} 11/06/2021 22:15:42 - INFO - __main__ - Step 7234: {'lr': 0.0004984592180217022, 'samples': 1388928, 'steps': 7233, 'loss/train': 1.602423906326294}}} 11/06/2021 22:15:44 - INFO - __main__ - Step 7238: {'lr': 0.0004984568640630648, 'samples': 1389696, 'steps': 7237, 'loss/train': 1.3969769477844238}} 11/06/2021 22:15:46 - INFO - __main__ - Step 7243: {'lr': 0.0004984539190958765, 'samples': 1390656, 'steps': 7242, 'loss/train': 2.117086887359619}}} 11/06/2021 22:15:46 - INFO - __main__ - Step 7243: {'lr': 0.0004984539190958765, 'samples': 1390656, 'steps': 7242, 'loss/train': 2.117086887359619}}} 11/06/2021 22:15:50 - INFO - __main__ - Step 7251: {'lr': 0.0004984492013270147, 'samples': 1392192, 'steps': 7250, 'loss/train': 1.9997104406356812}} 11/06/2021 22:15:52 - INFO - __main__ - Step 7255: {'lr': 0.0004984468397558384, 'samples': 1392960, 'steps': 7254, 'loss/train': 1.3238756656646729}} 11/06/2021 22:15:54 - INFO - __main__ - Step 7259: {'lr': 0.000498444476393521, 'samples': 1393728, 'steps': 7258, 'loss/train': 2.349273920059204}9}} 11/06/2021 22:15:56 - INFO - __main__ - Step 7264: {'lr': 0.0004984415196718582, 'samples': 1394688, 'steps': 7263, 'loss/train': 1.5477303266525269}} 11/06/2021 22:15:59 - INFO - __main__ - Step 7268: {'lr': 0.0004984391522795359, 'samples': 1395456, 'steps': 7267, 'loss/train': 1.868264079093933}}} 11/06/2021 22:16:01 - INFO - __main__ - Step 7272: {'lr': 0.0004984367830961281, 'samples': 1396224, 'steps': 7271, 'loss/train': 2.167809247970581}}} 11/06/2021 22:16:01 - INFO - __main__ - Step 7272: {'lr': 0.0004984367830961281, 'samples': 1396224, 'steps': 7271, 'loss/train': 2.167809247970581}}} 11/06/2021 22:16:04 - INFO - __main__ - Step 7279: {'lr': 0.000498432632715416, 'samples': 1397568, 'steps': 7278, 'loss/train': 0.6851865649223328}}} 11/06/2021 22:16:07 - INFO - __main__ - Step 7285: {'lr': 0.0004984290708805743, 'samples': 1398720, 'steps': 7284, 'loss/train': 1.7015665769577026}} 11/06/2021 22:16:09 - INFO - __main__ - Step 7289: {'lr': 0.0004984266940852434, 'samples': 1399488, 'steps': 7288, 'loss/train': 2.0083210468292236}} 11/06/2021 22:16:11 - INFO - __main__ - Step 7293: {'lr': 0.0004984243154989168, 'samples': 1400256, 'steps': 7292, 'loss/train': 1.5568764209747314}} 11/06/2021 22:16:12 - INFO - __main__ - Step 7297: {'lr': 0.0004984219351216116, 'samples': 1401024, 'steps': 7296, 'loss/train': 2.4018189907073975}} 11/06/2021 22:16:14 - INFO - __main__ - Step 7301: {'lr': 0.0004984195529533451, 'samples': 1401792, 'steps': 7300, 'loss/train': 1.912482500076294}}} 11/06/2021 22:16:17 - INFO - __main__ - Step 7306: {'lr': 0.0004984165727244984, 'samples': 1402752, 'steps': 7305, 'loss/train': 1.6583056449890137}} 11/06/2021 22:16:19 - INFO - __main__ - Step 7310: {'lr': 0.0004984141865266312, 'samples': 1403520, 'steps': 7309, 'loss/train': 0.43410855531692505} 11/06/2021 22:16:21 - INFO - __main__ - Step 7314: {'lr': 0.0004984117985378586, 'samples': 1404288, 'steps': 7313, 'loss/train': 2.5833494663238525}} 11/06/2021 22:16:22 - INFO - __main__ - Step 7318: {'lr': 0.0004984094087581975, 'samples': 1405056, 'steps': 7317, 'loss/train': 1.7780122756958008}} 11/06/2021 22:16:24 - INFO - __main__ - Step 7322: {'lr': 0.0004984070171876653, 'samples': 1405824, 'steps': 7321, 'loss/train': 2.0687177181243896}} 11/06/2021 22:16:27 - INFO - __main__ - Step 7327: {'lr': 0.0004984040252061137, 'samples': 1406784, 'steps': 7326, 'loss/train': 2.036705255508423}}} 11/06/2021 22:16:29 - INFO - __main__ - Step 7331: {'lr': 0.0004984016296061846, 'samples': 1407552, 'steps': 7330, 'loss/train': 1.4878357648849487}} 11/06/2021 22:16:29 - INFO - __main__ - Step 7331: {'lr': 0.0004984016296061846, 'samples': 1407552, 'steps': 7330, 'loss/train': 1.4878357648849487}} 11/06/2021 22:16:32 - INFO - __main__ - Step 7338: {'lr': 0.0004983974329971702, 'samples': 1408896, 'steps': 7337, 'loss/train': 1.8037465810775757}} 11/06/2021 22:16:34 - INFO - __main__ - Step 7343: {'lr': 0.0004983944320615757, 'samples': 1409856, 'steps': 7342, 'loss/train': 2.154550552368164}}} 11/06/2021 22:16:36 - INFO - __main__ - Step 7347: {'lr': 0.00049839202929849, 'samples': 1410624, 'steps': 7346, 'loss/train': 2.132795572280884}4}}} 11/06/2021 22:16:39 - INFO - __main__ - Step 7352: {'lr': 0.0004983890233263986, 'samples': 1411584, 'steps': 7351, 'loss/train': 1.8521647453308105}} 11/06/2021 22:16:41 - INFO - __main__ - Step 7356: {'lr': 0.0004983866165341592, 'samples': 1412352, 'steps': 7355, 'loss/train': 2.184023380279541}}} 11/06/2021 22:16:43 - INFO - __main__ - Step 7360: {'lr': 0.0004983842079512128, 'samples': 1413120, 'steps': 7359, 'loss/train': 1.5754534006118774}} 11/06/2021 22:16:44 - INFO - __main__ - Step 7364: {'lr': 0.0004983817975775771, 'samples': 1413888, 'steps': 7363, 'loss/train': 1.666754126548767}}} 11/06/2021 22:16:47 - INFO - __main__ - Step 7368: {'lr': 0.0004983793854132693, 'samples': 1414656, 'steps': 7367, 'loss/train': 1.803916335105896}}} 11/06/2021 22:16:49 - INFO - __main__ - Step 7373: {'lr': 0.0004983763676897784, 'samples': 1415616, 'steps': 7372, 'loss/train': 1.749665379524231}}} 11/06/2021 22:16:51 - INFO - __main__ - Step 7377: {'lr': 0.000498373951496522, 'samples': 1416384, 'steps': 7376, 'loss/train': 2.2691292762756348}}} 11/06/2021 22:16:53 - INFO - __main__ - Step 7381: {'lr': 0.00049837153351265, 'samples': 1417152, 'steps': 7380, 'loss/train': 1.6483396291732788}}}} 11/06/2021 22:16:54 - INFO - __main__ - Step 7385: {'lr': 0.00049836911373818, 'samples': 1417920, 'steps': 7384, 'loss/train': 1.690798044204712}}}}} 11/06/2021 22:16:56 - INFO - __main__ - Step 7389: {'lr': 0.0004983666921731293, 'samples': 1418688, 'steps': 7388, 'loss/train': 1.6616744995117188}} 11/06/2021 22:16:59 - INFO - __main__ - Step 7394: {'lr': 0.0004983636626988386, 'samples': 1419648, 'steps': 7393, 'loss/train': 2.0506041049957275}} 11/06/2021 22:17:01 - INFO - __main__ - Step 7398: {'lr': 0.0004983612371050453, 'samples': 1420416, 'steps': 7397, 'loss/train': 1.7991613149642944}} 11/06/2021 22:17:03 - INFO - __main__ - Step 7402: {'lr': 0.0004983588097207283, 'samples': 1421184, 'steps': 7401, 'loss/train': 1.678063988685608}}} 11/06/2021 22:17:05 - INFO - __main__ - Step 7406: {'lr': 0.0004983563805459048, 'samples': 1421952, 'steps': 7405, 'loss/train': 1.7848727703094482}} 11/06/2021 22:17:07 - INFO - __main__ - Step 7410: {'lr': 0.0004983539495805925, 'samples': 1422720, 'steps': 7409, 'loss/train': 0.8858946561813354}} 11/06/2021 22:17:09 - INFO - __main__ - Step 7415: {'lr': 0.0004983509083561038, 'samples': 1423680, 'steps': 7414, 'loss/train': 2.069981813430786}}} 11/06/2021 22:17:09 - INFO - __main__ - Step 7415: {'lr': 0.0004983509083561038, 'samples': 1423680, 'steps': 7414, 'loss/train': 2.069981813430786}}} 11/06/2021 22:17:13 - INFO - __main__ - Step 7422: {'lr': 0.0004983466459418978, 'samples': 1425024, 'steps': 7421, 'loss/train': 2.4349045753479004}} 11/06/2021 22:17:14 - INFO - __main__ - Step 7426: {'lr': 0.0004983442078148056, 'samples': 1425792, 'steps': 7425, 'loss/train': 1.7043726444244385}} 11/06/2021 22:17:16 - INFO - __main__ - Step 7430: {'lr': 0.0004983417678973123, 'samples': 1426560, 'steps': 7429, 'loss/train': 2.2200920581817627}} 11/06/2021 22:17:19 - INFO - __main__ - Step 7435: {'lr': 0.0004983387154827208, 'samples': 1427520, 'steps': 7434, 'loss/train': 1.69056236743927}7}} 11/06/2021 22:17:19 - INFO - __main__ - Step 7435: {'lr': 0.0004983387154827208, 'samples': 1427520, 'steps': 7434, 'loss/train': 1.69056236743927}7}} 11/06/2021 22:17:23 - INFO - __main__ - Step 7443: {'lr': 0.0004983338258007139, 'samples': 1429056, 'steps': 7442, 'loss/train': 1.3787864446640015}} 11/06/2021 22:17:25 - INFO - __main__ - Step 7447: {'lr': 0.0004983313782742124, 'samples': 1429824, 'steps': 7446, 'loss/train': 1.3778132200241089}} 11/06/2021 22:17:27 - INFO - __main__ - Step 7451: {'lr': 0.0004983289289574022, 'samples': 1430592, 'steps': 7450, 'loss/train': 1.7557963132858276}} 11/06/2021 22:17:29 - INFO - __main__ - Step 7456: {'lr': 0.0004983258647937949, 'samples': 1431552, 'steps': 7455, 'loss/train': 2.1020658016204834}} 11/06/2021 22:17:29 - INFO - __main__ - Step 7456: {'lr': 0.0004983258647937949, 'samples': 1431552, 'steps': 7455, 'loss/train': 2.1020658016204834}} 11/06/2021 22:17:34 - INFO - __main__ - Step 7464: {'lr': 0.0004983209563136639, 'samples': 1433088, 'steps': 7463, 'loss/train': 1.939214825630188}}} 11/06/2021 22:17:35 - INFO - __main__ - Step 7468: {'lr': 0.0004983184993882394, 'samples': 1433856, 'steps': 7467, 'loss/train': 1.6667938232421875}} 11/06/2021 22:17:37 - INFO - __main__ - Step 7472: {'lr': 0.000498316040672599, 'samples': 1434624, 'steps': 7471, 'loss/train': 1.767512559890747}5}} 11/06/2021 22:17:39 - INFO - __main__ - Step 7476: {'lr': 0.0004983135801667608, 'samples': 1435392, 'steps': 7475, 'loss/train': 2.2196826934814453}} 11/06/2021 22:17:41 - INFO - __main__ - Step 7480: {'lr': 0.0004983111178707422, 'samples': 1436160, 'steps': 7479, 'loss/train': 1.5433342456817627}} 11/06/2021 22:17:43 - INFO - __main__ - Step 7484: {'lr': 0.0004983086537845611, 'samples': 1436928, 'steps': 7483, 'loss/train': 1.9418973922729492}} 11/06/2021 22:17:45 - INFO - __main__ - Step 7488: {'lr': 0.0004983061879082352, 'samples': 1437696, 'steps': 7487, 'loss/train': 2.341383218765259}}} 11/06/2021 22:17:47 - INFO - __main__ - Step 7492: {'lr': 0.0004983037202417824, 'samples': 1438464, 'steps': 7491, 'loss/train': 1.6653988361358643}} 11/06/2021 22:17:49 - INFO - __main__ - Step 7497: {'lr': 0.0004983006331413773, 'samples': 1439424, 'steps': 7496, 'loss/train': 1.6898235082626343}} 11/06/2021 22:17:51 - INFO - __main__ - Step 7501: {'lr': 0.0004982981614472039, 'samples': 1440192, 'steps': 7500, 'loss/train': 2.0123913288116455}} 11/06/2021 22:17:54 - INFO - __main__ - Step 7505: {'lr': 0.0004982956879629612, 'samples': 1440960, 'steps': 7504, 'loss/train': 2.184455633163452}}} 11/06/2021 22:17:55 - INFO - __main__ - Step 7509: {'lr': 0.0004982932126886674, 'samples': 1441728, 'steps': 7508, 'loss/train': 1.670443058013916}}} 11/06/2021 22:17:57 - INFO - __main__ - Step 7513: {'lr': 0.00049829073562434, 'samples': 1442496, 'steps': 7512, 'loss/train': 1.9669575691223145}}}} 11/06/2021 22:17:59 - INFO - __main__ - Step 7518: {'lr': 0.0004982876367767234, 'samples': 1443456, 'steps': 7517, 'loss/train': 2.108997106552124}}} 11/06/2021 22:18:01 - INFO - __main__ - Step 7522: {'lr': 0.0004982851556848861, 'samples': 1444224, 'steps': 7521, 'loss/train': 1.771378993988037}}} 11/06/2021 22:18:04 - INFO - __main__ - Step 7526: {'lr': 0.0004982826728030735, 'samples': 1444992, 'steps': 7525, 'loss/train': 1.9992296695709229}} 11/06/2021 22:18:05 - INFO - __main__ - Step 7530: {'lr': 0.0004982801881313034, 'samples': 1445760, 'steps': 7529, 'loss/train': 1.9027869701385498}} 11/06/2021 22:18:07 - INFO - __main__ - Step 7534: {'lr': 0.0004982777016695937, 'samples': 1446528, 'steps': 7533, 'loss/train': 1.1510889530181885}} 11/06/2021 22:18:07 - INFO - __main__ - Step 7534: {'lr': 0.0004982777016695937, 'samples': 1446528, 'steps': 7533, 'loss/train': 1.1510889530181885}} 11/06/2021 22:18:11 - INFO - __main__ - Step 7542: {'lr': 0.0004982727233764276, 'samples': 1448064, 'steps': 7541, 'loss/train': 1.264660120010376}}} 11/06/2021 22:18:13 - INFO - __main__ - Step 7546: {'lr': 0.0004982702315450068, 'samples': 1448832, 'steps': 7545, 'loss/train': 2.1761927604675293}} 11/06/2021 22:18:15 - INFO - __main__ - Step 7550: {'lr': 0.0004982677379237185, 'samples': 1449600, 'steps': 7549, 'loss/train': 1.834945797920227}}} 11/06/2021 22:18:18 - INFO - __main__ - Step 7555: {'lr': 0.0004982646183801337, 'samples': 1450560, 'steps': 7554, 'loss/train': 2.560479164123535}}} 11/06/2021 22:18:20 - INFO - __main__ - Step 7559: {'lr': 0.0004982621207317086, 'samples': 1451328, 'steps': 7558, 'loss/train': 1.7856800556182861}} 11/06/2021 22:18:20 - INFO - __main__ - Step 7559: {'lr': 0.0004982621207317086, 'samples': 1451328, 'steps': 7558, 'loss/train': 1.7856800556182861}} 11/06/2021 22:18:23 - INFO - __main__ - Step 7566: {'lr': 0.0004982577455402467, 'samples': 1452672, 'steps': 7565, 'loss/train': 1.7357767820358276}} 11/06/2021 22:18:25 - INFO - __main__ - Step 7571: {'lr': 0.0004982546170476494, 'samples': 1453632, 'steps': 7570, 'loss/train': 1.5686355829238892}} 11/06/2021 22:18:28 - INFO - __main__ - Step 7576: {'lr': 0.0004982514857585596, 'samples': 1454592, 'steps': 7575, 'loss/train': 1.8690491914749146}} 11/06/2021 22:18:28 - INFO - __main__ - Step 7576: {'lr': 0.0004982514857585596, 'samples': 1454592, 'steps': 7575, 'loss/train': 1.8690491914749146}} 11/06/2021 22:18:31 - INFO - __main__ - Step 7583: {'lr': 0.0004982470972557936, 'samples': 1455936, 'steps': 7582, 'loss/train': 2.4827017784118652}} 11/06/2021 22:18:33 - INFO - __main__ - Step 7587: {'lr': 0.0004982445870790823, 'samples': 1456704, 'steps': 7586, 'loss/train': 1.7887145280838013}} 11/06/2021 22:18:35 - INFO - __main__ - Step 7591: {'lr': 0.0004982420751126882, 'samples': 1457472, 'steps': 7590, 'loss/train': 2.2515952587127686}} 11/06/2021 22:18:38 - INFO - __main__ - Step 7596: {'lr': 0.0004982389326379814, 'samples': 1458432, 'steps': 7595, 'loss/train': 1.887166142463684}}} 11/06/2021 22:18:40 - INFO - __main__ - Step 7600: {'lr': 0.0004982364166448669, 'samples': 1459200, 'steps': 7599, 'loss/train': 1.9225597381591797}} 11/06/2021 22:18:41 - INFO - __main__ - Step 7604: {'lr': 0.0004982338988621284, 'samples': 1459968, 'steps': 7603, 'loss/train': 1.420168161392212}}} 11/06/2021 22:18:43 - INFO - __main__ - Step 7608: {'lr': 0.0004982313792897843, 'samples': 1460736, 'steps': 7607, 'loss/train': 1.2424761056900024}} 11/06/2021 22:18:46 - INFO - __main__ - Step 7613: {'lr': 0.0004982282273077483, 'samples': 1461696, 'steps': 7612, 'loss/train': 1.8897991180419922}} 11/06/2021 22:18:48 - INFO - __main__ - Step 7617: {'lr': 0.0004982257037088574, 'samples': 1462464, 'steps': 7616, 'loss/train': 1.82854425907135}2}} 11/06/2021 22:18:50 - INFO - __main__ - Step 7621: {'lr': 0.0004982231783204196, 'samples': 1463232, 'steps': 7620, 'loss/train': 1.768385887145996}}} 11/06/2021 22:18:51 - INFO - __main__ - Step 7625: {'lr': 0.0004982206511424534, 'samples': 1464000, 'steps': 7624, 'loss/train': 1.2371141910552979}} 11/06/2021 22:18:53 - INFO - __main__ - Step 7629: {'lr': 0.0004982181221749769, 'samples': 1464768, 'steps': 7628, 'loss/train': 2.405860424041748}}} 11/06/2021 22:18:56 - INFO - __main__ - Step 7634: {'lr': 0.0004982149584491601, 'samples': 1465728, 'steps': 7633, 'loss/train': 2.227440357208252}}} 11/06/2021 22:18:58 - INFO - __main__ - Step 7638: {'lr': 0.000498212425455352, 'samples': 1466496, 'steps': 7637, 'loss/train': 1.6778465509414673}}} 11/06/2021 22:19:00 - INFO - __main__ - Step 7642: {'lr': 0.0004982098906720928, 'samples': 1467264, 'steps': 7641, 'loss/train': 1.8286490440368652}} 11/06/2021 22:19:00 - INFO - __main__ - Step 7642: {'lr': 0.0004982098906720928, 'samples': 1467264, 'steps': 7641, 'loss/train': 1.8286490440368652}} 11/06/2021 22:19:03 - INFO - __main__ - Step 7649: {'lr': 0.0004982054504955778, 'samples': 1468608, 'steps': 7648, 'loss/train': 1.6242306232452393}} 11/06/2021 22:19:06 - INFO - __main__ - Step 7655: {'lr': 0.0004982016402683255, 'samples': 1469760, 'steps': 7654, 'loss/train': 2.361668348312378}}} 11/06/2021 22:19:08 - INFO - __main__ - Step 7659: {'lr': 0.0004981990978801035, 'samples': 1470528, 'steps': 7658, 'loss/train': 2.046013593673706}}} 11/06/2021 22:19:10 - INFO - __main__ - Step 7663: {'lr': 0.0004981965537025267, 'samples': 1471296, 'steps': 7662, 'loss/train': 2.0140273571014404}} 11/06/2021 22:19:10 - INFO - __main__ - Step 7663: {'lr': 0.0004981965537025267, 'samples': 1471296, 'steps': 7662, 'loss/train': 2.0140273571014404}} 11/06/2021 22:19:13 - INFO - __main__ - Step 7670: {'lr': 0.000498192097086187, 'samples': 1472640, 'steps': 7669, 'loss/train': 1.8466298580169678}}} 11/06/2021 22:19:15 - INFO - __main__ - Step 7675: {'lr': 0.0004981889104338499, 'samples': 1473600, 'steps': 7674, 'loss/train': 1.673049807548523}}} 11/06/2021 22:19:18 - INFO - __main__ - Step 7680: {'lr': 0.0004981857209857605, 'samples': 1474560, 'steps': 7679, 'loss/train': 2.1773593425750732}} 11/06/2021 22:19:18 - INFO - __main__ - Step 7680: {'lr': 0.0004981857209857605, 'samples': 1474560, 'steps': 7679, 'loss/train': 2.1773593425750732}} 11/06/2021 22:19:21 - INFO - __main__ - Step 7687: {'lr': 0.0004981812510616399, 'samples': 1475904, 'steps': 7686, 'loss/train': 1.9942034482955933}} 11/06/2021 22:19:23 - INFO - __main__ - Step 7691: {'lr': 0.0004981786943590928, 'samples': 1476672, 'steps': 7690, 'loss/train': 1.96816885471344}3}} 11/06/2021 22:19:26 - INFO - __main__ - Step 7696: {'lr': 0.0004981754959648376, 'samples': 1477632, 'steps': 7695, 'loss/train': 1.89278244972229}3}} 11/06/2021 22:19:26 - INFO - __main__ - Step 7696: {'lr': 0.0004981754959648376, 'samples': 1477632, 'steps': 7695, 'loss/train': 1.89278244972229}3}} 11/06/2021 22:19:30 - INFO - __main__ - Step 7704: {'lr': 0.0004981703727191935, 'samples': 1479168, 'steps': 7703, 'loss/train': 1.7264535427093506}} 11/06/2021 22:19:31 - INFO - __main__ - Step 7708: {'lr': 0.0004981678084126405, 'samples': 1479936, 'steps': 7707, 'loss/train': 1.5540006160736084}} 11/06/2021 22:19:33 - INFO - __main__ - Step 7712: {'lr': 0.0004981652423169582, 'samples': 1480704, 'steps': 7711, 'loss/train': 1.3098105192184448}} 11/06/2021 22:19:36 - INFO - __main__ - Step 7717: {'lr': 0.0004981620321814203, 'samples': 1481664, 'steps': 7716, 'loss/train': 1.9144853353500366}} 11/06/2021 22:19:38 - INFO - __main__ - Step 7722: {'lr': 0.0004981588192504329, 'samples': 1482624, 'steps': 7721, 'loss/train': 0.8639780879020691}} 11/06/2021 22:19:38 - INFO - __main__ - Step 7722: {'lr': 0.0004981588192504329, 'samples': 1482624, 'steps': 7721, 'loss/train': 0.8639780879020691}} 11/06/2021 22:19:38 - INFO - __main__ - Step 7722: {'lr': 0.0004981588192504329, 'samples': 1482624, 'steps': 7721, 'loss/train': 0.8639780879020691}} 11/06/2021 22:19:43 - INFO - __main__ - Step 7732: {'lr': 0.000498152385002254, 'samples': 1484544, 'steps': 7731, 'loss/train': 2.0949268341064453}}} 11/06/2021 22:19:46 - INFO - __main__ - Step 7738: {'lr': 0.0004981485190862737, 'samples': 1485696, 'steps': 7737, 'loss/train': 1.6075444221496582}} 11/06/2021 22:19:48 - INFO - __main__ - Step 7742: {'lr': 0.0004981459395727117, 'samples': 1486464, 'steps': 7741, 'loss/train': 2.1248600482940674}} 11/06/2021 22:19:48 - INFO - __main__ - Step 7742: {'lr': 0.0004981459395727117, 'samples': 1486464, 'steps': 7741, 'loss/train': 2.1248600482940674}} 11/06/2021 22:19:51 - INFO - __main__ - Step 7749: {'lr': 0.0004981414211192763, 'samples': 1487808, 'steps': 7748, 'loss/train': 2.3146755695343018}} 11/06/2021 22:19:53 - INFO - __main__ - Step 7753: {'lr': 0.0004981388366860869, 'samples': 1488576, 'steps': 7752, 'loss/train': 1.484642744064331}}} 11/06/2021 22:19:56 - INFO - __main__ - Step 7759: {'lr': 0.0004981349566820828, 'samples': 1489728, 'steps': 7758, 'loss/train': 1.9951647520065308}} 11/06/2021 22:19:58 - INFO - __main__ - Step 7763: {'lr': 0.0004981323677766273, 'samples': 1490496, 'steps': 7762, 'loss/train': 2.313772678375244}}} 11/06/2021 22:20:00 - INFO - __main__ - Step 7767: {'lr': 0.0004981297770822977, 'samples': 1491264, 'steps': 7766, 'loss/train': 2.1629531383514404}} 11/06/2021 22:20:00 - INFO - __main__ - Step 7767: {'lr': 0.0004981297770822977, 'samples': 1491264, 'steps': 7766, 'loss/train': 2.1629531383514404}} 11/06/2021 22:20:03 - INFO - __main__ - Step 7774: {'lr': 0.0004981252390627997, 'samples': 1492608, 'steps': 7773, 'loss/train': 2.382383108139038}}} 11/06/2021 22:20:06 - INFO - __main__ - Step 7779: {'lr': 0.000498121994266253, 'samples': 1493568, 'steps': 7778, 'loss/train': 1.9787876605987549}}} 11/06/2021 22:20:08 - INFO - __main__ - Step 7783: {'lr': 0.0004981193964166151, 'samples': 1494336, 'steps': 7782, 'loss/train': 2.476020097732544}}} 11/06/2021 22:20:10 - INFO - __main__ - Step 7787: {'lr': 0.0004981167967781968, 'samples': 1495104, 'steps': 7786, 'loss/train': 1.5972424745559692}} 11/06/2021 22:20:12 - INFO - __main__ - Step 7791: {'lr': 0.0004981141953510169, 'samples': 1495872, 'steps': 7790, 'loss/train': 1.9454317092895508}} 11/06/2021 22:20:14 - INFO - __main__ - Step 7795: {'lr': 0.0004981115921350941, 'samples': 1496640, 'steps': 7794, 'loss/train': 1.3753095865249634}} 11/06/2021 22:20:16 - INFO - __main__ - Step 7800: {'lr': 0.0004981083355997995, 'samples': 1497600, 'steps': 7799, 'loss/train': 2.574296236038208}}} 11/06/2021 22:20:16 - INFO - __main__ - Step 7800: {'lr': 0.0004981083355997995, 'samples': 1497600, 'steps': 7799, 'loss/train': 2.574296236038208}}} 11/06/2021 22:20:20 - INFO - __main__ - Step 7808: {'lr': 0.0004981031193300667, 'samples': 1499136, 'steps': 7807, 'loss/train': 1.0529712438583374}} 11/06/2021 22:20:22 - INFO - __main__ - Step 7812: {'lr': 0.0004981005085121963, 'samples': 1499904, 'steps': 7811, 'loss/train': 2.1915555000305176}} 11/06/2021 22:20:24 - INFO - __main__ - Step 7816: {'lr': 0.0004980978959056819, 'samples': 1500672, 'steps': 7815, 'loss/train': 2.804755926132202}}} 11/06/2021 22:20:24 - INFO - __main__ - Step 7816: {'lr': 0.0004980978959056819, 'samples': 1500672, 'steps': 7815, 'loss/train': 2.804755926132202}}} 11/06/2021 22:20:24 - INFO - __main__ - Step 7816: {'lr': 0.0004980978959056819, 'samples': 1500672, 'steps': 7815, 'loss/train': 2.804755926132202}}} 11/06/2021 22:20:30 - INFO - __main__ - Step 7827: {'lr': 0.0004980907020152242, 'samples': 1502784, 'steps': 7826, 'loss/train': 2.0572545528411865}} 11/06/2021 22:20:32 - INFO - __main__ - Step 7832: {'lr': 0.0004980874275935591, 'samples': 1503744, 'steps': 7831, 'loss/train': 0.8481667041778564}} 11/06/2021 22:20:35 - INFO - __main__ - Step 7837: {'lr': 0.0004980841503772846, 'samples': 1504704, 'steps': 7836, 'loss/train': 1.764209270477295}}} 11/06/2021 22:20:35 - INFO - __main__ - Step 7837: {'lr': 0.0004980841503772846, 'samples': 1504704, 'steps': 7836, 'loss/train': 1.764209270477295}}} 11/06/2021 22:20:38 - INFO - __main__ - Step 7843: {'lr': 0.0004980802140289232, 'samples': 1505856, 'steps': 7842, 'loss/train': 1.2913402318954468}} 11/06/2021 22:20:40 - INFO - __main__ - Step 7847: {'lr': 0.000498077587561056, 'samples': 1506624, 'steps': 7846, 'loss/train': 1.6562167406082153}}} 11/06/2021 22:20:40 - INFO - __main__ - Step 7847: {'lr': 0.000498077587561056, 'samples': 1506624, 'steps': 7846, 'loss/train': 1.6562167406082153}}} 11/06/2021 22:20:43 - INFO - __main__ - Step 7854: {'lr': 0.0004980729869387724, 'samples': 1507968, 'steps': 7853, 'loss/train': 1.2803887128829956}} 11/06/2021 22:20:45 - INFO - __main__ - Step 7858: {'lr': 0.0004980703555526338, 'samples': 1508736, 'steps': 7857, 'loss/train': 1.8307812213897705}} 11/06/2021 22:20:48 - INFO - __main__ - Step 7863: {'lr': 0.0004980670638049875, 'samples': 1509696, 'steps': 7862, 'loss/train': 2.1986732482910156}} 11/06/2021 22:20:50 - INFO - __main__ - Step 7867: {'lr': 0.0004980644283949152, 'samples': 1510464, 'steps': 7866, 'loss/train': 1.8365942239761353}} 11/06/2021 22:20:50 - INFO - __main__ - Step 7867: {'lr': 0.0004980644283949152, 'samples': 1510464, 'steps': 7866, 'loss/train': 1.8365942239761353}} 11/06/2021 22:20:53 - INFO - __main__ - Step 7873: {'lr': 0.0004980604719265928, 'samples': 1511616, 'steps': 7872, 'loss/train': 1.9281331300735474}} 11/06/2021 22:20:56 - INFO - __main__ - Step 7879: {'lr': 0.0004980565114344704, 'samples': 1512768, 'steps': 7878, 'loss/train': 2.003199577331543}}} 11/06/2021 22:20:58 - INFO - __main__ - Step 7883: {'lr': 0.0004980538688709761, 'samples': 1513536, 'steps': 7882, 'loss/train': 1.9707108736038208}} 11/06/2021 22:21:00 - INFO - __main__ - Step 7887: {'lr': 0.0004980512245191738, 'samples': 1514304, 'steps': 7886, 'loss/train': 2.2796289920806885}} 11/06/2021 22:21:02 - INFO - __main__ - Step 7891: {'lr': 0.0004980485783790827, 'samples': 1515072, 'steps': 7890, 'loss/train': 1.8750466108322144}} 11/06/2021 22:21:04 - INFO - __main__ - Step 7895: {'lr': 0.0004980459304507218, 'samples': 1515840, 'steps': 7894, 'loss/train': 2.0702314376831055}} 11/06/2021 22:21:06 - INFO - __main__ - Step 7899: {'lr': 0.0004980432807341102, 'samples': 1516608, 'steps': 7898, 'loss/train': 1.881008267402649}}} 11/06/2021 22:21:06 - INFO - __main__ - Step 7899: {'lr': 0.0004980432807341102, 'samples': 1516608, 'steps': 7898, 'loss/train': 1.881008267402649}}} 11/06/2021 22:21:10 - INFO - __main__ - Step 7906: {'lr': 0.0004980386394271191, 'samples': 1517952, 'steps': 7905, 'loss/train': 1.9592478275299072}} 11/06/2021 22:21:12 - INFO - __main__ - Step 7911: {'lr': 0.0004980353208549623, 'samples': 1518912, 'steps': 7910, 'loss/train': 2.016561508178711}}} 11/06/2021 22:21:12 - INFO - __main__ - Step 7911: {'lr': 0.0004980353208549623, 'samples': 1518912, 'steps': 7910, 'loss/train': 2.016561508178711}}} 11/06/2021 22:21:12 - INFO - __main__ - Step 7911: {'lr': 0.0004980353208549623, 'samples': 1518912, 'steps': 7910, 'loss/train': 2.016561508178711}}} 11/06/2021 22:21:18 - INFO - __main__ - Step 7922: {'lr': 0.0004980280101613119, 'samples': 1521024, 'steps': 7921, 'loss/train': 2.067695379257202}}} 11/06/2021 22:21:20 - INFO - __main__ - Step 7927: {'lr': 0.0004980246826484157, 'samples': 1521984, 'steps': 7926, 'loss/train': 1.8057146072387695}} 11/06/2021 22:21:20 - INFO - __main__ - Step 7927: {'lr': 0.0004980246826484157, 'samples': 1521984, 'steps': 7926, 'loss/train': 1.8057146072387695}} 11/06/2021 22:21:24 - INFO - __main__ - Step 7934: {'lr': 0.0004980200194366136, 'samples': 1523328, 'steps': 7933, 'loss/train': 1.6870510578155518}} 11/06/2021 22:21:26 - INFO - __main__ - Step 7938: {'lr': 0.0004980173522855608, 'samples': 1524096, 'steps': 7937, 'loss/train': 1.793459415435791}}} 11/06/2021 22:21:28 - INFO - __main__ - Step 7943: {'lr': 0.0004980140158323092, 'samples': 1525056, 'steps': 7942, 'loss/train': 1.4568382501602173}} 11/06/2021 22:21:28 - INFO - __main__ - Step 7943: {'lr': 0.0004980140158323092, 'samples': 1525056, 'steps': 7942, 'loss/train': 1.4568382501602173}} 11/06/2021 22:21:32 - INFO - __main__ - Step 7951: {'lr': 0.0004980086716960552, 'samples': 1526592, 'steps': 7950, 'loss/train': 2.027440071105957}}} 11/06/2021 22:21:34 - INFO - __main__ - Step 7955: {'lr': 0.0004980059969459455, 'samples': 1527360, 'steps': 7954, 'loss/train': 1.7081891298294067}} 11/06/2021 22:21:36 - INFO - __main__ - Step 7959: {'lr': 0.000498003320407873, 'samples': 1528128, 'steps': 7958, 'loss/train': 2.172380208969116}7}} 11/06/2021 22:21:38 - INFO - __main__ - Step 7964: {'lr': 0.0004979999722209891, 'samples': 1529088, 'steps': 7963, 'loss/train': 1.9147931337356567}} 11/06/2021 22:21:38 - INFO - __main__ - Step 7964: {'lr': 0.0004979999722209891, 'samples': 1529088, 'steps': 7963, 'loss/train': 1.9147931337356567}} 11/06/2021 22:21:38 - INFO - __main__ - Step 7964: {'lr': 0.0004979999722209891, 'samples': 1529088, 'steps': 7963, 'loss/train': 1.9147931337356567}} 11/06/2021 22:21:44 - INFO - __main__ - Step 7975: {'lr': 0.000497992596376341, 'samples': 1531200, 'steps': 7974, 'loss/train': 1.6081430912017822}}} 11/06/2021 22:21:47 - INFO - __main__ - Step 7980: {'lr': 0.0004979892392499932, 'samples': 1532160, 'steps': 7979, 'loss/train': 1.864031434059143}}} 11/06/2021 22:21:47 - INFO - __main__ - Step 7980: {'lr': 0.0004979892392499932, 'samples': 1532160, 'steps': 7979, 'loss/train': 1.864031434059143}}} 11/06/2021 22:21:50 - INFO - __main__ - Step 7987: {'lr': 0.0004979845345800294, 'samples': 1533504, 'steps': 7986, 'loss/train': 1.5527995824813843}} 11/06/2021 22:21:52 - INFO - __main__ - Step 7991: {'lr': 0.0004979818437389502, 'samples': 1534272, 'steps': 7990, 'loss/train': 1.586555004119873}}} 11/06/2021 22:21:55 - INFO - __main__ - Step 7996: {'lr': 0.0004979784776735257, 'samples': 1535232, 'steps': 7995, 'loss/train': 2.107032060623169}}} 11/06/2021 22:21:55 - INFO - __main__ - Step 7996: {'lr': 0.0004979784776735257, 'samples': 1535232, 'steps': 7995, 'loss/train': 2.107032060623169}}} 11/06/2021 22:21:58 - INFO - __main__ - Step 8003: {'lr': 0.0004979737604890582, 'samples': 1536576, 'steps': 8002, 'loss/train': 2.0916402339935303}} 11/06/2021 22:22:01 - INFO - __main__ - Step 8007: {'lr': 0.0004979710624969408, 'samples': 1537344, 'steps': 8006, 'loss/train': 1.6920371055603027}} 11/06/2021 22:22:02 - INFO - __main__ - Step 8011: {'lr': 0.0004979683627171125, 'samples': 1538112, 'steps': 8010, 'loss/train': 2.038038730621338}}} 11/06/2021 22:22:04 - INFO - __main__ - Step 8015: {'lr': 0.0004979656611495927, 'samples': 1538880, 'steps': 8014, 'loss/train': 2.2237470149993896}} 11/06/2021 22:22:07 - INFO - __main__ - Step 8020: {'lr': 0.0004979622816762815, 'samples': 1539840, 'steps': 8019, 'loss/train': 2.0212595462799072}} 11/06/2021 22:22:09 - INFO - __main__ - Step 8024: {'lr': 0.0004979595760865271, 'samples': 1540608, 'steps': 8023, 'loss/train': 1.602588176727295}}} 11/06/2021 22:22:09 - INFO - __main__ - Step 8024: {'lr': 0.0004979595760865271, 'samples': 1540608, 'steps': 8023, 'loss/train': 1.602588176727295}}} 11/06/2021 22:22:12 - INFO - __main__ - Step 8031: {'lr': 0.0004979548370029884, 'samples': 1541952, 'steps': 8030, 'loss/train': 1.7080976963043213}} 11/06/2021 22:22:14 - INFO - __main__ - Step 8036: {'lr': 0.0004979514485915731, 'samples': 1542912, 'steps': 8035, 'loss/train': 1.4310439825057983}} 11/06/2021 22:22:17 - INFO - __main__ - Step 8041: {'lr': 0.0004979480573870803, 'samples': 1543872, 'steps': 8040, 'loss/train': 1.2592707872390747}} 11/06/2021 22:22:19 - INFO - __main__ - Step 8045: {'lr': 0.0004979453424124961, 'samples': 1544640, 'steps': 8044, 'loss/train': 1.2424389123916626}} 11/06/2021 22:22:21 - INFO - __main__ - Step 8049: {'lr': 0.0004979426256503863, 'samples': 1545408, 'steps': 8048, 'loss/train': 2.2753522396087646}} 11/06/2021 22:22:21 - INFO - __main__ - Step 8049: {'lr': 0.0004979426256503863, 'samples': 1545408, 'steps': 8048, 'loss/train': 2.2753522396087646}} 11/06/2021 22:22:24 - INFO - __main__ - Step 8056: {'lr': 0.00049793786701552, 'samples': 1546752, 'steps': 8055, 'loss/train': 1.7011387348175049}6}} 11/06/2021 22:22:24 - INFO - __main__ - Step 8056: {'lr': 0.00049793786701552, 'samples': 1546752, 'steps': 8055, 'loss/train': 1.7011387348175049}6}} 11/06/2021 22:22:24 - INFO - __main__ - Step 8056: {'lr': 0.00049793786701552, 'samples': 1546752, 'steps': 8055, 'loss/train': 1.7011387348175049}6}} 11/06/2021 22:22:30 - INFO - __main__ - Step 8068: {'lr': 0.0004979296966200718, 'samples': 1549056, 'steps': 8067, 'loss/train': 1.7086400985717773}} 11/06/2021 22:22:32 - INFO - __main__ - Step 8073: {'lr': 0.0004979262875407896, 'samples': 1550016, 'steps': 8072, 'loss/train': 1.3283685445785522}} 11/06/2021 22:22:32 - INFO - __main__ - Step 8073: {'lr': 0.0004979262875407896, 'samples': 1550016, 'steps': 8072, 'loss/train': 1.3283685445785522}} 11/06/2021 22:22:36 - INFO - __main__ - Step 8079: {'lr': 0.0004979221929591663, 'samples': 1551168, 'steps': 8078, 'loss/train': 2.025535821914673}}} 11/06/2021 22:22:39 - INFO - __main__ - Step 8085: {'lr': 0.000497918094355986, 'samples': 1552320, 'steps': 8084, 'loss/train': 1.904146671295166}}}} 11/06/2021 22:22:41 - INFO - __main__ - Step 8089: {'lr': 0.0004979153597197003, 'samples': 1553088, 'steps': 8088, 'loss/train': 1.035143256187439}}} 11/06/2021 22:22:41 - INFO - __main__ - Step 8089: {'lr': 0.0004979153597197003, 'samples': 1553088, 'steps': 8088, 'loss/train': 1.035143256187439}}} 11/06/2021 22:22:44 - INFO - __main__ - Step 8096: {'lr': 0.0004979105698054992, 'samples': 1554432, 'steps': 8095, 'loss/train': 2.0879011154174805}} 11/06/2021 22:22:47 - INFO - __main__ - Step 8101: {'lr': 0.0004979071450870662, 'samples': 1555392, 'steps': 8100, 'loss/train': 1.7830842733383179}} 11/06/2021 22:22:49 - INFO - __main__ - Step 8106: {'lr': 0.0004979037175760548, 'samples': 1556352, 'steps': 8105, 'loss/train': 2.045214891433716}}} 11/06/2021 22:22:49 - INFO - __main__ - Step 8106: {'lr': 0.0004979037175760548, 'samples': 1556352, 'steps': 8105, 'loss/train': 2.045214891433716}}} 11/06/2021 22:22:49 - INFO - __main__ - Step 8106: {'lr': 0.0004979037175760548, 'samples': 1556352, 'steps': 8105, 'loss/train': 2.045214891433716}}} 11/06/2021 22:22:54 - INFO - __main__ - Step 8116: {'lr': 0.0004978968541764515, 'samples': 1558272, 'steps': 8115, 'loss/train': 2.9898436069488525}} 11/06/2021 22:22:57 - INFO - __main__ - Step 8121: {'lr': 0.0004978934182879369, 'samples': 1559232, 'steps': 8120, 'loss/train': 1.8195523023605347}} 11/06/2021 22:22:59 - INFO - __main__ - Step 8125: {'lr': 0.0004978906675665782, 'samples': 1560000, 'steps': 8124, 'loss/train': 2.085545063018799}}} 11/06/2021 22:23:00 - INFO - __main__ - Step 8129: {'lr': 0.0004978879150580882, 'samples': 1560768, 'steps': 8128, 'loss/train': 1.8509951829910278}} 11/06/2021 22:23:00 - INFO - __main__ - Step 8129: {'lr': 0.0004978879150580882, 'samples': 1560768, 'steps': 8128, 'loss/train': 1.8509951829910278}} 11/06/2021 22:23:05 - INFO - __main__ - Step 8137: {'lr': 0.0004978824046797935, 'samples': 1562304, 'steps': 8136, 'loss/train': 2.647218942642212}}} 11/06/2021 22:23:06 - INFO - __main__ - Step 8141: {'lr': 0.0004978796468100286, 'samples': 1563072, 'steps': 8140, 'loss/train': 1.7271705865859985}} 11/06/2021 22:23:09 - INFO - __main__ - Step 8146: {'lr': 0.0004978761969597831, 'samples': 1564032, 'steps': 8145, 'loss/train': 2.357698917388916}}} 11/06/2021 22:23:11 - INFO - __main__ - Step 8150: {'lr': 0.0004978734350691793, 'samples': 1564800, 'steps': 8149, 'loss/train': 1.4162994623184204}} 11/06/2021 22:23:13 - INFO - __main__ - Step 8154: {'lr': 0.0004978706713915684, 'samples': 1565568, 'steps': 8153, 'loss/train': 1.7779927253723145}} 11/06/2021 22:23:13 - INFO - __main__ - Step 8154: {'lr': 0.0004978706713915684, 'samples': 1565568, 'steps': 8153, 'loss/train': 1.7779927253723145}} 11/06/2021 22:23:16 - INFO - __main__ - Step 8161: {'lr': 0.0004978658306558234, 'samples': 1566912, 'steps': 8160, 'loss/train': 1.9604636430740356}} 11/06/2021 22:23:19 - INFO - __main__ - Step 8166: {'lr': 0.0004978623696368924, 'samples': 1567872, 'steps': 8165, 'loss/train': 1.8691977262496948}} 11/06/2021 22:23:19 - INFO - __main__ - Step 8166: {'lr': 0.0004978623696368924, 'samples': 1567872, 'steps': 8165, 'loss/train': 1.8691977262496948}} 11/06/2021 22:23:23 - INFO - __main__ - Step 8174: {'lr': 0.0004978568261991051, 'samples': 1569408, 'steps': 8173, 'loss/train': 1.6820038557052612}} 11/06/2021 22:23:24 - INFO - __main__ - Step 8178: {'lr': 0.0004978540517998704, 'samples': 1570176, 'steps': 8177, 'loss/train': 1.511248230934143}}} 11/06/2021 22:23:26 - INFO - __main__ - Step 8182: {'lr': 0.0004978512756137684, 'samples': 1570944, 'steps': 8181, 'loss/train': 2.5265679359436035}} 11/06/2021 22:23:29 - INFO - __main__ - Step 8187: {'lr': 0.000497847802868389, 'samples': 1571904, 'steps': 8186, 'loss/train': 1.9279881715774536}}} 11/06/2021 22:23:29 - INFO - __main__ - Step 8187: {'lr': 0.000497847802868389, 'samples': 1571904, 'steps': 8186, 'loss/train': 1.9279881715774536}}} 11/06/2021 22:23:33 - INFO - __main__ - Step 8195: {'lr': 0.0004978422406686257, 'samples': 1573440, 'steps': 8194, 'loss/train': 1.9039665460586548}} 11/06/2021 22:23:35 - INFO - __main__ - Step 8199: {'lr': 0.0004978394568885608, 'samples': 1574208, 'steps': 8198, 'loss/train': 1.7852051258087158}} 11/06/2021 22:23:37 - INFO - __main__ - Step 8203: {'lr': 0.0004978366713217336, 'samples': 1574976, 'steps': 8202, 'loss/train': 1.418635368347168}}} 11/06/2021 22:23:39 - INFO - __main__ - Step 8208: {'lr': 0.000497833186850596, 'samples': 1575936, 'steps': 8207, 'loss/train': 1.9825032949447632}}} 11/06/2021 22:23:41 - INFO - __main__ - Step 8212: {'lr': 0.0004978303972636275, 'samples': 1576704, 'steps': 8211, 'loss/train': 1.6517690420150757}} 11/06/2021 22:23:43 - INFO - __main__ - Step 8216: {'lr': 0.000497827605889962, 'samples': 1577472, 'steps': 8215, 'loss/train': 1.8105554580688477}}} 11/06/2021 22:23:45 - INFO - __main__ - Step 8220: {'lr': 0.0004978248127296198, 'samples': 1578240, 'steps': 8219, 'loss/train': 2.2303905487060547}} 11/06/2021 22:23:47 - INFO - __main__ - Step 8224: {'lr': 0.0004978220177826212, 'samples': 1579008, 'steps': 8223, 'loss/train': 1.977313756942749}}} 11/06/2021 22:23:49 - INFO - __main__ - Step 8229: {'lr': 0.0004978185215864177, 'samples': 1579968, 'steps': 8228, 'loss/train': 1.8177608251571655}} 11/06/2021 22:23:51 - INFO - __main__ - Step 8233: {'lr': 0.0004978157226195153, 'samples': 1580736, 'steps': 8232, 'loss/train': 1.4781603813171387}} 11/06/2021 22:23:53 - INFO - __main__ - Step 8237: {'lr': 0.000497812921866022, 'samples': 1581504, 'steps': 8236, 'loss/train': 2.0580477714538574}}} 11/06/2021 22:23:55 - INFO - __main__ - Step 8241: {'lr': 0.0004978101193259578, 'samples': 1582272, 'steps': 8240, 'loss/train': 0.3795441687107086}} 11/06/2021 22:23:57 - INFO - __main__ - Step 8245: {'lr': 0.000497807314999343, 'samples': 1583040, 'steps': 8244, 'loss/train': 1.9589273929595947}}} 11/06/2021 22:23:59 - INFO - __main__ - Step 8250: {'lr': 0.0004978038070787683, 'samples': 1584000, 'steps': 8249, 'loss/train': 2.009343147277832}}} 11/06/2021 22:24:01 - INFO - __main__ - Step 8254: {'lr': 0.0004978009987324884, 'samples': 1584768, 'steps': 8253, 'loss/train': 1.9142787456512451}} 11/06/2021 22:24:03 - INFO - __main__ - Step 8258: {'lr': 0.0004977981885997235, 'samples': 1585536, 'steps': 8257, 'loss/train': 1.3908742666244507}} 11/06/2021 22:24:05 - INFO - __main__ - Step 8262: {'lr': 0.0004977953766804941, 'samples': 1586304, 'steps': 8261, 'loss/train': 1.7750216722488403}} 11/06/2021 22:24:07 - INFO - __main__ - Step 8266: {'lr': 0.0004977925629748203, 'samples': 1587072, 'steps': 8265, 'loss/train': 1.8348907232284546}} 11/06/2021 22:24:09 - INFO - __main__ - Step 8271: {'lr': 0.0004977890433305716, 'samples': 1588032, 'steps': 8270, 'loss/train': 2.1743407249450684}} 11/06/2021 22:24:12 - INFO - __main__ - Step 8275: {'lr': 0.0004977862256054721, 'samples': 1588800, 'steps': 8274, 'loss/train': 1.8264741897583008}} 11/06/2021 22:24:14 - INFO - __main__ - Step 8279: {'lr': 0.0004977834060939943, 'samples': 1589568, 'steps': 8278, 'loss/train': 1.6243547201156616}} 11/06/2021 22:24:15 - INFO - __main__ - Step 8283: {'lr': 0.0004977805847961584, 'samples': 1590336, 'steps': 8282, 'loss/train': 1.9494577646255493}} 11/06/2021 22:24:17 - INFO - __main__ - Step 8287: {'lr': 0.0004977777617119847, 'samples': 1591104, 'steps': 8286, 'loss/train': 1.8675819635391235}} 11/06/2021 22:24:20 - INFO - __main__ - Step 8292: {'lr': 0.0004977742303447613, 'samples': 1592064, 'steps': 8291, 'loss/train': 2.4035115242004395}} 11/06/2021 22:24:22 - INFO - __main__ - Step 8296: {'lr': 0.0004977714032414021, 'samples': 1592832, 'steps': 8295, 'loss/train': 1.709094762802124}}} 11/06/2021 22:24:22 - INFO - __main__ - Step 8296: {'lr': 0.0004977714032414021, 'samples': 1592832, 'steps': 8295, 'loss/train': 1.709094762802124}}} 11/06/2021 22:24:25 - INFO - __main__ - Step 8303: {'lr': 0.0004977664515123201, 'samples': 1594176, 'steps': 8302, 'loss/train': 1.8622348308563232}} 11/06/2021 22:24:27 - INFO - __main__ - Step 8307: {'lr': 0.0004977636194967634, 'samples': 1594944, 'steps': 8306, 'loss/train': 2.114375591278076}}} 11/06/2021 22:24:29 - INFO - __main__ - Step 8312: {'lr': 0.0004977600769654545, 'samples': 1595904, 'steps': 8311, 'loss/train': 1.5743494033813477}} 11/06/2021 22:24:31 - INFO - __main__ - Step 8316: {'lr': 0.0004977572409309418, 'samples': 1596672, 'steps': 8315, 'loss/train': 1.7749756574630737}} 11/06/2021 22:24:34 - INFO - __main__ - Step 8320: {'lr': 0.0004977544031102597, 'samples': 1597440, 'steps': 8319, 'loss/train': 1.7613978385925293}} 11/06/2021 22:24:35 - INFO - __main__ - Step 8324: {'lr': 0.0004977515635034285, 'samples': 1598208, 'steps': 8323, 'loss/train': 1.7190172672271729}} 11/06/2021 22:24:37 - INFO - __main__ - Step 8328: {'lr': 0.000497748722110469, 'samples': 1598976, 'steps': 8327, 'loss/train': 3.0373826026916504}}} 11/06/2021 22:24:40 - INFO - __main__ - Step 8333: {'lr': 0.0004977451678575575, 'samples': 1599936, 'steps': 8332, 'loss/train': 1.207032561302185}}} 11/06/2021 22:24:40 - INFO - __main__ - Step 8333: {'lr': 0.0004977451678575575, 'samples': 1599936, 'steps': 8332, 'loss/train': 1.207032561302185}}} 11/06/2021 22:24:43 - INFO - __main__ - Step 8340: {'lr': 0.0004977401872150241, 'samples': 1601280, 'steps': 8339, 'loss/train': 1.857647180557251}}} 11/06/2021 22:24:45 - INFO - __main__ - Step 8344: {'lr': 0.0004977373386777554, 'samples': 1602048, 'steps': 8343, 'loss/train': 2.0518319606781006}} 11/06/2021 22:24:47 - INFO - __main__ - Step 8349: {'lr': 0.0004977337754945731, 'samples': 1603008, 'steps': 8348, 'loss/train': 1.9418188333511353}} 11/06/2021 22:24:47 - INFO - __main__ - Step 8349: {'lr': 0.0004977337754945731, 'samples': 1603008, 'steps': 8348, 'loss/train': 1.9418188333511353}} 11/06/2021 22:24:52 - INFO - __main__ - Step 8357: {'lr': 0.0004977280685969971, 'samples': 1604544, 'steps': 8356, 'loss/train': 2.2838523387908936}} 11/06/2021 22:24:54 - INFO - __main__ - Step 8361: {'lr': 0.0004977252124692601, 'samples': 1605312, 'steps': 8360, 'loss/train': 1.8619705438613892}} 11/06/2021 22:24:55 - INFO - __main__ - Step 8365: {'lr': 0.0004977223545555847, 'samples': 1606080, 'steps': 8364, 'loss/train': 1.9046440124511719}} 11/06/2021 22:24:57 - INFO - __main__ - Step 8369: {'lr': 0.0004977194948559913, 'samples': 1606848, 'steps': 8368, 'loss/train': 1.6611480712890625}} 11/06/2021 22:24:59 - INFO - __main__ - Step 8373: {'lr': 0.0004977166333705005, 'samples': 1607616, 'steps': 8372, 'loss/train': 1.0230144262313843}} 11/06/2021 22:25:01 - INFO - __main__ - Step 8377: {'lr': 0.0004977137700991332, 'samples': 1608384, 'steps': 8376, 'loss/train': 1.4530218839645386}} 11/06/2021 22:25:03 - INFO - __main__ - Step 8381: {'lr': 0.0004977109050419097, 'samples': 1609152, 'steps': 8380, 'loss/train': 2.402639150619507}}} 11/06/2021 22:25:05 - INFO - __main__ - Step 8385: {'lr': 0.000497708038198851, 'samples': 1609920, 'steps': 8384, 'loss/train': 1.6063017845153809}}} 11/06/2021 22:25:07 - INFO - __main__ - Step 8390: {'lr': 0.000497704452133728, 'samples': 1610880, 'steps': 8389, 'loss/train': 2.0732336044311523}}} 11/06/2021 22:25:07 - INFO - __main__ - Step 8390: {'lr': 0.000497704452133728, 'samples': 1610880, 'steps': 8389, 'loss/train': 2.0732336044311523}}} 11/06/2021 22:25:11 - INFO - __main__ - Step 8398: {'lr': 0.0004976987086257342, 'samples': 1612416, 'steps': 8397, 'loss/train': 1.0962588787078857}} 11/06/2021 22:25:13 - INFO - __main__ - Step 8402: {'lr': 0.0004976958341931057, 'samples': 1613184, 'steps': 8401, 'loss/train': 1.0811164379119873}} 11/06/2021 22:25:15 - INFO - __main__ - Step 8406: {'lr': 0.0004976929579747505, 'samples': 1613952, 'steps': 8405, 'loss/train': 1.7756294012069702}} 11/06/2021 22:25:17 - INFO - __main__ - Step 8411: {'lr': 0.00049768936019066, 'samples': 1614912, 'steps': 8410, 'loss/train': 1.824702262878418}02}} 11/06/2021 22:25:17 - INFO - __main__ - Step 8411: {'lr': 0.00049768936019066, 'samples': 1614912, 'steps': 8410, 'loss/train': 1.824702262878418}02}} 11/06/2021 22:25:21 - INFO - __main__ - Step 8419: {'lr': 0.0004976835979326718, 'samples': 1616448, 'steps': 8418, 'loss/train': 1.5903434753417969}} 11/06/2021 22:25:21 - INFO - __main__ - Step 8419: {'lr': 0.0004976835979326718, 'samples': 1616448, 'steps': 8418, 'loss/train': 1.5903434753417969}} 11/06/2021 22:25:25 - INFO - __main__ - Step 8426: {'lr': 0.0004976785500978, 'samples': 1617792, 'steps': 8425, 'loss/train': 1.5716438293457031}69}} 11/06/2021 22:25:27 - INFO - __main__ - Step 8431: {'lr': 0.0004976749411534525, 'samples': 1618752, 'steps': 8430, 'loss/train': 1.566307783126831}}} 11/06/2021 22:25:29 - INFO - __main__ - Step 8435: {'lr': 0.0004976720519891994, 'samples': 1619520, 'steps': 8434, 'loss/train': 1.7268136739730835}} 11/06/2021 22:25:31 - INFO - __main__ - Step 8439: {'lr': 0.0004976691610393911, 'samples': 1620288, 'steps': 8438, 'loss/train': 2.1499216556549072}} 11/06/2021 22:25:33 - INFO - __main__ - Step 8443: {'lr': 0.0004976662683040484, 'samples': 1621056, 'steps': 8442, 'loss/train': 1.7835140228271484}} 11/06/2021 22:25:35 - INFO - __main__ - Step 8448: {'lr': 0.000497662649873994, 'samples': 1622016, 'steps': 8447, 'loss/train': 1.2889354228973389}}} 11/06/2021 22:25:37 - INFO - __main__ - Step 8452: {'lr': 0.000497659753121275, 'samples': 1622784, 'steps': 8451, 'loss/train': 2.0676944255828857}}} 11/06/2021 22:25:37 - INFO - __main__ - Step 8452: {'lr': 0.000497659753121275, 'samples': 1622784, 'steps': 8451, 'loss/train': 2.0676944255828857}}} 11/06/2021 22:25:41 - INFO - __main__ - Step 8459: {'lr': 0.0004976546795077503, 'samples': 1624128, 'steps': 8458, 'loss/train': 2.1664505004882812}} 11/06/2021 22:25:43 - INFO - __main__ - Step 8464: {'lr': 0.000497651052150402, 'samples': 1625088, 'steps': 8463, 'loss/train': 1.3184614181518555}}} 11/06/2021 22:25:45 - INFO - __main__ - Step 8468: {'lr': 0.0004976481482559421, 'samples': 1625856, 'steps': 8467, 'loss/train': 1.5242974758148193}} 11/06/2021 22:25:48 - INFO - __main__ - Step 8473: {'lr': 0.0004976445158771748, 'samples': 1626816, 'steps': 8472, 'loss/train': 1.7219116687774658}} 11/06/2021 22:25:50 - INFO - __main__ - Step 8477: {'lr': 0.0004976416079656328, 'samples': 1627584, 'steps': 8476, 'loss/train': 2.3073718547821045}} 11/06/2021 22:25:52 - INFO - __main__ - Step 8481: {'lr': 0.0004976386982687549, 'samples': 1628352, 'steps': 8480, 'loss/train': 1.8894435167312622}} 11/06/2021 22:25:53 - INFO - __main__ - Step 8485: {'lr': 0.0004976357867865621, 'samples': 1629120, 'steps': 8484, 'loss/train': 1.8762072324752808}} 11/06/2021 22:25:56 - INFO - __main__ - Step 8490: {'lr': 0.0004976321449232542, 'samples': 1630080, 'steps': 8489, 'loss/train': 1.4703295230865479}} 11/06/2021 22:25:58 - INFO - __main__ - Step 8494: {'lr': 0.0004976292294241798, 'samples': 1630848, 'steps': 8493, 'loss/train': 1.3954660892486572}} 11/06/2021 22:26:00 - INFO - __main__ - Step 8498: {'lr': 0.000497626312139859, 'samples': 1631616, 'steps': 8497, 'loss/train': 1.7845796346664429}}} 11/06/2021 22:26:02 - INFO - __main__ - Step 8502: {'lr': 0.0004976233930703126, 'samples': 1632384, 'steps': 8501, 'loss/train': 1.7581290006637573}} 11/06/2021 22:26:03 - INFO - __main__ - Step 8506: {'lr': 0.0004976204722155617, 'samples': 1633152, 'steps': 8505, 'loss/train': 1.7833354473114014}} 11/06/2021 22:26:06 - INFO - __main__ - Step 8510: {'lr': 0.0004976175495756274, 'samples': 1633920, 'steps': 8509, 'loss/train': 1.9994100332260132}} 11/06/2021 22:26:06 - INFO - __main__ - Step 8510: {'lr': 0.0004976175495756274, 'samples': 1633920, 'steps': 8509, 'loss/train': 1.9994100332260132}} 11/06/2021 22:26:10 - INFO - __main__ - Step 8518: {'lr': 0.0004976116989402929, 'samples': 1635456, 'steps': 8517, 'loss/train': 1.7013543844223022}} 11/06/2021 22:26:11 - INFO - __main__ - Step 8522: {'lr': 0.0004976087709449348, 'samples': 1636224, 'steps': 8521, 'loss/train': 1.6207133531570435}} 11/06/2021 22:26:13 - INFO - __main__ - Step 8526: {'lr': 0.0004976058411644777, 'samples': 1636992, 'steps': 8525, 'loss/train': 1.7340941429138184}} 11/06/2021 22:26:16 - INFO - __main__ - Step 8531: {'lr': 0.000497602176428643, 'samples': 1637952, 'steps': 8530, 'loss/train': 2.355168342590332}4}} 11/06/2021 22:26:18 - INFO - __main__ - Step 8536: {'lr': 0.0004975985089036652, 'samples': 1638912, 'steps': 8535, 'loss/train': 1.7440499067306519}} 11/06/2021 22:26:18 - INFO - __main__ - Step 8536: {'lr': 0.0004975985089036652, 'samples': 1638912, 'steps': 8535, 'loss/train': 1.7440499067306519}} 11/06/2021 22:26:22 - INFO - __main__ - Step 8543: {'lr': 0.0004975933696830147, 'samples': 1640256, 'steps': 8542, 'loss/train': 1.9213240146636963}} 11/06/2021 22:26:23 - INFO - __main__ - Step 8547: {'lr': 0.0004975904305311344, 'samples': 1641024, 'steps': 8546, 'loss/train': 2.413429021835327}}} 11/06/2021 22:26:26 - INFO - __main__ - Step 8551: {'lr': 0.0004975874895942872, 'samples': 1641792, 'steps': 8550, 'loss/train': 1.2816332578659058}} 11/06/2021 22:26:28 - INFO - __main__ - Step 8555: {'lr': 0.0004975845468724944, 'samples': 1642560, 'steps': 8554, 'loss/train': 1.236737847328186}}} 11/06/2021 22:26:30 - INFO - __main__ - Step 8559: {'lr': 0.000497581602365777, 'samples': 1643328, 'steps': 8558, 'loss/train': 2.347743511199951}}}} 11/06/2021 22:26:32 - INFO - __main__ - Step 8564: {'lr': 0.0004975779192223629, 'samples': 1644288, 'steps': 8563, 'loss/train': 1.7488797903060913}} 11/06/2021 22:26:34 - INFO - __main__ - Step 8568: {'lr': 0.0004975749706996433, 'samples': 1645056, 'steps': 8567, 'loss/train': 1.9086993932724}13}} 11/06/2021 22:26:36 - INFO - __main__ - Step 8572: {'lr': 0.0004975720203920683, 'samples': 1645824, 'steps': 8571, 'loss/train': 2.3485770225524902}} 11/06/2021 22:26:38 - INFO - __main__ - Step 8576: {'lr': 0.0004975690682996592, 'samples': 1646592, 'steps': 8575, 'loss/train': 1.9467846155166626}} 11/06/2021 22:26:40 - INFO - __main__ - Step 8580: {'lr': 0.0004975661144224374, 'samples': 1647360, 'steps': 8579, 'loss/train': 2.0112924575805664}} 11/06/2021 22:26:40 - INFO - __main__ - Step 8580: {'lr': 0.0004975661144224374, 'samples': 1647360, 'steps': 8579, 'loss/train': 2.0112924575805664}} 11/06/2021 22:26:44 - INFO - __main__ - Step 8587: {'lr': 0.0004975609408426572, 'samples': 1648704, 'steps': 8586, 'loss/train': 1.84238600730896}4}} 11/06/2021 22:26:46 - INFO - __main__ - Step 8591: {'lr': 0.0004975579820573099, 'samples': 1649472, 'steps': 8590, 'loss/train': 1.5335558652877808}} 11/06/2021 22:26:48 - INFO - __main__ - Step 8596: {'lr': 0.0004975542810658476, 'samples': 1650432, 'steps': 8595, 'loss/train': 1.850310206413269}}} 11/06/2021 22:26:48 - INFO - __main__ - Step 8596: {'lr': 0.0004975542810658476, 'samples': 1650432, 'steps': 8595, 'loss/train': 1.850310206413269}}} 11/06/2021 22:26:52 - INFO - __main__ - Step 8603: {'lr': 0.0004975490949929558, 'samples': 1651776, 'steps': 8602, 'loss/train': 1.782822608947754}}} 11/06/2021 22:26:54 - INFO - __main__ - Step 8607: {'lr': 0.000497546129068805, 'samples': 1652544, 'steps': 8606, 'loss/train': 1.4670497179031372}}} 11/06/2021 22:26:56 - INFO - __main__ - Step 8612: {'lr': 0.0004975424191539585, 'samples': 1653504, 'steps': 8611, 'loss/train': 1.8894764184951782}} 11/06/2021 22:26:58 - INFO - __main__ - Step 8616: {'lr': 0.0004975394492143808, 'samples': 1654272, 'steps': 8615, 'loss/train': 2.525263547897339}}} 11/06/2021 22:26:58 - INFO - __main__ - Step 8616: {'lr': 0.0004975394492143808, 'samples': 1654272, 'steps': 8615, 'loss/train': 2.525263547897339}}} 11/06/2021 22:27:02 - INFO - __main__ - Step 8623: {'lr': 0.000497534247525941, 'samples': 1655616, 'steps': 8622, 'loss/train': 1.932421326637268}}}} 11/06/2021 22:27:04 - INFO - __main__ - Step 8628: {'lr': 0.0004975305286881383, 'samples': 1656576, 'steps': 8627, 'loss/train': 1.7895426750183105}} 11/06/2021 22:27:06 - INFO - __main__ - Step 8632: {'lr': 0.0004975275516102922, 'samples': 1657344, 'steps': 8631, 'loss/train': 1.7859746217727661}} 11/06/2021 22:27:08 - INFO - __main__ - Step 8636: {'lr': 0.0004975245727479325, 'samples': 1658112, 'steps': 8635, 'loss/train': 1.8529094457626343}} 11/06/2021 22:27:10 - INFO - __main__ - Step 8640: {'lr': 0.0004975215921010808, 'samples': 1658880, 'steps': 8639, 'loss/train': 1.9541411399841309}} 11/06/2021 22:27:10 - INFO - __main__ - Step 8640: {'lr': 0.0004975215921010808, 'samples': 1658880, 'steps': 8639, 'loss/train': 1.9541411399841309}} 11/06/2021 22:27:14 - INFO - __main__ - Step 8647: {'lr': 0.000497516371675221, 'samples': 1660224, 'steps': 8646, 'loss/train': 2.2316641807556152}}} 11/06/2021 22:27:16 - INFO - __main__ - Step 8651: {'lr': 0.000497513386121127, 'samples': 1660992, 'steps': 8650, 'loss/train': 1.510509729385376}}}} 11/06/2021 22:27:18 - INFO - __main__ - Step 8655: {'lr': 0.0004975103987826217, 'samples': 1661760, 'steps': 8654, 'loss/train': 1.5138027667999268}} 11/06/2021 22:27:20 - INFO - __main__ - Step 8660: {'lr': 0.0004975066621001943, 'samples': 1662720, 'steps': 8659, 'loss/train': 1.1997767686843872}} 11/06/2021 22:27:20 - INFO - __main__ - Step 8660: {'lr': 0.0004975066621001943, 'samples': 1662720, 'steps': 8659, 'loss/train': 1.1997767686843872}} 11/06/2021 22:27:24 - INFO - __main__ - Step 8668: {'lr': 0.0004975006776091484, 'samples': 1664256, 'steps': 8667, 'loss/train': 1.9532525539398193}} 11/06/2021 22:27:26 - INFO - __main__ - Step 8672: {'lr': 0.000497497682687135, 'samples': 1665024, 'steps': 8671, 'loss/train': 1.5913190841674805}}} 11/06/2021 22:27:28 - INFO - __main__ - Step 8676: {'lr': 0.0004974946859808235, 'samples': 1665792, 'steps': 8675, 'loss/train': 1.5062406063079834}} 11/06/2021 22:27:30 - INFO - __main__ - Step 8681: {'lr': 0.0004974909375887976, 'samples': 1666752, 'steps': 8680, 'loss/train': 1.7356830835342407}} 11/06/2021 22:27:32 - INFO - __main__ - Step 8686: {'lr': 0.0004974871864088818, 'samples': 1667712, 'steps': 8685, 'loss/train': 1.7284247875213623}} 11/06/2021 22:27:32 - INFO - __main__ - Step 8686: {'lr': 0.0004974871864088818, 'samples': 1667712, 'steps': 8685, 'loss/train': 1.7284247875213623}} 11/06/2021 22:27:36 - INFO - __main__ - Step 8693: {'lr': 0.000497481930073425, 'samples': 1669056, 'steps': 8692, 'loss/train': 1.9291731119155884}}} 11/06/2021 22:27:38 - INFO - __main__ - Step 8697: {'lr': 0.0004974789239999027, 'samples': 1669824, 'steps': 8696, 'loss/train': 1.7184191942214966}} 11/06/2021 22:27:40 - INFO - __main__ - Step 8702: {'lr': 0.0004974751638990233, 'samples': 1670784, 'steps': 8701, 'loss/train': 1.9571999311447144}} 11/06/2021 22:27:43 - INFO - __main__ - Step 8706: {'lr': 0.0004974721538111649, 'samples': 1671552, 'steps': 8705, 'loss/train': 1.7644202709197998}} 11/06/2021 22:27:45 - INFO - __main__ - Step 8710: {'lr': 0.0004974691419391922, 'samples': 1672320, 'steps': 8709, 'loss/train': 1.8900412321090698}} 11/06/2021 22:27:46 - INFO - __main__ - Step 8714: {'lr': 0.0004974661282831272, 'samples': 1673088, 'steps': 8713, 'loss/train': 1.890992522239685}}} 11/06/2021 22:27:48 - INFO - __main__ - Step 8718: {'lr': 0.0004974631128429915, 'samples': 1673856, 'steps': 8717, 'loss/train': 2.640268564224243}}} 11/06/2021 22:27:48 - INFO - __main__ - Step 8718: {'lr': 0.0004974631128429915, 'samples': 1673856, 'steps': 8717, 'loss/train': 2.640268564224243}}} 11/06/2021 22:27:52 - INFO - __main__ - Step 8726: {'lr': 0.000497457076610595, 'samples': 1675392, 'steps': 8725, 'loss/train': 1.7877483367919922}}} 11/06/2021 22:27:52 - INFO - __main__ - Step 8726: {'lr': 0.000497457076610595, 'samples': 1675392, 'steps': 8725, 'loss/train': 1.7877483367919922}}} 11/06/2021 22:27:55 - INFO - __main__ - Step 8733: {'lr': 0.0004974517890534742, 'samples': 1676736, 'steps': 8732, 'loss/train': 2.029585361480713}}} 11/06/2021 22:27:58 - INFO - __main__ - Step 8738: {'lr': 0.0004974480088820139, 'samples': 1677696, 'steps': 8737, 'loss/train': 1.93135666847229}}}} 11/06/2021 22:27:58 - INFO - __main__ - Step 8738: {'lr': 0.0004974480088820139, 'samples': 1677696, 'steps': 8737, 'loss/train': 1.93135666847229}}}} 11/06/2021 22:28:01 - INFO - __main__ - Step 8745: {'lr': 0.0004974427119591361, 'samples': 1679040, 'steps': 8744, 'loss/train': 2.352595090866089}}} 11/06/2021 22:28:03 - INFO - __main__ - Step 8749: {'lr': 0.0004974396826931906, 'samples': 1679808, 'steps': 8748, 'loss/train': 2.0725631713867188}} 11/06/2021 22:28:06 - INFO - __main__ - Step 8754: {'lr': 0.00049743589360218, 'samples': 1680768, 'steps': 8753, 'loss/train': 2.1746933460235596}8}} 11/06/2021 22:28:08 - INFO - __main__ - Step 8759: {'lr': 0.0004974321017238994, 'samples': 1681728, 'steps': 8758, 'loss/train': 1.5569329261779785}} 11/06/2021 22:28:10 - INFO - __main__ - Step 8763: {'lr': 0.0004974290662144694, 'samples': 1682496, 'steps': 8762, 'loss/train': 1.675337791442871}}} 11/06/2021 22:28:10 - INFO - __main__ - Step 8763: {'lr': 0.0004974290662144694, 'samples': 1682496, 'steps': 8762, 'loss/train': 1.675337791442871}}} 11/06/2021 22:28:13 - INFO - __main__ - Step 8770: {'lr': 0.0004974237497807027, 'samples': 1683840, 'steps': 8769, 'loss/train': 0.3073934018611908}} 11/06/2021 22:28:16 - INFO - __main__ - Step 8775: {'lr': 0.0004974199489834457, 'samples': 1684800, 'steps': 8774, 'loss/train': 1.757877230644226}}} 11/06/2021 22:28:18 - INFO - __main__ - Step 8779: {'lr': 0.000497416906338933, 'samples': 1685568, 'steps': 8778, 'loss/train': 2.1888182163238525}}} 11/06/2021 22:28:20 - INFO - __main__ - Step 8784: {'lr': 0.0004974131005249444, 'samples': 1686528, 'steps': 8783, 'loss/train': 1.3217498064041138}} 11/06/2021 22:28:20 - INFO - __main__ - Step 8784: {'lr': 0.0004974131005249444, 'samples': 1686528, 'steps': 8783, 'loss/train': 1.3217498064041138}} 11/06/2021 22:28:24 - INFO - __main__ - Step 8791: {'lr': 0.0004974077677031879, 'samples': 1687872, 'steps': 8790, 'loss/train': 1.7749842405319214}} 11/06/2021 22:28:26 - INFO - __main__ - Step 8795: {'lr': 0.0004974047179239436, 'samples': 1688640, 'steps': 8794, 'loss/train': 1.9513581991195679}} 11/06/2021 22:28:28 - INFO - __main__ - Step 8800: {'lr': 0.000497400903191664, 'samples': 1689600, 'steps': 8799, 'loss/train': 1.5480501651763916}}} 11/06/2021 22:28:30 - INFO - __main__ - Step 8805: {'lr': 0.0004973970856725086, 'samples': 1690560, 'steps': 8804, 'loss/train': 1.862740159034729}}} 11/06/2021 22:28:30 - INFO - __main__ - Step 8805: {'lr': 0.0004973970856725086, 'samples': 1690560, 'steps': 8804, 'loss/train': 1.862740159034729}}} 11/06/2021 22:28:34 - INFO - __main__ - Step 8812: {'lr': 0.0004973917364638218, 'samples': 1691904, 'steps': 8811, 'loss/train': 1.7712737321853638}} 11/06/2021 22:28:36 - INFO - __main__ - Step 8816: {'lr': 0.0004973886773207763, 'samples': 1692672, 'steps': 8815, 'loss/train': 1.6713685989379883}} 11/06/2021 22:28:38 - INFO - __main__ - Step 8820: {'lr': 0.0004973856163942185, 'samples': 1693440, 'steps': 8819, 'loss/train': 1.8286590576171875}} 11/06/2021 22:28:41 - INFO - __main__ - Step 8826: {'lr': 0.0004973810216603443, 'samples': 1694592, 'steps': 8825, 'loss/train': 0.3064444959163666}} 11/06/2021 22:28:43 - INFO - __main__ - Step 8830: {'lr': 0.0004973779562751022, 'samples': 1695360, 'steps': 8829, 'loss/train': 1.7933107614517212}} 11/06/2021 22:28:43 - INFO - __main__ - Step 8830: {'lr': 0.0004973779562751022, 'samples': 1695360, 'steps': 8829, 'loss/train': 1.7933107614517212}} 11/06/2021 22:28:46 - INFO - __main__ - Step 8837: {'lr': 0.0004973725875595513, 'samples': 1696704, 'steps': 8836, 'loss/train': 2.336061716079712}}} 11/06/2021 22:28:48 - INFO - __main__ - Step 8841: {'lr': 0.000497369517269916, 'samples': 1697472, 'steps': 8840, 'loss/train': 2.1014244556427}12}}} 11/06/2021 22:28:50 - INFO - __main__ - Step 8846: {'lr': 0.0004973656769000046, 'samples': 1698432, 'steps': 8845, 'loss/train': 1.0025054216384888}} 11/06/2021 22:28:53 - INFO - __main__ - Step 8850: {'lr': 0.0004973626025978086, 'samples': 1699200, 'steps': 8849, 'loss/train': 1.551592469215393}}} 11/06/2021 22:28:55 - INFO - __main__ - Step 8854: {'lr': 0.0004973595265122883, 'samples': 1699968, 'steps': 8853, 'loss/train': 2.4081969261169434}} 11/06/2021 22:28:56 - INFO - __main__ - Step 8858: {'lr': 0.0004973564486434656, 'samples': 1700736, 'steps': 8857, 'loss/train': 1.6520261764526367}} 11/06/2021 22:28:58 - INFO - __main__ - Step 8862: {'lr': 0.0004973533689913631, 'samples': 1701504, 'steps': 8861, 'loss/train': 1.570049524307251}}} 11/06/2021 22:29:01 - INFO - __main__ - Step 8867: {'lr': 0.0004973495169185313, 'samples': 1702464, 'steps': 8866, 'loss/train': 1.2922062873840332}} 11/06/2021 22:29:03 - INFO - __main__ - Step 8871: {'lr': 0.00049734643325413, 'samples': 1703232, 'steps': 8870, 'loss/train': 1.7162529230117798}2}} 11/06/2021 22:29:05 - INFO - __main__ - Step 8875: {'lr': 0.0004973433478065209, 'samples': 1704000, 'steps': 8874, 'loss/train': 1.6638219356536865}} 11/06/2021 22:29:05 - INFO - __main__ - Step 8875: {'lr': 0.0004973433478065209, 'samples': 1704000, 'steps': 8874, 'loss/train': 1.6638219356536865}} 11/06/2021 22:29:08 - INFO - __main__ - Step 8882: {'lr': 0.0004973379439824283, 'samples': 1705344, 'steps': 8881, 'loss/train': 2.2020576000213623}} 11/06/2021 22:29:11 - INFO - __main__ - Step 8887: {'lr': 0.0004973340807646696, 'samples': 1706304, 'steps': 8886, 'loss/train': 2.3790831565856934}} 11/06/2021 22:29:13 - INFO - __main__ - Step 8891: {'lr': 0.000497330988184452, 'samples': 1707072, 'steps': 8890, 'loss/train': 1.9604440927505493}}} 11/06/2021 22:29:13 - INFO - __main__ - Step 8891: {'lr': 0.000497330988184452, 'samples': 1707072, 'steps': 8890, 'loss/train': 1.9604440927505493}}} 11/06/2021 22:29:16 - INFO - __main__ - Step 8898: {'lr': 0.0004973255718785088, 'samples': 1708416, 'steps': 8897, 'loss/train': 1.7817871570587158}} 11/06/2021 22:29:18 - INFO - __main__ - Step 8903: {'lr': 0.00049732169974531, 'samples': 1709376, 'steps': 8902, 'loss/train': 1.8295646905899048}8}} 11/06/2021 22:29:21 - INFO - __main__ - Step 8908: {'lr': 0.0004973178248261274, 'samples': 1710336, 'steps': 8907, 'loss/train': 2.1482808589935303}} 11/06/2021 22:29:23 - INFO - __main__ - Step 8912: {'lr': 0.0004973147228849027, 'samples': 1711104, 'steps': 8911, 'loss/train': 1.924820065498352}}} 11/06/2021 22:29:23 - INFO - __main__ - Step 8912: {'lr': 0.0004973147228849027, 'samples': 1711104, 'steps': 8911, 'loss/train': 1.924820065498352}}} 11/06/2021 22:29:26 - INFO - __main__ - Step 8919: {'lr': 0.000497309290197479, 'samples': 1712448, 'steps': 8918, 'loss/train': 1.6200121641159058}}} 11/06/2021 22:29:28 - INFO - __main__ - Step 8924: {'lr': 0.0004973054063634428, 'samples': 1713408, 'steps': 8923, 'loss/train': 1.9554498195648193}} 11/06/2021 22:29:31 - INFO - __main__ - Step 8929: {'lr': 0.0004973015197436063, 'samples': 1714368, 'steps': 8928, 'loss/train': 3.589097023010254}}} 11/06/2021 22:29:31 - INFO - __main__ - Step 8929: {'lr': 0.0004973015197436063, 'samples': 1714368, 'steps': 8928, 'loss/train': 3.589097023010254}}} 11/06/2021 22:29:34 - INFO - __main__ - Step 8936: {'lr': 0.0004972960737957749, 'samples': 1715712, 'steps': 8935, 'loss/train': 1.9268077611923218}} 11/06/2021 22:29:36 - INFO - __main__ - Step 8940: {'lr': 0.0004972929593741662, 'samples': 1716480, 'steps': 8939, 'loss/train': 1.9634605646133423}} 11/06/2021 22:29:36 - INFO - __main__ - Step 8940: {'lr': 0.0004972929593741662, 'samples': 1716480, 'steps': 8939, 'loss/train': 1.9634605646133423}} 11/06/2021 22:29:41 - INFO - __main__ - Step 8948: {'lr': 0.0004972867251825048, 'samples': 1718016, 'steps': 8947, 'loss/train': 2.001603841781616}}} 11/06/2021 22:29:42 - INFO - __main__ - Step 8952: {'lr': 0.0004972836054124968, 'samples': 1718784, 'steps': 8951, 'loss/train': 1.9462846517562866}} 11/06/2021 22:29:44 - INFO - __main__ - Step 8956: {'lr': 0.000497280483859734, 'samples': 1719552, 'steps': 8955, 'loss/train': 1.3424328565597534}}} 11/06/2021 22:29:47 - INFO - __main__ - Step 8961: {'lr': 0.0004972765794118158, 'samples': 1720512, 'steps': 8960, 'loss/train': 1.1508708000183105}} 11/06/2021 22:29:49 - INFO - __main__ - Step 8965: {'lr': 0.0004972734538479369, 'samples': 1721280, 'steps': 8964, 'loss/train': 2.106870174407959}}} 11/06/2021 22:29:51 - INFO - __main__ - Step 8969: {'lr': 0.0004972703265013764, 'samples': 1722048, 'steps': 8968, 'loss/train': 1.7631484270095825}} 11/06/2021 22:29:51 - INFO - __main__ - Step 8969: {'lr': 0.0004972703265013764, 'samples': 1722048, 'steps': 8968, 'loss/train': 1.7631484270095825}} 11/06/2021 22:29:54 - INFO - __main__ - Step 8976: {'lr': 0.0004972648493553856, 'samples': 1723392, 'steps': 8975, 'loss/train': 1.3855030536651611}} 11/06/2021 22:29:57 - INFO - __main__ - Step 8982: {'lr': 0.0004972601503136822, 'samples': 1724544, 'steps': 8981, 'loss/train': 1.6659023761749268}} 11/06/2021 22:29:57 - INFO - __main__ - Step 8982: {'lr': 0.0004972601503136822, 'samples': 1724544, 'steps': 8981, 'loss/train': 1.6659023761749268}} 11/06/2021 22:30:01 - INFO - __main__ - Step 8989: {'lr': 0.0004972546630291387, 'samples': 1725888, 'steps': 8988, 'loss/train': 1.9944013357162476}} 11/06/2021 22:30:02 - INFO - __main__ - Step 8993: {'lr': 0.0004972515249869622, 'samples': 1726656, 'steps': 8992, 'loss/train': 1.706042766571045}}} 11/06/2021 22:30:04 - INFO - __main__ - Step 8997: {'lr': 0.0004972483851622623, 'samples': 1727424, 'steps': 8996, 'loss/train': 2.627495527267456}}} 11/06/2021 22:30:07 - INFO - __main__ - Step 9002: {'lr': 0.000497244457874748, 'samples': 1728384, 'steps': 9001, 'loss/train': 1.736275315284729}}}} 11/06/2021 22:30:09 - INFO - __main__ - Step 9006: {'lr': 0.0004972413140394528, 'samples': 1729152, 'steps': 9005, 'loss/train': 2.118058204650879}}} 11/06/2021 22:30:11 - INFO - __main__ - Step 9010: {'lr': 0.0004972381684217077, 'samples': 1729920, 'steps': 9009, 'loss/train': 1.8316234350204468}} 11/06/2021 22:30:12 - INFO - __main__ - Step 9014: {'lr': 0.0004972350210215353, 'samples': 1730688, 'steps': 9013, 'loss/train': 1.3896691799163818}} 11/06/2021 22:30:14 - INFO - __main__ - Step 9018: {'lr': 0.0004972318718389583, 'samples': 1731456, 'steps': 9017, 'loss/train': 2.1753854751586914}} 11/06/2021 22:30:17 - INFO - __main__ - Step 9023: {'lr': 0.0004972279328542652, 'samples': 1732416, 'steps': 9022, 'loss/train': 2.2019128799438477}} 11/06/2021 22:30:19 - INFO - __main__ - Step 9027: {'lr': 0.0004972247796613611, 'samples': 1733184, 'steps': 9026, 'loss/train': 1.2223052978515625}} 11/06/2021 22:30:21 - INFO - __main__ - Step 9031: {'lr': 0.0004972216246861262, 'samples': 1733952, 'steps': 9030, 'loss/train': 2.1160690784454346}} 11/06/2021 22:30:22 - INFO - __main__ - Step 9035: {'lr': 0.0004972184679285833, 'samples': 1734720, 'steps': 9034, 'loss/train': 1.6166201829910278}} 11/06/2021 22:30:24 - INFO - __main__ - Step 9039: {'lr': 0.0004972153093887551, 'samples': 1735488, 'steps': 9038, 'loss/train': 1.671932578086853}}} 11/06/2021 22:30:27 - INFO - __main__ - Step 9044: {'lr': 0.000497211358707666, 'samples': 1736448, 'steps': 9043, 'loss/train': 1.862133502960205}}}} 11/06/2021 22:30:27 - INFO - __main__ - Step 9044: {'lr': 0.000497211358707666, 'samples': 1736448, 'steps': 9043, 'loss/train': 1.862133502960205}}}} 11/06/2021 22:30:30 - INFO - __main__ - Step 9050: {'lr': 0.0004972066142145055, 'samples': 1737600, 'steps': 9049, 'loss/train': 1.216286540031433}}} 11/06/2021 22:30:32 - INFO - __main__ - Step 9055: {'lr': 0.0004972026574070459, 'samples': 1738560, 'steps': 9054, 'loss/train': 2.233604907989502}}} 11/06/2021 22:30:35 - INFO - __main__ - Step 9060: {'lr': 0.0004971986978149437, 'samples': 1739520, 'steps': 9059, 'loss/train': 1.568691372871399}}} 11/06/2021 22:30:37 - INFO - __main__ - Step 9064: {'lr': 0.0004971955281363493, 'samples': 1740288, 'steps': 9063, 'loss/train': 1.9272085428237915}} 11/06/2021 22:30:37 - INFO - __main__ - Step 9064: {'lr': 0.0004971955281363493, 'samples': 1740288, 'steps': 9063, 'loss/train': 1.9272085428237915}} 11/06/2021 22:30:40 - INFO - __main__ - Step 9071: {'lr': 0.000497189976910597, 'samples': 1741632, 'steps': 9070, 'loss/train': 2.2380783557891846}}} 11/06/2021 22:30:42 - INFO - __main__ - Step 9075: {'lr': 0.000497186802331228, 'samples': 1742400, 'steps': 9074, 'loss/train': 2.291787624359131}}}} 11/06/2021 22:30:42 - INFO - __main__ - Step 9075: {'lr': 0.000497186802331228, 'samples': 1742400, 'steps': 9074, 'loss/train': 2.291787624359131}}}} 11/06/2021 22:30:47 - INFO - __main__ - Step 9084: {'lr': 0.0004971796530120371, 'samples': 1744128, 'steps': 9083, 'loss/train': 1.885785460472107}}} 11/06/2021 22:30:49 - INFO - __main__ - Step 9088: {'lr': 0.0004971764726410668, 'samples': 1744896, 'steps': 9087, 'loss/train': 1.8160526752471924}} 11/06/2021 22:30:50 - INFO - __main__ - Step 9092: {'lr': 0.000497173290488114, 'samples': 1745664, 'steps': 9091, 'loss/train': 1.6177334785461426}}} 11/06/2021 22:30:53 - INFO - __main__ - Step 9096: {'lr': 0.0004971701065532017, 'samples': 1746432, 'steps': 9095, 'loss/train': 1.4940237998962402}} 11/06/2021 22:30:53 - INFO - __main__ - Step 9096: {'lr': 0.0004971701065532017, 'samples': 1746432, 'steps': 9095, 'loss/train': 1.4940237998962402}} 11/06/2021 22:30:56 - INFO - __main__ - Step 9104: {'lr': 0.0004971637333375904, 'samples': 1747968, 'steps': 9103, 'loss/train': 1.4513171911239624}} 11/06/2021 22:30:58 - INFO - __main__ - Step 9108: {'lr': 0.0004971605440569374, 'samples': 1748736, 'steps': 9107, 'loss/train': 1.783698558807373}}} 11/06/2021 22:31:01 - INFO - __main__ - Step 9113: {'lr': 0.0004971565549503723, 'samples': 1749696, 'steps': 9112, 'loss/train': 2.295679807662964}}} 11/06/2021 22:31:03 - INFO - __main__ - Step 9117: {'lr': 0.0004971533616605495, 'samples': 1750464, 'steps': 9116, 'loss/train': 1.4062687158584595}} 11/06/2021 22:31:05 - INFO - __main__ - Step 9121: {'lr': 0.0004971501665889107, 'samples': 1751232, 'steps': 9120, 'loss/train': 1.5485329627990723}} 11/06/2021 22:31:06 - INFO - __main__ - Step 9125: {'lr': 0.0004971469697354792, 'samples': 1752000, 'steps': 9124, 'loss/train': 2.1170740127563477}} 11/06/2021 22:31:09 - INFO - __main__ - Step 9129: {'lr': 0.0004971437711002777, 'samples': 1752768, 'steps': 9128, 'loss/train': 1.880787968635559}}} 11/06/2021 22:31:11 - INFO - __main__ - Step 9134: {'lr': 0.0004971397703006974, 'samples': 1753728, 'steps': 9133, 'loss/train': 2.073517322540283}}} 11/06/2021 22:31:13 - INFO - __main__ - Step 9138: {'lr': 0.0004971365676565984, 'samples': 1754496, 'steps': 9137, 'loss/train': 1.9141130447387695}} 11/06/2021 22:31:15 - INFO - __main__ - Step 9142: {'lr': 0.0004971333632308047, 'samples': 1755264, 'steps': 9141, 'loss/train': 2.0353739261627197}} 11/06/2021 22:31:17 - INFO - __main__ - Step 9146: {'lr': 0.0004971301570233392, 'samples': 1756032, 'steps': 9145, 'loss/train': 2.155123472213745}}} 11/06/2021 22:31:19 - INFO - __main__ - Step 9150: {'lr': 0.0004971269490342252, 'samples': 1756800, 'steps': 9149, 'loss/train': 1.6433255672454834}} 11/06/2021 22:31:21 - INFO - __main__ - Step 9155: {'lr': 0.0004971229365424246, 'samples': 1757760, 'steps': 9154, 'loss/train': 1.973191738128662}}} 11/06/2021 22:31:23 - INFO - __main__ - Step 9159: {'lr': 0.0004971197245446859, 'samples': 1758528, 'steps': 9158, 'loss/train': 2.273866891860962}}} 11/06/2021 22:31:23 - INFO - __main__ - Step 9159: {'lr': 0.0004971197245446859, 'samples': 1758528, 'steps': 9158, 'loss/train': 2.273866891860962}}} 11/06/2021 22:31:26 - INFO - __main__ - Step 9166: {'lr': 0.0004971140992617462, 'samples': 1759872, 'steps': 9165, 'loss/train': 1.977513074874878}}} 11/06/2021 22:31:29 - INFO - __main__ - Step 9171: {'lr': 0.0004971100778621223, 'samples': 1760832, 'steps': 9170, 'loss/train': 1.477123498916626}}} 11/06/2021 22:31:31 - INFO - __main__ - Step 9176: {'lr': 0.0004971060536788988, 'samples': 1761792, 'steps': 9175, 'loss/train': 1.9381426572799683}} 11/06/2021 22:31:33 - INFO - __main__ - Step 9180: {'lr': 0.0004971028323281586, 'samples': 1762560, 'steps': 9179, 'loss/train': 2.6767475605010986}} 11/06/2021 22:31:35 - INFO - __main__ - Step 9184: {'lr': 0.0004970996091959668, 'samples': 1763328, 'steps': 9183, 'loss/train': 1.487623691558838}}} 11/06/2021 22:31:37 - INFO - __main__ - Step 9188: {'lr': 0.0004970963842823468, 'samples': 1764096, 'steps': 9187, 'loss/train': 1.1920486688613892}} 11/06/2021 22:31:39 - INFO - __main__ - Step 9192: {'lr': 0.0004970931575873215, 'samples': 1764864, 'steps': 9191, 'loss/train': 1.533847451210022}}} 11/06/2021 22:31:39 - INFO - __main__ - Step 9192: {'lr': 0.0004970931575873215, 'samples': 1764864, 'steps': 9191, 'loss/train': 1.533847451210022}}} 11/06/2021 22:31:43 - INFO - __main__ - Step 9199: {'lr': 0.0004970875065845914, 'samples': 1766208, 'steps': 9198, 'loss/train': 2.0182600021362305}} 11/06/2021 22:31:45 - INFO - __main__ - Step 9203: {'lr': 0.0004970842749908223, 'samples': 1766976, 'steps': 9202, 'loss/train': 1.710228443145752}}} 11/06/2021 22:31:47 - INFO - __main__ - Step 9208: {'lr': 0.0004970802329936355, 'samples': 1767936, 'steps': 9207, 'loss/train': 1.110896110534668}}} 11/06/2021 22:31:47 - INFO - __main__ - Step 9208: {'lr': 0.0004970802329936355, 'samples': 1767936, 'steps': 9207, 'loss/train': 1.110896110534668}}} 11/06/2021 22:31:51 - INFO - __main__ - Step 9216: {'lr': 0.0004970737600089673, 'samples': 1769472, 'steps': 9215, 'loss/train': 2.018321990966797}}} 11/06/2021 22:31:53 - INFO - __main__ - Step 9220: {'lr': 0.0004970705208447587, 'samples': 1770240, 'steps': 9219, 'loss/train': 1.9058424234390259}} 11/06/2021 22:31:55 - INFO - __main__ - Step 9225: {'lr': 0.0004970664693846618, 'samples': 1771200, 'steps': 9224, 'loss/train': 1.7904118299484253}} 11/06/2021 22:31:55 - INFO - __main__ - Step 9225: {'lr': 0.0004970664693846618, 'samples': 1771200, 'steps': 9224, 'loss/train': 1.7904118299484253}} 11/06/2021 22:31:59 - INFO - __main__ - Step 9233: {'lr': 0.0004970599812596603, 'samples': 1772736, 'steps': 9232, 'loss/train': 1.2998876571655273}} 11/06/2021 22:32:02 - INFO - __main__ - Step 9237: {'lr': 0.0004970567345254339, 'samples': 1773504, 'steps': 9236, 'loss/train': 1.544152021408081}}} 11/06/2021 22:32:03 - INFO - __main__ - Step 9241: {'lr': 0.0004970534860100883, 'samples': 1774272, 'steps': 9240, 'loss/train': 1.7890807390213013}} 11/06/2021 22:32:05 - INFO - __main__ - Step 9245: {'lr': 0.0004970502357136468, 'samples': 1775040, 'steps': 9244, 'loss/train': 1.9643011093139648}} 11/06/2021 22:32:07 - INFO - __main__ - Step 9250: {'lr': 0.0004970461703384647, 'samples': 1776000, 'steps': 9249, 'loss/train': 1.9446792602539062}} 11/06/2021 22:32:07 - INFO - __main__ - Step 9250: {'lr': 0.0004970461703384647, 'samples': 1776000, 'steps': 9249, 'loss/train': 1.9446792602539062}} 11/06/2021 22:32:10 - INFO - __main__ - Step 9256: {'lr': 0.0004970412882148488, 'samples': 1777152, 'steps': 9255, 'loss/train': 1.1907613277435303}} 11/06/2021 22:32:13 - INFO - __main__ - Step 9261: {'lr': 0.0004970372167173915, 'samples': 1778112, 'steps': 9260, 'loss/train': 2.010669708251953}}} 11/06/2021 22:32:15 - INFO - __main__ - Step 9266: {'lr': 0.0004970331424371555, 'samples': 1779072, 'steps': 9265, 'loss/train': 1.760794997215271}}} 11/06/2021 22:32:15 - INFO - __main__ - Step 9266: {'lr': 0.0004970331424371555, 'samples': 1779072, 'steps': 9265, 'loss/train': 1.760794997215271}}} 11/06/2021 22:32:19 - INFO - __main__ - Step 9273: {'lr': 0.0004970274337698436, 'samples': 1780416, 'steps': 9272, 'loss/train': 1.789774775505066}}} 11/06/2021 22:32:21 - INFO - __main__ - Step 9278: {'lr': 0.0004970233528111253, 'samples': 1781376, 'steps': 9277, 'loss/train': 1.9275052547454834}} 11/06/2021 22:32:23 - INFO - __main__ - Step 9282: {'lr': 0.0004970200860406601, 'samples': 1782144, 'steps': 9281, 'loss/train': 1.4385563135147095}} 11/06/2021 22:32:25 - INFO - __main__ - Step 9287: {'lr': 0.0004970160000732539, 'samples': 1783104, 'steps': 9286, 'loss/train': 1.7004362344741821}} 11/06/2021 22:32:25 - INFO - __main__ - Step 9287: {'lr': 0.0004970160000732539, 'samples': 1783104, 'steps': 9286, 'loss/train': 1.7004362344741821}} 11/06/2021 22:32:29 - INFO - __main__ - Step 9294: {'lr': 0.0004970102750442285, 'samples': 1784448, 'steps': 9293, 'loss/train': 2.1833393573760986}} 11/06/2021 22:32:31 - INFO - __main__ - Step 9298: {'lr': 0.0004970070011504846, 'samples': 1785216, 'steps': 9297, 'loss/train': 1.8061836957931519}} 11/06/2021 22:32:33 - INFO - __main__ - Step 9303: {'lr': 0.0004970029062791128, 'samples': 1786176, 'steps': 9302, 'loss/train': 1.933624505996704}}} 11/06/2021 22:32:35 - INFO - __main__ - Step 9307: {'lr': 0.0004969996283786905, 'samples': 1786944, 'steps': 9306, 'loss/train': 1.8101203441619873}} 11/06/2021 22:32:37 - INFO - __main__ - Step 9311: {'lr': 0.0004969963486975607, 'samples': 1787712, 'steps': 9310, 'loss/train': 1.2041494846343994}} 11/06/2021 22:32:39 - INFO - __main__ - Step 9315: {'lr': 0.0004969930672357471, 'samples': 1788480, 'steps': 9314, 'loss/train': 2.183551788330078}}} 11/06/2021 22:32:41 - INFO - __main__ - Step 9319: {'lr': 0.0004969897839932732, 'samples': 1789248, 'steps': 9318, 'loss/train': 1.988546371459961}}} 11/06/2021 22:32:43 - INFO - __main__ - Step 9324: {'lr': 0.0004969856774361634, 'samples': 1790208, 'steps': 9323, 'loss/train': 1.9644792079925537}} 11/06/2021 22:32:46 - INFO - __main__ - Step 9328: {'lr': 0.0004969823901872906, 'samples': 1790976, 'steps': 9327, 'loss/train': 2.1059629917144775}} 11/06/2021 22:32:48 - INFO - __main__ - Step 9332: {'lr': 0.0004969791011578344, 'samples': 1791744, 'steps': 9331, 'loss/train': 1.9717392921447754}} 11/06/2021 22:32:49 - INFO - __main__ - Step 9336: {'lr': 0.0004969758103478187, 'samples': 1792512, 'steps': 9335, 'loss/train': 1.6776036024093628}} 11/06/2021 22:32:51 - INFO - __main__ - Step 9340: {'lr': 0.0004969725177572672, 'samples': 1793280, 'steps': 9339, 'loss/train': 1.5066814422607422}} 11/06/2021 22:32:53 - INFO - __main__ - Step 9345: {'lr': 0.0004969683995152355, 'samples': 1794240, 'steps': 9344, 'loss/train': 3.186241626739502}}} 11/06/2021 22:32:53 - INFO - __main__ - Step 9345: {'lr': 0.0004969683995152355, 'samples': 1794240, 'steps': 9344, 'loss/train': 3.186241626739502}}} 11/06/2021 22:32:57 - INFO - __main__ - Step 9352: {'lr': 0.0004969626293026353, 'samples': 1795584, 'steps': 9351, 'loss/train': 1.7228342294692993}} 11/06/2021 22:32:59 - INFO - __main__ - Step 9356: {'lr': 0.0004969593295901779, 'samples': 1796352, 'steps': 9355, 'loss/train': 1.998396873474121}}} 11/06/2021 22:33:01 - INFO - __main__ - Step 9361: {'lr': 0.0004969552024458976, 'samples': 1797312, 'steps': 9360, 'loss/train': 1.6986967325210571}} 11/06/2021 22:33:01 - INFO - __main__ - Step 9361: {'lr': 0.0004969552024458976, 'samples': 1797312, 'steps': 9360, 'loss/train': 1.6986967325210571}} 11/06/2021 22:33:05 - INFO - __main__ - Step 9369: {'lr': 0.00049694859322881, 'samples': 1798848, 'steps': 9368, 'loss/train': 1.7562119960784912}1}} 11/06/2021 22:33:08 - INFO - __main__ - Step 9373: {'lr': 0.0004969452859497449, 'samples': 1799616, 'steps': 9372, 'loss/train': 1.927634596824646}}} 11/06/2021 22:33:09 - INFO - __main__ - Step 9377: {'lr': 0.000496941976890364, 'samples': 1800384, 'steps': 9376, 'loss/train': 1.6627328395843506}}} 11/06/2021 22:33:09 - INFO - __main__ - Step 9377: {'lr': 0.000496941976890364, 'samples': 1800384, 'steps': 9376, 'loss/train': 1.6627328395843506}}} 11/06/2021 22:33:09 - INFO - __main__ - Step 9377: {'lr': 0.000496941976890364, 'samples': 1800384, 'steps': 9376, 'loss/train': 1.6627328395843506}}} 11/06/2021 22:33:15 - INFO - __main__ - Step 9388: {'lr': 0.0004969328677975083, 'samples': 1802496, 'steps': 9387, 'loss/train': 1.6319315433502197}} 11/06/2021 22:33:17 - INFO - __main__ - Step 9393: {'lr': 0.0004969287228501602, 'samples': 1803456, 'steps': 9392, 'loss/train': 1.8238893747329712}} 11/06/2021 22:33:19 - INFO - __main__ - Step 9397: {'lr': 0.0004969254048895585, 'samples': 1804224, 'steps': 9396, 'loss/train': 1.8003126382827759}} 11/06/2021 22:33:22 - INFO - __main__ - Step 9401: {'lr': 0.0004969220851487844, 'samples': 1804992, 'steps': 9400, 'loss/train': 0.40175554156303406} 11/06/2021 22:33:23 - INFO - __main__ - Step 9405: {'lr': 0.000496918763627862, 'samples': 1805760, 'steps': 9404, 'loss/train': 1.7720991373062134}6} 11/06/2021 22:33:25 - INFO - __main__ - Step 9409: {'lr': 0.0004969154403268148, 'samples': 1806528, 'steps': 9408, 'loss/train': 1.8082882165908813}} 11/06/2021 22:33:27 - INFO - __main__ - Step 9414: {'lr': 0.0004969112836972423, 'samples': 1807488, 'steps': 9413, 'loss/train': 1.678197979927063}}} 11/06/2021 22:33:27 - INFO - __main__ - Step 9414: {'lr': 0.0004969112836972423, 'samples': 1807488, 'steps': 9413, 'loss/train': 1.678197979927063}}} 11/06/2021 22:33:32 - INFO - __main__ - Step 9422: {'lr': 0.0004969046273047161, 'samples': 1809024, 'steps': 9421, 'loss/train': 1.61650812625885}}}} 11/06/2021 22:33:34 - INFO - __main__ - Step 9427: {'lr': 0.0004969004634437042, 'samples': 1809984, 'steps': 9426, 'loss/train': 1.5909390449523926}} 11/06/2021 22:33:35 - INFO - __main__ - Step 9431: {'lr': 0.0004968971303524007, 'samples': 1810752, 'steps': 9430, 'loss/train': 1.2849524021148682}} 11/06/2021 22:33:37 - INFO - __main__ - Step 9435: {'lr': 0.0004968937954811284, 'samples': 1811520, 'steps': 9434, 'loss/train': 1.701316237449646}}} 11/06/2021 22:33:40 - INFO - __main__ - Step 9440: {'lr': 0.0004968896243889941, 'samples': 1812480, 'steps': 9439, 'loss/train': 1.6848827600479126}} 11/06/2021 22:33:42 - INFO - __main__ - Step 9444: {'lr': 0.0004968862855128806, 'samples': 1813248, 'steps': 9443, 'loss/train': 1.394817590713501}}} 11/06/2021 22:33:44 - INFO - __main__ - Step 9448: {'lr': 0.0004968829448568766, 'samples': 1814016, 'steps': 9447, 'loss/train': 1.6947396993637085}} 11/06/2021 22:33:46 - INFO - __main__ - Step 9452: {'lr': 0.0004968796024210064, 'samples': 1814784, 'steps': 9451, 'loss/train': 1.934395670890808}}} 11/06/2021 22:33:47 - INFO - __main__ - Step 9456: {'lr': 0.0004968762582052938, 'samples': 1815552, 'steps': 9455, 'loss/train': 1.959632396697998}}} 11/06/2021 22:33:49 - INFO - __main__ - Step 9460: {'lr': 0.0004968729122097632, 'samples': 1816320, 'steps': 9459, 'loss/train': 1.9877064228057861}} 11/06/2021 22:33:52 - INFO - __main__ - Step 9465: {'lr': 0.0004968687272125174, 'samples': 1817280, 'steps': 9464, 'loss/train': 1.5592188835144043}} 11/06/2021 22:33:52 - INFO - __main__ - Step 9465: {'lr': 0.0004968687272125174, 'samples': 1817280, 'steps': 9464, 'loss/train': 1.5592188835144043}} 11/06/2021 22:33:55 - INFO - __main__ - Step 9472: {'lr': 0.000496862863544504, 'samples': 1818624, 'steps': 9471, 'loss/train': 1.4764689207077026}}} 11/06/2021 22:33:57 - INFO - __main__ - Step 9476: {'lr': 0.0004968595104299422, 'samples': 1819392, 'steps': 9475, 'loss/train': 2.2946574687957764}} 11/06/2021 22:33:59 - INFO - __main__ - Step 9481: {'lr': 0.0004968553165340435, 'samples': 1820352, 'steps': 9480, 'loss/train': 1.7705086469650269}} 11/06/2021 22:33:59 - INFO - __main__ - Step 9481: {'lr': 0.0004968553165340435, 'samples': 1820352, 'steps': 9480, 'loss/train': 1.7705086469650269}} 11/06/2021 22:34:04 - INFO - __main__ - Step 9489: {'lr': 0.0004968486005167069, 'samples': 1821888, 'steps': 9488, 'loss/train': 1.7480806112289429}} 11/06/2021 22:34:05 - INFO - __main__ - Step 9493: {'lr': 0.0004968452398385984, 'samples': 1822656, 'steps': 9492, 'loss/train': 1.734236240386963}}} 11/06/2021 22:34:07 - INFO - __main__ - Step 9497: {'lr': 0.0004968418773808954, 'samples': 1823424, 'steps': 9496, 'loss/train': 1.705300211906433}}} 11/06/2021 22:34:10 - INFO - __main__ - Step 9502: {'lr': 0.0004968376718062488, 'samples': 1824384, 'steps': 9501, 'loss/train': 1.9032224416732788}} 11/06/2021 22:34:12 - INFO - __main__ - Step 9506: {'lr': 0.0004968343053445469, 'samples': 1825152, 'steps': 9505, 'loss/train': 1.723289132118225}}} 11/06/2021 22:34:14 - INFO - __main__ - Step 9510: {'lr': 0.0004968309371033293, 'samples': 1825920, 'steps': 9509, 'loss/train': 1.9803547859191895}} 11/06/2021 22:34:15 - INFO - __main__ - Step 9514: {'lr': 0.0004968275670826204, 'samples': 1826688, 'steps': 9513, 'loss/train': 1.7052531242370605}} 11/06/2021 22:34:17 - INFO - __main__ - Step 9518: {'lr': 0.0004968241952824442, 'samples': 1827456, 'steps': 9517, 'loss/train': 1.6158838272094727}} 11/06/2021 22:34:20 - INFO - __main__ - Step 9523: {'lr': 0.0004968199780298855, 'samples': 1828416, 'steps': 9522, 'loss/train': 1.8859481811523438}} 11/06/2021 22:34:20 - INFO - __main__ - Step 9523: {'lr': 0.0004968199780298855, 'samples': 1828416, 'steps': 9522, 'loss/train': 1.8859481811523438}} 11/06/2021 22:34:24 - INFO - __main__ - Step 9531: {'lr': 0.0004968132246427212, 'samples': 1829952, 'steps': 9530, 'loss/train': 2.275637149810791}}} 11/06/2021 22:34:26 - INFO - __main__ - Step 9535: {'lr': 0.0004968098452800815, 'samples': 1830720, 'steps': 9534, 'loss/train': 1.9174106121063232}} 11/06/2021 22:34:26 - INFO - __main__ - Step 9535: {'lr': 0.0004968098452800815, 'samples': 1830720, 'steps': 9534, 'loss/train': 1.9174106121063232}} 11/06/2021 22:34:30 - INFO - __main__ - Step 9542: {'lr': 0.0004968039271139412, 'samples': 1832064, 'steps': 9541, 'loss/train': 2.009514331817627}}} 11/06/2021 22:34:31 - INFO - __main__ - Step 9546: {'lr': 0.0004968005428581767, 'samples': 1832832, 'steps': 9545, 'loss/train': 2.1240222454071045}} 11/06/2021 22:34:33 - INFO - __main__ - Step 9550: {'lr': 0.0004967971568231402, 'samples': 1833600, 'steps': 9549, 'loss/train': 1.5235668420791626}} 11/06/2021 22:34:36 - INFO - __main__ - Step 9555: {'lr': 0.0004967929217772801, 'samples': 1834560, 'steps': 9554, 'loss/train': 2.0895955562591553}} 11/06/2021 22:34:38 - INFO - __main__ - Step 9559: {'lr': 0.0004967895317389702, 'samples': 1835328, 'steps': 9558, 'loss/train': 1.6342477798461914}} 11/06/2021 22:34:40 - INFO - __main__ - Step 9563: {'lr': 0.0004967861399214674, 'samples': 1836096, 'steps': 9562, 'loss/train': 1.7751929759979248}} 11/06/2021 22:34:41 - INFO - __main__ - Step 9567: {'lr': 0.0004967827463247962, 'samples': 1836864, 'steps': 9566, 'loss/train': 1.7700427770614624}} 11/06/2021 22:34:43 - INFO - __main__ - Step 9571: {'lr': 0.0004967793509489811, 'samples': 1837632, 'steps': 9570, 'loss/train': 2.0516250133514404}} 11/06/2021 22:34:46 - INFO - __main__ - Step 9576: {'lr': 0.0004967751042273282, 'samples': 1838592, 'steps': 9575, 'loss/train': 2.606065273284912}}} 11/06/2021 22:34:48 - INFO - __main__ - Step 9580: {'lr': 0.0004967717048485287, 'samples': 1839360, 'steps': 9579, 'loss/train': 2.031545400619507}}} 11/06/2021 22:34:50 - INFO - __main__ - Step 9584: {'lr': 0.0004967683036906648, 'samples': 1840128, 'steps': 9583, 'loss/train': 2.204907178878784}}} 11/06/2021 22:34:51 - INFO - __main__ - Step 9588: {'lr': 0.0004967649007537611, 'samples': 1840896, 'steps': 9587, 'loss/train': 1.3074101209640503}} 11/06/2021 22:34:53 - INFO - __main__ - Step 9592: {'lr': 0.0004967614960378421, 'samples': 1841664, 'steps': 9591, 'loss/train': 2.0305135250091553}} 11/06/2021 22:34:56 - INFO - __main__ - Step 9597: {'lr': 0.0004967572376412405, 'samples': 1842624, 'steps': 9596, 'loss/train': 1.8701852560043335}} 11/06/2021 22:34:56 - INFO - __main__ - Step 9597: {'lr': 0.0004967572376412405, 'samples': 1842624, 'steps': 9596, 'loss/train': 1.8701852560043335}} 11/06/2021 22:34:59 - INFO - __main__ - Step 9604: {'lr': 0.0004967512712162387, 'samples': 1843968, 'steps': 9603, 'loss/train': 1.7263696193695068}} 11/06/2021 22:35:02 - INFO - __main__ - Step 9609: {'lr': 0.0004967470061486175, 'samples': 1844928, 'steps': 9608, 'loss/train': 1.621782660484314}}} 11/06/2021 22:35:04 - INFO - __main__ - Step 9613: {'lr': 0.0004967435920932711, 'samples': 1845696, 'steps': 9612, 'loss/train': 1.8796131610870361}} 11/06/2021 22:35:06 - INFO - __main__ - Step 9617: {'lr': 0.0004967401762590631, 'samples': 1846464, 'steps': 9616, 'loss/train': 1.674919605255127}}} 11/06/2021 22:35:06 - INFO - __main__ - Step 9617: {'lr': 0.0004967401762590631, 'samples': 1846464, 'steps': 9616, 'loss/train': 1.674919605255127}}} 11/06/2021 22:35:09 - INFO - __main__ - Step 9624: {'lr': 0.0004967341942688872, 'samples': 1847808, 'steps': 9623, 'loss/train': 1.75148344039917}}}} 11/06/2021 22:35:12 - INFO - __main__ - Step 9629: {'lr': 0.0004967299180835153, 'samples': 1848768, 'steps': 9628, 'loss/train': 1.4124494791030884}} 11/06/2021 22:35:14 - INFO - __main__ - Step 9634: {'lr': 0.0004967256391188258, 'samples': 1849728, 'steps': 9633, 'loss/train': 1.9075095653533936}} 11/06/2021 22:35:14 - INFO - __main__ - Step 9634: {'lr': 0.0004967256391188258, 'samples': 1849728, 'steps': 9633, 'loss/train': 1.9075095653533936}} 11/06/2021 22:35:18 - INFO - __main__ - Step 9641: {'lr': 0.0004967196438990995, 'samples': 1851072, 'steps': 9640, 'loss/train': 1.3965966701507568}} 11/06/2021 22:35:20 - INFO - __main__ - Step 9646: {'lr': 0.0004967153582642452, 'samples': 1852032, 'steps': 9645, 'loss/train': 5.934883117675781}}} 11/06/2021 22:35:22 - INFO - __main__ - Step 9650: {'lr': 0.0004967119277553692, 'samples': 1852800, 'steps': 9649, 'loss/train': 1.5039646625518799}} 11/06/2021 22:35:24 - INFO - __main__ - Step 9654: {'lr': 0.0004967084954678597, 'samples': 1853568, 'steps': 9653, 'loss/train': 2.3092923164367676}} 11/06/2021 22:35:26 - INFO - __main__ - Step 9658: {'lr': 0.0004967050614017415, 'samples': 1854336, 'steps': 9657, 'loss/train': 2.276421546936035}}} 11/06/2021 22:35:28 - INFO - __main__ - Step 9662: {'lr': 0.0004967016255570394, 'samples': 1855104, 'steps': 9661, 'loss/train': 1.8968331813812256}} 11/06/2021 22:35:30 - INFO - __main__ - Step 9667: {'lr': 0.0004966973282500661, 'samples': 1856064, 'steps': 9666, 'loss/train': 1.8805911540985107}} 11/06/2021 22:35:32 - INFO - __main__ - Step 9671: {'lr': 0.0004966938884036408, 'samples': 1856832, 'steps': 9670, 'loss/train': 1.8533520698547363}} 11/06/2021 22:35:35 - INFO - __main__ - Step 9675: {'lr': 0.0004966904467787123, 'samples': 1857600, 'steps': 9674, 'loss/train': 1.5391348600387573}} 11/06/2021 22:35:35 - INFO - __main__ - Step 9675: {'lr': 0.0004966904467787123, 'samples': 1857600, 'steps': 9674, 'loss/train': 1.5391348600387573}} 11/06/2021 22:35:38 - INFO - __main__ - Step 9682: {'lr': 0.0004966844196556382, 'samples': 1858944, 'steps': 9681, 'loss/train': 1.113672137260437}}} 11/06/2021 22:35:40 - INFO - __main__ - Step 9687: {'lr': 0.0004966801112331545, 'samples': 1859904, 'steps': 9686, 'loss/train': 1.8079742193222046}} 11/06/2021 22:35:42 - INFO - __main__ - Step 9691: {'lr': 0.0004966766624944607, 'samples': 1860672, 'steps': 9690, 'loss/train': 1.839411735534668}}} 11/06/2021 22:35:44 - INFO - __main__ - Step 9695: {'lr': 0.0004966732119773879, 'samples': 1861440, 'steps': 9694, 'loss/train': 1.997467279434204}}} 11/06/2021 22:35:46 - INFO - __main__ - Step 9699: {'lr': 0.0004966697596819607, 'samples': 1862208, 'steps': 9698, 'loss/train': 1.9859890937805176}} 11/06/2021 22:35:48 - INFO - __main__ - Step 9703: {'lr': 0.0004966663056082041, 'samples': 1862976, 'steps': 9702, 'loss/train': 2.0231058597564697}} 11/06/2021 22:35:50 - INFO - __main__ - Step 9708: {'lr': 0.0004966619855152706, 'samples': 1863936, 'steps': 9707, 'loss/train': 1.6858545541763306}} 11/06/2021 22:35:53 - INFO - __main__ - Step 9713: {'lr': 0.000496657662643785, 'samples': 1864896, 'steps': 9712, 'loss/train': 1.2711286544799805}}} 11/06/2021 22:35:53 - INFO - __main__ - Step 9713: {'lr': 0.000496657662643785, 'samples': 1864896, 'steps': 9712, 'loss/train': 1.2711286544799805}}} 11/06/2021 22:35:56 - INFO - __main__ - Step 9720: {'lr': 0.0004966516059558304, 'samples': 1866240, 'steps': 9719, 'loss/train': 1.9591999053955078}} 11/06/2021 22:35:58 - INFO - __main__ - Step 9724: {'lr': 0.0004966481425462533, 'samples': 1867008, 'steps': 9723, 'loss/train': 2.0228872299194336}} 11/06/2021 22:36:01 - INFO - __main__ - Step 9730: {'lr': 0.00049664294409782, 'samples': 1868160, 'steps': 9729, 'loss/train': 1.442859172821045}36}} 11/06/2021 22:36:01 - INFO - __main__ - Step 9730: {'lr': 0.00049664294409782, 'samples': 1868160, 'steps': 9729, 'loss/train': 1.442859172821045}36}} 11/06/2021 22:36:05 - INFO - __main__ - Step 9737: {'lr': 0.0004966368741847461, 'samples': 1869504, 'steps': 9736, 'loss/train': 1.5357730388641357}} 11/06/2021 22:36:06 - INFO - __main__ - Step 9741: {'lr': 0.000496633403218104, 'samples': 1870272, 'steps': 9740, 'loss/train': 1.855295181274414}7}} 11/06/2021 22:36:08 - INFO - __main__ - Step 9745: {'lr': 0.0004966299304733947, 'samples': 1871040, 'steps': 9744, 'loss/train': 1.8761732578277588}} 11/06/2021 22:36:10 - INFO - __main__ - Step 9750: {'lr': 0.000496625587042139, 'samples': 1872000, 'steps': 9749, 'loss/train': 1.5603324174880981}}} 11/06/2021 22:36:13 - INFO - __main__ - Step 9755: {'lr': 0.0004966212408327412, 'samples': 1872960, 'steps': 9754, 'loss/train': 1.7627856731414795}} 11/06/2021 22:36:13 - INFO - __main__ - Step 9755: {'lr': 0.0004966212408327412, 'samples': 1872960, 'steps': 9754, 'loss/train': 1.7627856731414795}} 11/06/2021 22:36:16 - INFO - __main__ - Step 9761: {'lr': 0.0004966160217143852, 'samples': 1874112, 'steps': 9760, 'loss/train': 1.4507635831832886}} 11/06/2021 22:36:18 - INFO - __main__ - Step 9766: {'lr': 0.0004966116693932472, 'samples': 1875072, 'steps': 9765, 'loss/train': 2.1143555641174316}} 11/06/2021 22:36:18 - INFO - __main__ - Step 9766: {'lr': 0.0004966116693932472, 'samples': 1875072, 'steps': 9765, 'loss/train': 2.1143555641174316}} 11/06/2021 22:36:22 - INFO - __main__ - Step 9774: {'lr': 0.0004966046999012373, 'samples': 1876608, 'steps': 9773, 'loss/train': 1.0785945653915405}} 11/06/2021 22:36:24 - INFO - __main__ - Step 9778: {'lr': 0.0004966012124884292, 'samples': 1877376, 'steps': 9777, 'loss/train': 1.7784291505813599}} 11/06/2021 22:36:26 - INFO - __main__ - Step 9782: {'lr': 0.0004965977232977861, 'samples': 1878144, 'steps': 9781, 'loss/train': 1.0429720878601074}} 11/06/2021 22:36:28 - INFO - __main__ - Step 9787: {'lr': 0.0004965933593094395, 'samples': 1879104, 'steps': 9786, 'loss/train': 1.8389208316802979}} 11/06/2021 22:36:30 - INFO - __main__ - Step 9792: {'lr': 0.000496588992543314, 'samples': 1880064, 'steps': 9791, 'loss/train': 1.8304022550582886}}} 11/06/2021 22:36:30 - INFO - __main__ - Step 9792: {'lr': 0.000496588992543314, 'samples': 1880064, 'steps': 9791, 'loss/train': 1.8304022550582886}}} 11/06/2021 22:36:35 - INFO - __main__ - Step 9799: {'lr': 0.000496582874404163, 'samples': 1881408, 'steps': 9798, 'loss/train': 0.2936389148235321}}} 11/06/2021 22:36:36 - INFO - __main__ - Step 9803: {'lr': 0.0004965793758802978, 'samples': 1882176, 'steps': 9802, 'loss/train': 1.732246994972229}}} 11/06/2021 22:36:38 - INFO - __main__ - Step 9807: {'lr': 0.000496575875578755, 'samples': 1882944, 'steps': 9806, 'loss/train': 2.212789535522461}}}} 11/06/2021 22:36:40 - INFO - __main__ - Step 9812: {'lr': 0.0004965714977020053, 'samples': 1883904, 'steps': 9811, 'loss/train': 1.6284079551696777}} 11/06/2021 22:36:42 - INFO - __main__ - Step 9816: {'lr': 0.0004965679934007797, 'samples': 1884672, 'steps': 9815, 'loss/train': 1.7675822973251343}} 11/06/2021 22:36:45 - INFO - __main__ - Step 9820: {'lr': 0.0004965644873219583, 'samples': 1885440, 'steps': 9819, 'loss/train': 1.5804340839385986}} 11/06/2021 22:36:46 - INFO - __main__ - Step 9824: {'lr': 0.0004965609794655664, 'samples': 1886208, 'steps': 9823, 'loss/train': 1.8378883600234985}} 11/06/2021 22:36:48 - INFO - __main__ - Step 9828: {'lr': 0.0004965574698316294, 'samples': 1886976, 'steps': 9827, 'loss/train': 1.9440776109695435}} 11/06/2021 22:36:50 - INFO - __main__ - Step 9833: {'lr': 0.0004965530802895738, 'samples': 1887936, 'steps': 9832, 'loss/train': 1.8684687614440918}} 11/06/2021 22:36:50 - INFO - __main__ - Step 9833: {'lr': 0.0004965530802895738, 'samples': 1887936, 'steps': 9832, 'loss/train': 1.8684687614440918}} 11/06/2021 22:36:54 - INFO - __main__ - Step 9840: {'lr': 0.0004965469302648005, 'samples': 1889280, 'steps': 9839, 'loss/train': 2.1997225284576416}} 11/06/2021 22:36:56 - INFO - __main__ - Step 9844: {'lr': 0.000496543413520936, 'samples': 1890048, 'steps': 9843, 'loss/train': 1.9805771112442017}}} 11/06/2021 22:36:59 - INFO - __main__ - Step 9849: {'lr': 0.0004965390150916136, 'samples': 1891008, 'steps': 9848, 'loss/train': 1.9253339767456055}} 11/06/2021 22:37:01 - INFO - __main__ - Step 9853: {'lr': 0.000496535494348593, 'samples': 1891776, 'steps': 9852, 'loss/train': 1.9677116870880127}}} 11/06/2021 22:37:01 - INFO - __main__ - Step 9853: {'lr': 0.000496535494348593, 'samples': 1891776, 'steps': 9852, 'loss/train': 1.9677116870880127}}} 11/06/2021 22:37:04 - INFO - __main__ - Step 9860: {'lr': 0.0004965293287715464, 'samples': 1893120, 'steps': 9859, 'loss/train': 1.8850369453430176}} 11/06/2021 22:37:04 - INFO - __main__ - Step 9860: {'lr': 0.0004965293287715464, 'samples': 1893120, 'steps': 9859, 'loss/train': 1.8850369453430176}} 11/06/2021 22:37:08 - INFO - __main__ - Step 9867: {'lr': 0.0004965231577514666, 'samples': 1894464, 'steps': 9866, 'loss/train': 2.018059253692627}}} 11/06/2021 22:37:11 - INFO - __main__ - Step 9873: {'lr': 0.0004965178639735772, 'samples': 1895616, 'steps': 9872, 'loss/train': 1.981247067451477}}} 11/06/2021 22:37:13 - INFO - __main__ - Step 9877: {'lr': 0.0004965143325667692, 'samples': 1896384, 'steps': 9876, 'loss/train': 2.1641268730163574}} 11/06/2021 22:37:15 - INFO - __main__ - Step 9881: {'lr': 0.0004965107993827524, 'samples': 1897152, 'steps': 9880, 'loss/train': 1.8044612407684326}} 11/06/2021 22:37:17 - INFO - __main__ - Step 9885: {'lr': 0.0004965072644215522, 'samples': 1897920, 'steps': 9884, 'loss/train': 1.9941020011901855}} 11/06/2021 22:37:18 - INFO - __main__ - Step 9889: {'lr': 0.0004965037276831942, 'samples': 1898688, 'steps': 9888, 'loss/train': 1.0114924907684326}} 11/06/2021 22:37:20 - INFO - __main__ - Step 9893: {'lr': 0.0004965001891677037, 'samples': 1899456, 'steps': 9892, 'loss/train': 1.4711591005325317}} 11/06/2021 22:37:20 - INFO - __main__ - Step 9893: {'lr': 0.0004965001891677037, 'samples': 1899456, 'steps': 9892, 'loss/train': 1.4711591005325317}} 11/06/2021 22:37:24 - INFO - __main__ - Step 9900: {'lr': 0.0004964939924894472, 'samples': 1900800, 'steps': 9899, 'loss/train': 1.5659464597702026}} 11/06/2021 22:37:26 - INFO - __main__ - Step 9905: {'lr': 0.0004964895629586928, 'samples': 1901760, 'steps': 9904, 'loss/train': 2.2174274921417236}} 11/06/2021 22:37:29 - INFO - __main__ - Step 9909: {'lr': 0.000496486017334928, 'samples': 1902528, 'steps': 9908, 'loss/train': 2.083883047103882}6}} 11/06/2021 22:37:31 - INFO - __main__ - Step 9913: {'lr': 0.0004964824699341582, 'samples': 1903296, 'steps': 9912, 'loss/train': 1.807940125465393}}} 11/06/2021 22:37:32 - INFO - __main__ - Step 9917: {'lr': 0.0004964789207564094, 'samples': 1904064, 'steps': 9916, 'loss/train': 1.8686342239379883}} 11/06/2021 22:37:34 - INFO - __main__ - Step 9921: {'lr': 0.0004964753698017071, 'samples': 1904832, 'steps': 9920, 'loss/train': 1.6933525800704956}} 11/06/2021 22:37:37 - INFO - __main__ - Step 9926: {'lr': 0.0004964709286095271, 'samples': 1905792, 'steps': 9925, 'loss/train': 1.6846635341644287}} 11/06/2021 22:37:38 - INFO - __main__ - Step 9930: {'lr': 0.0004964673736567728, 'samples': 1906560, 'steps': 9929, 'loss/train': 1.7367271184921265}} 11/06/2021 22:37:38 - INFO - __main__ - Step 9930: {'lr': 0.0004964673736567728, 'samples': 1906560, 'steps': 9929, 'loss/train': 1.7367271184921265}} 11/06/2021 22:37:42 - INFO - __main__ - Step 9937: {'lr': 0.000496461148213874, 'samples': 1907904, 'steps': 9936, 'loss/train': 2.6705877780914307}}} 11/06/2021 22:37:44 - INFO - __main__ - Step 9942: {'lr': 0.0004964566981373905, 'samples': 1908864, 'steps': 9941, 'loss/train': 1.5615383386611938}} 11/06/2021 22:37:47 - INFO - __main__ - Step 9947: {'lr': 0.0004964522452846675, 'samples': 1909824, 'steps': 9946, 'loss/train': 1.8149683475494385}} 11/06/2021 22:37:47 - INFO - __main__ - Step 9947: {'lr': 0.0004964522452846675, 'samples': 1909824, 'steps': 9946, 'loss/train': 1.8149683475494385}} 11/06/2021 22:37:50 - INFO - __main__ - Step 9954: {'lr': 0.0004964460066268681, 'samples': 1911168, 'steps': 9953, 'loss/train': 1.9778108596801758}} 11/06/2021 22:37:52 - INFO - __main__ - Step 9958: {'lr': 0.0004964424392365604, 'samples': 1911936, 'steps': 9957, 'loss/train': 2.3807883262634277}} 11/06/2021 22:37:55 - INFO - __main__ - Step 9963: {'lr': 0.0004964379775002078, 'samples': 1912896, 'steps': 9962, 'loss/train': 2.360861301422119}}} 11/06/2021 22:37:57 - INFO - __main__ - Step 9968: {'lr': 0.0004964335129878264, 'samples': 1913856, 'steps': 9967, 'loss/train': 1.1940803527832031}} 11/06/2021 22:37:57 - INFO - __main__ - Step 9968: {'lr': 0.0004964335129878264, 'samples': 1913856, 'steps': 9967, 'loss/train': 1.1940803527832031}} 11/06/2021 22:38:00 - INFO - __main__ - Step 9975: {'lr': 0.0004964272580068599, 'samples': 1915200, 'steps': 9974, 'loss/train': 1.6218897104263306}} 11/06/2021 22:38:02 - INFO - __main__ - Step 9979: {'lr': 0.000496423681289214, 'samples': 1915968, 'steps': 9978, 'loss/train': 1.7776697874069214}}} 11/06/2021 22:38:05 - INFO - __main__ - Step 9984: {'lr': 0.0004964192078938788, 'samples': 1916928, 'steps': 9983, 'loss/train': 2.1289093494415283}} 11/06/2021 22:38:05 - INFO - __main__ - Step 9984: {'lr': 0.0004964192078938788, 'samples': 1916928, 'steps': 9983, 'loss/train': 2.1289093494415283}} 11/06/2021 22:38:09 - INFO - __main__ - Step 9992: {'lr': 0.0004964120446876633, 'samples': 1918464, 'steps': 9991, 'loss/train': 1.8245000839233398}} 11/06/2021 22:38:11 - INFO - __main__ - Step 9996: {'lr': 0.0004964084604198354, 'samples': 1919232, 'steps': 9995, 'loss/train': 1.9137816429138184}} 11/06/2021 22:38:12 - INFO - __main__ - Step 10000: {'lr': 0.0004964048743755621, 'samples': 1920000, 'steps': 9999, 'loss/train': 1.2586203813552856} 11/06/2021 22:38:15 - INFO - __main__ - Step 10005: {'lr': 0.000496400389322133, 'samples': 1920960, 'steps': 10004, 'loss/train': 1.6377642154693604} 11/06/2021 22:38:17 - INFO - __main__ - Step 10009: {'lr': 0.0004963967992809516, 'samples': 1921728, 'steps': 10008, 'loss/train': 1.7431193590164185} 11/06/2021 22:38:19 - INFO - __main__ - Step 10013: {'lr': 0.0004963932074634087, 'samples': 1922496, 'steps': 10012, 'loss/train': 1.3404390811920166} 11/06/2021 22:38:21 - INFO - __main__ - Step 10017: {'lr': 0.00049638961386953, 'samples': 1923264, 'steps': 10016, 'loss/train': 1.4850966930389404}6} 11/06/2021 22:38:22 - INFO - __main__ - Step 10021: {'lr': 0.0004963860184993416, 'samples': 1924032, 'steps': 10020, 'loss/train': 1.717958688735962}} 11/06/2021 22:38:24 - INFO - __main__ - Step 10025: {'lr': 0.0004963824213528696, 'samples': 1924800, 'steps': 10024, 'loss/train': 1.9384859800338745} 11/06/2021 22:38:27 - INFO - __main__ - Step 10030: {'lr': 0.0004963779224219197, 'samples': 1925760, 'steps': 10029, 'loss/train': 1.7997504472732544} 11/06/2021 22:38:29 - INFO - __main__ - Step 10034: {'lr': 0.0004963743212789038, 'samples': 1926528, 'steps': 10033, 'loss/train': 2.0401864051818848} 11/06/2021 22:38:29 - INFO - __main__ - Step 10034: {'lr': 0.0004963743212789038, 'samples': 1926528, 'steps': 10033, 'loss/train': 2.0401864051818848} 11/06/2021 22:38:32 - INFO - __main__ - Step 10041: {'lr': 0.0004963680150046618, 'samples': 1927872, 'steps': 10040, 'loss/train': 1.8479679822921753} 11/06/2021 22:38:35 - INFO - __main__ - Step 10046: {'lr': 0.0004963635071927633, 'samples': 1928832, 'steps': 10045, 'loss/train': 1.261295199394226}} 11/06/2021 22:38:37 - INFO - __main__ - Step 10051: {'lr': 0.000496358996605675, 'samples': 1929792, 'steps': 10050, 'loss/train': 1.9643739461898804}} 11/06/2021 22:38:39 - INFO - __main__ - Step 10055: {'lr': 0.0004963553861379018, 'samples': 1930560, 'steps': 10054, 'loss/train': 2.023212194442749}} 11/06/2021 22:38:41 - INFO - __main__ - Step 10059: {'lr': 0.0004963517738940656, 'samples': 1931328, 'steps': 10058, 'loss/train': 1.7526215314865112} 11/06/2021 22:38:42 - INFO - __main__ - Step 10063: {'lr': 0.0004963481598741925, 'samples': 1932096, 'steps': 10062, 'loss/train': 1.8724141120910645} 11/06/2021 22:38:45 - INFO - __main__ - Step 10067: {'lr': 0.0004963445440783086, 'samples': 1932864, 'steps': 10066, 'loss/train': 1.8763664960861206} 11/06/2021 22:38:47 - INFO - __main__ - Step 10072: {'lr': 0.0004963400218359781, 'samples': 1933824, 'steps': 10071, 'loss/train': 1.9994356632232666} 11/06/2021 22:38:49 - INFO - __main__ - Step 10076: {'lr': 0.000496336402044165, 'samples': 1934592, 'steps': 10075, 'loss/train': 1.838132619857788}6} 11/06/2021 22:38:51 - INFO - __main__ - Step 10080: {'lr': 0.0004963327804764257, 'samples': 1935360, 'steps': 10079, 'loss/train': 1.9088850021362305} 11/06/2021 22:38:53 - INFO - __main__ - Step 10084: {'lr': 0.0004963291571327866, 'samples': 1936128, 'steps': 10083, 'loss/train': 1.8040343523025513} 11/06/2021 22:38:55 - INFO - __main__ - Step 10088: {'lr': 0.0004963255320132735, 'samples': 1936896, 'steps': 10087, 'loss/train': 2.1118416786193848} 11/06/2021 22:38:57 - INFO - __main__ - Step 10093: {'lr': 0.0004963209981165993, 'samples': 1937856, 'steps': 10092, 'loss/train': 2.122255802154541}} 11/06/2021 22:38:59 - INFO - __main__ - Step 10097: {'lr': 0.0004963173690014656, 'samples': 1938624, 'steps': 10096, 'loss/train': 2.1923022270202637} 11/06/2021 22:39:01 - INFO - __main__ - Step 10101: {'lr': 0.0004963137381105431, 'samples': 1939392, 'steps': 10100, 'loss/train': 2.0459229946136475} 11/06/2021 22:39:01 - INFO - __main__ - Step 10101: {'lr': 0.0004963137381105431, 'samples': 1939392, 'steps': 10100, 'loss/train': 2.0459229946136475} 11/06/2021 22:39:05 - INFO - __main__ - Step 10108: {'lr': 0.0004963073797785153, 'samples': 1940736, 'steps': 10107, 'loss/train': 1.9007415771484375} 11/06/2021 22:39:05 - INFO - __main__ - Step 10108: {'lr': 0.0004963073797785153, 'samples': 1940736, 'steps': 10107, 'loss/train': 1.9007415771484375} 11/06/2021 22:39:08 - INFO - __main__ - Step 10115: {'lr': 0.0004963010160083546, 'samples': 1942080, 'steps': 10114, 'loss/train': 1.9651312828063965} 11/06/2021 22:39:11 - INFO - __main__ - Step 10120: {'lr': 0.0004962964671288484, 'samples': 1943040, 'steps': 10119, 'loss/train': 1.8889999389648438} 11/06/2021 22:39:11 - INFO - __main__ - Step 10120: {'lr': 0.0004962964671288484, 'samples': 1943040, 'steps': 10119, 'loss/train': 1.8889999389648438} 11/06/2021 22:39:15 - INFO - __main__ - Step 10128: {'lr': 0.0004962891831508359, 'samples': 1944576, 'steps': 10127, 'loss/train': 1.920640230178833}} 11/06/2021 22:39:17 - INFO - __main__ - Step 10132: {'lr': 0.000496285538498438, 'samples': 1945344, 'steps': 10131, 'loss/train': 1.649821162223816}}} 11/06/2021 22:39:19 - INFO - __main__ - Step 10136: {'lr': 0.0004962818920704805, 'samples': 1946112, 'steps': 10135, 'loss/train': 2.1492648124694824} 11/06/2021 22:39:21 - INFO - __main__ - Step 10141: {'lr': 0.0004962773315386935, 'samples': 1947072, 'steps': 10140, 'loss/train': 2.360720157623291}} 11/06/2021 22:39:23 - INFO - __main__ - Step 10145: {'lr': 0.0004962736811158236, 'samples': 1947840, 'steps': 10144, 'loss/train': 1.8755285739898682} 11/06/2021 22:39:23 - INFO - __main__ - Step 10145: {'lr': 0.0004962736811158236, 'samples': 1947840, 'steps': 10144, 'loss/train': 1.8755285739898682} 11/06/2021 22:39:27 - INFO - __main__ - Step 10153: {'lr': 0.0004962663749436883, 'samples': 1949376, 'steps': 10152, 'loss/train': 1.250434160232544}} 11/06/2021 22:39:29 - INFO - __main__ - Step 10157: {'lr': 0.0004962627191944756, 'samples': 1950144, 'steps': 10156, 'loss/train': 1.7389650344848633} 11/06/2021 22:39:31 - INFO - __main__ - Step 10162: {'lr': 0.0004962581470113138, 'samples': 1951104, 'steps': 10161, 'loss/train': 1.4758445024490356} 11/06/2021 22:39:31 - INFO - __main__ - Step 10162: {'lr': 0.0004962581470113138, 'samples': 1951104, 'steps': 10161, 'loss/train': 1.4758445024490356} 11/06/2021 22:39:35 - INFO - __main__ - Step 10170: {'lr': 0.00049625082574835, 'samples': 1952640, 'steps': 10169, 'loss/train': 1.8901619911193848}6} 11/06/2021 22:39:35 - INFO - __main__ - Step 10170: {'lr': 0.00049625082574835, 'samples': 1952640, 'steps': 10169, 'loss/train': 1.8901619911193848}6} 11/06/2021 22:39:39 - INFO - __main__ - Step 10177: {'lr': 0.0004962444138180164, 'samples': 1953984, 'steps': 10176, 'loss/train': 1.5094280242919922} 11/06/2021 22:39:41 - INFO - __main__ - Step 10183: {'lr': 0.0004962389135505217, 'samples': 1955136, 'steps': 10182, 'loss/train': 1.9149515628814697} 11/06/2021 22:39:43 - INFO - __main__ - Step 10187: {'lr': 0.0004962352444864904, 'samples': 1955904, 'steps': 10186, 'loss/train': 1.8243874311447144} 11/06/2021 22:39:43 - INFO - __main__ - Step 10187: {'lr': 0.0004962352444864904, 'samples': 1955904, 'steps': 10186, 'loss/train': 1.8243874311447144} 11/06/2021 22:39:47 - INFO - __main__ - Step 10194: {'lr': 0.0004962288193528846, 'samples': 1957248, 'steps': 10193, 'loss/train': 1.8837649822235107} 11/06/2021 22:39:49 - INFO - __main__ - Step 10198: {'lr': 0.0004962251454071259, 'samples': 1958016, 'steps': 10197, 'loss/train': 1.4461283683776855} 11/06/2021 22:39:49 - INFO - __main__ - Step 10198: {'lr': 0.0004962251454071259, 'samples': 1958016, 'steps': 10197, 'loss/train': 1.4461283683776855} 11/06/2021 22:39:53 - INFO - __main__ - Step 10205: {'lr': 0.000496218711730672, 'samples': 1959360, 'steps': 10204, 'loss/train': 1.5240142345428467}} 11/06/2021 22:39:56 - INFO - __main__ - Step 10211: {'lr': 0.0004962131928240972, 'samples': 1960512, 'steps': 10210, 'loss/train': 2.1297640800476074} 11/06/2021 22:39:58 - INFO - __main__ - Step 10215: {'lr': 0.0004962095113342445, 'samples': 1961280, 'steps': 10214, 'loss/train': 1.9678434133529663} 11/06/2021 22:40:00 - INFO - __main__ - Step 10219: {'lr': 0.0004962058280693805, 'samples': 1962048, 'steps': 10218, 'loss/train': 1.1909321546554565} 11/06/2021 22:40:00 - INFO - __main__ - Step 10219: {'lr': 0.0004962058280693805, 'samples': 1962048, 'steps': 10218, 'loss/train': 1.1909321546554565} 11/06/2021 22:40:03 - INFO - __main__ - Step 10226: {'lr': 0.0004961993780848276, 'samples': 1963392, 'steps': 10225, 'loss/train': 1.372536301612854}} 11/06/2021 22:40:05 - INFO - __main__ - Step 10231: {'lr': 0.0004961947676249864, 'samples': 1964352, 'steps': 10230, 'loss/train': 1.6913650035858154} 11/06/2021 22:40:08 - INFO - __main__ - Step 10236: {'lr': 0.0004961901543918563, 'samples': 1965312, 'steps': 10235, 'loss/train': 1.1824220418930054} 11/06/2021 22:40:10 - INFO - __main__ - Step 10240: {'lr': 0.0004961864618086188, 'samples': 1966080, 'steps': 10239, 'loss/train': 1.8443377017974854} 11/06/2021 22:40:10 - INFO - __main__ - Step 10240: {'lr': 0.0004961864618086188, 'samples': 1966080, 'steps': 10239, 'loss/train': 1.8443377017974854} 11/06/2021 22:40:13 - INFO - __main__ - Step 10247: {'lr': 0.0004961799955172483, 'samples': 1967424, 'steps': 10246, 'loss/train': 1.7437840700149536} 11/06/2021 22:40:15 - INFO - __main__ - Step 10252: {'lr': 0.0004961753734099425, 'samples': 1968384, 'steps': 10251, 'loss/train': 1.3765827417373657} 11/06/2021 22:40:18 - INFO - __main__ - Step 10257: {'lr': 0.0004961707485295659, 'samples': 1969344, 'steps': 10256, 'loss/train': 1.6465765237808228} 11/06/2021 22:40:18 - INFO - __main__ - Step 10257: {'lr': 0.0004961707485295659, 'samples': 1969344, 'steps': 10256, 'loss/train': 1.6465765237808228} 11/06/2021 22:40:21 - INFO - __main__ - Step 10263: {'lr': 0.0004961651950127343, 'samples': 1970496, 'steps': 10262, 'loss/train': 1.9414300918579102} 11/06/2021 22:40:23 - INFO - __main__ - Step 10268: {'lr': 0.0004961605640317858, 'samples': 1971456, 'steps': 10267, 'loss/train': 1.6062275171279907} 11/06/2021 22:40:23 - INFO - __main__ - Step 10268: {'lr': 0.0004961605640317858, 'samples': 1971456, 'steps': 10267, 'loss/train': 1.6062275171279907} 11/06/2021 22:40:23 - INFO - __main__ - Step 10268: {'lr': 0.0004961605640317858, 'samples': 1971456, 'steps': 10267, 'loss/train': 1.6062275171279907} 11/06/2021 22:40:29 - INFO - __main__ - Step 10279: {'lr': 0.0004961503661131515, 'samples': 1973568, 'steps': 10278, 'loss/train': 1.548420786857605}} 11/06/2021 22:40:31 - INFO - __main__ - Step 10283: {'lr': 0.0004961466544517267, 'samples': 1974336, 'steps': 10282, 'loss/train': 2.109795570373535}} 11/06/2021 22:40:31 - INFO - __main__ - Step 10283: {'lr': 0.0004961466544517267, 'samples': 1974336, 'steps': 10282, 'loss/train': 2.109795570373535}} 11/06/2021 22:40:35 - INFO - __main__ - Step 10292: {'lr': 0.0004961382967253335, 'samples': 1976064, 'steps': 10291, 'loss/train': 1.815101981163025}} 11/06/2021 22:40:37 - INFO - __main__ - Step 10296: {'lr': 0.0004961345792966926, 'samples': 1976832, 'steps': 10295, 'loss/train': 2.0797855854034424} 11/06/2021 22:40:39 - INFO - __main__ - Step 10300: {'lr': 0.0004961308600935807, 'samples': 1977600, 'steps': 10299, 'loss/train': 1.8009984493255615} 11/06/2021 22:40:41 - INFO - __main__ - Step 10305: {'lr': 0.0004961262085943815, 'samples': 1978560, 'steps': 10304, 'loss/train': 1.3772211074829102} 11/06/2021 22:40:41 - INFO - __main__ - Step 10305: {'lr': 0.0004961262085943815, 'samples': 1978560, 'steps': 10304, 'loss/train': 1.3772211074829102} 11/06/2021 22:40:45 - INFO - __main__ - Step 10312: {'lr': 0.0004961196918376864, 'samples': 1979904, 'steps': 10311, 'loss/train': 1.361377239227295}} 11/06/2021 22:40:47 - INFO - __main__ - Step 10316: {'lr': 0.0004961159655369582, 'samples': 1980672, 'steps': 10315, 'loss/train': 1.5698388814926147} 11/06/2021 22:40:49 - INFO - __main__ - Step 10320: {'lr': 0.0004961122374618933, 'samples': 1981440, 'steps': 10319, 'loss/train': 1.6604183912277222} 11/06/2021 22:40:51 - INFO - __main__ - Step 10326: {'lr': 0.0004961066420224729, 'samples': 1982592, 'steps': 10325, 'loss/train': 1.309130072593689}} 11/06/2021 22:40:54 - INFO - __main__ - Step 10330: {'lr': 0.0004961029095116833, 'samples': 1983360, 'steps': 10329, 'loss/train': 2.0712430477142334} 11/06/2021 22:40:54 - INFO - __main__ - Step 10330: {'lr': 0.0004961029095116833, 'samples': 1983360, 'steps': 10329, 'loss/train': 2.0712430477142334} 11/06/2021 22:40:57 - INFO - __main__ - Step 10337: {'lr': 0.000496096373348546, 'samples': 1984704, 'steps': 10336, 'loss/train': 1.792807936668396}4} 11/06/2021 22:40:59 - INFO - __main__ - Step 10341: {'lr': 0.0004960926359586535, 'samples': 1985472, 'steps': 10340, 'loss/train': 1.554911732673645}} 11/06/2021 22:41:01 - INFO - __main__ - Step 10346: {'lr': 0.0004960879617263664, 'samples': 1986432, 'steps': 10345, 'loss/train': 1.2233792543411255} 11/06/2021 22:41:01 - INFO - __main__ - Step 10346: {'lr': 0.0004960879617263664, 'samples': 1986432, 'steps': 10345, 'loss/train': 1.2233792543411255} 11/06/2021 22:41:01 - INFO - __main__ - Step 10346: {'lr': 0.0004960879617263664, 'samples': 1986432, 'steps': 10345, 'loss/train': 1.2233792543411255} 11/06/2021 22:41:07 - INFO - __main__ - Step 10357: {'lr': 0.0004960776686576663, 'samples': 1988544, 'steps': 10356, 'loss/train': 1.7053444385528564} 11/06/2021 22:41:09 - INFO - __main__ - Step 10362: {'lr': 0.0004960729855548895, 'samples': 1989504, 'steps': 10361, 'loss/train': 1.650147795677185}} 11/06/2021 22:41:12 - INFO - __main__ - Step 10367: {'lr': 0.0004960682996801956, 'samples': 1990464, 'steps': 10366, 'loss/train': 1.6244982481002808} 11/06/2021 22:41:12 - INFO - __main__ - Step 10367: {'lr': 0.0004960682996801956, 'samples': 1990464, 'steps': 10366, 'loss/train': 1.6244982481002808} 11/06/2021 22:41:15 - INFO - __main__ - Step 10374: {'lr': 0.0004960617347989036, 'samples': 1991808, 'steps': 10373, 'loss/train': 1.9511737823486328} 11/06/2021 22:41:15 - INFO - __main__ - Step 10374: {'lr': 0.0004960617347989036, 'samples': 1991808, 'steps': 10373, 'loss/train': 1.9511737823486328} 11/06/2021 22:41:19 - INFO - __main__ - Step 10380: {'lr': 0.0004960561034337975, 'samples': 1992960, 'steps': 10379, 'loss/train': 2.050475835800171}} 11/06/2021 22:41:21 - INFO - __main__ - Step 10385: {'lr': 0.0004960514075806387, 'samples': 1993920, 'steps': 10384, 'loss/train': 2.118435859680176}} 11/06/2021 22:41:23 - INFO - __main__ - Step 10389: {'lr': 0.0004960476489025037, 'samples': 1994688, 'steps': 10388, 'loss/train': 1.7955809831619263} 11/06/2021 22:41:23 - INFO - __main__ - Step 10389: {'lr': 0.0004960476489025037, 'samples': 1994688, 'steps': 10388, 'loss/train': 1.7955809831619263} 11/06/2021 22:41:27 - INFO - __main__ - Step 10396: {'lr': 0.0004960410669474708, 'samples': 1996032, 'steps': 10395, 'loss/train': 1.8390122652053833} 11/06/2021 22:41:29 - INFO - __main__ - Step 10400: {'lr': 0.0004960373033913289, 'samples': 1996800, 'steps': 10399, 'loss/train': 1.6192007064819336} 11/06/2021 22:41:31 - INFO - __main__ - Step 10405: {'lr': 0.0004960325964517912, 'samples': 1997760, 'steps': 10404, 'loss/train': 1.8894060850143433} 11/06/2021 22:41:31 - INFO - __main__ - Step 10405: {'lr': 0.0004960325964517912, 'samples': 1997760, 'steps': 10404, 'loss/train': 1.8894060850143433} 11/06/2021 22:41:36 - INFO - __main__ - Step 10413: {'lr': 0.0004960250595839111, 'samples': 1999296, 'steps': 10412, 'loss/train': 1.6633156538009644} 11/06/2021 22:41:37 - INFO - __main__ - Step 10417: {'lr': 0.0004960212884894353, 'samples': 2000064, 'steps': 10416, 'loss/train': 1.5149108171463013} 11/06/2021 22:41:39 - INFO - __main__ - Step 10421: {'lr': 0.0004960175156213051, 'samples': 2000832, 'steps': 10420, 'loss/train': 0.9687737226486206} 11/06/2021 22:41:42 - INFO - __main__ - Step 10426: {'lr': 0.0004960127970419822, 'samples': 2001792, 'steps': 10425, 'loss/train': 1.9231642484664917} 11/06/2021 22:41:44 - INFO - __main__ - Step 10430: {'lr': 0.0004960090201832293, 'samples': 2002560, 'steps': 10429, 'loss/train': 1.1784454584121704} 11/06/2021 22:41:46 - INFO - __main__ - Step 10434: {'lr': 0.0004960052415509103, 'samples': 2003328, 'steps': 10433, 'loss/train': 1.7110601663589478} 11/06/2021 22:41:47 - INFO - __main__ - Step 10438: {'lr': 0.0004960014611450527, 'samples': 2004096, 'steps': 10437, 'loss/train': 1.7718604803085327} 11/06/2021 22:41:49 - INFO - __main__ - Step 10442: {'lr': 0.0004959976789656838, 'samples': 2004864, 'steps': 10441, 'loss/train': 1.8617457151412964} 11/06/2021 22:41:52 - INFO - __main__ - Step 10447: {'lr': 0.0004959929487475138, 'samples': 2005824, 'steps': 10446, 'loss/train': 1.3970377445220947} 11/06/2021 22:41:54 - INFO - __main__ - Step 10451: {'lr': 0.0004959891625778438, 'samples': 2006592, 'steps': 10450, 'loss/train': 2.1753814220428467} 11/06/2021 22:41:54 - INFO - __main__ - Step 10451: {'lr': 0.0004959891625778438, 'samples': 2006592, 'steps': 10450, 'loss/train': 2.1753814220428467} 11/06/2021 22:41:58 - INFO - __main__ - Step 10458: {'lr': 0.0004959825325136396, 'samples': 2007936, 'steps': 10457, 'loss/train': 1.9572356939315796} 11/06/2021 22:42:00 - INFO - __main__ - Step 10463: {'lr': 0.000495977793428407, 'samples': 2008896, 'steps': 10462, 'loss/train': 1.9288679361343384}} 11/06/2021 22:42:02 - INFO - __main__ - Step 10467: {'lr': 0.0004959740001652102, 'samples': 2009664, 'steps': 10466, 'loss/train': 1.70595121383667}}} 11/06/2021 22:42:04 - INFO - __main__ - Step 10471: {'lr': 0.0004959702051286999, 'samples': 2010432, 'steps': 10470, 'loss/train': 2.0080511569976807} 11/06/2021 22:42:06 - INFO - __main__ - Step 10475: {'lr': 0.0004959664083189035, 'samples': 2011200, 'steps': 10474, 'loss/train': 1.8766359090805054} 11/06/2021 22:42:08 - INFO - __main__ - Step 10479: {'lr': 0.0004959626097358485, 'samples': 2011968, 'steps': 10478, 'loss/train': 1.7041106224060059} 11/06/2021 22:42:10 - INFO - __main__ - Step 10484: {'lr': 0.0004959578590134262, 'samples': 2012928, 'steps': 10483, 'loss/train': 1.7551143169403076} 11/06/2021 22:42:10 - INFO - __main__ - Step 10484: {'lr': 0.0004959578590134262, 'samples': 2012928, 'steps': 10483, 'loss/train': 1.7551143169403076} 11/06/2021 22:42:14 - INFO - __main__ - Step 10492: {'lr': 0.0004959502520946827, 'samples': 2014464, 'steps': 10491, 'loss/train': 2.1610023975372314} 11/06/2021 22:42:15 - INFO - __main__ - Step 10496: {'lr': 0.0004959464459755839, 'samples': 2015232, 'steps': 10495, 'loss/train': 1.2869349718093872} 11/06/2021 22:42:18 - INFO - __main__ - Step 10500: {'lr': 0.0004959426380833703, 'samples': 2016000, 'steps': 10499, 'loss/train': 1.6832598447799683} 11/06/2021 22:42:20 - INFO - __main__ - Step 10504: {'lr': 0.0004959388284180694, 'samples': 2016768, 'steps': 10503, 'loss/train': 2.1253671646118164} 11/06/2021 22:42:22 - INFO - __main__ - Step 10508: {'lr': 0.0004959350169797085, 'samples': 2017536, 'steps': 10507, 'loss/train': 1.786071538925171}} 11/06/2021 22:42:23 - INFO - __main__ - Step 10512: {'lr': 0.0004959312037683154, 'samples': 2018304, 'steps': 10511, 'loss/train': 1.6077988147735596} 11/06/2021 22:42:25 - INFO - __main__ - Step 10516: {'lr': 0.0004959273887839175, 'samples': 2019072, 'steps': 10515, 'loss/train': 1.4780120849609375} 11/06/2021 22:42:28 - INFO - __main__ - Step 10521: {'lr': 0.0004959226175601736, 'samples': 2020032, 'steps': 10520, 'loss/train': 1.7402362823486328} 11/06/2021 22:42:30 - INFO - __main__ - Step 10526: {'lr': 0.0004959178435662064, 'samples': 2020992, 'steps': 10525, 'loss/train': 3.172778606414795}} 11/06/2021 22:42:30 - INFO - __main__ - Step 10526: {'lr': 0.0004959178435662064, 'samples': 2020992, 'steps': 10525, 'loss/train': 3.172778606414795}} 11/06/2021 22:42:34 - INFO - __main__ - Step 10534: {'lr': 0.0004959101994139284, 'samples': 2022528, 'steps': 10533, 'loss/train': 1.9462462663650513} 11/06/2021 22:42:34 - INFO - __main__ - Step 10534: {'lr': 0.0004959101994139284, 'samples': 2022528, 'steps': 10533, 'loss/train': 1.9462462663650513} 11/06/2021 22:42:38 - INFO - __main__ - Step 10541: {'lr': 0.0004959035049635023, 'samples': 2023872, 'steps': 10540, 'loss/train': 0.9154389500617981} 11/06/2021 22:42:40 - INFO - __main__ - Step 10545: {'lr': 0.0004958996771256422, 'samples': 2024640, 'steps': 10544, 'loss/train': 1.765254259109497}} 11/06/2021 22:42:42 - INFO - __main__ - Step 10549: {'lr': 0.0004958958475150044, 'samples': 2025408, 'steps': 10548, 'loss/train': 1.9864680767059326} 11/06/2021 22:42:44 - INFO - __main__ - Step 10553: {'lr': 0.0004958920161316167, 'samples': 2026176, 'steps': 10552, 'loss/train': 1.855413556098938}} 11/06/2021 22:42:46 - INFO - __main__ - Step 10557: {'lr': 0.0004958881829755066, 'samples': 2026944, 'steps': 10556, 'loss/train': 2.26411771774292}}} 11/06/2021 22:42:46 - INFO - __main__ - Step 10557: {'lr': 0.0004958881829755066, 'samples': 2026944, 'steps': 10556, 'loss/train': 2.26411771774292}}} 11/06/2021 22:42:50 - INFO - __main__ - Step 10565: {'lr': 0.0004958805113452298, 'samples': 2028480, 'steps': 10564, 'loss/train': 1.6105812788009644} 11/06/2021 22:42:51 - INFO - __main__ - Step 10569: {'lr': 0.0004958766728711184, 'samples': 2029248, 'steps': 10568, 'loss/train': 1.380372166633606}} 11/06/2021 22:42:53 - INFO - __main__ - Step 10573: {'lr': 0.0004958728326243954, 'samples': 2030016, 'steps': 10572, 'loss/train': 1.5941991806030273} 11/06/2021 22:42:56 - INFO - __main__ - Step 10578: {'lr': 0.0004958680298232983, 'samples': 2030976, 'steps': 10577, 'loss/train': 1.4245156049728394} 11/06/2021 22:42:56 - INFO - __main__ - Step 10578: {'lr': 0.0004958680298232983, 'samples': 2030976, 'steps': 10577, 'loss/train': 1.4245156049728394} 11/06/2021 22:43:00 - INFO - __main__ - Step 10586: {'lr': 0.00049586033958078, 'samples': 2032512, 'steps': 10585, 'loss/train': 2.0318799018859863}4} 11/06/2021 22:43:01 - INFO - __main__ - Step 10590: {'lr': 0.0004958564918007659, 'samples': 2033280, 'steps': 10589, 'loss/train': 0.6293484568595886} 11/06/2021 22:43:04 - INFO - __main__ - Step 10594: {'lr': 0.0004958526422482857, 'samples': 2034048, 'steps': 10593, 'loss/train': 1.8704800605773926} 11/06/2021 22:43:06 - INFO - __main__ - Step 10599: {'lr': 0.0004958478278151969, 'samples': 2035008, 'steps': 10598, 'loss/train': 1.3910499811172485} 11/06/2021 22:43:08 - INFO - __main__ - Step 10603: {'lr': 0.000495843974274769, 'samples': 2035776, 'steps': 10602, 'loss/train': 1.4589587450027466}} 11/06/2021 22:43:08 - INFO - __main__ - Step 10603: {'lr': 0.000495843974274769, 'samples': 2035776, 'steps': 10602, 'loss/train': 1.4589587450027466}} 11/06/2021 22:43:11 - INFO - __main__ - Step 10610: {'lr': 0.0004958372263142571, 'samples': 2037120, 'steps': 10609, 'loss/train': 1.8919156789779663} 11/06/2021 22:43:11 - INFO - __main__ - Step 10610: {'lr': 0.0004958372263142571, 'samples': 2037120, 'steps': 10609, 'loss/train': 1.8919156789779663} 11/06/2021 22:43:15 - INFO - __main__ - Step 10616: {'lr': 0.0004958314380280504, 'samples': 2038272, 'steps': 10615, 'loss/train': 1.4865412712097168} 11/06/2021 22:43:18 - INFO - __main__ - Step 10622: {'lr': 0.0004958256457542011, 'samples': 2039424, 'steps': 10621, 'loss/train': 1.7138190269470215} 11/06/2021 22:43:18 - INFO - __main__ - Step 10622: {'lr': 0.0004958256457542011, 'samples': 2039424, 'steps': 10621, 'loss/train': 1.7138190269470215} 11/06/2021 22:43:22 - INFO - __main__ - Step 10629: {'lr': 0.0004958188830615649, 'samples': 2040768, 'steps': 10628, 'loss/train': 2.004051446914673}} 11/06/2021 22:43:24 - INFO - __main__ - Step 10634: {'lr': 0.0004958140492439502, 'samples': 2041728, 'steps': 10633, 'loss/train': 1.8917224407196045} 11/06/2021 22:43:26 - INFO - __main__ - Step 10639: {'lr': 0.0004958092126573352, 'samples': 2042688, 'steps': 10638, 'loss/train': 1.832000970840454}} 11/06/2021 22:43:26 - INFO - __main__ - Step 10639: {'lr': 0.0004958092126573352, 'samples': 2042688, 'steps': 10638, 'loss/train': 1.832000970840454}} 11/06/2021 22:43:30 - INFO - __main__ - Step 10646: {'lr': 0.0004958024367842569, 'samples': 2044032, 'steps': 10645, 'loss/train': 1.8402926921844482} 11/06/2021 22:43:32 - INFO - __main__ - Step 10650: {'lr': 0.0004957985624201688, 'samples': 2044800, 'steps': 10649, 'loss/train': 1.3269506692886353} 11/06/2021 22:43:34 - INFO - __main__ - Step 10655: {'lr': 0.0004957937169731186, 'samples': 2045760, 'steps': 10654, 'loss/train': 1.672598958015442}} 11/06/2021 22:43:36 - INFO - __main__ - Step 10659: {'lr': 0.0004957898386219603, 'samples': 2046528, 'steps': 10658, 'loss/train': 1.6076505184173584} 11/06/2021 22:43:36 - INFO - __main__ - Step 10659: {'lr': 0.0004957898386219603, 'samples': 2046528, 'steps': 10658, 'loss/train': 1.6076505184173584} 11/06/2021 22:43:40 - INFO - __main__ - Step 10666: {'lr': 0.0004957830472436097, 'samples': 2047872, 'steps': 10665, 'loss/train': 2.27144718170166}4} 11/06/2021 22:43:42 - INFO - __main__ - Step 10671: {'lr': 0.0004957781929366832, 'samples': 2048832, 'steps': 10670, 'loss/train': 2.503723382949829}} 11/06/2021 22:43:42 - INFO - __main__ - Step 10671: {'lr': 0.0004957781929366832, 'samples': 2048832, 'steps': 10670, 'loss/train': 2.503723382949829}} 11/06/2021 22:43:47 - INFO - __main__ - Step 10678: {'lr': 0.0004957713922557563, 'samples': 2050176, 'steps': 10677, 'loss/train': 1.6342339515686035} 11/06/2021 22:43:48 - INFO - __main__ - Step 10682: {'lr': 0.0004957675037160624, 'samples': 2050944, 'steps': 10681, 'loss/train': 1.979127049446106}} 11/06/2021 22:43:50 - INFO - __main__ - Step 10686: {'lr': 0.0004957636134045437, 'samples': 2051712, 'steps': 10685, 'loss/train': 1.5401474237442017} 11/06/2021 22:43:52 - INFO - __main__ - Step 10691: {'lr': 0.0004957587480235595, 'samples': 2052672, 'steps': 10690, 'loss/train': 1.7230714559555054} 11/06/2021 22:43:55 - INFO - __main__ - Step 10695: {'lr': 0.0004957548537255378, 'samples': 2053440, 'steps': 10694, 'loss/train': 2.194444417953491}} 11/06/2021 22:43:57 - INFO - __main__ - Step 10699: {'lr': 0.0004957509576557826, 'samples': 2054208, 'steps': 10698, 'loss/train': 1.7377288341522217} 11/06/2021 22:43:58 - INFO - __main__ - Step 10703: {'lr': 0.0004957470598143218, 'samples': 2054976, 'steps': 10702, 'loss/train': 1.4152930974960327} 11/06/2021 22:44:00 - INFO - __main__ - Step 10707: {'lr': 0.0004957431602011839, 'samples': 2055744, 'steps': 10706, 'loss/train': 1.5255722999572754} 11/06/2021 22:44:03 - INFO - __main__ - Step 10712: {'lr': 0.000495738283193383, 'samples': 2056704, 'steps': 10711, 'loss/train': 1.6758942604064941}} 11/06/2021 22:44:05 - INFO - __main__ - Step 10716: {'lr': 0.0004957343795940738, 'samples': 2057472, 'steps': 10715, 'loss/train': 2.222113847732544}} 11/06/2021 22:44:05 - INFO - __main__ - Step 10716: {'lr': 0.0004957343795940738, 'samples': 2057472, 'steps': 10715, 'loss/train': 2.222113847732544}} 11/06/2021 22:44:08 - INFO - __main__ - Step 10723: {'lr': 0.0004957275440324211, 'samples': 2058816, 'steps': 10722, 'loss/train': 1.7179454565048218} 11/06/2021 22:44:10 - INFO - __main__ - Step 10727: {'lr': 0.0004957236355613184, 'samples': 2059584, 'steps': 10726, 'loss/train': 1.7511495351791382} 11/06/2021 22:44:10 - INFO - __main__ - Step 10727: {'lr': 0.0004957236355613184, 'samples': 2059584, 'steps': 10726, 'loss/train': 1.7511495351791382} 11/06/2021 22:44:15 - INFO - __main__ - Step 10736: {'lr': 0.0004957148350243025, 'samples': 2061312, 'steps': 10735, 'loss/train': 2.2027082443237305} 11/06/2021 22:44:15 - INFO - __main__ - Step 10736: {'lr': 0.0004957148350243025, 'samples': 2061312, 'steps': 10735, 'loss/train': 2.2027082443237305} 11/06/2021 22:44:18 - INFO - __main__ - Step 10743: {'lr': 0.0004957079839621051, 'samples': 2062656, 'steps': 10742, 'loss/train': 2.051661968231201}} 11/06/2021 22:44:20 - INFO - __main__ - Step 10748: {'lr': 0.0004957030870248742, 'samples': 2063616, 'steps': 10747, 'loss/train': 1.2029649019241333} 11/06/2021 22:44:20 - INFO - __main__ - Step 10748: {'lr': 0.0004957030870248742, 'samples': 2063616, 'steps': 10747, 'loss/train': 1.2029649019241333} 11/06/2021 22:44:24 - INFO - __main__ - Step 10756: {'lr': 0.0004956952461684066, 'samples': 2065152, 'steps': 10755, 'loss/train': 2.1202139854431152} 11/06/2021 22:44:26 - INFO - __main__ - Step 10760: {'lr': 0.0004956913230832031, 'samples': 2065920, 'steps': 10759, 'loss/train': 2.134770154953003}} 11/06/2021 22:44:28 - INFO - __main__ - Step 10764: {'lr': 0.000495687398226724, 'samples': 2066688, 'steps': 10763, 'loss/train': 1.661841869354248}}} 11/06/2021 22:44:31 - INFO - __main__ - Step 10768: {'lr': 0.0004956834715989977, 'samples': 2067456, 'steps': 10767, 'loss/train': 1.8057781457901}}}} 11/06/2021 22:44:32 - INFO - __main__ - Step 10772: {'lr': 0.0004956795432000526, 'samples': 2068224, 'steps': 10771, 'loss/train': 1.9097800254821777} 11/06/2021 22:44:34 - INFO - __main__ - Step 10776: {'lr': 0.0004956756130299169, 'samples': 2068992, 'steps': 10775, 'loss/train': 1.20347261428833}7} 11/06/2021 22:44:36 - INFO - __main__ - Step 10781: {'lr': 0.0004956706978265536, 'samples': 2069952, 'steps': 10780, 'loss/train': 1.7361626625061035} 11/06/2021 22:44:38 - INFO - __main__ - Step 10785: {'lr': 0.0004956667636713427, 'samples': 2070720, 'steps': 10784, 'loss/train': 1.5868165493011475} 11/06/2021 22:44:40 - INFO - __main__ - Step 10789: {'lr': 0.0004956628277450333, 'samples': 2071488, 'steps': 10788, 'loss/train': 2.1454813480377197} 11/06/2021 22:44:43 - INFO - __main__ - Step 10793: {'lr': 0.0004956588900476538, 'samples': 2072256, 'steps': 10792, 'loss/train': 1.459761381149292}} 11/06/2021 22:44:44 - INFO - __main__ - Step 10797: {'lr': 0.0004956549505792327, 'samples': 2073024, 'steps': 10796, 'loss/train': 2.264521360397339}} 11/06/2021 22:44:46 - INFO - __main__ - Step 10801: {'lr': 0.0004956510093397983, 'samples': 2073792, 'steps': 10800, 'loss/train': 1.6848071813583374} 11/06/2021 22:44:46 - INFO - __main__ - Step 10801: {'lr': 0.0004956510093397983, 'samples': 2073792, 'steps': 10800, 'loss/train': 1.6848071813583374} 11/06/2021 22:44:51 - INFO - __main__ - Step 10810: {'lr': 0.0004956421350759508, 'samples': 2075520, 'steps': 10809, 'loss/train': 1.3117084503173828} 11/06/2021 22:44:53 - INFO - __main__ - Step 10814: {'lr': 0.0004956381880809195, 'samples': 2076288, 'steps': 10813, 'loss/train': 2.0894155502319336} 11/06/2021 22:44:54 - INFO - __main__ - Step 10818: {'lr': 0.0004956342393149959, 'samples': 2077056, 'steps': 10817, 'loss/train': 1.7274878025054932} 11/06/2021 22:44:57 - INFO - __main__ - Step 10822: {'lr': 0.0004956302887782082, 'samples': 2077824, 'steps': 10821, 'loss/train': 1.0446696281433105} 11/06/2021 22:44:59 - INFO - __main__ - Step 10826: {'lr': 0.0004956263364705851, 'samples': 2078592, 'steps': 10825, 'loss/train': 1.6332308053970337} 11/06/2021 22:45:01 - INFO - __main__ - Step 10830: {'lr': 0.000495622382392155, 'samples': 2079360, 'steps': 10829, 'loss/train': 1.06633460521698}37} 11/06/2021 22:45:02 - INFO - __main__ - Step 10834: {'lr': 0.0004956184265429463, 'samples': 2080128, 'steps': 10833, 'loss/train': 2.010787010192871}} 11/06/2021 22:45:04 - INFO - __main__ - Step 10838: {'lr': 0.0004956144689229877, 'samples': 2080896, 'steps': 10837, 'loss/train': 2.1746902465820312} 11/06/2021 22:45:07 - INFO - __main__ - Step 10843: {'lr': 0.0004956095194079658, 'samples': 2081856, 'steps': 10842, 'loss/train': 1.8103166818618774} 11/06/2021 22:45:09 - INFO - __main__ - Step 10847: {'lr': 0.0004956055578039241, 'samples': 2082624, 'steps': 10846, 'loss/train': 1.727042317390442}} 11/06/2021 22:45:11 - INFO - __main__ - Step 10851: {'lr': 0.0004956015944292253, 'samples': 2083392, 'steps': 10850, 'loss/train': 2.1259398460388184} 11/06/2021 22:45:12 - INFO - __main__ - Step 10855: {'lr': 0.0004955976292838979, 'samples': 2084160, 'steps': 10854, 'loss/train': 1.4349844455718994} 11/06/2021 22:45:15 - INFO - __main__ - Step 10859: {'lr': 0.0004955936623679703, 'samples': 2084928, 'steps': 10858, 'loss/train': 2.2991316318511963} 11/06/2021 22:45:17 - INFO - __main__ - Step 10864: {'lr': 0.0004955887012331982, 'samples': 2085888, 'steps': 10863, 'loss/train': 1.7115322351455688} 11/06/2021 22:45:17 - INFO - __main__ - Step 10864: {'lr': 0.0004955887012331982, 'samples': 2085888, 'steps': 10863, 'loss/train': 1.7115322351455688} 11/06/2021 22:45:20 - INFO - __main__ - Step 10871: {'lr': 0.0004955817509968737, 'samples': 2087232, 'steps': 10870, 'loss/train': 1.3590363264083862} 11/06/2021 22:45:22 - INFO - __main__ - Step 10875: {'lr': 0.0004955777769988322, 'samples': 2088000, 'steps': 10874, 'loss/train': 2.0556890964508057} 11/06/2021 22:45:25 - INFO - __main__ - Step 10880: {'lr': 0.0004955728070115787, 'samples': 2088960, 'steps': 10879, 'loss/train': 1.9695255756378174} 11/06/2021 22:45:25 - INFO - __main__ - Step 10880: {'lr': 0.0004955728070115787, 'samples': 2088960, 'steps': 10879, 'loss/train': 1.9695255756378174} 11/06/2021 22:45:28 - INFO - __main__ - Step 10887: {'lr': 0.0004955658443820809, 'samples': 2090304, 'steps': 10886, 'loss/train': 1.9681472778320312} 11/06/2021 22:45:31 - INFO - __main__ - Step 10892: {'lr': 0.0004955608677558424, 'samples': 2091264, 'steps': 10891, 'loss/train': 1.5222508907318115} 11/06/2021 22:45:33 - INFO - __main__ - Step 10897: {'lr': 0.0004955558883634555, 'samples': 2092224, 'steps': 10896, 'loss/train': 2.1378374099731445} 11/06/2021 22:45:35 - INFO - __main__ - Step 10901: {'lr': 0.0004955519028579568, 'samples': 2092992, 'steps': 10900, 'loss/train': 1.904826045036316}} 11/06/2021 22:45:37 - INFO - __main__ - Step 10905: {'lr': 0.0004955479155821877, 'samples': 2093760, 'steps': 10904, 'loss/train': 1.7908188104629517} 11/06/2021 22:45:38 - INFO - __main__ - Step 10909: {'lr': 0.000495543926536177, 'samples': 2094528, 'steps': 10908, 'loss/train': 1.9852023124694824}} 11/06/2021 22:45:40 - INFO - __main__ - Step 10913: {'lr': 0.0004955399357199534, 'samples': 2095296, 'steps': 10912, 'loss/train': 1.5614092350006104} 11/06/2021 22:45:43 - INFO - __main__ - Step 10918: {'lr': 0.0004955349447103559, 'samples': 2096256, 'steps': 10917, 'loss/train': 1.4804587364196777} 11/06/2021 22:45:45 - INFO - __main__ - Step 10922: {'lr': 0.0004955309499112586, 'samples': 2097024, 'steps': 10921, 'loss/train': 1.534606695175171}} 11/06/2021 22:45:47 - INFO - __main__ - Step 10926: {'lr': 0.0004955269533420419, 'samples': 2097792, 'steps': 10925, 'loss/train': 1.5095373392105103} 11/06/2021 22:45:49 - INFO - __main__ - Step 10930: {'lr': 0.0004955229550027347, 'samples': 2098560, 'steps': 10929, 'loss/train': 1.7207810878753662} 11/06/2021 22:45:51 - INFO - __main__ - Step 10934: {'lr': 0.000495518954893366, 'samples': 2099328, 'steps': 10933, 'loss/train': 1.7026230096817017}} 11/06/2021 22:45:53 - INFO - __main__ - Step 10939: {'lr': 0.0004955139522675496, 'samples': 2100288, 'steps': 10938, 'loss/train': 2.6687307357788086} 11/06/2021 22:45:55 - INFO - __main__ - Step 10943: {'lr': 0.0004955099481756475, 'samples': 2101056, 'steps': 10942, 'loss/train': 2.186129331588745}} 11/06/2021 22:45:57 - INFO - __main__ - Step 10947: {'lr': 0.0004955059423137774, 'samples': 2101824, 'steps': 10946, 'loss/train': 1.8952678442001343} 11/06/2021 22:45:59 - INFO - __main__ - Step 10951: {'lr': 0.0004955019346819684, 'samples': 2102592, 'steps': 10950, 'loss/train': 1.7060812711715698} 11/06/2021 22:46:01 - INFO - __main__ - Step 10955: {'lr': 0.0004954979252802491, 'samples': 2103360, 'steps': 10954, 'loss/train': 1.854745626449585}} 11/06/2021 22:46:03 - INFO - __main__ - Step 10959: {'lr': 0.0004954939141086488, 'samples': 2104128, 'steps': 10958, 'loss/train': 1.2875730991363525} 11/06/2021 22:46:03 - INFO - __main__ - Step 10959: {'lr': 0.0004954939141086488, 'samples': 2104128, 'steps': 10958, 'loss/train': 1.2875730991363525} 11/06/2021 22:46:07 - INFO - __main__ - Step 10967: {'lr': 0.0004954858864559199, 'samples': 2105664, 'steps': 10966, 'loss/train': 2.002821922302246}} 11/06/2021 22:46:09 - INFO - __main__ - Step 10971: {'lr': 0.0004954818699748493, 'samples': 2106432, 'steps': 10970, 'loss/train': 1.9312834739685059} 11/06/2021 22:46:10 - INFO - __main__ - Step 10975: {'lr': 0.0004954778517240133, 'samples': 2107200, 'steps': 10974, 'loss/train': 1.7459683418273926} 11/06/2021 22:46:13 - INFO - __main__ - Step 10980: {'lr': 0.0004954728264217796, 'samples': 2108160, 'steps': 10979, 'loss/train': 1.738264560699463}} 11/06/2021 22:46:15 - INFO - __main__ - Step 10985: {'lr': 0.0004954677983543893, 'samples': 2109120, 'steps': 10984, 'loss/train': 1.65086829662323}}} 11/06/2021 22:46:17 - INFO - __main__ - Step 10989: {'lr': 0.0004954637739096023, 'samples': 2109888, 'steps': 10988, 'loss/train': 2.1266255378723145} 11/06/2021 22:46:17 - INFO - __main__ - Step 10989: {'lr': 0.0004954637739096023, 'samples': 2109888, 'steps': 10988, 'loss/train': 2.1266255378723145} 11/06/2021 22:46:21 - INFO - __main__ - Step 10996: {'lr': 0.0004954567268730582, 'samples': 2111232, 'steps': 10995, 'loss/train': 1.7507246732711792} 11/06/2021 22:46:23 - INFO - __main__ - Step 11000: {'lr': 0.0004954526975618447, 'samples': 2112000, 'steps': 10999, 'loss/train': 1.7559417486190796} 11/06/2021 22:46:25 - INFO - __main__ - Step 11004: {'lr': 0.0004954486664810762, 'samples': 2112768, 'steps': 11003, 'loss/train': 1.973361849784851}} 11/06/2021 22:46:27 - INFO - __main__ - Step 11008: {'lr': 0.0004954446336307814, 'samples': 2113536, 'steps': 11007, 'loss/train': 1.5346612930297852} 11/06/2021 22:46:29 - INFO - __main__ - Step 11012: {'lr': 0.0004954405990109897, 'samples': 2114304, 'steps': 11011, 'loss/train': 1.1053050756454468} 11/06/2021 22:46:31 - INFO - __main__ - Step 11016: {'lr': 0.0004954365626217299, 'samples': 2115072, 'steps': 11015, 'loss/train': 1.8625273704528809} 11/06/2021 22:46:31 - INFO - __main__ - Step 11016: {'lr': 0.0004954365626217299, 'samples': 2115072, 'steps': 11015, 'loss/train': 1.8625273704528809} 11/06/2021 22:46:34 - INFO - __main__ - Step 11023: {'lr': 0.0004954294946828308, 'samples': 2116416, 'steps': 11022, 'loss/train': 2.187809944152832}} 11/06/2021 22:46:37 - INFO - __main__ - Step 11029: {'lr': 0.0004954234321365998, 'samples': 2117568, 'steps': 11028, 'loss/train': 2.104278087615967}} 11/06/2021 22:46:39 - INFO - __main__ - Step 11033: {'lr': 0.0004954193882274261, 'samples': 2118336, 'steps': 11032, 'loss/train': 1.8850313425064087} 11/06/2021 22:46:41 - INFO - __main__ - Step 11037: {'lr': 0.0004954153425489374, 'samples': 2119104, 'steps': 11036, 'loss/train': 1.7933595180511475} 11/06/2021 22:46:43 - INFO - __main__ - Step 11041: {'lr': 0.0004954112951011628, 'samples': 2119872, 'steps': 11040, 'loss/train': 1.5636467933654785} 11/06/2021 22:46:45 - INFO - __main__ - Step 11045: {'lr': 0.0004954072458841315, 'samples': 2120640, 'steps': 11044, 'loss/train': 1.7006868124008179} 11/06/2021 22:46:47 - INFO - __main__ - Step 11049: {'lr': 0.0004954031948978729, 'samples': 2121408, 'steps': 11048, 'loss/train': 1.3999689817428589} 11/06/2021 22:46:49 - INFO - __main__ - Step 11054: {'lr': 0.0004953981286771178, 'samples': 2122368, 'steps': 11053, 'loss/train': 0.8524397015571594} 11/06/2021 22:46:49 - INFO - __main__ - Step 11054: {'lr': 0.0004953981286771178, 'samples': 2122368, 'steps': 11053, 'loss/train': 0.8524397015571594} 11/06/2021 22:46:52 - INFO - __main__ - Step 11060: {'lr': 0.0004953920455633206, 'samples': 2123520, 'steps': 11059, 'loss/train': 1.7199558019638062} 11/06/2021 22:46:54 - INFO - __main__ - Step 11064: {'lr': 0.000495387987942719, 'samples': 2124288, 'steps': 11063, 'loss/train': 1.8901044130325317}} 11/06/2021 22:46:57 - INFO - __main__ - Step 11069: {'lr': 0.0004953829134291895, 'samples': 2125248, 'steps': 11068, 'loss/train': 2.0392954349517822} 11/06/2021 22:46:59 - INFO - __main__ - Step 11074: {'lr': 0.0004953778361515163, 'samples': 2126208, 'steps': 11073, 'loss/train': 1.7177420854568481} 11/06/2021 22:47:01 - INFO - __main__ - Step 11078: {'lr': 0.0004953737723392324, 'samples': 2126976, 'steps': 11077, 'loss/train': 2.2842390537261963} 11/06/2021 22:47:03 - INFO - __main__ - Step 11082: {'lr': 0.0004953697067579624, 'samples': 2127744, 'steps': 11081, 'loss/train': 1.5058166980743408} 11/06/2021 22:47:05 - INFO - __main__ - Step 11086: {'lr': 0.0004953656394077355, 'samples': 2128512, 'steps': 11085, 'loss/train': 1.1860450506210327} 11/06/2021 22:47:07 - INFO - __main__ - Step 11090: {'lr': 0.0004953615702885812, 'samples': 2129280, 'steps': 11089, 'loss/train': 1.5646089315414429} 11/06/2021 22:47:09 - INFO - __main__ - Step 11095: {'lr': 0.0004953564814021285, 'samples': 2130240, 'steps': 11094, 'loss/train': 1.9974864721298218} 11/06/2021 22:47:11 - INFO - __main__ - Step 11099: {'lr': 0.0004953524083029945, 'samples': 2131008, 'steps': 11098, 'loss/train': 2.143897533416748}} 11/06/2021 22:47:13 - INFO - __main__ - Step 11103: {'lr': 0.0004953483334350283, 'samples': 2131776, 'steps': 11102, 'loss/train': 1.5690078735351562} 11/06/2021 22:47:15 - INFO - __main__ - Step 11107: {'lr': 0.0004953442567982593, 'samples': 2132544, 'steps': 11106, 'loss/train': 1.606833815574646}} 11/06/2021 22:47:17 - INFO - __main__ - Step 11111: {'lr': 0.0004953401783927171, 'samples': 2133312, 'steps': 11110, 'loss/train': 1.9979164600372314} 11/06/2021 22:47:19 - INFO - __main__ - Step 11116: {'lr': 0.0004953350778984963, 'samples': 2134272, 'steps': 11115, 'loss/train': 1.8427363634109497} 11/06/2021 22:47:21 - INFO - __main__ - Step 11120: {'lr': 0.0004953309955133214, 'samples': 2135040, 'steps': 11119, 'loss/train': 1.8152966499328613} 11/06/2021 22:47:23 - INFO - __main__ - Step 11124: {'lr': 0.0004953269113594687, 'samples': 2135808, 'steps': 11123, 'loss/train': 1.684847116470337}} 11/06/2021 22:47:23 - INFO - __main__ - Step 11124: {'lr': 0.0004953269113594687, 'samples': 2135808, 'steps': 11123, 'loss/train': 1.684847116470337}} 11/06/2021 22:47:26 - INFO - __main__ - Step 11131: {'lr': 0.0004953197598344342, 'samples': 2137152, 'steps': 11130, 'loss/train': 2.116920232772827}} 11/06/2021 22:47:29 - INFO - __main__ - Step 11136: {'lr': 0.0004953146482861385, 'samples': 2138112, 'steps': 11135, 'loss/train': 1.7565672397613525} 11/06/2021 22:47:31 - INFO - __main__ - Step 11141: {'lr': 0.000495309533974468, 'samples': 2139072, 'steps': 11140, 'loss/train': 1.7597218751907349}} 11/06/2021 22:47:33 - INFO - __main__ - Step 11145: {'lr': 0.0004953054405355404, 'samples': 2139840, 'steps': 11144, 'loss/train': 2.103456735610962}} 11/06/2021 22:47:33 - INFO - __main__ - Step 11145: {'lr': 0.0004953054405355404, 'samples': 2139840, 'steps': 11144, 'loss/train': 2.103456735610962}} 11/06/2021 22:47:36 - INFO - __main__ - Step 11152: {'lr': 0.0004952982727619973, 'samples': 2141184, 'steps': 11151, 'loss/train': 1.672674298286438}} 11/06/2021 22:47:39 - INFO - __main__ - Step 11157: {'lr': 0.0004952931496079143, 'samples': 2142144, 'steps': 11156, 'loss/train': 1.9701721668243408} 11/06/2021 22:47:41 - INFO - __main__ - Step 11162: {'lr': 0.0004952880236906988, 'samples': 2143104, 'steps': 11161, 'loss/train': 1.2465680837631226} 11/06/2021 22:47:43 - INFO - __main__ - Step 11166: {'lr': 0.0004952839209675096, 'samples': 2143872, 'steps': 11165, 'loss/train': 1.383769154548645}} 11/06/2021 22:47:45 - INFO - __main__ - Step 11170: {'lr': 0.000495279816475982, 'samples': 2144640, 'steps': 11169, 'loss/train': 1.741720199584961}}} 11/06/2021 22:47:47 - INFO - __main__ - Step 11174: {'lr': 0.0004952757102161457, 'samples': 2145408, 'steps': 11173, 'loss/train': 1.3439342975616455} 11/06/2021 22:47:49 - INFO - __main__ - Step 11178: {'lr': 0.0004952716021880301, 'samples': 2146176, 'steps': 11177, 'loss/train': 2.207341194152832}} 11/06/2021 22:47:49 - INFO - __main__ - Step 11178: {'lr': 0.0004952716021880301, 'samples': 2146176, 'steps': 11177, 'loss/train': 2.207341194152832}} 11/06/2021 22:47:49 - INFO - __main__ - Step 11178: {'lr': 0.0004952716021880301, 'samples': 2146176, 'steps': 11177, 'loss/train': 2.207341194152832}} 11/06/2021 22:47:55 - INFO - __main__ - Step 11189: {'lr': 0.0004952602959932644, 'samples': 2148288, 'steps': 11188, 'loss/train': 0.2998165488243103} 11/06/2021 22:47:57 - INFO - __main__ - Step 11194: {'lr': 0.0004952551523933682, 'samples': 2149248, 'steps': 11193, 'loss/train': 2.243593692779541}} 11/06/2021 22:48:00 - INFO - __main__ - Step 11199: {'lr': 0.0004952500060307674, 'samples': 2150208, 'steps': 11198, 'loss/train': 1.9236924648284912} 11/06/2021 22:48:02 - INFO - __main__ - Step 11203: {'lr': 0.0004952458869515782, 'samples': 2150976, 'steps': 11202, 'loss/train': 1.9771145582199097} 11/06/2021 22:48:02 - INFO - __main__ - Step 11203: {'lr': 0.0004952458869515782, 'samples': 2150976, 'steps': 11202, 'loss/train': 1.9771145582199097} 11/06/2021 22:48:05 - INFO - __main__ - Step 11210: {'lr': 0.0004952386743086107, 'samples': 2152320, 'steps': 11209, 'loss/train': 1.5023480653762817} 11/06/2021 22:48:07 - INFO - __main__ - Step 11215: {'lr': 0.0004952335191057447, 'samples': 2153280, 'steps': 11214, 'loss/train': 1.707075834274292}} 11/06/2021 22:48:10 - INFO - __main__ - Step 11220: {'lr': 0.0004952283611404176, 'samples': 2154240, 'steps': 11219, 'loss/train': 1.5863221883773804} 11/06/2021 22:48:12 - INFO - __main__ - Step 11224: {'lr': 0.000495224232779223, 'samples': 2155008, 'steps': 11223, 'loss/train': 1.7595049142837524}} 11/06/2021 22:48:12 - INFO - __main__ - Step 11224: {'lr': 0.000495224232779223, 'samples': 2155008, 'steps': 11223, 'loss/train': 1.7595049142837524}} 11/06/2021 22:48:15 - INFO - __main__ - Step 11231: {'lr': 0.0004952170038931217, 'samples': 2156352, 'steps': 11230, 'loss/train': 1.376936912536621}} 11/06/2021 22:48:18 - INFO - __main__ - Step 11236: {'lr': 0.0004952118370883101, 'samples': 2157312, 'steps': 11235, 'loss/train': 2.1615216732025146} 11/06/2021 22:48:20 - INFO - __main__ - Step 11240: {'lr': 0.0004952077016556619, 'samples': 2158080, 'steps': 11239, 'loss/train': 1.766661524772644}} 11/06/2021 22:48:22 - INFO - __main__ - Step 11244: {'lr': 0.0004952035644552249, 'samples': 2158848, 'steps': 11243, 'loss/train': 1.4197368621826172} 11/06/2021 22:48:24 - INFO - __main__ - Step 11248: {'lr': 0.0004951994254870286, 'samples': 2159616, 'steps': 11247, 'loss/train': 1.3976362943649292} 11/06/2021 22:48:25 - INFO - __main__ - Step 11252: {'lr': 0.0004951952847511033, 'samples': 2160384, 'steps': 11251, 'loss/train': 1.3120644092559814} 11/06/2021 22:48:25 - INFO - __main__ - Step 11252: {'lr': 0.0004951952847511033, 'samples': 2160384, 'steps': 11251, 'loss/train': 1.3120644092559814} 11/06/2021 22:48:29 - INFO - __main__ - Step 11260: {'lr': 0.0004951869979761842, 'samples': 2161920, 'steps': 11259, 'loss/train': 1.692081332206726}} 11/06/2021 22:48:31 - INFO - __main__ - Step 11264: {'lr': 0.0004951828519372503, 'samples': 2162688, 'steps': 11263, 'loss/train': 1.470284104347229}} 11/06/2021 22:48:33 - INFO - __main__ - Step 11268: {'lr': 0.0004951787041307066, 'samples': 2163456, 'steps': 11267, 'loss/train': 1.7130874395370483} 11/06/2021 22:48:35 - INFO - __main__ - Step 11272: {'lr': 0.0004951745545565831, 'samples': 2164224, 'steps': 11271, 'loss/train': 1.5138862133026123} 11/06/2021 22:48:37 - INFO - __main__ - Step 11277: {'lr': 0.000495169365103315, 'samples': 2165184, 'steps': 11276, 'loss/train': 1.7609519958496094}} 11/06/2021 22:48:39 - INFO - __main__ - Step 11281: {'lr': 0.0004951652115522462, 'samples': 2165952, 'steps': 11280, 'loss/train': 1.7629950046539307} 11/06/2021 22:48:41 - INFO - __main__ - Step 11285: {'lr': 0.0004951610562336949, 'samples': 2166720, 'steps': 11284, 'loss/train': 1.6244908571243286} 11/06/2021 22:48:43 - INFO - __main__ - Step 11289: {'lr': 0.0004951568991476908, 'samples': 2167488, 'steps': 11288, 'loss/train': 1.3978345394134521} 11/06/2021 22:48:45 - INFO - __main__ - Step 11293: {'lr': 0.0004951527402942643, 'samples': 2168256, 'steps': 11292, 'loss/train': 1.4760860204696655} 11/06/2021 22:48:47 - INFO - __main__ - Step 11298: {'lr': 0.0004951475392420884, 'samples': 2169216, 'steps': 11297, 'loss/train': 2.0938913822174072} 11/06/2021 22:48:47 - INFO - __main__ - Step 11298: {'lr': 0.0004951475392420884, 'samples': 2169216, 'steps': 11297, 'loss/train': 2.0938913822174072} 11/06/2021 22:48:51 - INFO - __main__ - Step 11305: {'lr': 0.0004951402531297482, 'samples': 2170560, 'steps': 11304, 'loss/train': 2.0329270362854004} 11/06/2021 22:48:53 - INFO - __main__ - Step 11309: {'lr': 0.0004951360872069309, 'samples': 2171328, 'steps': 11308, 'loss/train': 1.3344480991363525} 11/06/2021 22:48:55 - INFO - __main__ - Step 11314: {'lr': 0.0004951308773181856, 'samples': 2172288, 'steps': 11313, 'loss/train': 2.1913249492645264} 11/06/2021 22:48:58 - INFO - __main__ - Step 11319: {'lr': 0.0004951256646681356, 'samples': 2173248, 'steps': 11318, 'loss/train': 1.662163496017456}} 11/06/2021 22:49:00 - INFO - __main__ - Step 11323: {'lr': 0.0004951214925599957, 'samples': 2174016, 'steps': 11322, 'loss/train': 1.6424283981323242} 11/06/2021 22:49:00 - INFO - __main__ - Step 11323: {'lr': 0.0004951214925599957, 'samples': 2174016, 'steps': 11322, 'loss/train': 1.6424283981323242} 11/06/2021 22:49:03 - INFO - __main__ - Step 11330: {'lr': 0.0004951141871185224, 'samples': 2175360, 'steps': 11329, 'loss/train': 2.280247449874878}} 11/06/2021 22:49:06 - INFO - __main__ - Step 11335: {'lr': 0.0004951089656326919, 'samples': 2176320, 'steps': 11334, 'loss/train': 0.3128710389137268} 11/06/2021 22:49:08 - INFO - __main__ - Step 11339: {'lr': 0.0004951047864560629, 'samples': 2177088, 'steps': 11338, 'loss/train': 1.64912748336792}8} 11/06/2021 22:49:10 - INFO - __main__ - Step 11343: {'lr': 0.000495100605512387, 'samples': 2177856, 'steps': 11342, 'loss/train': 1.8620246648788452}} 11/06/2021 22:49:11 - INFO - __main__ - Step 11347: {'lr': 0.0004950964228016944, 'samples': 2178624, 'steps': 11346, 'loss/train': 1.8836588859558105} 11/06/2021 22:49:13 - INFO - __main__ - Step 11351: {'lr': 0.000495092238324015, 'samples': 2179392, 'steps': 11350, 'loss/train': 1.1730953454971313}} 11/06/2021 22:49:16 - INFO - __main__ - Step 11356: {'lr': 0.0004950870052421368, 'samples': 2180352, 'steps': 11355, 'loss/train': 1.806678295135498}} 11/06/2021 22:49:18 - INFO - __main__ - Step 11360: {'lr': 0.0004950828167888478, 'samples': 2181120, 'steps': 11359, 'loss/train': 1.9310498237609863} 11/06/2021 22:49:20 - INFO - __main__ - Step 11364: {'lr': 0.0004950786265686702, 'samples': 2181888, 'steps': 11363, 'loss/train': 1.6461005210876465} 11/06/2021 22:49:21 - INFO - __main__ - Step 11368: {'lr': 0.0004950744345816342, 'samples': 2182656, 'steps': 11367, 'loss/train': 1.2006498575210571} 11/06/2021 22:49:23 - INFO - __main__ - Step 11372: {'lr': 0.0004950702408277702, 'samples': 2183424, 'steps': 11371, 'loss/train': 1.4938647747039795} 11/06/2021 22:49:25 - INFO - __main__ - Step 11377: {'lr': 0.0004950649961508841, 'samples': 2184384, 'steps': 11376, 'loss/train': 2.194822311401367}} 11/06/2021 22:49:28 - INFO - __main__ - Step 11381: {'lr': 0.0004950607984217674, 'samples': 2185152, 'steps': 11380, 'loss/train': 1.64052414894104}}} 11/06/2021 22:49:30 - INFO - __main__ - Step 11385: {'lr': 0.0004950565989259207, 'samples': 2185920, 'steps': 11384, 'loss/train': 1.4204002618789673} 11/06/2021 22:49:31 - INFO - __main__ - Step 11389: {'lr': 0.0004950523976633745, 'samples': 2186688, 'steps': 11388, 'loss/train': 1.9979164600372314} 11/06/2021 22:49:33 - INFO - __main__ - Step 11393: {'lr': 0.000495048194634159, 'samples': 2187456, 'steps': 11392, 'loss/train': 1.676889419555664}4} 11/06/2021 22:49:36 - INFO - __main__ - Step 11398: {'lr': 0.0004950429383633073, 'samples': 2188416, 'steps': 11397, 'loss/train': 1.5339622497558594} 11/06/2021 22:49:38 - INFO - __main__ - Step 11403: {'lr': 0.0004950376793321413, 'samples': 2189376, 'steps': 11402, 'loss/train': 1.7271697521209717} 11/06/2021 22:49:40 - INFO - __main__ - Step 11407: {'lr': 0.0004950334701198222, 'samples': 2190144, 'steps': 11406, 'loss/train': 1.9516605138778687} 11/06/2021 22:49:40 - INFO - __main__ - Step 11407: {'lr': 0.0004950334701198222, 'samples': 2190144, 'steps': 11406, 'loss/train': 1.9516605138778687} 11/06/2021 22:49:43 - INFO - __main__ - Step 11414: {'lr': 0.0004950260997475623, 'samples': 2191488, 'steps': 11413, 'loss/train': 1.7055529356002808} 11/06/2021 22:49:46 - INFO - __main__ - Step 11419: {'lr': 0.0004950208318837892, 'samples': 2192448, 'steps': 11418, 'loss/train': 1.5548012256622314} 11/06/2021 22:49:48 - INFO - __main__ - Step 11424: {'lr': 0.0004950155612599511, 'samples': 2193408, 'steps': 11423, 'loss/train': 1.7228354215621948} 11/06/2021 22:49:48 - INFO - __main__ - Step 11424: {'lr': 0.0004950155612599511, 'samples': 2193408, 'steps': 11423, 'loss/train': 1.7228354215621948} 11/06/2021 22:49:51 - INFO - __main__ - Step 11430: {'lr': 0.0004950092328681428, 'samples': 2194560, 'steps': 11429, 'loss/train': 1.3537622690200806} 11/06/2021 22:49:54 - INFO - __main__ - Step 11435: {'lr': 0.0004950039561723703, 'samples': 2195520, 'steps': 11434, 'loss/train': 2.0817348957061768} 11/06/2021 22:49:56 - INFO - __main__ - Step 11440: {'lr': 0.0004949986767167228, 'samples': 2196480, 'steps': 11439, 'loss/train': 1.7402944564819336} 11/06/2021 22:49:58 - INFO - __main__ - Step 11444: {'lr': 0.0004949944511651347, 'samples': 2197248, 'steps': 11443, 'loss/train': 1.7834969758987427} 11/06/2021 22:49:58 - INFO - __main__ - Step 11444: {'lr': 0.0004949944511651347, 'samples': 2197248, 'steps': 11443, 'loss/train': 1.7834969758987427} 11/06/2021 22:50:02 - INFO - __main__ - Step 11451: {'lr': 0.0004949870521998312, 'samples': 2198592, 'steps': 11450, 'loss/train': 1.8337377309799194} 11/06/2021 22:50:04 - INFO - __main__ - Step 11456: {'lr': 0.0004949817639129832, 'samples': 2199552, 'steps': 11455, 'loss/train': 1.85826575756073}4} 11/06/2021 22:50:06 - INFO - __main__ - Step 11460: {'lr': 0.0004949775312965721, 'samples': 2200320, 'steps': 11459, 'loss/train': 1.5449903011322021} 11/06/2021 22:50:08 - INFO - __main__ - Step 11464: {'lr': 0.0004949732969140313, 'samples': 2201088, 'steps': 11463, 'loss/train': 1.5884931087493896} 11/06/2021 22:50:10 - INFO - __main__ - Step 11468: {'lr': 0.0004949690607653916, 'samples': 2201856, 'steps': 11467, 'loss/train': 1.5645625591278076} 11/06/2021 22:50:12 - INFO - __main__ - Step 11472: {'lr': 0.0004949648228506834, 'samples': 2202624, 'steps': 11471, 'loss/train': 1.6954563856124878} 11/06/2021 22:50:14 - INFO - __main__ - Step 11477: {'lr': 0.000494959522973811, 'samples': 2203584, 'steps': 11476, 'loss/train': 1.5947993993759155}} 11/06/2021 22:50:16 - INFO - __main__ - Step 11481: {'lr': 0.0004949552810855605, 'samples': 2204352, 'steps': 11480, 'loss/train': 1.7512733936309814} 11/06/2021 22:50:18 - INFO - __main__ - Step 11485: {'lr': 0.0004949510374313409, 'samples': 2205120, 'steps': 11484, 'loss/train': 1.3778972625732422} 11/06/2021 22:50:20 - INFO - __main__ - Step 11489: {'lr': 0.0004949467920111827, 'samples': 2205888, 'steps': 11488, 'loss/train': 2.2677342891693115} 11/06/2021 22:50:22 - INFO - __main__ - Step 11493: {'lr': 0.0004949425448251166, 'samples': 2206656, 'steps': 11492, 'loss/train': 1.598381757736206}} 11/06/2021 22:50:22 - INFO - __main__ - Step 11493: {'lr': 0.0004949425448251166, 'samples': 2206656, 'steps': 11492, 'loss/train': 1.598381757736206}} 11/06/2021 22:50:26 - INFO - __main__ - Step 11501: {'lr': 0.0004949340451553833, 'samples': 2208192, 'steps': 11500, 'loss/train': 1.7358423471450806} 11/06/2021 22:50:26 - INFO - __main__ - Step 11501: {'lr': 0.0004949340451553833, 'samples': 2208192, 'steps': 11500, 'loss/train': 1.7358423471450806} 11/06/2021 22:50:29 - INFO - __main__ - Step 11508: {'lr': 0.0004949266021502744, 'samples': 2209536, 'steps': 11507, 'loss/train': 1.2840288877487183} 11/06/2021 22:50:32 - INFO - __main__ - Step 11514: {'lr': 0.0004949202181275577, 'samples': 2210688, 'steps': 11513, 'loss/train': 1.9893686771392822} 11/06/2021 22:50:32 - INFO - __main__ - Step 11514: {'lr': 0.0004949202181275577, 'samples': 2210688, 'steps': 11513, 'loss/train': 1.9893686771392822} 11/06/2021 22:50:35 - INFO - __main__ - Step 11521: {'lr': 0.0004949127650798063, 'samples': 2212032, 'steps': 11520, 'loss/train': 1.619840145111084}} 11/06/2021 22:50:37 - INFO - __main__ - Step 11525: {'lr': 0.0004949085037675803, 'samples': 2212800, 'steps': 11524, 'loss/train': 1.7247838973999023} 11/06/2021 22:50:40 - INFO - __main__ - Step 11530: {'lr': 0.0004949031746443816, 'samples': 2213760, 'steps': 11529, 'loss/train': 1.8068318367004395} 11/06/2021 22:50:40 - INFO - __main__ - Step 11530: {'lr': 0.0004949031746443816, 'samples': 2213760, 'steps': 11529, 'loss/train': 1.8068318367004395} 11/06/2021 22:50:44 - INFO - __main__ - Step 11538: {'lr': 0.0004948946423091099, 'samples': 2215296, 'steps': 11537, 'loss/train': 1.8267381191253662} 11/06/2021 22:50:46 - INFO - __main__ - Step 11542: {'lr': 0.0004948903734931608, 'samples': 2216064, 'steps': 11541, 'loss/train': 1.5871480703353882} 11/06/2021 22:50:47 - INFO - __main__ - Step 11546: {'lr': 0.0004948861029117104, 'samples': 2216832, 'steps': 11545, 'loss/train': 1.524583339691162}} 11/06/2021 22:50:50 - INFO - __main__ - Step 11551: {'lr': 0.0004948807622022083, 'samples': 2217792, 'steps': 11550, 'loss/train': 2.3621954917907715} 11/06/2021 22:50:52 - INFO - __main__ - Step 11555: {'lr': 0.000494876487648493, 'samples': 2218560, 'steps': 11554, 'loss/train': 1.359278917312622}5} 11/06/2021 22:50:54 - INFO - __main__ - Step 11559: {'lr': 0.0004948722113293766, 'samples': 2219328, 'steps': 11558, 'loss/train': 1.786012053489685}} 11/06/2021 22:50:54 - INFO - __main__ - Step 11559: {'lr': 0.0004948722113293766, 'samples': 2219328, 'steps': 11558, 'loss/train': 1.786012053489685}} 11/06/2021 22:50:57 - INFO - __main__ - Step 11566: {'lr': 0.0004948647235230192, 'samples': 2220672, 'steps': 11565, 'loss/train': 1.6016157865524292} 11/06/2021 22:51:00 - INFO - __main__ - Step 11571: {'lr': 0.0004948593717799292, 'samples': 2221632, 'steps': 11570, 'loss/train': 1.6391667127609253} 11/06/2021 22:51:02 - INFO - __main__ - Step 11576: {'lr': 0.0004948540172785927, 'samples': 2222592, 'steps': 11575, 'loss/train': 1.5622731447219849} 11/06/2021 22:51:04 - INFO - __main__ - Step 11580: {'lr': 0.0004948497316916267, 'samples': 2223360, 'steps': 11579, 'loss/train': 2.0067150592803955} 11/06/2021 22:51:04 - INFO - __main__ - Step 11580: {'lr': 0.0004948497316916267, 'samples': 2223360, 'steps': 11579, 'loss/train': 2.0067150592803955} 11/06/2021 22:51:07 - INFO - __main__ - Step 11587: {'lr': 0.0004948422276669228, 'samples': 2224704, 'steps': 11586, 'loss/train': 1.91586434841156}5} 11/06/2021 22:51:10 - INFO - __main__ - Step 11592: {'lr': 0.0004948368643396035, 'samples': 2225664, 'steps': 11591, 'loss/train': 1.732541799545288}} 11/06/2021 22:51:12 - INFO - __main__ - Step 11597: {'lr': 0.0004948314982542914, 'samples': 2226624, 'steps': 11596, 'loss/train': 1.7309365272521973} 11/06/2021 22:51:12 - INFO - __main__ - Step 11597: {'lr': 0.0004948314982542914, 'samples': 2226624, 'steps': 11596, 'loss/train': 1.7309365272521973} 11/06/2021 22:51:15 - INFO - __main__ - Step 11604: {'lr': 0.0004948239811015416, 'samples': 2227968, 'steps': 11603, 'loss/train': 1.6033754348754883} 11/06/2021 22:51:18 - INFO - __main__ - Step 11609: {'lr': 0.0004948186083972934, 'samples': 2228928, 'steps': 11608, 'loss/train': 1.552068829536438}} 11/06/2021 22:51:20 - INFO - __main__ - Step 11613: {'lr': 0.0004948143082482852, 'samples': 2229696, 'steps': 11612, 'loss/train': 1.9601466655731201} 11/06/2021 22:51:22 - INFO - __main__ - Step 11618: {'lr': 0.0004948089305800638, 'samples': 2230656, 'steps': 11617, 'loss/train': 1.9734327793121338} 11/06/2021 22:51:22 - INFO - __main__ - Step 11618: {'lr': 0.0004948089305800638, 'samples': 2230656, 'steps': 11617, 'loss/train': 1.9734327793121338} 11/06/2021 22:51:26 - INFO - __main__ - Step 11625: {'lr': 0.000494801397211668, 'samples': 2232000, 'steps': 11624, 'loss/train': 1.5584590435028076}} 11/06/2021 22:51:28 - INFO - __main__ - Step 11629: {'lr': 0.0004947970900030346, 'samples': 2232768, 'steps': 11628, 'loss/train': 1.6569414138793945} 11/06/2021 22:51:30 - INFO - __main__ - Step 11634: {'lr': 0.0004947917035104564, 'samples': 2233728, 'steps': 11633, 'loss/train': 1.9105935096740723} 11/06/2021 22:51:30 - INFO - __main__ - Step 11634: {'lr': 0.0004947917035104564, 'samples': 2233728, 'steps': 11633, 'loss/train': 1.9105935096740723} 11/06/2021 22:51:34 - INFO - __main__ - Step 11642: {'lr': 0.0004947830793867896, 'samples': 2235264, 'steps': 11641, 'loss/train': 2.2054293155670166} 11/06/2021 22:51:36 - INFO - __main__ - Step 11646: {'lr': 0.0004947787646778491, 'samples': 2236032, 'steps': 11645, 'loss/train': 1.7138854265213013} 11/06/2021 22:51:38 - INFO - __main__ - Step 11650: {'lr': 0.0004947744482042122, 'samples': 2236800, 'steps': 11649, 'loss/train': 1.3429278135299683} 11/06/2021 22:51:40 - INFO - __main__ - Step 11655: {'lr': 0.0004947690501306088, 'samples': 2237760, 'steps': 11654, 'loss/train': 2.2113709449768066} 11/06/2021 22:51:42 - INFO - __main__ - Step 11660: {'lr': 0.0004947636492997765, 'samples': 2238720, 'steps': 11659, 'loss/train': 1.6339030265808105} 11/06/2021 22:51:42 - INFO - __main__ - Step 11660: {'lr': 0.0004947636492997765, 'samples': 2238720, 'steps': 11659, 'loss/train': 1.6339030265808105} 11/06/2021 22:51:46 - INFO - __main__ - Step 11667: {'lr': 0.0004947560835045826, 'samples': 2240064, 'steps': 11666, 'loss/train': 1.7168904542922974} 11/06/2021 22:51:48 - INFO - __main__ - Step 11671: {'lr': 0.0004947517577667996, 'samples': 2240832, 'steps': 11670, 'loss/train': 1.9244534969329834} 11/06/2021 22:51:50 - INFO - __main__ - Step 11676: {'lr': 0.0004947463481132438, 'samples': 2241792, 'steps': 11675, 'loss/train': 1.5373907089233398} 11/06/2021 22:51:50 - INFO - __main__ - Step 11676: {'lr': 0.0004947463481132438, 'samples': 2241792, 'steps': 11675, 'loss/train': 1.5373907089233398} 11/06/2021 22:51:54 - INFO - __main__ - Step 11684: {'lr': 0.0004947376869330755, 'samples': 2243328, 'steps': 11683, 'loss/train': 2.4655892848968506} 11/06/2021 22:51:56 - INFO - __main__ - Step 11688: {'lr': 0.0004947333536963753, 'samples': 2244096, 'steps': 11687, 'loss/train': 1.7653391361236572} 11/06/2021 22:51:58 - INFO - __main__ - Step 11692: {'lr': 0.0004947290186953057, 'samples': 2244864, 'steps': 11691, 'loss/train': 1.3742139339447021} 11/06/2021 22:52:00 - INFO - __main__ - Step 11697: {'lr': 0.0004947235974628723, 'samples': 2245824, 'steps': 11696, 'loss/train': 1.703428864479065}} 11/06/2021 22:52:02 - INFO - __main__ - Step 11701: {'lr': 0.0004947192584920866, 'samples': 2246592, 'steps': 11700, 'loss/train': 1.931963324546814}} 11/06/2021 22:52:05 - INFO - __main__ - Step 11705: {'lr': 0.0004947149177570332, 'samples': 2247360, 'steps': 11704, 'loss/train': 1.6442228555679321} 11/06/2021 22:52:06 - INFO - __main__ - Step 11709: {'lr': 0.0004947105752577436, 'samples': 2248128, 'steps': 11708, 'loss/train': 1.3050994873046875} 11/06/2021 22:52:08 - INFO - __main__ - Step 11713: {'lr': 0.000494706230994249, 'samples': 2248896, 'steps': 11712, 'loss/train': 1.7337501049041748}} 11/06/2021 22:52:11 - INFO - __main__ - Step 11717: {'lr': 0.0004947018849665809, 'samples': 2249664, 'steps': 11716, 'loss/train': 1.7236446142196655} 11/06/2021 22:52:13 - INFO - __main__ - Step 11721: {'lr': 0.0004946975371747704, 'samples': 2250432, 'steps': 11720, 'loss/train': 2.156588315963745}} 11/06/2021 22:52:14 - INFO - __main__ - Step 11725: {'lr': 0.000494693187618849, 'samples': 2251200, 'steps': 11724, 'loss/train': 1.6982334852218628}} 11/06/2021 22:52:16 - INFO - __main__ - Step 11729: {'lr': 0.000494688836298848, 'samples': 2251968, 'steps': 11728, 'loss/train': 1.9926396608352661}} 11/06/2021 22:52:18 - INFO - __main__ - Step 11734: {'lr': 0.0004946833946681575, 'samples': 2252928, 'steps': 11733, 'loss/train': 1.9588221311569214} 11/06/2021 22:52:18 - INFO - __main__ - Step 11734: {'lr': 0.0004946833946681575, 'samples': 2252928, 'steps': 11733, 'loss/train': 1.9588221311569214} 11/06/2021 22:52:23 - INFO - __main__ - Step 11742: {'lr': 0.0004946746823260491, 'samples': 2254464, 'steps': 11741, 'loss/train': 0.8344317078590393} 11/06/2021 22:52:24 - INFO - __main__ - Step 11746: {'lr': 0.00049467032350906, 'samples': 2255232, 'steps': 11745, 'loss/train': 1.7040636539459229}3} 11/06/2021 22:52:26 - INFO - __main__ - Step 11750: {'lr': 0.0004946659629281561, 'samples': 2256000, 'steps': 11749, 'loss/train': 1.6414493322372437} 11/06/2021 22:52:29 - INFO - __main__ - Step 11755: {'lr': 0.0004946605097215691, 'samples': 2256960, 'steps': 11754, 'loss/train': 1.4197173118591309} 11/06/2021 22:52:29 - INFO - __main__ - Step 11755: {'lr': 0.0004946605097215691, 'samples': 2256960, 'steps': 11754, 'loss/train': 1.4197173118591309} 11/06/2021 22:52:32 - INFO - __main__ - Step 11762: {'lr': 0.0004946528706022703, 'samples': 2258304, 'steps': 11761, 'loss/train': 2.078275442123413}} 11/06/2021 22:52:34 - INFO - __main__ - Step 11766: {'lr': 0.0004946485029660219, 'samples': 2259072, 'steps': 11765, 'loss/train': 1.9496246576309204} 11/06/2021 22:52:37 - INFO - __main__ - Step 11771: {'lr': 0.0004946430409404311, 'samples': 2260032, 'steps': 11770, 'loss/train': 1.5809372663497925} 11/06/2021 22:52:37 - INFO - __main__ - Step 11771: {'lr': 0.0004946430409404311, 'samples': 2260032, 'steps': 11770, 'loss/train': 1.5809372663497925} 11/06/2021 22:52:41 - INFO - __main__ - Step 11779: {'lr': 0.0004946342959674278, 'samples': 2261568, 'steps': 11778, 'loss/train': 1.6700574159622192} 11/06/2021 22:52:42 - INFO - __main__ - Step 11783: {'lr': 0.0004946299208354279, 'samples': 2262336, 'steps': 11782, 'loss/train': 1.7613064050674438} 11/06/2021 22:52:45 - INFO - __main__ - Step 11788: {'lr': 0.0004946244494403361, 'samples': 2263296, 'steps': 11787, 'loss/train': 1.7674616575241089} 11/06/2021 22:52:47 - INFO - __main__ - Step 11793: {'lr': 0.0004946189752896443, 'samples': 2264256, 'steps': 11792, 'loss/train': 1.361173152923584}} 11/06/2021 22:52:47 - INFO - __main__ - Step 11793: {'lr': 0.0004946189752896443, 'samples': 2264256, 'steps': 11792, 'loss/train': 1.361173152923584}} 11/06/2021 22:52:47 - INFO - __main__ - Step 11793: {'lr': 0.0004946189752896443, 'samples': 2264256, 'steps': 11792, 'loss/train': 1.361173152923584}} 11/06/2021 22:52:52 - INFO - __main__ - Step 11803: {'lr': 0.0004946080187217072, 'samples': 2266176, 'steps': 11802, 'loss/train': 1.8065892457962036} 11/06/2021 22:52:54 - INFO - __main__ - Step 11808: {'lr': 0.0004946025363045854, 'samples': 2267136, 'steps': 11807, 'loss/train': 1.8952033519744873} 11/06/2021 22:52:54 - INFO - __main__ - Step 11808: {'lr': 0.0004946025363045854, 'samples': 2267136, 'steps': 11807, 'loss/train': 1.8952033519744873} 11/06/2021 22:52:54 - INFO - __main__ - Step 11808: {'lr': 0.0004946025363045854, 'samples': 2267136, 'steps': 11807, 'loss/train': 1.8952033519744873} 11/06/2021 22:53:00 - INFO - __main__ - Step 11819: {'lr': 0.0004945904652881611, 'samples': 2269248, 'steps': 11818, 'loss/train': 1.905003547668457}} 11/06/2021 22:53:00 - INFO - __main__ - Step 11819: {'lr': 0.0004945904652881611, 'samples': 2269248, 'steps': 11818, 'loss/train': 1.905003547668457}} 11/06/2021 22:53:04 - INFO - __main__ - Step 11827: {'lr': 0.0004945816779912272, 'samples': 2270784, 'steps': 11826, 'loss/train': 1.1215801239013672} 11/06/2021 22:53:06 - INFO - __main__ - Step 11831: {'lr': 0.0004945772816978309, 'samples': 2271552, 'steps': 11830, 'loss/train': 1.8250958919525146} 11/06/2021 22:53:08 - INFO - __main__ - Step 11836: {'lr': 0.0004945717838515275, 'samples': 2272512, 'steps': 11835, 'loss/train': 1.7910774946212769} 11/06/2021 22:53:11 - INFO - __main__ - Step 11841: {'lr': 0.0004945662832502171, 'samples': 2273472, 'steps': 11840, 'loss/train': 1.478073000907898}} 11/06/2021 22:53:13 - INFO - __main__ - Step 11845: {'lr': 0.0004945618807856056, 'samples': 2274240, 'steps': 11844, 'loss/train': 1.2191367149353027} 11/06/2021 22:53:15 - INFO - __main__ - Step 11849: {'lr': 0.0004945574765578612, 'samples': 2275008, 'steps': 11848, 'loss/train': 1.4972808361053467} 11/06/2021 22:53:16 - INFO - __main__ - Step 11853: {'lr': 0.0004945530705670156, 'samples': 2275776, 'steps': 11852, 'loss/train': 1.7779545783996582} 11/06/2021 22:53:19 - INFO - __main__ - Step 11857: {'lr': 0.0004945486628131006, 'samples': 2276544, 'steps': 11856, 'loss/train': 1.342666506767273}} 11/06/2021 22:53:21 - INFO - __main__ - Step 11862: {'lr': 0.0004945431506414386, 'samples': 2277504, 'steps': 11861, 'loss/train': 2.1039764881134033} 11/06/2021 22:53:21 - INFO - __main__ - Step 11862: {'lr': 0.0004945431506414386, 'samples': 2277504, 'steps': 11861, 'loss/train': 2.1039764881134033} 11/06/2021 22:53:24 - INFO - __main__ - Step 11869: {'lr': 0.0004945354289732565, 'samples': 2278848, 'steps': 11868, 'loss/train': 1.6801567077636719} 11/06/2021 22:53:26 - INFO - __main__ - Step 11873: {'lr': 0.0004945310141673816, 'samples': 2279616, 'steps': 11872, 'loss/train': 1.7067826986312866} 11/06/2021 22:53:29 - INFO - __main__ - Step 11878: {'lr': 0.0004945254931809489, 'samples': 2280576, 'steps': 11877, 'loss/train': 1.5857558250427246} 11/06/2021 22:53:31 - INFO - __main__ - Step 11882: {'lr': 0.0004945210744085702, 'samples': 2281344, 'steps': 11881, 'loss/train': 2.2412431240081787} 11/06/2021 22:53:31 - INFO - __main__ - Step 11882: {'lr': 0.0004945210744085702, 'samples': 2281344, 'steps': 11881, 'loss/train': 2.2412431240081787} 11/06/2021 22:53:34 - INFO - __main__ - Step 11889: {'lr': 0.000494513337315096, 'samples': 2282688, 'steps': 11888, 'loss/train': 1.7061456441879272}} 11/06/2021 22:53:37 - INFO - __main__ - Step 11894: {'lr': 0.0004945078075145292, 'samples': 2283648, 'steps': 11893, 'loss/train': 2.0001227855682373} 11/06/2021 22:53:39 - INFO - __main__ - Step 11899: {'lr': 0.0004945022749596764, 'samples': 2284608, 'steps': 11898, 'loss/train': 1.6990892887115479} 11/06/2021 22:53:41 - INFO - __main__ - Step 11903: {'lr': 0.0004944978469327499, 'samples': 2285376, 'steps': 11902, 'loss/train': 1.8978601694107056} 11/06/2021 22:53:43 - INFO - __main__ - Step 11907: {'lr': 0.0004944934171431522, 'samples': 2286144, 'steps': 11906, 'loss/train': 1.794429898262024}} 11/06/2021 22:53:43 - INFO - __main__ - Step 11907: {'lr': 0.0004944934171431522, 'samples': 2286144, 'steps': 11906, 'loss/train': 1.794429898262024}} 11/06/2021 22:53:46 - INFO - __main__ - Step 11914: {'lr': 0.0004944856607700243, 'samples': 2287488, 'steps': 11913, 'loss/train': 1.3514317274093628} 11/06/2021 22:53:49 - INFO - __main__ - Step 11920: {'lr': 0.0004944790081538969, 'samples': 2288640, 'steps': 11919, 'loss/train': 1.7780016660690308} 11/06/2021 22:53:51 - INFO - __main__ - Step 11924: {'lr': 0.0004944745708733025, 'samples': 2289408, 'steps': 11923, 'loss/train': 2.0285587310791016} 11/06/2021 22:53:53 - INFO - __main__ - Step 11928: {'lr': 0.0004944701318302046, 'samples': 2290176, 'steps': 11927, 'loss/train': 1.5972325801849365} 11/06/2021 22:53:55 - INFO - __main__ - Step 11932: {'lr': 0.0004944656910246352, 'samples': 2290944, 'steps': 11931, 'loss/train': 1.651598572731018}} 11/06/2021 22:53:57 - INFO - __main__ - Step 11936: {'lr': 0.0004944612484566263, 'samples': 2291712, 'steps': 11935, 'loss/train': 1.43559730052948}}} 11/06/2021 22:53:59 - INFO - __main__ - Step 11941: {'lr': 0.0004944556927682335, 'samples': 2292672, 'steps': 11940, 'loss/train': 1.7804417610168457} 11/06/2021 22:54:01 - INFO - __main__ - Step 11945: {'lr': 0.0004944512462348528, 'samples': 2293440, 'steps': 11944, 'loss/train': 1.9006822109222412} 11/06/2021 22:54:01 - INFO - __main__ - Step 11945: {'lr': 0.0004944512462348528, 'samples': 2293440, 'steps': 11944, 'loss/train': 1.9006822109222412} 11/06/2021 22:54:04 - INFO - __main__ - Step 11952: {'lr': 0.0004944434605608367, 'samples': 2294784, 'steps': 11951, 'loss/train': 1.2838270664215088} 11/06/2021 22:54:06 - INFO - __main__ - Step 11956: {'lr': 0.0004944390091811111, 'samples': 2295552, 'steps': 11955, 'loss/train': 1.7346197366714478} 11/06/2021 22:54:09 - INFO - __main__ - Step 11962: {'lr': 0.0004944323288073192, 'samples': 2296704, 'steps': 11961, 'loss/train': 1.4151784181594849} 11/06/2021 22:54:11 - INFO - __main__ - Step 11966: {'lr': 0.0004944278730220359, 'samples': 2297472, 'steps': 11965, 'loss/train': 1.4997888803482056} 11/06/2021 22:54:11 - INFO - __main__ - Step 11966: {'lr': 0.0004944278730220359, 'samples': 2297472, 'steps': 11965, 'loss/train': 1.4997888803482056} 11/06/2021 22:54:15 - INFO - __main__ - Step 11973: {'lr': 0.0004944200711575956, 'samples': 2298816, 'steps': 11972, 'loss/train': 1.9672355651855469} 11/06/2021 22:54:17 - INFO - __main__ - Step 11977: {'lr': 0.0004944156105264308, 'samples': 2299584, 'steps': 11976, 'loss/train': 0.3471572995185852} 11/06/2021 22:54:19 - INFO - __main__ - Step 11982: {'lr': 0.0004944100322595558, 'samples': 2300544, 'steps': 11981, 'loss/train': 1.5777186155319214} 11/06/2021 22:54:21 - INFO - __main__ - Step 11986: {'lr': 0.0004944055676637599, 'samples': 2301312, 'steps': 11985, 'loss/train': 1.4917117357254028} 11/06/2021 22:54:21 - INFO - __main__ - Step 11986: {'lr': 0.0004944055676637599, 'samples': 2301312, 'steps': 11985, 'loss/train': 1.4917117357254028} 11/06/2021 22:54:25 - INFO - __main__ - Step 11993: {'lr': 0.0004943977503813092, 'samples': 2302656, 'steps': 11992, 'loss/train': 1.3840059041976929} 11/06/2021 22:54:27 - INFO - __main__ - Step 11998: {'lr': 0.0004943921633044644, 'samples': 2303616, 'steps': 11997, 'loss/train': 1.752882480621338}} 11/06/2021 22:54:29 - INFO - __main__ - Step 12003: {'lr': 0.0004943865734746364, 'samples': 2304576, 'steps': 12002, 'loss/train': 1.8876904249191284} 11/06/2021 22:54:29 - INFO - __main__ - Step 12003: {'lr': 0.0004943865734746364, 'samples': 2304576, 'steps': 12002, 'loss/train': 1.8876904249191284} 11/06/2021 22:54:33 - INFO - __main__ - Step 12010: {'lr': 0.0004943787430879846, 'samples': 2305920, 'steps': 12009, 'loss/train': 1.99528169631958}4} 11/06/2021 22:54:35 - INFO - __main__ - Step 12014: {'lr': 0.000494374266158823, 'samples': 2306688, 'steps': 12013, 'loss/train': 1.9051744937896729}} 11/06/2021 22:54:37 - INFO - __main__ - Step 12018: {'lr': 0.000494369787467881, 'samples': 2307456, 'steps': 12017, 'loss/train': 1.328963041305542}}} 11/06/2021 22:54:37 - INFO - __main__ - Step 12018: {'lr': 0.000494369787467881, 'samples': 2307456, 'steps': 12017, 'loss/train': 1.328963041305542}}} 11/06/2021 22:54:37 - INFO - __main__ - Step 12018: {'lr': 0.000494369787467881, 'samples': 2307456, 'steps': 12017, 'loss/train': 1.328963041305542}}} 11/06/2021 22:54:42 - INFO - __main__ - Step 12029: {'lr': 0.0004943574619838741, 'samples': 2309568, 'steps': 12028, 'loss/train': 1.7877787351608276} 11/06/2021 22:54:45 - INFO - __main__ - Step 12034: {'lr': 0.0004943518550869552, 'samples': 2310528, 'steps': 12033, 'loss/train': 1.0278202295303345} 11/06/2021 22:54:47 - INFO - __main__ - Step 12039: {'lr': 0.0004943462454375069, 'samples': 2311488, 'steps': 12038, 'loss/train': 1.6484863758087158} 11/06/2021 22:54:50 - INFO - __main__ - Step 12044: {'lr': 0.0004943406330355925, 'samples': 2312448, 'steps': 12043, 'loss/train': 1.159117341041565}} 11/06/2021 22:54:50 - INFO - __main__ - Step 12044: {'lr': 0.0004943406330355925, 'samples': 2312448, 'steps': 12043, 'loss/train': 1.159117341041565}} 11/06/2021 22:54:50 - INFO - __main__ - Step 12044: {'lr': 0.0004943406330355925, 'samples': 2312448, 'steps': 12043, 'loss/train': 1.159117341041565}} 11/06/2021 22:54:54 - INFO - __main__ - Step 12054: {'lr': 0.0004943293999746179, 'samples': 2314368, 'steps': 12053, 'loss/train': 1.585003137588501}} 11/06/2021 22:54:57 - INFO - __main__ - Step 12059: {'lr': 0.0004943237793156844, 'samples': 2315328, 'steps': 12058, 'loss/train': 2.4570651054382324} 11/06/2021 22:54:59 - INFO - __main__ - Step 12063: {'lr': 0.0004943192808069411, 'samples': 2316096, 'steps': 12062, 'loss/train': 1.2292014360427856} 11/06/2021 22:54:59 - INFO - __main__ - Step 12063: {'lr': 0.0004943192808069411, 'samples': 2316096, 'steps': 12062, 'loss/train': 1.2292014360427856} 11/06/2021 22:55:03 - INFO - __main__ - Step 12070: {'lr': 0.0004943114041783296, 'samples': 2317440, 'steps': 12069, 'loss/train': 1.788710355758667}} 11/06/2021 22:55:05 - INFO - __main__ - Step 12075: {'lr': 0.0004943057747125371, 'samples': 2318400, 'steps': 12074, 'loss/train': 1.8841195106506348} 11/06/2021 22:55:07 - INFO - __main__ - Step 12079: {'lr': 0.0004943012691584526, 'samples': 2319168, 'steps': 12078, 'loss/train': 1.6468000411987305} 11/06/2021 22:55:07 - INFO - __main__ - Step 12079: {'lr': 0.0004943012691584526, 'samples': 2319168, 'steps': 12078, 'loss/train': 1.6468000411987305} 11/06/2021 22:55:11 - INFO - __main__ - Step 12086: {'lr': 0.0004942933802008066, 'samples': 2320512, 'steps': 12085, 'loss/train': 2.0663843154907227} 11/06/2021 22:55:13 - INFO - __main__ - Step 12090: {'lr': 0.0004942888698033515, 'samples': 2321280, 'steps': 12089, 'loss/train': 1.889116883277893}} 11/06/2021 22:55:15 - INFO - __main__ - Step 12094: {'lr': 0.0004942843576447316, 'samples': 2322048, 'steps': 12093, 'loss/train': 1.6451750993728638} 11/06/2021 22:55:17 - INFO - __main__ - Step 12099: {'lr': 0.0004942787149698687, 'samples': 2323008, 'steps': 12098, 'loss/train': 1.9871211051940918} 11/06/2021 22:55:17 - INFO - __main__ - Step 12099: {'lr': 0.0004942787149698687, 'samples': 2323008, 'steps': 12098, 'loss/train': 1.9871211051940918} 11/06/2021 22:55:21 - INFO - __main__ - Step 12107: {'lr': 0.0004942696809665668, 'samples': 2324544, 'steps': 12106, 'loss/train': 1.9903008937835693} 11/06/2021 22:55:23 - INFO - __main__ - Step 12111: {'lr': 0.0004942651613233599, 'samples': 2325312, 'steps': 12110, 'loss/train': 1.8227951526641846} 11/06/2021 22:55:25 - INFO - __main__ - Step 12115: {'lr': 0.0004942606399191593, 'samples': 2326080, 'steps': 12114, 'loss/train': 1.4733009338378906} 11/06/2021 22:55:25 - INFO - __main__ - Step 12115: {'lr': 0.0004942606399191593, 'samples': 2326080, 'steps': 12114, 'loss/train': 1.4733009338378906} 11/06/2021 22:55:25 - INFO - __main__ - Step 12115: {'lr': 0.0004942606399191593, 'samples': 2326080, 'steps': 12114, 'loss/train': 1.4733009338378906} 11/06/2021 22:55:31 - INFO - __main__ - Step 12126: {'lr': 0.0004942481969777495, 'samples': 2328192, 'steps': 12125, 'loss/train': 2.258380889892578}} 11/06/2021 22:55:33 - INFO - __main__ - Step 12131: {'lr': 0.000494242536693071, 'samples': 2329152, 'steps': 12130, 'loss/train': 1.718927264213562}}} 11/06/2021 22:55:35 - INFO - __main__ - Step 12135: {'lr': 0.0004942380064843906, 'samples': 2329920, 'steps': 12134, 'loss/train': 1.8256616592407227} 11/06/2021 22:55:38 - INFO - __main__ - Step 12139: {'lr': 0.0004942334745149122, 'samples': 2330688, 'steps': 12138, 'loss/train': 2.0332870483398438} 11/06/2021 22:55:40 - INFO - __main__ - Step 12143: {'lr': 0.0004942289407846684, 'samples': 2331456, 'steps': 12142, 'loss/train': 0.5399057865142822} 11/06/2021 22:55:41 - INFO - __main__ - Step 12147: {'lr': 0.000494224405293692, 'samples': 2332224, 'steps': 12146, 'loss/train': 1.870377779006958}2} 11/06/2021 22:55:43 - INFO - __main__ - Step 12152: {'lr': 0.0004942187334539908, 'samples': 2333184, 'steps': 12151, 'loss/train': 1.949316382408142}} 11/06/2021 22:55:45 - INFO - __main__ - Step 12156: {'lr': 0.0004942141940014854, 'samples': 2333952, 'steps': 12155, 'loss/train': 1.7409350872039795} 11/06/2021 22:55:47 - INFO - __main__ - Step 12160: {'lr': 0.0004942096527883538, 'samples': 2334720, 'steps': 12159, 'loss/train': 1.4643501043319702} 11/06/2021 22:55:50 - INFO - __main__ - Step 12164: {'lr': 0.0004942051098146284, 'samples': 2335488, 'steps': 12163, 'loss/train': 1.9286428689956665} 11/06/2021 22:55:51 - INFO - __main__ - Step 12168: {'lr': 0.0004942005650803421, 'samples': 2336256, 'steps': 12167, 'loss/train': 1.5838422775268555} 11/06/2021 22:55:53 - INFO - __main__ - Step 12173: {'lr': 0.0004941948816867455, 'samples': 2337216, 'steps': 12172, 'loss/train': 1.5987201929092407} 11/06/2021 22:55:56 - INFO - __main__ - Step 12178: {'lr': 0.0004941891955423878, 'samples': 2338176, 'steps': 12177, 'loss/train': 2.281742811203003}} 11/06/2021 22:55:56 - INFO - __main__ - Step 12178: {'lr': 0.0004941891955423878, 'samples': 2338176, 'steps': 12177, 'loss/train': 2.281742811203003}} 11/06/2021 22:56:00 - INFO - __main__ - Step 12185: {'lr': 0.0004941812303191302, 'samples': 2339520, 'steps': 12184, 'loss/train': 1.8058403730392456} 11/06/2021 22:56:01 - INFO - __main__ - Step 12189: {'lr': 0.0004941766763424373, 'samples': 2340288, 'steps': 12188, 'loss/train': 1.433286190032959}} 11/06/2021 22:56:03 - INFO - __main__ - Step 12193: {'lr': 0.0004941721206053885, 'samples': 2341056, 'steps': 12192, 'loss/train': 3.6256818771362305} 11/06/2021 22:56:06 - INFO - __main__ - Step 12198: {'lr': 0.000494166423458627, 'samples': 2342016, 'steps': 12197, 'loss/train': 2.043381929397583}5} 11/06/2021 22:56:06 - INFO - __main__ - Step 12198: {'lr': 0.000494166423458627, 'samples': 2342016, 'steps': 12197, 'loss/train': 2.043381929397583}5} 11/06/2021 22:56:10 - INFO - __main__ - Step 12206: {'lr': 0.000494157302302919, 'samples': 2343552, 'steps': 12205, 'loss/train': 1.4476592540740967}} 11/06/2021 22:56:11 - INFO - __main__ - Step 12210: {'lr': 0.0004941527390847243, 'samples': 2344320, 'steps': 12209, 'loss/train': 1.1334985494613647} 11/06/2021 22:56:11 - INFO - __main__ - Step 12210: {'lr': 0.0004941527390847243, 'samples': 2344320, 'steps': 12209, 'loss/train': 1.1334985494613647} 11/06/2021 22:56:15 - INFO - __main__ - Step 12218: {'lr': 0.0004941436073678179, 'samples': 2345856, 'steps': 12217, 'loss/train': 1.678484320640564}} 11/06/2021 22:56:17 - INFO - __main__ - Step 12222: {'lr': 0.0004941390388691719, 'samples': 2346624, 'steps': 12221, 'loss/train': 1.6822108030319214} 11/06/2021 22:56:19 - INFO - __main__ - Step 12226: {'lr': 0.0004941344686104414, 'samples': 2347392, 'steps': 12225, 'loss/train': 1.5134607553482056} 11/06/2021 22:56:21 - INFO - __main__ - Step 12231: {'lr': 0.0004941287533119597, 'samples': 2348352, 'steps': 12230, 'loss/train': 1.1296072006225586} 11/06/2021 22:56:21 - INFO - __main__ - Step 12231: {'lr': 0.0004941287533119597, 'samples': 2348352, 'steps': 12230, 'loss/train': 1.1296072006225586} 11/06/2021 22:56:25 - INFO - __main__ - Step 12238: {'lr': 0.0004941207472740724, 'samples': 2349696, 'steps': 12237, 'loss/train': 1.9084161520004272} 11/06/2021 22:56:27 - INFO - __main__ - Step 12242: {'lr': 0.0004941161699753335, 'samples': 2350464, 'steps': 12241, 'loss/train': 2.5496790409088135} 11/06/2021 22:56:30 - INFO - __main__ - Step 12248: {'lr': 0.0004941093007273859, 'samples': 2351616, 'steps': 12247, 'loss/train': 2.001997470855713}} 11/06/2021 22:56:30 - INFO - __main__ - Step 12248: {'lr': 0.0004941093007273859, 'samples': 2351616, 'steps': 12247, 'loss/train': 2.001997470855713}} 11/06/2021 22:56:34 - INFO - __main__ - Step 12255: {'lr': 0.0004941012816001575, 'samples': 2352960, 'steps': 12254, 'loss/train': 1.6902923583984375} 11/06/2021 22:56:35 - INFO - __main__ - Step 12259: {'lr': 0.0004940966968219881, 'samples': 2353728, 'steps': 12258, 'loss/train': 1.807405948638916}} 11/06/2021 22:56:37 - INFO - __main__ - Step 12264: {'lr': 0.0004940909633745905, 'samples': 2354688, 'steps': 12263, 'loss/train': 1.749647617340088}} 11/06/2021 22:56:37 - INFO - __main__ - Step 12264: {'lr': 0.0004940909633745905, 'samples': 2354688, 'steps': 12263, 'loss/train': 1.749647617340088}} 11/06/2021 22:56:41 - INFO - __main__ - Step 12271: {'lr': 0.0004940829319289361, 'samples': 2356032, 'steps': 12270, 'loss/train': 1.8408467769622803} 11/06/2021 22:56:43 - INFO - __main__ - Step 12275: {'lr': 0.000494078340111848, 'samples': 2356800, 'steps': 12274, 'loss/train': 1.8969393968582153}} 11/06/2021 22:56:45 - INFO - __main__ - Step 12280: {'lr': 0.0004940725978659881, 'samples': 2357760, 'steps': 12279, 'loss/train': 1.9200348854064941} 11/06/2021 22:56:48 - INFO - __main__ - Step 12285: {'lr': 0.0004940668528707446, 'samples': 2358720, 'steps': 12284, 'loss/train': 1.4615005254745483} 11/06/2021 22:56:48 - INFO - __main__ - Step 12285: {'lr': 0.0004940668528707446, 'samples': 2358720, 'steps': 12284, 'loss/train': 1.4615005254745483} 11/06/2021 22:56:51 - INFO - __main__ - Step 12292: {'lr': 0.0004940588052585624, 'samples': 2360064, 'steps': 12291, 'loss/train': 1.7617274522781372} 11/06/2021 22:56:53 - INFO - __main__ - Step 12296: {'lr': 0.0004940542042036974, 'samples': 2360832, 'steps': 12295, 'loss/train': 1.7313815355300903} 11/06/2021 22:56:55 - INFO - __main__ - Step 12301: {'lr': 0.0004940484504108612, 'samples': 2361792, 'steps': 12300, 'loss/train': 1.4371073246002197} 11/06/2021 22:56:55 - INFO - __main__ - Step 12301: {'lr': 0.0004940484504108612, 'samples': 2361792, 'steps': 12300, 'loss/train': 1.4371073246002197} 11/06/2021 22:57:00 - INFO - __main__ - Step 12309: {'lr': 0.0004940392386241981, 'samples': 2363328, 'steps': 12308, 'loss/train': 2.2593603134155273} 11/06/2021 22:57:01 - INFO - __main__ - Step 12313: {'lr': 0.0004940346300918024, 'samples': 2364096, 'steps': 12312, 'loss/train': 1.6373891830444336} 11/06/2021 22:57:03 - INFO - __main__ - Step 12317: {'lr': 0.0004940300198000748, 'samples': 2364864, 'steps': 12316, 'loss/train': 1.8684685230255127} 11/06/2021 22:57:06 - INFO - __main__ - Step 12322: {'lr': 0.0004940242544614056, 'samples': 2365824, 'steps': 12321, 'loss/train': 1.541722297668457}} 11/06/2021 22:57:08 - INFO - __main__ - Step 12326: {'lr': 0.0004940196402113031, 'samples': 2366592, 'steps': 12325, 'loss/train': 1.683254361152649}} 11/06/2021 22:57:08 - INFO - __main__ - Step 12326: {'lr': 0.0004940196402113031, 'samples': 2366592, 'steps': 12325, 'loss/train': 1.683254361152649}} 11/06/2021 22:57:11 - INFO - __main__ - Step 12333: {'lr': 0.0004940115610405114, 'samples': 2367936, 'steps': 12332, 'loss/train': 1.9940561056137085} 11/06/2021 22:57:14 - INFO - __main__ - Step 12339: {'lr': 0.000494004631749003, 'samples': 2369088, 'steps': 12338, 'loss/train': 1.7027802467346191}} 11/06/2021 22:57:16 - INFO - __main__ - Step 12343: {'lr': 0.0004940000100224295, 'samples': 2369856, 'steps': 12342, 'loss/train': 1.2512726783752441} 11/06/2021 22:57:18 - INFO - __main__ - Step 12347: {'lr': 0.0004939953865367735, 'samples': 2370624, 'steps': 12346, 'loss/train': 1.1613168716430664} 11/06/2021 22:57:18 - INFO - __main__ - Step 12347: {'lr': 0.0004939953865367735, 'samples': 2370624, 'steps': 12346, 'loss/train': 1.1613168716430664} 11/06/2021 22:57:21 - INFO - __main__ - Step 12354: {'lr': 0.0004939872912041844, 'samples': 2371968, 'steps': 12353, 'loss/train': 1.7622441053390503} 11/06/2021 22:57:24 - INFO - __main__ - Step 12360: {'lr': 0.0004939803480601333, 'samples': 2373120, 'steps': 12359, 'loss/train': 1.4958027601242065} 11/06/2021 22:57:24 - INFO - __main__ - Step 12360: {'lr': 0.0004939803480601333, 'samples': 2373120, 'steps': 12359, 'loss/train': 1.4958027601242065} 11/06/2021 22:57:27 - INFO - __main__ - Step 12366: {'lr': 0.0004939734009584661, 'samples': 2374272, 'steps': 12365, 'loss/train': 2.075040578842163}} 11/06/2021 22:57:29 - INFO - __main__ - Step 12370: {'lr': 0.0004939687673587346, 'samples': 2375040, 'steps': 12369, 'loss/train': 1.79799222946167}}} 11/06/2021 22:57:31 - INFO - __main__ - Step 12375: {'lr': 0.0004939629728856817, 'samples': 2376000, 'steps': 12374, 'loss/train': 1.2412973642349243} 11/06/2021 22:57:34 - INFO - __main__ - Step 12380: {'lr': 0.0004939571756644799, 'samples': 2376960, 'steps': 12379, 'loss/train': 1.6386349201202393} 11/06/2021 22:57:34 - INFO - __main__ - Step 12380: {'lr': 0.0004939571756644799, 'samples': 2376960, 'steps': 12379, 'loss/train': 1.6386349201202393} 11/06/2021 22:57:37 - INFO - __main__ - Step 12386: {'lr': 0.0004939502153715733, 'samples': 2378112, 'steps': 12385, 'loss/train': 1.881219744682312}} 11/06/2021 22:57:40 - INFO - __main__ - Step 12391: {'lr': 0.0004939444121046741, 'samples': 2379072, 'steps': 12390, 'loss/train': 1.171319603919983}} 11/06/2021 22:57:42 - INFO - __main__ - Step 12395: {'lr': 0.000493939767512635, 'samples': 2379840, 'steps': 12394, 'loss/train': 1.767325758934021}}} 11/06/2021 22:57:42 - INFO - __main__ - Step 12395: {'lr': 0.000493939767512635, 'samples': 2379840, 'steps': 12394, 'loss/train': 1.767325758934021}}} 11/06/2021 22:57:45 - INFO - __main__ - Step 12402: {'lr': 0.0004939316352448403, 'samples': 2381184, 'steps': 12401, 'loss/train': 1.0791970491409302} 11/06/2021 22:57:48 - INFO - __main__ - Step 12408: {'lr': 0.0004939246604430195, 'samples': 2382336, 'steps': 12407, 'loss/train': 1.8069547414779663} 11/06/2021 22:57:50 - INFO - __main__ - Step 12413: {'lr': 0.000493918845085675, 'samples': 2383296, 'steps': 12412, 'loss/train': 1.7577980756759644}} 11/06/2021 22:57:50 - INFO - __main__ - Step 12413: {'lr': 0.000493918845085675, 'samples': 2383296, 'steps': 12412, 'loss/train': 1.7577980756759644}} 11/06/2021 22:57:53 - INFO - __main__ - Step 12419: {'lr': 0.0004939118630299672, 'samples': 2384448, 'steps': 12418, 'loss/train': 1.799317479133606}} 11/06/2021 22:57:55 - INFO - __main__ - Step 12423: {'lr': 0.0004939072061280967, 'samples': 2385216, 'steps': 12422, 'loss/train': 1.4663667678833008} 11/06/2021 22:57:58 - INFO - __main__ - Step 12428: {'lr': 0.0004939013825279939, 'samples': 2386176, 'steps': 12427, 'loss/train': 1.5649765729904175} 11/06/2021 22:58:00 - INFO - __main__ - Step 12433: {'lr': 0.0004938955561804361, 'samples': 2387136, 'steps': 12432, 'loss/train': 1.6927978992462158} 11/06/2021 22:58:00 - INFO - __main__ - Step 12433: {'lr': 0.0004938955561804361, 'samples': 2387136, 'steps': 12432, 'loss/train': 1.6927978992462158} 11/06/2021 22:58:03 - INFO - __main__ - Step 12440: {'lr': 0.0004938873946782557, 'samples': 2388480, 'steps': 12439, 'loss/train': 1.3930256366729736} 11/06/2021 22:58:05 - INFO - __main__ - Step 12444: {'lr': 0.0004938827285450908, 'samples': 2389248, 'steps': 12443, 'loss/train': 1.6230531930923462} 11/06/2021 22:58:08 - INFO - __main__ - Step 12449: {'lr': 0.0004938768934061182, 'samples': 2390208, 'steps': 12448, 'loss/train': 1.9037350416183472} 11/06/2021 22:58:10 - INFO - __main__ - Step 12453: {'lr': 0.000493872223316968, 'samples': 2390976, 'steps': 12452, 'loss/train': 1.4199427366256714}} 11/06/2021 22:58:10 - INFO - __main__ - Step 12453: {'lr': 0.000493872223316968, 'samples': 2390976, 'steps': 12452, 'loss/train': 1.4199427366256714}} 11/06/2021 22:58:13 - INFO - __main__ - Step 12460: {'lr': 0.0004938640464304006, 'samples': 2392320, 'steps': 12459, 'loss/train': 1.7510279417037964} 11/06/2021 22:58:15 - INFO - __main__ - Step 12464: {'lr': 0.0004938593715063888, 'samples': 2393088, 'steps': 12463, 'loss/train': 1.430334448814392}} 11/06/2021 22:58:18 - INFO - __main__ - Step 12469: {'lr': 0.0004938535253790944, 'samples': 2394048, 'steps': 12468, 'loss/train': 1.569968342781067}} 11/06/2021 22:58:20 - INFO - __main__ - Step 12473: {'lr': 0.0004938488464994764, 'samples': 2394816, 'steps': 12472, 'loss/train': 1.2448487281799316} 11/06/2021 22:58:22 - INFO - __main__ - Step 12477: {'lr': 0.0004938441658618659, 'samples': 2395584, 'steps': 12476, 'loss/train': 1.6196297407150269} 11/06/2021 22:58:23 - INFO - __main__ - Step 12481: {'lr': 0.0004938394834662966, 'samples': 2396352, 'steps': 12480, 'loss/train': 0.9850826859474182} 11/06/2021 22:58:25 - INFO - __main__ - Step 12485: {'lr': 0.0004938347993128025, 'samples': 2397120, 'steps': 12484, 'loss/train': 1.588280439376831}} 11/06/2021 22:58:28 - INFO - __main__ - Step 12490: {'lr': 0.0004938289416489042, 'samples': 2398080, 'steps': 12489, 'loss/train': 2.0924293994903564} 11/06/2021 22:58:30 - INFO - __main__ - Step 12494: {'lr': 0.0004938242535402025, 'samples': 2398848, 'steps': 12493, 'loss/train': 1.4478893280029297} 11/06/2021 22:58:30 - INFO - __main__ - Step 12494: {'lr': 0.0004938242535402025, 'samples': 2398848, 'steps': 12493, 'loss/train': 1.4478893280029297} 11/06/2021 22:58:34 - INFO - __main__ - Step 12501: {'lr': 0.000493816045120252, 'samples': 2400192, 'steps': 12500, 'loss/train': 1.6998056173324585}} 11/06/2021 22:58:36 - INFO - __main__ - Step 12506: {'lr': 0.0004938101786673416, 'samples': 2401152, 'steps': 12505, 'loss/train': 1.861094355583191}} 11/06/2021 22:58:38 - INFO - __main__ - Step 12511: {'lr': 0.0004938043094680036, 'samples': 2402112, 'steps': 12510, 'loss/train': 1.6733269691467285} 11/06/2021 22:58:38 - INFO - __main__ - Step 12511: {'lr': 0.0004938043094680036, 'samples': 2402112, 'steps': 12510, 'loss/train': 1.6733269691467285} 11/06/2021 22:58:42 - INFO - __main__ - Step 12518: {'lr': 0.0004937960879750578, 'samples': 2403456, 'steps': 12517, 'loss/train': 1.5596864223480225} 11/06/2021 22:58:44 - INFO - __main__ - Step 12522: {'lr': 0.0004937913875623605, 'samples': 2404224, 'steps': 12521, 'loss/train': 1.12235426902771}5} 11/06/2021 22:58:46 - INFO - __main__ - Step 12527: {'lr': 0.0004937855095748985, 'samples': 2405184, 'steps': 12526, 'loss/train': 1.8808726072311401} 11/06/2021 22:58:46 - INFO - __main__ - Step 12527: {'lr': 0.0004937855095748985, 'samples': 2405184, 'steps': 12526, 'loss/train': 1.8808726072311401} 11/06/2021 22:58:50 - INFO - __main__ - Step 12534: {'lr': 0.0004937772757789352, 'samples': 2406528, 'steps': 12533, 'loss/train': 0.9707418084144592} 11/06/2021 22:58:52 - INFO - __main__ - Step 12538: {'lr': 0.0004937725683361286, 'samples': 2407296, 'steps': 12537, 'loss/train': 1.6391980648040771} 11/06/2021 22:58:54 - INFO - __main__ - Step 12543: {'lr': 0.0004937666815612207, 'samples': 2408256, 'steps': 12542, 'loss/train': 1.523587703704834}} 11/06/2021 22:58:56 - INFO - __main__ - Step 12547: {'lr': 0.0004937619701642162, 'samples': 2409024, 'steps': 12546, 'loss/train': 2.321528434753418}} 11/06/2021 22:58:58 - INFO - __main__ - Step 12551: {'lr': 0.0004937572570098455, 'samples': 2409792, 'steps': 12550, 'loss/train': 1.5153863430023193} 11/06/2021 22:59:00 - INFO - __main__ - Step 12555: {'lr': 0.0004937525420981428, 'samples': 2410560, 'steps': 12554, 'loss/train': 2.261537790298462}} 11/06/2021 22:59:02 - INFO - __main__ - Step 12559: {'lr': 0.0004937478254291418, 'samples': 2411328, 'steps': 12558, 'loss/train': 1.6904947757720947} 11/06/2021 22:59:04 - INFO - __main__ - Step 12564: {'lr': 0.0004937419271217419, 'samples': 2412288, 'steps': 12563, 'loss/train': 5.857589244842529}} 11/06/2021 22:59:06 - INFO - __main__ - Step 12568: {'lr': 0.0004937372064989445, 'samples': 2413056, 'steps': 12567, 'loss/train': 1.3068746328353882} 11/06/2021 22:59:08 - INFO - __main__ - Step 12572: {'lr': 0.0004937324841189595, 'samples': 2413824, 'steps': 12571, 'loss/train': 1.9527837038040161} 11/06/2021 22:59:10 - INFO - __main__ - Step 12576: {'lr': 0.000493727759981821, 'samples': 2414592, 'steps': 12575, 'loss/train': 1.5525745153427124}} 11/06/2021 22:59:12 - INFO - __main__ - Step 12580: {'lr': 0.000493723034087563, 'samples': 2415360, 'steps': 12579, 'loss/train': 0.8102706074714661}} 11/06/2021 22:59:14 - INFO - __main__ - Step 12584: {'lr': 0.0004937183064362196, 'samples': 2416128, 'steps': 12583, 'loss/train': 1.7389198541641235} 11/06/2021 22:59:16 - INFO - __main__ - Step 12588: {'lr': 0.0004937135770278248, 'samples': 2416896, 'steps': 12587, 'loss/train': 1.688107967376709}} 11/06/2021 22:59:18 - INFO - __main__ - Step 12592: {'lr': 0.0004937088458624128, 'samples': 2417664, 'steps': 12591, 'loss/train': 1.8203307390213013} 11/06/2021 22:59:20 - INFO - __main__ - Step 12596: {'lr': 0.0004937041129400177, 'samples': 2418432, 'steps': 12595, 'loss/train': 2.0309805870056152} 11/06/2021 22:59:22 - INFO - __main__ - Step 12601: {'lr': 0.0004936981943163182, 'samples': 2419392, 'steps': 12600, 'loss/train': 1.7093287706375122} 11/06/2021 22:59:24 - INFO - __main__ - Step 12605: {'lr': 0.000493693457440836, 'samples': 2420160, 'steps': 12604, 'loss/train': 2.0806899070739746}} 11/06/2021 22:59:26 - INFO - __main__ - Step 12609: {'lr': 0.0004936887188084813, 'samples': 2420928, 'steps': 12608, 'loss/train': 1.5434484481811523} 11/06/2021 22:59:28 - INFO - __main__ - Step 12613: {'lr': 0.0004936839784192888, 'samples': 2421696, 'steps': 12612, 'loss/train': 1.6676959991455078} 11/06/2021 22:59:30 - INFO - __main__ - Step 12617: {'lr': 0.0004936792362732924, 'samples': 2422464, 'steps': 12616, 'loss/train': 1.7673671245574951} 11/06/2021 22:59:32 - INFO - __main__ - Step 12622: {'lr': 0.0004936733061203435, 'samples': 2423424, 'steps': 12621, 'loss/train': 1.8702856302261353} 11/06/2021 22:59:34 - INFO - __main__ - Step 12626: {'lr': 0.0004936685600216635, 'samples': 2424192, 'steps': 12625, 'loss/train': 1.8777118921279907} 11/06/2021 22:59:34 - INFO - __main__ - Step 12626: {'lr': 0.0004936685600216635, 'samples': 2424192, 'steps': 12625, 'loss/train': 1.8777118921279907} 11/06/2021 22:59:38 - INFO - __main__ - Step 12633: {'lr': 0.0004936602501219522, 'samples': 2425536, 'steps': 12632, 'loss/train': 1.7763961553573608} 11/06/2021 22:59:40 - INFO - __main__ - Step 12638: {'lr': 0.0004936543111856041, 'samples': 2426496, 'steps': 12637, 'loss/train': 1.6684590578079224} 11/06/2021 22:59:40 - INFO - __main__ - Step 12638: {'lr': 0.0004936543111856041, 'samples': 2426496, 'steps': 12637, 'loss/train': 1.6684590578079224} 11/06/2021 22:59:44 - INFO - __main__ - Step 12646: {'lr': 0.0004936448031785576, 'samples': 2428032, 'steps': 12645, 'loss/train': 1.5351349115371704} 11/06/2021 22:59:47 - INFO - __main__ - Step 12650: {'lr': 0.0004936400465402351, 'samples': 2428800, 'steps': 12649, 'loss/train': 1.8610800504684448} 11/06/2021 22:59:48 - INFO - __main__ - Step 12654: {'lr': 0.0004936352881454256, 'samples': 2429568, 'steps': 12653, 'loss/train': 1.6259452104568481} 11/06/2021 22:59:50 - INFO - __main__ - Step 12658: {'lr': 0.000493630527994163, 'samples': 2430336, 'steps': 12657, 'loss/train': 1.9009116888046265}} 11/06/2021 22:59:53 - INFO - __main__ - Step 12663: {'lr': 0.0004936245753351256, 'samples': 2431296, 'steps': 12662, 'loss/train': 1.9647890329360962} 11/06/2021 22:59:55 - INFO - __main__ - Step 12667: {'lr': 0.0004936198112319698, 'samples': 2432064, 'steps': 12666, 'loss/train': 1.6497493982315063} 11/06/2021 22:59:55 - INFO - __main__ - Step 12667: {'lr': 0.0004936198112319698, 'samples': 2432064, 'steps': 12666, 'loss/train': 1.6497493982315063} 11/06/2021 22:59:58 - INFO - __main__ - Step 12674: {'lr': 0.0004936114698252717, 'samples': 2433408, 'steps': 12673, 'loss/train': 1.5145035982131958} 11/06/2021 23:00:00 - INFO - __main__ - Step 12679: {'lr': 0.0004936055083845924, 'samples': 2434368, 'steps': 12678, 'loss/train': 1.7321808338165283} 11/06/2021 23:00:03 - INFO - __main__ - Step 12683: {'lr': 0.0004936007372562778, 'samples': 2435136, 'steps': 12682, 'loss/train': 1.703395128250122}} 11/06/2021 23:00:05 - INFO - __main__ - Step 12687: {'lr': 0.0004935959643717595, 'samples': 2435904, 'steps': 12686, 'loss/train': 1.3232539892196655} 11/06/2021 23:00:07 - INFO - __main__ - Step 12691: {'lr': 0.0004935911897310719, 'samples': 2436672, 'steps': 12690, 'loss/train': 1.8013556003570557} 11/06/2021 23:00:08 - INFO - __main__ - Step 12695: {'lr': 0.0004935864133342495, 'samples': 2437440, 'steps': 12694, 'loss/train': 1.6431411504745483} 11/06/2021 23:00:10 - INFO - __main__ - Step 12699: {'lr': 0.0004935816351813265, 'samples': 2438208, 'steps': 12698, 'loss/train': 1.92881441116333}3} 11/06/2021 23:00:13 - INFO - __main__ - Step 12704: {'lr': 0.000493575660020709, 'samples': 2439168, 'steps': 12703, 'loss/train': 1.6965235471725464}} 11/06/2021 23:00:15 - INFO - __main__ - Step 12708: {'lr': 0.0004935708779166859, 'samples': 2439936, 'steps': 12707, 'loss/train': 1.2805671691894531} 11/06/2021 23:00:17 - INFO - __main__ - Step 12712: {'lr': 0.0004935660940566744, 'samples': 2440704, 'steps': 12711, 'loss/train': 1.640486240386963}} 11/06/2021 23:00:18 - INFO - __main__ - Step 12716: {'lr': 0.000493561308440709, 'samples': 2441472, 'steps': 12715, 'loss/train': 1.336901068687439}}} 11/06/2021 23:00:20 - INFO - __main__ - Step 12720: {'lr': 0.000493556521068824, 'samples': 2442240, 'steps': 12719, 'loss/train': 1.883273720741272}}} 11/06/2021 23:00:23 - INFO - __main__ - Step 12725: {'lr': 0.0004935505343847586, 'samples': 2443200, 'steps': 12724, 'loss/train': 1.665300965309143}} 11/06/2021 23:00:25 - INFO - __main__ - Step 12729: {'lr': 0.000493545743062181, 'samples': 2443968, 'steps': 12728, 'loss/train': 1.831040382385254}}} 11/06/2021 23:00:27 - INFO - __main__ - Step 12733: {'lr': 0.0004935409499837962, 'samples': 2444736, 'steps': 12732, 'loss/train': 2.449708938598633}} 11/06/2021 23:00:28 - INFO - __main__ - Step 12737: {'lr': 0.0004935361551496387, 'samples': 2445504, 'steps': 12736, 'loss/train': 1.5406252145767212} 11/06/2021 23:00:30 - INFO - __main__ - Step 12741: {'lr': 0.000493531358559743, 'samples': 2446272, 'steps': 12740, 'loss/train': 1.849948525428772}2} 11/06/2021 23:00:33 - INFO - __main__ - Step 12746: {'lr': 0.0004935253603534193, 'samples': 2447232, 'steps': 12745, 'loss/train': 1.745668649673462}} 11/06/2021 23:00:35 - INFO - __main__ - Step 12750: {'lr': 0.0004935205598132393, 'samples': 2448000, 'steps': 12749, 'loss/train': 1.637121319770813}} 11/06/2021 23:00:37 - INFO - __main__ - Step 12754: {'lr': 0.0004935157575174336, 'samples': 2448768, 'steps': 12753, 'loss/train': 1.7007369995117188} 11/06/2021 23:00:39 - INFO - __main__ - Step 12758: {'lr': 0.0004935109534660368, 'samples': 2449536, 'steps': 12757, 'loss/train': 1.8127813339233398} 11/06/2021 23:00:40 - INFO - __main__ - Step 12762: {'lr': 0.0004935061476590835, 'samples': 2450304, 'steps': 12761, 'loss/train': 0.9896982908248901} 11/06/2021 23:00:43 - INFO - __main__ - Step 12767: {'lr': 0.0004935001379316935, 'samples': 2451264, 'steps': 12766, 'loss/train': 1.5596141815185547} 11/06/2021 23:00:43 - INFO - __main__ - Step 12767: {'lr': 0.0004935001379316935, 'samples': 2451264, 'steps': 12766, 'loss/train': 1.5596141815185547} 11/06/2021 23:00:43 - INFO - __main__ - Step 12767: {'lr': 0.0004935001379316935, 'samples': 2451264, 'steps': 12766, 'loss/train': 1.5596141815185547} 11/06/2021 23:00:48 - INFO - __main__ - Step 12778: {'lr': 0.0004934869068763992, 'samples': 2453376, 'steps': 12777, 'loss/train': 1.8732150793075562} 11/06/2021 23:00:51 - INFO - __main__ - Step 12783: {'lr': 0.0004934808883718553, 'samples': 2454336, 'steps': 12782, 'loss/train': 1.3475416898727417} 11/06/2021 23:00:53 - INFO - __main__ - Step 12787: {'lr': 0.0004934760715934597, 'samples': 2455104, 'steps': 12786, 'loss/train': 1.5334937572479248} 11/06/2021 23:00:55 - INFO - __main__ - Step 12792: {'lr': 0.0004934700481520717, 'samples': 2456064, 'steps': 12791, 'loss/train': 1.9511923789978027} 11/06/2021 23:00:55 - INFO - __main__ - Step 12792: {'lr': 0.0004934700481520717, 'samples': 2456064, 'steps': 12791, 'loss/train': 1.9511923789978027} 11/06/2021 23:00:59 - INFO - __main__ - Step 12799: {'lr': 0.0004934616107265821, 'samples': 2457408, 'steps': 12798, 'loss/train': 2.198434591293335}} 11/06/2021 23:01:00 - INFO - __main__ - Step 12803: {'lr': 0.0004934567869271751, 'samples': 2458176, 'steps': 12802, 'loss/train': 1.9801455736160278} 11/06/2021 23:01:03 - INFO - __main__ - Step 12808: {'lr': 0.0004934507547097183, 'samples': 2459136, 'steps': 12807, 'loss/train': 1.6593072414398193} 11/06/2021 23:01:05 - INFO - __main__ - Step 12812: {'lr': 0.000493445926961237, 'samples': 2459904, 'steps': 12811, 'loss/train': 1.6957072019577026}} 11/06/2021 23:01:07 - INFO - __main__ - Step 12816: {'lr': 0.0004934410974576679, 'samples': 2460672, 'steps': 12815, 'loss/train': 1.9368259906768799} 11/06/2021 23:01:09 - INFO - __main__ - Step 12820: {'lr': 0.000493436266199046, 'samples': 2461440, 'steps': 12819, 'loss/train': 1.4729948043823242}} 11/06/2021 23:01:10 - INFO - __main__ - Step 12824: {'lr': 0.0004934314331854061, 'samples': 2462208, 'steps': 12823, 'loss/train': 1.826889157295227}} 11/06/2021 23:01:13 - INFO - __main__ - Step 12828: {'lr': 0.000493426598416783, 'samples': 2462976, 'steps': 12827, 'loss/train': 1.77826988697052}7}} 11/06/2021 23:01:15 - INFO - __main__ - Step 12833: {'lr': 0.0004934205524881123, 'samples': 2463936, 'steps': 12832, 'loss/train': 1.9681713581085205} 11/06/2021 23:01:15 - INFO - __main__ - Step 12833: {'lr': 0.0004934205524881123, 'samples': 2463936, 'steps': 12832, 'loss/train': 1.9681713581085205} 11/06/2021 23:01:18 - INFO - __main__ - Step 12838: {'lr': 0.0004934145038174028, 'samples': 2464896, 'steps': 12837, 'loss/train': 1.6767199039459229} 11/06/2021 23:01:21 - INFO - __main__ - Step 12844: {'lr': 0.0004934072417931564, 'samples': 2466048, 'steps': 12843, 'loss/train': 1.7585923671722412} 11/06/2021 23:01:23 - INFO - __main__ - Step 12848: {'lr': 0.0004934023982501406, 'samples': 2466816, 'steps': 12847, 'loss/train': 1.2807785272598267} 11/06/2021 23:01:25 - INFO - __main__ - Step 12852: {'lr': 0.0004933975529523511, 'samples': 2467584, 'steps': 12851, 'loss/train': 1.6571444272994995} 11/06/2021 23:01:27 - INFO - __main__ - Step 12856: {'lr': 0.0004933927058998226, 'samples': 2468352, 'steps': 12855, 'loss/train': 1.74991774559021}5} 11/06/2021 23:01:29 - INFO - __main__ - Step 12861: {'lr': 0.0004933866446166136, 'samples': 2469312, 'steps': 12860, 'loss/train': 1.84537935256958}5} 11/06/2021 23:01:29 - INFO - __main__ - Step 12861: {'lr': 0.0004933866446166136, 'samples': 2469312, 'steps': 12860, 'loss/train': 1.84537935256958}5} 11/06/2021 23:01:32 - INFO - __main__ - Step 12868: {'lr': 0.0004933781542141532, 'samples': 2470656, 'steps': 12867, 'loss/train': 1.985740303993225}} 11/06/2021 23:01:35 - INFO - __main__ - Step 12872: {'lr': 0.0004933733001430186, 'samples': 2471424, 'steps': 12871, 'loss/train': 1.7198106050491333} 11/06/2021 23:01:37 - INFO - __main__ - Step 12877: {'lr': 0.0004933672300867488, 'samples': 2472384, 'steps': 12876, 'loss/train': 1.5705578327178955} 11/06/2021 23:01:39 - INFO - __main__ - Step 12881: {'lr': 0.0004933623720678944, 'samples': 2473152, 'steps': 12880, 'loss/train': 1.816332459449768}} 11/06/2021 23:01:41 - INFO - __main__ - Step 12885: {'lr': 0.0004933575122945547, 'samples': 2473920, 'steps': 12884, 'loss/train': 1.8801758289337158} 11/06/2021 23:01:43 - INFO - __main__ - Step 12889: {'lr': 0.0004933526507667648, 'samples': 2474688, 'steps': 12888, 'loss/train': 2.223954200744629}} 11/06/2021 23:01:45 - INFO - __main__ - Step 12893: {'lr': 0.0004933477874845595, 'samples': 2475456, 'steps': 12892, 'loss/train': 1.9039117097854614} 11/06/2021 23:01:47 - INFO - __main__ - Step 12897: {'lr': 0.0004933429224479743, 'samples': 2476224, 'steps': 12896, 'loss/train': 1.6877235174179077} 11/06/2021 23:01:47 - INFO - __main__ - Step 12897: {'lr': 0.0004933429224479743, 'samples': 2476224, 'steps': 12896, 'loss/train': 1.6877235174179077} 11/06/2021 23:01:51 - INFO - __main__ - Step 12903: {'lr': 0.0004933356216037104, 'samples': 2477376, 'steps': 12902, 'loss/train': 1.398972749710083}} 11/06/2021 23:01:53 - INFO - __main__ - Step 12908: {'lr': 0.0004933295345516287, 'samples': 2478336, 'steps': 12907, 'loss/train': 1.5094048976898193} 11/06/2021 23:01:55 - INFO - __main__ - Step 12913: {'lr': 0.0004933234447585337, 'samples': 2479296, 'steps': 12912, 'loss/train': 1.6900960206985474} 11/06/2021 23:01:55 - INFO - __main__ - Step 12913: {'lr': 0.0004933234447585337, 'samples': 2479296, 'steps': 12912, 'loss/train': 1.6900960206985474} 11/06/2021 23:01:55 - INFO - __main__ - Step 12913: {'lr': 0.0004933234447585337, 'samples': 2479296, 'steps': 12912, 'loss/train': 1.6900960206985474} 11/06/2021 23:02:01 - INFO - __main__ - Step 12923: {'lr': 0.000493311256949578, 'samples': 2481216, 'steps': 12922, 'loss/train': 1.5766397714614868}} 11/06/2021 23:02:03 - INFO - __main__ - Step 12928: {'lr': 0.0004933051589338547, 'samples': 2482176, 'steps': 12927, 'loss/train': 1.815047025680542}} 11/06/2021 23:02:03 - INFO - __main__ - Step 12928: {'lr': 0.0004933051589338547, 'samples': 2482176, 'steps': 12927, 'loss/train': 1.815047025680542}} 11/06/2021 23:02:07 - INFO - __main__ - Step 12936: {'lr': 0.0004932953964079893, 'samples': 2483712, 'steps': 12935, 'loss/train': 1.569036841392517}} 11/06/2021 23:02:09 - INFO - __main__ - Step 12940: {'lr': 0.0004932905125140354, 'samples': 2484480, 'steps': 12939, 'loss/train': 1.5175234079360962} 11/06/2021 23:02:11 - INFO - __main__ - Step 12944: {'lr': 0.0004932856268661143, 'samples': 2485248, 'steps': 12943, 'loss/train': 1.5122387409210205} 11/06/2021 23:02:13 - INFO - __main__ - Step 12949: {'lr': 0.0004932795173397501, 'samples': 2486208, 'steps': 12948, 'loss/train': 2.574796199798584}} 11/06/2021 23:02:16 - INFO - __main__ - Step 12954: {'lr': 0.0004932734050729362, 'samples': 2487168, 'steps': 12953, 'loss/train': 1.2582000494003296} 11/06/2021 23:02:18 - INFO - __main__ - Step 12958: {'lr': 0.0004932685132864072, 'samples': 2487936, 'steps': 12957, 'loss/train': 1.3571699857711792} 11/06/2021 23:02:18 - INFO - __main__ - Step 12958: {'lr': 0.0004932685132864072, 'samples': 2487936, 'steps': 12957, 'loss/train': 1.3571699857711792} 11/06/2021 23:02:21 - INFO - __main__ - Step 12965: {'lr': 0.000493259948439901, 'samples': 2489280, 'steps': 12964, 'loss/train': 1.8154479265213013}} 11/06/2021 23:02:23 - INFO - __main__ - Step 12970: {'lr': 0.0004932538274041101, 'samples': 2490240, 'steps': 12969, 'loss/train': 1.741576910018921}} 11/06/2021 23:02:23 - INFO - __main__ - Step 12970: {'lr': 0.0004932538274041101, 'samples': 2490240, 'steps': 12969, 'loss/train': 1.741576910018921}} 11/06/2021 23:02:27 - INFO - __main__ - Step 12977: {'lr': 0.0004932452533505486, 'samples': 2491584, 'steps': 12976, 'loss/train': 1.471701979637146}} 11/06/2021 23:02:29 - INFO - __main__ - Step 12982: {'lr': 0.0004932391257384883, 'samples': 2492544, 'steps': 12981, 'loss/train': 1.9861183166503906} 11/06/2021 23:02:32 - INFO - __main__ - Step 12987: {'lr': 0.0004932329953864331, 'samples': 2493504, 'steps': 12986, 'loss/train': 1.9840530157089233} 11/06/2021 23:02:32 - INFO - __main__ - Step 12987: {'lr': 0.0004932329953864331, 'samples': 2493504, 'steps': 12986, 'loss/train': 1.9840530157089233} 11/06/2021 23:02:36 - INFO - __main__ - Step 12994: {'lr': 0.0004932244082904959, 'samples': 2494848, 'steps': 12993, 'loss/train': 1.4867180585861206} 11/06/2021 23:02:37 - INFO - __main__ - Step 12998: {'lr': 0.00049321949896747, 'samples': 2495616, 'steps': 12997, 'loss/train': 1.4822603464126587}6} 11/06/2021 23:02:40 - INFO - __main__ - Step 13002: {'lr': 0.0004932145878909889, 'samples': 2496384, 'steps': 13001, 'loss/train': 1.9297839403152466} 11/06/2021 23:02:42 - INFO - __main__ - Step 13006: {'lr': 0.0004932096750610879, 'samples': 2497152, 'steps': 13005, 'loss/train': 1.8479753732681274} 11/06/2021 23:02:42 - INFO - __main__ - Step 13006: {'lr': 0.0004932096750610879, 'samples': 2497152, 'steps': 13005, 'loss/train': 1.8479753732681274} 11/06/2021 23:02:45 - INFO - __main__ - Step 13013: {'lr': 0.0004932010733897012, 'samples': 2498496, 'steps': 13012, 'loss/train': 1.5579228401184082} 11/06/2021 23:02:48 - INFO - __main__ - Step 13018: {'lr': 0.00049319492605122, 'samples': 2499456, 'steps': 13017, 'loss/train': 1.8722537755966187}2} 11/06/2021 23:02:50 - INFO - __main__ - Step 13022: {'lr': 0.000493190006207994, 'samples': 2500224, 'steps': 13021, 'loss/train': 1.3083375692367554}} 11/06/2021 23:02:52 - INFO - __main__ - Step 13026: {'lr': 0.0004931850846115253, 'samples': 2500992, 'steps': 13025, 'loss/train': 1.7009488344192505} 11/06/2021 23:02:53 - INFO - __main__ - Step 13030: {'lr': 0.0004931801612618494, 'samples': 2501760, 'steps': 13029, 'loss/train': 0.9826458692550659} 11/06/2021 23:02:56 - INFO - __main__ - Step 13034: {'lr': 0.000493175236159002, 'samples': 2502528, 'steps': 13033, 'loss/train': 2.120985269546509}9} 11/06/2021 23:02:58 - INFO - __main__ - Step 13039: {'lr': 0.0004931690773150991, 'samples': 2503488, 'steps': 13038, 'loss/train': 1.3863145112991333} 11/06/2021 23:02:58 - INFO - __main__ - Step 13039: {'lr': 0.0004931690773150991, 'samples': 2503488, 'steps': 13038, 'loss/train': 1.3863145112991333} 11/06/2021 23:03:02 - INFO - __main__ - Step 13046: {'lr': 0.0004931604503317846, 'samples': 2504832, 'steps': 13045, 'loss/train': 1.6171785593032837} 11/06/2021 23:03:04 - INFO - __main__ - Step 13051: {'lr': 0.0004931542849139044, 'samples': 2505792, 'steps': 13050, 'loss/train': 2.8212499618530273} 11/06/2021 23:03:06 - INFO - __main__ - Step 13055: {'lr': 0.0004931493506074886, 'samples': 2506560, 'steps': 13054, 'loss/train': 1.271149754524231}} 11/06/2021 23:03:08 - INFO - __main__ - Step 13059: {'lr': 0.0004931444145481233, 'samples': 2507328, 'steps': 13058, 'loss/train': 1.3444815874099731} 11/06/2021 23:03:10 - INFO - __main__ - Step 13063: {'lr': 0.000493139476735844, 'samples': 2508096, 'steps': 13062, 'loss/train': 2.6808207035064697}} 11/06/2021 23:03:10 - INFO - __main__ - Step 13063: {'lr': 0.000493139476735844, 'samples': 2508096, 'steps': 13062, 'loss/train': 2.6808207035064697}} 11/06/2021 23:03:13 - INFO - __main__ - Step 13070: {'lr': 0.0004931308313465132, 'samples': 2509440, 'steps': 13069, 'loss/train': 1.9141942262649536} 11/06/2021 23:03:16 - INFO - __main__ - Step 13075: {'lr': 0.0004931246527818785, 'samples': 2510400, 'steps': 13074, 'loss/train': 1.26536226272583}6} 11/06/2021 23:03:18 - INFO - __main__ - Step 13080: {'lr': 0.0004931184714785385, 'samples': 2511360, 'steps': 13079, 'loss/train': 1.9777021408081055} 11/06/2021 23:03:18 - INFO - __main__ - Step 13080: {'lr': 0.0004931184714785385, 'samples': 2511360, 'steps': 13079, 'loss/train': 1.9777021408081055} 11/06/2021 23:03:22 - INFO - __main__ - Step 13087: {'lr': 0.0004931098130529699, 'samples': 2512704, 'steps': 13086, 'loss/train': 2.025705099105835}} 11/06/2021 23:03:23 - INFO - __main__ - Step 13091: {'lr': 0.0004931048629712905, 'samples': 2513472, 'steps': 13090, 'loss/train': 1.8698866367340088} 11/06/2021 23:03:25 - INFO - __main__ - Step 13095: {'lr': 0.0004930999111369824, 'samples': 2514240, 'steps': 13094, 'loss/train': 1.6571738719940186} 11/06/2021 23:03:25 - INFO - __main__ - Step 13095: {'lr': 0.0004930999111369824, 'samples': 2514240, 'steps': 13094, 'loss/train': 1.6571738719940186} 11/06/2021 23:03:30 - INFO - __main__ - Step 13104: {'lr': 0.0004930887631019248, 'samples': 2515968, 'steps': 13103, 'loss/train': 1.3024754524230957} 11/06/2021 23:03:32 - INFO - __main__ - Step 13109: {'lr': 0.0004930825659154674, 'samples': 2516928, 'steps': 13108, 'loss/train': 0.959062397480011}} 11/06/2021 23:03:32 - INFO - __main__ - Step 13109: {'lr': 0.0004930825659154674, 'samples': 2516928, 'steps': 13108, 'loss/train': 0.959062397480011}} 11/06/2021 23:03:35 - INFO - __main__ - Step 13116: {'lr': 0.0004930738852542141, 'samples': 2518272, 'steps': 13115, 'loss/train': 1.4143397808074951} 11/06/2021 23:03:38 - INFO - __main__ - Step 13121: {'lr': 0.0004930676814961189, 'samples': 2519232, 'steps': 13120, 'loss/train': 1.5763609409332275} 11/06/2021 23:03:38 - INFO - __main__ - Step 13121: {'lr': 0.0004930676814961189, 'samples': 2519232, 'steps': 13120, 'loss/train': 1.5763609409332275} 11/06/2021 23:03:42 - INFO - __main__ - Step 13127: {'lr': 0.0004930602333721667, 'samples': 2520384, 'steps': 13126, 'loss/train': 1.6020070314407349} 11/06/2021 23:03:44 - INFO - __main__ - Step 13133: {'lr': 0.0004930527813055237, 'samples': 2521536, 'steps': 13132, 'loss/train': 2.1043529510498047} 11/06/2021 23:03:44 - INFO - __main__ - Step 13133: {'lr': 0.0004930527813055237, 'samples': 2521536, 'steps': 13132, 'loss/train': 2.1043529510498047} 11/06/2021 23:03:48 - INFO - __main__ - Step 13140: {'lr': 0.0004930440822448115, 'samples': 2522880, 'steps': 13139, 'loss/train': 1.4115676879882812} 11/06/2021 23:03:50 - INFO - __main__ - Step 13144: {'lr': 0.0004930391089437017, 'samples': 2523648, 'steps': 13143, 'loss/train': 1.7475541830062866} 11/06/2021 23:03:52 - INFO - __main__ - Step 13148: {'lr': 0.0004930341338904371, 'samples': 2524416, 'steps': 13147, 'loss/train': 1.889960765838623}} 11/06/2021 23:03:52 - INFO - __main__ - Step 13148: {'lr': 0.0004930341338904371, 'samples': 2524416, 'steps': 13147, 'loss/train': 1.889960765838623}} 11/06/2021 23:03:56 - INFO - __main__ - Step 13156: {'lr': 0.000493024178527587, 'samples': 2525952, 'steps': 13155, 'loss/train': 2.2869997024536133}} 11/06/2021 23:03:57 - INFO - __main__ - Step 13160: {'lr': 0.0004930191982180734, 'samples': 2526720, 'steps': 13159, 'loss/train': 1.5737640857696533} 11/06/2021 23:03:59 - INFO - __main__ - Step 13164: {'lr': 0.0004930142161565486, 'samples': 2527488, 'steps': 13163, 'loss/train': 2.0381977558135986} 11/06/2021 23:04:02 - INFO - __main__ - Step 13169: {'lr': 0.0004930079861159315, 'samples': 2528448, 'steps': 13168, 'loss/train': 1.3868030309677124} 11/06/2021 23:04:04 - INFO - __main__ - Step 13174: {'lr': 0.000493001753337923, 'samples': 2529408, 'steps': 13173, 'loss/train': 1.542048692703247}4} 11/06/2021 23:04:04 - INFO - __main__ - Step 13174: {'lr': 0.000493001753337923, 'samples': 2529408, 'steps': 13173, 'loss/train': 1.542048692703247}4} 11/06/2021 23:04:07 - INFO - __main__ - Step 13181: {'lr': 0.0004929930228500279, 'samples': 2530752, 'steps': 13180, 'loss/train': 1.5887360572814941} 11/06/2021 23:04:10 - INFO - __main__ - Step 13185: {'lr': 0.0004929880315910338, 'samples': 2531520, 'steps': 13184, 'loss/train': 0.39340120553970337} 11/06/2021 23:04:12 - INFO - __main__ - Step 13190: {'lr': 0.0004929817900538455, 'samples': 2532480, 'steps': 13189, 'loss/train': 1.4178115129470825}} 11/06/2021 23:04:12 - INFO - __main__ - Step 13190: {'lr': 0.0004929817900538455, 'samples': 2532480, 'steps': 13189, 'loss/train': 1.4178115129470825}} 11/06/2021 23:04:15 - INFO - __main__ - Step 13197: {'lr': 0.0004929730473034763, 'samples': 2533824, 'steps': 13196, 'loss/train': 1.561155915260315}}} 11/06/2021 23:04:17 - INFO - __main__ - Step 13201: {'lr': 0.000492968049037552, 'samples': 2534592, 'steps': 13200, 'loss/train': 1.2632533311843872}}} 11/06/2021 23:04:20 - INFO - __main__ - Step 13206: {'lr': 0.0004929617987419039, 'samples': 2535552, 'steps': 13205, 'loss/train': 1.9509419202804565}} 11/06/2021 23:04:22 - INFO - __main__ - Step 13210: {'lr': 0.0004929567965348347, 'samples': 2536320, 'steps': 13209, 'loss/train': 1.7661024332046509}} 11/06/2021 23:04:24 - INFO - __main__ - Step 13214: {'lr': 0.0004929517925762045, 'samples': 2537088, 'steps': 13213, 'loss/train': 1.7152559757232666}} 11/06/2021 23:04:25 - INFO - __main__ - Step 13218: {'lr': 0.0004929467868660487, 'samples': 2537856, 'steps': 13217, 'loss/train': 1.3065961599349976}} 11/06/2021 23:04:28 - INFO - __main__ - Step 13222: {'lr': 0.000492941779404404, 'samples': 2538624, 'steps': 13221, 'loss/train': 1.3217058181762695}}} 11/06/2021 23:04:30 - INFO - __main__ - Step 13227: {'lr': 0.0004929355176143714, 'samples': 2539584, 'steps': 13226, 'loss/train': 1.6296266317367554}} 11/06/2021 23:04:32 - INFO - __main__ - Step 13232: {'lr': 0.0004929292530877638, 'samples': 2540544, 'steps': 13231, 'loss/train': 1.5122207403182983}} 11/06/2021 23:04:32 - INFO - __main__ - Step 13232: {'lr': 0.0004929292530877638, 'samples': 2540544, 'steps': 13231, 'loss/train': 1.5122207403182983}} 11/06/2021 23:04:36 - INFO - __main__ - Step 13239: {'lr': 0.0004929204781532018, 'samples': 2541888, 'steps': 13238, 'loss/train': 1.915743112564087}}} 11/06/2021 23:04:38 - INFO - __main__ - Step 13243: {'lr': 0.0004929154614968315, 'samples': 2542656, 'steps': 13242, 'loss/train': 0.5492236614227295}} 11/06/2021 23:04:40 - INFO - __main__ - Step 13247: {'lr': 0.0004929104430891978, 'samples': 2543424, 'steps': 13246, 'loss/train': 1.5061917304992676}} 11/06/2021 23:04:42 - INFO - __main__ - Step 13252: {'lr': 0.0004929041676169967, 'samples': 2544384, 'steps': 13251, 'loss/train': 1.4510022401809692}} 11/06/2021 23:04:44 - INFO - __main__ - Step 13256: {'lr': 0.0004928991452691528, 'samples': 2545152, 'steps': 13255, 'loss/train': 1.8692861795425415}} 11/06/2021 23:04:44 - INFO - __main__ - Step 13256: {'lr': 0.0004928991452691528, 'samples': 2545152, 'steps': 13255, 'loss/train': 1.8692861795425415}} 11/06/2021 23:04:48 - INFO - __main__ - Step 13263: {'lr': 0.0004928903519467534, 'samples': 2546496, 'steps': 13262, 'loss/train': 2.4009814262390137}} 11/06/2021 23:04:50 - INFO - __main__ - Step 13268: {'lr': 0.0004928840677188918, 'samples': 2547456, 'steps': 13267, 'loss/train': 1.5503804683685303}} 11/06/2021 23:04:50 - INFO - __main__ - Step 13268: {'lr': 0.0004928840677188918, 'samples': 2547456, 'steps': 13267, 'loss/train': 1.5503804683685303}} 11/06/2021 23:04:54 - INFO - __main__ - Step 13276: {'lr': 0.0004928740072634722, 'samples': 2548992, 'steps': 13275, 'loss/train': 2.0685250759124756}} 11/06/2021 23:04:56 - INFO - __main__ - Step 13280: {'lr': 0.0004928689744092976, 'samples': 2549760, 'steps': 13279, 'loss/train': 1.8017091751098633}} 11/06/2021 23:04:58 - INFO - __main__ - Step 13284: {'lr': 0.0004928639398041948, 'samples': 2550528, 'steps': 13283, 'loss/train': 1.974739670753479}}} 11/06/2021 23:05:00 - INFO - __main__ - Step 13288: {'lr': 0.0004928589034482001, 'samples': 2551296, 'steps': 13287, 'loss/train': 1.8625463247299194}} 11/06/2021 23:05:03 - INFO - __main__ - Step 13294: {'lr': 0.0004928513456313653, 'samples': 2552448, 'steps': 13293, 'loss/train': 1.6685292720794678}} 11/06/2021 23:05:03 - INFO - __main__ - Step 13294: {'lr': 0.0004928513456313653, 'samples': 2552448, 'steps': 13293, 'loss/train': 1.6685292720794678}} 11/06/2021 23:05:07 - INFO - __main__ - Step 13301: {'lr': 0.0004928425231995593, 'samples': 2553792, 'steps': 13300, 'loss/train': 1.7554380893707275}} 11/06/2021 23:05:08 - INFO - __main__ - Step 13305: {'lr': 0.0004928374794026792, 'samples': 2554560, 'steps': 13304, 'loss/train': 1.3994184732437134}} 11/06/2021 23:05:10 - INFO - __main__ - Step 13309: {'lr': 0.000492832433855098, 'samples': 2555328, 'steps': 13308, 'loss/train': 1.3893240690231323}}} 11/06/2021 23:05:13 - INFO - __main__ - Step 13314: {'lr': 0.0004928261244587536, 'samples': 2556288, 'steps': 13313, 'loss/train': 1.8624576330184937}} 11/06/2021 23:05:15 - INFO - __main__ - Step 13319: {'lr': 0.0004928198123270664, 'samples': 2557248, 'steps': 13318, 'loss/train': 1.5688247680664062}} 11/06/2021 23:05:15 - INFO - __main__ - Step 13319: {'lr': 0.0004928198123270664, 'samples': 2557248, 'steps': 13318, 'loss/train': 1.5688247680664062}} 11/06/2021 23:05:18 - INFO - __main__ - Step 13326: {'lr': 0.0004928109707474643, 'samples': 2558592, 'steps': 13325, 'loss/train': 1.6393741369247437}} 11/06/2021 23:05:20 - INFO - __main__ - Step 13330: {'lr': 0.0004928059160092993, 'samples': 2559360, 'steps': 13329, 'loss/train': 1.7646552324295044}} 11/06/2021 23:05:23 - INFO - __main__ - Step 13335: {'lr': 0.0004927995951249937, 'samples': 2560320, 'steps': 13334, 'loss/train': 1.3417226076126099}} 11/06/2021 23:05:23 - INFO - __main__ - Step 13335: {'lr': 0.0004927995951249937, 'samples': 2560320, 'steps': 13334, 'loss/train': 1.3417226076126099}} 11/06/2021 23:05:26 - INFO - __main__ - Step 13341: {'lr': 0.0004927920064535756, 'samples': 2561472, 'steps': 13340, 'loss/train': 1.808425784111023}}} 11/06/2021 23:05:28 - INFO - __main__ - Step 13345: {'lr': 0.0004927869451513226, 'samples': 2562240, 'steps': 13344, 'loss/train': 1.5792934894561768}} 11/06/2021 23:05:30 - INFO - __main__ - Step 13350: {'lr': 0.0004927806160620995, 'samples': 2563200, 'steps': 13349, 'loss/train': 2.182821750640869}}} 11/06/2021 23:05:33 - INFO - __main__ - Step 13355: {'lr': 0.0004927742842380465, 'samples': 2564160, 'steps': 13354, 'loss/train': 2.1269948482513428}} 11/06/2021 23:05:33 - INFO - __main__ - Step 13355: {'lr': 0.0004927742842380465, 'samples': 2564160, 'steps': 13354, 'loss/train': 2.1269948482513428}} 11/06/2021 23:05:36 - INFO - __main__ - Step 13362: {'lr': 0.0004927654150899937, 'samples': 2565504, 'steps': 13361, 'loss/train': 2.3916163444519043}} 11/06/2021 23:05:38 - INFO - __main__ - Step 13366: {'lr': 0.0004927603445988797, 'samples': 2566272, 'steps': 13365, 'loss/train': 1.918675422668457}}} 11/06/2021 23:05:40 - INFO - __main__ - Step 13370: {'lr': 0.0004927552723576207, 'samples': 2567040, 'steps': 13369, 'loss/train': 1.939740777015686}}} 11/06/2021 23:05:42 - INFO - __main__ - Step 13375: {'lr': 0.0004927489295949613, 'samples': 2568000, 'steps': 13374, 'loss/train': 1.403855562210083}}} 11/06/2021 23:05:42 - INFO - __main__ - Step 13375: {'lr': 0.0004927489295949613, 'samples': 2568000, 'steps': 13374, 'loss/train': 1.403855562210083}}} 11/06/2021 23:05:46 - INFO - __main__ - Step 13383: {'lr': 0.0004927387754870321, 'samples': 2569536, 'steps': 13382, 'loss/train': 0.35177692770957947} 11/06/2021 23:05:48 - INFO - __main__ - Step 13387: {'lr': 0.0004927336958080648, 'samples': 2570304, 'steps': 13386, 'loss/train': 1.88877272605896}47} 11/06/2021 23:05:50 - INFO - __main__ - Step 13391: {'lr': 0.0004927286143791447, 'samples': 2571072, 'steps': 13390, 'loss/train': 2.0183682441711426}} 11/06/2021 23:05:52 - INFO - __main__ - Step 13396: {'lr': 0.0004927222601321789, 'samples': 2572032, 'steps': 13395, 'loss/train': 2.6536405086517334}} 11/06/2021 23:05:52 - INFO - __main__ - Step 13396: {'lr': 0.0004927222601321789, 'samples': 2572032, 'steps': 13395, 'loss/train': 2.6536405086517334}} 11/06/2021 23:05:57 - INFO - __main__ - Step 13403: {'lr': 0.000492713359593033, 'samples': 2573376, 'steps': 13402, 'loss/train': 1.8210397958755493}}} 11/06/2021 23:05:58 - INFO - __main__ - Step 13407: {'lr': 0.0004927082711646676, 'samples': 2574144, 'steps': 13406, 'loss/train': 1.7488776445388794}} 11/06/2021 23:06:00 - INFO - __main__ - Step 13412: {'lr': 0.0004927019081686015, 'samples': 2575104, 'steps': 13411, 'loss/train': 1.4143866300582886}} 11/06/2021 23:06:02 - INFO - __main__ - Step 13416: {'lr': 0.000492696815803306, 'samples': 2575872, 'steps': 13415, 'loss/train': 1.7381833791732788}}} 11/06/2021 23:06:05 - INFO - __main__ - Step 13420: {'lr': 0.0004926917216883235, 'samples': 2576640, 'steps': 13419, 'loss/train': 1.7219598293304443}} 11/06/2021 23:06:07 - INFO - __main__ - Step 13424: {'lr': 0.0004926866258236907, 'samples': 2577408, 'steps': 13423, 'loss/train': 2.304779529571533}}} 11/06/2021 23:06:08 - INFO - __main__ - Step 13428: {'lr': 0.0004926815282094443, 'samples': 2578176, 'steps': 13427, 'loss/train': 2.2351608276367188}} 11/06/2021 23:06:10 - INFO - __main__ - Step 13432: {'lr': 0.0004926764288456212, 'samples': 2578944, 'steps': 13431, 'loss/train': 1.6883317232131958}} 11/06/2021 23:06:13 - INFO - __main__ - Step 13437: {'lr': 0.0004926700521805557, 'samples': 2579904, 'steps': 13436, 'loss/train': 1.5972094535827637}} 11/06/2021 23:06:15 - INFO - __main__ - Step 13441: {'lr': 0.000492664948880319, 'samples': 2580672, 'steps': 13440, 'loss/train': 1.513269305229187}7}} 11/06/2021 23:06:17 - INFO - __main__ - Step 13445: {'lr': 0.0004926598438306252, 'samples': 2581440, 'steps': 13444, 'loss/train': 1.1700359582901}}7}} 11/06/2021 23:06:17 - INFO - __main__ - Step 13445: {'lr': 0.0004926598438306252, 'samples': 2581440, 'steps': 13444, 'loss/train': 1.1700359582901}}7}} 11/06/2021 23:06:20 - INFO - __main__ - Step 13452: {'lr': 0.0004926509057841397, 'samples': 2582784, 'steps': 13451, 'loss/train': 2.232959032058716}}} 11/06/2021 23:06:23 - INFO - __main__ - Step 13458: {'lr': 0.0004926432403373752, 'samples': 2583936, 'steps': 13457, 'loss/train': 2.219507932662964}}} 11/06/2021 23:06:23 - INFO - __main__ - Step 13458: {'lr': 0.0004926432403373752, 'samples': 2583936, 'steps': 13457, 'loss/train': 2.219507932662964}}} 11/06/2021 23:06:26 - INFO - __main__ - Step 13465: {'lr': 0.0004926342923415844, 'samples': 2585280, 'steps': 13464, 'loss/train': 1.8750228881835938}} 11/06/2021 23:06:28 - INFO - __main__ - Step 13469: {'lr': 0.0004926291767959199, 'samples': 2586048, 'steps': 13468, 'loss/train': 3.767948865890503}}} 11/06/2021 23:06:31 - INFO - __main__ - Step 13474: {'lr': 0.000492622779904032, 'samples': 2587008, 'steps': 13473, 'loss/train': 1.531381368637085}}}} 11/06/2021 23:06:33 - INFO - __main__ - Step 13478: {'lr': 0.0004926176604227208, 'samples': 2587776, 'steps': 13477, 'loss/train': 1.7047059535980225}} 11/06/2021 23:06:33 - INFO - __main__ - Step 13478: {'lr': 0.0004926176604227208, 'samples': 2587776, 'steps': 13477, 'loss/train': 1.7047059535980225}} 11/06/2021 23:06:36 - INFO - __main__ - Step 13485: {'lr': 0.0004926086971216371, 'samples': 2589120, 'steps': 13484, 'loss/train': 1.7945960760116577}} 11/06/2021 23:06:36 - INFO - __main__ - Step 13485: {'lr': 0.0004926086971216371, 'samples': 2589120, 'steps': 13484, 'loss/train': 1.7945960760116577}} 11/06/2021 23:06:36 - INFO - __main__ - Step 13485: {'lr': 0.0004926086971216371, 'samples': 2589120, 'steps': 13484, 'loss/train': 1.7945960760116577}} 11/06/2021 23:06:42 - INFO - __main__ - Step 13496: {'lr': 0.0004925946011120382, 'samples': 2591232, 'steps': 13495, 'loss/train': 2.3239166736602783}} 11/06/2021 23:06:44 - INFO - __main__ - Step 13500: {'lr': 0.000492589472011044, 'samples': 2592000, 'steps': 13499, 'loss/train': 2.0992441177368164}}} 11/06/2021 23:06:47 - INFO - __main__ - Step 13505: {'lr': 0.0004925830581753964, 'samples': 2592960, 'steps': 13504, 'loss/train': 2.066511631011963}}} 11/06/2021 23:06:49 - INFO - __main__ - Step 13509: {'lr': 0.0004925779251393995, 'samples': 2593728, 'steps': 13508, 'loss/train': 2.0398895740509033}} 11/06/2021 23:06:51 - INFO - __main__ - Step 13513: {'lr': 0.0004925727903545727, 'samples': 2594496, 'steps': 13512, 'loss/train': 1.662975788116455}}} 11/06/2021 23:06:52 - INFO - __main__ - Step 13517: {'lr': 0.0004925676538209531, 'samples': 2595264, 'steps': 13516, 'loss/train': 2.147181510925293}}} 11/06/2021 23:06:54 - INFO - __main__ - Step 13521: {'lr': 0.0004925625155385775, 'samples': 2596032, 'steps': 13520, 'loss/train': 1.926583170890808}}} 11/06/2021 23:06:57 - INFO - __main__ - Step 13526: {'lr': 0.0004925560902264766, 'samples': 2596992, 'steps': 13525, 'loss/train': 1.8510551452636719}} 11/06/2021 23:06:57 - INFO - __main__ - Step 13526: {'lr': 0.0004925560902264766, 'samples': 2596992, 'steps': 13525, 'loss/train': 1.8510551452636719}} 11/06/2021 23:06:57 - INFO - __main__ - Step 13526: {'lr': 0.0004925560902264766, 'samples': 2596992, 'steps': 13525, 'loss/train': 1.8510551452636719}} 11/06/2021 23:07:02 - INFO - __main__ - Step 13536: {'lr': 0.0004925432314054448, 'samples': 2598912, 'steps': 13535, 'loss/train': 1.898144006729126}}} 11/06/2021 23:07:05 - INFO - __main__ - Step 13541: {'lr': 0.0004925367978966588, 'samples': 2599872, 'steps': 13540, 'loss/train': 2.184403419494629}}} 11/06/2021 23:07:07 - INFO - __main__ - Step 13546: {'lr': 0.0004925303616557893, 'samples': 2600832, 'steps': 13545, 'loss/train': 1.6563913822174072}} 11/06/2021 23:07:09 - INFO - __main__ - Step 13550: {'lr': 0.0004925252106960425, 'samples': 2601600, 'steps': 13549, 'loss/train': 1.596949577331543}}} 11/06/2021 23:07:09 - INFO - __main__ - Step 13550: {'lr': 0.0004925252106960425, 'samples': 2601600, 'steps': 13549, 'loss/train': 1.596949577331543}}} 11/06/2021 23:07:12 - INFO - __main__ - Step 13557: {'lr': 0.0004925161923093001, 'samples': 2602944, 'steps': 13556, 'loss/train': 1.5300875902175903}} 11/06/2021 23:07:15 - INFO - __main__ - Step 13562: {'lr': 0.0004925097473262509, 'samples': 2603904, 'steps': 13561, 'loss/train': 1.773667573928833}}} 11/06/2021 23:07:17 - INFO - __main__ - Step 13567: {'lr': 0.000492503299611423, 'samples': 2604864, 'steps': 13566, 'loss/train': 1.2261930704116821}}} 11/06/2021 23:07:19 - INFO - __main__ - Step 13571: {'lr': 0.0004924981394727288, 'samples': 2605632, 'steps': 13570, 'loss/train': 1.4361501932144165}} 11/06/2021 23:07:19 - INFO - __main__ - Step 13571: {'lr': 0.0004924981394727288, 'samples': 2605632, 'steps': 13570, 'loss/train': 1.4361501932144165}} 11/06/2021 23:07:22 - INFO - __main__ - Step 13578: {'lr': 0.0004924891050232984, 'samples': 2606976, 'steps': 13577, 'loss/train': 1.8606226444244385}} 11/06/2021 23:07:25 - INFO - __main__ - Step 13583: {'lr': 0.0004924826485672667, 'samples': 2607936, 'steps': 13582, 'loss/train': 0.9627843499183655}} 11/06/2021 23:07:27 - INFO - __main__ - Step 13587: {'lr': 0.0004924774814357768, 'samples': 2608704, 'steps': 13586, 'loss/train': 1.7782405614852905}} 11/06/2021 23:07:27 - INFO - __main__ - Step 13587: {'lr': 0.0004924774814357768, 'samples': 2608704, 'steps': 13586, 'loss/train': 1.7782405614852905}} 11/06/2021 23:07:31 - INFO - __main__ - Step 13594: {'lr': 0.0004924684347493126, 'samples': 2610048, 'steps': 13593, 'loss/train': 1.741284966468811}}} 11/06/2021 23:07:33 - INFO - __main__ - Step 13598: {'lr': 0.0004924632628106217, 'samples': 2610816, 'steps': 13597, 'loss/train': 1.9725139141082764}} 11/06/2021 23:07:35 - INFO - __main__ - Step 13604: {'lr': 0.0004924555016250908, 'samples': 2611968, 'steps': 13603, 'loss/train': 1.9242398738861084}} 11/06/2021 23:07:35 - INFO - __main__ - Step 13604: {'lr': 0.0004924555016250908, 'samples': 2611968, 'steps': 13603, 'loss/train': 1.9242398738861084}} 11/06/2021 23:07:39 - INFO - __main__ - Step 13609: {'lr': 0.0004924490309661918, 'samples': 2612928, 'steps': 13608, 'loss/train': 0.6431192755699158}} 11/06/2021 23:07:41 - INFO - __main__ - Step 13614: {'lr': 0.000492442557576198, 'samples': 2613888, 'steps': 13613, 'loss/train': 1.7973626852035522}}} 11/06/2021 23:07:43 - INFO - __main__ - Step 13618: {'lr': 0.0004924373768978638, 'samples': 2614656, 'steps': 13617, 'loss/train': 2.0704801082611084}} 11/06/2021 23:07:45 - INFO - __main__ - Step 13622: {'lr': 0.0004924321944717129, 'samples': 2615424, 'steps': 13621, 'loss/train': 1.9337289333343506}} 11/06/2021 23:07:47 - INFO - __main__ - Step 13626: {'lr': 0.0004924270102977827, 'samples': 2616192, 'steps': 13625, 'loss/train': 2.1077988147735596}} 11/06/2021 23:07:47 - INFO - __main__ - Step 13626: {'lr': 0.0004924270102977827, 'samples': 2616192, 'steps': 13625, 'loss/train': 2.1077988147735596}} 11/06/2021 23:07:51 - INFO - __main__ - Step 13634: {'lr': 0.000492416636706734, 'samples': 2617728, 'steps': 13633, 'loss/train': 1.84334397315979}96}} 11/06/2021 23:07:53 - INFO - __main__ - Step 13638: {'lr': 0.0004924114472896902, 'samples': 2618496, 'steps': 13637, 'loss/train': 1.547655701637268}}} 11/06/2021 23:07:55 - INFO - __main__ - Step 13642: {'lr': 0.0004924062561250167, 'samples': 2619264, 'steps': 13641, 'loss/train': 1.4436631202697754}} 11/06/2021 23:07:57 - INFO - __main__ - Step 13647: {'lr': 0.0004923997647116276, 'samples': 2620224, 'steps': 13646, 'loss/train': 0.61855548620224}4}} 11/06/2021 23:07:57 - INFO - __main__ - Step 13647: {'lr': 0.0004923997647116276, 'samples': 2620224, 'steps': 13646, 'loss/train': 0.61855548620224}4}} 11/06/2021 23:07:57 - INFO - __main__ - Step 13647: {'lr': 0.0004923997647116276, 'samples': 2620224, 'steps': 13646, 'loss/train': 0.61855548620224}4}} 11/06/2021 23:08:02 - INFO - __main__ - Step 13658: {'lr': 0.0004923854739907743, 'samples': 2622336, 'steps': 13657, 'loss/train': 1.2240080833435059}} 11/06/2021 23:08:05 - INFO - __main__ - Step 13663: {'lr': 0.0004923789738399152, 'samples': 2623296, 'steps': 13662, 'loss/train': 1.5047688484191895}} 11/06/2021 23:08:05 - INFO - __main__ - Step 13663: {'lr': 0.0004923789738399152, 'samples': 2623296, 'steps': 13662, 'loss/train': 1.5047688484191895}} 11/06/2021 23:08:09 - INFO - __main__ - Step 13670: {'lr': 0.0004923698690418154, 'samples': 2624640, 'steps': 13669, 'loss/train': 0.9396169781684875}} 11/06/2021 23:08:10 - INFO - __main__ - Step 13674: {'lr': 0.0004923646638974524, 'samples': 2625408, 'steps': 13673, 'loss/train': 1.3942358493804932}} 11/06/2021 23:08:13 - INFO - __main__ - Step 13679: {'lr': 0.0004923581550098733, 'samples': 2626368, 'steps': 13678, 'loss/train': 1.6954731941223145}} 11/06/2021 23:08:15 - INFO - __main__ - Step 13684: {'lr': 0.000492351643392223, 'samples': 2627328, 'steps': 13683, 'loss/train': 2.214824914932251}5}} 11/06/2021 23:08:17 - INFO - __main__ - Step 13688: {'lr': 0.0004923464321325008, 'samples': 2628096, 'steps': 13687, 'loss/train': 2.2057156562805176}} 11/06/2021 23:08:19 - INFO - __main__ - Step 13692: {'lr': 0.0004923412191256176, 'samples': 2628864, 'steps': 13691, 'loss/train': 1.6855508089065552}} 11/06/2021 23:08:21 - INFO - __main__ - Step 13696: {'lr': 0.000492336004371611, 'samples': 2629632, 'steps': 13695, 'loss/train': 1.891566276550293}2}} 11/06/2021 23:08:23 - INFO - __main__ - Step 13700: {'lr': 0.0004923307878705186, 'samples': 2630400, 'steps': 13699, 'loss/train': 1.4887804985046387}} 11/06/2021 23:08:25 - INFO - __main__ - Step 13704: {'lr': 0.000492325569622378, 'samples': 2631168, 'steps': 13703, 'loss/train': 1.5167887210845947}}} 11/06/2021 23:08:25 - INFO - __main__ - Step 13704: {'lr': 0.000492325569622378, 'samples': 2631168, 'steps': 13703, 'loss/train': 1.5167887210845947}}} 11/06/2021 23:08:29 - INFO - __main__ - Step 13712: {'lr': 0.0004923151278851025, 'samples': 2632704, 'steps': 13711, 'loss/train': 0.9735046029090881}} 11/06/2021 23:08:29 - INFO - __main__ - Step 13712: {'lr': 0.0004923151278851025, 'samples': 2632704, 'steps': 13711, 'loss/train': 0.9735046029090881}} 11/06/2021 23:08:33 - INFO - __main__ - Step 13719: {'lr': 0.0004923059856328447, 'samples': 2634048, 'steps': 13718, 'loss/train': 1.5310945510864258}} 11/06/2021 23:08:35 - INFO - __main__ - Step 13724: {'lr': 0.0004922994521772687, 'samples': 2635008, 'steps': 13723, 'loss/train': 1.856695532798767}}} 11/06/2021 23:08:35 - INFO - __main__ - Step 13724: {'lr': 0.0004922994521772687, 'samples': 2635008, 'steps': 13723, 'loss/train': 1.856695532798767}}} 11/06/2021 23:08:35 - INFO - __main__ - Step 13724: {'lr': 0.0004922994521772687, 'samples': 2635008, 'steps': 13723, 'loss/train': 1.856695532798767}}} 11/06/2021 23:08:41 - INFO - __main__ - Step 13735: {'lr': 0.0004922850689675823, 'samples': 2637120, 'steps': 13734, 'loss/train': 0.9880645275115967}} 11/06/2021 23:08:43 - INFO - __main__ - Step 13741: {'lr': 0.000492277218012765, 'samples': 2638272, 'steps': 13740, 'loss/train': 1.6142842769622803}}} 11/06/2021 23:08:43 - INFO - __main__ - Step 13741: {'lr': 0.000492277218012765, 'samples': 2638272, 'steps': 13740, 'loss/train': 1.6142842769622803}}} 11/06/2021 23:08:47 - INFO - __main__ - Step 13748: {'lr': 0.000492268053598417, 'samples': 2639616, 'steps': 13747, 'loss/train': 2.627610445022583}}}} 11/06/2021 23:08:49 - INFO - __main__ - Step 13752: {'lr': 0.0004922628143886358, 'samples': 2640384, 'steps': 13751, 'loss/train': 2.049717903137207}}} 11/06/2021 23:08:51 - INFO - __main__ - Step 13757: {'lr': 0.0004922562629203161, 'samples': 2641344, 'steps': 13756, 'loss/train': 1.271485447883606}}} 11/06/2021 23:08:54 - INFO - __main__ - Step 13762: {'lr': 0.0004922497087230732, 'samples': 2642304, 'steps': 13761, 'loss/train': 1.9483925104141235}} 11/06/2021 23:08:54 - INFO - __main__ - Step 13762: {'lr': 0.0004922497087230732, 'samples': 2642304, 'steps': 13761, 'loss/train': 1.9483925104141235}} 11/06/2021 23:08:58 - INFO - __main__ - Step 13769: {'lr': 0.0004922405282624825, 'samples': 2643648, 'steps': 13768, 'loss/train': 0.8302810788154602}} 11/06/2021 23:08:59 - INFO - __main__ - Step 13773: {'lr': 0.0004922352798836924, 'samples': 2644416, 'steps': 13772, 'loss/train': 2.0496914386749268}} 11/06/2021 23:09:01 - INFO - __main__ - Step 13777: {'lr': 0.0004922300297585428, 'samples': 2645184, 'steps': 13776, 'loss/train': 2.127047061920166}}} 11/06/2021 23:09:04 - INFO - __main__ - Step 13782: {'lr': 0.0004922234646463451, 'samples': 2646144, 'steps': 13781, 'loss/train': 1.9466784000396729}} 11/06/2021 23:09:06 - INFO - __main__ - Step 13786: {'lr': 0.0004922182105920246, 'samples': 2646912, 'steps': 13785, 'loss/train': 1.4474611282348633}} 11/06/2021 23:09:08 - INFO - __main__ - Step 13790: {'lr': 0.0004922129547914675, 'samples': 2647680, 'steps': 13789, 'loss/train': 2.097891330718994}}} 11/06/2021 23:09:09 - INFO - __main__ - Step 13794: {'lr': 0.0004922076972447117, 'samples': 2648448, 'steps': 13793, 'loss/train': 1.157942533493042}}} 11/06/2021 23:09:11 - INFO - __main__ - Step 13798: {'lr': 0.000492202437951795, 'samples': 2649216, 'steps': 13797, 'loss/train': 1.639959454536438}}}} 11/06/2021 23:09:14 - INFO - __main__ - Step 13803: {'lr': 0.0004921958613801683, 'samples': 2650176, 'steps': 13802, 'loss/train': 1.5847387313842773}} 11/06/2021 23:09:16 - INFO - __main__ - Step 13807: {'lr': 0.0004921905981585286, 'samples': 2650944, 'steps': 13806, 'loss/train': 1.3894824981689453}} 11/06/2021 23:09:18 - INFO - __main__ - Step 13811: {'lr': 0.0004921853331908512, 'samples': 2651712, 'steps': 13810, 'loss/train': 1.9072991609573364}} 11/06/2021 23:09:19 - INFO - __main__ - Step 13815: {'lr': 0.0004921800664771743, 'samples': 2652480, 'steps': 13814, 'loss/train': 1.7779452800750732}} 11/06/2021 23:09:21 - INFO - __main__ - Step 13819: {'lr': 0.0004921747980175357, 'samples': 2653248, 'steps': 13818, 'loss/train': 1.8573100566864014}} 11/06/2021 23:09:24 - INFO - __main__ - Step 13824: {'lr': 0.0004921682099877869, 'samples': 2654208, 'steps': 13823, 'loss/train': 1.7149298191070557}} 11/06/2021 23:09:26 - INFO - __main__ - Step 13828: {'lr': 0.0004921629375998736, 'samples': 2654976, 'steps': 13827, 'loss/train': 1.5501351356506348}} 11/06/2021 23:09:28 - INFO - __main__ - Step 13832: {'lr': 0.0004921576634661221, 'samples': 2655744, 'steps': 13831, 'loss/train': 1.670548439025879}}} 11/06/2021 23:09:30 - INFO - __main__ - Step 13836: {'lr': 0.0004921523875865706, 'samples': 2656512, 'steps': 13835, 'loss/train': 1.0246696472167969}} 11/06/2021 23:09:31 - INFO - __main__ - Step 13840: {'lr': 0.0004921471099612571, 'samples': 2657280, 'steps': 13839, 'loss/train': 1.2678571939468384}} 11/06/2021 23:09:33 - INFO - __main__ - Step 13844: {'lr': 0.0004921418305902194, 'samples': 2658048, 'steps': 13843, 'loss/train': 1.7268887758255005}} 11/06/2021 23:09:36 - INFO - __main__ - Step 13849: {'lr': 0.0004921352289215561, 'samples': 2659008, 'steps': 13848, 'loss/train': 1.6378384828567505}} 11/06/2021 23:09:38 - INFO - __main__ - Step 13853: {'lr': 0.0004921299456227785, 'samples': 2659776, 'steps': 13852, 'loss/train': 1.6967663764953613}} 11/06/2021 23:09:38 - INFO - __main__ - Step 13853: {'lr': 0.0004921299456227785, 'samples': 2659776, 'steps': 13852, 'loss/train': 1.6967663764953613}} 11/06/2021 23:09:41 - INFO - __main__ - Step 13860: {'lr': 0.0004921206956495903, 'samples': 2661120, 'steps': 13859, 'loss/train': 0.5547469854354858}} 11/06/2021 23:09:44 - INFO - __main__ - Step 13866: {'lr': 0.0004921127628463972, 'samples': 2662272, 'steps': 13865, 'loss/train': 1.8773669004440308}} 11/06/2021 23:09:46 - INFO - __main__ - Step 13870: {'lr': 0.0004921074721290819, 'samples': 2663040, 'steps': 13869, 'loss/train': 1.8711230754852295}} 11/06/2021 23:09:46 - INFO - __main__ - Step 13870: {'lr': 0.0004921074721290819, 'samples': 2663040, 'steps': 13869, 'loss/train': 1.8711230754852295}} 11/06/2021 23:09:49 - INFO - __main__ - Step 13877: {'lr': 0.000492098209173842, 'samples': 2664384, 'steps': 13876, 'loss/train': 1.0459492206573486}}} 11/06/2021 23:09:51 - INFO - __main__ - Step 13881: {'lr': 0.0004920929136566632, 'samples': 2665152, 'steps': 13880, 'loss/train': 1.556997299194336}}} 11/06/2021 23:09:53 - INFO - __main__ - Step 13885: {'lr': 0.0004920876163941511, 'samples': 2665920, 'steps': 13884, 'loss/train': 1.1255569458007812}} 11/06/2021 23:09:56 - INFO - __main__ - Step 13889: {'lr': 0.0004920823173863439, 'samples': 2666688, 'steps': 13888, 'loss/train': 1.4370955228805542}} 11/06/2021 23:09:57 - INFO - __main__ - Step 13893: {'lr': 0.0004920770166332798, 'samples': 2667456, 'steps': 13892, 'loss/train': 1.972109317779541}}} 11/06/2021 23:10:00 - INFO - __main__ - Step 13898: {'lr': 0.0004920703882377403, 'samples': 2668416, 'steps': 13897, 'loss/train': 1.741361141204834}}} 11/06/2021 23:10:02 - INFO - __main__ - Step 13902: {'lr': 0.000492065083557988, 'samples': 2669184, 'steps': 13901, 'loss/train': 1.932618260383606}}}} 11/06/2021 23:10:02 - INFO - __main__ - Step 13902: {'lr': 0.000492065083557988, 'samples': 2669184, 'steps': 13901, 'loss/train': 1.932618260383606}}}} 11/06/2021 23:10:02 - INFO - __main__ - Step 13902: {'lr': 0.000492065083557988, 'samples': 2669184, 'steps': 13901, 'loss/train': 1.932618260383606}}}} 11/06/2021 23:10:07 - INFO - __main__ - Step 13912: {'lr': 0.0004920518142237352, 'samples': 2671104, 'steps': 13911, 'loss/train': 1.33417809009552}}}} 11/06/2021 23:10:10 - INFO - __main__ - Step 13917: {'lr': 0.000492045175466641, 'samples': 2672064, 'steps': 13916, 'loss/train': 1.871131181716919}}}} 11/06/2021 23:10:12 - INFO - __main__ - Step 13921: {'lr': 0.0004920398624978493, 'samples': 2672832, 'steps': 13920, 'loss/train': 1.4717364311218262}} 11/06/2021 23:10:14 - INFO - __main__ - Step 13925: {'lr': 0.0004920345477841067, 'samples': 2673600, 'steps': 13924, 'loss/train': 2.0693888664245605}} 11/06/2021 23:10:16 - INFO - __main__ - Step 13929: {'lr': 0.0004920292313254516, 'samples': 2674368, 'steps': 13928, 'loss/train': 2.1696088314056396}} 11/06/2021 23:10:17 - INFO - __main__ - Step 13933: {'lr': 0.0004920239131219223, 'samples': 2675136, 'steps': 13932, 'loss/train': 0.7648064494132996}} 11/06/2021 23:10:19 - INFO - __main__ - Step 13937: {'lr': 0.0004920185931735572, 'samples': 2675904, 'steps': 13936, 'loss/train': 1.8334205150604248}} 11/06/2021 23:10:19 - INFO - __main__ - Step 13937: {'lr': 0.0004920185931735572, 'samples': 2675904, 'steps': 13936, 'loss/train': 1.8334205150604248}} 11/06/2021 23:10:24 - INFO - __main__ - Step 13946: {'lr': 0.0004920066169103783, 'samples': 2677632, 'steps': 13945, 'loss/train': 1.0051872730255127}} 11/06/2021 23:10:26 - INFO - __main__ - Step 13950: {'lr': 0.0004920012912915616, 'samples': 2678400, 'steps': 13949, 'loss/train': 1.9164284467697144}} 11/06/2021 23:10:27 - INFO - __main__ - Step 13954: {'lr': 0.0004919959639280722, 'samples': 2679168, 'steps': 13953, 'loss/train': 3.453373432159424}}} 11/06/2021 23:10:29 - INFO - __main__ - Step 13958: {'lr': 0.0004919906348199483, 'samples': 2679936, 'steps': 13957, 'loss/train': 1.8448896408081055}} 11/06/2021 23:10:32 - INFO - __main__ - Step 13963: {'lr': 0.00049198397098146, 'samples': 2680896, 'steps': 13962, 'loss/train': 2.475085496902466}55}} 11/06/2021 23:10:34 - INFO - __main__ - Step 13967: {'lr': 0.0004919786379480494, 'samples': 2681664, 'steps': 13966, 'loss/train': 2.0461950302124023}} 11/06/2021 23:10:34 - INFO - __main__ - Step 13967: {'lr': 0.0004919786379480494, 'samples': 2681664, 'steps': 13966, 'loss/train': 2.0461950302124023}} 11/06/2021 23:10:37 - INFO - __main__ - Step 13974: {'lr': 0.0004919693009418782, 'samples': 2683008, 'steps': 13973, 'loss/train': 2.224214553833008}}} 11/06/2021 23:10:40 - INFO - __main__ - Step 13979: {'lr': 0.0004919626283809149, 'samples': 2683968, 'steps': 13978, 'loss/train': 1.7867261171340942}} 11/06/2021 23:10:40 - INFO - __main__ - Step 13979: {'lr': 0.0004919626283809149, 'samples': 2683968, 'steps': 13978, 'loss/train': 1.7867261171340942}} 11/06/2021 23:10:44 - INFO - __main__ - Step 13987: {'lr': 0.0004919519466141242, 'samples': 2685504, 'steps': 13986, 'loss/train': 2.206554889678955}}} 11/06/2021 23:10:45 - INFO - __main__ - Step 13991: {'lr': 0.0004919466031142342, 'samples': 2686272, 'steps': 13990, 'loss/train': 1.2076705694198608}} 11/06/2021 23:10:48 - INFO - __main__ - Step 13995: {'lr': 0.0004919412578700654, 'samples': 2687040, 'steps': 13994, 'loss/train': 1.2231806516647339}} 11/06/2021 23:10:50 - INFO - __main__ - Step 14000: {'lr': 0.0004919345738620218, 'samples': 2688000, 'steps': 13999, 'loss/train': 1.2765216827392578}} 11/06/2021 23:10:52 - INFO - __main__ - Step 14004: {'lr': 0.0004919292246933675, 'samples': 2688768, 'steps': 14003, 'loss/train': 1.9574697017669678}} 11/06/2021 23:10:52 - INFO - __main__ - Step 14004: {'lr': 0.0004919292246933675, 'samples': 2688768, 'steps': 14003, 'loss/train': 1.9574697017669678}} 11/06/2021 23:10:55 - INFO - __main__ - Step 14011: {'lr': 0.0004919198594513771, 'samples': 2690112, 'steps': 14010, 'loss/train': 1.3401371240615845}} 11/06/2021 23:10:58 - INFO - __main__ - Step 14016: {'lr': 0.0004919131667226398, 'samples': 2691072, 'steps': 14015, 'loss/train': 1.5199148654937744}} 11/06/2021 23:11:00 - INFO - __main__ - Step 14021: {'lr': 0.0004919064712688439, 'samples': 2692032, 'steps': 14020, 'loss/train': 1.8053239583969116}} 11/06/2021 23:11:02 - INFO - __main__ - Step 14025: {'lr': 0.0004919011129438158, 'samples': 2692800, 'steps': 14024, 'loss/train': 1.4526000022888184}} 11/06/2021 23:11:04 - INFO - __main__ - Step 14029: {'lr': 0.0004918957528748371, 'samples': 2693568, 'steps': 14028, 'loss/train': 1.5124090909957886}} 11/06/2021 23:11:06 - INFO - __main__ - Step 14033: {'lr': 0.0004918903910619465, 'samples': 2694336, 'steps': 14032, 'loss/train': 1.157104253768921}}} 11/06/2021 23:11:08 - INFO - __main__ - Step 14037: {'lr': 0.0004918850275051829, 'samples': 2695104, 'steps': 14036, 'loss/train': 2.457850217819214}}} 11/06/2021 23:11:10 - INFO - __main__ - Step 14042: {'lr': 0.0004918783206069652, 'samples': 2696064, 'steps': 14041, 'loss/train': 1.7770543098449707}} 11/06/2021 23:11:12 - INFO - __main__ - Step 14046: {'lr': 0.000491872953126628, 'samples': 2696832, 'steps': 14045, 'loss/train': 1.677998661994934}7}} 11/06/2021 23:11:12 - INFO - __main__ - Step 14046: {'lr': 0.000491872953126628, 'samples': 2696832, 'steps': 14045, 'loss/train': 1.677998661994934}7}} 11/06/2021 23:11:15 - INFO - __main__ - Step 14053: {'lr': 0.0004918635558401687, 'samples': 2698176, 'steps': 14052, 'loss/train': 2.0729172229766846}} 11/06/2021 23:11:18 - INFO - __main__ - Step 14058: {'lr': 0.0004918568402232863, 'samples': 2699136, 'steps': 14057, 'loss/train': 1.5368565320968628}} 11/06/2021 23:11:18 - INFO - __main__ - Step 14058: {'lr': 0.0004918568402232863, 'samples': 2699136, 'steps': 14057, 'loss/train': 1.5368565320968628}} 11/06/2021 23:11:22 - INFO - __main__ - Step 14066: {'lr': 0.0004918460895695037, 'samples': 2700672, 'steps': 14065, 'loss/train': 1.585329294204712}}} 11/06/2021 23:11:24 - INFO - __main__ - Step 14070: {'lr': 0.0004918407116272622, 'samples': 2701440, 'steps': 14069, 'loss/train': 1.0781822204589844}} 11/06/2021 23:11:26 - INFO - __main__ - Step 14075: {'lr': 0.0004918339867476469, 'samples': 2702400, 'steps': 14074, 'loss/train': 1.5994923114776611}} 11/06/2021 23:11:26 - INFO - __main__ - Step 14075: {'lr': 0.0004918339867476469, 'samples': 2702400, 'steps': 14074, 'loss/train': 1.5994923114776611}} 11/06/2021 23:11:26 - INFO - __main__ - Step 14075: {'lr': 0.0004918339867476469, 'samples': 2702400, 'steps': 14074, 'loss/train': 1.5994923114776611}} 11/06/2021 23:11:32 - INFO - __main__ - Step 14086: {'lr': 0.0004918191824235335, 'samples': 2704512, 'steps': 14085, 'loss/train': 1.54026198387146}1}} 11/06/2021 23:11:34 - INFO - __main__ - Step 14091: {'lr': 0.000491812448826852, 'samples': 2705472, 'steps': 14090, 'loss/train': 1.8842805624008179}}} 11/06/2021 23:11:36 - INFO - __main__ - Step 14095: {'lr': 0.0004918070599882778, 'samples': 2706240, 'steps': 14094, 'loss/train': 1.7472896575927734}} 11/06/2021 23:11:38 - INFO - __main__ - Step 14099: {'lr': 0.0004918016694064313, 'samples': 2707008, 'steps': 14098, 'loss/train': 1.9474259614944458}} 11/06/2021 23:11:40 - INFO - __main__ - Step 14103: {'lr': 0.000491796277081351, 'samples': 2707776, 'steps': 14102, 'loss/train': 1.817934274673462}8}} 11/06/2021 23:11:42 - INFO - __main__ - Step 14107: {'lr': 0.000491790883013076, 'samples': 2708544, 'steps': 14106, 'loss/train': 1.3012551069259644}}} 11/06/2021 23:11:44 - INFO - __main__ - Step 14111: {'lr': 0.0004917854872016451, 'samples': 2709312, 'steps': 14110, 'loss/train': 1.7014143466949463}} 11/06/2021 23:11:46 - INFO - __main__ - Step 14115: {'lr': 0.0004917800896470974, 'samples': 2710080, 'steps': 14114, 'loss/train': 1.9659100770950317}} 11/06/2021 23:11:47 - INFO - __main__ - Step 14119: {'lr': 0.0004917746903494717, 'samples': 2710848, 'steps': 14118, 'loss/train': 0.9476760029792786}} 11/06/2021 23:11:49 - INFO - __main__ - Step 14123: {'lr': 0.0004917692893088067, 'samples': 2711616, 'steps': 14122, 'loss/train': 1.9275217056274414}} 11/06/2021 23:11:49 - INFO - __main__ - Step 14123: {'lr': 0.0004917692893088067, 'samples': 2711616, 'steps': 14122, 'loss/train': 1.9275217056274414}} 11/06/2021 23:11:54 - INFO - __main__ - Step 14131: {'lr': 0.0004917584819985153, 'samples': 2713152, 'steps': 14130, 'loss/train': 1.5905683040618896}} 11/06/2021 23:11:54 - INFO - __main__ - Step 14131: {'lr': 0.0004917584819985153, 'samples': 2713152, 'steps': 14130, 'loss/train': 1.5905683040618896}} 11/06/2021 23:11:57 - INFO - __main__ - Step 14138: {'lr': 0.000491749019883036, 'samples': 2714496, 'steps': 14137, 'loss/train': 1.6772507429122925}}} 11/06/2021 23:11:57 - INFO - __main__ - Step 14138: {'lr': 0.000491749019883036, 'samples': 2714496, 'steps': 14137, 'loss/train': 1.6772507429122925}}} 11/06/2021 23:12:02 - INFO - __main__ - Step 14147: {'lr': 0.0004917368464631772, 'samples': 2716224, 'steps': 14146, 'loss/train': 1.4266034364700317}} 11/06/2021 23:12:04 - INFO - __main__ - Step 14151: {'lr': 0.0004917314332223295, 'samples': 2716992, 'steps': 14150, 'loss/train': 1.8678841590881348}} 11/06/2021 23:12:05 - INFO - __main__ - Step 14155: {'lr': 0.0004917260182387545, 'samples': 2717760, 'steps': 14154, 'loss/train': 1.4603012800216675}} 11/06/2021 23:12:08 - INFO - __main__ - Step 14159: {'lr': 0.0004917206015124913, 'samples': 2718528, 'steps': 14158, 'loss/train': 0.9594613313674927}} 11/06/2021 23:12:10 - INFO - __main__ - Step 14164: {'lr': 0.0004917138281540664, 'samples': 2719488, 'steps': 14163, 'loss/train': 2.4625284671783447}} 11/06/2021 23:12:10 - INFO - __main__ - Step 14164: {'lr': 0.0004917138281540664, 'samples': 2719488, 'steps': 14163, 'loss/train': 2.4625284671783447}} 11/06/2021 23:12:13 - INFO - __main__ - Step 14171: {'lr': 0.0004917043408779629, 'samples': 2720832, 'steps': 14170, 'loss/train': 1.746435523033142}}} 11/06/2021 23:12:15 - INFO - __main__ - Step 14175: {'lr': 0.0004916989171813374, 'samples': 2721600, 'steps': 14174, 'loss/train': 1.6893455982208252}} 11/06/2021 23:12:18 - INFO - __main__ - Step 14180: {'lr': 0.0004916921351101796, 'samples': 2722560, 'steps': 14179, 'loss/train': 1.529984951019287}}} 11/06/2021 23:12:18 - INFO - __main__ - Step 14180: {'lr': 0.0004916921351101796, 'samples': 2722560, 'steps': 14179, 'loss/train': 1.529984951019287}}} 11/06/2021 23:12:22 - INFO - __main__ - Step 14188: {'lr': 0.0004916812781334161, 'samples': 2724096, 'steps': 14187, 'loss/train': 1.5331050157546997}} 11/06/2021 23:12:23 - INFO - __main__ - Step 14192: {'lr': 0.0004916758470314662, 'samples': 2724864, 'steps': 14191, 'loss/train': 1.8545008897781372}} 11/06/2021 23:12:25 - INFO - __main__ - Step 14196: {'lr': 0.0004916704141871899, 'samples': 2725632, 'steps': 14195, 'loss/train': 1.6380163431167603}} 11/06/2021 23:12:28 - INFO - __main__ - Step 14201: {'lr': 0.0004916636206817575, 'samples': 2726592, 'steps': 14200, 'loss/train': 1.805939793586731}}} 11/06/2021 23:12:28 - INFO - __main__ - Step 14201: {'lr': 0.0004916636206817575, 'samples': 2726592, 'steps': 14200, 'loss/train': 1.805939793586731}}} 11/06/2021 23:12:31 - INFO - __main__ - Step 14208: {'lr': 0.0004916541052007936, 'samples': 2727936, 'steps': 14207, 'loss/train': 1.0482618808746338}} 11/06/2021 23:12:33 - INFO - __main__ - Step 14212: {'lr': 0.0004916486653876029, 'samples': 2728704, 'steps': 14211, 'loss/train': 1.552703619003296}}} 11/06/2021 23:12:36 - INFO - __main__ - Step 14217: {'lr': 0.0004916418631712481, 'samples': 2729664, 'steps': 14216, 'loss/train': 1.3125375509262085}} 11/06/2021 23:12:38 - INFO - __main__ - Step 14221: {'lr': 0.000491636419438319, 'samples': 2730432, 'steps': 14220, 'loss/train': 1.322358250617981}5}} 11/06/2021 23:12:40 - INFO - __main__ - Step 14225: {'lr': 0.0004916309739633475, 'samples': 2731200, 'steps': 14224, 'loss/train': 1.5547466278076172}} 11/06/2021 23:12:40 - INFO - __main__ - Step 14225: {'lr': 0.0004916309739633475, 'samples': 2731200, 'steps': 14224, 'loss/train': 1.5547466278076172}} 11/06/2021 23:12:43 - INFO - __main__ - Step 14232: {'lr': 0.0004916214401904763, 'samples': 2732544, 'steps': 14231, 'loss/train': 1.461196780204773}}} 11/06/2021 23:12:46 - INFO - __main__ - Step 14237: {'lr': 0.0004916146270865721, 'samples': 2733504, 'steps': 14236, 'loss/train': 2.5321567058563232}} 11/06/2021 23:12:46 - INFO - __main__ - Step 14237: {'lr': 0.0004916146270865721, 'samples': 2733504, 'steps': 14236, 'loss/train': 2.5321567058563232}} 11/06/2021 23:12:49 - INFO - __main__ - Step 14244: {'lr': 0.0004916050841686748, 'samples': 2734848, 'steps': 14243, 'loss/train': 1.6944113969802856}} 11/06/2021 23:12:52 - INFO - __main__ - Step 14249: {'lr': 0.0004915982645328304, 'samples': 2735808, 'steps': 14248, 'loss/train': 1.6930842399597168}} 11/06/2021 23:12:52 - INFO - __main__ - Step 14249: {'lr': 0.0004915982645328304, 'samples': 2735808, 'steps': 14248, 'loss/train': 1.6930842399597168}} 11/06/2021 23:12:57 - INFO - __main__ - Step 14258: {'lr': 0.0004915859823301535, 'samples': 2737536, 'steps': 14257, 'loss/train': 1.91822350025177}8}} 11/06/2021 23:12:58 - INFO - __main__ - Step 14262: {'lr': 0.0004915805207431537, 'samples': 2738304, 'steps': 14261, 'loss/train': 1.4041943550109863}} 11/06/2021 23:13:00 - INFO - __main__ - Step 14266: {'lr': 0.0004915750574145148, 'samples': 2739072, 'steps': 14265, 'loss/train': 1.9792180061340332}} 11/06/2021 23:13:03 - INFO - __main__ - Step 14271: {'lr': 0.0004915682258045958, 'samples': 2740032, 'steps': 14270, 'loss/train': 1.8175733089447021}} 11/06/2021 23:13:05 - INFO - __main__ - Step 14275: {'lr': 0.0004915627585574124, 'samples': 2740800, 'steps': 14274, 'loss/train': 1.643608570098877}}} 11/06/2021 23:13:07 - INFO - __main__ - Step 14279: {'lr': 0.0004915572895687179, 'samples': 2741568, 'steps': 14278, 'loss/train': 1.9221125841140747}} 11/06/2021 23:13:09 - INFO - __main__ - Step 14283: {'lr': 0.0004915518188385514, 'samples': 2742336, 'steps': 14282, 'loss/train': 1.557709813117981}}} 11/06/2021 23:13:10 - INFO - __main__ - Step 14287: {'lr': 0.0004915463463669527, 'samples': 2743104, 'steps': 14286, 'loss/train': 2.307543992996216}}} 11/06/2021 23:13:13 - INFO - __main__ - Step 14292: {'lr': 0.0004915395033286251, 'samples': 2744064, 'steps': 14291, 'loss/train': 1.8184715509414673}} 11/06/2021 23:13:15 - INFO - __main__ - Step 14296: {'lr': 0.000491534026938948, 'samples': 2744832, 'steps': 14295, 'loss/train': 1.8665683269500732}}} 11/06/2021 23:13:17 - INFO - __main__ - Step 14300: {'lr': 0.0004915285488079666, 'samples': 2745600, 'steps': 14299, 'loss/train': 1.6357394456863403}} 11/06/2021 23:13:19 - INFO - __main__ - Step 14304: {'lr': 0.0004915230689357206, 'samples': 2746368, 'steps': 14303, 'loss/train': 1.9127839803695679}} 11/06/2021 23:13:20 - INFO - __main__ - Step 14308: {'lr': 0.0004915175873222497, 'samples': 2747136, 'steps': 14307, 'loss/train': 1.7802067995071411}} 11/06/2021 23:13:20 - INFO - __main__ - Step 14308: {'lr': 0.0004915175873222497, 'samples': 2747136, 'steps': 14307, 'loss/train': 1.7802067995071411}} 11/06/2021 23:13:24 - INFO - __main__ - Step 14316: {'lr': 0.0004915066188717905, 'samples': 2748672, 'steps': 14315, 'loss/train': 1.671004056930542}}} 11/06/2021 23:13:24 - INFO - __main__ - Step 14316: {'lr': 0.0004915066188717905, 'samples': 2748672, 'steps': 14315, 'loss/train': 1.671004056930542}}} 11/06/2021 23:13:28 - INFO - __main__ - Step 14323: {'lr': 0.0004914970157646222, 'samples': 2750016, 'steps': 14322, 'loss/train': 1.5857484340667725}} 11/06/2021 23:13:30 - INFO - __main__ - Step 14328: {'lr': 0.0004914901531379019, 'samples': 2750976, 'steps': 14327, 'loss/train': 1.843706727027893}}} 11/06/2021 23:13:30 - INFO - __main__ - Step 14328: {'lr': 0.0004914901531379019, 'samples': 2750976, 'steps': 14327, 'loss/train': 1.843706727027893}}} 11/06/2021 23:13:34 - INFO - __main__ - Step 14336: {'lr': 0.0004914791672769713, 'samples': 2752512, 'steps': 14335, 'loss/train': 1.6169592142105103}} 11/06/2021 23:13:36 - INFO - __main__ - Step 14340: {'lr': 0.0004914736717351233, 'samples': 2753280, 'steps': 14339, 'loss/train': 0.8791477084159851}} 11/06/2021 23:13:38 - INFO - __main__ - Step 14344: {'lr': 0.0004914681744524064, 'samples': 2754048, 'steps': 14343, 'loss/train': 1.9771595001220703}} 11/06/2021 23:13:40 - INFO - __main__ - Step 14349: {'lr': 0.0004914613004009736, 'samples': 2755008, 'steps': 14348, 'loss/train': 1.3072386980056763}} 11/06/2021 23:13:40 - INFO - __main__ - Step 14349: {'lr': 0.0004914613004009736, 'samples': 2755008, 'steps': 14348, 'loss/train': 1.3072386980056763}} 11/06/2021 23:13:44 - INFO - __main__ - Step 14356: {'lr': 0.0004914516721594382, 'samples': 2756352, 'steps': 14355, 'loss/train': 1.8118442296981812}} 11/06/2021 23:13:46 - INFO - __main__ - Step 14360: {'lr': 0.0004914461679136419, 'samples': 2757120, 'steps': 14359, 'loss/train': 1.5644958019256592}} 11/06/2021 23:13:48 - INFO - __main__ - Step 14365: {'lr': 0.0004914392851585829, 'samples': 2758080, 'steps': 14364, 'loss/train': 1.7351176738739014}} 11/06/2021 23:13:51 - INFO - __main__ - Step 14370: {'lr': 0.0004914323996838036, 'samples': 2759040, 'steps': 14369, 'loss/train': 1.368338942527771}}} 11/06/2021 23:13:53 - INFO - __main__ - Step 14374: {'lr': 0.0004914268893458336, 'samples': 2759808, 'steps': 14373, 'loss/train': 1.7623484134674072}} 11/06/2021 23:13:53 - INFO - __main__ - Step 14374: {'lr': 0.0004914268893458336, 'samples': 2759808, 'steps': 14373, 'loss/train': 1.7623484134674072}} 11/06/2021 23:13:57 - INFO - __main__ - Step 14381: {'lr': 0.0004914172420662556, 'samples': 2761152, 'steps': 14380, 'loss/train': 1.8425896167755127}} 11/06/2021 23:13:59 - INFO - __main__ - Step 14385: {'lr': 0.000491411726941919, 'samples': 2761920, 'steps': 14384, 'loss/train': 1.7564046382904053}}} 11/06/2021 23:14:00 - INFO - __main__ - Step 14389: {'lr': 0.00049140621007716, 'samples': 2762688, 'steps': 14388, 'loss/train': 1.5696700811386108}}}} 11/06/2021 23:14:02 - INFO - __main__ - Step 14393: {'lr': 0.0004914006914720184, 'samples': 2763456, 'steps': 14392, 'loss/train': 1.1342967748641968}} 11/06/2021 23:14:05 - INFO - __main__ - Step 14398: {'lr': 0.0004913937907682391, 'samples': 2764416, 'steps': 14397, 'loss/train': 1.6365022659301758}} 11/06/2021 23:14:07 - INFO - __main__ - Step 14402: {'lr': 0.0004913882682473821, 'samples': 2765184, 'steps': 14401, 'loss/train': 1.6088601350784302}} 11/06/2021 23:14:09 - INFO - __main__ - Step 14406: {'lr': 0.000491382743986272, 'samples': 2765952, 'steps': 14405, 'loss/train': 2.084760904312134}2}} 11/06/2021 23:14:10 - INFO - __main__ - Step 14410: {'lr': 0.0004913772179849483, 'samples': 2766720, 'steps': 14409, 'loss/train': 1.7048285007476807}} 11/06/2021 23:14:12 - INFO - __main__ - Step 14414: {'lr': 0.000491371690243451, 'samples': 2767488, 'steps': 14413, 'loss/train': 1.3805193901062012}}} 11/06/2021 23:14:15 - INFO - __main__ - Step 14419: {'lr': 0.0004913647781195212, 'samples': 2768448, 'steps': 14418, 'loss/train': 1.5809930562973022}} 11/06/2021 23:14:15 - INFO - __main__ - Step 14419: {'lr': 0.0004913647781195212, 'samples': 2768448, 'steps': 14418, 'loss/train': 1.5809930562973022}} 11/06/2021 23:14:18 - INFO - __main__ - Step 14426: {'lr': 0.0004913550965783165, 'samples': 2769792, 'steps': 14425, 'loss/train': 1.8378410339355469}} 11/06/2021 23:14:20 - INFO - __main__ - Step 14430: {'lr': 0.0004913495618765235, 'samples': 2770560, 'steps': 14429, 'loss/train': 1.6800792217254639}} 11/06/2021 23:14:23 - INFO - __main__ - Step 14435: {'lr': 0.0004913426410524482, 'samples': 2771520, 'steps': 14434, 'loss/train': 0.5304341316223145}} 11/06/2021 23:14:25 - INFO - __main__ - Step 14439: {'lr': 0.0004913371024357694, 'samples': 2772288, 'steps': 14438, 'loss/train': 1.7349668741226196}} 11/06/2021 23:14:27 - INFO - __main__ - Step 14443: {'lr': 0.0004913315620792061, 'samples': 2773056, 'steps': 14442, 'loss/train': 1.784220576286316}}} 11/06/2021 23:14:28 - INFO - __main__ - Step 14447: {'lr': 0.0004913260199827986, 'samples': 2773824, 'steps': 14446, 'loss/train': 2.1198136806488037}} 11/06/2021 23:14:31 - INFO - __main__ - Step 14452: {'lr': 0.0004913190899156936, 'samples': 2774784, 'steps': 14451, 'loss/train': 1.592616081237793}}} 11/06/2021 23:14:33 - INFO - __main__ - Step 14456: {'lr': 0.0004913135439047821, 'samples': 2775552, 'steps': 14455, 'loss/train': 1.7716397047042847}} 11/06/2021 23:14:35 - INFO - __main__ - Step 14460: {'lr': 0.000491307996154156, 'samples': 2776320, 'steps': 14459, 'loss/train': 1.87995445728302}47}} 11/06/2021 23:14:37 - INFO - __main__ - Step 14464: {'lr': 0.0004913024466638553, 'samples': 2777088, 'steps': 14463, 'loss/train': 1.7339403629302979}} 11/06/2021 23:14:38 - INFO - __main__ - Step 14468: {'lr': 0.0004912968954339202, 'samples': 2777856, 'steps': 14467, 'loss/train': 1.555679202079773}}} 11/06/2021 23:14:40 - INFO - __main__ - Step 14472: {'lr': 0.0004912913424643904, 'samples': 2778624, 'steps': 14471, 'loss/train': 1.8691980838775635}} 11/06/2021 23:14:40 - INFO - __main__ - Step 14472: {'lr': 0.0004912913424643904, 'samples': 2778624, 'steps': 14471, 'loss/train': 1.8691980838775635}} 11/06/2021 23:14:45 - INFO - __main__ - Step 14480: {'lr': 0.0004912802313067076, 'samples': 2780160, 'steps': 14479, 'loss/train': 1.160788893699646}}} 11/06/2021 23:14:46 - INFO - __main__ - Step 14484: {'lr': 0.0004912746731186346, 'samples': 2780928, 'steps': 14483, 'loss/train': 1.4715304374694824}} 11/06/2021 23:14:49 - INFO - __main__ - Step 14488: {'lr': 0.0004912691131911272, 'samples': 2781696, 'steps': 14487, 'loss/train': 1.073344111442566}}} 11/06/2021 23:14:49 - INFO - __main__ - Step 14488: {'lr': 0.0004912691131911272, 'samples': 2781696, 'steps': 14487, 'loss/train': 1.073344111442566}}} 11/06/2021 23:14:52 - INFO - __main__ - Step 14495: {'lr': 0.0004912593791325962, 'samples': 2783040, 'steps': 14494, 'loss/train': 0.6017135977745056}} 11/06/2021 23:14:54 - INFO - __main__ - Step 14500: {'lr': 0.0004912524229724002, 'samples': 2784000, 'steps': 14499, 'loss/train': 1.174378752708435}}} 11/06/2021 23:14:57 - INFO - __main__ - Step 14505: {'lr': 0.0004912454640945889, 'samples': 2784960, 'steps': 14504, 'loss/train': 1.8055917024612427}} 11/06/2021 23:14:59 - INFO - __main__ - Step 14509: {'lr': 0.0004912398950357094, 'samples': 2785728, 'steps': 14508, 'loss/train': 2.2110788822174072}} 11/06/2021 23:14:59 - INFO - __main__ - Step 14509: {'lr': 0.0004912398950357094, 'samples': 2785728, 'steps': 14508, 'loss/train': 2.2110788822174072}} 11/06/2021 23:15:02 - INFO - __main__ - Step 14516: {'lr': 0.0004912301449977837, 'samples': 2787072, 'steps': 14515, 'loss/train': 1.7214689254760742}} 11/06/2021 23:15:05 - INFO - __main__ - Step 14521: {'lr': 0.0004912231774241298, 'samples': 2788032, 'steps': 14520, 'loss/train': 1.797451138496399}}} 11/06/2021 23:15:07 - INFO - __main__ - Step 14526: {'lr': 0.0004912162071331898, 'samples': 2788992, 'steps': 14525, 'loss/train': 1.6764172315597534}} 11/06/2021 23:15:09 - INFO - __main__ - Step 14530: {'lr': 0.0004912106289440446, 'samples': 2789760, 'steps': 14529, 'loss/train': 1.904273271560669}}} 11/06/2021 23:15:11 - INFO - __main__ - Step 14534: {'lr': 0.0004912050490159268, 'samples': 2790528, 'steps': 14533, 'loss/train': 1.7847373485565186}} 11/06/2021 23:15:12 - INFO - __main__ - Step 14538: {'lr': 0.0004911994673488766, 'samples': 2791296, 'steps': 14537, 'loss/train': 1.377058982849121}}} 11/06/2021 23:15:15 - INFO - __main__ - Step 14542: {'lr': 0.0004911938839429344, 'samples': 2792064, 'steps': 14541, 'loss/train': 1.8177210092544556}} 11/06/2021 23:15:17 - INFO - __main__ - Step 14547: {'lr': 0.0004911869022402508, 'samples': 2793024, 'steps': 14546, 'loss/train': 1.8774888515472412}} 11/06/2021 23:15:19 - INFO - __main__ - Step 14551: {'lr': 0.0004911813149219485, 'samples': 2793792, 'steps': 14550, 'loss/train': 1.7999236583709717}} 11/06/2021 23:15:19 - INFO - __main__ - Step 14551: {'lr': 0.0004911813149219485, 'samples': 2793792, 'steps': 14550, 'loss/train': 1.7999236583709717}} 11/06/2021 23:15:22 - INFO - __main__ - Step 14557: {'lr': 0.0004911729306843302, 'samples': 2794944, 'steps': 14556, 'loss/train': 1.9681403636932373}} 11/06/2021 23:15:25 - INFO - __main__ - Step 14563: {'lr': 0.000491164542534635, 'samples': 2796096, 'steps': 14562, 'loss/train': 1.9331468343734741}}} 11/06/2021 23:15:27 - INFO - __main__ - Step 14567: {'lr': 0.0004911589482615294, 'samples': 2796864, 'steps': 14566, 'loss/train': 0.8703035116195679}} 11/06/2021 23:15:29 - INFO - __main__ - Step 14571: {'lr': 0.0004911533522498239, 'samples': 2797632, 'steps': 14570, 'loss/train': 1.7870599031448364}} 11/06/2021 23:15:30 - INFO - __main__ - Step 14575: {'lr': 0.0004911477544995585, 'samples': 2798400, 'steps': 14574, 'loss/train': 1.705611228942871}}} 11/06/2021 23:15:32 - INFO - __main__ - Step 14579: {'lr': 0.0004911421550107739, 'samples': 2799168, 'steps': 14578, 'loss/train': 1.5826555490493774}} 11/06/2021 23:15:35 - INFO - __main__ - Step 14584: {'lr': 0.000491135153205062, 'samples': 2800128, 'steps': 14583, 'loss/train': 1.6225132942199707}}} 11/06/2021 23:15:37 - INFO - __main__ - Step 14588: {'lr': 0.0004911295498047565, 'samples': 2800896, 'steps': 14587, 'loss/train': 2.0302164554595947}} 11/06/2021 23:15:39 - INFO - __main__ - Step 14592: {'lr': 0.000491123944666063, 'samples': 2801664, 'steps': 14591, 'loss/train': 1.9390480518341064}}} 11/06/2021 23:15:41 - INFO - __main__ - Step 14596: {'lr': 0.0004911183377890218, 'samples': 2802432, 'steps': 14595, 'loss/train': 1.3001048564910889}} 11/06/2021 23:15:43 - INFO - __main__ - Step 14600: {'lr': 0.0004911127291736735, 'samples': 2803200, 'steps': 14599, 'loss/train': 1.4260157346725464}} 11/06/2021 23:15:45 - INFO - __main__ - Step 14605: {'lr': 0.0004911057159600551, 'samples': 2804160, 'steps': 14604, 'loss/train': 1.5187530517578125}} 11/06/2021 23:15:47 - INFO - __main__ - Step 14609: {'lr': 0.0004911001034336633, 'samples': 2804928, 'steps': 14608, 'loss/train': 1.6968247890472412}} 11/06/2021 23:15:49 - INFO - __main__ - Step 14613: {'lr': 0.0004910944891690956, 'samples': 2805696, 'steps': 14612, 'loss/train': 1.5665946006774902}} 11/06/2021 23:15:51 - INFO - __main__ - Step 14617: {'lr': 0.0004910888731663928, 'samples': 2806464, 'steps': 14616, 'loss/train': 1.519328236579895}}} 11/06/2021 23:15:53 - INFO - __main__ - Step 14621: {'lr': 0.0004910832554255951, 'samples': 2807232, 'steps': 14620, 'loss/train': 1.7540532350540161}} 11/06/2021 23:15:53 - INFO - __main__ - Step 14621: {'lr': 0.0004910832554255951, 'samples': 2807232, 'steps': 14620, 'loss/train': 1.7540532350540161}} 11/06/2021 23:15:57 - INFO - __main__ - Step 14629: {'lr': 0.0004910720147298772, 'samples': 2808768, 'steps': 14628, 'loss/train': 1.5132771730422974}} 11/06/2021 23:15:59 - INFO - __main__ - Step 14633: {'lr': 0.0004910663917750382, 'samples': 2809536, 'steps': 14632, 'loss/train': 1.8392850160598755}} 11/06/2021 23:16:01 - INFO - __main__ - Step 14638: {'lr': 0.0004910593606375261, 'samples': 2810496, 'steps': 14637, 'loss/train': 2.147937297821045}}} 11/06/2021 23:16:03 - INFO - __main__ - Step 14642: {'lr': 0.0004910537337723954, 'samples': 2811264, 'steps': 14641, 'loss/train': 1.8136534690856934}} 11/06/2021 23:16:05 - INFO - __main__ - Step 14646: {'lr': 0.0004910481051694231, 'samples': 2812032, 'steps': 14645, 'loss/train': 1.9416497945785522}} 11/06/2021 23:16:05 - INFO - __main__ - Step 14646: {'lr': 0.0004910481051694231, 'samples': 2812032, 'steps': 14645, 'loss/train': 1.9416497945785522}} 11/06/2021 23:16:09 - INFO - __main__ - Step 14653: {'lr': 0.0004910382509326627, 'samples': 2813376, 'steps': 14652, 'loss/train': 2.0839693546295166}} 11/06/2021 23:16:11 - INFO - __main__ - Step 14658: {'lr': 0.0004910312089338634, 'samples': 2814336, 'steps': 14657, 'loss/train': 1.369263768196106}}} 11/06/2021 23:16:13 - INFO - __main__ - Step 14663: {'lr': 0.0004910241642199406, 'samples': 2815296, 'steps': 14662, 'loss/train': 1.4840021133422852}} 11/06/2021 23:16:16 - INFO - __main__ - Step 14667: {'lr': 0.0004910185264939667, 'samples': 2816064, 'steps': 14666, 'loss/train': 1.7304264307022095}} 11/06/2021 23:16:16 - INFO - __main__ - Step 14667: {'lr': 0.0004910185264939667, 'samples': 2816064, 'steps': 14666, 'loss/train': 1.7304264307022095}} 11/06/2021 23:16:19 - INFO - __main__ - Step 14674: {'lr': 0.0004910086562924663, 'samples': 2817408, 'steps': 14673, 'loss/train': 1.5062963962554932}} 11/06/2021 23:16:21 - INFO - __main__ - Step 14679: {'lr': 0.0004910016028906813, 'samples': 2818368, 'steps': 14678, 'loss/train': 2.185955047607422}}} 11/06/2021 23:16:21 - INFO - __main__ - Step 14679: {'lr': 0.0004910016028906813, 'samples': 2818368, 'steps': 14678, 'loss/train': 2.185955047607422}}} 11/06/2021 23:16:21 - INFO - __main__ - Step 14679: {'lr': 0.0004910016028906813, 'samples': 2818368, 'steps': 14678, 'loss/train': 2.185955047607422}}} 11/06/2021 23:16:26 - INFO - __main__ - Step 14690: {'lr': 0.0004909860758508052, 'samples': 2820480, 'steps': 14689, 'loss/train': 1.798073172569275}}} 11/06/2021 23:16:29 - INFO - __main__ - Step 14695: {'lr': 0.0004909790137619719, 'samples': 2821440, 'steps': 14694, 'loss/train': 1.473230242729187}}} 11/06/2021 23:16:31 - INFO - __main__ - Step 14700: {'lr': 0.0004909719489586029, 'samples': 2822400, 'steps': 14699, 'loss/train': 1.9752013683319092}} 11/06/2021 23:16:31 - INFO - __main__ - Step 14700: {'lr': 0.0004909719489586029, 'samples': 2822400, 'steps': 14699, 'loss/train': 1.9752013683319092}} 11/06/2021 23:16:31 - INFO - __main__ - Step 14700: {'lr': 0.0004909719489586029, 'samples': 2822400, 'steps': 14699, 'loss/train': 1.9752013683319092}} 11/06/2021 23:16:37 - INFO - __main__ - Step 14711: {'lr': 0.0004909563968364179, 'samples': 2824512, 'steps': 14710, 'loss/train': 1.516960859298706}}} 11/06/2021 23:16:40 - INFO - __main__ - Step 14717: {'lr': 0.0004909479083234936, 'samples': 2825664, 'steps': 14716, 'loss/train': 1.4705344438552856}} 11/06/2021 23:16:40 - INFO - __main__ - Step 14717: {'lr': 0.0004909479083234936, 'samples': 2825664, 'steps': 14716, 'loss/train': 1.4705344438552856}} 11/06/2021 23:16:42 - INFO - __main__ - Step 14723: {'lr': 0.0004909394159021425, 'samples': 2826816, 'steps': 14722, 'loss/train': 1.6616466045379639}} 11/06/2021 23:16:44 - INFO - __main__ - Step 14727: {'lr': 0.0004909337521166282, 'samples': 2827584, 'steps': 14726, 'loss/train': 1.4586058855056763}} 11/06/2021 23:16:47 - INFO - __main__ - Step 14733: {'lr': 0.0004909252531815388, 'samples': 2828736, 'steps': 14732, 'loss/train': 1.6438379287719727}} 11/06/2021 23:16:47 - INFO - __main__ - Step 14733: {'lr': 0.0004909252531815388, 'samples': 2828736, 'steps': 14732, 'loss/train': 1.6438379287719727}} 11/06/2021 23:16:51 - INFO - __main__ - Step 14740: {'lr': 0.0004909153328179248, 'samples': 2830080, 'steps': 14739, 'loss/train': 1.9582070112228394}} 11/06/2021 23:16:53 - INFO - __main__ - Step 14744: {'lr': 0.0004909096616505426, 'samples': 2830848, 'steps': 14743, 'loss/train': 1.605130672454834}}} 11/06/2021 23:16:55 - INFO - __main__ - Step 14749: {'lr': 0.0004909025702489407, 'samples': 2831808, 'steps': 14748, 'loss/train': 0.686924934387207}}} 11/06/2021 23:16:57 - INFO - __main__ - Step 14753: {'lr': 0.0004908968951738098, 'samples': 2832576, 'steps': 14752, 'loss/train': 2.133230209350586}}} 11/06/2021 23:16:57 - INFO - __main__ - Step 14753: {'lr': 0.0004908968951738098, 'samples': 2832576, 'steps': 14752, 'loss/train': 2.133230209350586}}} 11/06/2021 23:17:01 - INFO - __main__ - Step 14760: {'lr': 0.0004908869596133948, 'samples': 2833920, 'steps': 14759, 'loss/train': 2.303706407546997}}} 11/06/2021 23:17:03 - INFO - __main__ - Step 14765: {'lr': 0.0004908798595283159, 'samples': 2834880, 'steps': 14764, 'loss/train': 2.077310562133789}}} 11/06/2021 23:17:03 - INFO - __main__ - Step 14765: {'lr': 0.0004908798595283159, 'samples': 2834880, 'steps': 14764, 'loss/train': 2.077310562133789}}} 11/06/2021 23:17:07 - INFO - __main__ - Step 14773: {'lr': 0.0004908684937483119, 'samples': 2836416, 'steps': 14772, 'loss/train': 1.799191951751709}}} 11/06/2021 23:17:09 - INFO - __main__ - Step 14777: {'lr': 0.0004908628082535303, 'samples': 2837184, 'steps': 14776, 'loss/train': 1.8874455690383911}} 11/06/2021 23:17:11 - INFO - __main__ - Step 14781: {'lr': 0.0004908571210222837, 'samples': 2837952, 'steps': 14780, 'loss/train': 1.470587968826294}}} 11/06/2021 23:17:11 - INFO - __main__ - Step 14781: {'lr': 0.0004908571210222837, 'samples': 2837952, 'steps': 14780, 'loss/train': 1.470587968826294}}} 11/06/2021 23:17:15 - INFO - __main__ - Step 14789: {'lr': 0.0004908457413505596, 'samples': 2839488, 'steps': 14788, 'loss/train': 1.7249447107315063}} 11/06/2021 23:17:17 - INFO - __main__ - Step 14793: {'lr': 0.000490840048910164, 'samples': 2840256, 'steps': 14792, 'loss/train': 1.8218657970428467}}} 11/06/2021 23:17:19 - INFO - __main__ - Step 14797: {'lr': 0.0004908343547334674, 'samples': 2841024, 'steps': 14796, 'loss/train': 1.586082100868225}}} 11/06/2021 23:17:21 - INFO - __main__ - Step 14802: {'lr': 0.0004908272345709861, 'samples': 2841984, 'steps': 14801, 'loss/train': 1.824029564857483}}} 11/06/2021 23:17:21 - INFO - __main__ - Step 14802: {'lr': 0.0004908272345709861, 'samples': 2841984, 'steps': 14801, 'loss/train': 1.824029564857483}}} 11/06/2021 23:17:25 - INFO - __main__ - Step 14810: {'lr': 0.0004908158366683714, 'samples': 2843520, 'steps': 14809, 'loss/train': 2.220637798309326}}} 11/06/2021 23:17:27 - INFO - __main__ - Step 14814: {'lr': 0.000490810135112854, 'samples': 2844288, 'steps': 14813, 'loss/train': 1.816659688949585}}}} 11/06/2021 23:17:29 - INFO - __main__ - Step 14818: {'lr': 0.0004908044318212512, 'samples': 2845056, 'steps': 14817, 'loss/train': 1.5969890356063843}} 11/06/2021 23:17:31 - INFO - __main__ - Step 14823: {'lr': 0.0004907973002654404, 'samples': 2846016, 'steps': 14822, 'loss/train': 1.708099603652954}}} 11/06/2021 23:17:33 - INFO - __main__ - Step 14827: {'lr': 0.0004907915930677961, 'samples': 2846784, 'steps': 14826, 'loss/train': 1.3716732263565063}} 11/06/2021 23:17:33 - INFO - __main__ - Step 14827: {'lr': 0.0004907915930677961, 'samples': 2846784, 'steps': 14826, 'loss/train': 1.3716732263565063}} 11/06/2021 23:17:37 - INFO - __main__ - Step 14834: {'lr': 0.0004907816012948098, 'samples': 2848128, 'steps': 14833, 'loss/train': 2.050675630569458}}} 11/06/2021 23:17:39 - INFO - __main__ - Step 14839: {'lr': 0.0004907744610593181, 'samples': 2849088, 'steps': 14838, 'loss/train': 1.936640739440918}}} 11/06/2021 23:17:41 - INFO - __main__ - Step 14844: {'lr': 0.000490767318111595, 'samples': 2850048, 'steps': 14843, 'loss/train': 1.6642917394638062}}} 11/06/2021 23:17:43 - INFO - __main__ - Step 14848: {'lr': 0.000490761601800664, 'samples': 2850816, 'steps': 14847, 'loss/train': 1.7513176202774048}}} 11/06/2021 23:17:45 - INFO - __main__ - Step 14852: {'lr': 0.0004907558837539976, 'samples': 2851584, 'steps': 14851, 'loss/train': 1.7864621877670288}} 11/06/2021 23:17:45 - INFO - __main__ - Step 14852: {'lr': 0.0004907558837539976, 'samples': 2851584, 'steps': 14851, 'loss/train': 1.7864621877670288}} 11/06/2021 23:17:49 - INFO - __main__ - Step 14859: {'lr': 0.0004907458729958422, 'samples': 2852928, 'steps': 14858, 'loss/train': 1.4801957607269287}} 11/06/2021 23:17:51 - INFO - __main__ - Step 14864: {'lr': 0.0004907387191999984, 'samples': 2853888, 'steps': 14863, 'loss/train': 1.4284332990646362}} 11/06/2021 23:17:51 - INFO - __main__ - Step 14864: {'lr': 0.0004907387191999984, 'samples': 2853888, 'steps': 14863, 'loss/train': 1.4284332990646362}} 11/06/2021 23:17:56 - INFO - __main__ - Step 14872: {'lr': 0.0004907272674860779, 'samples': 2855424, 'steps': 14871, 'loss/train': 2.0826239585876465}} 11/06/2021 23:17:57 - INFO - __main__ - Step 14876: {'lr': 0.0004907215390258652, 'samples': 2856192, 'steps': 14875, 'loss/train': 1.654037356376648}}} 11/06/2021 23:17:59 - INFO - __main__ - Step 14880: {'lr': 0.0004907158088302059, 'samples': 2856960, 'steps': 14879, 'loss/train': 1.6989234685897827}} 11/06/2021 23:18:02 - INFO - __main__ - Step 14885: {'lr': 0.0004907086436452231, 'samples': 2857920, 'steps': 14884, 'loss/train': 1.5355026721954346}} 11/06/2021 23:18:04 - INFO - __main__ - Step 14889: {'lr': 0.0004907029095449602, 'samples': 2858688, 'steps': 14888, 'loss/train': 1.847086787223816}}} 11/06/2021 23:18:06 - INFO - __main__ - Step 14893: {'lr': 0.0004906971737093849, 'samples': 2859456, 'steps': 14892, 'loss/train': 1.9280325174331665}} 11/06/2021 23:18:08 - INFO - __main__ - Step 14897: {'lr': 0.0004906914361385387, 'samples': 2860224, 'steps': 14896, 'loss/train': 1.7625758647918701}} 11/06/2021 23:18:09 - INFO - __main__ - Step 14901: {'lr': 0.000490685696832463, 'samples': 2860992, 'steps': 14900, 'loss/train': 1.9356876611709595}}} 11/06/2021 23:18:11 - INFO - __main__ - Step 14905: {'lr': 0.0004906799557911992, 'samples': 2861760, 'steps': 14904, 'loss/train': 1.366396427154541}}} 11/06/2021 23:18:14 - INFO - __main__ - Step 14910: {'lr': 0.0004906727770495739, 'samples': 2862720, 'steps': 14909, 'loss/train': 1.645467758178711}}} 11/06/2021 23:18:14 - INFO - __main__ - Step 14910: {'lr': 0.0004906727770495739, 'samples': 2862720, 'steps': 14909, 'loss/train': 1.645467758178711}}} 11/06/2021 23:18:17 - INFO - __main__ - Step 14917: {'lr': 0.0004906627222566924, 'samples': 2864064, 'steps': 14916, 'loss/train': 1.4498852491378784}} 11/06/2021 23:18:19 - INFO - __main__ - Step 14921: {'lr': 0.0004906569742750899, 'samples': 2864832, 'steps': 14920, 'loss/train': 1.7183939218521118}} 11/06/2021 23:18:22 - INFO - __main__ - Step 14926: {'lr': 0.0004906497868582743, 'samples': 2865792, 'steps': 14925, 'loss/train': 1.4507989883422852}} 11/06/2021 23:18:24 - INFO - __main__ - Step 14930: {'lr': 0.0004906440349730226, 'samples': 2866560, 'steps': 14929, 'loss/train': 1.1155771017074585}} 11/06/2021 23:18:24 - INFO - __main__ - Step 14930: {'lr': 0.0004906440349730226, 'samples': 2866560, 'steps': 14929, 'loss/train': 1.1155771017074585}} 11/06/2021 23:18:28 - INFO - __main__ - Step 14938: {'lr': 0.000490632525997897, 'samples': 2868096, 'steps': 14937, 'loss/train': 1.6703389883041382}}} 11/06/2021 23:18:29 - INFO - __main__ - Step 14942: {'lr': 0.0004906267689081063, 'samples': 2868864, 'steps': 14941, 'loss/train': 1.9198617935180664}} 11/06/2021 23:18:31 - INFO - __main__ - Step 14946: {'lr': 0.0004906210100835522, 'samples': 2869632, 'steps': 14945, 'loss/train': 1.7185678482055664}} 11/06/2021 23:18:34 - INFO - __main__ - Step 14950: {'lr': 0.0004906152495242763, 'samples': 2870400, 'steps': 14949, 'loss/train': 1.3722232580184937}} 11/06/2021 23:18:36 - INFO - __main__ - Step 14954: {'lr': 0.00049060948723032, 'samples': 2871168, 'steps': 14953, 'loss/train': 1.7364459037780762}7}} 11/06/2021 23:18:38 - INFO - __main__ - Step 14958: {'lr': 0.000490603723201725, 'samples': 2871936, 'steps': 14957, 'loss/train': 3.7174441814422607}}} 11/06/2021 23:18:39 - INFO - __main__ - Step 14962: {'lr': 0.0004905979574385328, 'samples': 2872704, 'steps': 14961, 'loss/train': 1.4080133438110352}} 11/06/2021 23:18:42 - INFO - __main__ - Step 14967: {'lr': 0.0004905907477953286, 'samples': 2873664, 'steps': 14966, 'loss/train': 1.8428308963775635}} 11/06/2021 23:18:44 - INFO - __main__ - Step 14972: {'lr': 0.0004905835354419625, 'samples': 2874624, 'steps': 14971, 'loss/train': 1.7392559051513672}} 11/06/2021 23:18:44 - INFO - __main__ - Step 14972: {'lr': 0.0004905835354419625, 'samples': 2874624, 'steps': 14971, 'loss/train': 1.7392559051513672}} 11/06/2021 23:18:48 - INFO - __main__ - Step 14980: {'lr': 0.0004905719900396426, 'samples': 2876160, 'steps': 14979, 'loss/train': 1.410271406173706}}} 11/06/2021 23:18:50 - INFO - __main__ - Step 14984: {'lr': 0.0004905662147369091, 'samples': 2876928, 'steps': 14983, 'loss/train': 2.0281472206115723}} 11/06/2021 23:18:50 - INFO - __main__ - Step 14984: {'lr': 0.0004905662147369091, 'samples': 2876928, 'steps': 14983, 'loss/train': 2.0281472206115723}} 11/06/2021 23:18:50 - INFO - __main__ - Step 14984: {'lr': 0.0004905662147369091, 'samples': 2876928, 'steps': 14983, 'loss/train': 2.0281472206115723}} 11/06/2021 23:18:55 - INFO - __main__ - Step 14995: {'lr': 0.000490550323711895, 'samples': 2879040, 'steps': 14994, 'loss/train': 1.996849536895752}3}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} 11/06/2021 23:18:57 - INFO - __main__ - Step 14999: {'lr': 0.000490544541905651, 'samples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} remote: ---------------------------------------------------------- pos ples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} remote: ---------------------------------------------------------- pos ples': 2879808, 'steps': 14998, 'loss/train': 1.6460684537887573}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible 11/06/2021 23:22:44 - INFO - __main__ - Step 15003: {'lr': 0.0004905387583652363, 'samples': 2880576, 'steps': 15002, 'loss/train': 1.7260355949401855}} 11/06/2021 23:22:44 - INFO - __main__ - Step 15003: {'lr': 0.0004905387583652363, 'samples': 2880576, 'steps': 15002, 'loss/train': 1.7260355949401855}} 11/06/2021 23:22:47 - INFO - __main__ - Step 15009: {'lr': 0.0004905300798031359, 'samples': 2881728, 'steps': 15008, 'loss/train': 2.0110666751861572}} 11/06/2021 23:22:50 - INFO - __main__ - Step 15015: {'lr': 0.0004905213973393863, 'samples': 2882880, 'steps': 15014, 'loss/train': 2.3514349460601807}} 11/06/2021 23:22:52 - INFO - __main__ - Step 15021: {'lr': 0.0004905127109741284, 'samples': 2884032, 'steps': 15020, 'loss/train': 1.9690922498703003}} 11/06/2021 23:22:52 - INFO - __main__ - Step 15021: {'lr': 0.0004905127109741284, 'samples': 2884032, 'steps': 15020, 'loss/train': 1.9690922498703003}} 11/06/2021 23:22:55 - INFO - __main__ - Step 15027: {'lr': 0.0004905040207075032, 'samples': 2885184, 'steps': 15026, 'loss/train': 1.7792967557907104}} 11/06/2021 23:22:55 - INFO - __main__ - Step 15027: {'lr': 0.0004905040207075032, 'samples': 2885184, 'steps': 15026, 'loss/train': 1.7792967557907104}} 11/06/2021 23:22:59 - INFO - __main__ - Step 15033: {'lr': 0.0004904953265396515, 'samples': 2886336, 'steps': 15032, 'loss/train': 1.9477510452270508}} 11/06/2021 23:23:02 - INFO - __main__ - Step 15039: {'lr': 0.0004904866284707144, 'samples': 2887488, 'steps': 15038, 'loss/train': 1.706793189048767}}} 11/06/2021 23:23:02 - INFO - __main__ - Step 15039: {'lr': 0.0004904866284707144, 'samples': 2887488, 'steps': 15038, 'loss/train': 1.706793189048767}}} 11/06/2021 23:23:06 - INFO - __main__ - Step 15048: {'lr': 0.0004904735740530825, 'samples': 2889216, 'steps': 15047, 'loss/train': 1.7291828393936157}} 11/06/2021 23:23:08 - INFO - __main__ - Step 15052: {'lr': 0.0004904677692724664, 'samples': 2889984, 'steps': 15051, 'loss/train': 1.3407013416290283}} 11/06/2021 23:23:11 - INFO - __main__ - Step 15057: {'lr': 0.0004904605108588023, 'samples': 2890944, 'steps': 15056, 'loss/train': 1.5444527864456177}} 11/06/2021 23:23:13 - INFO - __main__ - Step 15062: {'lr': 0.0004904532497364432, 'samples': 2891904, 'steps': 15061, 'loss/train': 1.4568783044815063}} 11/06/2021 23:23:13 - INFO - __main__ - Step 15062: {'lr': 0.0004904532497364432, 'samples': 2891904, 'steps': 15061, 'loss/train': 1.4568783044815063}} 11/06/2021 23:23:16 - INFO - __main__ - Step 15069: {'lr': 0.0004904430796146889, 'samples': 2893248, 'steps': 15068, 'loss/train': 1.6384254693984985}} 11/06/2021 23:23:18 - INFO - __main__ - Step 15073: {'lr': 0.0004904372657330504, 'samples': 2894016, 'steps': 15072, 'loss/train': 1.440606713294983}}} 11/06/2021 23:23:21 - INFO - __main__ - Step 15078: {'lr': 0.0004904299959434175, 'samples': 2894976, 'steps': 15077, 'loss/train': 1.727868676185608}}} 11/06/2021 23:23:21 - INFO - __main__ - Step 15078: {'lr': 0.0004904299959434175, 'samples': 2894976, 'steps': 15077, 'loss/train': 1.727868676185608}}} 11/06/2021 23:23:24 - INFO - __main__ - Step 15085: {'lr': 0.00049041981368792, 'samples': 2896320, 'steps': 15084, 'loss/train': 1.955782175064087}8}}} 11/06/2021 23:23:26 - INFO - __main__ - Step 15089: {'lr': 0.0004904139928729445, 'samples': 2897088, 'steps': 15088, 'loss/train': 1.4871602058410645}} 11/06/2021 23:23:29 - INFO - __main__ - Step 15094: {'lr': 0.0004904067144168763, 'samples': 2898048, 'steps': 15093, 'loss/train': 1.7554247379302979}} 11/06/2021 23:23:29 - INFO - __main__ - Step 15094: {'lr': 0.0004904067144168763, 'samples': 2898048, 'steps': 15093, 'loss/train': 1.7554247379302979}} 11/06/2021 23:23:33 - INFO - __main__ - Step 15102: {'lr': 0.0004903950632543766, 'samples': 2899584, 'steps': 15101, 'loss/train': 1.921471357345581}}} 11/06/2021 23:23:34 - INFO - __main__ - Step 15106: {'lr': 0.0004903892350734663, 'samples': 2900352, 'steps': 15105, 'loss/train': 1.4163860082626343}} 11/06/2021 23:23:37 - INFO - __main__ - Step 15110: {'lr': 0.0004903834051595052, 'samples': 2901120, 'steps': 15109, 'loss/train': 1.6631858348846436}} 11/06/2021 23:23:39 - INFO - __main__ - Step 15115: {'lr': 0.0004903761153300149, 'samples': 2902080, 'steps': 15114, 'loss/train': 1.5683375597000122}} 11/06/2021 23:23:41 - INFO - __main__ - Step 15119: {'lr': 0.000490370281516843, 'samples': 2902848, 'steps': 15118, 'loss/train': 1.9191677570343018}}} 11/06/2021 23:23:41 - INFO - __main__ - Step 15119: {'lr': 0.000490370281516843, 'samples': 2902848, 'steps': 15118, 'loss/train': 1.9191677570343018}}} 11/06/2021 23:23:44 - INFO - __main__ - Step 15126: {'lr': 0.0004903600681739926, 'samples': 2904192, 'steps': 15125, 'loss/train': 1.5869134664535522}} 11/06/2021 23:23:47 - INFO - __main__ - Step 15131: {'lr': 0.0004903527696800102, 'samples': 2905152, 'steps': 15130, 'loss/train': 1.154365062713623}}} 11/06/2021 23:23:49 - INFO - __main__ - Step 15136: {'lr': 0.0004903454684785465, 'samples': 2906112, 'steps': 15135, 'loss/train': 1.4386088848114014}} 11/06/2021 23:23:49 - INFO - __main__ - Step 15136: {'lr': 0.0004903454684785465, 'samples': 2906112, 'steps': 15135, 'loss/train': 1.4386088848114014}} 11/06/2021 23:23:49 - INFO - __main__ - Step 15136: {'lr': 0.0004903454684785465, 'samples': 2906112, 'steps': 15135, 'loss/train': 1.4386088848114014}} 11/06/2021 23:23:54 - INFO - __main__ - Step 15146: {'lr': 0.0004903308579535045, 'samples': 2908032, 'steps': 15145, 'loss/train': 1.6294296979904175}} 11/06/2021 23:23:57 - INFO - __main__ - Step 15151: {'lr': 0.0004903235486300908, 'samples': 2908992, 'steps': 15150, 'loss/train': 1.674008846282959}}} 11/06/2021 23:23:59 - INFO - __main__ - Step 15156: {'lr': 0.000490316236599525, 'samples': 2909952, 'steps': 15155, 'loss/train': 1.7094593048095703}}} 11/06/2021 23:24:01 - INFO - __main__ - Step 15160: {'lr': 0.0004903103850259781, 'samples': 2910720, 'steps': 15159, 'loss/train': 1.470730185508728}}} 11/06/2021 23:24:01 - INFO - __main__ - Step 15160: {'lr': 0.0004903103850259781, 'samples': 2910720, 'steps': 15159, 'loss/train': 1.470730185508728}}} 11/06/2021 23:24:04 - INFO - __main__ - Step 15167: {'lr': 0.0004903001406035109, 'samples': 2912064, 'steps': 15166, 'loss/train': 1.7047207355499268}} 11/06/2021 23:24:07 - INFO - __main__ - Step 15172: {'lr': 0.0004902928199106121, 'samples': 2913024, 'steps': 15171, 'loss/train': 1.4685401916503906}} 11/06/2021 23:24:07 - INFO - __main__ - Step 15172: {'lr': 0.0004902928199106121, 'samples': 2913024, 'steps': 15171, 'loss/train': 1.4685401916503906}} 11/06/2021 23:24:11 - INFO - __main__ - Step 15180: {'lr': 0.0004902811011718521, 'samples': 2914560, 'steps': 15179, 'loss/train': 1.366045355796814}}} 11/06/2021 23:24:13 - INFO - __main__ - Step 15184: {'lr': 0.000490275239204044, 'samples': 2915328, 'steps': 15183, 'loss/train': 1.9920772314071655}}} 11/06/2021 23:24:15 - INFO - __main__ - Step 15188: {'lr': 0.0004902693755040069, 'samples': 2916096, 'steps': 15187, 'loss/train': 1.1258271932601929}} 11/06/2021 23:24:17 - INFO - __main__ - Step 15193: {'lr': 0.0004902620434430778, 'samples': 2917056, 'steps': 15192, 'loss/train': 0.23766328394412994} 11/06/2021 23:24:17 - INFO - __main__ - Step 15193: {'lr': 0.0004902620434430778, 'samples': 2917056, 'steps': 15192, 'loss/train': 0.23766328394412994} 11/06/2021 23:24:21 - INFO - __main__ - Step 15199: {'lr': 0.000490253241397444, 'samples': 2918208, 'steps': 15198, 'loss/train': 1.277166724205017}94} 11/06/2021 23:24:23 - INFO - __main__ - Step 15204: {'lr': 0.0004902459033824137, 'samples': 2919168, 'steps': 15203, 'loss/train': 1.7561739683151245}} 11/06/2021 23:24:26 - INFO - __main__ - Step 15208: {'lr': 0.0004902400310218657, 'samples': 2919936, 'steps': 15207, 'loss/train': 1.5054603815078735}} 11/06/2021 23:24:27 - INFO - __main__ - Step 15212: {'lr': 0.0004902341569293425, 'samples': 2920704, 'steps': 15211, 'loss/train': 1.7645387649536133}} 11/06/2021 23:24:29 - INFO - __main__ - Step 15216: {'lr': 0.0004902282811048864, 'samples': 2921472, 'steps': 15215, 'loss/train': 1.9115432500839233}} 11/06/2021 23:24:32 - INFO - __main__ - Step 15221: {'lr': 0.0004902209338888503, 'samples': 2922432, 'steps': 15220, 'loss/train': 2.0752274990081787}} 11/06/2021 23:24:32 - INFO - __main__ - Step 15221: {'lr': 0.0004902209338888503, 'samples': 2922432, 'steps': 15220, 'loss/train': 2.0752274990081787}} 11/06/2021 23:24:35 - INFO - __main__ - Step 15228: {'lr': 0.0004902106432403448, 'samples': 2923776, 'steps': 15227, 'loss/train': 1.354243516921997}}} 11/06/2021 23:24:37 - INFO - __main__ - Step 15232: {'lr': 0.0004902047604885811, 'samples': 2924544, 'steps': 15231, 'loss/train': 1.9354302883148193}} 11/06/2021 23:24:39 - INFO - __main__ - Step 15237: {'lr': 0.0004901974046136488, 'samples': 2925504, 'steps': 15236, 'loss/train': 1.4140377044677734}} 11/06/2021 23:24:39 - INFO - __main__ - Step 15237: {'lr': 0.0004901974046136488, 'samples': 2925504, 'steps': 15236, 'loss/train': 1.4140377044677734}} 11/06/2021 23:24:43 - INFO - __main__ - Step 15245: {'lr': 0.0004901856295858708, 'samples': 2927040, 'steps': 15244, 'loss/train': 1.7202751636505127}} 11/06/2021 23:24:46 - INFO - __main__ - Step 15249: {'lr': 0.0004901797394745861, 'samples': 2927808, 'steps': 15248, 'loss/train': 1.667556643486023}}} 11/06/2021 23:24:47 - INFO - __main__ - Step 15253: {'lr': 0.000490173847631761, 'samples': 2928576, 'steps': 15252, 'loss/train': 1.8147984743118286}}} 11/06/2021 23:24:49 - INFO - __main__ - Step 15258: {'lr': 0.0004901664803933153, 'samples': 2929536, 'steps': 15257, 'loss/train': 1.265826940536499}}} 11/06/2021 23:24:52 - INFO - __main__ - Step 15262: {'lr': 0.0004901605846546791, 'samples': 2930304, 'steps': 15261, 'loss/train': 1.0735132694244385}} 11/06/2021 23:24:52 - INFO - __main__ - Step 15262: {'lr': 0.0004901605846546791, 'samples': 2930304, 'steps': 15261, 'loss/train': 1.0735132694244385}} 11/06/2021 23:24:55 - INFO - __main__ - Step 15269: {'lr': 0.0004901502629459042, 'samples': 2931648, 'steps': 15268, 'loss/train': 0.25619640946388245} 11/06/2021 23:24:57 - INFO - __main__ - Step 15273: {'lr': 0.0004901443624460136, 'samples': 2932416, 'steps': 15272, 'loss/train': 1.692050576210022}5} 11/06/2021 23:25:00 - INFO - __main__ - Step 15278: {'lr': 0.0004901369843865351, 'samples': 2933376, 'steps': 15277, 'loss/train': 1.9054416418075562}} 11/06/2021 23:25:02 - INFO - __main__ - Step 15282: {'lr': 0.0004901310799913121, 'samples': 2934144, 'steps': 15281, 'loss/train': 1.4008394479751587}} 11/06/2021 23:25:03 - INFO - __main__ - Step 15286: {'lr': 0.000490125173864899, 'samples': 2934912, 'steps': 15285, 'loss/train': 1.5084561109542847}}} 11/06/2021 23:25:06 - INFO - __main__ - Step 15290: {'lr': 0.000490119266007339, 'samples': 2935680, 'steps': 15289, 'loss/train': 1.3966121673583984}}} 11/06/2021 23:25:06 - INFO - __main__ - Step 15290: {'lr': 0.000490119266007339, 'samples': 2935680, 'steps': 15289, 'loss/train': 1.3966121673583984}}} 11/06/2021 23:25:09 - INFO - __main__ - Step 15295: {'lr': 0.0004901118787510281, 'samples': 2936640, 'steps': 15294, 'loss/train': 3.1649906635284424}} 11/06/2021 23:25:12 - INFO - __main__ - Step 15301: {'lr': 0.0004901030104731691, 'samples': 2937792, 'steps': 15300, 'loss/train': 2.0078108310699463}} 11/06/2021 23:25:14 - INFO - __main__ - Step 15305: {'lr': 0.0004900970961241866, 'samples': 2938560, 'steps': 15304, 'loss/train': 1.3544918298721313}} 11/06/2021 23:25:16 - INFO - __main__ - Step 15309: {'lr': 0.0004900911800442593, 'samples': 2939328, 'steps': 15308, 'loss/train': 1.6788253784179688}} 11/06/2021 23:25:16 - INFO - __main__ - Step 15309: {'lr': 0.0004900911800442593, 'samples': 2939328, 'steps': 15308, 'loss/train': 1.6788253784179688}} 11/06/2021 23:25:19 - INFO - __main__ - Step 15316: {'lr': 0.0004900808227394293, 'samples': 2940672, 'steps': 15315, 'loss/train': 1.51724112033844}8}} 11/06/2021 23:25:22 - INFO - __main__ - Step 15321: {'lr': 0.0004900734214192358, 'samples': 2941632, 'steps': 15320, 'loss/train': 1.9147781133651733}} 11/06/2021 23:25:22 - INFO - __main__ - Step 15321: {'lr': 0.0004900734214192358, 'samples': 2941632, 'steps': 15320, 'loss/train': 1.9147781133651733}} 11/06/2021 23:25:25 - INFO - __main__ - Step 15328: {'lr': 0.0004900630550277018, 'samples': 2942976, 'steps': 15327, 'loss/train': 2.0773236751556396}} 11/06/2021 23:25:28 - INFO - __main__ - Step 15333: {'lr': 0.0004900556472172457, 'samples': 2943936, 'steps': 15332, 'loss/train': 2.6426515579223633}} 11/06/2021 23:25:28 - INFO - __main__ - Step 15333: {'lr': 0.0004900556472172457, 'samples': 2943936, 'steps': 15332, 'loss/train': 2.6426515579223633}} 11/06/2021 23:25:31 - INFO - __main__ - Step 15340: {'lr': 0.0004900452717396803, 'samples': 2945280, 'steps': 15339, 'loss/train': 1.5828317403793335}} 11/06/2021 23:25:33 - INFO - __main__ - Step 15344: {'lr': 0.0004900393405158073, 'samples': 2946048, 'steps': 15343, 'loss/train': 2.0135602951049805}} 11/06/2021 23:25:36 - INFO - __main__ - Step 15350: {'lr': 0.0004900304404352704, 'samples': 2947200, 'steps': 15349, 'loss/train': 2.0036559104919434}} 11/06/2021 23:25:36 - INFO - __main__ - Step 15350: {'lr': 0.0004900304404352704, 'samples': 2947200, 'steps': 15349, 'loss/train': 2.0036559104919434}} 11/06/2021 23:25:40 - INFO - __main__ - Step 15357: {'lr': 0.00049002005208698, 'samples': 2948544, 'steps': 15356, 'loss/train': 1.4935647249221802}4}} 11/06/2021 23:25:41 - INFO - __main__ - Step 15361: {'lr': 0.0004900141135086569, 'samples': 2949312, 'steps': 15360, 'loss/train': 1.6635363101959229}} 11/06/2021 23:25:44 - INFO - __main__ - Step 15365: {'lr': 0.0004900081731999872, 'samples': 2950080, 'steps': 15364, 'loss/train': 1.6182117462158203}} 11/06/2021 23:25:46 - INFO - __main__ - Step 15370: {'lr': 0.0004900007453809157, 'samples': 2951040, 'steps': 15369, 'loss/train': 1.5732970237731934}} 11/06/2021 23:25:48 - INFO - __main__ - Step 15374: {'lr': 0.000489994801179123, 'samples': 2951808, 'steps': 15373, 'loss/train': 1.610868215560913}4}} 11/06/2021 23:25:48 - INFO - __main__ - Step 15374: {'lr': 0.000489994801179123, 'samples': 2951808, 'steps': 15373, 'loss/train': 1.610868215560913}4}} 11/06/2021 23:25:51 - INFO - __main__ - Step 15380: {'lr': 0.0004899858816323089, 'samples': 2952960, 'steps': 15379, 'loss/train': 1.6182414293289185}} 11/06/2021 23:25:54 - INFO - __main__ - Step 15386: {'lr': 0.000489976958192673, 'samples': 2954112, 'steps': 15385, 'loss/train': 1.6987992525100708}}} 11/06/2021 23:25:54 - INFO - __main__ - Step 15386: {'lr': 0.000489976958192673, 'samples': 2954112, 'steps': 15385, 'loss/train': 1.6987992525100708}}} 11/06/2021 23:25:57 - INFO - __main__ - Step 15393: {'lr': 0.000489966542593197, 'samples': 2955456, 'steps': 15392, 'loss/train': 1.6282072067260742}}} 11/06/2021 23:25:59 - INFO - __main__ - Step 15397: {'lr': 0.0004899605884432983, 'samples': 2956224, 'steps': 15396, 'loss/train': 1.6448665857315063}} 11/06/2021 23:26:01 - INFO - __main__ - Step 15402: {'lr': 0.0004899531433231728, 'samples': 2957184, 'steps': 15401, 'loss/train': 1.210938572883606}}} 11/06/2021 23:26:04 - INFO - __main__ - Step 15406: {'lr': 0.000489947185280923, 'samples': 2957952, 'steps': 15405, 'loss/train': 1.348501443862915}}}} 11/06/2021 23:26:06 - INFO - __main__ - Step 15410: {'lr': 0.0004899412255088091, 'samples': 2958720, 'steps': 15409, 'loss/train': 1.6057772636413574}} 11/06/2021 23:26:07 - INFO - __main__ - Step 15414: {'lr': 0.0004899352640068743, 'samples': 2959488, 'steps': 15413, 'loss/train': 1.6085702180862427}} 11/06/2021 23:26:09 - INFO - __main__ - Step 15418: {'lr': 0.0004899293007751616, 'samples': 2960256, 'steps': 15417, 'loss/train': 1.2673732042312622}} 11/06/2021 23:26:12 - INFO - __main__ - Step 15423: {'lr': 0.0004899218443030857, 'samples': 2961216, 'steps': 15422, 'loss/train': 0.9601808786392212}} 11/06/2021 23:26:12 - INFO - __main__ - Step 15423: {'lr': 0.0004899218443030857, 'samples': 2961216, 'steps': 15422, 'loss/train': 0.9601808786392212}} 11/06/2021 23:26:15 - INFO - __main__ - Step 15430: {'lr': 0.0004899114007017849, 'samples': 2962560, 'steps': 15429, 'loss/train': 1.9394744634628296}} 11/06/2021 23:26:17 - INFO - __main__ - Step 15434: {'lr': 0.0004899054305513899, 'samples': 2963328, 'steps': 15433, 'loss/train': 2.0248918533325195}} 11/06/2021 23:26:20 - INFO - __main__ - Step 15439: {'lr': 0.0004898979654312034, 'samples': 2964288, 'steps': 15438, 'loss/train': 1.7832825183868408}} 11/06/2021 23:26:20 - INFO - __main__ - Step 15439: {'lr': 0.0004898979654312034, 'samples': 2964288, 'steps': 15438, 'loss/train': 1.7832825183868408}} 11/06/2021 23:26:24 - INFO - __main__ - Step 15447: {'lr': 0.0004898860156180351, 'samples': 2965824, 'steps': 15446, 'loss/train': 1.3978043794631958}} 11/06/2021 23:26:26 - INFO - __main__ - Step 15451: {'lr': 0.0004898800381172951, 'samples': 2966592, 'steps': 15450, 'loss/train': 1.2126389741897583}} 11/06/2021 23:26:27 - INFO - __main__ - Step 15455: {'lr': 0.000489874058887175, 'samples': 2967360, 'steps': 15454, 'loss/train': 2.1697914600372314}}} 11/06/2021 23:26:30 - INFO - __main__ - Step 15459: {'lr': 0.0004898680779277182, 'samples': 2968128, 'steps': 15458, 'loss/train': 1.4359500408172607}} 11/06/2021 23:26:32 - INFO - __main__ - Step 15464: {'lr': 0.000489860599296583, 'samples': 2969088, 'steps': 15463, 'loss/train': 1.6961749792099}607}} 11/06/2021 23:26:34 - INFO - __main__ - Step 15469: {'lr': 0.0004898531179635108, 'samples': 2970048, 'steps': 15468, 'loss/train': 1.0643110275268555}} 11/06/2021 23:26:36 - INFO - __main__ - Step 15473: {'lr': 0.0004898471309517148, 'samples': 2970816, 'steps': 15472, 'loss/train': 1.8764305114746094}} 11/06/2021 23:26:36 - INFO - __main__ - Step 15473: {'lr': 0.0004898471309517148, 'samples': 2970816, 'steps': 15472, 'loss/train': 1.8764305114746094}} 11/06/2021 23:26:40 - INFO - __main__ - Step 15480: {'lr': 0.0004898366495203483, 'samples': 2972160, 'steps': 15479, 'loss/train': 1.967870831489563}}} 11/06/2021 23:26:42 - INFO - __main__ - Step 15485: {'lr': 0.0004898291595416438, 'samples': 2973120, 'steps': 15484, 'loss/train': 1.7662090063095093}} 11/06/2021 23:26:44 - INFO - __main__ - Step 15490: {'lr': 0.0004898216668613562, 'samples': 2974080, 'steps': 15489, 'loss/train': 1.7685637474060059}} 11/06/2021 23:26:46 - INFO - __main__ - Step 15494: {'lr': 0.0004898156707720432, 'samples': 2974848, 'steps': 15493, 'loss/train': 1.5298535823822021}} 11/06/2021 23:26:46 - INFO - __main__ - Step 15494: {'lr': 0.0004898156707720432, 'samples': 2974848, 'steps': 15493, 'loss/train': 1.5298535823822021}} 11/06/2021 23:26:50 - INFO - __main__ - Step 15501: {'lr': 0.0004898051734555676, 'samples': 2976192, 'steps': 15500, 'loss/train': 1.9325761795043945}} 11/06/2021 23:26:52 - INFO - __main__ - Step 15506: {'lr': 0.0004897976721307818, 'samples': 2977152, 'steps': 15505, 'loss/train': 1.7012150287628174}} 11/06/2021 23:26:52 - INFO - __main__ - Step 15506: {'lr': 0.0004897976721307818, 'samples': 2977152, 'steps': 15505, 'loss/train': 1.7012150287628174}} 11/06/2021 23:26:56 - INFO - __main__ - Step 15514: {'lr': 0.0004897856643926051, 'samples': 2978688, 'steps': 15513, 'loss/train': 1.6480381488800049}} 11/06/2021 23:26:58 - INFO - __main__ - Step 15518: {'lr': 0.0004897796579304458, 'samples': 2979456, 'steps': 15517, 'loss/train': 2.0190608501434326}} 11/06/2021 23:27:00 - INFO - __main__ - Step 15522: {'lr': 0.0004897736497396303, 'samples': 2980224, 'steps': 15521, 'loss/train': 1.6107763051986694}} 11/06/2021 23:27:02 - INFO - __main__ - Step 15527: {'lr': 0.000489766137070254, 'samples': 2981184, 'steps': 15526, 'loss/train': 1.7062444686889648}}} 11/06/2021 23:27:05 - INFO - __main__ - Step 15532: {'lr': 0.0004897586217000047, 'samples': 2982144, 'steps': 15531, 'loss/train': 1.5027600526809692}} 11/06/2021 23:27:05 - INFO - __main__ - Step 15532: {'lr': 0.0004897586217000047, 'samples': 2982144, 'steps': 15531, 'loss/train': 1.5027600526809692}} 11/06/2021 23:27:08 - INFO - __main__ - Step 15539: {'lr': 0.0004897480956443503, 'samples': 2983488, 'steps': 15538, 'loss/train': 1.4675657749176025}} 11/06/2021 23:27:10 - INFO - __main__ - Step 15543: {'lr': 0.0004897420783788006, 'samples': 2984256, 'steps': 15542, 'loss/train': 1.0916229486465454}} 11/06/2021 23:27:12 - INFO - __main__ - Step 15548: {'lr': 0.0004897345543663266, 'samples': 2985216, 'steps': 15547, 'loss/train': 0.22535571455955505} 11/06/2021 23:27:12 - INFO - __main__ - Step 15548: {'lr': 0.0004897345543663266, 'samples': 2985216, 'steps': 15547, 'loss/train': 0.22535571455955505} 11/06/2021 23:27:16 - INFO - __main__ - Step 15555: {'lr': 0.000489724016212013, 'samples': 2986560, 'steps': 15554, 'loss/train': 0.7079336643218994}5} 11/06/2021 23:27:18 - INFO - __main__ - Step 15559: {'lr': 0.0004897179920331826, 'samples': 2987328, 'steps': 15558, 'loss/train': 1.4157203435897827}} 11/06/2021 23:27:20 - INFO - __main__ - Step 15564: {'lr': 0.0004897104593793518, 'samples': 2988288, 'steps': 15563, 'loss/train': 1.779022455215454}}} 11/06/2021 23:27:23 - INFO - __main__ - Step 15568: {'lr': 0.0004897044313121061, 'samples': 2989056, 'steps': 15567, 'loss/train': 5.84039306640625}}}} 11/06/2021 23:27:25 - INFO - __main__ - Step 15572: {'lr': 0.0004896984015167466, 'samples': 2989824, 'steps': 15571, 'loss/train': 1.4174809455871582}} 11/06/2021 23:27:26 - INFO - __main__ - Step 15576: {'lr': 0.0004896923699933167, 'samples': 2990592, 'steps': 15575, 'loss/train': 1.7249995470046997}} 11/06/2021 23:27:28 - INFO - __main__ - Step 15580: {'lr': 0.0004896863367418598, 'samples': 2991360, 'steps': 15579, 'loss/train': 1.1707956790924072}} 11/06/2021 23:27:31 - INFO - __main__ - Step 15585: {'lr': 0.0004896787927475671, 'samples': 2992320, 'steps': 15584, 'loss/train': 1.8630757331848145}} 11/06/2021 23:27:31 - INFO - __main__ - Step 15585: {'lr': 0.0004896787927475671, 'samples': 2992320, 'steps': 15584, 'loss/train': 1.8630757331848145}} 11/06/2021 23:27:34 - INFO - __main__ - Step 15591: {'lr': 0.0004896697363905697, 'samples': 2993472, 'steps': 15590, 'loss/train': 1.653200387954712}}} 11/06/2021 23:27:36 - INFO - __main__ - Step 15596: {'lr': 0.0004896621864566331, 'samples': 2994432, 'steps': 15595, 'loss/train': 1.6735210418701172}} 11/06/2021 23:27:39 - INFO - __main__ - Step 15601: {'lr': 0.0004896546338229945, 'samples': 2995392, 'steps': 15600, 'loss/train': 1.6731430292129517}} 11/06/2021 23:27:41 - INFO - __main__ - Step 15605: {'lr': 0.0004896485897723552, 'samples': 2996160, 'steps': 15604, 'loss/train': 1.8443701267242432}} 11/06/2021 23:27:41 - INFO - __main__ - Step 15605: {'lr': 0.0004896485897723552, 'samples': 2996160, 'steps': 15604, 'loss/train': 1.8443701267242432}} 11/06/2021 23:27:44 - INFO - __main__ - Step 15612: {'lr': 0.0004896380085264573, 'samples': 2997504, 'steps': 15611, 'loss/train': 1.4598520994186401}} 11/06/2021 23:27:47 - INFO - __main__ - Step 15617: {'lr': 0.0004896304472543439, 'samples': 2998464, 'steps': 15616, 'loss/train': 1.2389189004898071}} 11/06/2021 23:27:47 - INFO - __main__ - Step 15617: {'lr': 0.0004896304472543439, 'samples': 2998464, 'steps': 15616, 'loss/train': 1.2389189004898071}} 11/06/2021 23:27:51 - INFO - __main__ - Step 15625: {'lr': 0.0004896183436043613, 'samples': 3000000, 'steps': 15624, 'loss/train': 1.905300498008728}}} 11/06/2021 23:27:52 - INFO - __main__ - Step 15629: {'lr': 0.0004896122891881083, 'samples': 3000768, 'steps': 15628, 'loss/train': 1.4601120948791504}} 11/06/2021 23:27:54 - INFO - __main__ - Step 15633: {'lr': 0.0004896062330444057, 'samples': 3001536, 'steps': 15632, 'loss/train': 1.8706539869308472}} 11/06/2021 23:27:57 - INFO - __main__ - Step 15638: {'lr': 0.0004895986604356178, 'samples': 3002496, 'steps': 15637, 'loss/train': 1.6697584390640259}} 11/06/2021 23:27:59 - INFO - __main__ - Step 15642: {'lr': 0.0004895926004053133, 'samples': 3003264, 'steps': 15641, 'loss/train': 1.5293596982955933}} 11/06/2021 23:27:59 - INFO - __main__ - Step 15642: {'lr': 0.0004895926004053133, 'samples': 3003264, 'steps': 15641, 'loss/train': 1.5293596982955933}} 11/06/2021 23:28:02 - INFO - __main__ - Step 15649: {'lr': 0.0004895819911959725, 'samples': 3004608, 'steps': 15648, 'loss/train': 0.838192343711853}}} 11/06/2021 23:28:05 - INFO - __main__ - Step 15654: {'lr': 0.0004895744099507284, 'samples': 3005568, 'steps': 15653, 'loss/train': 1.9209213256835938}} 11/06/2021 23:28:05 - INFO - __main__ - Step 15654: {'lr': 0.0004895744099507284, 'samples': 3005568, 'steps': 15653, 'loss/train': 1.9209213256835938}} 11/06/2021 23:28:09 - INFO - __main__ - Step 15662: {'lr': 0.0004895622743450497, 'samples': 3007104, 'steps': 15661, 'loss/train': 1.9682589769363403}} 11/06/2021 23:28:10 - INFO - __main__ - Step 15666: {'lr': 0.000489556203951555, 'samples': 3007872, 'steps': 15665, 'loss/train': 1.7595479488372803}}} 11/06/2021 23:28:12 - INFO - __main__ - Step 15670: {'lr': 0.000489550131831015, 'samples': 3008640, 'steps': 15669, 'loss/train': 1.0655637979507446}}} 11/06/2021 23:28:12 - INFO - __main__ - Step 15670: {'lr': 0.000489550131831015, 'samples': 3008640, 'steps': 15669, 'loss/train': 1.0655637979507446}}} 11/06/2021 23:28:17 - INFO - __main__ - Step 15678: {'lr': 0.0004895379824089743, 'samples': 3010176, 'steps': 15677, 'loss/train': 1.4177005290985107}} 11/06/2021 23:28:18 - INFO - __main__ - Step 15682: {'lr': 0.0004895319051075612, 'samples': 3010944, 'steps': 15681, 'loss/train': 1.2223986387252808}} 11/06/2021 23:28:20 - INFO - __main__ - Step 15686: {'lr': 0.0004895258260792781, 'samples': 3011712, 'steps': 15685, 'loss/train': 1.674849271774292}}} 11/06/2021 23:28:23 - INFO - __main__ - Step 15690: {'lr': 0.0004895197453241687, 'samples': 3012480, 'steps': 15689, 'loss/train': 1.8645319938659668}} 11/06/2021 23:28:23 - INFO - __main__ - Step 15690: {'lr': 0.0004895197453241687, 'samples': 3012480, 'steps': 15689, 'loss/train': 1.8645319938659668}} 11/06/2021 23:28:26 - INFO - __main__ - Step 15697: {'lr': 0.0004895090998476833, 'samples': 3013824, 'steps': 15696, 'loss/train': 1.7760313749313354}} 11/06/2021 23:28:28 - INFO - __main__ - Step 15701: {'lr': 0.0004895030143440278, 'samples': 3014592, 'steps': 15700, 'loss/train': 1.9887073040008545}} 11/06/2021 23:28:28 - INFO - __main__ - Step 15701: {'lr': 0.0004895030143440278, 'samples': 3014592, 'steps': 15700, 'loss/train': 1.9887073040008545}} 11/06/2021 23:28:32 - INFO - __main__ - Step 15708: {'lr': 0.000489492360557877, 'samples': 3015936, 'steps': 15707, 'loss/train': 1.384521722793579}5}} 11/06/2021 23:28:35 - INFO - __main__ - Step 15713: {'lr': 0.0004894847474732658, 'samples': 3016896, 'steps': 15712, 'loss/train': 1.6129860877990723}} 11/06/2021 23:28:37 - INFO - __main__ - Step 15717: {'lr': 0.0004894786550632264, 'samples': 3017664, 'steps': 15716, 'loss/train': 1.774351954460144}}} 11/06/2021 23:28:37 - INFO - __main__ - Step 15717: {'lr': 0.0004894786550632264, 'samples': 3017664, 'steps': 15716, 'loss/train': 1.774351954460144}}} 11/06/2021 23:28:40 - INFO - __main__ - Step 15724: {'lr': 0.0004894679891913266, 'samples': 3019008, 'steps': 15723, 'loss/train': 1.9029018878936768}} 11/06/2021 23:28:43 - INFO - __main__ - Step 15729: {'lr': 0.0004894603674743668, 'samples': 3019968, 'steps': 15728, 'loss/train': 1.7608121633529663}} 11/06/2021 23:28:43 - INFO - __main__ - Step 15729: {'lr': 0.0004894603674743668, 'samples': 3019968, 'steps': 15728, 'loss/train': 1.7608121633529663}} 11/06/2021 23:28:47 - INFO - __main__ - Step 15737: {'lr': 0.0004894481671166155, 'samples': 3021504, 'steps': 15736, 'loss/train': 1.9780317544937134}} 11/06/2021 23:28:48 - INFO - __main__ - Step 15741: {'lr': 0.0004894420643483184, 'samples': 3022272, 'steps': 15740, 'loss/train': 1.8506090641021729}} 11/06/2021 23:28:51 - INFO - __main__ - Step 15745: {'lr': 0.0004894359598537987, 'samples': 3023040, 'steps': 15744, 'loss/train': 1.7577073574066162}} 11/06/2021 23:28:51 - INFO - __main__ - Step 15745: {'lr': 0.0004894359598537987, 'samples': 3023040, 'steps': 15744, 'loss/train': 1.7577073574066162}} 11/06/2021 23:28:55 - INFO - __main__ - Step 15752: {'lr': 0.0004894252728347992, 'samples': 3024384, 'steps': 15751, 'loss/train': 1.6653718948364258}} 11/06/2021 23:28:56 - INFO - __main__ - Step 15756: {'lr': 0.0004894191635933949, 'samples': 3025152, 'steps': 15755, 'loss/train': 2.1717846393585205}} 11/06/2021 23:28:58 - INFO - __main__ - Step 15760: {'lr': 0.0004894130526259334, 'samples': 3025920, 'steps': 15759, 'loss/train': 1.284491777420044}}} 11/06/2021 23:29:01 - INFO - __main__ - Step 15765: {'lr': 0.0004894054114894055, 'samples': 3026880, 'steps': 15764, 'loss/train': 1.9538586139678955}} 11/06/2021 23:29:01 - INFO - __main__ - Step 15765: {'lr': 0.0004894054114894055, 'samples': 3026880, 'steps': 15764, 'loss/train': 1.9538586139678955}} 11/06/2021 23:29:05 - INFO - __main__ - Step 15772: {'lr': 0.0004893947093676458, 'samples': 3028224, 'steps': 15771, 'loss/train': 1.4160466194152832}} 11/06/2021 23:29:06 - INFO - __main__ - Step 15776: {'lr': 0.0004893885914963958, 'samples': 3028992, 'steps': 15775, 'loss/train': 1.977927565574646}}} 11/06/2021 23:29:09 - INFO - __main__ - Step 15781: {'lr': 0.0004893809417303803, 'samples': 3029952, 'steps': 15780, 'loss/train': 1.5243641138076782}} 11/06/2021 23:29:11 - INFO - __main__ - Step 15785: {'lr': 0.0004893748199760594, 'samples': 3030720, 'steps': 15784, 'loss/train': 2.882068157196045}}} 11/06/2021 23:29:11 - INFO - __main__ - Step 15785: {'lr': 0.0004893748199760594, 'samples': 3030720, 'steps': 15784, 'loss/train': 2.882068157196045}}} 11/06/2021 23:29:14 - INFO - __main__ - Step 15792: {'lr': 0.0004893641027534682, 'samples': 3032064, 'steps': 15791, 'loss/train': 1.4932310581207275}} 11/06/2021 23:29:17 - INFO - __main__ - Step 15797: {'lr': 0.0004893564443588476, 'samples': 3033024, 'steps': 15796, 'loss/train': 2.3122026920318604}} 11/06/2021 23:29:17 - INFO - __main__ - Step 15797: {'lr': 0.0004893564443588476, 'samples': 3033024, 'steps': 15796, 'loss/train': 2.3122026920318604}} 11/06/2021 23:29:21 - INFO - __main__ - Step 15805: {'lr': 0.0004893441853192739, 'samples': 3034560, 'steps': 15804, 'loss/train': 1.941593885421753}}} 11/06/2021 23:29:22 - INFO - __main__ - Step 15809: {'lr': 0.0004893380532111898, 'samples': 3035328, 'steps': 15808, 'loss/train': 1.792184591293335}}} 11/06/2021 23:29:24 - INFO - __main__ - Step 15813: {'lr': 0.0004893319193776331, 'samples': 3036096, 'steps': 15812, 'loss/train': 1.3216885328292847}} 11/06/2021 23:29:27 - INFO - __main__ - Step 15818: {'lr': 0.0004893242496593089, 'samples': 3037056, 'steps': 15817, 'loss/train': 1.5864753723144531}} 11/06/2021 23:29:29 - INFO - __main__ - Step 15822: {'lr': 0.0004893181119436007, 'samples': 3037824, 'steps': 15821, 'loss/train': 1.765951156616211}}} 11/06/2021 23:29:29 - INFO - __main__ - Step 15822: {'lr': 0.0004893181119436007, 'samples': 3037824, 'steps': 15821, 'loss/train': 1.765951156616211}}} 11/06/2021 23:29:32 - INFO - __main__ - Step 15829: {'lr': 0.0004893073667895658, 'samples': 3039168, 'steps': 15828, 'loss/train': 1.2464898824691772}} 11/06/2021 23:29:35 - INFO - __main__ - Step 15834: {'lr': 0.0004892996884446807, 'samples': 3040128, 'steps': 15833, 'loss/train': 1.350760817527771}}} 11/06/2021 23:29:35 - INFO - __main__ - Step 15834: {'lr': 0.0004892996884446807, 'samples': 3040128, 'steps': 15833, 'loss/train': 1.350760817527771}}} 11/06/2021 23:29:39 - INFO - __main__ - Step 15842: {'lr': 0.000489287397486014, 'samples': 3041664, 'steps': 15841, 'loss/train': 1.3404470682144165}}} 11/06/2021 23:29:40 - INFO - __main__ - Step 15846: {'lr': 0.0004892812494189973, 'samples': 3042432, 'steps': 15845, 'loss/train': 1.8682761192321777}} 11/06/2021 23:29:43 - INFO - __main__ - Step 15850: {'lr': 0.0004892750996269177, 'samples': 3043200, 'steps': 15849, 'loss/train': 1.7898610830307007}} 11/06/2021 23:29:45 - INFO - __main__ - Step 15854: {'lr': 0.0004892689481098193, 'samples': 3043968, 'steps': 15853, 'loss/train': 1.5801000595092773}} 11/06/2021 23:29:47 - INFO - __main__ - Step 15858: {'lr': 0.0004892627948677467, 'samples': 3044736, 'steps': 15857, 'loss/train': 1.1623653173446655}} 11/06/2021 23:29:49 - INFO - __main__ - Step 15862: {'lr': 0.0004892566399007441, 'samples': 3045504, 'steps': 15861, 'loss/train': 1.6186326742172241}} 11/06/2021 23:29:50 - INFO - __main__ - Step 15866: {'lr': 0.000489250483208856, 'samples': 3046272, 'steps': 15865, 'loss/train': 1.8032313585281372}}} 11/06/2021 23:29:53 - INFO - __main__ - Step 15870: {'lr': 0.0004892443247921265, 'samples': 3047040, 'steps': 15869, 'loss/train': 1.867167353630066}}} 11/06/2021 23:29:55 - INFO - __main__ - Step 15875: {'lr': 0.0004892366243457244, 'samples': 3048000, 'steps': 15874, 'loss/train': 1.7657088041305542}} 11/06/2021 23:29:55 - INFO - __main__ - Step 15875: {'lr': 0.0004892366243457244, 'samples': 3048000, 'steps': 15874, 'loss/train': 1.7657088041305542}} 11/06/2021 23:29:58 - INFO - __main__ - Step 15882: {'lr': 0.000489225839193335, 'samples': 3049344, 'steps': 15881, 'loss/train': 2.681311845779419}2}} 11/06/2021 23:30:00 - INFO - __main__ - Step 15886: {'lr': 0.0004892196738776848, 'samples': 3050112, 'steps': 15885, 'loss/train': 1.6380057334899902}} 11/06/2021 23:30:03 - INFO - __main__ - Step 15891: {'lr': 0.0004892119648078817, 'samples': 3051072, 'steps': 15890, 'loss/train': 1.6009021997451782}} 11/06/2021 23:30:05 - INFO - __main__ - Step 15895: {'lr': 0.0004892057956119012, 'samples': 3051840, 'steps': 15894, 'loss/train': 1.6373045444488525}} 11/06/2021 23:30:07 - INFO - __main__ - Step 15899: {'lr': 0.0004891996246914014, 'samples': 3052608, 'steps': 15898, 'loss/train': 1.63882315158844}5}} 11/06/2021 23:30:08 - INFO - __main__ - Step 15903: {'lr': 0.0004891934520464273, 'samples': 3053376, 'steps': 15902, 'loss/train': 1.785117268562317}}} 11/06/2021 23:30:11 - INFO - __main__ - Step 15908: {'lr': 0.000489185733815235, 'samples': 3054336, 'steps': 15907, 'loss/train': 1.645919919013977}}}} 11/06/2021 23:30:13 - INFO - __main__ - Step 15912: {'lr': 0.0004891795572903557, 'samples': 3055104, 'steps': 15911, 'loss/train': 1.5487264394760132}} 11/06/2021 23:30:13 - INFO - __main__ - Step 15912: {'lr': 0.0004891795572903557, 'samples': 3055104, 'steps': 15911, 'loss/train': 1.5487264394760132}} 11/06/2021 23:30:16 - INFO - __main__ - Step 15918: {'lr': 0.0004891702892699323, 'samples': 3056256, 'steps': 15917, 'loss/train': 1.2524698972702026}} 11/06/2021 23:30:19 - INFO - __main__ - Step 15924: {'lr': 0.0004891610173699169, 'samples': 3057408, 'steps': 15923, 'loss/train': 1.4933257102966309}} 11/06/2021 23:30:21 - INFO - __main__ - Step 15928: {'lr': 0.0004891548339479854, 'samples': 3058176, 'steps': 15927, 'loss/train': 1.7549539804458618}} 11/06/2021 23:30:21 - INFO - __main__ - Step 15928: {'lr': 0.0004891548339479854, 'samples': 3058176, 'steps': 15927, 'loss/train': 1.7549539804458618}} 11/06/2021 23:30:24 - INFO - __main__ - Step 15935: {'lr': 0.0004891440088108923, 'samples': 3059520, 'steps': 15934, 'loss/train': 1.4407297372817993}} 11/06/2021 23:30:27 - INFO - __main__ - Step 15939: {'lr': 0.0004891378206476511, 'samples': 3060288, 'steps': 15938, 'loss/train': 1.4875487089157104}} 11/06/2021 23:30:29 - INFO - __main__ - Step 15944: {'lr': 0.000489130083019189, 'samples': 3061248, 'steps': 15943, 'loss/train': 1.3536427021026611}}} 11/06/2021 23:30:31 - INFO - __main__ - Step 15948: {'lr': 0.0004891238909769454, 'samples': 3062016, 'steps': 15947, 'loss/train': 1.6971629858016968}} 11/06/2021 23:30:31 - INFO - __main__ - Step 15948: {'lr': 0.0004891238909769454, 'samples': 3062016, 'steps': 15947, 'loss/train': 1.6971629858016968}} 11/06/2021 23:30:35 - INFO - __main__ - Step 15955: {'lr': 0.0004891130507548427, 'samples': 3063360, 'steps': 15954, 'loss/train': 2.0542542934417725}} 11/06/2021 23:30:37 - INFO - __main__ - Step 15960: {'lr': 0.0004891053045068217, 'samples': 3064320, 'steps': 15959, 'loss/train': 2.0735301971435547}} 11/06/2021 23:30:39 - INFO - __main__ - Step 15964: {'lr': 0.0004890991055691318, 'samples': 3065088, 'steps': 15963, 'loss/train': 1.987470030784607}}} 11/06/2021 23:30:41 - INFO - __main__ - Step 15968: {'lr': 0.0004890929049076919, 'samples': 3065856, 'steps': 15967, 'loss/train': 1.5248794555664062}} 11/06/2021 23:30:43 - INFO - __main__ - Step 15972: {'lr': 0.0004890867025225469, 'samples': 3066624, 'steps': 15971, 'loss/train': 1.434366226196289}}} 11/06/2021 23:30:43 - INFO - __main__ - Step 15972: {'lr': 0.0004890867025225469, 'samples': 3066624, 'steps': 15971, 'loss/train': 1.434366226196289}}} 11/06/2021 23:30:47 - INFO - __main__ - Step 15980: {'lr': 0.0004890742925813202, 'samples': 3068160, 'steps': 15979, 'loss/train': 1.9059771299362183}} 11/06/2021 23:30:49 - INFO - __main__ - Step 15984: {'lr': 0.0004890680850253281, 'samples': 3068928, 'steps': 15983, 'loss/train': 1.7431037425994873}} 11/06/2021 23:30:51 - INFO - __main__ - Step 15988: {'lr': 0.0004890618757458096, 'samples': 3069696, 'steps': 15987, 'loss/train': 1.3658758401870728}} 11/06/2021 23:30:53 - INFO - __main__ - Step 15992: {'lr': 0.0004890556647428097, 'samples': 3070464, 'steps': 15991, 'loss/train': 1.804748296737671}}} 11/06/2021 23:30:55 - INFO - __main__ - Step 15997: {'lr': 0.0004890478985654823, 'samples': 3071424, 'steps': 15996, 'loss/train': 1.6601943969726562}} 11/06/2021 23:30:55 - INFO - __main__ - Step 15997: {'lr': 0.0004890478985654823, 'samples': 3071424, 'steps': 15996, 'loss/train': 1.6601943969726562}} 11/06/2021 23:30:59 - INFO - __main__ - Step 16005: {'lr': 0.0004890354670808074, 'samples': 3072960, 'steps': 16004, 'loss/train': 2.4110894203186035}} 11/06/2021 23:31:01 - INFO - __main__ - Step 16009: {'lr': 0.0004890292487535108, 'samples': 3073728, 'steps': 16008, 'loss/train': 2.2022879123687744}} 11/06/2021 23:31:03 - INFO - __main__ - Step 16013: {'lr': 0.0004890230287029681, 'samples': 3074496, 'steps': 16012, 'loss/train': 1.8367836475372314}} 11/06/2021 23:31:05 - INFO - __main__ - Step 16017: {'lr': 0.0004890168069292241, 'samples': 3075264, 'steps': 16016, 'loss/train': 1.898835301399231}}} 11/06/2021 23:31:07 - INFO - __main__ - Step 16021: {'lr': 0.0004890105834323233, 'samples': 3076032, 'steps': 16020, 'loss/train': 1.5474944114685059}} 11/06/2021 23:31:07 - INFO - __main__ - Step 16021: {'lr': 0.0004890105834323233, 'samples': 3076032, 'steps': 16020, 'loss/train': 1.5474944114685059}} 11/06/2021 23:31:10 - INFO - __main__ - Step 16028: {'lr': 0.0004889996881665366, 'samples': 3077376, 'steps': 16027, 'loss/train': 1.3393259048461914}} 11/06/2021 23:31:13 - INFO - __main__ - Step 16034: {'lr': 0.0004889903451673884, 'samples': 3078528, 'steps': 16033, 'loss/train': 1.6132980585098267}} 11/06/2021 23:31:16 - INFO - __main__ - Step 16039: {'lr': 0.000488982556373411, 'samples': 3079488, 'steps': 16038, 'loss/train': 1.3169190883636475}}} 11/06/2021 23:31:16 - INFO - __main__ - Step 16039: {'lr': 0.000488982556373411, 'samples': 3079488, 'steps': 16038, 'loss/train': 1.3169190883636475}}} 11/06/2021 23:31:20 - INFO - __main__ - Step 16047: {'lr': 0.0004889700887036275, 'samples': 3081024, 'steps': 16046, 'loss/train': 2.1141278743743896}} 11/06/2021 23:31:20 - INFO - __main__ - Step 16047: {'lr': 0.0004889700887036275, 'samples': 3081024, 'steps': 16046, 'loss/train': 2.1141278743743896}} 11/06/2021 23:31:23 - INFO - __main__ - Step 16054: {'lr': 0.0004889591738395522, 'samples': 3082368, 'steps': 16053, 'loss/train': 1.535683512687683}}} 11/06/2021 23:31:23 - INFO - __main__ - Step 16054: {'lr': 0.0004889591738395522, 'samples': 3082368, 'steps': 16053, 'loss/train': 1.535683512687683}}} 11/06/2021 23:31:28 - INFO - __main__ - Step 16063: {'lr': 0.0004889451326905831, 'samples': 3084096, 'steps': 16062, 'loss/train': 1.7582470178604126}} 11/06/2021 23:31:29 - INFO - __main__ - Step 16067: {'lr': 0.0004889388893806099, 'samples': 3084864, 'steps': 16066, 'loss/train': 1.8277945518493652}} 11/06/2021 23:31:29 - INFO - __main__ - Step 16067: {'lr': 0.0004889388893806099, 'samples': 3084864, 'steps': 16066, 'loss/train': 1.8277945518493652}} 11/06/2021 23:31:33 - INFO - __main__ - Step 16074: {'lr': 0.0004889279594431903, 'samples': 3086208, 'steps': 16073, 'loss/train': 1.7310936450958252}} 11/06/2021 23:31:35 - INFO - __main__ - Step 16078: {'lr': 0.0004889217113961896, 'samples': 3086976, 'steps': 16077, 'loss/train': 1.6018173694610596}} 11/06/2021 23:31:37 - INFO - __main__ - Step 16082: {'lr': 0.0004889154616267181, 'samples': 3087744, 'steps': 16081, 'loss/train': 1.7039793729782104}} 11/06/2021 23:31:39 - INFO - __main__ - Step 16086: {'lr': 0.0004889092101348206, 'samples': 3088512, 'steps': 16085, 'loss/train': 1.1894793510437012}} 11/06/2021 23:31:41 - INFO - __main__ - Step 16091: {'lr': 0.0004889013933478559, 'samples': 3089472, 'steps': 16090, 'loss/train': 1.9759714603424072}} 11/06/2021 23:31:44 - INFO - __main__ - Step 16096: {'lr': 0.0004888935738697593, 'samples': 3090432, 'steps': 16095, 'loss/train': 1.3398206233978271}} 11/06/2021 23:31:44 - INFO - __main__ - Step 16096: {'lr': 0.0004888935738697593, 'samples': 3090432, 'steps': 16095, 'loss/train': 1.3398206233978271}} 11/06/2021 23:31:48 - INFO - __main__ - Step 16103: {'lr': 0.0004888826220794899, 'samples': 3091776, 'steps': 16102, 'loss/train': 1.991464376449585}}} 11/06/2021 23:31:49 - INFO - __main__ - Step 16107: {'lr': 0.0004888763615455959, 'samples': 3092544, 'steps': 16106, 'loss/train': 1.7099063396453857}} 11/06/2021 23:31:49 - INFO - __main__ - Step 16107: {'lr': 0.0004888763615455959, 'samples': 3092544, 'steps': 16106, 'loss/train': 1.7099063396453857}} 11/06/2021 23:31:53 - INFO - __main__ - Step 16114: {'lr': 0.0004888654014673998, 'samples': 3093888, 'steps': 16113, 'loss/train': 1.8334771394729614}} 11/06/2021 23:31:55 - INFO - __main__ - Step 16119: {'lr': 0.0004888575696112308, 'samples': 3094848, 'steps': 16118, 'loss/train': 1.839583158493042}}} 11/06/2021 23:31:55 - INFO - __main__ - Step 16119: {'lr': 0.0004888575696112308, 'samples': 3094848, 'steps': 16118, 'loss/train': 1.839583158493042}}} 11/06/2021 23:32:00 - INFO - __main__ - Step 16127: {'lr': 0.0004888450330448692, 'samples': 3096384, 'steps': 16126, 'loss/train': 1.4687092304229736}} 11/06/2021 23:32:02 - INFO - __main__ - Step 16131: {'lr': 0.0004888387621787885, 'samples': 3097152, 'steps': 16130, 'loss/train': 2.1754300594329834}} 11/06/2021 23:32:03 - INFO - __main__ - Step 16135: {'lr': 0.0004888324895908349, 'samples': 3097920, 'steps': 16134, 'loss/train': 1.8906220197677612}} 11/06/2021 23:32:05 - INFO - __main__ - Step 16139: {'lr': 0.0004888262152810534, 'samples': 3098688, 'steps': 16138, 'loss/train': 1.4442768096923828}} 11/06/2021 23:32:08 - INFO - __main__ - Step 16144: {'lr': 0.0004888183699725755, 'samples': 3099648, 'steps': 16143, 'loss/train': 1.240419864654541}}} 11/06/2021 23:32:08 - INFO - __main__ - Step 16144: {'lr': 0.0004888183699725755, 'samples': 3099648, 'steps': 16143, 'loss/train': 1.240419864654541}}} 11/06/2021 23:32:11 - INFO - __main__ - Step 16151: {'lr': 0.0004888073820211952, 'samples': 3100992, 'steps': 16150, 'loss/train': 1.5844863653182983}} 11/06/2021 23:32:13 - INFO - __main__ - Step 16155: {'lr': 0.0004888011008245554, 'samples': 3101760, 'steps': 16154, 'loss/train': 1.5103989839553833}} 11/06/2021 23:32:16 - INFO - __main__ - Step 16161: {'lr': 0.0004887916758016069, 'samples': 3102912, 'steps': 16160, 'loss/train': 1.7102398872375488}} 11/06/2021 23:32:16 - INFO - __main__ - Step 16161: {'lr': 0.0004887916758016069, 'samples': 3102912, 'steps': 16160, 'loss/train': 1.7102398872375488}} 11/06/2021 23:32:20 - INFO - __main__ - Step 16168: {'lr': 0.0004887806750459002, 'samples': 3104256, 'steps': 16167, 'loss/train': 0.9959037899971008}} 11/06/2021 23:32:21 - INFO - __main__ - Step 16172: {'lr': 0.000488774386532767, 'samples': 3105024, 'steps': 16171, 'loss/train': 1.583954930305481}8}} 11/06/2021 23:32:24 - INFO - __main__ - Step 16176: {'lr': 0.0004887680962982249, 'samples': 3105792, 'steps': 16175, 'loss/train': 1.8155841827392578}} 11/06/2021 23:32:26 - INFO - __main__ - Step 16181: {'lr': 0.0004887602310843852, 'samples': 3106752, 'steps': 16180, 'loss/train': 2.0834195613861084}} 11/06/2021 23:32:28 - INFO - __main__ - Step 16185: {'lr': 0.000488753936976839, 'samples': 3107520, 'steps': 16184, 'loss/train': 5.809494495391846}4}} 11/06/2021 23:32:30 - INFO - __main__ - Step 16189: {'lr': 0.0004887476411480314, 'samples': 3108288, 'steps': 16188, 'loss/train': 1.7173161506652832}} 11/06/2021 23:32:32 - INFO - __main__ - Step 16193: {'lr': 0.0004887413435980077, 'samples': 3109056, 'steps': 16192, 'loss/train': 1.82635498046875}2}} 11/06/2021 23:32:32 - INFO - __main__ - Step 16193: {'lr': 0.0004887413435980077, 'samples': 3109056, 'steps': 16192, 'loss/train': 1.82635498046875}2}} 11/06/2021 23:32:36 - INFO - __main__ - Step 16201: {'lr': 0.0004887287433344939, 'samples': 3110592, 'steps': 16200, 'loss/train': 1.2824130058288574}} 11/06/2021 23:32:38 - INFO - __main__ - Step 16205: {'lr': 0.0004887224406210945, 'samples': 3111360, 'steps': 16204, 'loss/train': 1.9707239866256714}} 11/06/2021 23:32:40 - INFO - __main__ - Step 16209: {'lr': 0.0004887161361866607, 'samples': 3112128, 'steps': 16208, 'loss/train': 1.5540908575057983}} 11/06/2021 23:32:42 - INFO - __main__ - Step 16214: {'lr': 0.0004887082532234832, 'samples': 3113088, 'steps': 16213, 'loss/train': 1.5670133829116821}} 11/06/2021 23:32:44 - INFO - __main__ - Step 16218: {'lr': 0.0004887019449168884, 'samples': 3113856, 'steps': 16217, 'loss/train': 1.4380708932876587}} 11/06/2021 23:32:46 - INFO - __main__ - Step 16222: {'lr': 0.0004886956348894069, 'samples': 3114624, 'steps': 16221, 'loss/train': 1.3726708889007568}} 11/06/2021 23:32:48 - INFO - __main__ - Step 16226: {'lr': 0.0004886893231410844, 'samples': 3115392, 'steps': 16225, 'loss/train': 2.2528281211853027}} 11/06/2021 23:32:50 - INFO - __main__ - Step 16230: {'lr': 0.0004886830096719662, 'samples': 3116160, 'steps': 16229, 'loss/train': 2.0018463134765625}} 11/06/2021 23:32:52 - INFO - __main__ - Step 16234: {'lr': 0.0004886766944820979, 'samples': 3116928, 'steps': 16233, 'loss/train': 1.961445927619934}}} 11/06/2021 23:32:54 - INFO - __main__ - Step 16238: {'lr': 0.000488670377571525, 'samples': 3117696, 'steps': 16237, 'loss/train': 1.2586119174957275}}} 11/06/2021 23:32:56 - INFO - __main__ - Step 16242: {'lr': 0.0004886640589402932, 'samples': 3118464, 'steps': 16241, 'loss/train': 1.874056339263916}}} 11/06/2021 23:32:57 - INFO - __main__ - Step 16246: {'lr': 0.0004886577385884478, 'samples': 3119232, 'steps': 16245, 'loss/train': 1.6517481803894043}} 11/06/2021 23:33:00 - INFO - __main__ - Step 16250: {'lr': 0.0004886514165160345, 'samples': 3120000, 'steps': 16249, 'loss/train': 1.7493343353271484}} 11/06/2021 23:33:02 - INFO - __main__ - Step 16255: {'lr': 0.0004886435115060388, 'samples': 3120960, 'steps': 16254, 'loss/train': 1.7781718969345093}} 11/06/2021 23:33:02 - INFO - __main__ - Step 16255: {'lr': 0.0004886435115060388, 'samples': 3120960, 'steps': 16254, 'loss/train': 1.7781718969345093}} 11/06/2021 23:33:02 - INFO - __main__ - Step 16255: {'lr': 0.0004886435115060388, 'samples': 3120960, 'steps': 16254, 'loss/train': 1.7781718969345093}} 11/06/2021 23:33:08 - INFO - __main__ - Step 16266: {'lr': 0.0004886261110216141, 'samples': 3123072, 'steps': 16265, 'loss/train': 1.357380986213684}}} 11/06/2021 23:33:10 - INFO - __main__ - Step 16272: {'lr': 0.0004886166143646476, 'samples': 3124224, 'steps': 16271, 'loss/train': 1.217058777809143}}} 11/06/2021 23:33:10 - INFO - __main__ - Step 16272: {'lr': 0.0004886166143646476, 'samples': 3124224, 'steps': 16271, 'loss/train': 1.217058777809143}}} 11/06/2021 23:33:14 - INFO - __main__ - Step 16279: {'lr': 0.000488605530039509, 'samples': 3125568, 'steps': 16278, 'loss/train': 2.0787177085876465}}} 11/06/2021 23:33:16 - INFO - __main__ - Step 16283: {'lr': 0.0004885991937741506, 'samples': 3126336, 'steps': 16282, 'loss/train': 1.2737128734588623}} 11/06/2021 23:33:18 - INFO - __main__ - Step 16287: {'lr': 0.0004885928557886466, 'samples': 3127104, 'steps': 16286, 'loss/train': 1.347070574760437}}} 11/06/2021 23:33:20 - INFO - __main__ - Step 16292: {'lr': 0.0004885849308878809, 'samples': 3128064, 'steps': 16291, 'loss/train': 1.1847530603408813}} 11/06/2021 23:33:22 - INFO - __main__ - Step 16296: {'lr': 0.0004885785890322158, 'samples': 3128832, 'steps': 16295, 'loss/train': 1.4970622062683105}} 11/06/2021 23:33:22 - INFO - __main__ - Step 16296: {'lr': 0.0004885785890322158, 'samples': 3128832, 'steps': 16295, 'loss/train': 1.4970622062683105}} 11/06/2021 23:33:26 - INFO - __main__ - Step 16303: {'lr': 0.0004885674866460858, 'samples': 3130176, 'steps': 16302, 'loss/train': 1.6252819299697876}} 11/06/2021 23:33:28 - INFO - __main__ - Step 16308: {'lr': 0.0004885595531454195, 'samples': 3131136, 'steps': 16307, 'loss/train': 1.55535888671875}6}} 11/06/2021 23:33:30 - INFO - __main__ - Step 16312: {'lr': 0.0004885532044100396, 'samples': 3131904, 'steps': 16311, 'loss/train': 2.0965306758880615}} 11/06/2021 23:33:32 - INFO - __main__ - Step 16316: {'lr': 0.0004885468539548455, 'samples': 3132672, 'steps': 16315, 'loss/train': 1.3534947633743286}} 11/06/2021 23:33:34 - INFO - __main__ - Step 16320: {'lr': 0.0004885405017798828, 'samples': 3133440, 'steps': 16319, 'loss/train': 2.023437023162842}}} 11/06/2021 23:33:37 - INFO - __main__ - Step 16325: {'lr': 0.0004885325591428248, 'samples': 3134400, 'steps': 16324, 'loss/train': 0.9680758118629456}} 11/06/2021 23:33:37 - INFO - __main__ - Step 16325: {'lr': 0.0004885325591428248, 'samples': 3134400, 'steps': 16324, 'loss/train': 0.9680758118629456}} 11/06/2021 23:33:40 - INFO - __main__ - Step 16332: {'lr': 0.0004885214349368419, 'samples': 3135744, 'steps': 16331, 'loss/train': 1.477682113647461}}} 11/06/2021 23:33:42 - INFO - __main__ - Step 16336: {'lr': 0.0004885150758832632, 'samples': 3136512, 'steps': 16335, 'loss/train': 1.7622499465942383}} 11/06/2021 23:33:44 - INFO - __main__ - Step 16340: {'lr': 0.0004885087151101453, 'samples': 3137280, 'steps': 16339, 'loss/train': 1.7774170637130737}} 11/06/2021 23:33:46 - INFO - __main__ - Step 16345: {'lr': 0.0004885007617257154, 'samples': 3138240, 'steps': 16344, 'loss/train': 1.7933660745620728}} 11/06/2021 23:33:49 - INFO - __main__ - Step 16350: {'lr': 0.0004884928056546663, 'samples': 3139200, 'steps': 16349, 'loss/train': 1.1514432430267334}} 11/06/2021 23:33:49 - INFO - __main__ - Step 16350: {'lr': 0.0004884928056546663, 'samples': 3139200, 'steps': 16349, 'loss/train': 1.1514432430267334}} 11/06/2021 23:33:52 - INFO - __main__ - Step 16356: {'lr': 0.0004884832548231966, 'samples': 3140352, 'steps': 16355, 'loss/train': 1.732427716255188}}} 11/06/2021 23:33:55 - INFO - __main__ - Step 16361: {'lr': 0.0004884752928419012, 'samples': 3141312, 'steps': 16360, 'loss/train': 1.7363823652267456}} 11/06/2021 23:33:55 - INFO - __main__ - Step 16361: {'lr': 0.0004884752928419012, 'samples': 3141312, 'steps': 16360, 'loss/train': 1.7363823652267456}} 11/06/2021 23:33:58 - INFO - __main__ - Step 16368: {'lr': 0.0004884641415550696, 'samples': 3142656, 'steps': 16367, 'loss/train': 1.6368969678878784}} 11/06/2021 23:33:58 - INFO - __main__ - Step 16368: {'lr': 0.0004884641415550696, 'samples': 3142656, 'steps': 16367, 'loss/train': 1.6368969678878784}} 11/06/2021 23:33:58 - INFO - __main__ - Step 16368: {'lr': 0.0004884641415550696, 'samples': 3142656, 'steps': 16367, 'loss/train': 1.6368969678878784}} 11/06/2021 23:34:04 - INFO - __main__ - Step 16379: {'lr': 0.0004884466074670512, 'samples': 3144768, 'steps': 16378, 'loss/train': 1.9465229511260986}} 11/06/2021 23:34:06 - INFO - __main__ - Step 16383: {'lr': 0.0004884402282117833, 'samples': 3145536, 'steps': 16382, 'loss/train': 0.8137797117233276}} 11/06/2021 23:34:08 - INFO - __main__ - Step 16388: {'lr': 0.0004884322517253604, 'samples': 3146496, 'steps': 16387, 'loss/train': 1.888465404510498}}} 11/06/2021 23:34:10 - INFO - __main__ - Step 16392: {'lr': 0.0004884258686024077, 'samples': 3147264, 'steps': 16391, 'loss/train': 2.085355758666992}}} 11/06/2021 23:34:12 - INFO - __main__ - Step 16396: {'lr': 0.0004884194837605587, 'samples': 3148032, 'steps': 16395, 'loss/train': 1.6716073751449585}} 11/06/2021 23:34:14 - INFO - __main__ - Step 16400: {'lr': 0.0004884130971998595, 'samples': 3148800, 'steps': 16399, 'loss/train': 1.464718222618103}}} 11/06/2021 23:34:16 - INFO - __main__ - Step 16405: {'lr': 0.0004884051115819224, 'samples': 3149760, 'steps': 16404, 'loss/train': 2.2028331756591797}} 11/06/2021 23:34:16 - INFO - __main__ - Step 16405: {'lr': 0.0004884051115819224, 'samples': 3149760, 'steps': 16404, 'loss/train': 2.2028331756591797}} 11/06/2021 23:34:20 - INFO - __main__ - Step 16412: {'lr': 0.0004883939272051208, 'samples': 3151104, 'steps': 16411, 'loss/train': 0.9273844957351685}} 11/06/2021 23:34:22 - INFO - __main__ - Step 16416: {'lr': 0.000488387533769481, 'samples': 3151872, 'steps': 16415, 'loss/train': 1.3187230825424194}}} 11/06/2021 23:34:24 - INFO - __main__ - Step 16421: {'lr': 0.0004883795395581277, 'samples': 3152832, 'steps': 16420, 'loss/train': 1.746578574180603}}} 11/06/2021 23:34:24 - INFO - __main__ - Step 16421: {'lr': 0.0004883795395581277, 'samples': 3152832, 'steps': 16420, 'loss/train': 1.746578574180603}}} 11/06/2021 23:34:29 - INFO - __main__ - Step 16429: {'lr': 0.0004883667432346723, 'samples': 3154368, 'steps': 16428, 'loss/train': 1.2394202947616577}} 11/06/2021 23:34:30 - INFO - __main__ - Step 16433: {'lr': 0.0004883603424952165, 'samples': 3155136, 'steps': 16432, 'loss/train': 1.8419935703277588}} 11/06/2021 23:34:32 - INFO - __main__ - Step 16437: {'lr': 0.0004883539400373369, 'samples': 3155904, 'steps': 16436, 'loss/train': 1.708992600440979}}} 11/06/2021 23:34:35 - INFO - __main__ - Step 16442: {'lr': 0.000488345934548524, 'samples': 3156864, 'steps': 16441, 'loss/train': 1.2319788932800293}}} 11/06/2021 23:34:37 - INFO - __main__ - Step 16446: {'lr': 0.0004883395282243595, 'samples': 3157632, 'steps': 16445, 'loss/train': 1.54921293258667}}}} 11/06/2021 23:34:39 - INFO - __main__ - Step 16450: {'lr': 0.0004883331201819211, 'samples': 3158400, 'steps': 16449, 'loss/train': 2.188420057296753}}} 11/06/2021 23:34:40 - INFO - __main__ - Step 16454: {'lr': 0.0004883267104212551, 'samples': 3159168, 'steps': 16453, 'loss/train': 0.8045353889465332}} 11/06/2021 23:34:40 - INFO - __main__ - Step 16454: {'lr': 0.0004883267104212551, 'samples': 3159168, 'steps': 16453, 'loss/train': 0.8045353889465332}} torch.nn.utils.clip_grad_norm_(parameters, max_norm, norm_type=norm_type)12551, 'samples': 3159168, 'steps': 16453, 'loss/train': 0.8045353889465332}} 11/06/2021 23:34:46 - INFO - __main__ - Step 16465: {'lr': 0.0004883090747201897, 'samples': 3161280, 'steps': 16464, 'loss/train': 1.9212692975997925}} 11/06/2021 23:34:49 - INFO - __main__ - Step 16470: {'lr': 0.0004883010541972392, 'samples': 3162240, 'steps': 16469, 'loss/train': 1.4929072856903076}} 11/06/2021 23:34:51 - INFO - __main__ - Step 16474: {'lr': 0.0004882946358461285, 'samples': 3163008, 'steps': 16473, 'loss/train': 2.0677337646484375}} 11/06/2021 23:34:52 - INFO - __main__ - Step 16478: {'lr': 0.0004882882157770676, 'samples': 3163776, 'steps': 16477, 'loss/train': 1.4509589672088623}} 11/06/2021 23:34:54 - INFO - __main__ - Step 16482: {'lr': 0.0004882817939901027, 'samples': 3164544, 'steps': 16481, 'loss/train': 1.7016760110855103}} 11/06/2021 23:34:54 - INFO - __main__ - Step 16482: {'lr': 0.0004882817939901027, 'samples': 3164544, 'steps': 16481, 'loss/train': 1.7016760110855103}} 11/06/2021 23:34:59 - INFO - __main__ - Step 16490: {'lr': 0.0004882689452626468, 'samples': 3166080, 'steps': 16489, 'loss/train': 1.9867051839828491}} 11/06/2021 23:35:00 - INFO - __main__ - Step 16494: {'lr': 0.0004882625183222481, 'samples': 3166848, 'steps': 16493, 'loss/train': 1.7978031635284424}} 11/06/2021 23:35:02 - INFO - __main__ - Step 16498: {'lr': 0.00048825608966413095, 'samples': 3167616, 'steps': 16497, 'loss/train': 1.3743228912353516} 11/06/2021 23:35:05 - INFO - __main__ - Step 16503: {'lr': 0.0004882480514260131, 'samples': 3168576, 'steps': 16502, 'loss/train': 1.7174887657165527}} 11/06/2021 23:35:07 - INFO - __main__ - Step 16507: {'lr': 0.00048824161890319854, 'samples': 3169344, 'steps': 16506, 'loss/train': 1.6323604583740234} 11/06/2021 23:35:09 - INFO - __main__ - Step 16511: {'lr': 0.00048823518466281586, 'samples': 3170112, 'steps': 16510, 'loss/train': 1.5818568468093872} 11/06/2021 23:35:10 - INFO - __main__ - Step 16515: {'lr': 0.0004882287487049117, 'samples': 3170880, 'steps': 16514, 'loss/train': 2.055966854095459}2} 11/06/2021 23:35:12 - INFO - __main__ - Step 16519: {'lr': 0.0004882223110295323, 'samples': 3171648, 'steps': 16518, 'loss/train': 1.8688184022903442}} 11/06/2021 23:35:15 - INFO - __main__ - Step 16524: {'lr': 0.0004882142615201793, 'samples': 3172608, 'steps': 16523, 'loss/train': 1.3749724626541138}} 11/06/2021 23:35:17 - INFO - __main__ - Step 16528: {'lr': 0.00048820781998065054, 'samples': 3173376, 'steps': 16527, 'loss/train': 1.1366714239120483} 11/06/2021 23:35:17 - INFO - __main__ - Step 16528: {'lr': 0.00048820781998065054, 'samples': 3173376, 'steps': 16527, 'loss/train': 1.1366714239120483} 11/06/2021 23:35:20 - INFO - __main__ - Step 16535: {'lr': 0.0004881965431541916, 'samples': 3174720, 'steps': 16534, 'loss/train': 1.4096086025238037}} 11/06/2021 23:35:22 - INFO - __main__ - Step 16539: {'lr': 0.0004881900968921328, 'samples': 3175488, 'steps': 16538, 'loss/train': 1.8910950422286987}} 11/06/2021 23:35:22 - INFO - __main__ - Step 16539: {'lr': 0.0004881900968921328, 'samples': 3175488, 'steps': 16538, 'loss/train': 1.8910950422286987}} 11/06/2021 23:35:27 - INFO - __main__ - Step 16547: {'lr': 0.0004881771992164722, 'samples': 3177024, 'steps': 16546, 'loss/train': 1.7142109870910645}} 11/06/2021 23:35:28 - INFO - __main__ - Step 16551: {'lr': 0.0004881707478029634, 'samples': 3177792, 'steps': 16550, 'loss/train': 1.6136027574539185}} 11/06/2021 23:35:30 - INFO - __main__ - Step 16555: {'lr': 0.0004881642946723975, 'samples': 3178560, 'steps': 16554, 'loss/train': 1.7432516813278198}} 11/06/2021 23:35:33 - INFO - __main__ - Step 16560: {'lr': 0.0004881562258446496, 'samples': 3179520, 'steps': 16559, 'loss/train': 1.6638139486312866}} 11/06/2021 23:35:33 - INFO - __main__ - Step 16560: {'lr': 0.0004881562258446496, 'samples': 3179520, 'steps': 16559, 'loss/train': 1.6638139486312866}} 11/06/2021 23:35:36 - INFO - __main__ - Step 16566: {'lr': 0.000488146539710146, 'samples': 3180672, 'steps': 16565, 'loss/train': 1.3595619201660156}}} 11/06/2021 23:35:39 - INFO - __main__ - Step 16571: {'lr': 0.0004881384649804945, 'samples': 3181632, 'steps': 16570, 'loss/train': 1.6895521879196167}} 11/06/2021 23:35:41 - INFO - __main__ - Step 16576: {'lr': 0.00048813038756830506, 'samples': 3182592, 'steps': 16575, 'loss/train': 1.7620586156845093} 11/06/2021 23:35:41 - INFO - __main__ - Step 16576: {'lr': 0.00048813038756830506, 'samples': 3182592, 'steps': 16575, 'loss/train': 1.7620586156845093} 11/06/2021 23:35:44 - INFO - __main__ - Step 16582: {'lr': 0.00048812069113285573, 'samples': 3183744, 'steps': 16581, 'loss/train': 1.8036795854568481} 11/06/2021 23:35:47 - INFO - __main__ - Step 16587: {'lr': 0.00048811260781940317, 'samples': 3184704, 'steps': 16586, 'loss/train': 1.7502573728561401} 11/06/2021 23:35:49 - INFO - __main__ - Step 16591: {'lr': 0.0004881061392374192, 'samples': 3185472, 'steps': 16590, 'loss/train': 1.1119537353515625}} 11/06/2021 23:35:51 - INFO - __main__ - Step 16595: {'lr': 0.00048809966893884396, 'samples': 3186240, 'steps': 16594, 'loss/train': 1.6212115287780762} 11/06/2021 23:35:53 - INFO - __main__ - Step 16599: {'lr': 0.00048809319692372406, 'samples': 3187008, 'steps': 16598, 'loss/train': 1.8749748468399048} 11/06/2021 23:35:53 - INFO - __main__ - Step 16599: {'lr': 0.00048809319692372406, 'samples': 3187008, 'steps': 16598, 'loss/train': 1.8749748468399048} 11/06/2021 23:35:57 - INFO - __main__ - Step 16607: {'lr': 0.00048808024774403726, 'samples': 3188544, 'steps': 16606, 'loss/train': 1.502026915550232}} 11/06/2021 23:35:59 - INFO - __main__ - Step 16611: {'lr': 0.00048807377057956365, 'samples': 3189312, 'steps': 16610, 'loss/train': 1.7892169952392578} 11/06/2021 23:36:01 - INFO - __main__ - Step 16615: {'lr': 0.0004880672916987322, 'samples': 3190080, 'steps': 16614, 'loss/train': 1.4086577892303467}} 11/06/2021 23:36:03 - INFO - __main__ - Step 16620: {'lr': 0.00048805919068413574, 'samples': 3191040, 'steps': 16619, 'loss/train': 3.036583185195923}} 11/06/2021 23:36:03 - INFO - __main__ - Step 16620: {'lr': 0.00048805919068413574, 'samples': 3191040, 'steps': 16619, 'loss/train': 3.036583185195923}} 11/06/2021 23:36:07 - INFO - __main__ - Step 16628: {'lr': 0.00048804622348299785, 'samples': 3192576, 'steps': 16627, 'loss/train': 1.4593732357025146} 11/06/2021 23:36:09 - INFO - __main__ - Step 16632: {'lr': 0.0004880397373081666, 'samples': 3193344, 'steps': 16631, 'loss/train': 1.5343999862670898}} 11/06/2021 23:36:11 - INFO - __main__ - Step 16636: {'lr': 0.00048803324941722295, 'samples': 3194112, 'steps': 16635, 'loss/train': 2.057948350906372}} 11/06/2021 23:36:13 - INFO - __main__ - Step 16641: {'lr': 0.0004880251371403313, 'samples': 3195072, 'steps': 16640, 'loss/train': 1.6309278011322021}} 11/06/2021 23:36:15 - INFO - __main__ - Step 16645: {'lr': 0.0004880186453883054, 'samples': 3195840, 'steps': 16644, 'loss/train': 1.6673327684402466}} 11/06/2021 23:36:15 - INFO - __main__ - Step 16645: {'lr': 0.0004880186453883054, 'samples': 3195840, 'steps': 16644, 'loss/train': 1.6673327684402466}} 11/06/2021 23:36:19 - INFO - __main__ - Step 16652: {'lr': 0.0004880072806932585, 'samples': 3197184, 'steps': 16651, 'loss/train': 1.279304027557373}}} 11/06/2021 23:36:19 - INFO - __main__ - Step 16652: {'lr': 0.0004880072806932585, 'samples': 3197184, 'steps': 16651, 'loss/train': 1.279304027557373}}} 11/06/2021 23:36:23 - INFO - __main__ - Step 16660: {'lr': 0.00048799428603581786, 'samples': 3198720, 'steps': 16659, 'loss/train': 1.5537936687469482} 11/06/2021 23:36:24 - INFO - __main__ - Step 16664: {'lr': 0.0004879877861333969, 'samples': 3199488, 'steps': 16663, 'loss/train': 1.791337013244629}2} 11/06/2021 23:36:26 - INFO - __main__ - Step 16668: {'lr': 0.0004879812845152379, 'samples': 3200256, 'steps': 16667, 'loss/train': 1.8223564624786377}} 11/06/2021 23:36:29 - INFO - __main__ - Step 16673: {'lr': 0.000487973155079854, 'samples': 3201216, 'steps': 16672, 'loss/train': 1.7812671661376953}}} 11/06/2021 23:36:31 - INFO - __main__ - Step 16677: {'lr': 0.00048796664960145596, 'samples': 3201984, 'steps': 16676, 'loss/train': 1.4314196109771729} 11/06/2021 23:36:33 - INFO - __main__ - Step 16681: {'lr': 0.00048796014240747227, 'samples': 3202752, 'steps': 16680, 'loss/train': 2.0574045181274414} 11/06/2021 23:36:33 - INFO - __main__ - Step 16681: {'lr': 0.00048796014240747227, 'samples': 3202752, 'steps': 16680, 'loss/train': 2.0574045181274414} 11/06/2021 23:36:36 - INFO - __main__ - Step 16688: {'lr': 0.0004879487506900141, 'samples': 3204096, 'steps': 16687, 'loss/train': 1.1915156841278076}} 11/06/2021 23:36:39 - INFO - __main__ - Step 16694: {'lr': 0.0004879389821793294, 'samples': 3205248, 'steps': 16693, 'loss/train': 1.6574643850326538}} 11/06/2021 23:36:39 - INFO - __main__ - Step 16694: {'lr': 0.0004879389821793294, 'samples': 3205248, 'steps': 16693, 'loss/train': 1.6574643850326538}} 11/06/2021 23:36:42 - INFO - __main__ - Step 16701: {'lr': 0.00048792758070541234, 'samples': 3206592, 'steps': 16700, 'loss/train': 1.749446153640747}} 11/06/2021 23:36:45 - INFO - __main__ - Step 16706: {'lr': 0.0004879194335792619, 'samples': 3207552, 'steps': 16705, 'loss/train': 1.3388835191726685}} 11/06/2021 23:36:47 - INFO - __main__ - Step 16710: {'lr': 0.00048791291394868644, 'samples': 3208320, 'steps': 16709, 'loss/train': 2.0529398918151855} 11/06/2021 23:36:49 - INFO - __main__ - Step 16714: {'lr': 0.0004879063926029127, 'samples': 3209088, 'steps': 16713, 'loss/train': 1.708368182182312}5} 11/06/2021 23:36:51 - INFO - __main__ - Step 16718: {'lr': 0.0004878998695419877, 'samples': 3209856, 'steps': 16717, 'loss/train': 1.689518690109253}5} 11/06/2021 23:36:53 - INFO - __main__ - Step 16722: {'lr': 0.0004878933447659587, 'samples': 3210624, 'steps': 16721, 'loss/train': 2.208752155303955}5} 11/06/2021 23:36:55 - INFO - __main__ - Step 16727: {'lr': 0.0004878851863841287, 'samples': 3211584, 'steps': 16726, 'loss/train': 1.4157750606536865}} 11/06/2021 23:36:57 - INFO - __main__ - Step 16731: {'lr': 0.0004878786577492873, 'samples': 3212352, 'steps': 16730, 'loss/train': 2.0975825786590576}} 11/06/2021 23:36:57 - INFO - __main__ - Step 16731: {'lr': 0.0004878786577492873, 'samples': 3212352, 'steps': 16730, 'loss/train': 2.0975825786590576}} 11/06/2021 23:37:00 - INFO - __main__ - Step 16738: {'lr': 0.0004878672285117417, 'samples': 3213696, 'steps': 16737, 'loss/train': 1.9334702491760254}} 11/06/2021 23:37:02 - INFO - __main__ - Step 16742: {'lr': 0.0004878606951608976, 'samples': 3214464, 'steps': 16741, 'loss/train': 2.05916428565979}4}} 11/06/2021 23:37:02 - INFO - __main__ - Step 16742: {'lr': 0.0004878606951608976, 'samples': 3214464, 'steps': 16741, 'loss/train': 2.05916428565979}4}} 11/06/2021 23:37:07 - INFO - __main__ - Step 16750: {'lr': 0.0004878476233147914, 'samples': 3216000, 'steps': 16749, 'loss/train': 1.612733006477356}}} 11/06/2021 23:37:09 - INFO - __main__ - Step 16754: {'lr': 0.00048784108481962347, 'samples': 3216768, 'steps': 16753, 'loss/train': 1.4702725410461426} 11/06/2021 23:37:11 - INFO - __main__ - Step 16759: {'lr': 0.00048783290928939985, 'samples': 3217728, 'steps': 16758, 'loss/train': 1.8196167945861816} 11/06/2021 23:37:14 - INFO - __main__ - Step 16763: {'lr': 0.00048782636693626736, 'samples': 3218496, 'steps': 16762, 'loss/train': 1.4822304248809814} 11/06/2021 23:37:15 - INFO - __main__ - Step 16767: {'lr': 0.0004878198228685607, 'samples': 3219264, 'steps': 16766, 'loss/train': 0.8852767944335938}} 11/06/2021 23:37:17 - INFO - __main__ - Step 16771: {'lr': 0.00048781327708632695, 'samples': 3220032, 'steps': 16770, 'loss/train': 1.9142979383468628} 11/06/2021 23:37:17 - INFO - __main__ - Step 16771: {'lr': 0.00048781327708632695, 'samples': 3220032, 'steps': 16770, 'loss/train': 1.9142979383468628} 11/06/2021 23:37:21 - INFO - __main__ - Step 16779: {'lr': 0.0004878001803784669, 'samples': 3221568, 'steps': 16778, 'loss/train': 1.6661590337753296}} 11/06/2021 23:37:23 - INFO - __main__ - Step 16783: {'lr': 0.0004877936294529351, 'samples': 3222336, 'steps': 16782, 'loss/train': 2.207812786102295}}} 11/06/2021 23:37:25 - INFO - __main__ - Step 16787: {'lr': 0.0004877870768130651, 'samples': 3223104, 'steps': 16786, 'loss/train': 1.743882417678833}}} 11/06/2021 23:37:27 - INFO - __main__ - Step 16791: {'lr': 0.00048778052245890404, 'samples': 3223872, 'steps': 16790, 'loss/train': 1.9870336055755615} 11/06/2021 23:37:29 - INFO - __main__ - Step 16796: {'lr': 0.00048777232710555296, 'samples': 3224832, 'steps': 16795, 'loss/train': 0.9855947494506836} 11/06/2021 23:37:32 - INFO - __main__ - Step 16800: {'lr': 0.0004877657688944099, 'samples': 3225600, 'steps': 16799, 'loss/train': 1.6870945692062378}} 11/06/2021 23:37:32 - INFO - __main__ - Step 16800: {'lr': 0.0004877657688944099, 'samples': 3225600, 'steps': 16799, 'loss/train': 1.6870945692062378}} 11/06/2021 23:37:35 - INFO - __main__ - Step 16807: {'lr': 0.0004877542879002951, 'samples': 3226944, 'steps': 16806, 'loss/train': 1.4899168014526367}} 11/06/2021 23:37:35 - INFO - __main__ - Step 16807: {'lr': 0.0004877542879002951, 'samples': 3226944, 'steps': 16806, 'loss/train': 1.4899168014526367}} 11/06/2021 23:37:38 - INFO - __main__ - Step 16815: {'lr': 0.00048774116033647373, 'samples': 3228480, 'steps': 16814, 'loss/train': 1.6762943267822266} 11/06/2021 23:37:41 - INFO - __main__ - Step 16819: {'lr': 0.0004877345939835995, 'samples': 3229248, 'steps': 16818, 'loss/train': 1.668867826461792}6} 11/06/2021 23:37:43 - INFO - __main__ - Step 16824: {'lr': 0.0004877263836323226, 'samples': 3230208, 'steps': 16823, 'loss/train': 1.8889883756637573}} 11/06/2021 23:37:43 - INFO - __main__ - Step 16824: {'lr': 0.0004877263836323226, 'samples': 3230208, 'steps': 16823, 'loss/train': 1.8889883756637573}} 11/06/2021 23:37:47 - INFO - __main__ - Step 16831: {'lr': 0.0004877148846416903, 'samples': 3231552, 'steps': 16830, 'loss/train': 1.4326417446136475}} 11/06/2021 23:37:49 - INFO - __main__ - Step 16835: {'lr': 0.0004877083114334496, 'samples': 3232320, 'steps': 16834, 'loss/train': 1.6590317487716675}} 11/06/2021 23:37:51 - INFO - __main__ - Step 16840: {'lr': 0.0004877000925132312, 'samples': 3233280, 'steps': 16839, 'loss/train': 2.102421760559082}}} 11/06/2021 23:37:51 - INFO - __main__ - Step 16840: {'lr': 0.0004877000925132312, 'samples': 3233280, 'steps': 16839, 'loss/train': 2.102421760559082}}} 11/06/2021 23:37:55 - INFO - __main__ - Step 16848: {'lr': 0.0004876869366715125, 'samples': 3234816, 'steps': 16847, 'loss/train': 1.9071651697158813}} 11/06/2021 23:37:57 - INFO - __main__ - Step 16852: {'lr': 0.00048768035618027597, 'samples': 3235584, 'steps': 16851, 'loss/train': 1.7523447275161743} 11/06/2021 23:37:59 - INFO - __main__ - Step 16856: {'lr': 0.00048767377397551773, 'samples': 3236352, 'steps': 16855, 'loss/train': 2.0163164138793945} 11/06/2021 23:38:01 - INFO - __main__ - Step 16861: {'lr': 0.0004876655438100024, 'samples': 3237312, 'steps': 16860, 'loss/train': 1.7778635025024414}} 11/06/2021 23:38:04 - INFO - __main__ - Step 16866: {'lr': 0.0004876573109672765, 'samples': 3238272, 'steps': 16865, 'loss/train': 1.67001473903656}4}} 11/06/2021 23:38:06 - INFO - __main__ - Step 16870: {'lr': 0.0004876507227655664, 'samples': 3239040, 'steps': 16869, 'loss/train': 1.5355979204177856}} 11/06/2021 23:38:08 - INFO - __main__ - Step 16874: {'lr': 0.0004876441328505483, 'samples': 3239808, 'steps': 16873, 'loss/train': 1.5508947372436523}} 11/06/2021 23:38:08 - INFO - __main__ - Step 16874: {'lr': 0.0004876441328505483, 'samples': 3239808, 'steps': 16873, 'loss/train': 1.5508947372436523}} 11/06/2021 23:38:11 - INFO - __main__ - Step 16881: {'lr': 0.00048763259637676226, 'samples': 3241152, 'steps': 16880, 'loss/train': 1.099325180053711}} 11/06/2021 23:38:14 - INFO - __main__ - Step 16887: {'lr': 0.0004876227037947807, 'samples': 3242304, 'steps': 16886, 'loss/train': 1.3176175355911255}} 11/06/2021 23:38:16 - INFO - __main__ - Step 16891: {'lr': 0.00048761610659873387, 'samples': 3243072, 'steps': 16890, 'loss/train': 1.4026546478271484} 11/06/2021 23:38:18 - INFO - __main__ - Step 16895: {'lr': 0.00048760950768962863, 'samples': 3243840, 'steps': 16894, 'loss/train': 1.6721173524856567} 11/06/2021 23:38:19 - INFO - __main__ - Step 16899: {'lr': 0.0004876029070675126, 'samples': 3244608, 'steps': 16898, 'loss/train': 1.2345243692398071}} 11/06/2021 23:38:21 - INFO - __main__ - Step 16903: {'lr': 0.00048759630473243327, 'samples': 3245376, 'steps': 16902, 'loss/train': 1.6020948886871338} 11/06/2021 23:38:21 - INFO - __main__ - Step 16903: {'lr': 0.00048759630473243327, 'samples': 3245376, 'steps': 16902, 'loss/train': 1.6020948886871338} 11/06/2021 23:38:26 - INFO - __main__ - Step 16911: {'lr': 0.00048758309492357533, 'samples': 3246912, 'steps': 16910, 'loss/train': 1.5082074403762817} 11/06/2021 23:38:28 - INFO - __main__ - Step 16915: {'lr': 0.0004875764874498919, 'samples': 3247680, 'steps': 16914, 'loss/train': 1.2739442586898804}} 11/06/2021 23:38:29 - INFO - __main__ - Step 16919: {'lr': 0.0004875698782634357, 'samples': 3248448, 'steps': 16918, 'loss/train': 1.927436113357544}}} 11/06/2021 23:38:31 - INFO - __main__ - Step 16923: {'lr': 0.00048756326736425427, 'samples': 3249216, 'steps': 16922, 'loss/train': 1.6660298109054565} 11/06/2021 23:38:31 - INFO - __main__ - Step 16923: {'lr': 0.00048756326736425427, 'samples': 3249216, 'steps': 16922, 'loss/train': 1.6660298109054565} 11/06/2021 23:38:35 - INFO - __main__ - Step 16931: {'lr': 0.00048755004042790685, 'samples': 3250752, 'steps': 16930, 'loss/train': 1.3983508348464966} 11/06/2021 23:38:37 - INFO - __main__ - Step 16935: {'lr': 0.0004875434243908361, 'samples': 3251520, 'steps': 16934, 'loss/train': 1.8356921672821045}} 11/06/2021 23:38:39 - INFO - __main__ - Step 16939: {'lr': 0.0004875368066412309, 'samples': 3252288, 'steps': 16938, 'loss/train': 1.4638780355453491}} 11/06/2021 23:38:41 - INFO - __main__ - Step 16944: {'lr': 0.00048752853204604555, 'samples': 3253248, 'steps': 16943, 'loss/train': 1.238154649734497}} 11/06/2021 23:38:41 - INFO - __main__ - Step 16944: {'lr': 0.00048752853204604555, 'samples': 3253248, 'steps': 16943, 'loss/train': 1.238154649734497}} 11/06/2021 23:38:45 - INFO - __main__ - Step 16952: {'lr': 0.0004875152871283999, 'samples': 3254784, 'steps': 16951, 'loss/train': 0.9848697185516357}} 11/06/2021 23:38:47 - INFO - __main__ - Step 16956: {'lr': 0.00048750866210105583, 'samples': 3255552, 'steps': 16955, 'loss/train': 1.563040018081665}} 11/06/2021 23:38:49 - INFO - __main__ - Step 16960: {'lr': 0.0004875020353614279, 'samples': 3256320, 'steps': 16959, 'loss/train': 1.5973029136657715}} 11/06/2021 23:38:51 - INFO - __main__ - Step 16965: {'lr': 0.00048749374952906677, 'samples': 3257280, 'steps': 16964, 'loss/train': 1.4671553373336792} 11/06/2021 23:38:54 - INFO - __main__ - Step 16970: {'lr': 0.0004874854610214301, 'samples': 3258240, 'steps': 16969, 'loss/train': 1.7520792484283447}} 11/06/2021 23:38:54 - INFO - __main__ - Step 16970: {'lr': 0.0004874854610214301, 'samples': 3258240, 'steps': 16969, 'loss/train': 1.7520792484283447}} 11/06/2021 23:38:57 - INFO - __main__ - Step 16977: {'lr': 0.00048747385261645377, 'samples': 3259584, 'steps': 16976, 'loss/train': 1.7434256076812744} 11/06/2021 23:38:59 - INFO - __main__ - Step 16981: {'lr': 0.00048746721688812004, 'samples': 3260352, 'steps': 16980, 'loss/train': 1.5293978452682495} 11/06/2021 23:39:01 - INFO - __main__ - Step 16986: {'lr': 0.0004874589198202294, 'samples': 3261312, 'steps': 16985, 'loss/train': 1.3058313131332397}} 11/06/2021 23:39:01 - INFO - __main__ - Step 16986: {'lr': 0.0004874589198202294, 'samples': 3261312, 'steps': 16985, 'loss/train': 1.3058313131332397}} 11/06/2021 23:39:06 - INFO - __main__ - Step 16994: {'lr': 0.0004874456389478865, 'samples': 3262848, 'steps': 16993, 'loss/train': 1.9466887712478638}} 11/06/2021 23:39:07 - INFO - __main__ - Step 16998: {'lr': 0.0004874389959439469, 'samples': 3263616, 'steps': 16997, 'loss/train': 1.821245789527893}}} 11/06/2021 23:39:09 - INFO - __main__ - Step 17002: {'lr': 0.0004874323512282258, 'samples': 3264384, 'steps': 17001, 'loss/train': 1.7762051820755005}} 11/06/2021 23:39:12 - INFO - __main__ - Step 17007: {'lr': 0.0004874240429264545, 'samples': 3265344, 'steps': 17006, 'loss/train': 2.030019760131836}}} 11/06/2021 23:39:12 - INFO - __main__ - Step 17007: {'lr': 0.0004874240429264545, 'samples': 3265344, 'steps': 17006, 'loss/train': 2.030019760131836}}} 11/06/2021 23:39:16 - INFO - __main__ - Step 17014: {'lr': 0.0004874124068108521, 'samples': 3266688, 'steps': 17013, 'loss/train': 1.6101415157318115}} 11/06/2021 23:39:17 - INFO - __main__ - Step 17018: {'lr': 0.0004874057552484839, 'samples': 3267456, 'steps': 17017, 'loss/train': 1.9977906942367554}} 11/06/2021 23:39:20 - INFO - __main__ - Step 17023: {'lr': 0.00048739743838867344, 'samples': 3268416, 'steps': 17022, 'loss/train': 1.543113112449646}} 11/06/2021 23:39:22 - INFO - __main__ - Step 17028: {'lr': 0.00048738911885467243, 'samples': 3269376, 'steps': 17027, 'loss/train': 1.5371849536895752} 11/06/2021 23:39:22 - INFO - __main__ - Step 17028: {'lr': 0.00048738911885467243, 'samples': 3269376, 'steps': 17027, 'loss/train': 1.5371849536895752} 11/06/2021 23:39:26 - INFO - __main__ - Step 17035: {'lr': 0.00048737746701460927, 'samples': 3270720, 'steps': 17034, 'loss/train': 1.8075716495513916} 11/06/2021 23:39:27 - INFO - __main__ - Step 17039: {'lr': 0.0004873708064671812, 'samples': 3271488, 'steps': 17038, 'loss/train': 1.6623574495315552}} 11/06/2021 23:39:27 - INFO - __main__ - Step 17039: {'lr': 0.0004873708064671812, 'samples': 3271488, 'steps': 17038, 'loss/train': 1.6623574495315552}} 11/06/2021 23:39:32 - INFO - __main__ - Step 17047: {'lr': 0.00048735748023850337, 'samples': 3273024, 'steps': 17046, 'loss/train': 1.769322156906128}} 11/06/2021 23:39:34 - INFO - __main__ - Step 17051: {'lr': 0.0004873508145573495, 'samples': 3273792, 'steps': 17050, 'loss/train': 1.3549208641052246}} 11/06/2021 23:39:35 - INFO - __main__ - Step 17055: {'lr': 0.0004873441471650499, 'samples': 3274560, 'steps': 17054, 'loss/train': 1.7989593744277954}} 11/06/2021 23:39:38 - INFO - __main__ - Step 17060: {'lr': 0.00048733581051844976, 'samples': 3275520, 'steps': 17059, 'loss/train': 0.27972447872161865} 11/06/2021 23:39:38 - INFO - __main__ - Step 17060: {'lr': 0.00048733581051844976, 'samples': 3275520, 'steps': 17059, 'loss/train': 0.27972447872161865} 11/06/2021 23:39:42 - INFO - __main__ - Step 17067: {'lr': 0.0004873241347217567, 'samples': 3276864, 'steps': 17066, 'loss/train': 2.148686647415161}65} 11/06/2021 23:39:43 - INFO - __main__ - Step 17071: {'lr': 0.0004873174604853546, 'samples': 3277632, 'steps': 17070, 'loss/train': 1.536741018295288}65} 11/06/2021 23:39:45 - INFO - __main__ - Step 17075: {'lr': 0.0004873107845380471, 'samples': 3278400, 'steps': 17074, 'loss/train': 1.820310115814209}65} 11/06/2021 23:39:45 - INFO - __main__ - Step 17075: {'lr': 0.0004873107845380471, 'samples': 3278400, 'steps': 17074, 'loss/train': 1.820310115814209}65} 11/06/2021 23:39:49 - INFO - __main__ - Step 17083: {'lr': 0.0004872974275109085, 'samples': 3279936, 'steps': 17082, 'loss/train': 1.463983178138733}65} 11/06/2021 23:39:51 - INFO - __main__ - Step 17087: {'lr': 0.0004872907464311737, 'samples': 3280704, 'steps': 17086, 'loss/train': 1.4752380847930908}5} 11/06/2021 23:39:54 - INFO - __main__ - Step 17092: {'lr': 0.00048728239267582096, 'samples': 3281664, 'steps': 17091, 'loss/train': 1.1213079690933228}} 11/06/2021 23:39:54 - INFO - __main__ - Step 17092: {'lr': 0.00048728239267582096, 'samples': 3281664, 'steps': 17091, 'loss/train': 1.1213079690933228}} 11/06/2021 23:39:54 - INFO - __main__ - Step 17092: {'lr': 0.00048728239267582096, 'samples': 3281664, 'steps': 17091, 'loss/train': 1.1213079690933228}} 11/06/2021 23:39:59 - INFO - __main__ - Step 17103: {'lr': 0.00048726400500558856, 'samples': 3283776, 'steps': 17102, 'loss/train': 1.9495559930801392}} 11/06/2021 23:40:02 - INFO - __main__ - Step 17108: {'lr': 0.0004872556426973044, 'samples': 3284736, 'steps': 17107, 'loss/train': 1.5045108795166016}}} 11/06/2021 23:40:02 - INFO - __main__ - Step 17108: {'lr': 0.0004872556426973044, 'samples': 3284736, 'steps': 17107, 'loss/train': 1.5045108795166016}}} 11/06/2021 23:40:02 - INFO - __main__ - Step 17108: {'lr': 0.0004872556426973044, 'samples': 3284736, 'steps': 17107, 'loss/train': 1.5045108795166016}}} 11/06/2021 23:40:07 - INFO - __main__ - Step 17119: {'lr': 0.0004872372362116838, 'samples': 3286848, 'steps': 17118, 'loss/train': 1.6649097204208374}}} 11/06/2021 23:40:10 - INFO - __main__ - Step 17124: {'lr': 0.0004872288653514329, 'samples': 3287808, 'steps': 17123, 'loss/train': 1.8238401412963867}}} 11/06/2021 23:40:12 - INFO - __main__ - Step 17129: {'lr': 0.00048722049181889037, 'samples': 3288768, 'steps': 17128, 'loss/train': 1.9449520111083984}} 11/06/2021 23:40:14 - INFO - __main__ - Step 17133: {'lr': 0.00048721379106886976, 'samples': 3289536, 'steps': 17132, 'loss/train': 1.65146005153656}4}} 11/06/2021 23:40:14 - INFO - __main__ - Step 17133: {'lr': 0.00048721379106886976, 'samples': 3289536, 'steps': 17132, 'loss/train': 1.65146005153656}4}} 11/06/2021 23:40:17 - INFO - __main__ - Step 17140: {'lr': 0.00048720206064129516, 'samples': 3290880, 'steps': 17139, 'loss/train': 1.6057360172271729}} 11/06/2021 23:40:20 - INFO - __main__ - Step 17145: {'lr': 0.0004871936785580533, 'samples': 3291840, 'steps': 17144, 'loss/train': 1.0786106586456299}}} 11/06/2021 23:40:20 - INFO - __main__ - Step 17145: {'lr': 0.0004871936785580533, 'samples': 3291840, 'steps': 17144, 'loss/train': 1.0786106586456299}}} 11/06/2021 23:40:24 - INFO - __main__ - Step 17151: {'lr': 0.00048718361653126975, 'samples': 3292992, 'steps': 17150, 'loss/train': 1.692865252494812}}} 11/06/2021 23:40:26 - INFO - __main__ - Step 17156: {'lr': 0.000487175228569983, 'samples': 3293952, 'steps': 17155, 'loss/train': 1.9590970277786255}}}} 11/06/2021 23:40:28 - INFO - __main__ - Step 17160: {'lr': 0.00048716851627733404, 'samples': 3294720, 'steps': 17159, 'loss/train': 1.938184142112732}}} 11/06/2021 23:40:29 - INFO - __main__ - Step 17164: {'lr': 0.00048716180227485365, 'samples': 3295488, 'steps': 17163, 'loss/train': 1.6158461570739746}} 11/06/2021 23:40:32 - INFO - __main__ - Step 17168: {'lr': 0.00048715508656259, 'samples': 3296256, 'steps': 17167, 'loss/train': 1.054592490196228}746}} 11/06/2021 23:40:34 - INFO - __main__ - Step 17172: {'lr': 0.0004871483691405916, 'samples': 3297024, 'steps': 17171, 'loss/train': 1.7745710611343384}}} 11/06/2021 23:40:36 - INFO - __main__ - Step 17176: {'lr': 0.00048714165000890685, 'samples': 3297792, 'steps': 17175, 'loss/train': 1.890425682067871}}} 11/06/2021 23:40:37 - INFO - __main__ - Step 17180: {'lr': 0.00048713492916758425, 'samples': 3298560, 'steps': 17179, 'loss/train': 1.000144124031067}}} 11/06/2021 23:40:40 - INFO - __main__ - Step 17184: {'lr': 0.00048712820661667215, 'samples': 3299328, 'steps': 17183, 'loss/train': 1.9780511856079102}} 11/06/2021 23:40:42 - INFO - __main__ - Step 17189: {'lr': 0.0004871198010239958, 'samples': 3300288, 'steps': 17188, 'loss/train': 1.8698750734329224}}} 11/06/2021 23:40:42 - INFO - __main__ - Step 17189: {'lr': 0.0004871198010239958, 'samples': 3300288, 'steps': 17188, 'loss/train': 1.8698750734329224}}} 11/06/2021 23:40:46 - INFO - __main__ - Step 17197: {'lr': 0.00048710634651994176, 'samples': 3301824, 'steps': 17196, 'loss/train': 1.7191940546035767}} 11/06/2021 23:40:48 - INFO - __main__ - Step 17201: {'lr': 0.0004870996167038154, 'samples': 3302592, 'steps': 17200, 'loss/train': 1.7400089502334595}}} 11/06/2021 23:40:50 - INFO - __main__ - Step 17205: {'lr': 0.0004870928851783543, 'samples': 3303360, 'steps': 17204, 'loss/train': 1.7238097190856934}}} 11/06/2021 23:40:52 - INFO - __main__ - Step 17210: {'lr': 0.0004870844683678496, 'samples': 3304320, 'steps': 17209, 'loss/train': 1.492447018623352}}}} 11/06/2021 23:40:54 - INFO - __main__ - Step 17215: {'lr': 0.00048707604888667983, 'samples': 3305280, 'steps': 17214, 'loss/train': 0.9866694211959839}} 11/06/2021 23:40:54 - INFO - __main__ - Step 17215: {'lr': 0.00048707604888667983, 'samples': 3305280, 'steps': 17214, 'loss/train': 0.9866694211959839}} 11/06/2021 23:40:58 - INFO - __main__ - Step 17222: {'lr': 0.0004870642571265054, 'samples': 3306624, 'steps': 17221, 'loss/train': 1.514723539352417}9}} 11/06/2021 23:41:00 - INFO - __main__ - Step 17226: {'lr': 0.0004870575166278327, 'samples': 3307392, 'steps': 17225, 'loss/train': 1.4333207607269287}}} 11/06/2021 23:41:02 - INFO - __main__ - Step 17231: {'lr': 0.0004870490886011723, 'samples': 3308352, 'steps': 17230, 'loss/train': 1.821036696434021}}}} 11/06/2021 23:41:02 - INFO - __main__ - Step 17231: {'lr': 0.0004870490886011723, 'samples': 3308352, 'steps': 17230, 'loss/train': 1.821036696434021}}}} 11/06/2021 23:41:06 - INFO - __main__ - Step 17239: {'lr': 0.00048703559820440054, 'samples': 3309888, 'steps': 17238, 'loss/train': 1.221914291381836}}} 11/06/2021 23:41:08 - INFO - __main__ - Step 17243: {'lr': 0.0004870288504426804, 'samples': 3310656, 'steps': 17242, 'loss/train': 1.3708194494247437}}} 11/06/2021 23:41:10 - INFO - __main__ - Step 17247: {'lr': 0.0004870221009721356, 'samples': 3311424, 'steps': 17246, 'loss/train': 2.0922882556915283}}} 11/06/2021 23:41:12 - INFO - __main__ - Step 17251: {'lr': 0.0004870153497928147, 'samples': 3312192, 'steps': 17250, 'loss/train': 1.8932836055755615}}} 11/06/2021 23:41:14 - INFO - __main__ - Step 17255: {'lr': 0.0004870085969047665, 'samples': 3312960, 'steps': 17254, 'loss/train': 1.5786818265914917}}} 11/06/2021 23:41:16 - INFO - __main__ - Step 17259: {'lr': 0.0004870018423080397, 'samples': 3313728, 'steps': 17258, 'loss/train': 2.0775022506713867}}} 11/06/2021 23:41:18 - INFO - __main__ - Step 17263: {'lr': 0.00048699508600268284, 'samples': 3314496, 'steps': 17262, 'loss/train': 1.1747856140136719}} 11/06/2021 23:41:20 - INFO - __main__ - Step 17267: {'lr': 0.00048698832798874477, 'samples': 3315264, 'steps': 17266, 'loss/train': 1.6333647966384888}} 11/06/2021 23:41:20 - INFO - __main__ - Step 17267: {'lr': 0.00048698832798874477, 'samples': 3315264, 'steps': 17266, 'loss/train': 1.6333647966384888}} 11/06/2021 23:41:24 - INFO - __main__ - Step 17274: {'lr': 0.000486976497353226, 'samples': 3316608, 'steps': 17273, 'loss/train': 1.683788537979126}88}} 11/06/2021 23:41:26 - INFO - __main__ - Step 17279: {'lr': 0.00048696804369593023, 'samples': 3317568, 'steps': 17278, 'loss/train': 1.604174017906189}}} 11/06/2021 23:41:28 - INFO - __main__ - Step 17283: {'lr': 0.0004869612788481544, 'samples': 3318336, 'steps': 17282, 'loss/train': 1.6801536083221436}}} 11/06/2021 23:41:28 - INFO - __main__ - Step 17283: {'lr': 0.0004869612788481544, 'samples': 3318336, 'steps': 17282, 'loss/train': 1.6801536083221436}}} 11/06/2021 23:41:32 - INFO - __main__ - Step 17290: {'lr': 0.000486949436253889, 'samples': 3319680, 'steps': 17289, 'loss/train': 1.3601343631744385}}}} 11/06/2021 23:41:34 - INFO - __main__ - Step 17295: {'lr': 0.00048694097405499703, 'samples': 3320640, 'steps': 17294, 'loss/train': 1.6262609958648682}} 11/06/2021 23:41:36 - INFO - __main__ - Step 17300: {'lr': 0.00048693250918705643, 'samples': 3321600, 'steps': 17299, 'loss/train': 1.0434212684631348}} 11/06/2021 23:41:38 - INFO - __main__ - Step 17304: {'lr': 0.000486925735371053, 'samples': 3322368, 'steps': 17303, 'loss/train': 1.3537814617156982}8}} 11/06/2021 23:41:40 - INFO - __main__ - Step 17308: {'lr': 0.0004869189598469683, 'samples': 3323136, 'steps': 17307, 'loss/train': 1.5620085000991821}}} 11/06/2021 23:41:42 - INFO - __main__ - Step 17312: {'lr': 0.00048691218261485113, 'samples': 3323904, 'steps': 17311, 'loss/train': 1.9944758415222168}} 11/06/2021 23:41:44 - INFO - __main__ - Step 17316: {'lr': 0.00048690540367475046, 'samples': 3324672, 'steps': 17315, 'loss/train': 1.5609761476516724}} 11/06/2021 23:41:46 - INFO - __main__ - Step 17320: {'lr': 0.00048689862302671495, 'samples': 3325440, 'steps': 17319, 'loss/train': 1.8871047496795654}} 11/06/2021 23:41:48 - INFO - __main__ - Step 17325: {'lr': 0.00048689014481496197, 'samples': 3326400, 'steps': 17324, 'loss/train': 1.9357455968856812}} 11/06/2021 23:41:48 - INFO - __main__ - Step 17325: {'lr': 0.00048689014481496197, 'samples': 3326400, 'steps': 17324, 'loss/train': 1.9357455968856812}} 11/06/2021 23:41:52 - INFO - __main__ - Step 17332: {'lr': 0.0004868782708354893, 'samples': 3327744, 'steps': 17331, 'loss/train': 1.734197735786438}2}} 11/06/2021 23:41:55 - INFO - __main__ - Step 17337: {'lr': 0.00048686978621955416, 'samples': 3328704, 'steps': 17336, 'loss/train': 1.2255845069885254}} 11/06/2021 23:41:55 - INFO - __main__ - Step 17337: {'lr': 0.00048686978621955416, 'samples': 3328704, 'steps': 17336, 'loss/train': 1.2255845069885254}} 11/06/2021 23:41:58 - INFO - __main__ - Step 17344: {'lr': 0.00048685790327461184, 'samples': 3330048, 'steps': 17343, 'loss/train': 1.853256106376648}}} 11/06/2021 23:42:00 - INFO - __main__ - Step 17348: {'lr': 0.00048685111067240283, 'samples': 3330816, 'steps': 17347, 'loss/train': 1.716973900794983}}} 11/06/2021 23:42:00 - INFO - __main__ - Step 17348: {'lr': 0.00048685111067240283, 'samples': 3330816, 'steps': 17347, 'loss/train': 1.716973900794983}}} 11/06/2021 23:42:05 - INFO - __main__ - Step 17357: {'lr': 0.00048683582107430227, 'samples': 3332544, 'steps': 17356, 'loss/train': 1.4887734651565552}} 11/06/2021 23:42:07 - INFO - __main__ - Step 17361: {'lr': 0.00048682902292275667, 'samples': 3333312, 'steps': 17360, 'loss/train': 1.5254342555999756}} 11/06/2021 23:42:08 - INFO - __main__ - Step 17365: {'lr': 0.00048682222306382705, 'samples': 3334080, 'steps': 17364, 'loss/train': 1.5252821445465088}} 11/06/2021 23:42:10 - INFO - __main__ - Step 17369: {'lr': 0.00048681542149756253, 'samples': 3334848, 'steps': 17368, 'loss/train': 1.49398672580719}8}} 11/06/2021 23:42:12 - INFO - __main__ - Step 17374: {'lr': 0.00048680691713886653, 'samples': 3335808, 'steps': 17373, 'loss/train': 1.8448344469070435}} 11/06/2021 23:42:12 - INFO - __main__ - Step 17374: {'lr': 0.00048680691713886653, 'samples': 3335808, 'steps': 17373, 'loss/train': 1.8448344469070435}} 11/06/2021 23:42:17 - INFO - __main__ - Step 17382: {'lr': 0.00048679330461651275, 'samples': 3337344, 'steps': 17381, 'loss/train': 1.6298061609268188}} 11/06/2021 23:42:18 - INFO - __main__ - Step 17386: {'lr': 0.0004867864957946214, 'samples': 3338112, 'steps': 17385, 'loss/train': 1.0904605388641357}}} 11/06/2021 23:42:18 - INFO - __main__ - Step 17386: {'lr': 0.0004867864957946214, 'samples': 3338112, 'steps': 17385, 'loss/train': 1.0904605388641357}}} 11/06/2021 23:42:22 - INFO - __main__ - Step 17394: {'lr': 0.0004867728730296556, 'samples': 3339648, 'steps': 17393, 'loss/train': 1.5756046772003174}}} 11/06/2021 23:42:24 - INFO - __main__ - Step 17398: {'lr': 0.00048676605908667926, 'samples': 3340416, 'steps': 17397, 'loss/train': 1.3067097663879395}} 11/06/2021 23:42:26 - INFO - __main__ - Step 17402: {'lr': 0.0004867592434367728, 'samples': 3341184, 'steps': 17401, 'loss/train': 1.5143498182296753}}} 11/06/2021 23:42:28 - INFO - __main__ - Step 17407: {'lr': 0.00048675072147409405, 'samples': 3342144, 'steps': 17406, 'loss/train': 1.472575068473816}}} 11/06/2021 23:42:30 - INFO - __main__ - Step 17411: {'lr': 0.0004867439019837745, 'samples': 3342912, 'steps': 17410, 'loss/train': 1.6748888492584229}}} 11/06/2021 23:42:30 - INFO - __main__ - Step 17411: {'lr': 0.0004867439019837745, 'samples': 3342912, 'steps': 17410, 'loss/train': 1.6748888492584229}}} 11/06/2021 23:42:34 - INFO - __main__ - Step 17418: {'lr': 0.0004867319637688286, 'samples': 3344256, 'steps': 17417, 'loss/train': 1.8701719045639038}}} 11/06/2021 23:42:36 - INFO - __main__ - Step 17423: {'lr': 0.00048672343327239024, 'samples': 3345216, 'steps': 17422, 'loss/train': 1.4655964374542236}} 11/06/2021 23:42:36 - INFO - __main__ - Step 17423: {'lr': 0.00048672343327239024, 'samples': 3345216, 'steps': 17422, 'loss/train': 1.4655964374542236}} 11/06/2021 23:42:41 - INFO - __main__ - Step 17431: {'lr': 0.0004867097789316046, 'samples': 3346752, 'steps': 17430, 'loss/train': 1.3610199689865112}}} 11/06/2021 23:42:42 - INFO - __main__ - Step 17435: {'lr': 0.00048670294920140063, 'samples': 3347520, 'steps': 17434, 'loss/train': 1.677636981010437}}} 11/06/2021 23:42:44 - INFO - __main__ - Step 17440: {'lr': 0.00048669440963892074, 'samples': 3348480, 'steps': 17439, 'loss/train': 1.8037528991699219}} 11/06/2021 23:42:46 - INFO - __main__ - Step 17444: {'lr': 0.000486687576069217, 'samples': 3349248, 'steps': 17443, 'loss/train': 1.418339729309082}19}} 11/06/2021 23:42:46 - INFO - __main__ - Step 17444: {'lr': 0.000486687576069217, 'samples': 3349248, 'steps': 17443, 'loss/train': 1.418339729309082}19}} 11/06/2021 23:42:50 - INFO - __main__ - Step 17451: {'lr': 0.0004866756132163259, 'samples': 3350592, 'steps': 17450, 'loss/train': 1.5698901414871216}}} 11/06/2021 23:42:50 - INFO - __main__ - Step 17451: {'lr': 0.0004866756132163259, 'samples': 3350592, 'steps': 17450, 'loss/train': 1.5698901414871216}}} 11/06/2021 23:42:55 - INFO - __main__ - Step 17460: {'lr': 0.0004866602247272516, 'samples': 3352320, 'steps': 17459, 'loss/train': 1.7726678848266602}}} 11/06/2021 23:42:55 - INFO - __main__ - Step 17460: {'lr': 0.0004866602247272516, 'samples': 3352320, 'steps': 17459, 'loss/train': 1.7726678848266602}}} 11/06/2021 23:42:58 - INFO - __main__ - Step 17467: {'lr': 0.0004866482499308023, 'samples': 3353664, 'steps': 17466, 'loss/train': 1.636150598526001}}}} 11/06/2021 23:43:00 - INFO - __main__ - Step 17472: {'lr': 0.0004866396933058502, 'samples': 3354624, 'steps': 17471, 'loss/train': 1.5251487493515015}}} 11/06/2021 23:43:03 - INFO - __main__ - Step 17477: {'lr': 0.0004866311340152433, 'samples': 3355584, 'steps': 17476, 'loss/train': 1.4653677940368652}}} 11/06/2021 23:43:05 - INFO - __main__ - Step 17481: {'lr': 0.00048662428466355104, 'samples': 3356352, 'steps': 17480, 'loss/train': 1.3983826637268066}} 11/06/2021 23:43:07 - INFO - __main__ - Step 17485: {'lr': 0.0004866174336059507, 'samples': 3357120, 'steps': 17484, 'loss/train': 1.5222047567367554}}} 11/06/2021 23:43:08 - INFO - __main__ - Step 17489: {'lr': 0.0004866105808424918, 'samples': 3357888, 'steps': 17488, 'loss/train': 1.581131100654602}}}} 11/06/2021 23:43:10 - INFO - __main__ - Step 17493: {'lr': 0.0004866037263732237, 'samples': 3358656, 'steps': 17492, 'loss/train': 1.8439452648162842}}} 11/06/2021 23:43:13 - INFO - __main__ - Step 17498: {'lr': 0.0004865951558879196, 'samples': 3359616, 'steps': 17497, 'loss/train': 1.65242600440979}2}}} 11/06/2021 23:43:15 - INFO - __main__ - Step 17502: {'lr': 0.0004865882975807614, 'samples': 3360384, 'steps': 17501, 'loss/train': 1.3170610666275024}}} 11/06/2021 23:43:17 - INFO - __main__ - Step 17506: {'lr': 0.00048658143756795456, 'samples': 3361152, 'steps': 17505, 'loss/train': 1.6811589002609253}} 11/06/2021 23:43:19 - INFO - __main__ - Step 17510: {'lr': 0.0004865745758495487, 'samples': 3361920, 'steps': 17509, 'loss/train': 1.487194299697876}3}} 11/06/2021 23:43:20 - INFO - __main__ - Step 17514: {'lr': 0.00048656771242559316, 'samples': 3362688, 'steps': 17513, 'loss/train': 1.190946340560913}}} 11/06/2021 23:43:23 - INFO - __main__ - Step 17519: {'lr': 0.0004865591307472949, 'samples': 3363648, 'steps': 17518, 'loss/train': 1.7990758419036865}}} 11/06/2021 23:43:25 - INFO - __main__ - Step 17523: {'lr': 0.0004865522634860335, 'samples': 3364416, 'steps': 17522, 'loss/train': 1.9060850143432617}}} 11/06/2021 23:43:25 - INFO - __main__ - Step 17523: {'lr': 0.0004865522634860335, 'samples': 3364416, 'steps': 17522, 'loss/train': 1.9060850143432617}}} 11/06/2021 23:43:28 - INFO - __main__ - Step 17530: {'lr': 0.0004865402416752642, 'samples': 3365760, 'steps': 17529, 'loss/train': 1.7213939428329468}}} 11/06/2021 23:43:30 - INFO - __main__ - Step 17534: {'lr': 0.00048653336972430297, 'samples': 3366528, 'steps': 17533, 'loss/train': 1.5995358228683472}} 11/06/2021 23:43:33 - INFO - __main__ - Step 17539: {'lr': 0.0004865247773875956, 'samples': 3367488, 'steps': 17538, 'loss/train': 1.8571265935897827}}} 11/06/2021 23:43:35 - INFO - __main__ - Step 17543: {'lr': 0.00048651790159988563, 'samples': 3368256, 'steps': 17542, 'loss/train': 1.915865421295166}}} 11/06/2021 23:43:35 - INFO - __main__ - Step 17543: {'lr': 0.00048651790159988563, 'samples': 3368256, 'steps': 17542, 'loss/train': 1.915865421295166}}} 11/06/2021 23:43:38 - INFO - __main__ - Step 17550: {'lr': 0.0004865058648684273, 'samples': 3369600, 'steps': 17549, 'loss/train': 1.9157060384750366}}} 11/06/2021 23:43:41 - INFO - __main__ - Step 17555: {'lr': 0.0004864972640061077, 'samples': 3370560, 'steps': 17554, 'loss/train': 2.118384838104248}}}} 11/06/2021 23:43:43 - INFO - __main__ - Step 17560: {'lr': 0.00048648866047973756, 'samples': 3371520, 'steps': 17559, 'loss/train': 1.6892518997192383}} 11/06/2021 23:43:43 - INFO - __main__ - Step 17560: {'lr': 0.00048648866047973756, 'samples': 3371520, 'steps': 17559, 'loss/train': 1.6892518997192383}} 11/06/2021 23:43:46 - INFO - __main__ - Step 17567: {'lr': 0.0004864766110673992, 'samples': 3372864, 'steps': 17566, 'loss/train': 1.8728556632995605}}} 11/06/2021 23:43:48 - INFO - __main__ - Step 17571: {'lr': 0.00048646972334474343, 'samples': 3373632, 'steps': 17570, 'loss/train': 1.59146249294281}}}} 11/06/2021 23:43:51 - INFO - __main__ - Step 17576: {'lr': 0.00048646111129406336, 'samples': 3374592, 'steps': 17575, 'loss/train': 1.4696110486984253}} 11/06/2021 23:43:53 - INFO - __main__ - Step 17581: {'lr': 0.00048645249657974007, 'samples': 3375552, 'steps': 17580, 'loss/train': 1.566757082939148}}} 11/06/2021 23:43:55 - INFO - __main__ - Step 17585: {'lr': 0.00048644560289052354, 'samples': 3376320, 'steps': 17584, 'loss/train': 0.6992756128311157}} 11/06/2021 23:43:55 - INFO - __main__ - Step 17585: {'lr': 0.00048644560289052354, 'samples': 3376320, 'steps': 17584, 'loss/train': 0.6992756128311157}} 11/06/2021 23:43:58 - INFO - __main__ - Step 17592: {'lr': 0.00048643353483268306, 'samples': 3377664, 'steps': 17591, 'loss/train': 1.770916223526001}}} 11/06/2021 23:44:01 - INFO - __main__ - Step 17597: {'lr': 0.00048642491159535373, 'samples': 3378624, 'steps': 17596, 'loss/train': 1.6837226152420044}} 11/06/2021 23:44:03 - INFO - __main__ - Step 17602: {'lr': 0.00048641628569478916, 'samples': 3379584, 'steps': 17601, 'loss/train': 1.8393548727035522}} 11/06/2021 23:44:05 - INFO - __main__ - Step 17606: {'lr': 0.00048640938305687315, 'samples': 3380352, 'steps': 17605, 'loss/train': 1.4664735794067383}} 11/06/2021 23:44:07 - INFO - __main__ - Step 17610: {'lr': 0.0004864024787145985, 'samples': 3381120, 'steps': 17609, 'loss/train': 1.6443487405776978}}} 11/06/2021 23:44:07 - INFO - __main__ - Step 17610: {'lr': 0.0004864024787145985, 'samples': 3381120, 'steps': 17609, 'loss/train': 1.6443487405776978}}} 11/06/2021 23:44:10 - INFO - __main__ - Step 17617: {'lr': 0.0004863903920146544, 'samples': 3382464, 'steps': 17616, 'loss/train': 2.09637188911438}8}}} 11/06/2021 23:44:13 - INFO - __main__ - Step 17622: {'lr': 0.00048638175546212, 'samples': 3383424, 'steps': 17621, 'loss/train': 1.2425532341003418}8}}} 11/06/2021 23:44:15 - INFO - __main__ - Step 17627: {'lr': 0.00048637311624683634, 'samples': 3384384, 'steps': 17626, 'loss/train': 1.8352149724960327}} 11/06/2021 23:44:17 - INFO - __main__ - Step 17631: {'lr': 0.00048636620295749533, 'samples': 3385152, 'steps': 17630, 'loss/train': 1.7116636037826538}} 11/06/2021 23:44:17 - INFO - __main__ - Step 17631: {'lr': 0.00048636620295749533, 'samples': 3385152, 'steps': 17630, 'loss/train': 1.7116636037826538}} 11/06/2021 23:44:21 - INFO - __main__ - Step 17638: {'lr': 0.0004863541006008144, 'samples': 3386496, 'steps': 17637, 'loss/train': 1.8092470169067383}}} 11/06/2021 23:44:21 - INFO - __main__ - Step 17638: {'lr': 0.0004863541006008144, 'samples': 3386496, 'steps': 17637, 'loss/train': 1.8092470169067383}}} 11/06/2021 23:44:25 - INFO - __main__ - Step 17646: {'lr': 0.00048634026294620125, 'samples': 3388032, 'steps': 17645, 'loss/train': 1.1031410694122314}} 11/06/2021 23:44:26 - INFO - __main__ - Step 17650: {'lr': 0.00048633334156307907, 'samples': 3388800, 'steps': 17649, 'loss/train': 1.1769516468048096}} 11/06/2021 23:44:28 - INFO - __main__ - Step 17654: {'lr': 0.00048632641847614645, 'samples': 3389568, 'steps': 17653, 'loss/train': 1.3956372737884521}} 11/06/2021 23:44:31 - INFO - __main__ - Step 17659: {'lr': 0.0004863177622215731, 'samples': 3390528, 'steps': 17658, 'loss/train': 1.276356816291809}1}} 11/06/2021 23:44:33 - INFO - __main__ - Step 17663: {'lr': 0.00048631083530124934, 'samples': 3391296, 'steps': 17662, 'loss/train': 1.7655481100082397}} 11/06/2021 23:44:33 - INFO - __main__ - Step 17663: {'lr': 0.00048631083530124934, 'samples': 3391296, 'steps': 17662, 'loss/train': 1.7655481100082397}} 11/06/2021 23:44:36 - INFO - __main__ - Step 17670: {'lr': 0.0004862987090913091, 'samples': 3392640, 'steps': 17669, 'loss/train': 1.4978746175765991}}} 11/06/2021 23:44:39 - INFO - __main__ - Step 17675: {'lr': 0.0004862900443185882, 'samples': 3393600, 'steps': 17674, 'loss/train': 1.705519199371338}}}} 11/06/2021 23:44:41 - INFO - __main__ - Step 17680: {'lr': 0.0004862813768841511, 'samples': 3394560, 'steps': 17679, 'loss/train': 1.2477961778640747}}} 11/06/2021 23:44:41 - INFO - __main__ - Step 17680: {'lr': 0.0004862813768841511, 'samples': 3394560, 'steps': 17679, 'loss/train': 1.2477961778640747}}} 11/06/2021 23:44:45 - INFO - __main__ - Step 17686: {'lr': 0.0004862709724494987, 'samples': 3395712, 'steps': 17685, 'loss/train': 0.83989018201828}7}}} 11/06/2021 23:44:47 - INFO - __main__ - Step 17691: {'lr': 0.00048626229915962974, 'samples': 3396672, 'steps': 17690, 'loss/train': 1.9429171085357666}} 11/06/2021 23:44:49 - INFO - __main__ - Step 17695: {'lr': 0.0004862553586115192, 'samples': 3397440, 'steps': 17694, 'loss/train': 1.4430689811706543}}} 11/06/2021 23:44:50 - INFO - __main__ - Step 17699: {'lr': 0.0004862484163601604, 'samples': 3398208, 'steps': 17698, 'loss/train': 1.2866634130477905}}} 11/06/2021 23:44:53 - INFO - __main__ - Step 17704: {'lr': 0.00048623973615084516, 'samples': 3399168, 'steps': 17703, 'loss/train': 1.6480399370193481}} 11/06/2021 23:44:53 - INFO - __main__ - Step 17704: {'lr': 0.00048623973615084516, 'samples': 3399168, 'steps': 17703, 'loss/train': 1.6480399370193481}} 11/06/2021 23:44:56 - INFO - __main__ - Step 17711: {'lr': 0.00048622757938709466, 'samples': 3400512, 'steps': 17710, 'loss/train': 1.562470555305481}}} 11/06/2021 23:44:59 - INFO - __main__ - Step 17715: {'lr': 0.00048622063032324324, 'samples': 3401280, 'steps': 17714, 'loss/train': 1.4588117599487305}} 11/06/2021 23:45:01 - INFO - __main__ - Step 17720: {'lr': 0.00048621194159859403, 'samples': 3402240, 'steps': 17719, 'loss/train': 1.776808738708496}}} 11/06/2021 23:45:01 - INFO - __main__ - Step 17720: {'lr': 0.00048621194159859403, 'samples': 3402240, 'steps': 17719, 'loss/train': 1.776808738708496}}} 11/06/2021 23:45:04 - INFO - __main__ - Step 17727: {'lr': 0.00048619977291390186, 'samples': 3403584, 'steps': 17726, 'loss/train': 1.5515477657318115}} 11/06/2021 23:45:06 - INFO - __main__ - Step 17731: {'lr': 0.0004861928170383594, 'samples': 3404352, 'steps': 17730, 'loss/train': 2.0993919372558594}}} 11/06/2021 23:45:09 - INFO - __main__ - Step 17736: {'lr': 0.0004861841197993784, 'samples': 3405312, 'steps': 17735, 'loss/train': 1.55830979347229}4}}} 11/06/2021 23:45:11 - INFO - __main__ - Step 17741: {'lr': 0.00048617541989987435, 'samples': 3406272, 'steps': 17740, 'loss/train': 2.2797935009002686}} 11/06/2021 23:45:13 - INFO - __main__ - Step 17745: {'lr': 0.0004861684580647605, 'samples': 3407040, 'steps': 17744, 'loss/train': 1.9319639205932617}}} 11/06/2021 23:45:13 - INFO - __main__ - Step 17745: {'lr': 0.0004861684580647605, 'samples': 3407040, 'steps': 17744, 'loss/train': 1.9319639205932617}}} 11/06/2021 23:45:16 - INFO - __main__ - Step 17752: {'lr': 0.00048615627075640754, 'samples': 3408384, 'steps': 17751, 'loss/train': 1.6517837047576904}} 11/06/2021 23:45:18 - INFO - __main__ - Step 17756: {'lr': 0.0004861493042392045, 'samples': 3409152, 'steps': 17755, 'loss/train': 1.853451132774353}4}} 11/06/2021 23:45:21 - INFO - __main__ - Step 17762: {'lr': 0.0004861388512712586, 'samples': 3410304, 'steps': 17761, 'loss/train': 1.6699413061141968}}} 11/06/2021 23:45:23 - INFO - __main__ - Step 17766: {'lr': 0.00048613188049794045, 'samples': 3411072, 'steps': 17765, 'loss/train': 1.2881523370742798}} 11/06/2021 23:45:25 - INFO - __main__ - Step 17770: {'lr': 0.00048612490802226415, 'samples': 3411840, 'steps': 17769, 'loss/train': 1.651924729347229}}} 11/06/2021 23:45:25 - INFO - __main__ - Step 17770: {'lr': 0.00048612490802226415, 'samples': 3411840, 'steps': 17769, 'loss/train': 1.651924729347229}}} 11/06/2021 23:45:28 - INFO - __main__ - Step 17777: {'lr': 0.00048611270209368264, 'samples': 3413184, 'steps': 17776, 'loss/train': 1.560634732246399}}} 11/06/2021 23:45:31 - INFO - __main__ - Step 17782: {'lr': 0.00048610398038158943, 'samples': 3414144, 'steps': 17781, 'loss/train': 1.194156527519226}}} 11/06/2021 23:45:33 - INFO - __main__ - Step 17787: {'lr': 0.0004860952560098759, 'samples': 3415104, 'steps': 17786, 'loss/train': 2.101297378540039}}}} 11/06/2021 23:45:33 - INFO - __main__ - Step 17787: {'lr': 0.0004860952560098759, 'samples': 3415104, 'steps': 17786, 'loss/train': 2.101297378540039}}}} 11/06/2021 23:45:36 - INFO - __main__ - Step 17793: {'lr': 0.0004860847832532593, 'samples': 3416256, 'steps': 17792, 'loss/train': 1.5424718856811523}}} 11/06/2021 23:45:39 - INFO - __main__ - Step 17799: {'lr': 0.00048607430666710097, 'samples': 3417408, 'steps': 17798, 'loss/train': 1.1966557502746582}} 11/06/2021 23:45:39 - INFO - __main__ - Step 17799: {'lr': 0.00048607430666710097, 'samples': 3417408, 'steps': 17798, 'loss/train': 1.1966557502746582}} 11/06/2021 23:45:43 - INFO - __main__ - Step 17805: {'lr': 0.00048606382625157075, 'samples': 3418560, 'steps': 17804, 'loss/train': 2.160865306854248}}} 11/06/2021 23:45:45 - INFO - __main__ - Step 17811: {'lr': 0.00048605334200683883, 'samples': 3419712, 'steps': 17810, 'loss/train': 1.910631537437439}}} 11/06/2021 23:45:45 - INFO - __main__ - Step 17811: {'lr': 0.00048605334200683883, 'samples': 3419712, 'steps': 17810, 'loss/train': 1.910631537437439}}} 11/06/2021 23:45:45 - INFO - __main__ - Step 17811: {'lr': 0.00048605334200683883, 'samples': 3419712, 'steps': 17810, 'loss/train': 1.910631537437439}}} 11/06/2021 23:45:51 - INFO - __main__ - Step 17821: {'lr': 0.00048603585975674334, 'samples': 3421632, 'steps': 17820, 'loss/train': 0.8659923076629639}} 11/06/2021 23:45:53 - INFO - __main__ - Step 17826: {'lr': 0.000486027114643367, 'samples': 3422592, 'steps': 17825, 'loss/train': 1.745100498199463}39}} 11/06/2021 23:45:53 - INFO - __main__ - Step 17826: {'lr': 0.000486027114643367, 'samples': 3422592, 'steps': 17825, 'loss/train': 1.745100498199463}39}} 11/06/2021 23:45:57 - INFO - __main__ - Step 17834: {'lr': 0.0004860131169317968, 'samples': 3424128, 'steps': 17833, 'loss/train': 1.9183335304260254}}} 11/06/2021 23:45:59 - INFO - __main__ - Step 17838: {'lr': 0.0004860061155237336, 'samples': 3424896, 'steps': 17837, 'loss/train': 1.628971815109253}}}} 11/06/2021 23:46:01 - INFO - __main__ - Step 17842: {'lr': 0.000485999112414219, 'samples': 3425664, 'steps': 17841, 'loss/train': 1.1723436117172241}}}} 11/06/2021 23:46:03 - INFO - __main__ - Step 17847: {'lr': 0.00048599035613473656, 'samples': 3426624, 'steps': 17846, 'loss/train': 1.792570948600769}}} 11/06/2021 23:46:03 - INFO - __main__ - Step 17847: {'lr': 0.00048599035613473656, 'samples': 3426624, 'steps': 17846, 'loss/train': 1.792570948600769}}} 11/06/2021 23:46:07 - INFO - __main__ - Step 17854: {'lr': 0.00048597809287747153, 'samples': 3427968, 'steps': 17853, 'loss/train': 1.4261943101882935}} 11/06/2021 23:46:10 - INFO - __main__ - Step 17859: {'lr': 0.00048596933021813815, 'samples': 3428928, 'steps': 17858, 'loss/train': 1.1924867630004883}} 11/06/2021 23:46:10 - INFO - __main__ - Step 17859: {'lr': 0.00048596933021813815, 'samples': 3428928, 'steps': 17858, 'loss/train': 1.1924867630004883}} 11/06/2021 23:46:13 - INFO - __main__ - Step 17866: {'lr': 0.00048595705802947963, 'samples': 3430272, 'steps': 17865, 'loss/train': 1.4125962257385254}} 11/06/2021 23:46:15 - INFO - __main__ - Step 17870: {'lr': 0.0004859500430112194, 'samples': 3431040, 'steps': 17869, 'loss/train': 1.7616888284683228}}} 11/06/2021 23:46:15 - INFO - __main__ - Step 17870: {'lr': 0.0004859500430112194, 'samples': 3431040, 'steps': 17869, 'loss/train': 1.7616888284683228}}} 11/06/2021 23:46:20 - INFO - __main__ - Step 17878: {'lr': 0.00048593600787160806, 'samples': 3432576, 'steps': 17877, 'loss/train': 1.8523950576782227}} 11/06/2021 23:46:22 - INFO - __main__ - Step 17882: {'lr': 0.0004859289877503581, 'samples': 3433344, 'steps': 17881, 'loss/train': 2.83292293548584}27}} 11/06/2021 23:46:23 - INFO - __main__ - Step 17886: {'lr': 0.0004859219659282127, 'samples': 3434112, 'steps': 17885, 'loss/train': 1.591639518737793}7}} 11/06/2021 23:46:26 - INFO - __main__ - Step 17891: {'lr': 0.00048591318625872403, 'samples': 3435072, 'steps': 17890, 'loss/train': 1.9429192543029785}} 11/06/2021 23:46:28 - INFO - __main__ - Step 17895: {'lr': 0.00048590616060974917, 'samples': 3435840, 'steps': 17894, 'loss/train': 2.1065878868103027}} 11/06/2021 23:46:30 - INFO - __main__ - Step 17899: {'lr': 0.00048589913326004355, 'samples': 3436608, 'steps': 17898, 'loss/train': 1.7949775457382202}} 11/06/2021 23:46:31 - INFO - __main__ - Step 17903: {'lr': 0.00048589210420965775, 'samples': 3437376, 'steps': 17902, 'loss/train': 1.1888090372085571}} 11/06/2021 23:46:33 - INFO - __main__ - Step 17907: {'lr': 0.00048588507345864246, 'samples': 3438144, 'steps': 17906, 'loss/train': 1.5869488716125488}} 11/06/2021 23:46:36 - INFO - __main__ - Step 17912: {'lr': 0.0004858762826284404, 'samples': 3439104, 'steps': 17911, 'loss/train': 1.5381463766098022}}} 11/06/2021 23:46:36 - INFO - __main__ - Step 17912: {'lr': 0.0004858762826284404, 'samples': 3439104, 'steps': 17911, 'loss/train': 1.5381463766098022}}} 11/06/2021 23:46:39 - INFO - __main__ - Step 17917: {'lr': 0.00048586748914118303, 'samples': 3440064, 'steps': 17916, 'loss/train': 1.3594805002212524}} 11/06/2021 23:46:42 - INFO - __main__ - Step 17923: {'lr': 0.0004858569334493006, 'samples': 3441216, 'steps': 17922, 'loss/train': 0.7357732653617859}}} 11/06/2021 23:46:42 - INFO - __main__ - Step 17923: {'lr': 0.0004858569334493006, 'samples': 3441216, 'steps': 17922, 'loss/train': 0.7357732653617859}}} 11/06/2021 23:46:42 - INFO - __main__ - Step 17923: {'lr': 0.0004858569334493006, 'samples': 3441216, 'steps': 17922, 'loss/train': 0.7357732653617859}}} 11/06/2021 23:46:47 - INFO - __main__ - Step 17933: {'lr': 0.00048583933212770154, 'samples': 3443136, 'steps': 17932, 'loss/train': 1.8300704956054688}} 11/06/2021 23:46:49 - INFO - __main__ - Step 17937: {'lr': 0.0004858322886235817, 'samples': 3443904, 'steps': 17936, 'loss/train': 1.5555347204208374}}} 11/06/2021 23:46:51 - INFO - __main__ - Step 17942: {'lr': 0.0004858234818525341, 'samples': 3444864, 'steps': 17941, 'loss/train': 1.5800806283950806}}} 11/06/2021 23:46:54 - INFO - __main__ - Step 17947: {'lr': 0.000485814672425026, 'samples': 3445824, 'steps': 17946, 'loss/train': 1.5922514200210571}}}} 11/06/2021 23:46:56 - INFO - __main__ - Step 17951: {'lr': 0.00048580762297043456, 'samples': 3446592, 'steps': 17950, 'loss/train': 1.3768843412399292}} 11/06/2021 23:46:58 - INFO - __main__ - Step 17955: {'lr': 0.0004858005718158227, 'samples': 3447360, 'steps': 17954, 'loss/train': 1.441080093383789}2}} 11/06/2021 23:46:59 - INFO - __main__ - Step 17959: {'lr': 0.00048579351896124127, 'samples': 3448128, 'steps': 17958, 'loss/train': 1.4108787775039673}} 11/06/2021 23:47:01 - INFO - __main__ - Step 17963: {'lr': 0.00048578646440674113, 'samples': 3448896, 'steps': 17962, 'loss/train': 1.7330888509750366}} 11/06/2021 23:47:01 - INFO - __main__ - Step 17963: {'lr': 0.00048578646440674113, 'samples': 3448896, 'steps': 17962, 'loss/train': 1.7330888509750366}} 11/06/2021 23:47:05 - INFO - __main__ - Step 17969: {'lr': 0.0004857758793877545, 'samples': 3450048, 'steps': 17968, 'loss/train': 1.6525585651397705}}} 11/06/2021 23:47:07 - INFO - __main__ - Step 17974: {'lr': 0.0004857670556170749, 'samples': 3451008, 'steps': 17973, 'loss/train': 1.7699105739593506}}} 11/06/2021 23:47:10 - INFO - __main__ - Step 17979: {'lr': 0.0004857582291905704, 'samples': 3451968, 'steps': 17978, 'loss/train': 1.7393509149551392}}} 11/06/2021 23:47:11 - INFO - __main__ - Step 17983: {'lr': 0.0004857511661372397, 'samples': 3452736, 'steps': 17982, 'loss/train': 1.7817578315734863}}} 11/06/2021 23:47:14 - INFO - __main__ - Step 17987: {'lr': 0.0004857441013842956, 'samples': 3453504, 'steps': 17986, 'loss/train': 1.843093991279602}}}} 11/06/2021 23:47:15 - INFO - __main__ - Step 17991: {'lr': 0.000485737034931789, 'samples': 3454272, 'steps': 17990, 'loss/train': 1.8824354410171509}}}} 11/06/2021 23:47:17 - INFO - __main__ - Step 17995: {'lr': 0.0004857299667797709, 'samples': 3455040, 'steps': 17994, 'loss/train': 1.633614420890808}}}} 11/06/2021 23:47:19 - INFO - __main__ - Step 17999: {'lr': 0.00048572289692829217, 'samples': 3455808, 'steps': 17998, 'loss/train': 0.954354465007782}}} 11/06/2021 23:47:19 - INFO - __main__ - Step 17999: {'lr': 0.00048572289692829217, 'samples': 3455808, 'steps': 17998, 'loss/train': 0.954354465007782}}} 11/06/2021 23:47:24 - INFO - __main__ - Step 18007: {'lr': 0.00048570875212715706, 'samples': 3457344, 'steps': 18006, 'loss/train': 1.4929587841033936}} 11/06/2021 23:47:25 - INFO - __main__ - Step 18011: {'lr': 0.0004857016771776025, 'samples': 3458112, 'steps': 18010, 'loss/train': 1.862199068069458}6}} 11/06/2021 23:47:28 - INFO - __main__ - Step 18015: {'lr': 0.00048569460052879136, 'samples': 3458880, 'steps': 18014, 'loss/train': 1.917330265045166}}} 11/06/2021 23:47:30 - INFO - __main__ - Step 18019: {'lr': 0.0004856875221807746, 'samples': 3459648, 'steps': 18018, 'loss/train': 1.3662374019622803}}} 11/06/2021 23:47:31 - INFO - __main__ - Step 18023: {'lr': 0.0004856804421336033, 'samples': 3460416, 'steps': 18022, 'loss/train': 1.4246373176574707}}} 11/06/2021 23:47:33 - INFO - __main__ - Step 18027: {'lr': 0.00048567336038732843, 'samples': 3461184, 'steps': 18026, 'loss/train': 1.299382209777832}}} 11/06/2021 23:47:36 - INFO - __main__ - Step 18032: {'lr': 0.0004856645058151984, 'samples': 3462144, 'steps': 18031, 'loss/train': 1.7649421691894531}}} 11/06/2021 23:47:36 - INFO - __main__ - Step 18032: {'lr': 0.0004856645058151984, 'samples': 3462144, 'steps': 18031, 'loss/train': 1.7649421691894531}}} 11/06/2021 23:47:39 - INFO - __main__ - Step 18039: {'lr': 0.00048565210495439337, 'samples': 3463488, 'steps': 18038, 'loss/train': 1.2623989582061768}} 11/06/2021 23:47:41 - INFO - __main__ - Step 18043: {'lr': 0.00048564501641221516, 'samples': 3464256, 'steps': 18042, 'loss/train': 1.9328662157058716}} 11/06/2021 23:47:43 - INFO - __main__ - Step 18048: {'lr': 0.00048563615334549316, 'samples': 3465216, 'steps': 18047, 'loss/train': 1.3910892009735107}} 11/06/2021 23:47:43 - INFO - __main__ - Step 18048: {'lr': 0.00048563615334549316, 'samples': 3465216, 'steps': 18047, 'loss/train': 1.3910892009735107}} 11/06/2021 23:47:48 - INFO - __main__ - Step 18056: {'lr': 0.00048562196691773066, 'samples': 3466752, 'steps': 18055, 'loss/train': 1.7463195323944092}} 11/06/2021 23:47:48 - INFO - __main__ - Step 18056: {'lr': 0.00048562196691773066, 'samples': 3466752, 'steps': 18055, 'loss/train': 1.7463195323944092}} 11/06/2021 23:47:51 - INFO - __main__ - Step 18063: {'lr': 0.00048560954821962434, 'samples': 3468096, 'steps': 18062, 'loss/train': 1.7900665998458862}} 11/06/2021 23:47:51 - INFO - __main__ - Step 18063: {'lr': 0.00048560954821962434, 'samples': 3468096, 'steps': 18062, 'loss/train': 1.7900665998458862}} 11/06/2021 23:47:56 - INFO - __main__ - Step 18072: {'lr': 0.0004855935736784316, 'samples': 3469824, 'steps': 18071, 'loss/train': 1.6132595539093018}}} 11/06/2021 23:47:58 - INFO - __main__ - Step 18076: {'lr': 0.0004855864711222857, 'samples': 3470592, 'steps': 18075, 'loss/train': 1.7564913034439087}}} 11/06/2021 23:47:59 - INFO - __main__ - Step 18080: {'lr': 0.00048557936686771376, 'samples': 3471360, 'steps': 18079, 'loss/train': 1.4804255962371826}} 11/06/2021 23:48:01 - INFO - __main__ - Step 18084: {'lr': 0.00048557226091476704, 'samples': 3472128, 'steps': 18083, 'loss/train': 1.7548338174819946}} 11/06/2021 23:48:01 - INFO - __main__ - Step 18084: {'lr': 0.00048557226091476704, 'samples': 3472128, 'steps': 18083, 'loss/train': 1.7548338174819946}} 11/06/2021 23:48:05 - INFO - __main__ - Step 18090: {'lr': 0.00048556159880100604, 'samples': 3473280, 'steps': 18089, 'loss/train': 1.8687024116516113}} 11/06/2021 23:48:08 - INFO - __main__ - Step 18095: {'lr': 0.00048555271078733637, 'samples': 3474240, 'steps': 18094, 'loss/train': 1.76602303981781}3}} 11/06/2021 23:48:10 - INFO - __main__ - Step 18099: {'lr': 0.00048554559846594026, 'samples': 3475008, 'steps': 18098, 'loss/train': 1.5190415382385254}} 11/06/2021 23:48:11 - INFO - __main__ - Step 18103: {'lr': 0.0004855384844464128, 'samples': 3475776, 'steps': 18102, 'loss/train': 0.6320077776908875}}} 11/06/2021 23:48:14 - INFO - __main__ - Step 18108: {'lr': 0.00048552958953408437, 'samples': 3476736, 'steps': 18107, 'loss/train': 1.5354968309402466}} 11/06/2021 23:48:16 - INFO - __main__ - Step 18112: {'lr': 0.0004855224716939488, 'samples': 3477504, 'steps': 18111, 'loss/train': 1.6454395055770874}}} 11/06/2021 23:48:18 - INFO - __main__ - Step 18116: {'lr': 0.00048551535215584865, 'samples': 3478272, 'steps': 18115, 'loss/train': 2.0963571071624756}} 11/06/2021 23:48:20 - INFO - __main__ - Step 18120: {'lr': 0.00048550823091983507, 'samples': 3479040, 'steps': 18119, 'loss/train': 1.7439087629318237}} 11/06/2021 23:48:21 - INFO - __main__ - Step 18124: {'lr': 0.00048550110798595953, 'samples': 3479808, 'steps': 18123, 'loss/train': 1.469502568244934}}} 11/06/2021 23:48:24 - INFO - __main__ - Step 18128: {'lr': 0.00048549398335427337, 'samples': 3480576, 'steps': 18127, 'loss/train': 1.1223576068878174}} 11/06/2021 23:48:26 - INFO - __main__ - Step 18132: {'lr': 0.0004854868570248279, 'samples': 3481344, 'steps': 18131, 'loss/train': 1.1974247694015503}}} 11/06/2021 23:48:28 - INFO - __main__ - Step 18136: {'lr': 0.00048547972899767454, 'samples': 3482112, 'steps': 18135, 'loss/train': 1.999104380607605}}} 11/06/2021 23:48:30 - INFO - __main__ - Step 18140: {'lr': 0.0004854725992728647, 'samples': 3482880, 'steps': 18139, 'loss/train': 1.5731804370880127}}} 11/06/2021 23:48:31 - INFO - __main__ - Step 18144: {'lr': 0.00048546546785044965, 'samples': 3483648, 'steps': 18143, 'loss/train': 1.4351774454116821}} 11/06/2021 23:48:34 - INFO - __main__ - Step 18149: {'lr': 0.00048545655118525206, 'samples': 3484608, 'steps': 18148, 'loss/train': 1.6034213304519653}} 11/06/2021 23:48:34 - INFO - __main__ - Step 18149: {'lr': 0.00048545655118525206, 'samples': 3484608, 'steps': 18148, 'loss/train': 1.6034213304519653}} 11/06/2021 23:48:38 - INFO - __main__ - Step 18157: {'lr': 0.00048544227900413706, 'samples': 3486144, 'steps': 18156, 'loss/train': 1.761330008506775}}} 11/06/2021 23:48:40 - INFO - __main__ - Step 18161: {'lr': 0.00048543514036747404, 'samples': 3486912, 'steps': 18160, 'loss/train': 1.5005325078964233}} 11/06/2021 23:48:41 - INFO - __main__ - Step 18165: {'lr': 0.000485428000033476, 'samples': 3487680, 'steps': 18164, 'loss/train': 1.6403565406799316}3}} 11/06/2021 23:48:43 - INFO - __main__ - Step 18169: {'lr': 0.0004854208580021944, 'samples': 3488448, 'steps': 18168, 'loss/train': 1.6747791767120361}}} 11/06/2021 23:48:46 - INFO - __main__ - Step 18174: {'lr': 0.0004854119280763657, 'samples': 3489408, 'steps': 18173, 'loss/train': 2.03631329536438}1}}} 11/06/2021 23:48:46 - INFO - __main__ - Step 18174: {'lr': 0.0004854119280763657, 'samples': 3489408, 'steps': 18173, 'loss/train': 2.03631329536438}1}}} 11/06/2021 23:48:50 - INFO - __main__ - Step 18182: {'lr': 0.00048539763467928665, 'samples': 3490944, 'steps': 18181, 'loss/train': 1.8729602098464966}} 11/06/2021 23:48:51 - INFO - __main__ - Step 18186: {'lr': 0.00048539048543512443, 'samples': 3491712, 'steps': 18185, 'loss/train': 1.6529532670974731}} 11/06/2021 23:48:53 - INFO - __main__ - Step 18190: {'lr': 0.000485383334493949, 'samples': 3492480, 'steps': 18189, 'loss/train': 1.6987266540527344}1}} 11/06/2021 23:48:56 - INFO - __main__ - Step 18195: {'lr': 0.00048537439343113354, 'samples': 3493440, 'steps': 18194, 'loss/train': 1.648136854171753}}} 11/06/2021 23:48:58 - INFO - __main__ - Step 18200: {'lr': 0.0004853654497169163, 'samples': 3494400, 'steps': 18199, 'loss/train': 1.4523322582244873}}} 11/06/2021 23:48:58 - INFO - __main__ - Step 18200: {'lr': 0.0004853654497169163, 'samples': 3494400, 'steps': 18199, 'loss/train': 1.4523322582244873}}} 11/06/2021 23:49:01 - INFO - __main__ - Step 18207: {'lr': 0.0004853529240628493, 'samples': 3495744, 'steps': 18206, 'loss/train': 1.7013306617736816}}} 11/06/2021 23:49:04 - INFO - __main__ - Step 18211: {'lr': 0.000485345764213201, 'samples': 3496512, 'steps': 18210, 'loss/train': 1.694120168685913}6}}} 11/06/2021 23:49:06 - INFO - __main__ - Step 18216: {'lr': 0.0004853368120151754, 'samples': 3497472, 'steps': 18215, 'loss/train': 2.0772011280059814}}} 11/06/2021 23:49:06 - INFO - __main__ - Step 18216: {'lr': 0.0004853368120151754, 'samples': 3497472, 'steps': 18215, 'loss/train': 2.0772011280059814}}} 11/06/2021 23:49:10 - INFO - __main__ - Step 18224: {'lr': 0.0004853224829843414, 'samples': 3499008, 'steps': 18223, 'loss/train': 1.524446964263916}}}} 11/06/2021 23:49:12 - INFO - __main__ - Step 18228: {'lr': 0.0004853153159241143, 'samples': 3499776, 'steps': 18227, 'loss/train': 2.811218500137329}}}} 11/06/2021 23:49:14 - INFO - __main__ - Step 18232: {'lr': 0.0004853081471674159, 'samples': 3500544, 'steps': 18231, 'loss/train': 1.558580994606018}}}} 11/06/2021 23:49:16 - INFO - __main__ - Step 18237: {'lr': 0.00048529918383595906, 'samples': 3501504, 'steps': 18236, 'loss/train': 1.4741995334625244}} 11/06/2021 23:49:18 - INFO - __main__ - Step 18241: {'lr': 0.0004852920112623895, 'samples': 3502272, 'steps': 18240, 'loss/train': 1.721117615699768}4}} 11/06/2021 23:49:20 - INFO - __main__ - Step 18245: {'lr': 0.0004852848369925167, 'samples': 3503040, 'steps': 18244, 'loss/train': 1.5317625999450684}}} 11/06/2021 23:49:22 - INFO - __main__ - Step 18249: {'lr': 0.0004852776610263925, 'samples': 3503808, 'steps': 18248, 'loss/train': 1.6794166564941406}}} 11/06/2021 23:49:24 - INFO - __main__ - Step 18253: {'lr': 0.00048527048336406855, 'samples': 3504576, 'steps': 18252, 'loss/train': 1.830647587776184}}} 11/06/2021 23:49:24 - INFO - __main__ - Step 18253: {'lr': 0.00048527048336406855, 'samples': 3504576, 'steps': 18252, 'loss/train': 1.830647587776184}}} 11/06/2021 23:49:28 - INFO - __main__ - Step 18261: {'lr': 0.00048525612295102836, 'samples': 3506112, 'steps': 18260, 'loss/train': 1.6988489627838135}} 11/06/2021 23:49:30 - INFO - __main__ - Step 18265: {'lr': 0.0004852489402004157, 'samples': 3506880, 'steps': 18264, 'loss/train': 1.6482011079788208}}} 11/06/2021 23:49:31 - INFO - __main__ - Step 18269: {'lr': 0.0004852417557538104, 'samples': 3507648, 'steps': 18268, 'loss/train': 1.8137376308441162}}} 11/06/2021 23:49:33 - INFO - __main__ - Step 18273: {'lr': 0.0004852345696112642, 'samples': 3508416, 'steps': 18272, 'loss/train': 1.773221492767334}}}} 11/06/2021 23:49:36 - INFO - __main__ - Step 18278: {'lr': 0.0004852255845482435, 'samples': 3509376, 'steps': 18277, 'loss/train': 1.9277666807174683}}} 11/06/2021 23:49:36 - INFO - __main__ - Step 18278: {'lr': 0.0004852255845482435, 'samples': 3509376, 'steps': 18277, 'loss/train': 1.9277666807174683}}} 11/06/2021 23:49:36 - INFO - __main__ - Step 18278: {'lr': 0.0004852255845482435, 'samples': 3509376, 'steps': 18277, 'loss/train': 1.9277666807174683}}} 11/06/2021 23:49:41 - INFO - __main__ - Step 18289: {'lr': 0.00048520580808270687, 'samples': 3511488, 'steps': 18288, 'loss/train': 1.6178340911865234}} 11/06/2021 23:49:44 - INFO - __main__ - Step 18294: {'lr': 0.0004851968145409211, 'samples': 3512448, 'steps': 18293, 'loss/train': 1.8689836263656616}}} 11/06/2021 23:49:46 - INFO - __main__ - Step 18299: {'lr': 0.00048518781834973405, 'samples': 3513408, 'steps': 18298, 'loss/train': 1.8023676872253418}} 11/06/2021 23:49:48 - INFO - __main__ - Step 18303: {'lr': 0.00048518061948928337, 'samples': 3514176, 'steps': 18302, 'loss/train': 1.6613613367080688}} 11/06/2021 23:49:50 - INFO - __main__ - Step 18307: {'lr': 0.00048517341893333267, 'samples': 3514944, 'steps': 18306, 'loss/train': 1.4218535423278809}} 11/06/2021 23:49:50 - INFO - __main__ - Step 18307: {'lr': 0.00048517341893333267, 'samples': 3514944, 'steps': 18306, 'loss/train': 1.4218535423278809}} 11/06/2021 23:49:53 - INFO - __main__ - Step 18314: {'lr': 0.0004851608138807778, 'samples': 3516288, 'steps': 18313, 'loss/train': 1.4870017766952515}}} 11/06/2021 23:49:56 - INFO - __main__ - Step 18320: {'lr': 0.0004851500054175725, 'samples': 3517440, 'steps': 18319, 'loss/train': 1.655856966972351}}}} 11/06/2021 23:49:56 - INFO - __main__ - Step 18320: {'lr': 0.0004851500054175725, 'samples': 3517440, 'steps': 18319, 'loss/train': 1.655856966972351}}}} 11/06/2021 23:49:59 - INFO - __main__ - Step 18326: {'lr': 0.0004851391931399884, 'samples': 3518592, 'steps': 18325, 'loss/train': 1.7464122772216797}}} 11/06/2021 23:50:01 - INFO - __main__ - Step 18330: {'lr': 0.0004851319828359198, 'samples': 3519360, 'steps': 18329, 'loss/train': 1.1017811298370361}}} 11/06/2021 23:50:04 - INFO - __main__ - Step 18335: {'lr': 0.000485122967572036, 'samples': 3520320, 'steps': 18334, 'loss/train': 1.6032307147979736}}}} 11/06/2021 23:50:04 - INFO - __main__ - Step 18335: {'lr': 0.000485122967572036, 'samples': 3520320, 'steps': 18334, 'loss/train': 1.6032307147979736}}}} 11/06/2021 23:50:08 - INFO - __main__ - Step 18343: {'lr': 0.0004851085376408396, 'samples': 3521856, 'steps': 18342, 'loss/train': 2.004495143890381}}}} 11/06/2021 23:50:08 - INFO - __main__ - Step 18343: {'lr': 0.0004851085376408396, 'samples': 3521856, 'steps': 18342, 'loss/train': 2.004495143890381}}}} 11/06/2021 23:50:12 - INFO - __main__ - Step 18350: {'lr': 0.000485095905889374, 'samples': 3523200, 'steps': 18349, 'loss/train': 1.9649345874786377}}}} 11/06/2021 23:50:14 - INFO - __main__ - Step 18354: {'lr': 0.0004850886854151885, 'samples': 3523968, 'steps': 18353, 'loss/train': 1.0320509672164917}}} 11/06/2021 23:50:16 - INFO - __main__ - Step 18359: {'lr': 0.0004850796574390977, 'samples': 3524928, 'steps': 18358, 'loss/train': 2.392427682876587}}}} 11/06/2021 23:50:16 - INFO - __main__ - Step 18359: {'lr': 0.0004850796574390977, 'samples': 3524928, 'steps': 18358, 'loss/train': 2.392427682876587}}}} 11/06/2021 23:50:20 - INFO - __main__ - Step 18367: {'lr': 0.00048506520716938496, 'samples': 3526464, 'steps': 18366, 'loss/train': 1.616496205329895}}} 11/06/2021 23:50:22 - INFO - __main__ - Step 18371: {'lr': 0.0004850579794925004, 'samples': 3527232, 'steps': 18370, 'loss/train': 1.0218453407287598}}} 11/06/2021 23:50:24 - INFO - __main__ - Step 18375: {'lr': 0.000485050750121, 'samples': 3528000, 'steps': 18374, 'loss/train': 1.681142807006836}7598}}} 11/06/2021 23:50:26 - INFO - __main__ - Step 18380: {'lr': 0.00048504171102365, 'samples': 3528960, 'steps': 18379, 'loss/train': 2.1345901489257812}8}}} 11/06/2021 23:50:26 - INFO - __main__ - Step 18380: {'lr': 0.00048504171102365, 'samples': 3528960, 'steps': 18379, 'loss/train': 2.1345901489257812}8}}} 11/06/2021 23:50:30 - INFO - __main__ - Step 18388: {'lr': 0.0004850272429608117, 'samples': 3530496, 'steps': 18387, 'loss/train': 1.9355649948120117}}} 11/06/2021 23:50:32 - INFO - __main__ - Step 18392: {'lr': 0.00048502000638777487, 'samples': 3531264, 'steps': 18391, 'loss/train': 1.8195085525512695}} 11/06/2021 23:50:34 - INFO - __main__ - Step 18396: {'lr': 0.00048501276812039585, 'samples': 3532032, 'steps': 18395, 'loss/train': 1.2785509824752808}} 11/06/2021 23:50:36 - INFO - __main__ - Step 18401: {'lr': 0.0004850037179035829, 'samples': 3532992, 'steps': 18400, 'loss/train': 1.3907315731048584}}} 11/06/2021 23:50:38 - INFO - __main__ - Step 18405: {'lr': 0.00048499647582412475, 'samples': 3533760, 'steps': 18404, 'loss/train': 1.2877929210662842}} 11/06/2021 23:50:41 - INFO - __main__ - Step 18409: {'lr': 0.000484989232050494, 'samples': 3534528, 'steps': 18408, 'loss/train': 1.753796935081482}42}} 11/06/2021 23:50:41 - INFO - __main__ - Step 18409: {'lr': 0.000484989232050494, 'samples': 3534528, 'steps': 18408, 'loss/train': 1.753796935081482}42}} 11/06/2021 23:50:44 - INFO - __main__ - Step 18416: {'lr': 0.00048497655137019454, 'samples': 3535872, 'steps': 18415, 'loss/train': 1.747482180595398}}} 11/06/2021 23:50:46 - INFO - __main__ - Step 18421: {'lr': 0.0004849674905650886, 'samples': 3536832, 'steps': 18420, 'loss/train': 1.2444932460784912}}} 11/06/2021 23:50:49 - INFO - __main__ - Step 18426: {'lr': 0.0004849584271131646, 'samples': 3537792, 'steps': 18425, 'loss/train': 2.1302671432495117}}} 11/06/2021 23:50:50 - INFO - __main__ - Step 18430: {'lr': 0.0004849511744459849, 'samples': 3538560, 'steps': 18429, 'loss/train': 1.3633182048797607}}} 11/06/2021 23:50:50 - INFO - __main__ - Step 18430: {'lr': 0.0004849511744459849, 'samples': 3538560, 'steps': 18429, 'loss/train': 1.3633182048797607}}} 11/06/2021 23:50:54 - INFO - __main__ - Step 18437: {'lr': 0.000484938478202635, 'samples': 3539904, 'steps': 18436, 'loss/train': 1.955151915550232}7}}} 11/06/2021 23:50:56 - INFO - __main__ - Step 18442: {'lr': 0.0004849294062815792, 'samples': 3540864, 'steps': 18441, 'loss/train': 1.1858590841293335}}} 11/06/2021 23:50:59 - INFO - __main__ - Step 18446: {'lr': 0.0004849221468393294, 'samples': 3541632, 'steps': 18445, 'loss/train': 1.6834259033203125}}} 11/06/2021 23:51:01 - INFO - __main__ - Step 18450: {'lr': 0.000484914885703443, 'samples': 3542400, 'steps': 18449, 'loss/train': 0.9411190748214722}}}} 11/06/2021 23:51:01 - INFO - __main__ - Step 18450: {'lr': 0.000484914885703443, 'samples': 3542400, 'steps': 18449, 'loss/train': 0.9411190748214722}}}} 11/06/2021 23:51:04 - INFO - __main__ - Step 18457: {'lr': 0.0004849021746404859, 'samples': 3543744, 'steps': 18456, 'loss/train': 1.9116475582122803}}} 11/06/2021 23:51:06 - INFO - __main__ - Step 18462: {'lr': 0.00048489309213448696, 'samples': 3544704, 'steps': 18461, 'loss/train': 1.5725513696670532}} 11/06/2021 23:51:06 - INFO - __main__ - Step 18462: {'lr': 0.00048489309213448696, 'samples': 3544704, 'steps': 18461, 'loss/train': 1.5725513696670532}} 11/06/2021 23:51:10 - INFO - __main__ - Step 18470: {'lr': 0.0004848785546212927, 'samples': 3546240, 'steps': 18469, 'loss/train': 1.7710777521133423}}} 11/06/2021 23:51:12 - INFO - __main__ - Step 18474: {'lr': 0.00048487128332468576, 'samples': 3547008, 'steps': 18473, 'loss/train': 2.0391478538513184}} 11/06/2021 23:51:14 - INFO - __main__ - Step 18478: {'lr': 0.0004848640103348088, 'samples': 3547776, 'steps': 18477, 'loss/train': 1.4348716735839844}}} 11/06/2021 23:51:16 - INFO - __main__ - Step 18483: {'lr': 0.00048485491671638146, 'samples': 3548736, 'steps': 18482, 'loss/train': 2.1826088428497314}} 11/06/2021 23:51:18 - INFO - __main__ - Step 18487: {'lr': 0.0004848476399168387, 'samples': 3549504, 'steps': 18486, 'loss/train': 1.0934349298477173}}} 11/06/2021 23:51:21 - INFO - __main__ - Step 18491: {'lr': 0.0004848403614241964, 'samples': 3550272, 'steps': 18490, 'loss/train': 1.4911553859710693}}} 11/06/2021 23:51:21 - INFO - __main__ - Step 18491: {'lr': 0.0004848403614241964, 'samples': 3550272, 'steps': 18490, 'loss/train': 1.4911553859710693}}} 11/06/2021 23:51:24 - INFO - __main__ - Step 18498: {'lr': 0.0004848276199882093, 'samples': 3551616, 'steps': 18497, 'loss/train': 1.7423537969589233}}} 11/06/2021 23:51:24 - INFO - __main__ - Step 18498: {'lr': 0.0004848276199882093, 'samples': 3551616, 'steps': 18497, 'loss/train': 1.7423537969589233}}} 11/06/2021 23:51:28 - INFO - __main__ - Step 18505: {'lr': 0.0004848148733675468, 'samples': 3552960, 'steps': 18504, 'loss/train': 0.7332248687744141}}} 11/06/2021 23:51:31 - INFO - __main__ - Step 18511: {'lr': 0.0004848039435663282, 'samples': 3554112, 'steps': 18510, 'loss/train': 1.341917634010315}}}} 11/06/2021 23:51:33 - INFO - __main__ - Step 18515: {'lr': 0.0004847966549161909, 'samples': 3554880, 'steps': 18514, 'loss/train': 1.137990117073059}}}} 11/06/2021 23:51:33 - INFO - __main__ - Step 18515: {'lr': 0.0004847966549161909, 'samples': 3554880, 'steps': 18514, 'loss/train': 1.137990117073059}}}} 11/06/2021 23:51:36 - INFO - __main__ - Step 18522: {'lr': 0.00048478389570534575, 'samples': 3556224, 'steps': 18521, 'loss/train': 2.240158796310425}}} 11/06/2021 23:51:38 - INFO - __main__ - Step 18527: {'lr': 0.00048477477880959715, 'samples': 3557184, 'steps': 18526, 'loss/train': 1.8836625814437866}} 11/06/2021 23:51:41 - INFO - __main__ - Step 18532: {'lr': 0.0004847656592692012, 'samples': 3558144, 'steps': 18531, 'loss/train': 1.590442419052124}6}} 11/06/2021 23:51:43 - INFO - __main__ - Step 18536: {'lr': 0.0004847583617328074, 'samples': 3558912, 'steps': 18535, 'loss/train': 1.7262592315673828}}} 11/06/2021 23:51:45 - INFO - __main__ - Step 18540: {'lr': 0.0004847510625039577, 'samples': 3559680, 'steps': 18539, 'loss/train': 1.090240716934204}}}} 11/06/2021 23:51:46 - INFO - __main__ - Step 18544: {'lr': 0.0004847437615827046, 'samples': 3560448, 'steps': 18543, 'loss/train': 1.538326621055603}}}} 11/06/2021 23:51:49 - INFO - __main__ - Step 18548: {'lr': 0.00048473645896910094, 'samples': 3561216, 'steps': 18547, 'loss/train': 1.8863868713378906}} 11/06/2021 23:51:51 - INFO - __main__ - Step 18553: {'lr': 0.0004847273283223084, 'samples': 3562176, 'steps': 18552, 'loss/train': 1.689874291419983}6}} 11/06/2021 23:51:51 - INFO - __main__ - Step 18553: {'lr': 0.0004847273283223084, 'samples': 3562176, 'steps': 18552, 'loss/train': 1.689874291419983}6}} 11/06/2021 23:51:55 - INFO - __main__ - Step 18560: {'lr': 0.0004847145409747125, 'samples': 3563520, 'steps': 18559, 'loss/train': 1.2590919733047485}}} 11/06/2021 23:51:56 - INFO - __main__ - Step 18564: {'lr': 0.00048470723159223266, 'samples': 3564288, 'steps': 18563, 'loss/train': 1.4612468481063843}} 11/06/2021 23:51:59 - INFO - __main__ - Step 18569: {'lr': 0.00048469809248464135, 'samples': 3565248, 'steps': 18568, 'loss/train': 1.9893124103546143}} 11/06/2021 23:52:01 - INFO - __main__ - Step 18574: {'lr': 0.00048468895073326663, 'samples': 3566208, 'steps': 18573, 'loss/train': 1.2497248649597168}} 11/06/2021 23:52:03 - INFO - __main__ - Step 18578: {'lr': 0.0004846816354287119, 'samples': 3566976, 'steps': 18577, 'loss/train': 1.6888341903686523}}} 11/06/2021 23:52:03 - INFO - __main__ - Step 18578: {'lr': 0.0004846816354287119, 'samples': 3566976, 'steps': 18577, 'loss/train': 1.6888341903686523}}} 11/06/2021 23:52:06 - INFO - __main__ - Step 18584: {'lr': 0.00048467065929957867, 'samples': 3568128, 'steps': 18583, 'loss/train': 4.058262348175049}}} 11/06/2021 23:52:09 - INFO - __main__ - Step 18590: {'lr': 0.00048465967936384217, 'samples': 3569280, 'steps': 18589, 'loss/train': 1.7790948152542114}} 11/06/2021 23:52:09 - INFO - __main__ - Step 18590: {'lr': 0.00048465967936384217, 'samples': 3569280, 'steps': 18589, 'loss/train': 1.7790948152542114}} 11/06/2021 23:52:12 - INFO - __main__ - Step 18597: {'lr': 0.0004846468646279304, 'samples': 3570624, 'steps': 18596, 'loss/train': 1.4772709608078003}}} 11/06/2021 23:52:15 - INFO - __main__ - Step 18601: {'lr': 0.0004846395395956553, 'samples': 3571392, 'steps': 18600, 'loss/train': 1.8090925216674805}}} 11/06/2021 23:52:17 - INFO - __main__ - Step 18606: {'lr': 0.0004846303809265061, 'samples': 3572352, 'steps': 18605, 'loss/train': 3.0497324466705322}}} 11/06/2021 23:52:19 - INFO - __main__ - Step 18610: {'lr': 0.0004846230520882069, 'samples': 3573120, 'steps': 18609, 'loss/train': 1.321014642715454}}}} 11/06/2021 23:52:19 - INFO - __main__ - Step 18610: {'lr': 0.0004846230520882069, 'samples': 3573120, 'steps': 18609, 'loss/train': 1.321014642715454}}}} 11/06/2021 23:52:22 - INFO - __main__ - Step 18617: {'lr': 0.0004846102225510903, 'samples': 3574464, 'steps': 18616, 'loss/train': 1.6835120916366577}}} 11/06/2021 23:52:24 - INFO - __main__ - Step 18621: {'lr': 0.0004846028890613471, 'samples': 3575232, 'steps': 18620, 'loss/train': 1.4925874471664429}}} 11/06/2021 23:52:27 - INFO - __main__ - Step 18626: {'lr': 0.0004845937198207343, 'samples': 3576192, 'steps': 18625, 'loss/train': 1.2264925241470337}}} 11/06/2021 23:52:29 - INFO - __main__ - Step 18630: {'lr': 0.00048458638252556153, 'samples': 3576960, 'steps': 18629, 'loss/train': 1.786418080329895}}} 11/06/2021 23:52:31 - INFO - __main__ - Step 18634: {'lr': 0.00048457904353917277, 'samples': 3577728, 'steps': 18633, 'loss/train': 1.4519388675689697}} 11/06/2021 23:52:33 - INFO - __main__ - Step 18638: {'lr': 0.0004845717028616208, 'samples': 3578496, 'steps': 18637, 'loss/train': 1.1412678956985474}}} 11/06/2021 23:52:35 - INFO - __main__ - Step 18642: {'lr': 0.0004845643604929586, 'samples': 3579264, 'steps': 18641, 'loss/train': 1.6623493432998657}}} 11/06/2021 23:52:37 - INFO - __main__ - Step 18647: {'lr': 0.00048455518015408773, 'samples': 3580224, 'steps': 18646, 'loss/train': 1.586424708366394}}} 11/06/2021 23:52:37 - INFO - __main__ - Step 18647: {'lr': 0.00048455518015408773, 'samples': 3580224, 'steps': 18646, 'loss/train': 1.586424708366394}}} 11/06/2021 23:52:41 - INFO - __main__ - Step 18654: {'lr': 0.00048454232324084004, 'samples': 3581568, 'steps': 18653, 'loss/train': 1.283732295036316}}} 11/06/2021 23:52:43 - INFO - __main__ - Step 18658: {'lr': 0.0004845349741082663, 'samples': 3582336, 'steps': 18657, 'loss/train': 1.749588131904602}}}} 11/06/2021 23:52:43 - INFO - __main__ - Step 18658: {'lr': 0.0004845349741082663, 'samples': 3582336, 'steps': 18657, 'loss/train': 1.749588131904602}}}} 11/06/2021 23:52:47 - INFO - __main__ - Step 18666: {'lr': 0.0004845202707706356, 'samples': 3583872, 'steps': 18665, 'loss/train': 2.0066657066345215}}} 11/06/2021 23:52:48 - INFO - __main__ - Step 18670: {'lr': 0.0004845129165656846, 'samples': 3584640, 'steps': 18669, 'loss/train': 1.3592866659164429}}} 11/06/2021 23:52:50 - INFO - __main__ - Step 18674: {'lr': 0.0004845055606700472, 'samples': 3585408, 'steps': 18673, 'loss/train': 1.6786866188049316}}} 11/06/2021 23:52:53 - INFO - __main__ - Step 18679: {'lr': 0.00048449636342305343, 'samples': 3586368, 'steps': 18678, 'loss/train': 1.582143783569336}}} 11/06/2021 23:52:53 - INFO - __main__ - Step 18679: {'lr': 0.00048449636342305343, 'samples': 3586368, 'steps': 18678, 'loss/train': 1.582143783569336}}} 11/06/2021 23:52:57 - INFO - __main__ - Step 18687: {'lr': 0.00048448164233356344, 'samples': 3587904, 'steps': 18686, 'loss/train': 1.7413592338562012}} 11/06/2021 23:52:59 - INFO - __main__ - Step 18691: {'lr': 0.0004844742792531005, 'samples': 3588672, 'steps': 18690, 'loss/train': 1.1845340728759766}}} 11/06/2021 23:53:01 - INFO - __main__ - Step 18695: {'lr': 0.0004844669144822297, 'samples': 3589440, 'steps': 18694, 'loss/train': 1.8748408555984497}}} 11/06/2021 23:53:03 - INFO - __main__ - Step 18699: {'lr': 0.00048445954802100414, 'samples': 3590208, 'steps': 18698, 'loss/train': 1.5242774486541748}} 11/06/2021 23:53:05 - INFO - __main__ - Step 18704: {'lr': 0.0004844503375674916, 'samples': 3591168, 'steps': 18703, 'loss/train': 1.0163267850875854}}} 11/06/2021 23:53:07 - INFO - __main__ - Step 18708: {'lr': 0.00048444296730316196, 'samples': 3591936, 'steps': 18707, 'loss/train': 0.8280439972877502}} 11/06/2021 23:53:09 - INFO - __main__ - Step 18712: {'lr': 0.00048443559534865017, 'samples': 3592704, 'steps': 18711, 'loss/train': 1.2353262901306152}} 11/06/2021 23:53:11 - INFO - __main__ - Step 18716: {'lr': 0.0004844282217040094, 'samples': 3593472, 'steps': 18715, 'loss/train': 1.9265437126159668}}} 11/06/2021 23:53:13 - INFO - __main__ - Step 18720: {'lr': 0.0004844208463692928, 'samples': 3594240, 'steps': 18719, 'loss/train': 1.9108844995498657}}} 11/06/2021 23:53:15 - INFO - __main__ - Step 18725: {'lr': 0.0004844116248243089, 'samples': 3595200, 'steps': 18724, 'loss/train': 2.38761568069458}7}}} 11/06/2021 23:53:17 - INFO - __main__ - Step 18729: {'lr': 0.0004844042456871162, 'samples': 3595968, 'steps': 18728, 'loss/train': 1.1527280807495117}}} 11/06/2021 23:53:19 - INFO - __main__ - Step 18733: {'lr': 0.0004843968648600204, 'samples': 3596736, 'steps': 18732, 'loss/train': 1.6883686780929565}}} 11/06/2021 23:53:21 - INFO - __main__ - Step 18737: {'lr': 0.0004843894823430749, 'samples': 3597504, 'steps': 18736, 'loss/train': 1.7384228706359863}}} 11/06/2021 23:53:23 - INFO - __main__ - Step 18741: {'lr': 0.0004843820981363328, 'samples': 3598272, 'steps': 18740, 'loss/train': 1.6665148735046387}}} 11/06/2021 23:53:25 - INFO - __main__ - Step 18745: {'lr': 0.00048437471223984743, 'samples': 3599040, 'steps': 18744, 'loss/train': 1.527302861213684}}} 11/06/2021 23:53:27 - INFO - __main__ - Step 18749: {'lr': 0.000484367324653672, 'samples': 3599808, 'steps': 18748, 'loss/train': 1.6161614656448364}}}} 11/06/2021 23:53:29 - INFO - __main__ - Step 18753: {'lr': 0.00048435993537785976, 'samples': 3600576, 'steps': 18752, 'loss/train': 0.9509969353675842}} 11/06/2021 23:53:31 - INFO - __main__ - Step 18757: {'lr': 0.000484352544412464, 'samples': 3601344, 'steps': 18756, 'loss/train': 1.6753501892089844}2}} 11/06/2021 23:53:33 - INFO - __main__ - Step 18762: {'lr': 0.0004843433033298237, 'samples': 3602304, 'steps': 18761, 'loss/train': 0.3044757544994354}}} 11/06/2021 23:53:35 - INFO - __main__ - Step 18767: {'lr': 0.0004843340596073964, 'samples': 3603264, 'steps': 18766, 'loss/train': 1.588987946510315}}}} 11/06/2021 23:53:35 - INFO - __main__ - Step 18767: {'lr': 0.0004843340596073964, 'samples': 3603264, 'steps': 18766, 'loss/train': 1.588987946510315}}}} 11/06/2021 23:53:39 - INFO - __main__ - Step 18774: {'lr': 0.00048432111396135447, 'samples': 3604608, 'steps': 18773, 'loss/train': 1.7124041318893433}} 11/06/2021 23:53:41 - INFO - __main__ - Step 18778: {'lr': 0.0004843137141265197, 'samples': 3605376, 'steps': 18777, 'loss/train': 1.467863917350769}3}} 11/06/2021 23:53:43 - INFO - __main__ - Step 18783: {'lr': 0.00048430446195747424, 'samples': 3606336, 'steps': 18782, 'loss/train': 1.7206259965896606}} 11/06/2021 23:53:45 - INFO - __main__ - Step 18788: {'lr': 0.0004842952071490794, 'samples': 3607296, 'steps': 18787, 'loss/train': 1.8691179752349854}}} 11/06/2021 23:53:45 - INFO - __main__ - Step 18788: {'lr': 0.0004842952071490794, 'samples': 3607296, 'steps': 18787, 'loss/train': 1.8691179752349854}}} 11/06/2021 23:53:49 - INFO - __main__ - Step 18795: {'lr': 0.00048428224598341815, 'samples': 3608640, 'steps': 18794, 'loss/train': 1.8937662839889526}} 11/06/2021 23:53:51 - INFO - __main__ - Step 18799: {'lr': 0.0004842748372806147, 'samples': 3609408, 'steps': 18798, 'loss/train': 1.7149622440338135}}} 11/06/2021 23:53:53 - INFO - __main__ - Step 18804: {'lr': 0.0004842655740270026, 'samples': 3610368, 'steps': 18803, 'loss/train': 1.4389371871948242}}} 11/06/2021 23:53:55 - INFO - __main__ - Step 18808: {'lr': 0.00048425816152409173, 'samples': 3611136, 'steps': 18807, 'loss/train': 1.5551064014434814}} 11/06/2021 23:53:57 - INFO - __main__ - Step 18812: {'lr': 0.0004842507473323311, 'samples': 3611904, 'steps': 18811, 'loss/train': 1.5817826986312866}}} 11/06/2021 23:53:57 - INFO - __main__ - Step 18812: {'lr': 0.0004842507473323311, 'samples': 3611904, 'steps': 18811, 'loss/train': 1.5817826986312866}}} 11/06/2021 23:54:01 - INFO - __main__ - Step 18819: {'lr': 0.00048423776843311585, 'samples': 3613248, 'steps': 18818, 'loss/train': 1.7637563943862915}} 11/06/2021 23:54:01 - INFO - __main__ - Step 18819: {'lr': 0.00048423776843311585, 'samples': 3613248, 'steps': 18818, 'loss/train': 1.7637563943862915}} 11/06/2021 23:54:05 - INFO - __main__ - Step 18827: {'lr': 0.0004842229290728226, 'samples': 3614784, 'steps': 18826, 'loss/train': 1.6318795680999756}}} 11/06/2021 23:54:07 - INFO - __main__ - Step 18832: {'lr': 0.0004842136510426519, 'samples': 3615744, 'steps': 18831, 'loss/train': 1.83372962474823}6}}} 11/06/2021 23:54:10 - INFO - __main__ - Step 18837: {'lr': 0.00048420437037415486, 'samples': 3616704, 'steps': 18836, 'loss/train': 1.6292400360107422}} 11/06/2021 23:54:12 - INFO - __main__ - Step 18841: {'lr': 0.00048419694393983244, 'samples': 3617472, 'steps': 18840, 'loss/train': 1.542981505393982}}} 11/06/2021 23:54:14 - INFO - __main__ - Step 18845: {'lr': 0.00048418951581710154, 'samples': 3618240, 'steps': 18844, 'loss/train': 1.0846983194351196}} 11/06/2021 23:54:16 - INFO - __main__ - Step 18849: {'lr': 0.0004841820860060157, 'samples': 3619008, 'steps': 18848, 'loss/train': 1.4073768854141235}}} 11/06/2021 23:54:17 - INFO - __main__ - Step 18853: {'lr': 0.00048417465450662856, 'samples': 3619776, 'steps': 18852, 'loss/train': 1.5022872686386108}} 11/06/2021 23:54:19 - INFO - __main__ - Step 18857: {'lr': 0.0004841672213189936, 'samples': 3620544, 'steps': 18856, 'loss/train': 1.4299774169921875}}} 11/06/2021 23:54:22 - INFO - __main__ - Step 18861: {'lr': 0.0004841597864431645, 'samples': 3621312, 'steps': 18860, 'loss/train': 1.5714701414108276}}} 11/06/2021 23:54:23 - INFO - __main__ - Step 18865: {'lr': 0.00048415234987919474, 'samples': 3622080, 'steps': 18864, 'loss/train': 1.3690611124038696}} 11/06/2021 23:54:25 - INFO - __main__ - Step 18869: {'lr': 0.00048414491162713814, 'samples': 3622848, 'steps': 18868, 'loss/train': 1.482108473777771}}} 11/06/2021 23:54:27 - INFO - __main__ - Step 18873: {'lr': 0.0004841374716870481, 'samples': 3623616, 'steps': 18872, 'loss/train': 1.7237398624420166}}} 11/06/2021 23:54:29 - INFO - __main__ - Step 18878: {'lr': 0.0004841281693882204, 'samples': 3624576, 'steps': 18877, 'loss/train': 1.2580933570861816}}} 11/06/2021 23:54:29 - INFO - __main__ - Step 18878: {'lr': 0.0004841281693882204, 'samples': 3624576, 'steps': 18877, 'loss/train': 1.2580933570861816}}} 11/06/2021 23:54:33 - INFO - __main__ - Step 18885: {'lr': 0.0004841151417391144, 'samples': 3625920, 'steps': 18884, 'loss/train': 1.3188947439193726}}} 11/06/2021 23:54:35 - INFO - __main__ - Step 18890: {'lr': 0.0004841058331107904, 'samples': 3626880, 'steps': 18889, 'loss/train': 1.5939115285873413}}} 11/06/2021 23:54:37 - INFO - __main__ - Step 18895: {'lr': 0.00048409652184535447, 'samples': 3627840, 'steps': 18894, 'loss/train': 1.5139254331588745}} 11/06/2021 23:54:37 - INFO - __main__ - Step 18895: {'lr': 0.00048409652184535447, 'samples': 3627840, 'steps': 18894, 'loss/train': 1.5139254331588745}} 11/06/2021 23:54:41 - INFO - __main__ - Step 18902: {'lr': 0.00048408348164359594, 'samples': 3629184, 'steps': 18901, 'loss/train': 1.845752477645874}}} 11/06/2021 23:54:43 - INFO - __main__ - Step 18906: {'lr': 0.00048407602777927856, 'samples': 3629952, 'steps': 18905, 'loss/train': 1.2008774280548096}} 11/06/2021 23:54:45 - INFO - __main__ - Step 18910: {'lr': 0.0004840685722274244, 'samples': 3630720, 'steps': 18909, 'loss/train': 1.252254605293274}6}} 11/06/2021 23:54:45 - INFO - __main__ - Step 18910: {'lr': 0.0004840685722274244, 'samples': 3630720, 'steps': 18909, 'loss/train': 1.252254605293274}6}} 11/06/2021 23:54:49 - INFO - __main__ - Step 18918: {'lr': 0.00048405365606132096, 'samples': 3632256, 'steps': 18917, 'loss/train': 1.8452666997909546}} 11/06/2021 23:54:51 - INFO - __main__ - Step 18922: {'lr': 0.0004840461954471792, 'samples': 3633024, 'steps': 18921, 'loss/train': 1.8817951679229736}}} 11/06/2021 23:54:53 - INFO - __main__ - Step 18926: {'lr': 0.0004840387331457157, 'samples': 3633792, 'steps': 18925, 'loss/train': 1.4895330667495728}}} 11/06/2021 23:54:55 - INFO - __main__ - Step 18931: {'lr': 0.00048402940289617223, 'samples': 3634752, 'steps': 18930, 'loss/train': 1.1204884052276611}} 11/06/2021 23:54:58 - INFO - __main__ - Step 18935: {'lr': 0.00048402193679843175, 'samples': 3635520, 'steps': 18934, 'loss/train': 1.0119448900222778}} 11/06/2021 23:55:00 - INFO - __main__ - Step 18939: {'lr': 0.00048401446901354453, 'samples': 3636288, 'steps': 18938, 'loss/train': 1.4605997800827026}} 11/06/2021 23:55:01 - INFO - __main__ - Step 18943: {'lr': 0.0004840069995415643, 'samples': 3637056, 'steps': 18942, 'loss/train': 1.3233683109283447}}} 11/06/2021 23:55:03 - INFO - __main__ - Step 18947: {'lr': 0.000483999528382545, 'samples': 3637824, 'steps': 18946, 'loss/train': 1.5244332551956177}}}} 11/06/2021 23:55:06 - INFO - __main__ - Step 18952: {'lr': 0.0004839901870614543, 'samples': 3638784, 'steps': 18951, 'loss/train': 1.4252386093139648}}} 11/06/2021 23:55:08 - INFO - __main__ - Step 18956: {'lr': 0.00048398271210679393, 'samples': 3639552, 'steps': 18955, 'loss/train': 1.3582264184951782}} 11/06/2021 23:55:10 - INFO - __main__ - Step 18960: {'lr': 0.00048397523546526966, 'samples': 3640320, 'steps': 18959, 'loss/train': 1.5042515993118286}} 11/06/2021 23:55:10 - INFO - __main__ - Step 18960: {'lr': 0.00048397523546526966, 'samples': 3640320, 'steps': 18959, 'loss/train': 1.5042515993118286}} 11/06/2021 23:55:13 - INFO - __main__ - Step 18967: {'lr': 0.00048396214728374786, 'samples': 3641664, 'steps': 18966, 'loss/train': 1.5210620164871216}} 11/06/2021 23:55:15 - INFO - __main__ - Step 18972: {'lr': 0.000483952795420052, 'samples': 3642624, 'steps': 18971, 'loss/train': 1.4770593643188477}6}} 11/06/2021 23:55:18 - INFO - __main__ - Step 18977: {'lr': 0.00048394344092096816, 'samples': 3643584, 'steps': 18976, 'loss/train': 1.8595176935195923}} 11/06/2021 23:55:18 - INFO - __main__ - Step 18977: {'lr': 0.00048394344092096816, 'samples': 3643584, 'steps': 18976, 'loss/train': 1.8595176935195923}} 11/06/2021 23:55:21 - INFO - __main__ - Step 18984: {'lr': 0.0004839303401949996, 'samples': 3644928, 'steps': 18983, 'loss/train': 2.2249767780303955}}} 11/06/2021 23:55:23 - INFO - __main__ - Step 18988: {'lr': 0.00048392285174693727, 'samples': 3645696, 'steps': 18987, 'loss/train': 0.914047360420227}}} 11/06/2021 23:55:26 - INFO - __main__ - Step 18993: {'lr': 0.0004839134888153202, 'samples': 3646656, 'steps': 18992, 'loss/train': 1.9287071228027344}}} 11/06/2021 23:55:28 - INFO - __main__ - Step 18997: {'lr': 0.0004839059965728608, 'samples': 3647424, 'steps': 18996, 'loss/train': 1.8564058542251587}}} 11/06/2021 23:55:28 - INFO - __main__ - Step 18997: {'lr': 0.0004839059965728608, 'samples': 3647424, 'steps': 18996, 'loss/train': 1.8564058542251587}}} 11/06/2021 23:55:31 - INFO - __main__ - Step 19004: {'lr': 0.00048389288109090383, 'samples': 3648768, 'steps': 19003, 'loss/train': 1.1992428302764893}} 11/06/2021 23:55:34 - INFO - __main__ - Step 19009: {'lr': 0.00048388350972783346, 'samples': 3649728, 'steps': 19008, 'loss/train': 1.362441062927246}}} 11/06/2021 23:55:36 - INFO - __main__ - Step 19014: {'lr': 0.0004838741357301555, 'samples': 3650688, 'steps': 19013, 'loss/train': 1.4588598012924194}}} 11/06/2021 23:55:38 - INFO - __main__ - Step 19018: {'lr': 0.0004838666346351667, 'samples': 3651456, 'steps': 19017, 'loss/train': 1.271425724029541}}}} 11/06/2021 23:55:38 - INFO - __main__ - Step 19018: {'lr': 0.0004838666346351667, 'samples': 3651456, 'steps': 19017, 'loss/train': 1.271425724029541}}}} 11/06/2021 23:55:41 - INFO - __main__ - Step 19025: {'lr': 0.000483853503661966, 'samples': 3652800, 'steps': 19024, 'loss/train': 1.6738646030426025}}}} 11/06/2021 23:55:44 - INFO - __main__ - Step 19030: {'lr': 0.0004838441212342538, 'samples': 3653760, 'steps': 19029, 'loss/train': 1.3828831911087036}}} 11/06/2021 23:55:46 - INFO - __main__ - Step 19035: {'lr': 0.0004838347361723778, 'samples': 3654720, 'steps': 19034, 'loss/train': 1.3136703968048096}}} 11/06/2021 23:55:48 - INFO - __main__ - Step 19039: {'lr': 0.00048382722622635014, 'samples': 3655488, 'steps': 19038, 'loss/train': 1.4272637367248535}} 11/06/2021 23:55:48 - INFO - __main__ - Step 19039: {'lr': 0.00048382722622635014, 'samples': 3655488, 'steps': 19038, 'loss/train': 1.4272637367248535}} 11/06/2021 23:55:51 - INFO - __main__ - Step 19046: {'lr': 0.000483814079764515, 'samples': 3656832, 'steps': 19045, 'loss/train': 1.874314308166504}35}} 11/06/2021 23:55:53 - INFO - __main__ - Step 19050: {'lr': 0.0004838065651828242, 'samples': 3657600, 'steps': 19049, 'loss/train': 1.7106494903564453}}} 11/06/2021 23:55:56 - INFO - __main__ - Step 19055: {'lr': 0.00048379716958535043, 'samples': 3658560, 'steps': 19054, 'loss/train': 1.7493064403533936}} 11/06/2021 23:55:58 - INFO - __main__ - Step 19060: {'lr': 0.00048378777135424166, 'samples': 3659520, 'steps': 19059, 'loss/train': 1.3405288457870483}} 11/06/2021 23:55:58 - INFO - __main__ - Step 19060: {'lr': 0.00048378777135424166, 'samples': 3659520, 'steps': 19059, 'loss/train': 1.3405288457870483}} 11/06/2021 23:56:01 - INFO - __main__ - Step 19067: {'lr': 0.0004837746094063844, 'samples': 3660864, 'steps': 19066, 'loss/train': 1.9289908409118652}}} 11/06/2021 23:56:03 - INFO - __main__ - Step 19071: {'lr': 0.0004837670859759294, 'samples': 3661632, 'steps': 19070, 'loss/train': 1.667305827140808}}}} 11/06/2021 23:56:06 - INFO - __main__ - Step 19076: {'lr': 0.0004837576793179005, 'samples': 3662592, 'steps': 19075, 'loss/train': 0.8432311415672302}}} 11/06/2021 23:56:08 - INFO - __main__ - Step 19081: {'lr': 0.00048374827002668156, 'samples': 3663552, 'steps': 19080, 'loss/train': 1.3041592836380005}} 11/06/2021 23:56:10 - INFO - __main__ - Step 19085: {'lr': 0.00048374074069788077, 'samples': 3664320, 'steps': 19084, 'loss/train': 0.9480411410331726}} 11/06/2021 23:56:10 - INFO - __main__ - Step 19085: {'lr': 0.00048374074069788077, 'samples': 3664320, 'steps': 19084, 'loss/train': 0.9480411410331726}} 11/06/2021 23:56:13 - INFO - __main__ - Step 19091: {'lr': 0.0004837294435450974, 'samples': 3665472, 'steps': 19090, 'loss/train': 1.4970214366912842}}} 11/06/2021 23:56:16 - INFO - __main__ - Step 19097: {'lr': 0.00048371814260097834, 'samples': 3666624, 'steps': 19096, 'loss/train': 1.3919228315353394}} 11/06/2021 23:56:16 - INFO - __main__ - Step 19097: {'lr': 0.00048371814260097834, 'samples': 3666624, 'steps': 19096, 'loss/train': 1.3919228315353394}} 11/06/2021 23:56:20 - INFO - __main__ - Step 19104: {'lr': 0.0004837049533745903, 'samples': 3667968, 'steps': 19103, 'loss/train': 1.8435603380203247}}} 11/06/2021 23:56:20 - INFO - __main__ - Step 19104: {'lr': 0.0004837049533745903, 'samples': 3667968, 'steps': 19103, 'loss/train': 1.8435603380203247}}} 11/06/2021 23:56:23 - INFO - __main__ - Step 19112: {'lr': 0.0004836898736547902, 'samples': 3669504, 'steps': 19111, 'loss/train': 1.8408292531967163}}} 11/06/2021 23:56:26 - INFO - __main__ - Step 19117: {'lr': 0.0004836804454077334, 'samples': 3670464, 'steps': 19116, 'loss/train': 1.6894001960754395}}} 11/06/2021 23:56:28 - INFO - __main__ - Step 19122: {'lr': 0.0004836710145283565, 'samples': 3671424, 'steps': 19121, 'loss/train': 1.6386430263519287}}} 11/06/2021 23:56:28 - INFO - __main__ - Step 19122: {'lr': 0.0004836710145283565, 'samples': 3671424, 'steps': 19121, 'loss/train': 1.6386430263519287}}} 11/06/2021 23:56:31 - INFO - __main__ - Step 19129: {'lr': 0.00048365780687513346, 'samples': 3672768, 'steps': 19128, 'loss/train': 0.9183880686759949}} 11/06/2021 23:56:33 - INFO - __main__ - Step 19133: {'lr': 0.00048365025732848433, 'samples': 3673536, 'steps': 19132, 'loss/train': 1.1418144702911377}} 11/06/2021 23:56:36 - INFO - __main__ - Step 19138: {'lr': 0.00048364081802639724, 'samples': 3674496, 'steps': 19137, 'loss/train': 1.5668922662734985}} 11/06/2021 23:56:38 - INFO - __main__ - Step 19142: {'lr': 0.00048363326468977343, 'samples': 3675264, 'steps': 19141, 'loss/train': 2.229715585708618}}} 11/06/2021 23:56:38 - INFO - __main__ - Step 19142: {'lr': 0.00048363326468977343, 'samples': 3675264, 'steps': 19141, 'loss/train': 2.229715585708618}}} 11/06/2021 23:56:41 - INFO - __main__ - Step 19148: {'lr': 0.00048362193152670847, 'samples': 3676416, 'steps': 19147, 'loss/train': 1.4134140014648438}} 11/06/2021 23:56:44 - INFO - __main__ - Step 19154: {'lr': 0.00048361059457405176, 'samples': 3677568, 'steps': 19153, 'loss/train': 1.160764217376709}}} 11/06/2021 23:56:44 - INFO - __main__ - Step 19154: {'lr': 0.00048361059457405176, 'samples': 3677568, 'steps': 19153, 'loss/train': 1.160764217376709}}} 11/06/2021 23:56:47 - INFO - __main__ - Step 19160: {'lr': 0.00048359925383198714, 'samples': 3678720, 'steps': 19159, 'loss/train': 1.9010136127471924}} 11/06/2021 23:56:50 - INFO - __main__ - Step 19165: {'lr': 0.0004835898003190462, 'samples': 3679680, 'steps': 19164, 'loss/train': 1.830366611480713}4}} 11/06/2021 23:56:52 - INFO - __main__ - Step 19170: {'lr': 0.0004835803441748062, 'samples': 3680640, 'steps': 19169, 'loss/train': 1.7084600925445557}}} 11/06/2021 23:56:52 - INFO - __main__ - Step 19170: {'lr': 0.0004835803441748062, 'samples': 3680640, 'steps': 19169, 'loss/train': 1.7084600925445557}}} 11/06/2021 23:56:55 - INFO - __main__ - Step 19177: {'lr': 0.0004835671011524908, 'samples': 3681984, 'steps': 19176, 'loss/train': 1.6103253364562988}}} 11/06/2021 23:56:57 - INFO - __main__ - Step 19181: {'lr': 0.00048355953139583087, 'samples': 3682752, 'steps': 19180, 'loss/train': 1.4923588037490845}} 11/06/2021 23:57:00 - INFO - __main__ - Step 19186: {'lr': 0.0004835500668321501, 'samples': 3683712, 'steps': 19185, 'loss/train': 1.4003887176513672}}} 11/06/2021 23:57:02 - INFO - __main__ - Step 19190: {'lr': 0.00048354249328698743, 'samples': 3684480, 'steps': 19189, 'loss/train': 1.5004618167877197}} 11/06/2021 23:57:02 - INFO - __main__ - Step 19190: {'lr': 0.00048354249328698743, 'samples': 3684480, 'steps': 19189, 'loss/train': 1.5004618167877197}} 11/06/2021 23:57:06 - INFO - __main__ - Step 19198: {'lr': 0.0004835273411456456, 'samples': 3686016, 'steps': 19197, 'loss/train': 1.6436011791229248}}} 11/06/2021 23:57:08 - INFO - __main__ - Step 19202: {'lr': 0.00048351976254957585, 'samples': 3686784, 'steps': 19201, 'loss/train': 1.810278296470642}}} 11/06/2021 23:57:10 - INFO - __main__ - Step 19207: {'lr': 0.000483510286937036, 'samples': 3687744, 'steps': 19206, 'loss/train': 1.5403071641921997}}}} 11/06/2021 23:57:12 - INFO - __main__ - Step 19211: {'lr': 0.00048350270455310864, 'samples': 3688512, 'steps': 19210, 'loss/train': 1.5911760330200195}} 11/06/2021 23:57:12 - INFO - __main__ - Step 19211: {'lr': 0.00048350270455310864, 'samples': 3688512, 'steps': 19210, 'loss/train': 1.5911760330200195}} 11/06/2021 23:57:15 - INFO - __main__ - Step 19218: {'lr': 0.00048348943133057903, 'samples': 3689856, 'steps': 19217, 'loss/train': 1.2457647323608398}} 11/06/2021 23:57:18 - INFO - __main__ - Step 19222: {'lr': 0.00048348184431742377, 'samples': 3690624, 'steps': 19221, 'loss/train': 1.4660913944244385}} 11/06/2021 23:57:20 - INFO - __main__ - Step 19227: {'lr': 0.00048347235818391144, 'samples': 3691584, 'steps': 19226, 'loss/train': 1.223244071006775}}} 11/06/2021 23:57:22 - INFO - __main__ - Step 19231: {'lr': 0.0004834647673835137, 'samples': 3692352, 'steps': 19230, 'loss/train': 1.7404580116271973}}} 11/06/2021 23:57:24 - INFO - __main__ - Step 19235: {'lr': 0.000483457174899986, 'samples': 3693120, 'steps': 19234, 'loss/train': 1.780066728591919}3}}} 11/06/2021 23:57:26 - INFO - __main__ - Step 19239: {'lr': 0.00048344958073338315, 'samples': 3693888, 'steps': 19238, 'loss/train': 0.8222588896751404}} 11/06/2021 23:57:28 - INFO - __main__ - Step 19243: {'lr': 0.0004834419848837598, 'samples': 3694656, 'steps': 19242, 'loss/train': 1.6340835094451904}}} 11/06/2021 23:57:30 - INFO - __main__ - Step 19248: {'lr': 0.00048343248770506655, 'samples': 3695616, 'steps': 19247, 'loss/train': 2.032688856124878}}} 11/06/2021 23:57:32 - INFO - __main__ - Step 19252: {'lr': 0.0004834248880688474, 'samples': 3696384, 'steps': 19251, 'loss/train': 1.692755103111267}}}} 11/06/2021 23:57:32 - INFO - __main__ - Step 19252: {'lr': 0.0004834248880688474, 'samples': 3696384, 'steps': 19251, 'loss/train': 1.692755103111267}}}} 11/06/2021 23:57:36 - INFO - __main__ - Step 19259: {'lr': 0.0004834115846561572, 'samples': 3697728, 'steps': 19258, 'loss/train': 1.2784446477890015}}} 11/06/2021 23:57:38 - INFO - __main__ - Step 19264: {'lr': 0.0004834020790633545, 'samples': 3698688, 'steps': 19263, 'loss/train': 1.5909192562103271}}} 11/06/2021 23:57:38 - INFO - __main__ - Step 19264: {'lr': 0.0004834020790633545, 'samples': 3698688, 'steps': 19263, 'loss/train': 1.5909192562103271}}} 11/06/2021 23:57:42 - INFO - __main__ - Step 19271: {'lr': 0.00048338876681642504, 'samples': 3700032, 'steps': 19270, 'loss/train': 2.3842742443084717}} 11/06/2021 23:57:44 - INFO - __main__ - Step 19276: {'lr': 0.0004833792549137598, 'samples': 3700992, 'steps': 19275, 'loss/train': 0.9541102051734924}}} 11/06/2021 23:57:47 - INFO - __main__ - Step 19281: {'lr': 0.0004833697403821672, 'samples': 3701952, 'steps': 19280, 'loss/train': 1.5551531314849854}}} 11/06/2021 23:57:47 - INFO - __main__ - Step 19281: {'lr': 0.0004833697403821672, 'samples': 3701952, 'steps': 19280, 'loss/train': 1.5551531314849854}}} 11/06/2021 23:57:49 - INFO - __main__ - Step 19287: {'lr': 0.0004833583194742231, 'samples': 3703104, 'steps': 19286, 'loss/train': 2.003950595855713}}}} 11/06/2021 23:57:52 - INFO - __main__ - Step 19292: {'lr': 0.0004833487991593679, 'samples': 3704064, 'steps': 19291, 'loss/train': 1.7094693183898926}}} 11/06/2021 23:57:54 - INFO - __main__ - Step 19297: {'lr': 0.00048333927621592844, 'samples': 3705024, 'steps': 19296, 'loss/train': 1.6878700256347656}} 11/06/2021 23:57:56 - INFO - __main__ - Step 19301: {'lr': 0.00048333165596866837, 'samples': 3705792, 'steps': 19300, 'loss/train': 1.9592984914779663}} 11/06/2021 23:57:56 - INFO - __main__ - Step 19301: {'lr': 0.00048333165596866837, 'samples': 3705792, 'steps': 19300, 'loss/train': 1.9592984914779663}} 11/06/2021 23:57:59 - INFO - __main__ - Step 19308: {'lr': 0.000483318316488274, 'samples': 3707136, 'steps': 19307, 'loss/train': 1.7169498205184937}3}} 11/06/2021 23:58:02 - INFO - __main__ - Step 19313: {'lr': 0.00048330878513408616, 'samples': 3708096, 'steps': 19312, 'loss/train': 1.7465753555297852}} 11/06/2021 23:58:02 - INFO - __main__ - Step 19313: {'lr': 0.00048330878513408616, 'samples': 3708096, 'steps': 19312, 'loss/train': 1.7465753555297852}} 11/06/2021 23:58:06 - INFO - __main__ - Step 19321: {'lr': 0.0004832935295009127, 'samples': 3709632, 'steps': 19320, 'loss/train': 1.843841791152954}2}} 11/06/2021 23:58:08 - INFO - __main__ - Step 19325: {'lr': 0.0004832858991614553, 'samples': 3710400, 'steps': 19324, 'loss/train': 1.6283198595046997}}} 11/06/2021 23:58:10 - INFO - __main__ - Step 19329: {'lr': 0.00048327826714015756, 'samples': 3711168, 'steps': 19328, 'loss/train': 3.4698245525360107}} 11/06/2021 23:58:12 - INFO - __main__ - Step 19334: {'lr': 0.000483268724748531, 'samples': 3712128, 'steps': 19333, 'loss/train': 1.5540878772735596}7}} 11/06/2021 23:58:14 - INFO - __main__ - Step 19338: {'lr': 0.00048326108894329345, 'samples': 3712896, 'steps': 19337, 'loss/train': 1.6094346046447754}} 11/06/2021 23:58:16 - INFO - __main__ - Step 19342: {'lr': 0.0004832534514563943, 'samples': 3713664, 'steps': 19341, 'loss/train': 1.700616717338562}4}} 11/06/2021 23:58:18 - INFO - __main__ - Step 19346: {'lr': 0.0004832458122878888, 'samples': 3714432, 'steps': 19345, 'loss/train': 0.8863457441329956}}} 11/06/2021 23:58:20 - INFO - __main__ - Step 19350: {'lr': 0.00048323817143783174, 'samples': 3715200, 'steps': 19349, 'loss/train': 1.5020644664764404}} 11/06/2021 23:58:22 - INFO - __main__ - Step 19355: {'lr': 0.00048322861801066265, 'samples': 3716160, 'steps': 19354, 'loss/train': 1.8485344648361206}} 11/06/2021 23:58:24 - INFO - __main__ - Step 19359: {'lr': 0.0004832209733773164, 'samples': 3716928, 'steps': 19358, 'loss/train': 1.7394636869430542}}} 11/06/2021 23:58:24 - INFO - __main__ - Step 19359: {'lr': 0.0004832209733773164, 'samples': 3716928, 'steps': 19358, 'loss/train': 1.7394636869430542}}} 11/06/2021 23:58:28 - INFO - __main__ - Step 19366: {'lr': 0.0004832075912231913, 'samples': 3718272, 'steps': 19365, 'loss/train': 1.71169114112854}2}}} 11/06/2021 23:58:30 - INFO - __main__ - Step 19370: {'lr': 0.0004831999419662037, 'samples': 3719040, 'steps': 19369, 'loss/train': 1.3964022397994995}}} 11/06/2021 23:58:32 - INFO - __main__ - Step 19375: {'lr': 0.000483190378030759, 'samples': 3720000, 'steps': 19374, 'loss/train': 1.5655479431152344}}}} 11/06/2021 23:58:32 - INFO - __main__ - Step 19375: {'lr': 0.000483190378030759, 'samples': 3720000, 'steps': 19374, 'loss/train': 1.5655479431152344}}}} 11/06/2021 23:58:36 - INFO - __main__ - Step 19383: {'lr': 0.00048317507027034913, 'samples': 3721536, 'steps': 19382, 'loss/train': 1.783972978591919}}} 11/06/2021 23:58:38 - INFO - __main__ - Step 19387: {'lr': 0.00048316741386855445, 'samples': 3722304, 'steps': 19386, 'loss/train': 1.541589379310608}}} 11/06/2021 23:58:40 - INFO - __main__ - Step 19391: {'lr': 0.0004831597557857735, 'samples': 3723072, 'steps': 19390, 'loss/train': 2.5333597660064697}}} 11/06/2021 23:58:42 - INFO - __main__ - Step 19395: {'lr': 0.00048315209602206165, 'samples': 3723840, 'steps': 19394, 'loss/train': 1.3035863637924194}} 11/06/2021 23:58:45 - INFO - __main__ - Step 19399: {'lr': 0.0004831444345774739, 'samples': 3724608, 'steps': 19398, 'loss/train': 1.9155510663986206}}} 11/06/2021 23:58:46 - INFO - __main__ - Step 19403: {'lr': 0.0004831367714520657, 'samples': 3725376, 'steps': 19402, 'loss/train': 1.216731071472168}}}} 11/06/2021 23:58:48 - INFO - __main__ - Step 19407: {'lr': 0.00048312910664589215, 'samples': 3726144, 'steps': 19406, 'loss/train': 1.7577673196792603}} 11/06/2021 23:58:48 - INFO - __main__ - Step 19407: {'lr': 0.00048312910664589215, 'samples': 3726144, 'steps': 19406, 'loss/train': 1.7577673196792603}} 11/06/2021 23:58:52 - INFO - __main__ - Step 19415: {'lr': 0.00048311377199147023, 'samples': 3727680, 'steps': 19414, 'loss/train': 1.9940916299819946}} 11/06/2021 23:58:54 - INFO - __main__ - Step 19419: {'lr': 0.0004831061021433323, 'samples': 3728448, 'steps': 19418, 'loss/train': 1.7884739637374878}}} 11/06/2021 23:58:56 - INFO - __main__ - Step 19423: {'lr': 0.0004830984306146503, 'samples': 3729216, 'steps': 19422, 'loss/train': 1.1024407148361206}}} 11/06/2021 23:59:00 - INFO - __main__ - Step 19428: {'lr': 0.0004830888388406166, 'samples': 3730176, 'steps': 19427, 'loss/train': 1.8020360469818115}}} 11/06/2021 23:59:02 - INFO - __main__ - Step 19434: {'lr': 0.0004830773252459201, 'samples': 3731328, 'steps': 19433, 'loss/train': 1.6783347129821777}}} 11/06/2021 23:59:05 - INFO - __main__ - Step 19438: {'lr': 0.00048306964741568994, 'samples': 3732096, 'steps': 19437, 'loss/train': 1.5111713409423828}} 11/06/2021 23:59:06 - INFO - __main__ - Step 19442: {'lr': 0.00048306196790517844, 'samples': 3732864, 'steps': 19441, 'loss/train': 1.871138334274292}}} 11/06/2021 23:59:08 - INFO - __main__ - Step 19446: {'lr': 0.00048305428671444083, 'samples': 3733632, 'steps': 19445, 'loss/train': 1.0744739770889282}} 11/06/2021 23:59:11 - INFO - __main__ - Step 19451: {'lr': 0.0004830446828632854, 'samples': 3734592, 'steps': 19450, 'loss/train': 1.22652006149292}82}} 11/06/2021 23:59:11 - INFO - __main__ - Step 19451: {'lr': 0.0004830446828632854, 'samples': 3734592, 'steps': 19450, 'loss/train': 1.22652006149292}82}} 11/06/2021 23:59:14 - INFO - __main__ - Step 19458: {'lr': 0.0004830312330614259, 'samples': 3735936, 'steps': 19457, 'loss/train': 1.7021613121032715}}} 11/06/2021 23:59:16 - INFO - __main__ - Step 19462: {'lr': 0.00048302354515033813, 'samples': 3736704, 'steps': 19461, 'loss/train': 1.8570998907089233}} 11/06/2021 23:59:18 - INFO - __main__ - Step 19467: {'lr': 0.00048301393289905663, 'samples': 3737664, 'steps': 19466, 'loss/train': 1.6262980699539185}} 11/06/2021 23:59:21 - INFO - __main__ - Step 19472: {'lr': 0.0004830043180229631, 'samples': 3738624, 'steps': 19471, 'loss/train': 1.8613708019256592}}} 11/06/2021 23:59:21 - INFO - __main__ - Step 19472: {'lr': 0.0004830043180229631, 'samples': 3738624, 'steps': 19471, 'loss/train': 1.8613708019256592}}} 11/06/2021 23:59:21 - INFO - __main__ - Step 19472: {'lr': 0.0004830043180229631, 'samples': 3738624, 'steps': 19471, 'loss/train': 1.8613708019256592}}} 11/06/2021 23:59:26 - INFO - __main__ - Step 19482: {'lr': 0.000482985080396773, 'samples': 3740544, 'steps': 19481, 'loss/train': 1.5462687015533447}}}} 11/06/2021 23:59:29 - INFO - __main__ - Step 19487: {'lr': 0.00048297545764689327, 'samples': 3741504, 'steps': 19486, 'loss/train': 1.5216996669769287}} 11/06/2021 23:59:29 - INFO - __main__ - Step 19487: {'lr': 0.00048297545764689327, 'samples': 3741504, 'steps': 19486, 'loss/train': 1.5216996669769287}} 11/06/2021 23:59:32 - INFO - __main__ - Step 19494: {'lr': 0.00048296198138812974, 'samples': 3742848, 'steps': 19493, 'loss/train': 1.4425673484802246}} 11/06/2021 23:59:34 - INFO - __main__ - Step 19498: {'lr': 0.00048295427835949757, 'samples': 3743616, 'steps': 19497, 'loss/train': 1.531845211982727}}} 11/06/2021 23:59:37 - INFO - __main__ - Step 19503: {'lr': 0.0004829446472119878, 'samples': 3744576, 'steps': 19502, 'loss/train': 1.3352776765823364}}} 11/06/2021 23:59:37 - INFO - __main__ - Step 19503: {'lr': 0.0004829446472119878, 'samples': 3744576, 'steps': 19502, 'loss/train': 1.3352776765823364}}} 11/06/2021 23:59:40 - INFO - __main__ - Step 19510: {'lr': 0.0004829311591971254, 'samples': 3745920, 'steps': 19509, 'loss/train': 1.8190715312957764}}} 11/06/2021 23:59:42 - INFO - __main__ - Step 19514: {'lr': 0.00048292344945102795, 'samples': 3746688, 'steps': 19513, 'loss/train': 1.670436143875122}}} 11/06/2021 23:59:45 - INFO - __main__ - Step 19519: {'lr': 0.0004829138099069991, 'samples': 3747648, 'steps': 19518, 'loss/train': 1.420883059501648}}}} 11/06/2021 23:59:45 - INFO - __main__ - Step 19519: {'lr': 0.0004829138099069991, 'samples': 3747648, 'steps': 19518, 'loss/train': 1.420883059501648}}}} 11/06/2021 23:59:49 - INFO - __main__ - Step 19527: {'lr': 0.00048289838117933505, 'samples': 3749184, 'steps': 19526, 'loss/train': 1.8481258153915405}} 11/06/2021 23:59:50 - INFO - __main__ - Step 19531: {'lr': 0.0004828906642969052, 'samples': 3749952, 'steps': 19530, 'loss/train': 1.5103822946548462}}} 11/06/2021 23:59:52 - INFO - __main__ - Step 19535: {'lr': 0.0004828829457354843, 'samples': 3750720, 'steps': 19534, 'loss/train': 1.6254938840866089}}} 11/06/2021 23:59:55 - INFO - __main__ - Step 19540: {'lr': 0.0004828732951727119, 'samples': 3751680, 'steps': 19539, 'loss/train': 1.4148067235946655}}} 11/06/2021 23:59:57 - INFO - __main__ - Step 19544: {'lr': 0.00048286557283376465, 'samples': 3752448, 'steps': 19543, 'loss/train': 1.6993317604064941}} 11/06/2021 23:59:57 - INFO - __main__ - Step 19544: {'lr': 0.00048286557283376465, 'samples': 3752448, 'steps': 19543, 'loss/train': 1.6993317604064941}} 11/07/2021 00:00:01 - INFO - __main__ - Step 19550: {'lr': 0.0004828539861775922, 'samples': 3753600, 'steps': 19549, 'loss/train': 2.087019205093384}1}} 11/07/2021 00:00:01 - INFO - __main__ - Step 19550: {'lr': 0.0004828539861775922, 'samples': 3753600, 'steps': 19549, 'loss/train': 2.087019205093384}1}} 11/07/2021 00:00:04 - INFO - __main__ - Step 19558: {'lr': 0.000482838531427185, 'samples': 3755136, 'steps': 19557, 'loss/train': 2.2403976917266846}1}} 11/07/2021 00:00:07 - INFO - __main__ - Step 19563: {'lr': 0.0004828288687984651, 'samples': 3756096, 'steps': 19562, 'loss/train': 1.5286688804626465}}} 11/07/2021 00:00:07 - INFO - __main__ - Step 19563: {'lr': 0.0004828288687984651, 'samples': 3756096, 'steps': 19562, 'loss/train': 1.5286688804626465}}} 11/07/2021 00:00:11 - INFO - __main__ - Step 19571: {'lr': 0.0004828134031372855, 'samples': 3757632, 'steps': 19570, 'loss/train': 1.9711503982543945}}} 11/07/2021 00:00:12 - INFO - __main__ - Step 19575: {'lr': 0.00048280566778901684, 'samples': 3758400, 'steps': 19574, 'loss/train': 1.4834022521972656}} 11/07/2021 00:00:15 - INFO - __main__ - Step 19579: {'lr': 0.0004827979307623699, 'samples': 3759168, 'steps': 19578, 'loss/train': 1.9877415895462036}}} 11/07/2021 00:00:17 - INFO - __main__ - Step 19584: {'lr': 0.0004827882571189268, 'samples': 3760128, 'steps': 19583, 'loss/train': 1.6104263067245483}}} 11/07/2021 00:00:17 - INFO - __main__ - Step 19584: {'lr': 0.0004827882571189268, 'samples': 3760128, 'steps': 19583, 'loss/train': 1.6104263067245483}}} 11/07/2021 00:00:17 - INFO - __main__ - Step 19584: {'lr': 0.0004827882571189268, 'samples': 3760128, 'steps': 19583, 'loss/train': 1.6104263067245483}}} 11/07/2021 00:00:22 - INFO - __main__ - Step 19595: {'lr': 0.00048276696587311525, 'samples': 3762240, 'steps': 19594, 'loss/train': 1.380395770072937}}} 11/07/2021 00:00:25 - INFO - __main__ - Step 19600: {'lr': 0.00048275728383879215, 'samples': 3763200, 'steps': 19599, 'loss/train': 1.4692695140838623}} 11/07/2021 00:00:25 - INFO - __main__ - Step 19600: {'lr': 0.00048275728383879215, 'samples': 3763200, 'steps': 19599, 'loss/train': 1.4692695140838623}} 11/07/2021 00:00:29 - INFO - __main__ - Step 19608: {'lr': 0.0004827417871303248, 'samples': 3764736, 'steps': 19607, 'loss/train': 1.5020709037780762}}} 11/07/2021 00:00:29 - INFO - __main__ - Step 19608: {'lr': 0.0004827417871303248, 'samples': 3764736, 'steps': 19607, 'loss/train': 1.5020709037780762}}} 11/07/2021 00:00:32 - INFO - __main__ - Step 19616: {'lr': 0.0004827262837101866, 'samples': 3766272, 'steps': 19615, 'loss/train': 1.532706618309021}}}} 11/07/2021 00:00:32 - INFO - __main__ - Step 19616: {'lr': 0.0004827262837101866, 'samples': 3766272, 'steps': 19615, 'loss/train': 1.532706618309021}}}} 11/07/2021 00:00:37 - INFO - __main__ - Step 19624: {'lr': 0.00048271077357882455, 'samples': 3767808, 'steps': 19623, 'loss/train': 1.7045823335647583}} 11/07/2021 00:00:39 - INFO - __main__ - Step 19628: {'lr': 0.00048270301599657436, 'samples': 3768576, 'steps': 19627, 'loss/train': 1.0141645669937134}} 11/07/2021 00:00:40 - INFO - __main__ - Step 19632: {'lr': 0.00048269525673668595, 'samples': 3769344, 'steps': 19631, 'loss/train': 1.7020254135131836}} 11/07/2021 00:00:42 - INFO - __main__ - Step 19636: {'lr': 0.00048268749579921536, 'samples': 3770112, 'steps': 19635, 'loss/train': 1.066512107849121}}} 11/07/2021 00:00:45 - INFO - __main__ - Step 19642: {'lr': 0.0004826758512476649, 'samples': 3771264, 'steps': 19641, 'loss/train': 1.6785047054290771}}} 11/07/2021 00:00:47 - INFO - __main__ - Step 19646: {'lr': 0.0004826680861164834, 'samples': 3772032, 'steps': 19645, 'loss/train': 1.3264529705047607}}} 11/07/2021 00:00:47 - INFO - __main__ - Step 19646: {'lr': 0.0004826680861164834, 'samples': 3772032, 'steps': 19645, 'loss/train': 1.3264529705047607}}} 11/07/2021 00:00:50 - INFO - __main__ - Step 19653: {'lr': 0.00048265449310073847, 'samples': 3773376, 'steps': 19652, 'loss/train': 1.2029601335525513}} 11/07/2021 00:00:52 - INFO - __main__ - Step 19657: {'lr': 0.0004826467233568791, 'samples': 3774144, 'steps': 19656, 'loss/train': 1.3596159219741821}}} 11/07/2021 00:00:55 - INFO - __main__ - Step 19662: {'lr': 0.00048263700881845346, 'samples': 3775104, 'steps': 19661, 'loss/train': 2.0011045932769775}} 11/07/2021 00:00:57 - INFO - __main__ - Step 19666: {'lr': 0.00048262923530090007, 'samples': 3775872, 'steps': 19665, 'loss/train': 1.7530802488327026}} 11/07/2021 00:00:59 - INFO - __main__ - Step 19670: {'lr': 0.00048262146010624035, 'samples': 3776640, 'steps': 19669, 'loss/train': 1.950370192527771}}} 11/07/2021 00:01:00 - INFO - __main__ - Step 19674: {'lr': 0.0004826136832345304, 'samples': 3777408, 'steps': 19673, 'loss/train': 1.9064000844955444}}} 11/07/2021 00:01:02 - INFO - __main__ - Step 19678: {'lr': 0.00048260590468582624, 'samples': 3778176, 'steps': 19677, 'loss/train': 1.5107479095458984}} 11/07/2021 00:01:05 - INFO - __main__ - Step 19683: {'lr': 0.00048259617914175846, 'samples': 3779136, 'steps': 19682, 'loss/train': 1.4910651445388794}} 11/07/2021 00:01:07 - INFO - __main__ - Step 19687: {'lr': 0.00048258839682002253, 'samples': 3779904, 'steps': 19686, 'loss/train': 0.8131615519523621}} 11/07/2021 00:01:09 - INFO - __main__ - Step 19691: {'lr': 0.0004825806128214747, 'samples': 3780672, 'steps': 19690, 'loss/train': 1.7099860906600952}}} 11/07/2021 00:01:11 - INFO - __main__ - Step 19695: {'lr': 0.000482572827146171, 'samples': 3781440, 'steps': 19694, 'loss/train': 1.8137426376342773}}}} 11/07/2021 00:01:13 - INFO - __main__ - Step 19699: {'lr': 0.00048256503979416776, 'samples': 3782208, 'steps': 19698, 'loss/train': 1.5465449094772339}} 11/07/2021 00:01:15 - INFO - __main__ - Step 19704: {'lr': 0.0004825553032463904, 'samples': 3783168, 'steps': 19703, 'loss/train': 1.48812735080719}39}} 11/07/2021 00:01:17 - INFO - __main__ - Step 19709: {'lr': 0.0004825455640789672, 'samples': 3784128, 'steps': 19708, 'loss/train': 2.0251336097717285}}} 11/07/2021 00:01:17 - INFO - __main__ - Step 19709: {'lr': 0.0004825455640789672, 'samples': 3784128, 'steps': 19708, 'loss/train': 2.0251336097717285}}} 11/07/2021 00:01:21 - INFO - __main__ - Step 19716: {'lr': 0.00048253192484377884, 'samples': 3785472, 'steps': 19715, 'loss/train': 1.6705501079559326}} 11/07/2021 00:01:22 - INFO - __main__ - Step 19720: {'lr': 0.0004825241286900238, 'samples': 3786240, 'steps': 19719, 'loss/train': 1.398619294166565}6}} 11/07/2021 00:01:25 - INFO - __main__ - Step 19724: {'lr': 0.0004825163308599203, 'samples': 3787008, 'steps': 19723, 'loss/train': 1.8848963975906372}}} 11/07/2021 00:01:25 - INFO - __main__ - Step 19724: {'lr': 0.0004825163308599203, 'samples': 3787008, 'steps': 19723, 'loss/train': 1.8848963975906372}}} 11/07/2021 00:01:25 - INFO - __main__ - Step 19724: {'lr': 0.0004825163308599203, 'samples': 3787008, 'steps': 19723, 'loss/train': 1.8848963975906372}}} 11/07/2021 00:01:30 - INFO - __main__ - Step 19735: {'lr': 0.0004824948781839225, 'samples': 3789120, 'steps': 19734, 'loss/train': 1.7068781852722168}}} 11/07/2021 00:01:33 - INFO - __main__ - Step 19740: {'lr': 0.0004824851227771453, 'samples': 3790080, 'steps': 19739, 'loss/train': 1.4696168899536133}}} 11/07/2021 00:01:33 - INFO - __main__ - Step 19740: {'lr': 0.0004824851227771453, 'samples': 3790080, 'steps': 19739, 'loss/train': 1.4696168899536133}}} 11/07/2021 00:01:37 - INFO - __main__ - Step 19748: {'lr': 0.00048246950867912873, 'samples': 3791616, 'steps': 19747, 'loss/train': 1.7688283920288086}} 11/07/2021 00:01:39 - INFO - __main__ - Step 19752: {'lr': 0.00048246169911616015, 'samples': 3792384, 'steps': 19751, 'loss/train': 1.7890325784683228}} 11/07/2021 00:01:40 - INFO - __main__ - Step 19756: {'lr': 0.00048245388787729316, 'samples': 3793152, 'steps': 19755, 'loss/train': 1.5922755002975464}} 11/07/2021 00:01:42 - INFO - __main__ - Step 19760: {'lr': 0.0004824460749625839, 'samples': 3793920, 'steps': 19759, 'loss/train': 1.4646573066711426}}} 11/07/2021 00:01:42 - INFO - __main__ - Step 19760: {'lr': 0.0004824460749625839, 'samples': 3793920, 'steps': 19759, 'loss/train': 1.4646573066711426}}} 11/07/2021 00:01:47 - INFO - __main__ - Step 19768: {'lr': 0.00048243044410586433, 'samples': 3795456, 'steps': 19767, 'loss/train': 1.8924862146377563}} 11/07/2021 00:01:48 - INFO - __main__ - Step 19772: {'lr': 0.0004824226261639666, 'samples': 3796224, 'steps': 19771, 'loss/train': 1.2419971227645874}}} 11/07/2021 00:01:50 - INFO - __main__ - Step 19776: {'lr': 0.0004824148065464522, 'samples': 3796992, 'steps': 19775, 'loss/train': 0.828349232673645}}}} 11/07/2021 00:01:53 - INFO - __main__ - Step 19781: {'lr': 0.0004824050296683089, 'samples': 3797952, 'steps': 19780, 'loss/train': 1.6432030200958252}}} 11/07/2021 00:01:53 - INFO - __main__ - Step 19781: {'lr': 0.0004824050296683089, 'samples': 3797952, 'steps': 19780, 'loss/train': 1.6432030200958252}}} 11/07/2021 00:01:57 - INFO - __main__ - Step 19789: {'lr': 0.00048238938121798313, 'samples': 3799488, 'steps': 19788, 'loss/train': 1.7748910188674927}} 11/07/2021 00:01:59 - INFO - __main__ - Step 19793: {'lr': 0.0004823815544797265, 'samples': 3800256, 'steps': 19792, 'loss/train': 1.7088088989257812}}} 11/07/2021 00:02:01 - INFO - __main__ - Step 19797: {'lr': 0.0004823737260661491, 'samples': 3801024, 'steps': 19796, 'loss/train': 2.3506391048431396}}} 11/07/2021 00:02:03 - INFO - __main__ - Step 19802: {'lr': 0.00048236393819334363, 'samples': 3801984, 'steps': 19801, 'loss/train': 1.7285722494125366}} 11/07/2021 00:02:05 - INFO - __main__ - Step 19806: {'lr': 0.0004823561060105011, 'samples': 3802752, 'steps': 19805, 'loss/train': 1.6271388530731201}}} 11/07/2021 00:02:05 - INFO - __main__ - Step 19806: {'lr': 0.0004823561060105011, 'samples': 3802752, 'steps': 19805, 'loss/train': 1.6271388530731201}}} 11/07/2021 00:02:08 - INFO - __main__ - Step 19813: {'lr': 0.0004823423956597617, 'samples': 3804096, 'steps': 19812, 'loss/train': 1.3268944025039673}}} 11/07/2021 00:02:11 - INFO - __main__ - Step 19818: {'lr': 0.0004823325994113761, 'samples': 3805056, 'steps': 19817, 'loss/train': 1.7103739976882935}}} 11/07/2021 00:02:13 - INFO - __main__ - Step 19822: {'lr': 0.0004823247605283236, 'samples': 3805824, 'steps': 19821, 'loss/train': 1.4720865488052368}}} 11/07/2021 00:02:15 - INFO - __main__ - Step 19826: {'lr': 0.00048231691997035987, 'samples': 3806592, 'steps': 19825, 'loss/train': 1.6825331449508667}} 11/07/2021 00:02:16 - INFO - __main__ - Step 19830: {'lr': 0.0004823090777375414, 'samples': 3807360, 'steps': 19829, 'loss/train': 2.1048223972320557}}} 11/07/2021 00:02:19 - INFO - __main__ - Step 19834: {'lr': 0.0004823012338299248, 'samples': 3808128, 'steps': 19833, 'loss/train': 1.5451176166534424}}} 11/07/2021 00:02:21 - INFO - __main__ - Step 19839: {'lr': 0.00048229142659030527, 'samples': 3809088, 'steps': 19838, 'loss/train': 1.6088569164276123}} 11/07/2021 00:02:23 - INFO - __main__ - Step 19843: {'lr': 0.00048228357891459954, 'samples': 3809856, 'steps': 19842, 'loss/train': 1.5152689218521118}} 11/07/2021 00:02:25 - INFO - __main__ - Step 19847: {'lr': 0.0004822757295642795, 'samples': 3810624, 'steps': 19846, 'loss/train': 1.0797271728515625}}} 11/07/2021 00:02:27 - INFO - __main__ - Step 19851: {'lr': 0.0004822678785394017, 'samples': 3811392, 'steps': 19850, 'loss/train': 0.8418163657188416}}} 11/07/2021 00:02:29 - INFO - __main__ - Step 19855: {'lr': 0.00048226002584002276, 'samples': 3812160, 'steps': 19854, 'loss/train': 1.4608242511749268}} 11/07/2021 00:02:31 - INFO - __main__ - Step 19859: {'lr': 0.0004822521714661993, 'samples': 3812928, 'steps': 19858, 'loss/train': 1.6047289371490479}}} 11/07/2021 00:02:33 - INFO - __main__ - Step 19863: {'lr': 0.00048224431541798784, 'samples': 3813696, 'steps': 19862, 'loss/train': 1.8354872465133667}} 11/07/2021 00:02:35 - INFO - __main__ - Step 19868: {'lr': 0.0004822344930032019, 'samples': 3814656, 'steps': 19867, 'loss/train': 1.6406512260437012}}} 11/07/2021 00:02:35 - INFO - __main__ - Step 19868: {'lr': 0.0004822344930032019, 'samples': 3814656, 'steps': 19867, 'loss/train': 1.6406512260437012}}} 11/07/2021 00:02:39 - INFO - __main__ - Step 19874: {'lr': 0.00048222270265230627, 'samples': 3815808, 'steps': 19873, 'loss/train': 2.2236526012420654}} 11/07/2021 00:02:41 - INFO - __main__ - Step 19879: {'lr': 0.00048221287448239604, 'samples': 3816768, 'steps': 19878, 'loss/train': 1.5209888219833374}} 11/07/2021 00:02:43 - INFO - __main__ - Step 19883: {'lr': 0.0004822050100630949, 'samples': 3817536, 'steps': 19882, 'loss/train': 1.7728437185287476}}} 11/07/2021 00:02:45 - INFO - __main__ - Step 19887: {'lr': 0.00048219714396974587, 'samples': 3818304, 'steps': 19886, 'loss/train': 1.7361443042755127}} 11/07/2021 00:02:47 - INFO - __main__ - Step 19891: {'lr': 0.00048218927620240557, 'samples': 3819072, 'steps': 19890, 'loss/train': 2.8923377990722656}} 11/07/2021 00:02:49 - INFO - __main__ - Step 19896: {'lr': 0.00048217943913926646, 'samples': 3820032, 'steps': 19895, 'loss/train': 1.4981663227081299}} 11/07/2021 00:02:51 - INFO - __main__ - Step 19900: {'lr': 0.0004821715676056534, 'samples': 3820800, 'steps': 19899, 'loss/train': 1.760026216506958}9}} 11/07/2021 00:02:51 - INFO - __main__ - Step 19900: {'lr': 0.0004821715676056534, 'samples': 3820800, 'steps': 19899, 'loss/train': 1.760026216506958}9}} 11/07/2021 00:02:55 - INFO - __main__ - Step 19908: {'lr': 0.0004821558195170636, 'samples': 3822336, 'steps': 19907, 'loss/train': 1.3953006267547607}}} 11/07/2021 00:02:57 - INFO - __main__ - Step 19912: {'lr': 0.00048214794296220045, 'samples': 3823104, 'steps': 19911, 'loss/train': 1.2514482736587524}} 11/07/2021 00:02:59 - INFO - __main__ - Step 19916: {'lr': 0.0004821400647337007, 'samples': 3823872, 'steps': 19915, 'loss/train': 1.1114486455917358}}} 11/07/2021 00:02:59 - INFO - __main__ - Step 19916: {'lr': 0.0004821400647337007, 'samples': 3823872, 'steps': 19915, 'loss/train': 1.1114486455917358}}} 11/07/2021 00:03:03 - INFO - __main__ - Step 19924: {'lr': 0.00048212430325601905, 'samples': 3825408, 'steps': 19923, 'loss/train': 1.4564239978790283}} 11/07/2021 00:03:05 - INFO - __main__ - Step 19928: {'lr': 0.00048211642000695065, 'samples': 3826176, 'steps': 19927, 'loss/train': 1.8800297975540161}} 11/07/2021 00:03:07 - INFO - __main__ - Step 19932: {'lr': 0.0004821085350844731, 'samples': 3826944, 'steps': 19931, 'loss/train': 1.8413258790969849}}} 11/07/2021 00:03:09 - INFO - __main__ - Step 19937: {'lr': 0.000482098676578231, 'samples': 3827904, 'steps': 19936, 'loss/train': 1.7007976770401}849}}} 11/07/2021 00:03:11 - INFO - __main__ - Step 19941: {'lr': 0.00048209078789079055, 'samples': 3828672, 'steps': 19940, 'loss/train': 1.5990064144134521}} 11/07/2021 00:03:14 - INFO - __main__ - Step 19945: {'lr': 0.0004820828975301256, 'samples': 3829440, 'steps': 19944, 'loss/train': 1.8021361827850342}}} 11/07/2021 00:03:16 - INFO - __main__ - Step 19949: {'lr': 0.0004820750054962931, 'samples': 3830208, 'steps': 19948, 'loss/train': 1.7571378946304321}}} 11/07/2021 00:03:17 - INFO - __main__ - Step 19953: {'lr': 0.00048206711178934994, 'samples': 3830976, 'steps': 19952, 'loss/train': 1.4717843532562256}} 11/07/2021 00:03:19 - INFO - __main__ - Step 19957: {'lr': 0.000482059216409353, 'samples': 3831744, 'steps': 19956, 'loss/train': 1.5581938028335571}6}} 11/07/2021 00:03:22 - INFO - __main__ - Step 19962: {'lr': 0.00048204934483171176, 'samples': 3832704, 'steps': 19961, 'loss/train': 1.6191062927246094}} 11/07/2021 00:03:24 - INFO - __main__ - Step 19966: {'lr': 0.000482041445687552, 'samples': 3833472, 'steps': 19965, 'loss/train': 1.8509191274642944}4}} 11/07/2021 00:03:26 - INFO - __main__ - Step 19970: {'lr': 0.00048203354487052363, 'samples': 3834240, 'steps': 19969, 'loss/train': 1.5672111511230469}} 11/07/2021 00:03:27 - INFO - __main__ - Step 19974: {'lr': 0.0004820256423806835, 'samples': 3835008, 'steps': 19973, 'loss/train': 1.4316328763961792}}} 11/07/2021 00:03:29 - INFO - __main__ - Step 19978: {'lr': 0.0004820177382180885, 'samples': 3835776, 'steps': 19977, 'loss/train': 1.506314992904663}}}} 11/07/2021 00:03:29 - INFO - __main__ - Step 19978: {'lr': 0.0004820177382180885, 'samples': 3835776, 'steps': 19977, 'loss/train': 1.506314992904663}}}} 11/07/2021 00:03:29 - INFO - __main__ - Step 19978: {'lr': 0.0004820177382180885, 'samples': 3835776, 'steps': 19977, 'loss/train': 1.506314992904663}}}} 11/07/2021 00:03:35 - INFO - __main__ - Step 19989: {'lr': 0.00048199599314627576, 'samples': 3837888, 'steps': 19988, 'loss/train': 0.23624520003795624} 11/07/2021 00:03:38 - INFO - __main__ - Step 19994: {'lr': 0.0004819861048413006, 'samples': 3838848, 'steps': 19993, 'loss/train': 2.013521671295166}24} 11/07/2021 00:03:38 - INFO - __main__ - Step 19994: {'lr': 0.0004819861048413006, 'samples': 3838848, 'steps': 19993, 'loss/train': 2.013521671295166}24} 11/07/2021 00:03:41 - INFO - __main__ - Step 20000: {'lr': 0.00048197423542587143, 'samples': 3840000, 'steps': 19999, 'loss/train': 1.3565977811813354}} 11/07/2021 00:03:43 - INFO - __main__ - Step 20005: {'lr': 0.0004819643413719287, 'samples': 3840960, 'steps': 20004, 'loss/train': 1.347116470336914}4}} 11/07/2021 00:03:43 - INFO - __main__ - Step 20005: {'lr': 0.0004819643413719287, 'samples': 3840960, 'steps': 20004, 'loss/train': 1.347116470336914}4}} 11/07/2021 00:03:47 - INFO - __main__ - Step 20013: {'lr': 0.0004819485054506498, 'samples': 3842496, 'steps': 20012, 'loss/train': 1.8437927961349487}}} 11/07/2021 00:03:49 - INFO - __main__ - Step 20017: {'lr': 0.0004819405849816839, 'samples': 3843264, 'steps': 20016, 'loss/train': 1.6164615154266357}}} 11/07/2021 00:03:51 - INFO - __main__ - Step 20021: {'lr': 0.00048193266284057634, 'samples': 3844032, 'steps': 20020, 'loss/train': 1.0968644618988037}} 11/07/2021 00:03:51 - INFO - __main__ - Step 20021: {'lr': 0.00048193266284057634, 'samples': 3844032, 'steps': 20020, 'loss/train': 1.0968644618988037}} 11/07/2021 00:03:55 - INFO - __main__ - Step 20029: {'lr': 0.00048191681354216504, 'samples': 3845568, 'steps': 20028, 'loss/train': 1.8787384033203125}} 11/07/2021 00:03:57 - INFO - __main__ - Step 20033: {'lr': 0.00048190888638497553, 'samples': 3846336, 'steps': 20032, 'loss/train': 1.8794498443603516}} 11/07/2021 00:03:59 - INFO - __main__ - Step 20037: {'lr': 0.0004819009575558729, 'samples': 3847104, 'steps': 20036, 'loss/train': 1.7073240280151367}}} 11/07/2021 00:04:01 - INFO - __main__ - Step 20042: {'lr': 0.000481891044168454, 'samples': 3848064, 'steps': 20041, 'loss/train': 1.7178666591644287}}}} 11/07/2021 00:04:03 - INFO - __main__ - Step 20046: {'lr': 0.0004818831115777561, 'samples': 3848832, 'steps': 20045, 'loss/train': 1.6248598098754883}}} 11/07/2021 00:04:06 - INFO - __main__ - Step 20050: {'lr': 0.0004818751773153309, 'samples': 3849600, 'steps': 20049, 'loss/train': 1.623940348625183}}}} 11/07/2021 00:04:08 - INFO - __main__ - Step 20054: {'lr': 0.00048186724138123577, 'samples': 3850368, 'steps': 20053, 'loss/train': 1.556071162223816}}} 11/07/2021 00:04:09 - INFO - __main__ - Step 20058: {'lr': 0.0004818593037755278, 'samples': 3851136, 'steps': 20057, 'loss/train': 1.833410620689392}}}} 11/07/2021 00:04:11 - INFO - __main__ - Step 20063: {'lr': 0.0004818493794177744, 'samples': 3852096, 'steps': 20062, 'loss/train': 2.7569499015808105}}} 11/07/2021 00:04:11 - INFO - __main__ - Step 20063: {'lr': 0.0004818493794177744, 'samples': 3852096, 'steps': 20062, 'loss/train': 2.7569499015808105}}} 11/07/2021 00:04:16 - INFO - __main__ - Step 20071: {'lr': 0.0004818334950130925, 'samples': 3853632, 'steps': 20070, 'loss/train': 1.792672038078308}}}} 11/07/2021 00:04:18 - INFO - __main__ - Step 20075: {'lr': 0.00048182555030366854, 'samples': 3854400, 'steps': 20074, 'loss/train': 1.7799201011657715}} 11/07/2021 00:04:19 - INFO - __main__ - Step 20079: {'lr': 0.0004818176039229324, 'samples': 3855168, 'steps': 20078, 'loss/train': 1.445941686630249}5}} 11/07/2021 00:04:21 - INFO - __main__ - Step 20083: {'lr': 0.00048180965587094125, 'samples': 3855936, 'steps': 20082, 'loss/train': 1.6008447408676147}} 11/07/2021 00:04:24 - INFO - __main__ - Step 20088: {'lr': 0.00048179971845583734, 'samples': 3856896, 'steps': 20087, 'loss/train': 1.7061606645584106}} 11/07/2021 00:04:26 - INFO - __main__ - Step 20092: {'lr': 0.00048179176664373214, 'samples': 3857664, 'steps': 20091, 'loss/train': 1.464991569519043}}} 11/07/2021 00:04:28 - INFO - __main__ - Step 20096: {'lr': 0.0004817838131605582, 'samples': 3858432, 'steps': 20095, 'loss/train': 1.6248929500579834}}} 11/07/2021 00:04:29 - INFO - __main__ - Step 20100: {'lr': 0.00048177585800637286, 'samples': 3859200, 'steps': 20099, 'loss/train': 1.5155513286590576}} 11/07/2021 00:04:31 - INFO - __main__ - Step 20104: {'lr': 0.0004817679011812336, 'samples': 3859968, 'steps': 20103, 'loss/train': 1.7872097492218018}}} 11/07/2021 00:04:34 - INFO - __main__ - Step 20109: {'lr': 0.00048175795280011775, 'samples': 3860928, 'steps': 20108, 'loss/train': 1.485747218132019}}} 11/07/2021 00:04:36 - INFO - __main__ - Step 20114: {'lr': 0.000481748001808338, 'samples': 3861888, 'steps': 20113, 'loss/train': 1.2486543655395508}}}} 11/07/2021 00:04:36 - INFO - __main__ - Step 20114: {'lr': 0.000481748001808338, 'samples': 3861888, 'steps': 20113, 'loss/train': 1.2486543655395508}}}} 11/07/2021 00:04:39 - INFO - __main__ - Step 20120: {'lr': 0.0004817360571722838, 'samples': 3863040, 'steps': 20119, 'loss/train': 1.5528236627578735}}} 11/07/2021 00:04:42 - INFO - __main__ - Step 20125: {'lr': 0.000481726100437438, 'samples': 3864000, 'steps': 20124, 'loss/train': 1.859569787979126}5}}} 11/07/2021 00:04:44 - INFO - __main__ - Step 20130: {'lr': 0.00048171614109228714, 'samples': 3864960, 'steps': 20129, 'loss/train': 1.369523048400879}}} 11/07/2021 00:04:44 - INFO - __main__ - Step 20130: {'lr': 0.00048171614109228714, 'samples': 3864960, 'steps': 20129, 'loss/train': 1.369523048400879}}} 11/07/2021 00:04:47 - INFO - __main__ - Step 20137: {'lr': 0.00048170219362397685, 'samples': 3866304, 'steps': 20136, 'loss/train': 1.84674870967865}}}} 11/07/2021 00:04:49 - INFO - __main__ - Step 20141: {'lr': 0.00048169422134523404, 'samples': 3867072, 'steps': 20140, 'loss/train': 1.5689804553985596}} 11/07/2021 00:04:52 - INFO - __main__ - Step 20146: {'lr': 0.0004816842536478608, 'samples': 3868032, 'steps': 20145, 'loss/train': 2.007317066192627}6}} 11/07/2021 00:04:54 - INFO - __main__ - Step 20151: {'lr': 0.0004816742833406538, 'samples': 3868992, 'steps': 20150, 'loss/train': 2.0192196369171143}}} 11/07/2021 00:04:54 - INFO - __main__ - Step 20151: {'lr': 0.0004816742833406538, 'samples': 3868992, 'steps': 20150, 'loss/train': 2.0192196369171143}}} 11/07/2021 00:04:57 - INFO - __main__ - Step 20158: {'lr': 0.0004816603205262572, 'samples': 3870336, 'steps': 20157, 'loss/train': 1.8653146028518677}}} 11/07/2021 00:05:00 - INFO - __main__ - Step 20162: {'lr': 0.0004816523394787372, 'samples': 3871104, 'steps': 20161, 'loss/train': 1.504123568534851}}}} 11/07/2021 00:05:02 - INFO - __main__ - Step 20167: {'lr': 0.00048164236082081713, 'samples': 3872064, 'steps': 20166, 'loss/train': 1.5199638605117798}} 11/07/2021 00:05:02 - INFO - __main__ - Step 20167: {'lr': 0.00048164236082081713, 'samples': 3872064, 'steps': 20166, 'loss/train': 1.5199638605117798}} 11/07/2021 00:05:06 - INFO - __main__ - Step 20174: {'lr': 0.00048162838631602643, 'samples': 3873408, 'steps': 20173, 'loss/train': 1.643215298652649}}} 11/07/2021 00:05:07 - INFO - __main__ - Step 20178: {'lr': 0.0004816203985885977, 'samples': 3874176, 'steps': 20177, 'loss/train': 1.7557024955749512}}} 11/07/2021 00:05:09 - INFO - __main__ - Step 20182: {'lr': 0.00048161240919133573, 'samples': 3874944, 'steps': 20181, 'loss/train': 1.5637428760528564}} 11/07/2021 00:05:12 - INFO - __main__ - Step 20187: {'lr': 0.0004816024200966431, 'samples': 3875904, 'steps': 20186, 'loss/train': 1.409131646156311}4}} 11/07/2021 00:05:14 - INFO - __main__ - Step 20191: {'lr': 0.000481594426942467, 'samples': 3876672, 'steps': 20190, 'loss/train': 1.5956690311431885}4}} 11/07/2021 00:05:16 - INFO - __main__ - Step 20195: {'lr': 0.00048158643211864495, 'samples': 3877440, 'steps': 20194, 'loss/train': 1.642877221107483}}} 11/07/2021 00:05:17 - INFO - __main__ - Step 20199: {'lr': 0.0004815784356252344, 'samples': 3878208, 'steps': 20198, 'loss/train': 1.2568089962005615}}} 11/07/2021 00:05:19 - INFO - __main__ - Step 20203: {'lr': 0.00048157043746229324, 'samples': 3878976, 'steps': 20202, 'loss/train': 1.2760602235794067}} 11/07/2021 00:05:19 - INFO - __main__ - Step 20203: {'lr': 0.00048157043746229324, 'samples': 3878976, 'steps': 20202, 'loss/train': 1.2760602235794067}} 11/07/2021 00:05:23 - INFO - __main__ - Step 20211: {'lr': 0.0004815544361280494, 'samples': 3880512, 'steps': 20210, 'loss/train': 1.3515102863311768}}} 11/07/2021 00:05:26 - INFO - __main__ - Step 20215: {'lr': 0.0004815464329568621, 'samples': 3881280, 'steps': 20214, 'loss/train': 1.4815483093261719}}} 11/07/2021 00:05:26 - INFO - __main__ - Step 20215: {'lr': 0.0004815464329568621, 'samples': 3881280, 'steps': 20214, 'loss/train': 1.4815483093261719}}} 11/07/2021 00:05:30 - INFO - __main__ - Step 20223: {'lr': 0.0004815304216066453, 'samples': 3882816, 'steps': 20222, 'loss/train': 1.10834538936615}9}}} 11/07/2021 00:05:31 - INFO - __main__ - Step 20227: {'lr': 0.0004815224134277311, 'samples': 3883584, 'steps': 20226, 'loss/train': 0.39159658551216125}} 11/07/2021 00:05:33 - INFO - __main__ - Step 20231: {'lr': 0.0004815144035796901, 'samples': 3884352, 'steps': 20230, 'loss/train': 1.4114503860473633}}} 11/07/2021 00:05:36 - INFO - __main__ - Step 20236: {'lr': 0.00048150438892251724, 'samples': 3885312, 'steps': 20235, 'loss/train': 1.5953601598739624}} 11/07/2021 00:05:38 - INFO - __main__ - Step 20240: {'lr': 0.00048149637531915215, 'samples': 3886080, 'steps': 20239, 'loss/train': 1.6984490156173706}} 11/07/2021 00:05:40 - INFO - __main__ - Step 20244: {'lr': 0.0004814883600468478, 'samples': 3886848, 'steps': 20243, 'loss/train': 1.2742701768875122}}} 11/07/2021 00:05:42 - INFO - __main__ - Step 20248: {'lr': 0.0004814803431056622, 'samples': 3887616, 'steps': 20247, 'loss/train': 1.6258333921432495}}} 11/07/2021 00:05:44 - INFO - __main__ - Step 20252: {'lr': 0.00048147232449565305, 'samples': 3888384, 'steps': 20251, 'loss/train': 1.609937310218811}}} 11/07/2021 00:05:46 - INFO - __main__ - Step 20257: {'lr': 0.00048146229888644656, 'samples': 3889344, 'steps': 20256, 'loss/train': 1.4912571907043457}} 11/07/2021 00:05:48 - INFO - __main__ - Step 20261: {'lr': 0.00048145427652179583, 'samples': 3890112, 'steps': 20260, 'loss/train': 2.0702147483825684}} 11/07/2021 00:05:50 - INFO - __main__ - Step 20265: {'lr': 0.00048144625248850955, 'samples': 3890880, 'steps': 20264, 'loss/train': 1.8271234035491943}} 11/07/2021 00:05:50 - INFO - __main__ - Step 20265: {'lr': 0.00048144625248850955, 'samples': 3890880, 'steps': 20264, 'loss/train': 1.8271234035491943}} 11/07/2021 00:05:54 - INFO - __main__ - Step 20272: {'lr': 0.00048143220641527805, 'samples': 3892224, 'steps': 20271, 'loss/train': 0.8694155812263489}} 11/07/2021 00:05:56 - INFO - __main__ - Step 20277: {'lr': 0.0004814221703774155, 'samples': 3893184, 'steps': 20276, 'loss/train': 1.9482049942016602}}} 11/07/2021 00:05:56 - INFO - __main__ - Step 20277: {'lr': 0.0004814221703774155, 'samples': 3893184, 'steps': 20276, 'loss/train': 1.9482049942016602}}} 11/07/2021 00:06:00 - INFO - __main__ - Step 20285: {'lr': 0.000481406107294569, 'samples': 3894720, 'steps': 20284, 'loss/train': 1.877029538154602}2}}} 11/07/2021 00:06:02 - INFO - __main__ - Step 20289: {'lr': 0.00048139807325068423, 'samples': 3895488, 'steps': 20288, 'loss/train': 0.5814390778541565}} 11/07/2021 00:06:04 - INFO - __main__ - Step 20293: {'lr': 0.0004813900375385691, 'samples': 3896256, 'steps': 20292, 'loss/train': 2.788299083709717}5}} 11/07/2021 00:06:06 - INFO - __main__ - Step 20298: {'lr': 0.00048137999055256444, 'samples': 3897216, 'steps': 20297, 'loss/train': 1.6339629888534546}} 11/07/2021 00:06:06 - INFO - __main__ - Step 20298: {'lr': 0.00048137999055256444, 'samples': 3897216, 'steps': 20297, 'loss/train': 1.6339629888534546}} 11/07/2021 00:06:10 - INFO - __main__ - Step 20305: {'lr': 0.00048136592039342053, 'samples': 3898560, 'steps': 20304, 'loss/train': 1.7422436475753784}} 11/07/2021 00:06:12 - INFO - __main__ - Step 20309: {'lr': 0.0004813578780089632, 'samples': 3899328, 'steps': 20308, 'loss/train': 1.690520167350769}4}} 11/07/2021 00:06:14 - INFO - __main__ - Step 20314: {'lr': 0.00048134782268285676, 'samples': 3900288, 'steps': 20313, 'loss/train': 1.62480890750885}4}} 11/07/2021 00:06:17 - INFO - __main__ - Step 20319: {'lr': 0.00048133776475070637, 'samples': 3901248, 'steps': 20318, 'loss/train': 1.7565277814865112}} 11/07/2021 00:06:17 - INFO - __main__ - Step 20319: {'lr': 0.00048133776475070637, 'samples': 3901248, 'steps': 20318, 'loss/train': 1.7565277814865112}} 11/07/2021 00:06:20 - INFO - __main__ - Step 20326: {'lr': 0.0004813236792677577, 'samples': 3902592, 'steps': 20325, 'loss/train': 1.7275755405426025}}} 11/07/2021 00:06:22 - INFO - __main__ - Step 20330: {'lr': 0.00048131562812725904, 'samples': 3903360, 'steps': 20329, 'loss/train': 1.7746409177780151}} 11/07/2021 00:06:24 - INFO - __main__ - Step 20335: {'lr': 0.00048130556185652947, 'samples': 3904320, 'steps': 20334, 'loss/train': 1.9235178232192993}} 11/07/2021 00:06:27 - INFO - __main__ - Step 20339: {'lr': 0.00048129750696393144, 'samples': 3905088, 'steps': 20338, 'loss/train': 1.6730066537857056}} 11/07/2021 00:06:29 - INFO - __main__ - Step 20343: {'lr': 0.000481289450403828, 'samples': 3905856, 'steps': 20342, 'loss/train': 1.8831218481063843}6}} 11/07/2021 00:06:30 - INFO - __main__ - Step 20347: {'lr': 0.00048128139217627725, 'samples': 3906624, 'steps': 20346, 'loss/train': 1.4639637470245361}} 11/07/2021 00:06:32 - INFO - __main__ - Step 20351: {'lr': 0.0004812733322813373, 'samples': 3907392, 'steps': 20350, 'loss/train': 1.5133908987045288}}} 11/07/2021 00:06:35 - INFO - __main__ - Step 20356: {'lr': 0.0004812632550679848, 'samples': 3908352, 'steps': 20355, 'loss/train': 1.9811984300613403}}} 11/07/2021 00:06:37 - INFO - __main__ - Step 20360: {'lr': 0.00048125519142163157, 'samples': 3909120, 'steps': 20359, 'loss/train': 1.5944174528121948}} 11/07/2021 00:06:37 - INFO - __main__ - Step 20360: {'lr': 0.00048125519142163157, 'samples': 3909120, 'steps': 20359, 'loss/train': 1.5944174528121948}} 11/07/2021 00:06:40 - INFO - __main__ - Step 20366: {'lr': 0.0004812430928261192, 'samples': 3910272, 'steps': 20365, 'loss/train': 1.7087429761886597}}} 11/07/2021 00:06:42 - INFO - __main__ - Step 20372: {'lr': 0.0004812309904796024, 'samples': 3911424, 'steps': 20371, 'loss/train': 1.5013188123703003}}} 11/07/2021 00:06:42 - INFO - __main__ - Step 20372: {'lr': 0.0004812309904796024, 'samples': 3911424, 'steps': 20371, 'loss/train': 1.5013188123703003}}} 11/07/2021 00:06:46 - INFO - __main__ - Step 20379: {'lr': 0.0004812168663347418, 'samples': 3912768, 'steps': 20378, 'loss/train': 1.824082374572754}}}} 11/07/2021 00:06:48 - INFO - __main__ - Step 20383: {'lr': 0.00048120879310278094, 'samples': 3913536, 'steps': 20382, 'loss/train': 0.8189948797225952}} 11/07/2021 00:06:50 - INFO - __main__ - Step 20388: {'lr': 0.00048119869921880656, 'samples': 3914496, 'steps': 20387, 'loss/train': 1.6406468152999878}} 11/07/2021 00:06:50 - INFO - __main__ - Step 20388: {'lr': 0.00048119869921880656, 'samples': 3914496, 'steps': 20387, 'loss/train': 1.6406468152999878}} 11/07/2021 00:06:54 - INFO - __main__ - Step 20396: {'lr': 0.0004811825435874174, 'samples': 3916032, 'steps': 20395, 'loss/train': 1.985606074333191}8}} 11/07/2021 00:06:56 - INFO - __main__ - Step 20400: {'lr': 0.0004811744632716789, 'samples': 3916800, 'steps': 20399, 'loss/train': 1.566263198852539}8}} 11/07/2021 00:06:58 - INFO - __main__ - Step 20404: {'lr': 0.000481166381289322, 'samples': 3917568, 'steps': 20403, 'loss/train': 1.3343929052352905}8}} 11/07/2021 00:06:58 - INFO - __main__ - Step 20404: {'lr': 0.000481166381289322, 'samples': 3917568, 'steps': 20403, 'loss/train': 1.3343929052352905}8}} 11/07/2021 00:07:02 - INFO - __main__ - Step 20412: {'lr': 0.0004811502123249862, 'samples': 3919104, 'steps': 20411, 'loss/train': 1.4090980291366577}}} 11/07/2021 00:07:04 - INFO - __main__ - Step 20416: {'lr': 0.000481142125343124, 'samples': 3919872, 'steps': 20415, 'loss/train': 1.5453555583953857}}}} 11/07/2021 00:07:06 - INFO - __main__ - Step 20420: {'lr': 0.00048113403669487655, 'samples': 3920640, 'steps': 20419, 'loss/train': 1.8343051671981812}} 11/07/2021 00:07:08 - INFO - __main__ - Step 20425: {'lr': 0.00048112392354130194, 'samples': 3921600, 'steps': 20424, 'loss/train': 1.6747199296951294}} 11/07/2021 00:07:08 - INFO - __main__ - Step 20425: {'lr': 0.00048112392354130194, 'samples': 3921600, 'steps': 20424, 'loss/train': 1.6747199296951294}} 11/07/2021 00:07:13 - INFO - __main__ - Step 20433: {'lr': 0.00048110773708030444, 'samples': 3923136, 'steps': 20432, 'loss/train': 1.7761973142623901}} 11/07/2021 00:07:14 - INFO - __main__ - Step 20437: {'lr': 0.0004810996413505706, 'samples': 3923904, 'steps': 20436, 'loss/train': 1.4574823379516602}}} 11/07/2021 00:07:16 - INFO - __main__ - Step 20441: {'lr': 0.00048109154395475787, 'samples': 3924672, 'steps': 20440, 'loss/train': 1.6449775695800781}} 11/07/2021 00:07:18 - INFO - __main__ - Step 20446: {'lr': 0.0004810814198671574, 'samples': 3925632, 'steps': 20445, 'loss/train': 1.5799506902694702}}} 11/07/2021 00:07:21 - INFO - __main__ - Step 20451: {'lr': 0.0004810712931765139, 'samples': 3926592, 'steps': 20450, 'loss/train': 1.519484519958496}}}} 11/07/2021 00:07:21 - INFO - __main__ - Step 20451: {'lr': 0.0004810712931765139, 'samples': 3926592, 'steps': 20450, 'loss/train': 1.519484519958496}}}} 11/07/2021 00:07:24 - INFO - __main__ - Step 20458: {'lr': 0.00048105711143671783, 'samples': 3927936, 'steps': 20457, 'loss/train': 1.615772008895874}}} 11/07/2021 00:07:26 - INFO - __main__ - Step 20462: {'lr': 0.0004810490052949488, 'samples': 3928704, 'steps': 20461, 'loss/train': 1.2645617723464966}}} 11/07/2021 00:07:28 - INFO - __main__ - Step 20467: {'lr': 0.0004810388702753342, 'samples': 3929664, 'steps': 20466, 'loss/train': 1.3941420316696167}}} 11/07/2021 00:07:28 - INFO - __main__ - Step 20467: {'lr': 0.0004810388702753342, 'samples': 3929664, 'steps': 20466, 'loss/train': 1.3941420316696167}}} 11/07/2021 00:07:33 - INFO - __main__ - Step 20475: {'lr': 0.0004810226488306659, 'samples': 3931200, 'steps': 20474, 'loss/train': 1.388344407081604}}}} 11/07/2021 00:07:34 - INFO - __main__ - Step 20479: {'lr': 0.00048101453561001667, 'samples': 3931968, 'steps': 20478, 'loss/train': 2.0187954902648926}} 11/07/2021 00:07:36 - INFO - __main__ - Step 20483: {'lr': 0.0004810064207239021, 'samples': 3932736, 'steps': 20482, 'loss/train': 1.6259307861328125}}} 11/07/2021 00:07:39 - INFO - __main__ - Step 20488: {'lr': 0.00048099627477428744, 'samples': 3933696, 'steps': 20487, 'loss/train': 1.9467196464538574}} 11/07/2021 00:07:41 - INFO - __main__ - Step 20492: {'lr': 0.0004809881561410897, 'samples': 3934464, 'steps': 20491, 'loss/train': 1.471197485923767}4}} 11/07/2021 00:07:43 - INFO - __main__ - Step 20496: {'lr': 0.00048098003584261684, 'samples': 3935232, 'steps': 20495, 'loss/train': 1.3832452297210693}} 11/07/2021 00:07:45 - INFO - __main__ - Step 20500: {'lr': 0.0004809719138789273, 'samples': 3936000, 'steps': 20499, 'loss/train': 1.6991946697235107}}} 11/07/2021 00:07:46 - INFO - __main__ - Step 20504: {'lr': 0.0004809637902500797, 'samples': 3936768, 'steps': 20503, 'loss/train': 1.8379428386688232}}} 11/07/2021 00:07:49 - INFO - __main__ - Step 20509: {'lr': 0.0004809536333724809, 'samples': 3937728, 'steps': 20508, 'loss/train': 1.3580818176269531}}} 11/07/2021 00:07:49 - INFO - __main__ - Step 20509: {'lr': 0.0004809536333724809, 'samples': 3937728, 'steps': 20508, 'loss/train': 1.3580818176269531}}} 11/07/2021 00:07:53 - INFO - __main__ - Step 20516: {'lr': 0.00048093940937317414, 'samples': 3939072, 'steps': 20515, 'loss/train': 0.4304567873477936}} 11/07/2021 00:07:54 - INFO - __main__ - Step 20520: {'lr': 0.00048093127908428, 'samples': 3939840, 'steps': 20519, 'loss/train': 1.905431866645813}936}} 11/07/2021 00:07:56 - INFO - __main__ - Step 20524: {'lr': 0.0004809231471305208, 'samples': 3940608, 'steps': 20523, 'loss/train': 1.7005736827850342}}} 11/07/2021 00:07:58 - INFO - __main__ - Step 20528: {'lr': 0.00048091501351195495, 'samples': 3941376, 'steps': 20527, 'loss/train': 1.3537952899932861}} 11/07/2021 00:07:58 - INFO - __main__ - Step 20528: {'lr': 0.00048091501351195495, 'samples': 3941376, 'steps': 20527, 'loss/train': 1.3537952899932861}} 11/07/2021 00:08:02 - INFO - __main__ - Step 20536: {'lr': 0.0004808987412806384, 'samples': 3942912, 'steps': 20535, 'loss/train': 1.644826889038086}1}} 11/07/2021 00:08:04 - INFO - __main__ - Step 20540: {'lr': 0.000480890602668005, 'samples': 3943680, 'steps': 20539, 'loss/train': 1.3758116960525513}1}} 11/07/2021 00:08:07 - INFO - __main__ - Step 20545: {'lr': 0.0004808804270614159, 'samples': 3944640, 'steps': 20544, 'loss/train': 1.7366575002670288}}} 11/07/2021 00:08:09 - INFO - __main__ - Step 20549: {'lr': 0.00048087228470357823, 'samples': 3945408, 'steps': 20548, 'loss/train': 1.7939064502716064}} 11/07/2021 00:08:11 - INFO - __main__ - Step 20553: {'lr': 0.00048086414068130077, 'samples': 3946176, 'steps': 20552, 'loss/train': 1.5216528177261353}} 11/07/2021 00:08:12 - INFO - __main__ - Step 20557: {'lr': 0.00048085599499464216, 'samples': 3946944, 'steps': 20556, 'loss/train': 1.8088922500610352}} 11/07/2021 00:08:14 - INFO - __main__ - Step 20561: {'lr': 0.0004808478476436612, 'samples': 3947712, 'steps': 20560, 'loss/train': 0.45990151166915894}} 11/07/2021 00:08:14 - INFO - __main__ - Step 20561: {'lr': 0.0004808478476436612, 'samples': 3947712, 'steps': 20560, 'loss/train': 0.45990151166915894}} 11/07/2021 00:08:19 - INFO - __main__ - Step 20569: {'lr': 0.0004808315479489671, 'samples': 3949248, 'steps': 20568, 'loss/train': 1.7102299928665161}}} 11/07/2021 00:08:20 - INFO - __main__ - Step 20573: {'lr': 0.00048082339560537145, 'samples': 3950016, 'steps': 20572, 'loss/train': 1.6746459007263184}} 11/07/2021 00:08:22 - INFO - __main__ - Step 20577: {'lr': 0.0004808152415976885, 'samples': 3950784, 'steps': 20576, 'loss/train': 1.443601369857788}4}} 11/07/2021 00:08:25 - INFO - __main__ - Step 20582: {'lr': 0.0004808050467480515, 'samples': 3951744, 'steps': 20581, 'loss/train': 1.2447943687438965}}} 11/07/2021 00:08:27 - INFO - __main__ - Step 20587: {'lr': 0.0004807948492984846, 'samples': 3952704, 'steps': 20586, 'loss/train': 2.091362476348877}}}} 11/07/2021 00:08:29 - INFO - __main__ - Step 20591: {'lr': 0.00048078668946695887, 'samples': 3953472, 'steps': 20590, 'loss/train': 1.240483283996582}}} 11/07/2021 00:08:31 - INFO - __main__ - Step 20595: {'lr': 0.00048077852797161034, 'samples': 3954240, 'steps': 20594, 'loss/train': 1.3747129440307617}} 11/07/2021 00:08:33 - INFO - __main__ - Step 20599: {'lr': 0.000480770364812498, 'samples': 3955008, 'steps': 20598, 'loss/train': 1.6940776109695435}7}} 11/07/2021 00:08:35 - INFO - __main__ - Step 20603: {'lr': 0.00048076219998968055, 'samples': 3955776, 'steps': 20602, 'loss/train': 2.141200304031372}}} 11/07/2021 00:08:37 - INFO - __main__ - Step 20607: {'lr': 0.0004807540335032169, 'samples': 3956544, 'steps': 20606, 'loss/train': 1.51893949508667}2}}} 11/07/2021 00:08:37 - INFO - __main__ - Step 20607: {'lr': 0.0004807540335032169, 'samples': 3956544, 'steps': 20606, 'loss/train': 1.51893949508667}2}}} 11/07/2021 00:08:40 - INFO - __main__ - Step 20614: {'lr': 0.0004807397381489341, 'samples': 3957888, 'steps': 20613, 'loss/train': 1.6169627904891968}}} 11/07/2021 00:08:42 - INFO - __main__ - Step 20618: {'lr': 0.0004807315670877471, 'samples': 3958656, 'steps': 20617, 'loss/train': 1.8260729312896729}}} 11/07/2021 00:08:45 - INFO - __main__ - Step 20623: {'lr': 0.0004807213509220784, 'samples': 3959616, 'steps': 20622, 'loss/train': 1.9633917808532715}}} 11/07/2021 00:08:47 - INFO - __main__ - Step 20628: {'lr': 0.00048071113215742263, 'samples': 3960576, 'steps': 20627, 'loss/train': 1.8338042497634888}} 11/07/2021 00:08:49 - INFO - __main__ - Step 20632: {'lr': 0.00048070295527450474, 'samples': 3961344, 'steps': 20631, 'loss/train': 1.6593570709228516}} 11/07/2021 00:08:51 - INFO - __main__ - Step 20636: {'lr': 0.0004806947767283678, 'samples': 3962112, 'steps': 20635, 'loss/train': 1.4025685787200928}}} 11/07/2021 00:08:53 - INFO - __main__ - Step 20640: {'lr': 0.00048068659651907076, 'samples': 3962880, 'steps': 20639, 'loss/train': 1.4923686981201172}} 11/07/2021 00:08:55 - INFO - __main__ - Step 20644: {'lr': 0.0004806784146466726, 'samples': 3963648, 'steps': 20643, 'loss/train': 1.530687928199768}2}} 11/07/2021 00:08:55 - INFO - __main__ - Step 20644: {'lr': 0.0004806784146466726, 'samples': 3963648, 'steps': 20643, 'loss/train': 1.530687928199768}2}} 11/07/2021 00:08:59 - INFO - __main__ - Step 20650: {'lr': 0.00048066613871988967, 'samples': 3964800, 'steps': 20649, 'loss/train': 0.8895652890205383}} 11/07/2021 00:09:01 - INFO - __main__ - Step 20656: {'lr': 0.00048065385905146114, 'samples': 3965952, 'steps': 20655, 'loss/train': 1.5131564140319824}} 11/07/2021 00:09:03 - INFO - __main__ - Step 20660: {'lr': 0.0004806456705272484, 'samples': 3966720, 'steps': 20659, 'loss/train': 1.0221384763717651}}} 11/07/2021 00:09:03 - INFO - __main__ - Step 20660: {'lr': 0.0004806456705272484, 'samples': 3966720, 'steps': 20659, 'loss/train': 1.0221384763717651}}} 11/07/2021 00:09:07 - INFO - __main__ - Step 20667: {'lr': 0.00048063133660878455, 'samples': 3968064, 'steps': 20666, 'loss/train': 2.1483118534088135}} 11/07/2021 00:09:09 - INFO - __main__ - Step 20672: {'lr': 0.00048062109497800997, 'samples': 3969024, 'steps': 20671, 'loss/train': 1.9477202892303467}} 11/07/2021 00:09:11 - INFO - __main__ - Step 20676: {'lr': 0.0004806128998029272, 'samples': 3969792, 'steps': 20675, 'loss/train': 1.3123760223388672}}} 11/07/2021 00:09:13 - INFO - __main__ - Step 20680: {'lr': 0.0004806047029652747, 'samples': 3970560, 'steps': 20679, 'loss/train': 1.983790636062622}}}} 11/07/2021 00:09:15 - INFO - __main__ - Step 20684: {'lr': 0.00048059650446511136, 'samples': 3971328, 'steps': 20683, 'loss/train': 1.7126437425613403}} 11/07/2021 00:09:17 - INFO - __main__ - Step 20688: {'lr': 0.0004805883043024965, 'samples': 3972096, 'steps': 20687, 'loss/train': 1.5301727056503296}}} 11/07/2021 00:09:19 - INFO - __main__ - Step 20693: {'lr': 0.0004805780517614954, 'samples': 3973056, 'steps': 20692, 'loss/train': 1.8313792943954468}}} 11/07/2021 00:09:21 - INFO - __main__ - Step 20697: {'lr': 0.00048056984785858046, 'samples': 3973824, 'steps': 20696, 'loss/train': 1.9138669967651367}} 11/07/2021 00:09:24 - INFO - __main__ - Step 20701: {'lr': 0.00048056164229340613, 'samples': 3974592, 'steps': 20700, 'loss/train': 1.3749315738677979}} 11/07/2021 00:09:24 - INFO - __main__ - Step 20701: {'lr': 0.00048056164229340613, 'samples': 3974592, 'steps': 20700, 'loss/train': 1.3749315738677979}} 11/07/2021 00:09:27 - INFO - __main__ - Step 20708: {'lr': 0.00048054727855471717, 'samples': 3975936, 'steps': 20707, 'loss/train': 1.785535216331482}}} 11/07/2021 00:09:30 - INFO - __main__ - Step 20714: {'lr': 0.0004805349627273253, 'samples': 3977088, 'steps': 20713, 'loss/train': 1.3053054809570312}}} 11/07/2021 00:09:32 - INFO - __main__ - Step 20718: {'lr': 0.00048052675009820837, 'samples': 3977856, 'steps': 20717, 'loss/train': 1.3425061702728271}} 11/07/2021 00:09:34 - INFO - __main__ - Step 20722: {'lr': 0.0004805185358071428, 'samples': 3978624, 'steps': 20721, 'loss/train': 1.3219795227050781}}} 11/07/2021 00:09:36 - INFO - __main__ - Step 20726: {'lr': 0.00048051031985418764, 'samples': 3979392, 'steps': 20725, 'loss/train': 2.1667885780334473}} 11/07/2021 00:09:37 - INFO - __main__ - Step 20730: {'lr': 0.0004805021022394022, 'samples': 3980160, 'steps': 20729, 'loss/train': 1.715214729309082}3}} 11/07/2021 00:09:39 - INFO - __main__ - Step 20734: {'lr': 0.00048049388296284576, 'samples': 3980928, 'steps': 20733, 'loss/train': 1.0420160293579102}} 11/07/2021 00:09:39 - INFO - __main__ - Step 20734: {'lr': 0.00048049388296284576, 'samples': 3980928, 'steps': 20733, 'loss/train': 1.0420160293579102}} 11/07/2021 00:09:44 - INFO - __main__ - Step 20742: {'lr': 0.0004804774394246567, 'samples': 3982464, 'steps': 20741, 'loss/train': 0.8025193214416504}}} 11/07/2021 00:09:46 - INFO - __main__ - Step 20746: {'lr': 0.0004804692151631427, 'samples': 3983232, 'steps': 20745, 'loss/train': 1.8971871137619019}}} 11/07/2021 00:09:47 - INFO - __main__ - Step 20750: {'lr': 0.00048046098924009467, 'samples': 3984000, 'steps': 20749, 'loss/train': 1.5476644039154053}} 11/07/2021 00:09:49 - INFO - __main__ - Step 20754: {'lr': 0.0004804527616555721, 'samples': 3984768, 'steps': 20753, 'loss/train': 1.7650110721588135}}} 11/07/2021 00:09:52 - INFO - __main__ - Step 20759: {'lr': 0.00048044247483856043, 'samples': 3985728, 'steps': 20758, 'loss/train': 1.6131950616836548}} 11/07/2021 00:09:52 - INFO - __main__ - Step 20759: {'lr': 0.00048044247483856043, 'samples': 3985728, 'steps': 20758, 'loss/train': 1.6131950616836548}} 11/07/2021 00:09:55 - INFO - __main__ - Step 20766: {'lr': 0.0004804280689337496, 'samples': 3987072, 'steps': 20765, 'loss/train': 1.503300428390503}8}} 11/07/2021 00:09:57 - INFO - __main__ - Step 20770: {'lr': 0.0004804198347039216, 'samples': 3987840, 'steps': 20769, 'loss/train': 1.3138035535812378}}} 11/07/2021 00:09:59 - INFO - __main__ - Step 20775: {'lr': 0.0004804095395806122, 'samples': 3988800, 'steps': 20774, 'loss/train': 1.7506057024002075}}} 11/07/2021 00:10:02 - INFO - __main__ - Step 20779: {'lr': 0.00048040130161321724, 'samples': 3989568, 'steps': 20778, 'loss/train': 1.8434571027755737}} 11/07/2021 00:10:04 - INFO - __main__ - Step 20783: {'lr': 0.00048039306198477817, 'samples': 3990336, 'steps': 20782, 'loss/train': 1.4748272895812988}} 11/07/2021 00:10:05 - INFO - __main__ - Step 20787: {'lr': 0.00048038482069535406, 'samples': 3991104, 'steps': 20786, 'loss/train': 1.664506435394287}}} 11/07/2021 00:10:07 - INFO - __main__ - Step 20791: {'lr': 0.0004803765777450044, 'samples': 3991872, 'steps': 20790, 'loss/train': 2.179063081741333}}}} 11/07/2021 00:10:07 - INFO - __main__ - Step 20791: {'lr': 0.0004803765777450044, 'samples': 3991872, 'steps': 20790, 'loss/train': 2.179063081741333}}}} 11/07/2021 00:10:12 - INFO - __main__ - Step 20799: {'lr': 0.00048036008686176636, 'samples': 3993408, 'steps': 20798, 'loss/train': 1.1961723566055298}} 11/07/2021 00:10:13 - INFO - __main__ - Step 20803: {'lr': 0.00048035183892899676, 'samples': 3994176, 'steps': 20802, 'loss/train': 1.7721303701400757}} 11/07/2021 00:10:15 - INFO - __main__ - Step 20807: {'lr': 0.0004803435893355394, 'samples': 3994944, 'steps': 20806, 'loss/train': 1.882066011428833}7}} 11/07/2021 00:10:15 - INFO - __main__ - Step 20807: {'lr': 0.0004803435893355394, 'samples': 3994944, 'steps': 20806, 'loss/train': 1.882066011428833}7}} 11/07/2021 00:10:19 - INFO - __main__ - Step 20815: {'lr': 0.00048032708516679946, 'samples': 3996480, 'steps': 20814, 'loss/train': 1.8684601783752441}} 11/07/2021 00:10:21 - INFO - __main__ - Step 20819: {'lr': 0.00048031883059163576, 'samples': 3997248, 'steps': 20818, 'loss/train': 1.5640259981155396}} 11/07/2021 00:10:23 - INFO - __main__ - Step 20823: {'lr': 0.00048031057435602234, 'samples': 3998016, 'steps': 20822, 'loss/train': 1.4730157852172852}} 11/07/2021 00:10:25 - INFO - __main__ - Step 20827: {'lr': 0.00048030231646001867, 'samples': 3998784, 'steps': 20826, 'loss/train': 1.6759287118911743}} 11/07/2021 00:10:27 - INFO - __main__ - Step 20831: {'lr': 0.0004802940569036842, 'samples': 3999552, 'steps': 20830, 'loss/train': 2.0138940811157227}}} 11/07/2021 00:10:29 - INFO - __main__ - Step 20835: {'lr': 0.0004802857956870786, 'samples': 4000320, 'steps': 20834, 'loss/train': 1.5970216989517212}}} 11/07/2021 00:10:31 - INFO - __main__ - Step 20839: {'lr': 0.00048027753281026144, 'samples': 4001088, 'steps': 20838, 'loss/train': 1.6905814409255981}} 11/07/2021 00:10:33 - INFO - __main__ - Step 20843: {'lr': 0.0004802692682732922, 'samples': 4001856, 'steps': 20842, 'loss/train': 1.7693142890930176}}} 11/07/2021 00:10:35 - INFO - __main__ - Step 20848: {'lr': 0.0004802589352675826, 'samples': 4002816, 'steps': 20847, 'loss/train': 1.9966727495193481}}} 11/07/2021 00:10:37 - INFO - __main__ - Step 20852: {'lr': 0.0004802506669954891, 'samples': 4003584, 'steps': 20851, 'loss/train': 1.6507529020309448}}} 11/07/2021 00:10:37 - INFO - __main__ - Step 20852: {'lr': 0.0004802506669954891, 'samples': 4003584, 'steps': 20851, 'loss/train': 1.6507529020309448}}} 11/07/2021 00:10:41 - INFO - __main__ - Step 20859: {'lr': 0.0004802361935250865, 'samples': 4004928, 'steps': 20858, 'loss/train': 1.6424322128295898}}} 11/07/2021 00:10:43 - INFO - __main__ - Step 20863: {'lr': 0.00048022792068825107, 'samples': 4005696, 'steps': 20862, 'loss/train': 1.2037049531936646}} 11/07/2021 00:10:46 - INFO - __main__ - Step 20869: {'lr': 0.000480215508320902, 'samples': 4006848, 'steps': 20868, 'loss/train': 1.958810806274414}46}} 11/07/2021 00:10:46 - INFO - __main__ - Step 20869: {'lr': 0.000480215508320902, 'samples': 4006848, 'steps': 20868, 'loss/train': 1.958810806274414}46}} 11/07/2021 00:10:49 - INFO - __main__ - Step 20876: {'lr': 0.00048020102250588976, 'samples': 4008192, 'steps': 20875, 'loss/train': 1.2552753686904907}} 11/07/2021 00:10:51 - INFO - __main__ - Step 20880: {'lr': 0.0004801927426153402, 'samples': 4008960, 'steps': 20879, 'loss/train': 1.5084853172302246}}} 11/07/2021 00:10:53 - INFO - __main__ - Step 20884: {'lr': 0.0004801844610652499, 'samples': 4009728, 'steps': 20883, 'loss/train': 1.8874273300170898}}} 11/07/2021 00:10:56 - INFO - __main__ - Step 20890: {'lr': 0.00048017203562860614, 'samples': 4010880, 'steps': 20889, 'loss/train': 1.7974549531936646}} 11/07/2021 00:10:58 - INFO - __main__ - Step 20894: {'lr': 0.00048016374992992516, 'samples': 4011648, 'steps': 20893, 'loss/train': 1.2430540323257446}} 11/07/2021 00:10:58 - INFO - __main__ - Step 20894: {'lr': 0.00048016374992992516, 'samples': 4011648, 'steps': 20893, 'loss/train': 1.2430540323257446}} 11/07/2021 00:11:01 - INFO - __main__ - Step 20901: {'lr': 0.0004801492459645024, 'samples': 4012992, 'steps': 20900, 'loss/train': 1.8301000595092773}}} 11/07/2021 00:11:03 - INFO - __main__ - Step 20906: {'lr': 0.0004801388828781307, 'samples': 4013952, 'steps': 20905, 'loss/train': 1.4361201524734497}}} 11/07/2021 00:11:06 - INFO - __main__ - Step 20911: {'lr': 0.00048012851719933335, 'samples': 4014912, 'steps': 20910, 'loss/train': 1.4883944988250732}} 11/07/2021 00:11:08 - INFO - __main__ - Step 20915: {'lr': 0.0004801202227898274, 'samples': 4015680, 'steps': 20914, 'loss/train': 1.5604429244995117}}} 11/07/2021 00:11:10 - INFO - __main__ - Step 20919: {'lr': 0.00048011192672130356, 'samples': 4016448, 'steps': 20918, 'loss/train': 1.5356251001358032}} 11/07/2021 00:11:10 - INFO - __main__ - Step 20919: {'lr': 0.00048011192672130356, 'samples': 4016448, 'steps': 20918, 'loss/train': 1.5356251001358032}} 11/07/2021 00:11:13 - INFO - __main__ - Step 20926: {'lr': 0.00048009740460955465, 'samples': 4017792, 'steps': 20925, 'loss/train': 1.7685003280639648}} 11/07/2021 00:11:15 - INFO - __main__ - Step 20930: {'lr': 0.0004800891039790399, 'samples': 4018560, 'steps': 20929, 'loss/train': 1.3246406316757202}}} 11/07/2021 00:11:15 - INFO - __main__ - Step 20930: {'lr': 0.0004800891039790399, 'samples': 4018560, 'steps': 20929, 'loss/train': 1.3246406316757202}}} 11/07/2021 00:11:20 - INFO - __main__ - Step 20938: {'lr': 0.0004800724977416894, 'samples': 4020096, 'steps': 20937, 'loss/train': 2.168165445327759}}}} 11/07/2021 00:11:21 - INFO - __main__ - Step 20942: {'lr': 0.00048006419213497334, 'samples': 4020864, 'steps': 20941, 'loss/train': 1.8388252258300781}} 11/07/2021 00:11:23 - INFO - __main__ - Step 20947: {'lr': 0.0004800538077941594, 'samples': 4021824, 'steps': 20946, 'loss/train': 1.4030572175979614}}} 11/07/2021 00:11:23 - INFO - __main__ - Step 20947: {'lr': 0.0004800538077941594, 'samples': 4021824, 'steps': 20946, 'loss/train': 1.4030572175979614}}} 11/07/2021 00:11:27 - INFO - __main__ - Step 20955: {'lr': 0.0004800371874586535, 'samples': 4023360, 'steps': 20954, 'loss/train': 1.8569425344467163}}} 11/07/2021 00:11:29 - INFO - __main__ - Step 20959: {'lr': 0.00048002887480324175, 'samples': 4024128, 'steps': 20958, 'loss/train': 1.5606435537338257}} 11/07/2021 00:11:31 - INFO - __main__ - Step 20963: {'lr': 0.00048002056048947054, 'samples': 4024896, 'steps': 20962, 'loss/train': 1.690382957458496}}} 11/07/2021 00:11:33 - INFO - __main__ - Step 20967: {'lr': 0.0004800122445173999, 'samples': 4025664, 'steps': 20966, 'loss/train': 1.5580840110778809}}} 11/07/2021 00:11:35 - INFO - __main__ - Step 20972: {'lr': 0.00048000184722041934, 'samples': 4026624, 'steps': 20971, 'loss/train': 1.4988945722579956}} 11/07/2021 00:11:38 - INFO - __main__ - Step 20977: {'lr': 0.0004799914473325567, 'samples': 4027584, 'steps': 20976, 'loss/train': 1.4126200675964355}}} 11/07/2021 00:11:38 - INFO - __main__ - Step 20977: {'lr': 0.0004799914473325567, 'samples': 4027584, 'steps': 20976, 'loss/train': 1.4126200675964355}}} 11/07/2021 00:11:41 - INFO - __main__ - Step 20984: {'lr': 0.0004799768831370902, 'samples': 4028928, 'steps': 20983, 'loss/train': 1.8679004907608032}}} 11/07/2021 00:11:43 - INFO - __main__ - Step 20988: {'lr': 0.0004799685584599313, 'samples': 4029696, 'steps': 20987, 'loss/train': 1.1187705993652344}}} 11/07/2021 00:11:46 - INFO - __main__ - Step 20993: {'lr': 0.00047995815028203346, 'samples': 4030656, 'steps': 20992, 'loss/train': 0.9504069685935974}} 11/07/2021 00:11:48 - INFO - __main__ - Step 20997: {'lr': 0.00047994982187462876, 'samples': 4031424, 'steps': 20996, 'loss/train': 1.1650660037994385}} 11/07/2021 00:11:50 - INFO - __main__ - Step 21001: {'lr': 0.0004799414918094347, 'samples': 4032192, 'steps': 21000, 'loss/train': 1.828909993171692}5}} 11/07/2021 00:11:50 - INFO - __main__ - Step 21001: {'lr': 0.0004799414918094347, 'samples': 4032192, 'steps': 21000, 'loss/train': 1.828909993171692}5}} 11/07/2021 00:11:54 - INFO - __main__ - Step 21008: {'lr': 0.0004799269102064698, 'samples': 4033536, 'steps': 21007, 'loss/train': 1.620564579963684}5}} 11/07/2021 00:11:54 - INFO - __main__ - Step 21008: {'lr': 0.0004799269102064698, 'samples': 4033536, 'steps': 21007, 'loss/train': 1.620564579963684}5}} 11/07/2021 00:11:57 - INFO - __main__ - Step 21015: {'lr': 0.0004799123235270305, 'samples': 4034880, 'steps': 21014, 'loss/train': 2.6511857509613037}}} 11/07/2021 00:12:00 - INFO - __main__ - Step 21021: {'lr': 0.0004798998166187246, 'samples': 4036032, 'steps': 21020, 'loss/train': 1.4598809480667114}}} 11/07/2021 00:12:02 - INFO - __main__ - Step 21025: {'lr': 0.0004798914766080553, 'samples': 4036800, 'steps': 21024, 'loss/train': 2.5650393962860107}}} 11/07/2021 00:12:02 - INFO - __main__ - Step 21025: {'lr': 0.0004798914766080553, 'samples': 4036800, 'steps': 21024, 'loss/train': 2.5650393962860107}}} 11/07/2021 00:12:05 - INFO - __main__ - Step 21031: {'lr': 0.00047987896348450354, 'samples': 4037952, 'steps': 21030, 'loss/train': 1.8528162240982056}} 11/07/2021 00:12:08 - INFO - __main__ - Step 21036: {'lr': 0.0004798685330330876, 'samples': 4038912, 'steps': 21035, 'loss/train': 2.236529588699341}6}} 11/07/2021 00:12:08 - INFO - __main__ - Step 21036: {'lr': 0.0004798685330330876, 'samples': 4038912, 'steps': 21035, 'loss/train': 2.236529588699341}6}} 11/07/2021 00:12:12 - INFO - __main__ - Step 21044: {'lr': 0.00047985183892495977, 'samples': 4040448, 'steps': 21043, 'loss/train': 1.7737886905670166}} 11/07/2021 00:12:13 - INFO - __main__ - Step 21048: {'lr': 0.00047984348938524113, 'samples': 4041216, 'steps': 21047, 'loss/train': 1.8316309452056885}} 11/07/2021 00:12:15 - INFO - __main__ - Step 21052: {'lr': 0.00047983513818849967, 'samples': 4041984, 'steps': 21051, 'loss/train': 1.7306580543518066}} 11/07/2021 00:12:17 - INFO - __main__ - Step 21057: {'lr': 0.0004798246968624761, 'samples': 4042944, 'steps': 21056, 'loss/train': 1.7298862934112549}}} 11/07/2021 00:12:17 - INFO - __main__ - Step 21057: {'lr': 0.0004798246968624761, 'samples': 4042944, 'steps': 21056, 'loss/train': 1.7298862934112549}}} 11/07/2021 00:12:22 - INFO - __main__ - Step 21065: {'lr': 0.00047980798535600334, 'samples': 4044480, 'steps': 21064, 'loss/train': 2.1295764446258545}} 11/07/2021 00:12:22 - INFO - __main__ - Step 21065: {'lr': 0.00047980798535600334, 'samples': 4044480, 'steps': 21064, 'loss/train': 2.1295764446258545}} 11/07/2021 00:12:25 - INFO - __main__ - Step 21073: {'lr': 0.00047979126722246294, 'samples': 4046016, 'steps': 21072, 'loss/train': 1.4065769910812378}} 11/07/2021 00:12:27 - INFO - __main__ - Step 21077: {'lr': 0.00047978290567069306, 'samples': 4046784, 'steps': 21076, 'loss/train': 2.6458792686462402}} 11/07/2021 00:12:30 - INFO - __main__ - Step 21082: {'lr': 0.00047977245140141354, 'samples': 4047744, 'steps': 21081, 'loss/train': 2.114461660385132}}} 11/07/2021 00:12:32 - INFO - __main__ - Step 21087: {'lr': 0.00047976199454383595, 'samples': 4048704, 'steps': 21086, 'loss/train': 2.107619285583496}}} 11/07/2021 00:12:32 - INFO - __main__ - Step 21087: {'lr': 0.00047976199454383595, 'samples': 4048704, 'steps': 21086, 'loss/train': 2.107619285583496}}} 11/07/2021 00:12:35 - INFO - __main__ - Step 21094: {'lr': 0.000479747350595111, 'samples': 4050048, 'steps': 21093, 'loss/train': 1.4000967741012573}}}} 11/07/2021 00:12:37 - INFO - __main__ - Step 21098: {'lr': 0.0004797389803469369, 'samples': 4050816, 'steps': 21097, 'loss/train': 1.7806404829025269}}} 11/07/2021 00:12:40 - INFO - __main__ - Step 21103: {'lr': 0.0004797285152075973, 'samples': 4051776, 'steps': 21102, 'loss/train': 1.7230778932571411}}} 11/07/2021 00:12:40 - INFO - __main__ - Step 21103: {'lr': 0.0004797285152075973, 'samples': 4051776, 'steps': 21102, 'loss/train': 1.7230778932571411}}} 11/07/2021 00:12:44 - INFO - __main__ - Step 21111: {'lr': 0.0004797117656020727, 'samples': 4053312, 'steps': 21110, 'loss/train': 1.8342883586883545}}} 11/07/2021 00:12:45 - INFO - __main__ - Step 21115: {'lr': 0.0004797033883151703, 'samples': 4054080, 'steps': 21114, 'loss/train': 1.3049585819244385}}} 11/07/2021 00:12:47 - INFO - __main__ - Step 21119: {'lr': 0.0004796950093722552, 'samples': 4054848, 'steps': 21118, 'loss/train': 1.7003940343856812}}} 11/07/2021 00:12:50 - INFO - __main__ - Step 21124: {'lr': 0.0004796845333649352, 'samples': 4055808, 'steps': 21123, 'loss/train': 1.552304744720459}}}} 11/07/2021 00:12:50 - INFO - __main__ - Step 21124: {'lr': 0.0004796845333649352, 'samples': 4055808, 'steps': 21123, 'loss/train': 1.552304744720459}}}} 11/07/2021 00:12:53 - INFO - __main__ - Step 21131: {'lr': 0.00047966986260803676, 'samples': 4057152, 'steps': 21130, 'loss/train': 1.6884819269180298}} 11/07/2021 00:12:56 - INFO - __main__ - Step 21135: {'lr': 0.0004796614770416744, 'samples': 4057920, 'steps': 21134, 'loss/train': 1.308184027671814}8}} 11/07/2021 00:12:58 - INFO - __main__ - Step 21140: {'lr': 0.0004796509927553854, 'samples': 4058880, 'steps': 21139, 'loss/train': 1.2639167308807373}}} 11/07/2021 00:13:00 - INFO - __main__ - Step 21145: {'lr': 0.0004796405058821666, 'samples': 4059840, 'steps': 21144, 'loss/train': 1.1046626567840576}}} 11/07/2021 00:13:02 - INFO - __main__ - Step 21149: {'lr': 0.00047963211452108144, 'samples': 4060608, 'steps': 21148, 'loss/train': 1.298509955406189}}} 11/07/2021 00:13:02 - INFO - __main__ - Step 21149: {'lr': 0.00047963211452108144, 'samples': 4060608, 'steps': 21148, 'loss/train': 1.298509955406189}}} 11/07/2021 00:13:06 - INFO - __main__ - Step 21156: {'lr': 0.0004796174256556744, 'samples': 4061952, 'steps': 21155, 'loss/train': 1.778990626335144}}}} 11/07/2021 00:13:08 - INFO - __main__ - Step 21161: {'lr': 0.0004796069305050741, 'samples': 4062912, 'steps': 21160, 'loss/train': 1.7508039474487305}}} 11/07/2021 00:13:10 - INFO - __main__ - Step 21166: {'lr': 0.00047959643276804026, 'samples': 4063872, 'steps': 21165, 'loss/train': 2.0346102714538574}} 11/07/2021 00:13:10 - INFO - __main__ - Step 21166: {'lr': 0.00047959643276804026, 'samples': 4063872, 'steps': 21165, 'loss/train': 2.0346102714538574}} 11/07/2021 00:13:14 - INFO - __main__ - Step 21173: {'lr': 0.00047958173159120984, 'samples': 4065216, 'steps': 21172, 'loss/train': 1.3294357061386108}} 11/07/2021 00:13:16 - INFO - __main__ - Step 21178: {'lr': 0.00047957122764721817, 'samples': 4066176, 'steps': 21177, 'loss/train': 1.3946402072906494}} 11/07/2021 00:13:18 - INFO - __main__ - Step 21182: {'lr': 0.00047956282263007663, 'samples': 4066944, 'steps': 21181, 'loss/train': 2.069216728210449}}} 11/07/2021 00:13:20 - INFO - __main__ - Step 21186: {'lr': 0.00047955441595793556, 'samples': 4067712, 'steps': 21185, 'loss/train': 1.556066632270813}}} 11/07/2021 00:13:22 - INFO - __main__ - Step 21190: {'lr': 0.00047954600763085577, 'samples': 4068480, 'steps': 21189, 'loss/train': 1.792773723602295}}} 11/07/2021 00:13:24 - INFO - __main__ - Step 21194: {'lr': 0.0004795375976488977, 'samples': 4069248, 'steps': 21193, 'loss/train': 2.0459771156311035}}} 11/07/2021 00:13:26 - INFO - __main__ - Step 21198: {'lr': 0.000479529186012122, 'samples': 4070016, 'steps': 21197, 'loss/train': 1.546166181564331}5}}} 11/07/2021 00:13:28 - INFO - __main__ - Step 21203: {'lr': 0.00047951866913915767, 'samples': 4070976, 'steps': 21202, 'loss/train': 1.6653965711593628}} 11/07/2021 00:13:30 - INFO - __main__ - Step 21207: {'lr': 0.0004795102537792641, 'samples': 4071744, 'steps': 21206, 'loss/train': 1.7479983568191528}}} 11/07/2021 00:13:30 - INFO - __main__ - Step 21207: {'lr': 0.0004795102537792641, 'samples': 4071744, 'steps': 21206, 'loss/train': 1.7479983568191528}}} 11/07/2021 00:13:35 - INFO - __main__ - Step 21215: {'lr': 0.0004794934180956764, 'samples': 4073280, 'steps': 21214, 'loss/train': 1.7211107015609741}}} 11/07/2021 00:13:36 - INFO - __main__ - Step 21219: {'lr': 0.0004794849977721036, 'samples': 4074048, 'steps': 21218, 'loss/train': 1.3072047233581543}}} 11/07/2021 00:13:38 - INFO - __main__ - Step 21223: {'lr': 0.0004794765757940924, 'samples': 4074816, 'steps': 21222, 'loss/train': 1.7698161602020264}}} 11/07/2021 00:13:41 - INFO - __main__ - Step 21228: {'lr': 0.0004794660459951169, 'samples': 4075776, 'steps': 21227, 'loss/train': 1.253070592880249}}}} 11/07/2021 00:13:43 - INFO - __main__ - Step 21232: {'lr': 0.0004794576202948414, 'samples': 4076544, 'steps': 21231, 'loss/train': 1.383471131324768}}}} 11/07/2021 00:13:43 - INFO - __main__ - Step 21232: {'lr': 0.0004794576202948414, 'samples': 4076544, 'steps': 21231, 'loss/train': 1.383471131324768}}}} 11/07/2021 00:13:46 - INFO - __main__ - Step 21239: {'lr': 0.00047944287133887834, 'samples': 4077888, 'steps': 21238, 'loss/train': 1.5501058101654053}} 11/07/2021 00:13:48 - INFO - __main__ - Step 21243: {'lr': 0.00047943444108958623, 'samples': 4078656, 'steps': 21242, 'loss/train': 3.5779306888580322}} 11/07/2021 00:13:51 - INFO - __main__ - Step 21248: {'lr': 0.0004794239009519368, 'samples': 4079616, 'steps': 21247, 'loss/train': 1.7059050798416138}}} 11/07/2021 00:13:53 - INFO - __main__ - Step 21252: {'lr': 0.00047941546698106386, 'samples': 4080384, 'steps': 21251, 'loss/train': 1.0373538732528687}} 11/07/2021 00:13:55 - INFO - __main__ - Step 21256: {'lr': 0.00047940703135625386, 'samples': 4081152, 'steps': 21255, 'loss/train': 1.5795459747314453}} 11/07/2021 00:13:56 - INFO - __main__ - Step 21260: {'lr': 0.0004793985940775676, 'samples': 4081920, 'steps': 21259, 'loss/train': 1.8833643198013306}}} 11/07/2021 00:13:59 - INFO - __main__ - Step 21264: {'lr': 0.0004793901551450658, 'samples': 4082688, 'steps': 21263, 'loss/train': 1.888841152191162}}}} 11/07/2021 00:14:01 - INFO - __main__ - Step 21268: {'lr': 0.0004793817145588094, 'samples': 4083456, 'steps': 21267, 'loss/train': 1.3523656129837036}}} 11/07/2021 00:14:03 - INFO - __main__ - Step 21272: {'lr': 0.00047937327231885925, 'samples': 4084224, 'steps': 21271, 'loss/train': 1.488318920135498}}} 11/07/2021 00:14:04 - INFO - __main__ - Step 21276: {'lr': 0.00047936482842527616, 'samples': 4084992, 'steps': 21275, 'loss/train': 1.7852537631988525}} 11/07/2021 00:14:06 - INFO - __main__ - Step 21280: {'lr': 0.00047935638287812104, 'samples': 4085760, 'steps': 21279, 'loss/train': 1.6133617162704468}} 11/07/2021 00:14:09 - INFO - __main__ - Step 21285: {'lr': 0.00047934582361893423, 'samples': 4086720, 'steps': 21284, 'loss/train': 1.3281868696212769}} 11/07/2021 00:14:11 - INFO - __main__ - Step 21289: {'lr': 0.0004793373743514647, 'samples': 4087488, 'steps': 21288, 'loss/train': 1.4470610618591309}}} 11/07/2021 00:14:13 - INFO - __main__ - Step 21293: {'lr': 0.00047932892343062103, 'samples': 4088256, 'steps': 21292, 'loss/train': 1.7325321435928345}} 11/07/2021 00:14:15 - INFO - __main__ - Step 21297: {'lr': 0.00047932047085646416, 'samples': 4089024, 'steps': 21296, 'loss/train': 1.4018278121948242}} 11/07/2021 00:14:16 - INFO - __main__ - Step 21301: {'lr': 0.00047931201662905503, 'samples': 4089792, 'steps': 21300, 'loss/train': 1.8875302076339722}} 11/07/2021 00:14:19 - INFO - __main__ - Step 21306: {'lr': 0.0004793014465200005, 'samples': 4090752, 'steps': 21305, 'loss/train': 1.727817177772522}2}} 11/07/2021 00:14:21 - INFO - __main__ - Step 21310: {'lr': 0.00047929298857299677, 'samples': 4091520, 'steps': 21309, 'loss/train': 1.4541163444519043}} 11/07/2021 00:14:21 - INFO - __main__ - Step 21310: {'lr': 0.00047929298857299677, 'samples': 4091520, 'steps': 21309, 'loss/train': 1.4541163444519043}} 11/07/2021 00:14:24 - INFO - __main__ - Step 21316: {'lr': 0.0004792802985530337, 'samples': 4092672, 'steps': 21315, 'loss/train': 1.7252967357635498}}} 11/07/2021 00:14:26 - INFO - __main__ - Step 21321: {'lr': 0.00047926972069535945, 'samples': 4093632, 'steps': 21320, 'loss/train': 1.4405698776245117}} 11/07/2021 00:14:26 - INFO - __main__ - Step 21321: {'lr': 0.00047926972069535945, 'samples': 4093632, 'steps': 21320, 'loss/train': 1.4405698776245117}} 11/07/2021 00:14:30 - INFO - __main__ - Step 21329: {'lr': 0.00047925279075124963, 'samples': 4095168, 'steps': 21328, 'loss/train': 1.454081654548645}}} 11/07/2021 00:14:32 - INFO - __main__ - Step 21333: {'lr': 0.00047924432330001776, 'samples': 4095936, 'steps': 21332, 'loss/train': 1.5046299695968628}} 11/07/2021 00:14:34 - INFO - __main__ - Step 21337: {'lr': 0.0004792358541960826, 'samples': 4096704, 'steps': 21336, 'loss/train': 2.1345772743225098}}} 11/07/2021 00:14:34 - INFO - __main__ - Step 21337: {'lr': 0.0004792358541960826, 'samples': 4096704, 'steps': 21336, 'loss/train': 2.1345772743225098}}} 11/07/2021 00:14:38 - INFO - __main__ - Step 21345: {'lr': 0.00047921891103034665, 'samples': 4098240, 'steps': 21344, 'loss/train': 1.3660441637039185}} 11/07/2021 00:14:40 - INFO - __main__ - Step 21349: {'lr': 0.000479210436968668, 'samples': 4099008, 'steps': 21348, 'loss/train': 0.2855474650859833}5}} 11/07/2021 00:14:42 - INFO - __main__ - Step 21353: {'lr': 0.0004792019612545304, 'samples': 4099776, 'steps': 21352, 'loss/train': 1.8609702587127686}}} 11/07/2021 00:14:44 - INFO - __main__ - Step 21357: {'lr': 0.000479193483887995, 'samples': 4100544, 'steps': 21356, 'loss/train': 1.198959469795227}6}}} 11/07/2021 00:14:46 - INFO - __main__ - Step 21362: {'lr': 0.00047918288485623427, 'samples': 4101504, 'steps': 21361, 'loss/train': 1.4473345279693604}} 11/07/2021 00:14:48 - INFO - __main__ - Step 21366: {'lr': 0.0004791744037720271, 'samples': 4102272, 'steps': 21365, 'loss/train': 1.810196042060852}4}} 11/07/2021 00:14:48 - INFO - __main__ - Step 21366: {'lr': 0.0004791744037720271, 'samples': 4102272, 'steps': 21365, 'loss/train': 1.810196042060852}4}} 11/07/2021 00:14:52 - INFO - __main__ - Step 21374: {'lr': 0.00047915743664707626, 'samples': 4103808, 'steps': 21373, 'loss/train': 0.219575896859169}}} 11/07/2021 00:14:54 - INFO - __main__ - Step 21378: {'lr': 0.000479148950606455, 'samples': 4104576, 'steps': 21377, 'loss/train': 1.7246084213256836}}}} 11/07/2021 00:14:57 - INFO - __main__ - Step 21383: {'lr': 0.0004791383407325384, 'samples': 4105536, 'steps': 21382, 'loss/train': 1.6584688425064087}}} 11/07/2021 00:14:59 - INFO - __main__ - Step 21388: {'lr': 0.00047912772827746685, 'samples': 4106496, 'steps': 21387, 'loss/train': 1.1820610761642456}} 11/07/2021 00:15:01 - INFO - __main__ - Step 21392: {'lr': 0.0004791192364550584, 'samples': 4107264, 'steps': 21391, 'loss/train': 0.3875841498374939}}} 11/07/2021 00:15:01 - INFO - __main__ - Step 21392: {'lr': 0.0004791192364550584, 'samples': 4107264, 'steps': 21391, 'loss/train': 0.3875841498374939}}} 11/07/2021 00:15:04 - INFO - __main__ - Step 21399: {'lr': 0.000479104371791233, 'samples': 4108608, 'steps': 21398, 'loss/train': 1.3778434991836548}}}} 11/07/2021 00:15:07 - INFO - __main__ - Step 21404: {'lr': 0.00047909375107726894, 'samples': 4109568, 'steps': 21403, 'loss/train': 1.4478750228881836}} 11/07/2021 00:15:09 - INFO - __main__ - Step 21409: {'lr': 0.00047908312778265213, 'samples': 4110528, 'steps': 21408, 'loss/train': 1.5026124715805054}} 11/07/2021 00:15:11 - INFO - __main__ - Step 21413: {'lr': 0.0004790746272889691, 'samples': 4111296, 'steps': 21412, 'loss/train': 1.090224027633667}4}} 11/07/2021 00:15:11 - INFO - __main__ - Step 21413: {'lr': 0.0004790746272889691, 'samples': 4111296, 'steps': 21412, 'loss/train': 1.090224027633667}4}} 11/07/2021 00:15:14 - INFO - __main__ - Step 21420: {'lr': 0.0004790597474511873, 'samples': 4112640, 'steps': 21419, 'loss/train': 1.4230188131332397}}} 11/07/2021 00:15:16 - INFO - __main__ - Step 21424: {'lr': 0.0004790512424160821, 'samples': 4113408, 'steps': 21423, 'loss/train': 1.8626660108566284}}} 11/07/2021 00:15:19 - INFO - __main__ - Step 21429: {'lr': 0.0004790406088000514, 'samples': 4114368, 'steps': 21428, 'loss/train': 1.5563852787017822}}} 11/07/2021 00:15:19 - INFO - __main__ - Step 21429: {'lr': 0.0004790406088000514, 'samples': 4114368, 'steps': 21428, 'loss/train': 1.5563852787017822}}} 11/07/2021 00:15:22 - INFO - __main__ - Step 21436: {'lr': 0.00047902571740314427, 'samples': 4115712, 'steps': 21435, 'loss/train': 1.878816843032837}}} 11/07/2021 00:15:24 - INFO - __main__ - Step 21440: {'lr': 0.000479017205763162, 'samples': 4116480, 'steps': 21439, 'loss/train': 1.3909929990768433}}}} 11/07/2021 00:15:27 - INFO - __main__ - Step 21445: {'lr': 0.0004790065638913799, 'samples': 4117440, 'steps': 21444, 'loss/train': 1.9915907382965088}}} 11/07/2021 00:15:29 - INFO - __main__ - Step 21450: {'lr': 0.00047899591943992726, 'samples': 4118400, 'steps': 21449, 'loss/train': 1.6092162132263184}} 11/07/2021 00:15:29 - INFO - __main__ - Step 21450: {'lr': 0.00047899591943992726, 'samples': 4118400, 'steps': 21449, 'loss/train': 1.6092162132263184}} 11/07/2021 00:15:32 - INFO - __main__ - Step 21457: {'lr': 0.00047898101287427523, 'samples': 4119744, 'steps': 21456, 'loss/train': 1.6409664154052734}} 11/07/2021 00:15:34 - INFO - __main__ - Step 21461: {'lr': 0.0004789724925668818, 'samples': 4120512, 'steps': 21460, 'loss/train': 1.5798488855361938}}} 11/07/2021 00:15:37 - INFO - __main__ - Step 21466: {'lr': 0.0004789618398612891, 'samples': 4121472, 'steps': 21465, 'loss/train': 2.6186089515686035}}} 11/07/2021 00:15:39 - INFO - __main__ - Step 21470: {'lr': 0.0004789533158398091, 'samples': 4122240, 'steps': 21469, 'loss/train': 1.933680772781372}}}} 11/07/2021 00:15:41 - INFO - __main__ - Step 21474: {'lr': 0.0004789447901677238, 'samples': 4123008, 'steps': 21473, 'loss/train': 0.4565056264400482}}} 11/07/2021 00:15:42 - INFO - __main__ - Step 21478: {'lr': 0.00047893626284509466, 'samples': 4123776, 'steps': 21477, 'loss/train': 1.7401764392852783}} 11/07/2021 00:15:44 - INFO - __main__ - Step 21482: {'lr': 0.0004789277338719832, 'samples': 4124544, 'steps': 21481, 'loss/train': 1.8052656650543213}}} 11/07/2021 00:15:47 - INFO - __main__ - Step 21487: {'lr': 0.00047891707033469665, 'samples': 4125504, 'steps': 21486, 'loss/train': 1.8308603763580322}} 11/07/2021 00:15:47 - INFO - __main__ - Step 21487: {'lr': 0.00047891707033469665, 'samples': 4125504, 'steps': 21486, 'loss/train': 1.8308603763580322}} 11/07/2021 00:15:51 - INFO - __main__ - Step 21495: {'lr': 0.00047890000331147033, 'samples': 4127040, 'steps': 21494, 'loss/train': 1.1416540145874023}} 11/07/2021 00:15:53 - INFO - __main__ - Step 21499: {'lr': 0.00047889146732449497, 'samples': 4127808, 'steps': 21498, 'loss/train': 1.7818214893341064}} 11/07/2021 00:15:54 - INFO - __main__ - Step 21503: {'lr': 0.0004788829296873601, 'samples': 4128576, 'steps': 21502, 'loss/train': 1.6200863122940063}}} 11/07/2021 00:15:57 - INFO - __main__ - Step 21507: {'lr': 0.00047887439040012755, 'samples': 4129344, 'steps': 21506, 'loss/train': 1.625873327255249}}} 11/07/2021 00:15:59 - INFO - __main__ - Step 21513: {'lr': 0.00047886157837547975, 'samples': 4130496, 'steps': 21512, 'loss/train': 1.1308202743530273}} 11/07/2021 00:15:59 - INFO - __main__ - Step 21513: {'lr': 0.00047886157837547975, 'samples': 4130496, 'steps': 21512, 'loss/train': 1.1308202743530273}} 11/07/2021 00:16:02 - INFO - __main__ - Step 21519: {'lr': 0.0004788487626384581, 'samples': 4131648, 'steps': 21518, 'loss/train': 1.6291325092315674}}} 11/07/2021 00:16:04 - INFO - __main__ - Step 21523: {'lr': 0.00047884021675144987, 'samples': 4132416, 'steps': 21522, 'loss/train': 1.3714509010314941}} 11/07/2021 00:16:07 - INFO - __main__ - Step 21528: {'lr': 0.00047882953207267954, 'samples': 4133376, 'steps': 21527, 'loss/train': 1.8911947011947632}} 11/07/2021 00:16:07 - INFO - __main__ - Step 21528: {'lr': 0.00047882953207267954, 'samples': 4133376, 'steps': 21527, 'loss/train': 1.8911947011947632}} 11/07/2021 00:16:11 - INFO - __main__ - Step 21536: {'lr': 0.0004788124312251303, 'samples': 4134912, 'steps': 21535, 'loss/train': 1.6723524332046509}}} 11/07/2021 00:16:13 - INFO - __main__ - Step 21540: {'lr': 0.0004788038783269404, 'samples': 4135680, 'steps': 21539, 'loss/train': 1.685656189918518}}}} 11/07/2021 00:16:14 - INFO - __main__ - Step 21544: {'lr': 0.0004787953237792225, 'samples': 4136448, 'steps': 21543, 'loss/train': 1.9875998497009277}}} 11/07/2021 00:16:17 - INFO - __main__ - Step 21549: {'lr': 0.00047878462827502055, 'samples': 4137408, 'steps': 21548, 'loss/train': 1.7756991386413574}} 11/07/2021 00:16:17 - INFO - __main__ - Step 21549: {'lr': 0.00047878462827502055, 'samples': 4137408, 'steps': 21548, 'loss/train': 1.7756991386413574}} 11/07/2021 00:16:21 - INFO - __main__ - Step 21557: {'lr': 0.00047876751010783266, 'samples': 4138944, 'steps': 21556, 'loss/train': 1.8446043729782104}} 11/07/2021 00:16:23 - INFO - __main__ - Step 21561: {'lr': 0.00047875894855030923, 'samples': 4139712, 'steps': 21560, 'loss/train': 1.3773596286773682}} 11/07/2021 00:16:24 - INFO - __main__ - Step 21565: {'lr': 0.0004787503853435817, 'samples': 4140480, 'steps': 21564, 'loss/train': 1.6013624668121338}}} 11/07/2021 00:16:27 - INFO - __main__ - Step 21569: {'lr': 0.000478741820487712, 'samples': 4141248, 'steps': 21568, 'loss/train': 1.362804889678955}8}}} 11/07/2021 00:16:27 - INFO - __main__ - Step 21569: {'lr': 0.000478741820487712, 'samples': 4141248, 'steps': 21568, 'loss/train': 1.362804889678955}8}}} 11/07/2021 00:16:31 - INFO - __main__ - Step 21577: {'lr': 0.0004787246858287926, 'samples': 4142784, 'steps': 21576, 'loss/train': 2.4061551094055176}}} 11/07/2021 00:16:32 - INFO - __main__ - Step 21581: {'lr': 0.0004787161160258664, 'samples': 4143552, 'steps': 21580, 'loss/train': 1.6663235425949097}}} 11/07/2021 00:16:34 - INFO - __main__ - Step 21585: {'lr': 0.000478707544574045, 'samples': 4144320, 'steps': 21584, 'loss/train': 1.1830782890319824}}}} 11/07/2021 00:16:37 - INFO - __main__ - Step 21590: {'lr': 0.0004786968279406035, 'samples': 4145280, 'steps': 21589, 'loss/train': 1.5908639430999756}}} 11/07/2021 00:16:39 - INFO - __main__ - Step 21594: {'lr': 0.0004786882527789938, 'samples': 4146048, 'steps': 21593, 'loss/train': 2.0507147312164307}}} 11/07/2021 00:16:41 - INFO - __main__ - Step 21598: {'lr': 0.00047867967596868974, 'samples': 4146816, 'steps': 21597, 'loss/train': 1.0293926000595093}} 11/07/2021 00:16:43 - INFO - __main__ - Step 21602: {'lr': 0.0004786710975097531, 'samples': 4147584, 'steps': 21601, 'loss/train': 1.9969254732131958}}} 11/07/2021 00:16:44 - INFO - __main__ - Step 21606: {'lr': 0.0004786625174022458, 'samples': 4148352, 'steps': 21605, 'loss/train': 1.1935527324676514}}} 11/07/2021 00:16:44 - INFO - __main__ - Step 21606: {'lr': 0.0004786625174022458, 'samples': 4148352, 'steps': 21605, 'loss/train': 1.1935527324676514}}} 11/07/2021 00:16:49 - INFO - __main__ - Step 21614: {'lr': 0.00047864535224176666, 'samples': 4149888, 'steps': 21613, 'loss/train': 1.2140483856201172}} 11/07/2021 00:16:51 - INFO - __main__ - Step 21618: {'lr': 0.00047863676718891846, 'samples': 4150656, 'steps': 21617, 'loss/train': 1.6251708269119263}} 11/07/2021 00:16:53 - INFO - __main__ - Step 21622: {'lr': 0.0004786281804877471, 'samples': 4151424, 'steps': 21621, 'loss/train': 1.2028931379318237}}} 11/07/2021 00:16:55 - INFO - __main__ - Step 21626: {'lr': 0.00047861959213831446, 'samples': 4152192, 'steps': 21625, 'loss/train': 1.6223113536834717}} 11/07/2021 00:16:57 - INFO - __main__ - Step 21631: {'lr': 0.0004786088543837506, 'samples': 4153152, 'steps': 21630, 'loss/train': 1.5705360174179077}}} 11/07/2021 00:16:59 - INFO - __main__ - Step 21635: {'lr': 0.00047860026232595645, 'samples': 4153920, 'steps': 21634, 'loss/train': 1.7790687084197998}} 11/07/2021 00:17:01 - INFO - __main__ - Step 21639: {'lr': 0.0004785916686201023, 'samples': 4154688, 'steps': 21638, 'loss/train': 2.32190203666687}98}} 11/07/2021 00:17:03 - INFO - __main__ - Step 21643: {'lr': 0.00047858307326625014, 'samples': 4155456, 'steps': 21642, 'loss/train': 1.774276614189148}}} 11/07/2021 00:17:05 - INFO - __main__ - Step 21647: {'lr': 0.0004785744762644619, 'samples': 4156224, 'steps': 21646, 'loss/train': 1.1660066843032837}}} 11/07/2021 00:17:07 - INFO - __main__ - Step 21652: {'lr': 0.00047856372769491083, 'samples': 4157184, 'steps': 21651, 'loss/train': 1.6415342092514038}} 11/07/2021 00:17:09 - INFO - __main__ - Step 21656: {'lr': 0.00047855512698549295, 'samples': 4157952, 'steps': 21655, 'loss/train': 1.7791290283203125}} 11/07/2021 00:17:09 - INFO - __main__ - Step 21656: {'lr': 0.00047855512698549295, 'samples': 4157952, 'steps': 21655, 'loss/train': 1.7791290283203125}} 11/07/2021 00:17:13 - INFO - __main__ - Step 21663: {'lr': 0.0004785400717791877, 'samples': 4159296, 'steps': 21662, 'loss/train': 2.0939254760742188}}} 11/07/2021 00:17:13 - INFO - __main__ - Step 21663: {'lr': 0.0004785400717791877, 'samples': 4159296, 'steps': 21662, 'loss/train': 2.0939254760742188}}} 11/07/2021 00:17:17 - INFO - __main__ - Step 21671: {'lr': 0.00047852285965054606, 'samples': 4160832, 'steps': 21670, 'loss/train': 1.454700231552124}}} 11/07/2021 00:17:19 - INFO - __main__ - Step 21675: {'lr': 0.0004785142511149412, 'samples': 4161600, 'steps': 21674, 'loss/train': 1.6360929012298584}}} 11/07/2021 00:17:21 - INFO - __main__ - Step 21679: {'lr': 0.00047850564093189653, 'samples': 4162368, 'steps': 21678, 'loss/train': 1.9067273139953613}} 11/07/2021 00:17:23 - INFO - __main__ - Step 21684: {'lr': 0.0004784948758864727, 'samples': 4163328, 'steps': 21683, 'loss/train': 1.3767606019973755}}} 11/07/2021 00:17:25 - INFO - __main__ - Step 21688: {'lr': 0.00047848626199691513, 'samples': 4164096, 'steps': 21687, 'loss/train': 1.1082913875579834}} 11/07/2021 00:17:28 - INFO - __main__ - Step 21692: {'lr': 0.00047847764646011937, 'samples': 4164864, 'steps': 21691, 'loss/train': 2.253713846206665}}} 11/07/2021 00:17:29 - INFO - __main__ - Step 21696: {'lr': 0.00047846902927614767, 'samples': 4165632, 'steps': 21695, 'loss/train': 1.421078085899353}}} 11/07/2021 00:17:29 - INFO - __main__ - Step 21696: {'lr': 0.00047846902927614767, 'samples': 4165632, 'steps': 21695, 'loss/train': 1.421078085899353}}} 11/07/2021 00:17:34 - INFO - __main__ - Step 21703: {'lr': 0.0004784539452408666, 'samples': 4166976, 'steps': 21702, 'loss/train': 1.7695062160491943}}} 11/07/2021 00:17:35 - INFO - __main__ - Step 21707: {'lr': 0.00047844532352748115, 'samples': 4167744, 'steps': 21706, 'loss/train': 1.4688421487808228}} 11/07/2021 00:17:37 - INFO - __main__ - Step 21712: {'lr': 0.00047843454406974295, 'samples': 4168704, 'steps': 21711, 'loss/train': 0.41166508197784424} 11/07/2021 00:17:40 - INFO - __main__ - Step 21717: {'lr': 0.0004784237620387778, 'samples': 4169664, 'steps': 21716, 'loss/train': 1.5829322338104248}4} 11/07/2021 00:17:40 - INFO - __main__ - Step 21717: {'lr': 0.0004784237620387778, 'samples': 4169664, 'steps': 21716, 'loss/train': 1.5829322338104248}4} 11/07/2021 00:17:40 - INFO - __main__ - Step 21717: {'lr': 0.0004784237620387778, 'samples': 4169664, 'steps': 21716, 'loss/train': 1.5829322338104248}4} 11/07/2021 00:17:45 - INFO - __main__ - Step 21727: {'lr': 0.00047840219025765225, 'samples': 4171584, 'steps': 21726, 'loss/train': 1.7945908308029175}} 11/07/2021 00:17:47 - INFO - __main__ - Step 21732: {'lr': 0.0004783914005077349, 'samples': 4172544, 'steps': 21731, 'loss/train': 1.0612422227859497}}} 11/07/2021 00:17:49 - INFO - __main__ - Step 21736: {'lr': 0.00047838276685542157, 'samples': 4173312, 'steps': 21735, 'loss/train': 1.6839711666107178}} 11/07/2021 00:17:49 - INFO - __main__ - Step 21736: {'lr': 0.00047838276685542157, 'samples': 4173312, 'steps': 21735, 'loss/train': 1.6839711666107178}} 11/07/2021 00:17:53 - INFO - __main__ - Step 21743: {'lr': 0.00047836765400203953, 'samples': 4174656, 'steps': 21742, 'loss/train': 2.0440940856933594}} 11/07/2021 00:17:55 - INFO - __main__ - Step 21748: {'lr': 0.00047835685601977886, 'samples': 4175616, 'steps': 21747, 'loss/train': 1.6363345384597778}} 11/07/2021 00:17:57 - INFO - __main__ - Step 21752: {'lr': 0.0004783482157818711, 'samples': 4176384, 'steps': 21751, 'loss/train': 1.7143123149871826}}} 11/07/2021 00:17:59 - INFO - __main__ - Step 21756: {'lr': 0.00047833957389772046, 'samples': 4177152, 'steps': 21755, 'loss/train': 1.6110718250274658}} 11/07/2021 00:18:01 - INFO - __main__ - Step 21760: {'lr': 0.0004783309303673892, 'samples': 4177920, 'steps': 21759, 'loss/train': 1.4296553134918213}}} 11/07/2021 00:18:03 - INFO - __main__ - Step 21764: {'lr': 0.0004783222851909397, 'samples': 4178688, 'steps': 21763, 'loss/train': 1.790881872177124}}}} 11/07/2021 00:18:05 - INFO - __main__ - Step 21769: {'lr': 0.0004783114764056188, 'samples': 4179648, 'steps': 21768, 'loss/train': 1.2280904054641724}}} 11/07/2021 00:18:05 - INFO - __main__ - Step 21769: {'lr': 0.0004783114764056188, 'samples': 4179648, 'steps': 21768, 'loss/train': 1.2280904054641724}}} 11/07/2021 00:18:09 - INFO - __main__ - Step 21777: {'lr': 0.00047829417699972747, 'samples': 4181184, 'steps': 21776, 'loss/train': 1.57528555393219}}}} 11/07/2021 00:18:11 - INFO - __main__ - Step 21781: {'lr': 0.0004782855248279706, 'samples': 4181952, 'steps': 21780, 'loss/train': 2.070675849914551}}}} 11/07/2021 00:18:13 - INFO - __main__ - Step 21785: {'lr': 0.00047827687101042283, 'samples': 4182720, 'steps': 21784, 'loss/train': 1.162865400314331}}} 11/07/2021 00:18:15 - INFO - __main__ - Step 21789: {'lr': 0.00047826821554714644, 'samples': 4183488, 'steps': 21788, 'loss/train': 1.4995194673538208}} 11/07/2021 00:18:18 - INFO - __main__ - Step 21794: {'lr': 0.0004782573939038402, 'samples': 4184448, 'steps': 21793, 'loss/train': 1.5484576225280762}}} 11/07/2021 00:18:20 - INFO - __main__ - Step 21798: {'lr': 0.00047824873473790275, 'samples': 4185216, 'steps': 21797, 'loss/train': 1.65096914768219}}}} 11/07/2021 00:18:22 - INFO - __main__ - Step 21802: {'lr': 0.0004782400739264395, 'samples': 4185984, 'steps': 21801, 'loss/train': 1.8260188102722168}}} 11/07/2021 00:18:23 - INFO - __main__ - Step 21806: {'lr': 0.000478231411469513, 'samples': 4186752, 'steps': 21805, 'loss/train': 1.3001028299331665}}}} 11/07/2021 00:18:25 - INFO - __main__ - Step 21810: {'lr': 0.0004782227473671857, 'samples': 4187520, 'steps': 21809, 'loss/train': 1.851442575454712}}}} 11/07/2021 00:18:28 - INFO - __main__ - Step 21815: {'lr': 0.00047821191492552676, 'samples': 4188480, 'steps': 21814, 'loss/train': 1.5835494995117188}} 11/07/2021 00:18:30 - INFO - __main__ - Step 21819: {'lr': 0.00047820324712127593, 'samples': 4189248, 'steps': 21818, 'loss/train': 1.2986811399459839}} 11/07/2021 00:18:32 - INFO - __main__ - Step 21823: {'lr': 0.00047819457767182735, 'samples': 4190016, 'steps': 21822, 'loss/train': 1.800366997718811}}} 11/07/2021 00:18:32 - INFO - __main__ - Step 21823: {'lr': 0.00047819457767182735, 'samples': 4190016, 'steps': 21822, 'loss/train': 1.800366997718811}}} 11/07/2021 00:18:35 - INFO - __main__ - Step 21830: {'lr': 0.00047817940217672315, 'samples': 4191360, 'steps': 21829, 'loss/train': 1.6217126846313477}} 11/07/2021 00:18:37 - INFO - __main__ - Step 21835: {'lr': 0.0004781685594529199, 'samples': 4192320, 'steps': 21834, 'loss/train': 1.3522447347640991}}} 11/07/2021 00:18:40 - INFO - __main__ - Step 21840: {'lr': 0.0004781577141588859, 'samples': 4193280, 'steps': 21839, 'loss/train': 2.1624045372009277}}} 11/07/2021 00:18:40 - INFO - __main__ - Step 21840: {'lr': 0.0004781577141588859, 'samples': 4193280, 'steps': 21839, 'loss/train': 2.1624045372009277}}} 11/07/2021 00:18:43 - INFO - __main__ - Step 21847: {'lr': 0.0004781425264294831, 'samples': 4194624, 'steps': 21846, 'loss/train': 0.418106347322464}}}} 11/07/2021 00:18:45 - INFO - __main__ - Step 21852: {'lr': 0.00047813167496739363, 'samples': 4195584, 'steps': 21851, 'loss/train': 1.6310547590255737}} 11/07/2021 00:18:48 - INFO - __main__ - Step 21856: {'lr': 0.00047812299194744924, 'samples': 4196352, 'steps': 21855, 'loss/train': 1.4601771831512451}} 11/07/2021 00:18:50 - INFO - __main__ - Step 21861: {'lr': 0.00047811213585978023, 'samples': 4197312, 'steps': 21860, 'loss/train': 1.5927478075027466}} 11/07/2021 00:18:52 - INFO - __main__ - Step 21865: {'lr': 0.00047810344913953065, 'samples': 4198080, 'steps': 21864, 'loss/train': 1.4618579149246216}} 11/07/2021 00:18:52 - INFO - __main__ - Step 21865: {'lr': 0.00047810344913953065, 'samples': 4198080, 'steps': 21864, 'loss/train': 1.4618579149246216}} 11/07/2021 00:18:55 - INFO - __main__ - Step 21872: {'lr': 0.00047808824342210565, 'samples': 4199424, 'steps': 21871, 'loss/train': 2.10385799407959}6}} 11/07/2021 00:18:58 - INFO - __main__ - Step 21877: {'lr': 0.0004780773791121626, 'samples': 4200384, 'steps': 21876, 'loss/train': 1.5490812063217163}}} 11/07/2021 00:18:58 - INFO - __main__ - Step 21877: {'lr': 0.0004780773791121626, 'samples': 4200384, 'steps': 21876, 'loss/train': 1.5490812063217163}}} 11/07/2021 00:19:02 - INFO - __main__ - Step 21885: {'lr': 0.00047805999087236097, 'samples': 4201920, 'steps': 21884, 'loss/train': 2.2122480869293213}} 11/07/2021 00:19:03 - INFO - __main__ - Step 21889: {'lr': 0.0004780512942861813, 'samples': 4202688, 'steps': 21888, 'loss/train': 1.3067866563796997}}} 11/07/2021 00:19:06 - INFO - __main__ - Step 21893: {'lr': 0.0004780425960558994, 'samples': 4203456, 'steps': 21892, 'loss/train': 1.5470932722091675}}} 11/07/2021 00:19:06 - INFO - __main__ - Step 21893: {'lr': 0.0004780425960558994, 'samples': 4203456, 'steps': 21892, 'loss/train': 1.5470932722091675}}} 11/07/2021 00:19:10 - INFO - __main__ - Step 21901: {'lr': 0.00047802519466327945, 'samples': 4204992, 'steps': 21900, 'loss/train': 2.107398509979248}}} 11/07/2021 00:19:12 - INFO - __main__ - Step 21905: {'lr': 0.00047801649150106684, 'samples': 4205760, 'steps': 21904, 'loss/train': 1.9765163660049438}} 11/07/2021 00:19:13 - INFO - __main__ - Step 21909: {'lr': 0.0004780077866950029, 'samples': 4206528, 'steps': 21908, 'loss/train': 5.938828945159912}8}} 11/07/2021 00:19:16 - INFO - __main__ - Step 21914: {'lr': 0.0004779969033758525, 'samples': 4207488, 'steps': 21913, 'loss/train': 1.252291202545166}8}} 11/07/2021 00:19:18 - INFO - __main__ - Step 21918: {'lr': 0.0004779881948713524, 'samples': 4208256, 'steps': 21917, 'loss/train': 1.3182247877120972}}} 11/07/2021 00:19:20 - INFO - __main__ - Step 21922: {'lr': 0.0004779794847232048, 'samples': 4209024, 'steps': 21921, 'loss/train': 0.12900973856449127}} 11/07/2021 00:19:22 - INFO - __main__ - Step 21926: {'lr': 0.0004779707729314726, 'samples': 4209792, 'steps': 21925, 'loss/train': 1.3452696800231934}}} 11/07/2021 00:19:23 - INFO - __main__ - Step 21930: {'lr': 0.00047796205949621873, 'samples': 4210560, 'steps': 21929, 'loss/train': 0.7199652791023254}} 11/07/2021 00:19:25 - INFO - __main__ - Step 21934: {'lr': 0.0004779533444175058, 'samples': 4211328, 'steps': 21933, 'loss/train': 1.737036108970642}4}} 11/07/2021 00:19:28 - INFO - __main__ - Step 21939: {'lr': 0.00047794244825809614, 'samples': 4212288, 'steps': 21938, 'loss/train': 1.3799632787704468}} 11/07/2021 00:19:30 - INFO - __main__ - Step 21943: {'lr': 0.00047793372948183024, 'samples': 4213056, 'steps': 21942, 'loss/train': 1.6062308549880981}} 11/07/2021 00:19:32 - INFO - __main__ - Step 21947: {'lr': 0.00047792500906230963, 'samples': 4213824, 'steps': 21946, 'loss/train': 1.8816180229187012}} 11/07/2021 00:19:33 - INFO - __main__ - Step 21951: {'lr': 0.0004779162869995971, 'samples': 4214592, 'steps': 21950, 'loss/train': 1.45814049243927}12}} 11/07/2021 00:19:35 - INFO - __main__ - Step 21955: {'lr': 0.0004779075632937556, 'samples': 4215360, 'steps': 21954, 'loss/train': 1.2813609838485718}}} 11/07/2021 00:19:38 - INFO - __main__ - Step 21960: {'lr': 0.0004778966563508994, 'samples': 4216320, 'steps': 21959, 'loss/train': 1.7045584917068481}}} 11/07/2021 00:19:40 - INFO - __main__ - Step 21964: {'lr': 0.0004778879289482476, 'samples': 4217088, 'steps': 21963, 'loss/train': 1.4810577630996704}}} 11/07/2021 00:19:42 - INFO - __main__ - Step 21968: {'lr': 0.0004778791999026713, 'samples': 4217856, 'steps': 21967, 'loss/train': 1.728704810142517}}}} 11/07/2021 00:19:42 - INFO - __main__ - Step 21968: {'lr': 0.0004778791999026713, 'samples': 4217856, 'steps': 21967, 'loss/train': 1.728704810142517}}}} 11/07/2021 00:19:45 - INFO - __main__ - Step 21975: {'lr': 0.0004778639201198149, 'samples': 4219200, 'steps': 21974, 'loss/train': 1.5207573175430298}}} 11/07/2021 00:19:48 - INFO - __main__ - Step 21981: {'lr': 0.0004778508191588613, 'samples': 4220352, 'steps': 21980, 'loss/train': 1.6339510679244995}}} 11/07/2021 00:19:50 - INFO - __main__ - Step 21985: {'lr': 0.0004778420831315579, 'samples': 4221120, 'steps': 21984, 'loss/train': 1.1991043090820312}}} 11/07/2021 00:19:52 - INFO - __main__ - Step 21989: {'lr': 0.00047783334546166046, 'samples': 4221888, 'steps': 21988, 'loss/train': 1.6182039976119995}} 11/07/2021 00:19:52 - INFO - __main__ - Step 21989: {'lr': 0.00047783334546166046, 'samples': 4221888, 'steps': 21988, 'loss/train': 1.6182039976119995}} 11/07/2021 00:19:55 - INFO - __main__ - Step 21996: {'lr': 0.0004778180505870375, 'samples': 4223232, 'steps': 21995, 'loss/train': 1.802703619003296}5}} 11/07/2021 00:19:58 - INFO - __main__ - Step 22001: {'lr': 0.00047780712259703394, 'samples': 4224192, 'steps': 22000, 'loss/train': 1.5872262716293335}} 11/07/2021 00:20:00 - INFO - __main__ - Step 22005: {'lr': 0.00047779837835739043, 'samples': 4224960, 'steps': 22004, 'loss/train': 1.5473153591156006}} 11/07/2021 00:20:02 - INFO - __main__ - Step 22009: {'lr': 0.000477789632475468, 'samples': 4225728, 'steps': 22008, 'loss/train': 1.392614722251892}06}} 11/07/2021 00:20:03 - INFO - __main__ - Step 22013: {'lr': 0.00047778088495132963, 'samples': 4226496, 'steps': 22012, 'loss/train': 2.2597596645355225}} 11/07/2021 00:20:06 - INFO - __main__ - Step 22017: {'lr': 0.00047777213578503844, 'samples': 4227264, 'steps': 22016, 'loss/train': 1.7673450708389282}} 11/07/2021 00:20:08 - INFO - __main__ - Step 22022: {'lr': 0.00047776119701799317, 'samples': 4228224, 'steps': 22021, 'loss/train': 1.1082690954208374}} 11/07/2021 00:20:10 - INFO - __main__ - Step 22027: {'lr': 0.0004777502556853058, 'samples': 4229184, 'steps': 22026, 'loss/train': 0.7627279162406921}}} 11/07/2021 00:20:10 - INFO - __main__ - Step 22027: {'lr': 0.0004777502556853058, 'samples': 4229184, 'steps': 22026, 'loss/train': 0.7627279162406921}}} 11/07/2021 00:20:14 - INFO - __main__ - Step 22034: {'lr': 0.00047773493350949963, 'samples': 4230528, 'steps': 22033, 'loss/train': 2.4903831481933594}} 11/07/2021 00:20:16 - INFO - __main__ - Step 22038: {'lr': 0.00047772617572294123, 'samples': 4231296, 'steps': 22037, 'loss/train': 1.876652717590332}}} 11/07/2021 00:20:18 - INFO - __main__ - Step 22043: {'lr': 0.0004777152261810279, 'samples': 4232256, 'steps': 22042, 'loss/train': 1.7015565633773804}}} 11/07/2021 00:20:18 - INFO - __main__ - Step 22043: {'lr': 0.0004777152261810279, 'samples': 4232256, 'steps': 22042, 'loss/train': 1.7015565633773804}}} 11/07/2021 00:20:23 - INFO - __main__ - Step 22051: {'lr': 0.0004776977015785595, 'samples': 4233792, 'steps': 22050, 'loss/train': 1.2668564319610596}}} 11/07/2021 00:20:23 - INFO - __main__ - Step 22051: {'lr': 0.0004776977015785595, 'samples': 4233792, 'steps': 22050, 'loss/train': 1.2668564319610596}}} 11/07/2021 00:20:26 - INFO - __main__ - Step 22058: {'lr': 0.00047768236216503613, 'samples': 4235136, 'steps': 22057, 'loss/train': 1.7460706233978271}} 11/07/2021 00:20:28 - INFO - __main__ - Step 22064: {'lr': 0.00047766921009527284, 'samples': 4236288, 'steps': 22063, 'loss/train': 1.7506041526794434}} 11/07/2021 00:20:28 - INFO - __main__ - Step 22064: {'lr': 0.00047766921009527284, 'samples': 4236288, 'steps': 22063, 'loss/train': 1.7506041526794434}} 11/07/2021 00:20:33 - INFO - __main__ - Step 22071: {'lr': 0.0004776538613463147, 'samples': 4237632, 'steps': 22070, 'loss/train': 0.24791944026947021}} 11/07/2021 00:20:34 - INFO - __main__ - Step 22075: {'lr': 0.0004776450883759016, 'samples': 4238400, 'steps': 22074, 'loss/train': 1.0848828554153442}}} 11/07/2021 00:20:36 - INFO - __main__ - Step 22079: {'lr': 0.0004776363137643147, 'samples': 4239168, 'steps': 22078, 'loss/train': 1.6272341012954712}}} 11/07/2021 00:20:39 - INFO - __main__ - Step 22084: {'lr': 0.0004776253431920268, 'samples': 4240128, 'steps': 22083, 'loss/train': 1.3235876560211182}}} 11/07/2021 00:20:41 - INFO - __main__ - Step 22088: {'lr': 0.00047761656488803006, 'samples': 4240896, 'steps': 22087, 'loss/train': 1.8013110160827637}} 11/07/2021 00:20:41 - INFO - __main__ - Step 22088: {'lr': 0.00047761656488803006, 'samples': 4240896, 'steps': 22087, 'loss/train': 1.8013110160827637}} 11/07/2021 00:20:44 - INFO - __main__ - Step 22095: {'lr': 0.0004776011989074943, 'samples': 4242240, 'steps': 22094, 'loss/train': 1.7879666090011597}}} 11/07/2021 00:20:46 - INFO - __main__ - Step 22100: {'lr': 0.00047759022013048417, 'samples': 4243200, 'steps': 22099, 'loss/train': 1.2801927328109741}} 11/07/2021 00:20:46 - INFO - __main__ - Step 22100: {'lr': 0.00047759022013048417, 'samples': 4243200, 'steps': 22099, 'loss/train': 1.2801927328109741}} 11/07/2021 00:20:49 - INFO - __main__ - Step 22106: {'lr': 0.0004775770422139776, 'samples': 4244352, 'steps': 22105, 'loss/train': 1.5821597576141357}}} 11/07/2021 00:20:52 - INFO - __main__ - Step 22111: {'lr': 0.0004775660577969555, 'samples': 4245312, 'steps': 22110, 'loss/train': 0.2226470410823822}}} 11/07/2021 00:20:54 - INFO - __main__ - Step 22116: {'lr': 0.0004775550708164895, 'samples': 4246272, 'steps': 22115, 'loss/train': 1.5752651691436768}}} 11/07/2021 00:20:54 - INFO - __main__ - Step 22116: {'lr': 0.0004775550708164895, 'samples': 4246272, 'steps': 22115, 'loss/train': 1.5752651691436768}}} 11/07/2021 00:20:58 - INFO - __main__ - Step 22123: {'lr': 0.0004775396847374871, 'samples': 4247616, 'steps': 22122, 'loss/train': 1.7947843074798584}}} 11/07/2021 00:21:00 - INFO - __main__ - Step 22127: {'lr': 0.0004775308904367519, 'samples': 4248384, 'steps': 22126, 'loss/train': 1.7427397966384888}}} 11/07/2021 00:21:02 - INFO - __main__ - Step 22132: {'lr': 0.00047751989525409745, 'samples': 4249344, 'steps': 22131, 'loss/train': 1.9468735456466675}} 11/07/2021 00:21:02 - INFO - __main__ - Step 22132: {'lr': 0.00047751989525409745, 'samples': 4249344, 'steps': 22131, 'loss/train': 1.9468735456466675}} 11/07/2021 00:21:06 - INFO - __main__ - Step 22140: {'lr': 0.0004775022976310203, 'samples': 4250880, 'steps': 22139, 'loss/train': 1.6305336952209473}}} 11/07/2021 00:21:08 - INFO - __main__ - Step 22144: {'lr': 0.00047749349635923334, 'samples': 4251648, 'steps': 22143, 'loss/train': 1.6408462524414062}} 11/07/2021 00:21:10 - INFO - __main__ - Step 22148: {'lr': 0.00047748469344736547, 'samples': 4252416, 'steps': 22147, 'loss/train': 2.0035457611083984}} 11/07/2021 00:21:12 - INFO - __main__ - Step 22153: {'lr': 0.00047747368750126345, 'samples': 4253376, 'steps': 22152, 'loss/train': 1.6150118112564087}} 11/07/2021 00:21:15 - INFO - __main__ - Step 22158: {'lr': 0.0004774626789927582, 'samples': 4254336, 'steps': 22157, 'loss/train': 1.7074940204620361}}} 11/07/2021 00:21:15 - INFO - __main__ - Step 22158: {'lr': 0.0004774626789927582, 'samples': 4254336, 'steps': 22157, 'loss/train': 1.7074940204620361}}} 11/07/2021 00:21:19 - INFO - __main__ - Step 22165: {'lr': 0.00047744726277624926, 'samples': 4255680, 'steps': 22164, 'loss/train': 1.7479281425476074}} 11/07/2021 00:21:20 - INFO - __main__ - Step 22169: {'lr': 0.0004774384512549979, 'samples': 4256448, 'steps': 22168, 'loss/train': 1.6606671810150146}}} 11/07/2021 00:21:23 - INFO - __main__ - Step 22174: {'lr': 0.0004774274345476354, 'samples': 4257408, 'steps': 22173, 'loss/train': 1.6413216590881348}}} 11/07/2021 00:21:25 - INFO - __main__ - Step 22178: {'lr': 0.0004774186193371841, 'samples': 4258176, 'steps': 22177, 'loss/train': 0.964850664138794}}}} 11/07/2021 00:21:27 - INFO - __main__ - Step 22182: {'lr': 0.0004774098024871918, 'samples': 4258944, 'steps': 22181, 'loss/train': 1.5867414474487305}}} 11/07/2021 00:21:28 - INFO - __main__ - Step 22186: {'lr': 0.00047740098399772185, 'samples': 4259712, 'steps': 22185, 'loss/train': 0.4743185043334961}} 11/07/2021 00:21:30 - INFO - __main__ - Step 22190: {'lr': 0.00047739216386883797, 'samples': 4260480, 'steps': 22189, 'loss/train': 1.4760106801986694}} 11/07/2021 00:21:33 - INFO - __main__ - Step 22195: {'lr': 0.000477381136402404, 'samples': 4261440, 'steps': 22194, 'loss/train': 1.462497591972351}94}} 11/07/2021 00:21:35 - INFO - __main__ - Step 22199: {'lr': 0.00047737231258507116, 'samples': 4262208, 'steps': 22198, 'loss/train': 1.4578293561935425}} 11/07/2021 00:21:37 - INFO - __main__ - Step 22203: {'lr': 0.00047736348712853094, 'samples': 4262976, 'steps': 22202, 'loss/train': 1.5299713611602783}} 11/07/2021 00:21:38 - INFO - __main__ - Step 22207: {'lr': 0.0004773546600328471, 'samples': 4263744, 'steps': 22206, 'loss/train': 1.6162457466125488}}} 11/07/2021 00:21:40 - INFO - __main__ - Step 22211: {'lr': 0.00047734583129808327, 'samples': 4264512, 'steps': 22210, 'loss/train': 1.5364080667495728}} 11/07/2021 00:21:40 - INFO - __main__ - Step 22211: {'lr': 0.00047734583129808327, 'samples': 4264512, 'steps': 22210, 'loss/train': 1.5364080667495728}} 11/07/2021 00:21:45 - INFO - __main__ - Step 22219: {'lr': 0.0004773281689115701, 'samples': 4266048, 'steps': 22218, 'loss/train': 1.8526577949523926}}} 11/07/2021 00:21:46 - INFO - __main__ - Step 22223: {'lr': 0.00047731933525994814, 'samples': 4266816, 'steps': 22222, 'loss/train': 0.9544246196746826}} 11/07/2021 00:21:48 - INFO - __main__ - Step 22227: {'lr': 0.0004773104999695008, 'samples': 4267584, 'steps': 22226, 'loss/train': 1.5060862302780151}}} 11/07/2021 00:21:48 - INFO - __main__ - Step 22227: {'lr': 0.0004773104999695008, 'samples': 4267584, 'steps': 22226, 'loss/train': 1.5060862302780151}}} 11/07/2021 00:21:53 - INFO - __main__ - Step 22235: {'lr': 0.0004772928244723849, 'samples': 4269120, 'steps': 22234, 'loss/train': 1.5444846153259277}}} 11/07/2021 00:21:55 - INFO - __main__ - Step 22239: {'lr': 0.00047728398426584375, 'samples': 4269888, 'steps': 22238, 'loss/train': 1.3304921388626099}} 11/07/2021 00:21:56 - INFO - __main__ - Step 22243: {'lr': 0.000477275142420732, 'samples': 4270656, 'steps': 22242, 'loss/train': 0.41618475317955017}}} 11/07/2021 00:21:58 - INFO - __main__ - Step 22247: {'lr': 0.0004772662989371136, 'samples': 4271424, 'steps': 22246, 'loss/train': 1.7262928485870361}}} 11/07/2021 00:22:01 - INFO - __main__ - Step 22252: {'lr': 0.0004772552422785376, 'samples': 4272384, 'steps': 22251, 'loss/train': 1.52297043800354}1}}} 11/07/2021 00:22:01 - INFO - __main__ - Step 22252: {'lr': 0.0004772552422785376, 'samples': 4272384, 'steps': 22251, 'loss/train': 1.52297043800354}1}}} 11/07/2021 00:22:01 - INFO - __main__ - Step 22252: {'lr': 0.0004772552422785376, 'samples': 4272384, 'steps': 22251, 'loss/train': 1.52297043800354}1}}} 11/07/2021 00:22:06 - INFO - __main__ - Step 22263: {'lr': 0.00047723090861884773, 'samples': 4274496, 'steps': 22262, 'loss/train': 1.3939660787582397}} 11/07/2021 00:22:09 - INFO - __main__ - Step 22268: {'lr': 0.0004772198437688938, 'samples': 4275456, 'steps': 22267, 'loss/train': 1.468030571937561}7}} 11/07/2021 00:22:11 - INFO - __main__ - Step 22273: {'lr': 0.00047720877635939606, 'samples': 4276416, 'steps': 22272, 'loss/train': 1.9079005718231201}} 11/07/2021 00:22:13 - INFO - __main__ - Step 22277: {'lr': 0.00047719992058901006, 'samples': 4277184, 'steps': 22276, 'loss/train': 1.6172411441802979}} 11/07/2021 00:22:13 - INFO - __main__ - Step 22277: {'lr': 0.00047719992058901006, 'samples': 4277184, 'steps': 22276, 'loss/train': 1.6172411441802979}} 11/07/2021 00:22:16 - INFO - __main__ - Step 22284: {'lr': 0.0004771844190495209, 'samples': 4278528, 'steps': 22283, 'loss/train': 1.577344298362732}9}} 11/07/2021 00:22:16 - INFO - __main__ - Step 22284: {'lr': 0.0004771844190495209, 'samples': 4278528, 'steps': 22283, 'loss/train': 1.577344298362732}9}} 11/07/2021 00:22:21 - INFO - __main__ - Step 22292: {'lr': 0.00047716669686246287, 'samples': 4280064, 'steps': 22291, 'loss/train': 0.9953264594078064}} 11/07/2021 00:22:22 - INFO - __main__ - Step 22296: {'lr': 0.0004771578333123145, 'samples': 4280832, 'steps': 22295, 'loss/train': 0.9274198412895203}}} 11/07/2021 00:22:24 - INFO - __main__ - Step 22300: {'lr': 0.00047714896812450514, 'samples': 4281600, 'steps': 22299, 'loss/train': 1.509248971939087}}} 11/07/2021 00:22:27 - INFO - __main__ - Step 22305: {'lr': 0.0004771378843368799, 'samples': 4282560, 'steps': 22304, 'loss/train': 1.4086192846298218}}} 11/07/2021 00:22:29 - INFO - __main__ - Step 22310: {'lr': 0.0004771267979906341, 'samples': 4283520, 'steps': 22309, 'loss/train': 1.9466007947921753}}} 11/07/2021 00:22:29 - INFO - __main__ - Step 22310: {'lr': 0.0004771267979906341, 'samples': 4283520, 'steps': 22309, 'loss/train': 1.9466007947921753}}} 11/07/2021 00:22:33 - INFO - __main__ - Step 22317: {'lr': 0.00047711127280764497, 'samples': 4284864, 'steps': 22316, 'loss/train': 0.9728535413742065}} 11/07/2021 00:22:34 - INFO - __main__ - Step 22321: {'lr': 0.00047710239902316404, 'samples': 4285632, 'steps': 22320, 'loss/train': 1.1445330381393433}} 11/07/2021 00:22:36 - INFO - __main__ - Step 22325: {'lr': 0.0004770935236014217, 'samples': 4286400, 'steps': 22324, 'loss/train': 1.6623705625534058}}} 11/07/2021 00:22:39 - INFO - __main__ - Step 22330: {'lr': 0.0004770824270219424, 'samples': 4287360, 'steps': 22329, 'loss/train': 1.3073426485061646}}} 11/07/2021 00:22:41 - INFO - __main__ - Step 22334: {'lr': 0.00047707354791659594, 'samples': 4288128, 'steps': 22333, 'loss/train': 2.013827085494995}}} 11/07/2021 00:22:41 - INFO - __main__ - Step 22334: {'lr': 0.00047707354791659594, 'samples': 4288128, 'steps': 22333, 'loss/train': 2.013827085494995}}} 11/07/2021 00:22:44 - INFO - __main__ - Step 22341: {'lr': 0.00047705800554311836, 'samples': 4289472, 'steps': 22340, 'loss/train': 1.812485694885254}}} 11/07/2021 00:22:47 - INFO - __main__ - Step 22346: {'lr': 0.00047704690077849223, 'samples': 4290432, 'steps': 22345, 'loss/train': 1.5357767343521118}} 11/07/2021 00:22:49 - INFO - __main__ - Step 22351: {'lr': 0.00047703579345627036, 'samples': 4291392, 'steps': 22350, 'loss/train': 1.2991282939910889}} 11/07/2021 00:22:51 - INFO - __main__ - Step 22355: {'lr': 0.00047702690575710796, 'samples': 4292160, 'steps': 22354, 'loss/train': 1.6376055479049683}} 11/07/2021 00:22:53 - INFO - __main__ - Step 22359: {'lr': 0.0004770180164212284, 'samples': 4292928, 'steps': 22358, 'loss/train': 2.1267189979553223}}} 11/07/2021 00:22:55 - INFO - __main__ - Step 22363: {'lr': 0.00047700912544869595, 'samples': 4293696, 'steps': 22362, 'loss/train': 0.9005606174468994}} 11/07/2021 00:22:57 - INFO - __main__ - Step 22367: {'lr': 0.0004770002328395745, 'samples': 4294464, 'steps': 22366, 'loss/train': 1.8009976148605347}}} 11/07/2021 00:22:57 - INFO - __main__ - Step 22367: {'lr': 0.0004770002328395745, 'samples': 4294464, 'steps': 22366, 'loss/train': 1.8009976148605347}}} 11/07/2021 00:23:01 - INFO - __main__ - Step 22375: {'lr': 0.0004769824427118211, 'samples': 4296000, 'steps': 22374, 'loss/train': 1.9812889099121094}}} 11/07/2021 00:23:02 - INFO - __main__ - Step 22379: {'lr': 0.0004769735451933176, 'samples': 4296768, 'steps': 22378, 'loss/train': 1.2897299528121948}}} 11/07/2021 00:23:04 - INFO - __main__ - Step 22383: {'lr': 0.0004769646460384816, 'samples': 4297536, 'steps': 22382, 'loss/train': 1.3800374269485474}}} 11/07/2021 00:23:07 - INFO - __main__ - Step 22388: {'lr': 0.00047695351979394173, 'samples': 4298496, 'steps': 22387, 'loss/train': 1.2966810464859009}} 11/07/2021 00:23:07 - INFO - __main__ - Step 22388: {'lr': 0.00047695351979394173, 'samples': 4298496, 'steps': 22387, 'loss/train': 1.2966810464859009}} 11/07/2021 00:23:11 - INFO - __main__ - Step 22396: {'lr': 0.000476935712485119, 'samples': 4300032, 'steps': 22395, 'loss/train': 1.6139127016067505}9}} 11/07/2021 00:23:13 - INFO - __main__ - Step 22400: {'lr': 0.0004769268063765861, 'samples': 4300800, 'steps': 22399, 'loss/train': 1.9369391202926636}}} 11/07/2021 00:23:15 - INFO - __main__ - Step 22404: {'lr': 0.00047691789863205764, 'samples': 4301568, 'steps': 22403, 'loss/train': 2.1965112686157227}} 11/07/2021 00:23:17 - INFO - __main__ - Step 22409: {'lr': 0.0004769067616508763, 'samples': 4302528, 'steps': 22408, 'loss/train': 2.1145200729370117}}} 11/07/2021 00:23:19 - INFO - __main__ - Step 22414: {'lr': 0.0004768956221136778, 'samples': 4303488, 'steps': 22413, 'loss/train': 1.2305755615234375}}} 11/07/2021 00:23:19 - INFO - __main__ - Step 22414: {'lr': 0.0004768956221136778, 'samples': 4303488, 'steps': 22413, 'loss/train': 1.2305755615234375}}} 11/07/2021 00:23:23 - INFO - __main__ - Step 22421: {'lr': 0.0004768800224677301, 'samples': 4304832, 'steps': 22420, 'loss/train': 1.4588857889175415}}} 11/07/2021 00:23:25 - INFO - __main__ - Step 22425: {'lr': 0.00047687110613527924, 'samples': 4305600, 'steps': 22424, 'loss/train': 2.046144485473633}}} 11/07/2021 00:23:27 - INFO - __main__ - Step 22429: {'lr': 0.0004768621881672345, 'samples': 4306368, 'steps': 22428, 'loss/train': 1.1368001699447632}}} 11/07/2021 00:23:29 - INFO - __main__ - Step 22435: {'lr': 0.0004768488081485695, 'samples': 4307520, 'steps': 22434, 'loss/train': 0.8506284952163696}}} 11/07/2021 00:23:32 - INFO - __main__ - Step 22439: {'lr': 0.0004768398860918213, 'samples': 4308288, 'steps': 22438, 'loss/train': 1.650249719619751}}}} 11/07/2021 00:23:34 - INFO - __main__ - Step 22443: {'lr': 0.00047683096239970423, 'samples': 4309056, 'steps': 22442, 'loss/train': 1.7382694482803345}} 11/07/2021 00:23:34 - INFO - __main__ - Step 22443: {'lr': 0.00047683096239970423, 'samples': 4309056, 'steps': 22442, 'loss/train': 1.7382694482803345}} 11/07/2021 00:23:37 - INFO - __main__ - Step 22450: {'lr': 0.00047681534200358665, 'samples': 4310400, 'steps': 22449, 'loss/train': 1.75339674949646}5}} 11/07/2021 00:23:39 - INFO - __main__ - Step 22455: {'lr': 0.0004768041815117835, 'samples': 4311360, 'steps': 22454, 'loss/train': 1.6310304403305054}}} 11/07/2021 00:23:42 - INFO - __main__ - Step 22460: {'lr': 0.0004767930184651187, 'samples': 4312320, 'steps': 22459, 'loss/train': 1.8718630075454712}}} 11/07/2021 00:23:42 - INFO - __main__ - Step 22460: {'lr': 0.0004767930184651187, 'samples': 4312320, 'steps': 22459, 'loss/train': 1.8718630075454712}}} 11/07/2021 00:23:45 - INFO - __main__ - Step 22467: {'lr': 0.00047677738590786, 'samples': 4313664, 'steps': 22466, 'loss/train': 0.24449460208415985}}}} 11/07/2021 00:23:47 - INFO - __main__ - Step 22471: {'lr': 0.00047676845076996305, 'samples': 4314432, 'steps': 22470, 'loss/train': 2.423990488052368}}} 11/07/2021 00:23:47 - INFO - __main__ - Step 22471: {'lr': 0.00047676845076996305, 'samples': 4314432, 'steps': 22470, 'loss/train': 2.423990488052368}}} 11/07/2021 00:23:51 - INFO - __main__ - Step 22479: {'lr': 0.00047675057558967224, 'samples': 4315968, 'steps': 22478, 'loss/train': 1.8550174236297607}} 11/07/2021 00:23:53 - INFO - __main__ - Step 22483: {'lr': 0.0004767416355474071, 'samples': 4316736, 'steps': 22482, 'loss/train': 1.7197669744491577}}} 11/07/2021 00:23:55 - INFO - __main__ - Step 22487: {'lr': 0.0004767326938704816, 'samples': 4317504, 'steps': 22486, 'loss/train': 1.2819911241531372}}} 11/07/2021 00:23:57 - INFO - __main__ - Step 22492: {'lr': 0.0004767215144756814, 'samples': 4318464, 'steps': 22491, 'loss/train': 2.750136137008667}}}} 11/07/2021 00:23:59 - INFO - __main__ - Step 22497: {'lr': 0.00047671033252695083, 'samples': 4319424, 'steps': 22496, 'loss/train': 1.8200469017028809}} 11/07/2021 00:24:02 - INFO - __main__ - Step 22501: {'lr': 0.0004767013851292212, 'samples': 4320192, 'steps': 22500, 'loss/train': 1.605943202972412}9}} 11/07/2021 00:24:02 - INFO - __main__ - Step 22501: {'lr': 0.0004767013851292212, 'samples': 4320192, 'steps': 22500, 'loss/train': 1.605943202972412}9}} 11/07/2021 00:24:05 - INFO - __main__ - Step 22508: {'lr': 0.00047668572325052953, 'samples': 4321536, 'steps': 22507, 'loss/train': 1.3067169189453125}} 11/07/2021 00:24:07 - INFO - __main__ - Step 22513: {'lr': 0.00047667453313006826, 'samples': 4322496, 'steps': 22512, 'loss/train': 1.282457947731018}}} 11/07/2021 00:24:10 - INFO - __main__ - Step 22518: {'lr': 0.0004766633404562059, 'samples': 4323456, 'steps': 22517, 'loss/train': 1.046881079673767}}}} 11/07/2021 00:24:12 - INFO - __main__ - Step 22522: {'lr': 0.00047665438447875186, 'samples': 4324224, 'steps': 22521, 'loss/train': 1.4796826839447021}} 11/07/2021 00:24:12 - INFO - __main__ - Step 22522: {'lr': 0.00047665438447875186, 'samples': 4324224, 'steps': 22521, 'loss/train': 1.4796826839447021}} 11/07/2021 00:24:15 - INFO - __main__ - Step 22529: {'lr': 0.000476638707586358, 'samples': 4325568, 'steps': 22528, 'loss/train': 1.7667334079742432}1}} 11/07/2021 00:24:17 - INFO - __main__ - Step 22534: {'lr': 0.0004766275067424593, 'samples': 4326528, 'steps': 22533, 'loss/train': 2.0930192470550537}}} 11/07/2021 00:24:20 - INFO - __main__ - Step 22539: {'lr': 0.0004766163033456891, 'samples': 4327488, 'steps': 22538, 'loss/train': 1.8392276763916016}}} 11/07/2021 00:24:22 - INFO - __main__ - Step 22543: {'lr': 0.0004766073387902904, 'samples': 4328256, 'steps': 22542, 'loss/train': 1.3626255989074707}}} 11/07/2021 00:24:22 - INFO - __main__ - Step 22543: {'lr': 0.0004766073387902904, 'samples': 4328256, 'steps': 22542, 'loss/train': 1.3626255989074707}}} 11/07/2021 00:24:25 - INFO - __main__ - Step 22550: {'lr': 0.00047659164688730935, 'samples': 4329600, 'steps': 22549, 'loss/train': 1.728060007095337}}} 11/07/2021 00:24:27 - INFO - __main__ - Step 22554: {'lr': 0.00047658267783941223, 'samples': 4330368, 'steps': 22553, 'loss/train': 1.8560858964920044}} 11/07/2021 00:24:27 - INFO - __main__ - Step 22554: {'lr': 0.00047658267783941223, 'samples': 4330368, 'steps': 22553, 'loss/train': 1.8560858964920044}} 11/07/2021 00:24:31 - INFO - __main__ - Step 22561: {'lr': 0.00047656697807498693, 'samples': 4331712, 'steps': 22560, 'loss/train': 0.23743750154972076} 11/07/2021 00:24:33 - INFO - __main__ - Step 22565: {'lr': 0.0004765580045350805, 'samples': 4332480, 'steps': 22564, 'loss/train': 1.3348045349121094}6} 11/07/2021 00:24:35 - INFO - __main__ - Step 22570: {'lr': 0.00047654678531332544, 'samples': 4333440, 'steps': 22569, 'loss/train': 1.0381836891174316}} 11/07/2021 00:24:35 - INFO - __main__ - Step 22570: {'lr': 0.00047654678531332544, 'samples': 4333440, 'steps': 22569, 'loss/train': 1.0381836891174316}} 11/07/2021 00:24:35 - INFO - __main__ - Step 22570: {'lr': 0.00047654678531332544, 'samples': 4333440, 'steps': 22569, 'loss/train': 1.0381836891174316}} 11/07/2021 00:24:41 - INFO - __main__ - Step 22580: {'lr': 0.00047652433921405526, 'samples': 4335360, 'steps': 22579, 'loss/train': 1.555201530456543}}} 11/07/2021 00:24:43 - INFO - __main__ - Step 22586: {'lr': 0.00047651086665514655, 'samples': 4336512, 'steps': 22585, 'loss/train': 1.6485368013381958}} 11/07/2021 00:24:45 - INFO - __main__ - Step 22590: {'lr': 0.0004765018829079479, 'samples': 4337280, 'steps': 22589, 'loss/train': 2.1266448497772217}}} 11/07/2021 00:24:45 - INFO - __main__ - Step 22590: {'lr': 0.0004765018829079479, 'samples': 4337280, 'steps': 22589, 'loss/train': 2.1266448497772217}}} 11/07/2021 00:24:49 - INFO - __main__ - Step 22597: {'lr': 0.0004764861574211465, 'samples': 4338624, 'steps': 22596, 'loss/train': 1.4135661125183105}}} 11/07/2021 00:24:49 - INFO - __main__ - Step 22597: {'lr': 0.0004764861574211465, 'samples': 4338624, 'steps': 22596, 'loss/train': 1.4135661125183105}}} 11/07/2021 00:24:53 - INFO - __main__ - Step 22606: {'lr': 0.0004764659315904807, 'samples': 4340352, 'steps': 22605, 'loss/train': 1.906373143196106}}}} 11/07/2021 00:24:55 - INFO - __main__ - Step 22610: {'lr': 0.0004764569396792697, 'samples': 4341120, 'steps': 22609, 'loss/train': 1.2064729928970337}}} 11/07/2021 00:24:57 - INFO - __main__ - Step 22614: {'lr': 0.00047644794613545065, 'samples': 4341888, 'steps': 22613, 'loss/train': 1.790145993232727}}} 11/07/2021 00:24:59 - INFO - __main__ - Step 22619: {'lr': 0.0004764367019099206, 'samples': 4342848, 'steps': 22618, 'loss/train': 1.383577823638916}}}} 11/07/2021 00:24:59 - INFO - __main__ - Step 22619: {'lr': 0.0004764367019099206, 'samples': 4342848, 'steps': 22618, 'loss/train': 1.383577823638916}}}} 11/07/2021 00:25:03 - INFO - __main__ - Step 22627: {'lr': 0.00047641870584362323, 'samples': 4344384, 'steps': 22626, 'loss/train': 1.7284650802612305}} 11/07/2021 00:25:05 - INFO - __main__ - Step 22631: {'lr': 0.0004764097053619435, 'samples': 4345152, 'steps': 22630, 'loss/train': 1.6884974241256714}}} 11/07/2021 00:25:07 - INFO - __main__ - Step 22635: {'lr': 0.0004764007032479963, 'samples': 4345920, 'steps': 22634, 'loss/train': 1.8320151567459106}}} 11/07/2021 00:25:09 - INFO - __main__ - Step 22640: {'lr': 0.00047638944831028497, 'samples': 4346880, 'steps': 22639, 'loss/train': 1.7990508079528809}} 11/07/2021 00:25:12 - INFO - __main__ - Step 22645: {'lr': 0.0004763781908223838, 'samples': 4347840, 'steps': 22644, 'loss/train': 1.46024489402771}09}} 11/07/2021 00:25:12 - INFO - __main__ - Step 22645: {'lr': 0.0004763781908223838, 'samples': 4347840, 'steps': 22644, 'loss/train': 1.46024489402771}09}} 11/07/2021 00:25:15 - INFO - __main__ - Step 22652: {'lr': 0.00047636242605524477, 'samples': 4349184, 'steps': 22651, 'loss/train': 1.3490198850631714}} 11/07/2021 00:25:17 - INFO - __main__ - Step 22656: {'lr': 0.00047635341537295814, 'samples': 4349952, 'steps': 22655, 'loss/train': 1.369418740272522}}} 11/07/2021 00:25:19 - INFO - __main__ - Step 22661: {'lr': 0.0004763421497253019, 'samples': 4350912, 'steps': 22660, 'loss/train': 1.1604344844818115}}} 11/07/2021 00:25:22 - INFO - __main__ - Step 22666: {'lr': 0.00047633088152798875, 'samples': 4351872, 'steps': 22665, 'loss/train': 1.3631956577301025}} 11/07/2021 00:25:22 - INFO - __main__ - Step 22666: {'lr': 0.00047633088152798875, 'samples': 4351872, 'steps': 22665, 'loss/train': 1.3631956577301025}} 11/07/2021 00:25:25 - INFO - __main__ - Step 22673: {'lr': 0.0004763151017685682, 'samples': 4353216, 'steps': 22672, 'loss/train': 1.4709689617156982}}} 11/07/2021 00:25:27 - INFO - __main__ - Step 22677: {'lr': 0.00047630608251973265, 'samples': 4353984, 'steps': 22676, 'loss/train': 2.0414798259735107}} 11/07/2021 00:25:30 - INFO - __main__ - Step 22682: {'lr': 0.0004762948061643702, 'samples': 4354944, 'steps': 22681, 'loss/train': 1.2838383913040161}}} 11/07/2021 00:25:32 - INFO - __main__ - Step 22686: {'lr': 0.00047628578324470505, 'samples': 4355712, 'steps': 22685, 'loss/train': 1.5135688781738281}} 11/07/2021 00:25:32 - INFO - __main__ - Step 22686: {'lr': 0.00047628578324470505, 'samples': 4355712, 'steps': 22685, 'loss/train': 1.5135688781738281}} 11/07/2021 00:25:32 - INFO - __main__ - Step 22686: {'lr': 0.00047628578324470505, 'samples': 4355712, 'steps': 22685, 'loss/train': 1.5135688781738281}} 11/07/2021 00:25:37 - INFO - __main__ - Step 22697: {'lr': 0.00047626096180404895, 'samples': 4357824, 'steps': 22696, 'loss/train': 0.8824257850646973}} 11/07/2021 00:25:37 - INFO - __main__ - Step 22697: {'lr': 0.00047626096180404895, 'samples': 4357824, 'steps': 22696, 'loss/train': 0.8824257850646973}} 11/07/2021 00:25:42 - INFO - __main__ - Step 22706: {'lr': 0.00047624064417706917, 'samples': 4359552, 'steps': 22705, 'loss/train': 1.3341710567474365}} 11/07/2021 00:25:44 - INFO - __main__ - Step 22710: {'lr': 0.00047623161147013557, 'samples': 4360320, 'steps': 22709, 'loss/train': 1.6129733324050903}} 11/07/2021 00:25:45 - INFO - __main__ - Step 22714: {'lr': 0.00047622257713221826, 'samples': 4361088, 'steps': 22713, 'loss/train': 1.5603699684143066}} 11/07/2021 00:25:48 - INFO - __main__ - Step 22718: {'lr': 0.0004762135411633827, 'samples': 4361856, 'steps': 22717, 'loss/train': 1.6967964172363281}}} 11/07/2021 00:25:50 - INFO - __main__ - Step 22723: {'lr': 0.0004762022439089583, 'samples': 4362816, 'steps': 22722, 'loss/train': 1.5845859050750732}}} 11/07/2021 00:25:52 - INFO - __main__ - Step 22727: {'lr': 0.00047619320427079437, 'samples': 4363584, 'steps': 22726, 'loss/train': 1.7012308835983276}} 11/07/2021 00:25:54 - INFO - __main__ - Step 22731: {'lr': 0.00047618416300192375, 'samples': 4364352, 'steps': 22730, 'loss/train': 1.7706865072250366}} 11/07/2021 00:25:56 - INFO - __main__ - Step 22735: {'lr': 0.0004761751201024116, 'samples': 4365120, 'steps': 22734, 'loss/train': 1.7968497276306152}}} 11/07/2021 00:25:58 - INFO - __main__ - Step 22740: {'lr': 0.0004761638141850312, 'samples': 4366080, 'steps': 22739, 'loss/train': 0.6306785345077515}}} 11/07/2021 00:25:58 - INFO - __main__ - Step 22740: {'lr': 0.0004761638141850312, 'samples': 4366080, 'steps': 22739, 'loss/train': 0.6306785345077515}}} 11/07/2021 00:26:01 - INFO - __main__ - Step 22747: {'lr': 0.0004761479816206783, 'samples': 4367424, 'steps': 22746, 'loss/train': 1.8603065013885498}}} 11/07/2021 00:26:03 - INFO - __main__ - Step 22751: {'lr': 0.00047613893219925217, 'samples': 4368192, 'steps': 22750, 'loss/train': 1.6966766119003296}} 11/07/2021 00:26:06 - INFO - __main__ - Step 22756: {'lr': 0.00047612761812984626, 'samples': 4369152, 'steps': 22755, 'loss/train': 1.803318738937378}}} 11/07/2021 00:26:08 - INFO - __main__ - Step 22761: {'lr': 0.0004761163015131999, 'samples': 4370112, 'steps': 22760, 'loss/train': 1.2533822059631348}}} 11/07/2021 00:26:08 - INFO - __main__ - Step 22761: {'lr': 0.0004761163015131999, 'samples': 4370112, 'steps': 22760, 'loss/train': 1.2533822059631348}}} 11/07/2021 00:26:12 - INFO - __main__ - Step 22768: {'lr': 0.0004761004539707739, 'samples': 4371456, 'steps': 22767, 'loss/train': 1.0878734588623047}}} 11/07/2021 00:26:14 - INFO - __main__ - Step 22772: {'lr': 0.00047609139599092006, 'samples': 4372224, 'steps': 22771, 'loss/train': 1.7285878658294678}} 11/07/2021 00:26:14 - INFO - __main__ - Step 22772: {'lr': 0.00047609139599092006, 'samples': 4372224, 'steps': 22771, 'loss/train': 1.7285878658294678}} 11/07/2021 00:26:18 - INFO - __main__ - Step 22780: {'lr': 0.00047607327514135955, 'samples': 4373760, 'steps': 22779, 'loss/train': 1.6470462083816528}} 11/07/2021 00:26:20 - INFO - __main__ - Step 22784: {'lr': 0.00047606421227178354, 'samples': 4374528, 'steps': 22783, 'loss/train': 1.9405661821365356}} 11/07/2021 00:26:21 - INFO - __main__ - Step 22788: {'lr': 0.00047605514777243076, 'samples': 4375296, 'steps': 22787, 'loss/train': 1.6153793334960938}} 11/07/2021 00:26:23 - INFO - __main__ - Step 22792: {'lr': 0.0004760460816433666, 'samples': 4376064, 'steps': 22791, 'loss/train': 1.8664559125900269}}} 11/07/2021 00:26:26 - INFO - __main__ - Step 22797: {'lr': 0.0004760347466903544, 'samples': 4377024, 'steps': 22796, 'loss/train': 1.2284456491470337}}} 11/07/2021 00:26:28 - INFO - __main__ - Step 22801: {'lr': 0.0004760256768946787, 'samples': 4377792, 'steps': 22800, 'loss/train': 1.7655497789382935}}} 11/07/2021 00:26:30 - INFO - __main__ - Step 22805: {'lr': 0.00047601660546950396, 'samples': 4378560, 'steps': 22804, 'loss/train': 1.632975697517395}}} 11/07/2021 00:26:31 - INFO - __main__ - Step 22809: {'lr': 0.0004760075324148959, 'samples': 4379328, 'steps': 22808, 'loss/train': 1.4321844577789307}}} 11/07/2021 00:26:33 - INFO - __main__ - Step 22813: {'lr': 0.00047599845773091957, 'samples': 4380096, 'steps': 22812, 'loss/train': 1.912548303604126}}} 11/07/2021 00:26:36 - INFO - __main__ - Step 22818: {'lr': 0.00047598711208475, 'samples': 4381056, 'steps': 22817, 'loss/train': 1.697596549987793}26}}} 11/07/2021 00:26:36 - INFO - __main__ - Step 22818: {'lr': 0.00047598711208475, 'samples': 4381056, 'steps': 22817, 'loss/train': 1.697596549987793}26}}} 11/07/2021 00:26:39 - INFO - __main__ - Step 22825: {'lr': 0.0004759712239034364, 'samples': 4382400, 'steps': 22824, 'loss/train': 2.0725107192993164}}} 11/07/2021 00:26:41 - INFO - __main__ - Step 22829: {'lr': 0.00047596214270264204, 'samples': 4383168, 'steps': 22828, 'loss/train': 1.3580565452575684}} 11/07/2021 00:26:44 - INFO - __main__ - Step 22834: {'lr': 0.000475950788910818, 'samples': 4384128, 'steps': 22833, 'loss/train': 1.3071024417877197}4}} 11/07/2021 00:26:44 - INFO - __main__ - Step 22834: {'lr': 0.000475950788910818, 'samples': 4384128, 'steps': 22833, 'loss/train': 1.3071024417877197}4}} 11/07/2021 00:26:48 - INFO - __main__ - Step 22842: {'lr': 0.00047593261754983607, 'samples': 4385664, 'steps': 22841, 'loss/train': 1.7863192558288574}} 11/07/2021 00:26:49 - INFO - __main__ - Step 22846: {'lr': 0.0004759235294260703, 'samples': 4386432, 'steps': 22845, 'loss/train': 1.771543264389038}4}} 11/07/2021 00:26:51 - INFO - __main__ - Step 22850: {'lr': 0.00047591443967354196, 'samples': 4387200, 'steps': 22849, 'loss/train': 1.6243668794631958}} 11/07/2021 00:26:54 - INFO - __main__ - Step 22855: {'lr': 0.00047590307519253423, 'samples': 4388160, 'steps': 22854, 'loss/train': 1.9432487487792969}} 11/07/2021 00:26:56 - INFO - __main__ - Step 22859: {'lr': 0.0004758939817755299, 'samples': 4388928, 'steps': 22858, 'loss/train': 1.5929142236709595}}} 11/07/2021 00:26:56 - INFO - __main__ - Step 22859: {'lr': 0.0004758939817755299, 'samples': 4388928, 'steps': 22858, 'loss/train': 1.5929142236709595}}} 11/07/2021 00:27:00 - INFO - __main__ - Step 22867: {'lr': 0.0004758757900559385, 'samples': 4390464, 'steps': 22866, 'loss/train': 0.689471960067749}}}} 11/07/2021 00:27:01 - INFO - __main__ - Step 22871: {'lr': 0.00047586669175348254, 'samples': 4391232, 'steps': 22870, 'loss/train': 1.5636752843856812}} 11/07/2021 00:27:04 - INFO - __main__ - Step 22876: {'lr': 0.0004758553165855492, 'samples': 4392192, 'steps': 22875, 'loss/train': 1.644388198852539}2}} 11/07/2021 00:27:04 - INFO - __main__ - Step 22876: {'lr': 0.0004758553165855492, 'samples': 4392192, 'steps': 22875, 'loss/train': 1.644388198852539}2}} 11/07/2021 00:27:08 - INFO - __main__ - Step 22884: {'lr': 0.00047583711102502934, 'samples': 4393728, 'steps': 22883, 'loss/train': 1.5916904211044312}} 11/07/2021 00:27:10 - INFO - __main__ - Step 22888: {'lr': 0.0004758280058025274, 'samples': 4394496, 'steps': 22887, 'loss/train': 0.9584725499153137}}} 11/07/2021 00:27:11 - INFO - __main__ - Step 22892: {'lr': 0.00047581889895195154, 'samples': 4395264, 'steps': 22891, 'loss/train': 1.7565606832504272}} 11/07/2021 00:27:14 - INFO - __main__ - Step 22896: {'lr': 0.0004758097904733676, 'samples': 4396032, 'steps': 22895, 'loss/train': 1.64580500125885}72}} 11/07/2021 00:27:16 - INFO - __main__ - Step 22902: {'lr': 0.0004757961247031199, 'samples': 4397184, 'steps': 22901, 'loss/train': 1.7332442998886108}}} 11/07/2021 00:27:18 - INFO - __main__ - Step 22906: {'lr': 0.0004757870121548028, 'samples': 4397952, 'steps': 22905, 'loss/train': 1.6508798599243164}}} 11/07/2021 00:27:18 - INFO - __main__ - Step 22906: {'lr': 0.0004757870121548028, 'samples': 4397952, 'steps': 22905, 'loss/train': 1.6508798599243164}}} 11/07/2021 00:27:22 - INFO - __main__ - Step 22913: {'lr': 0.0004757710612784458, 'samples': 4399296, 'steps': 22912, 'loss/train': 1.82514488697052}4}}} 11/07/2021 00:27:24 - INFO - __main__ - Step 22917: {'lr': 0.00047576194425389654, 'samples': 4400064, 'steps': 22916, 'loss/train': 1.6657871007919312}} 11/07/2021 00:27:26 - INFO - __main__ - Step 22922: {'lr': 0.00047575054568440846, 'samples': 4401024, 'steps': 22921, 'loss/train': 1.4380706548690796}} 11/07/2021 00:27:29 - INFO - __main__ - Step 22927: {'lr': 0.0004757391445719277, 'samples': 4401984, 'steps': 22926, 'loss/train': 1.4672082662582397}}} 11/07/2021 00:27:29 - INFO - __main__ - Step 22927: {'lr': 0.0004757391445719277, 'samples': 4401984, 'steps': 22926, 'loss/train': 1.4672082662582397}}} 11/07/2021 00:27:32 - INFO - __main__ - Step 22934: {'lr': 0.00047572317874247107, 'samples': 4403328, 'steps': 22933, 'loss/train': 1.5558907985687256}} 11/07/2021 00:27:34 - INFO - __main__ - Step 22938: {'lr': 0.00047571405317376803, 'samples': 4404096, 'steps': 22937, 'loss/train': 1.4291313886642456}} 11/07/2021 00:27:36 - INFO - __main__ - Step 22943: {'lr': 0.0004757026439245735, 'samples': 4405056, 'steps': 22942, 'loss/train': 1.5172832012176514}}} 11/07/2021 00:27:39 - INFO - __main__ - Step 22948: {'lr': 0.0004756912321329256, 'samples': 4406016, 'steps': 22947, 'loss/train': 1.4924403429031372}}} 11/07/2021 00:27:39 - INFO - __main__ - Step 22948: {'lr': 0.0004756912321329256, 'samples': 4406016, 'steps': 22947, 'loss/train': 1.4924403429031372}}} 11/07/2021 00:27:42 - INFO - __main__ - Step 22954: {'lr': 0.00047567753462709095, 'samples': 4407168, 'steps': 22953, 'loss/train': 1.7961945533752441}} 11/07/2021 00:27:42 - INFO - __main__ - Step 22954: {'lr': 0.00047567753462709095, 'samples': 4407168, 'steps': 22953, 'loss/train': 1.7961945533752441}} 11/07/2021 00:27:47 - INFO - __main__ - Step 22963: {'lr': 0.00047565698150454845, 'samples': 4408896, 'steps': 22962, 'loss/train': 1.1223114728927612}} 11/07/2021 00:27:48 - INFO - __main__ - Step 22967: {'lr': 0.0004756478441397575, 'samples': 4409664, 'steps': 22966, 'loss/train': 1.8136224746704102}}} 11/07/2021 00:27:50 - INFO - __main__ - Step 22971: {'lr': 0.00047563870514819154, 'samples': 4410432, 'steps': 22970, 'loss/train': 0.8799536824226379}} 11/07/2021 00:27:52 - INFO - __main__ - Step 22976: {'lr': 0.00047562727912118206, 'samples': 4411392, 'steps': 22975, 'loss/train': 1.6678000688552856}} 11/07/2021 00:27:55 - INFO - __main__ - Step 22981: {'lr': 0.0004756158505525684, 'samples': 4412352, 'steps': 22980, 'loss/train': 1.1839544773101807}}} 11/07/2021 00:27:55 - INFO - __main__ - Step 22981: {'lr': 0.0004756158505525684, 'samples': 4412352, 'steps': 22980, 'loss/train': 1.1839544773101807}}} 11/07/2021 00:27:59 - INFO - __main__ - Step 22988: {'lr': 0.0004755998462868592, 'samples': 4413696, 'steps': 22987, 'loss/train': 1.766256332397461}}}} 11/07/2021 00:28:00 - INFO - __main__ - Step 22992: {'lr': 0.00047559069875580573, 'samples': 4414464, 'steps': 22991, 'loss/train': 1.68605637550354}}}} 11/07/2021 00:28:00 - INFO - __main__ - Step 22992: {'lr': 0.00047559069875580573, 'samples': 4414464, 'steps': 22991, 'loss/train': 1.68605637550354}}}} 11/07/2021 00:28:04 - INFO - __main__ - Step 23000: {'lr': 0.00047557239881467584, 'samples': 4416000, 'steps': 22999, 'loss/train': 1.4137681722640991}} 11/07/2021 00:28:06 - INFO - __main__ - Step 23004: {'lr': 0.00047556324640473134, 'samples': 4416768, 'steps': 23003, 'loss/train': 1.8332017660140991}} 11/07/2021 00:28:08 - INFO - __main__ - Step 23008: {'lr': 0.0004755540923686217, 'samples': 4417536, 'steps': 23007, 'loss/train': 1.5133399963378906}}} 11/07/2021 00:28:10 - INFO - __main__ - Step 23013: {'lr': 0.0004755426475367905, 'samples': 4418496, 'steps': 23012, 'loss/train': 1.3803929090499878}}} 11/07/2021 00:28:10 - INFO - __main__ - Step 23013: {'lr': 0.0004755426475367905, 'samples': 4418496, 'steps': 23012, 'loss/train': 1.3803929090499878}}} 11/07/2021 00:28:14 - INFO - __main__ - Step 23021: {'lr': 0.00047552433052136034, 'samples': 4420032, 'steps': 23020, 'loss/train': 1.6080424785614014}} 11/07/2021 00:28:16 - INFO - __main__ - Step 23025: {'lr': 0.00047551516957478545, 'samples': 4420800, 'steps': 23024, 'loss/train': 1.6900150775909424}} 11/07/2021 00:28:18 - INFO - __main__ - Step 23029: {'lr': 0.0004755060070023921, 'samples': 4421568, 'steps': 23028, 'loss/train': 0.19474640488624573}} 11/07/2021 00:28:20 - INFO - __main__ - Step 23034: {'lr': 0.0004754945515006938, 'samples': 4422528, 'steps': 23033, 'loss/train': 1.348042368888855}3}} 11/07/2021 00:28:20 - INFO - __main__ - Step 23034: {'lr': 0.0004754945515006938, 'samples': 4422528, 'steps': 23033, 'loss/train': 1.348042368888855}3}} 11/07/2021 00:28:24 - INFO - __main__ - Step 23042: {'lr': 0.0004754762174146032, 'samples': 4424064, 'steps': 23041, 'loss/train': 1.841143012046814}3}} 11/07/2021 00:28:26 - INFO - __main__ - Step 23046: {'lr': 0.00047546704793321835, 'samples': 4424832, 'steps': 23045, 'loss/train': 1.5525362491607666}} 11/07/2021 00:28:29 - INFO - __main__ - Step 23051: {'lr': 0.00047545558379567565, 'samples': 4425792, 'steps': 23050, 'loss/train': 1.7283782958984375}} 11/07/2021 00:28:31 - INFO - __main__ - Step 23055: {'lr': 0.0004754464106570727, 'samples': 4426560, 'steps': 23054, 'loss/train': 1.412814974784851}5}} 11/07/2021 00:28:33 - INFO - __main__ - Step 23059: {'lr': 0.0004754372358931471, 'samples': 4427328, 'steps': 23058, 'loss/train': 1.6849995851516724}}} 11/07/2021 00:28:34 - INFO - __main__ - Step 23063: {'lr': 0.00047542805950396476, 'samples': 4428096, 'steps': 23062, 'loss/train': 1.4607219696044922}} 11/07/2021 00:28:36 - INFO - __main__ - Step 23067: {'lr': 0.000475418881489592, 'samples': 4428864, 'steps': 23066, 'loss/train': 1.926710605621338}22}} 11/07/2021 00:28:39 - INFO - __main__ - Step 23072: {'lr': 0.0004754074066863027, 'samples': 4429824, 'steps': 23071, 'loss/train': 1.2372655868530273}}} 11/07/2021 00:28:41 - INFO - __main__ - Step 23076: {'lr': 0.0004753982250154933, 'samples': 4430592, 'steps': 23075, 'loss/train': 1.4330766201019287}}} 11/07/2021 00:28:43 - INFO - __main__ - Step 23080: {'lr': 0.00047538904171970847, 'samples': 4431360, 'steps': 23079, 'loss/train': 1.5698622465133667}} 11/07/2021 00:28:43 - INFO - __main__ - Step 23080: {'lr': 0.00047538904171970847, 'samples': 4431360, 'steps': 23079, 'loss/train': 1.5698622465133667}} 11/07/2021 00:28:46 - INFO - __main__ - Step 23087: {'lr': 0.0004753729670421871, 'samples': 4432704, 'steps': 23086, 'loss/train': 1.9568339586257935}}} 11/07/2021 00:28:49 - INFO - __main__ - Step 23092: {'lr': 0.0004753614820831638, 'samples': 4433664, 'steps': 23091, 'loss/train': 1.3575159311294556}}} 11/07/2021 00:28:49 - INFO - __main__ - Step 23092: {'lr': 0.0004753614820831638, 'samples': 4433664, 'steps': 23091, 'loss/train': 1.3575159311294556}}} 11/07/2021 00:28:53 - INFO - __main__ - Step 23100: {'lr': 0.00047534310086847116, 'samples': 4435200, 'steps': 23099, 'loss/train': 1.7464089393615723}} 11/07/2021 00:28:54 - INFO - __main__ - Step 23104: {'lr': 0.0004753339078242247, 'samples': 4435968, 'steps': 23103, 'loss/train': 1.5717073678970337}}} 11/07/2021 00:28:56 - INFO - __main__ - Step 23108: {'lr': 0.00047532471315546654, 'samples': 4436736, 'steps': 23107, 'loss/train': 1.3140676021575928}} 11/07/2021 00:28:59 - INFO - __main__ - Step 23113: {'lr': 0.00047531321753515026, 'samples': 4437696, 'steps': 23112, 'loss/train': 1.4464131593704224}} 11/07/2021 00:29:01 - INFO - __main__ - Step 23117: {'lr': 0.0004753040192114831, 'samples': 4438464, 'steps': 23116, 'loss/train': 1.6963107585906982}}} 11/07/2021 00:29:01 - INFO - __main__ - Step 23117: {'lr': 0.0004753040192114831, 'samples': 4438464, 'steps': 23116, 'loss/train': 1.6963107585906982}}} 11/07/2021 00:29:04 - INFO - __main__ - Step 23124: {'lr': 0.0004752879182366429, 'samples': 4439808, 'steps': 23123, 'loss/train': 1.7153961658477783}}} 11/07/2021 00:29:07 - INFO - __main__ - Step 23129: {'lr': 0.0004752764144949698, 'samples': 4440768, 'steps': 23128, 'loss/train': 1.5948010683059692}}} 11/07/2021 00:29:09 - INFO - __main__ - Step 23133: {'lr': 0.00047526720967451573, 'samples': 4441536, 'steps': 23132, 'loss/train': 1.695439100265503}}} 11/07/2021 00:29:09 - INFO - __main__ - Step 23133: {'lr': 0.00047526720967451573, 'samples': 4441536, 'steps': 23132, 'loss/train': 1.695439100265503}}} 11/07/2021 00:29:12 - INFO - __main__ - Step 23140: {'lr': 0.0004752510973309369, 'samples': 4442880, 'steps': 23139, 'loss/train': 1.5068085193634033}}} 11/07/2021 00:29:14 - INFO - __main__ - Step 23144: {'lr': 0.00047524188804455776, 'samples': 4443648, 'steps': 23143, 'loss/train': 1.1105417013168335}} 11/07/2021 00:29:17 - INFO - __main__ - Step 23149: {'lr': 0.00047523037415305494, 'samples': 4444608, 'steps': 23148, 'loss/train': 1.4677647352218628}} 11/07/2021 00:29:19 - INFO - __main__ - Step 23153: {'lr': 0.0004752211612131104, 'samples': 4445376, 'steps': 23152, 'loss/train': 0.6896765828132629}}} 11/07/2021 00:29:21 - INFO - __main__ - Step 23157: {'lr': 0.0004752119466494671, 'samples': 4446144, 'steps': 23156, 'loss/train': 1.9892706871032715}}} 11/07/2021 00:29:22 - INFO - __main__ - Step 23161: {'lr': 0.0004752027304621913, 'samples': 4446912, 'steps': 23160, 'loss/train': 0.3823089003562927}}} 11/07/2021 00:29:24 - INFO - __main__ - Step 23165: {'lr': 0.00047519351265134954, 'samples': 4447680, 'steps': 23164, 'loss/train': 1.934503197669983}}} 11/07/2021 00:29:27 - INFO - __main__ - Step 23170: {'lr': 0.00047518198810475885, 'samples': 4448640, 'steps': 23169, 'loss/train': 1.5390900373458862}} 11/07/2021 00:29:29 - INFO - __main__ - Step 23174: {'lr': 0.00047517276664113653, 'samples': 4449408, 'steps': 23173, 'loss/train': 1.5295782089233398}} 11/07/2021 00:29:31 - INFO - __main__ - Step 23178: {'lr': 0.00047516354355416426, 'samples': 4450176, 'steps': 23177, 'loss/train': 1.5092757940292358}} 11/07/2021 00:29:33 - INFO - __main__ - Step 23182: {'lr': 0.00047515431884390845, 'samples': 4450944, 'steps': 23181, 'loss/train': 1.668994665145874}}} 11/07/2021 00:29:34 - INFO - __main__ - Step 23186: {'lr': 0.0004751450925104357, 'samples': 4451712, 'steps': 23185, 'loss/train': 1.8284626007080078}}} 11/07/2021 00:29:37 - INFO - __main__ - Step 23191: {'lr': 0.00047513355731104717, 'samples': 4452672, 'steps': 23190, 'loss/train': 1.6222933530807495}} 11/07/2021 00:29:39 - INFO - __main__ - Step 23195: {'lr': 0.0004751243273255794, 'samples': 4453440, 'steps': 23194, 'loss/train': 1.7216869592666626}}} 11/07/2021 00:29:41 - INFO - __main__ - Step 23199: {'lr': 0.00047511509571711085, 'samples': 4454208, 'steps': 23198, 'loss/train': 1.2735344171524048}} 11/07/2021 00:29:43 - INFO - __main__ - Step 23203: {'lr': 0.00047510586248570815, 'samples': 4454976, 'steps': 23202, 'loss/train': 1.2748398780822754}} 11/07/2021 00:29:44 - INFO - __main__ - Step 23207: {'lr': 0.00047509662763143775, 'samples': 4455744, 'steps': 23206, 'loss/train': 1.393203854560852}}} 11/07/2021 00:29:46 - INFO - __main__ - Step 23211: {'lr': 0.0004750873911543663, 'samples': 4456512, 'steps': 23210, 'loss/train': 1.6986215114593506}}} 11/07/2021 00:29:49 - INFO - __main__ - Step 23216: {'lr': 0.0004750758432760644, 'samples': 4457472, 'steps': 23215, 'loss/train': 1.3327617645263672}}} 11/07/2021 00:29:51 - INFO - __main__ - Step 23220: {'lr': 0.000475066603147934, 'samples': 4458240, 'steps': 23219, 'loss/train': 1.4702428579330444}}}} 11/07/2021 00:29:53 - INFO - __main__ - Step 23224: {'lr': 0.000475057361397219, 'samples': 4459008, 'steps': 23223, 'loss/train': 1.4242885112762451}}}} 11/07/2021 00:29:55 - INFO - __main__ - Step 23228: {'lr': 0.00047504811802398603, 'samples': 4459776, 'steps': 23227, 'loss/train': 0.9568268656730652}} 11/07/2021 00:29:57 - INFO - __main__ - Step 23232: {'lr': 0.0004750388730283016, 'samples': 4460544, 'steps': 23231, 'loss/train': 1.7405587434768677}}} 11/07/2021 00:29:57 - INFO - __main__ - Step 23232: {'lr': 0.0004750388730283016, 'samples': 4460544, 'steps': 23231, 'loss/train': 1.7405587434768677}}} 11/07/2021 00:29:57 - INFO - __main__ - Step 23232: {'lr': 0.0004750388730283016, 'samples': 4460544, 'steps': 23231, 'loss/train': 1.7405587434768677}}} 11/07/2021 00:30:02 - INFO - __main__ - Step 23243: {'lr': 0.00047501344092494915, 'samples': 4462656, 'steps': 23242, 'loss/train': 1.8841161727905273}} 11/07/2021 00:30:05 - INFO - __main__ - Step 23248: {'lr': 0.000475001876822384, 'samples': 4463616, 'steps': 23247, 'loss/train': 1.821009635925293}73}} 11/07/2021 00:30:07 - INFO - __main__ - Step 23253: {'lr': 0.00047499031018525953, 'samples': 4464576, 'steps': 23252, 'loss/train': 0.6648880839347839}} 11/07/2021 00:30:09 - INFO - __main__ - Step 23257: {'lr': 0.00047498105505076475, 'samples': 4465344, 'steps': 23256, 'loss/train': 1.2903780937194824}} 11/07/2021 00:30:11 - INFO - __main__ - Step 23261: {'lr': 0.00047497179829430217, 'samples': 4466112, 'steps': 23260, 'loss/train': 1.524401307106018}}} 11/07/2021 00:30:13 - INFO - __main__ - Step 23265: {'lr': 0.0004749625399159384, 'samples': 4466880, 'steps': 23264, 'loss/train': 0.34565478563308716}} 11/07/2021 00:30:15 - INFO - __main__ - Step 23269: {'lr': 0.00047495327991574034, 'samples': 4467648, 'steps': 23268, 'loss/train': 1.6375641822814941}} 11/07/2021 00:30:17 - INFO - __main__ - Step 23274: {'lr': 0.0004749417026348897, 'samples': 4468608, 'steps': 23273, 'loss/train': 1.6376235485076904}}} 11/07/2021 00:30:19 - INFO - __main__ - Step 23278: {'lr': 0.0004749324389858083, 'samples': 4469376, 'steps': 23277, 'loss/train': 1.7561227083206177}}} 11/07/2021 00:30:19 - INFO - __main__ - Step 23278: {'lr': 0.0004749324389858083, 'samples': 4469376, 'steps': 23277, 'loss/train': 1.7561227083206177}}} 11/07/2021 00:30:23 - INFO - __main__ - Step 23285: {'lr': 0.0004749162236979393, 'samples': 4470720, 'steps': 23284, 'loss/train': 1.2017252445220947}}} 11/07/2021 00:30:25 - INFO - __main__ - Step 23290: {'lr': 0.00047490463830912713, 'samples': 4471680, 'steps': 23289, 'loss/train': 1.5626459121704102}} 11/07/2021 00:30:27 - INFO - __main__ - Step 23294: {'lr': 0.00047489536817397706, 'samples': 4472448, 'steps': 23293, 'loss/train': 1.6595637798309326}} 11/07/2021 00:30:29 - INFO - __main__ - Step 23298: {'lr': 0.0004748860964174768, 'samples': 4473216, 'steps': 23297, 'loss/train': 1.3847980499267578}}} 11/07/2021 00:30:29 - INFO - __main__ - Step 23298: {'lr': 0.0004748860964174768, 'samples': 4473216, 'steps': 23297, 'loss/train': 1.3847980499267578}}} 11/07/2021 00:30:32 - INFO - __main__ - Step 23305: {'lr': 0.00047486986694242887, 'samples': 4474560, 'steps': 23304, 'loss/train': 1.4229025840759277}} 11/07/2021 00:30:35 - INFO - __main__ - Step 23311: {'lr': 0.0004748559520122099, 'samples': 4475712, 'steps': 23310, 'loss/train': 1.7451789379119873}}} 11/07/2021 00:30:35 - INFO - __main__ - Step 23311: {'lr': 0.0004748559520122099, 'samples': 4475712, 'steps': 23310, 'loss/train': 1.7451789379119873}}} 11/07/2021 00:30:39 - INFO - __main__ - Step 23318: {'lr': 0.000474839713317064, 'samples': 4477056, 'steps': 23317, 'loss/train': 1.7453724145889282}}}} 11/07/2021 00:30:39 - INFO - __main__ - Step 23318: {'lr': 0.000474839713317064, 'samples': 4477056, 'steps': 23317, 'loss/train': 1.7453724145889282}}}} 11/07/2021 00:30:43 - INFO - __main__ - Step 23326: {'lr': 0.0004748211487297884, 'samples': 4478592, 'steps': 23325, 'loss/train': 1.8467061519622803}}} 11/07/2021 00:30:45 - INFO - __main__ - Step 23331: {'lr': 0.00047480954257042666, 'samples': 4479552, 'steps': 23330, 'loss/train': 1.509041428565979}}} 11/07/2021 00:30:48 - INFO - __main__ - Step 23336: {'lr': 0.0004747979338786721, 'samples': 4480512, 'steps': 23335, 'loss/train': 1.405137062072754}}}} 11/07/2021 00:30:48 - INFO - __main__ - Step 23336: {'lr': 0.0004747979338786721, 'samples': 4480512, 'steps': 23335, 'loss/train': 1.405137062072754}}}} 11/07/2021 00:30:51 - INFO - __main__ - Step 23343: {'lr': 0.00047478167745604495, 'samples': 4481856, 'steps': 23342, 'loss/train': 1.4305206537246704}} 11/07/2021 00:30:53 - INFO - __main__ - Step 23347: {'lr': 0.00047477238584343407, 'samples': 4482624, 'steps': 23346, 'loss/train': 1.2408764362335205}} 11/07/2021 00:30:55 - INFO - __main__ - Step 23352: {'lr': 0.0004747607690489015, 'samples': 4483584, 'steps': 23351, 'loss/train': 1.5466938018798828}}} 11/07/2021 00:30:58 - INFO - __main__ - Step 23357: {'lr': 0.0004747491497225257, 'samples': 4484544, 'steps': 23356, 'loss/train': 1.5523254871368408}}} 11/07/2021 00:30:58 - INFO - __main__ - Step 23357: {'lr': 0.0004747491497225257, 'samples': 4484544, 'steps': 23356, 'loss/train': 1.5523254871368408}}} 11/07/2021 00:30:58 - INFO - __main__ - Step 23357: {'lr': 0.0004747491497225257, 'samples': 4484544, 'steps': 23356, 'loss/train': 1.5523254871368408}}} 11/07/2021 00:31:03 - INFO - __main__ - Step 23367: {'lr': 0.0004747259034747675, 'samples': 4486464, 'steps': 23366, 'loss/train': 1.718819499015808}}}} 11/07/2021 00:31:05 - INFO - __main__ - Step 23372: {'lr': 0.000474714276553647, 'samples': 4487424, 'steps': 23371, 'loss/train': 1.8126929998397827}}}} 11/07/2021 00:31:05 - INFO - __main__ - Step 23372: {'lr': 0.000474714276553647, 'samples': 4487424, 'steps': 23371, 'loss/train': 1.8126929998397827}}}} 11/07/2021 00:31:09 - INFO - __main__ - Step 23380: {'lr': 0.000474695668214764, 'samples': 4488960, 'steps': 23379, 'loss/train': 1.5567258596420288}}}} 11/07/2021 00:31:10 - INFO - __main__ - Step 23384: {'lr': 0.00047468636161542325, 'samples': 4489728, 'steps': 23383, 'loss/train': 1.4994603395462036}} 11/07/2021 00:31:13 - INFO - __main__ - Step 23388: {'lr': 0.0004746770533962391, 'samples': 4490496, 'steps': 23387, 'loss/train': 1.6473876237869263}}} 11/07/2021 00:31:15 - INFO - __main__ - Step 23393: {'lr': 0.00047466541584445667, 'samples': 4491456, 'steps': 23392, 'loss/train': 1.4724624156951904}} 11/07/2021 00:31:15 - INFO - __main__ - Step 23393: {'lr': 0.00047466541584445667, 'samples': 4491456, 'steps': 23392, 'loss/train': 1.4724624156951904}} 11/07/2021 00:31:19 - INFO - __main__ - Step 23401: {'lr': 0.00047464679049765926, 'samples': 4492992, 'steps': 23400, 'loss/train': 1.7427978515625}04}} 11/07/2021 00:31:21 - INFO - __main__ - Step 23405: {'lr': 0.0004746374753948899, 'samples': 4493760, 'steps': 23404, 'loss/train': 1.66446852684021}04}} 11/07/2021 00:31:23 - INFO - __main__ - Step 23409: {'lr': 0.00047462815867262967, 'samples': 4494528, 'steps': 23408, 'loss/train': 1.623984456062317}}} 11/07/2021 00:31:25 - INFO - __main__ - Step 23414: {'lr': 0.00047461651049249764, 'samples': 4495488, 'steps': 23413, 'loss/train': 1.349570393562317}}} 11/07/2021 00:31:25 - INFO - __main__ - Step 23414: {'lr': 0.00047461651049249764, 'samples': 4495488, 'steps': 23413, 'loss/train': 1.349570393562317}}} 11/07/2021 00:31:29 - INFO - __main__ - Step 23421: {'lr': 0.0004746001987895755, 'samples': 4496832, 'steps': 23420, 'loss/train': 0.7658491134643555}}} 11/07/2021 00:31:31 - INFO - __main__ - Step 23425: {'lr': 0.00047459087559002355, 'samples': 4497600, 'steps': 23424, 'loss/train': 0.885067343711853}}} 11/07/2021 00:31:33 - INFO - __main__ - Step 23430: {'lr': 0.0004745792193136549, 'samples': 4498560, 'steps': 23429, 'loss/train': 1.2394837141036987}}} 11/07/2021 00:31:35 - INFO - __main__ - Step 23434: {'lr': 0.0004745698924710988, 'samples': 4499328, 'steps': 23433, 'loss/train': 1.6531399488449097}}} 11/07/2021 00:31:37 - INFO - __main__ - Step 23438: {'lr': 0.0004745605640095392, 'samples': 4500096, 'steps': 23437, 'loss/train': 1.2304730415344238}}} 11/07/2021 00:31:39 - INFO - __main__ - Step 23442: {'lr': 0.0004745512339290432, 'samples': 4500864, 'steps': 23441, 'loss/train': 1.948983073234558}}}} 11/07/2021 00:31:41 - INFO - __main__ - Step 23446: {'lr': 0.000474541902229678, 'samples': 4501632, 'steps': 23445, 'loss/train': 1.2001657485961914}}}} 11/07/2021 00:31:43 - INFO - __main__ - Step 23451: {'lr': 0.00047453023532903927, 'samples': 4502592, 'steps': 23450, 'loss/train': 1.027752161026001}}} 11/07/2021 00:31:45 - INFO - __main__ - Step 23455: {'lr': 0.00047452089998746463, 'samples': 4503360, 'steps': 23454, 'loss/train': 1.2960706949234009}} 11/07/2021 00:31:47 - INFO - __main__ - Step 23459: {'lr': 0.0004745115630272394, 'samples': 4504128, 'steps': 23458, 'loss/train': 1.5691437721252441}}} 11/07/2021 00:31:47 - INFO - __main__ - Step 23459: {'lr': 0.0004745115630272394, 'samples': 4504128, 'steps': 23458, 'loss/train': 1.5691437721252441}}} 11/07/2021 00:31:51 - INFO - __main__ - Step 23466: {'lr': 0.00047449521945217016, 'samples': 4505472, 'steps': 23465, 'loss/train': 1.0448362827301025}} 11/07/2021 00:31:53 - INFO - __main__ - Step 23471: {'lr': 0.0004744835424353344, 'samples': 4506432, 'steps': 23470, 'loss/train': 1.616047739982605}5}} 11/07/2021 00:31:56 - INFO - __main__ - Step 23475: {'lr': 0.00047447419900118067, 'samples': 4507200, 'steps': 23474, 'loss/train': 1.3570666313171387}} 11/07/2021 00:31:56 - INFO - __main__ - Step 23475: {'lr': 0.00047447419900118067, 'samples': 4507200, 'steps': 23474, 'loss/train': 1.3570666313171387}} 11/07/2021 00:31:59 - INFO - __main__ - Step 23482: {'lr': 0.00047445784409738467, 'samples': 4508544, 'steps': 23481, 'loss/train': 1.8529151678085327}} 11/07/2021 00:32:02 - INFO - __main__ - Step 23488: {'lr': 0.00047444382166405067, 'samples': 4509696, 'steps': 23487, 'loss/train': 5.782830238342285}}} 11/07/2021 00:32:02 - INFO - __main__ - Step 23488: {'lr': 0.00047444382166405067, 'samples': 4509696, 'steps': 23487, 'loss/train': 5.782830238342285}}} 11/07/2021 00:32:05 - INFO - __main__ - Step 23495: {'lr': 0.00047442745755705326, 'samples': 4511040, 'steps': 23494, 'loss/train': 1.4900996685028076}} 11/07/2021 00:32:07 - INFO - __main__ - Step 23499: {'lr': 0.00047441810441402777, 'samples': 4511808, 'steps': 23498, 'loss/train': 0.658106803894043}}} 11/07/2021 00:32:09 - INFO - __main__ - Step 23504: {'lr': 0.00047440641071006874, 'samples': 4512768, 'steps': 23503, 'loss/train': 1.6008127927780151}} 11/07/2021 00:32:09 - INFO - __main__ - Step 23504: {'lr': 0.00047440641071006874, 'samples': 4512768, 'steps': 23503, 'loss/train': 1.6008127927780151}} 11/07/2021 00:32:13 - INFO - __main__ - Step 23512: {'lr': 0.0004743876955258578, 'samples': 4514304, 'steps': 23511, 'loss/train': 1.8968209028244019}}} 11/07/2021 00:32:15 - INFO - __main__ - Step 23516: {'lr': 0.00047437833550718336, 'samples': 4515072, 'steps': 23515, 'loss/train': 1.3097409009933472}} 11/07/2021 00:32:17 - INFO - __main__ - Step 23520: {'lr': 0.0004743689738708863, 'samples': 4515840, 'steps': 23519, 'loss/train': 0.6383938193321228}}} 11/07/2021 00:32:19 - INFO - __main__ - Step 23525: {'lr': 0.00047435726955083593, 'samples': 4516800, 'steps': 23524, 'loss/train': 1.865501880645752}}} 11/07/2021 00:32:22 - INFO - __main__ - Step 23530: {'lr': 0.0004743455627034875, 'samples': 4517760, 'steps': 23529, 'loss/train': 1.602501392364502}}}} 11/07/2021 00:32:22 - INFO - __main__ - Step 23530: {'lr': 0.0004743455627034875, 'samples': 4517760, 'steps': 23529, 'loss/train': 1.602501392364502}}}} 11/07/2021 00:32:25 - INFO - __main__ - Step 23537: {'lr': 0.00047432916887158995, 'samples': 4519104, 'steps': 23536, 'loss/train': 1.1145853996276855}} 11/07/2021 00:32:27 - INFO - __main__ - Step 23541: {'lr': 0.00047431979874388154, 'samples': 4519872, 'steps': 23540, 'loss/train': 1.2911465167999268}} 11/07/2021 00:32:29 - INFO - __main__ - Step 23545: {'lr': 0.00047431042699897245, 'samples': 4520640, 'steps': 23544, 'loss/train': 1.5103429555892944}} 11/07/2021 00:32:32 - INFO - __main__ - Step 23550: {'lr': 0.0004742987100437507, 'samples': 4521600, 'steps': 23549, 'loss/train': 1.565748691558838}4}} 11/07/2021 00:32:34 - INFO - __main__ - Step 23555: {'lr': 0.00047428699056189047, 'samples': 4522560, 'steps': 23554, 'loss/train': 1.76802396774292}4}} 11/07/2021 00:32:34 - INFO - __main__ - Step 23555: {'lr': 0.00047428699056189047, 'samples': 4522560, 'steps': 23554, 'loss/train': 1.76802396774292}4}} 11/07/2021 00:32:37 - INFO - __main__ - Step 23562: {'lr': 0.0004742705790427849, 'samples': 4523904, 'steps': 23561, 'loss/train': 1.4885367155075073}}} 11/07/2021 00:32:39 - INFO - __main__ - Step 23566: {'lr': 0.00047426119880868123, 'samples': 4524672, 'steps': 23565, 'loss/train': 1.8396615982055664}} 11/07/2021 00:32:42 - INFO - __main__ - Step 23571: {'lr': 0.0004742494712424653, 'samples': 4525632, 'steps': 23570, 'loss/train': 1.6598799228668213}}} 11/07/2021 00:32:44 - INFO - __main__ - Step 23576: {'lr': 0.0004742377411501656, 'samples': 4526592, 'steps': 23575, 'loss/train': 1.6603103876113892}}} 11/07/2021 00:32:44 - INFO - __main__ - Step 23576: {'lr': 0.0004742377411501656, 'samples': 4526592, 'steps': 23575, 'loss/train': 1.6603103876113892}}} 11/07/2021 00:32:48 - INFO - __main__ - Step 23583: {'lr': 0.00047422131477737684, 'samples': 4527936, 'steps': 23582, 'loss/train': 1.6390395164489746}} 11/07/2021 00:32:49 - INFO - __main__ - Step 23587: {'lr': 0.0004742119260559424, 'samples': 4528704, 'steps': 23586, 'loss/train': 2.1219773292541504}}} 11/07/2021 00:32:51 - INFO - __main__ - Step 23591: {'lr': 0.0004742025357180852, 'samples': 4529472, 'steps': 23590, 'loss/train': 1.342724084854126}}}} 11/07/2021 00:32:53 - INFO - __main__ - Step 23595: {'lr': 0.0004741931437638727, 'samples': 4530240, 'steps': 23594, 'loss/train': 1.5737087726593018}}} 11/07/2021 00:32:53 - INFO - __main__ - Step 23595: {'lr': 0.0004741931437638727, 'samples': 4530240, 'steps': 23594, 'loss/train': 1.5737087726593018}}} 11/07/2021 00:32:57 - INFO - __main__ - Step 23603: {'lr': 0.0004741743550066527, 'samples': 4531776, 'steps': 23602, 'loss/train': 1.2754849195480347}}} 11/07/2021 00:32:59 - INFO - __main__ - Step 23607: {'lr': 0.0004741649582037808, 'samples': 4532544, 'steps': 23606, 'loss/train': 1.8504846096038818}}} 11/07/2021 00:33:01 - INFO - __main__ - Step 23612: {'lr': 0.00047415320992758025, 'samples': 4533504, 'steps': 23611, 'loss/train': 1.6855032444000244}} 11/07/2021 00:33:01 - INFO - __main__ - Step 23612: {'lr': 0.00047415320992758025, 'samples': 4533504, 'steps': 23611, 'loss/train': 1.6855032444000244}} 11/07/2021 00:33:06 - INFO - __main__ - Step 23620: {'lr': 0.0004741344074337155, 'samples': 4535040, 'steps': 23619, 'loss/train': 1.7315502166748047}}} 11/07/2021 00:33:08 - INFO - __main__ - Step 23624: {'lr': 0.0004741250037629531, 'samples': 4535808, 'steps': 23623, 'loss/train': 1.7665811777114868}}} 11/07/2021 00:33:09 - INFO - __main__ - Step 23628: {'lr': 0.00047411559847639447, 'samples': 4536576, 'steps': 23627, 'loss/train': 2.0532431602478027}} 11/07/2021 00:33:11 - INFO - __main__ - Step 23632: {'lr': 0.0004741061915741073, 'samples': 4537344, 'steps': 23631, 'loss/train': 1.8703787326812744}}} 11/07/2021 00:33:14 - INFO - __main__ - Step 23637: {'lr': 0.0004740944306742335, 'samples': 4538304, 'steps': 23636, 'loss/train': 1.346545934677124}}}} 11/07/2021 00:33:14 - INFO - __main__ - Step 23637: {'lr': 0.0004740944306742335, 'samples': 4538304, 'steps': 23636, 'loss/train': 1.346545934677124}}}} 11/07/2021 00:33:18 - INFO - __main__ - Step 23645: {'lr': 0.00047407560798386894, 'samples': 4539840, 'steps': 23644, 'loss/train': 1.8145923614501953}} 11/07/2021 00:33:19 - INFO - __main__ - Step 23649: {'lr': 0.00047406619421549247, 'samples': 4540608, 'steps': 23648, 'loss/train': 1.5111690759658813}} 11/07/2021 00:33:21 - INFO - __main__ - Step 23653: {'lr': 0.0004740567788317437, 'samples': 4541376, 'steps': 23652, 'loss/train': 0.5793235301971436}}} 11/07/2021 00:33:24 - INFO - __main__ - Step 23658: {'lr': 0.0004740450073305438, 'samples': 4542336, 'steps': 23657, 'loss/train': 0.7227396368980408}}} 11/07/2021 00:33:24 - INFO - __main__ - Step 23658: {'lr': 0.0004740450073305438, 'samples': 4542336, 'steps': 23657, 'loss/train': 0.7227396368980408}}} 11/07/2021 00:33:27 - INFO - __main__ - Step 23665: {'lr': 0.0004740285229889423, 'samples': 4543680, 'steps': 23664, 'loss/train': 1.558956503868103}}}} 11/07/2021 00:33:30 - INFO - __main__ - Step 23669: {'lr': 0.00047401910114438313, 'samples': 4544448, 'steps': 23668, 'loss/train': 1.3126306533813477}} 11/07/2021 00:33:32 - INFO - __main__ - Step 23674: {'lr': 0.0004740073215675523, 'samples': 4545408, 'steps': 23673, 'loss/train': 2.9824087619781494}}} 11/07/2021 00:33:34 - INFO - __main__ - Step 23678: {'lr': 0.0004739978960892649, 'samples': 4546176, 'steps': 23677, 'loss/train': 1.305275797843933}}}} 11/07/2021 00:33:36 - INFO - __main__ - Step 23682: {'lr': 0.00047398846899609755, 'samples': 4546944, 'steps': 23681, 'loss/train': 1.5400047302246094}} 11/07/2021 00:33:38 - INFO - __main__ - Step 23686: {'lr': 0.00047397904028811824, 'samples': 4547712, 'steps': 23685, 'loss/train': 0.9328953623771667}} 11/07/2021 00:33:40 - INFO - __main__ - Step 23691: {'lr': 0.00047396725213241835, 'samples': 4548672, 'steps': 23690, 'loss/train': 1.7212902307510376}} 11/07/2021 00:33:40 - INFO - __main__ - Step 23691: {'lr': 0.00047396725213241835, 'samples': 4548672, 'steps': 23690, 'loss/train': 1.7212902307510376}} 11/07/2021 00:33:43 - INFO - __main__ - Step 23697: {'lr': 0.000473953103015356, 'samples': 4549824, 'steps': 23696, 'loss/train': 1.1808024644851685}6}} 11/07/2021 00:33:45 - INFO - __main__ - Step 23701: {'lr': 0.0004739436682524373, 'samples': 4550592, 'steps': 23700, 'loss/train': 1.2520087957382202}}} 11/07/2021 00:33:48 - INFO - __main__ - Step 23706: {'lr': 0.00047393187252842183, 'samples': 4551552, 'steps': 23705, 'loss/train': 1.6185001134872437}} 11/07/2021 00:33:50 - INFO - __main__ - Step 23710: {'lr': 0.0004739224341329987, 'samples': 4552320, 'steps': 23709, 'loss/train': 1.2958784103393555}}} 11/07/2021 00:33:52 - INFO - __main__ - Step 23714: {'lr': 0.0004739129941232396, 'samples': 4553088, 'steps': 23713, 'loss/train': 1.1649988889694214}}} 11/07/2021 00:33:54 - INFO - __main__ - Step 23718: {'lr': 0.0004739035524992127, 'samples': 4553856, 'steps': 23717, 'loss/train': 1.6495895385742188}}} 11/07/2021 00:33:56 - INFO - __main__ - Step 23722: {'lr': 0.000473894109260986, 'samples': 4554624, 'steps': 23721, 'loss/train': 1.5855025053024292}}}} 11/07/2021 00:33:58 - INFO - __main__ - Step 23726: {'lr': 0.00047388466440862755, 'samples': 4555392, 'steps': 23725, 'loss/train': 1.599613070487976}}} 11/07/2021 00:34:00 - INFO - __main__ - Step 23731: {'lr': 0.00047387285607341064, 'samples': 4556352, 'steps': 23730, 'loss/train': 1.5931893587112427}} 11/07/2021 00:34:02 - INFO - __main__ - Step 23735: {'lr': 0.00047386340758950494, 'samples': 4557120, 'steps': 23734, 'loss/train': 1.7338857650756836}} 11/07/2021 00:34:04 - INFO - __main__ - Step 23739: {'lr': 0.00047385395749168885, 'samples': 4557888, 'steps': 23738, 'loss/train': 0.8991847038269043}} 11/07/2021 00:34:05 - INFO - __main__ - Step 23743: {'lr': 0.00047384450578003055, 'samples': 4558656, 'steps': 23742, 'loss/train': 1.8437845706939697}} 11/07/2021 00:34:08 - INFO - __main__ - Step 23747: {'lr': 0.0004738350524545982, 'samples': 4559424, 'steps': 23746, 'loss/train': 1.2148529291152954}}} 11/07/2021 00:34:10 - INFO - __main__ - Step 23752: {'lr': 0.0004738232335285417, 'samples': 4560384, 'steps': 23751, 'loss/train': 1.4667108058929443}}} 11/07/2021 00:34:10 - INFO - __main__ - Step 23752: {'lr': 0.0004738232335285417, 'samples': 4560384, 'steps': 23751, 'loss/train': 1.4667108058929443}}} 11/07/2021 00:34:14 - INFO - __main__ - Step 23759: {'lr': 0.00047380668279633814, 'samples': 4561728, 'steps': 23758, 'loss/train': 1.7122327089309692}} 11/07/2021 00:34:16 - INFO - __main__ - Step 23763: {'lr': 0.0004737972230164911, 'samples': 4562496, 'steps': 23762, 'loss/train': 1.7650277614593506}}} 11/07/2021 00:34:18 - INFO - __main__ - Step 23768: {'lr': 0.0004737853960227998, 'samples': 4563456, 'steps': 23767, 'loss/train': 1.617723822593689}}}} 11/07/2021 00:34:21 - INFO - __main__ - Step 23773: {'lr': 0.00047377356650825245, 'samples': 4564416, 'steps': 23772, 'loss/train': 1.3861149549484253}} 11/07/2021 00:34:23 - INFO - __main__ - Step 23777: {'lr': 0.00047376410108168756, 'samples': 4565184, 'steps': 23776, 'loss/train': 1.6561030149459839}} 11/07/2021 00:34:25 - INFO - __main__ - Step 23781: {'lr': 0.0004737546340419283, 'samples': 4565952, 'steps': 23780, 'loss/train': 1.6509655714035034}}} 11/07/2021 00:34:25 - INFO - __main__ - Step 23781: {'lr': 0.0004737546340419283, 'samples': 4565952, 'steps': 23780, 'loss/train': 1.6509655714035034}}} 11/07/2021 00:34:28 - INFO - __main__ - Step 23788: {'lr': 0.0004737380628408059, 'samples': 4567296, 'steps': 23787, 'loss/train': 1.7185180187225342}}} 11/07/2021 00:34:31 - INFO - __main__ - Step 23793: {'lr': 0.0004737262232441667, 'samples': 4568256, 'steps': 23792, 'loss/train': 0.1296090930700302}}} 11/07/2021 00:34:31 - INFO - __main__ - Step 23793: {'lr': 0.0004737262232441667, 'samples': 4568256, 'steps': 23792, 'loss/train': 0.1296090930700302}}} 11/07/2021 00:34:34 - INFO - __main__ - Step 23800: {'lr': 0.00047370964357498313, 'samples': 4569600, 'steps': 23799, 'loss/train': 1.482106328010559}}} 11/07/2021 00:34:36 - INFO - __main__ - Step 23804: {'lr': 0.00047370016726068086, 'samples': 4570368, 'steps': 23803, 'loss/train': 2.384243965148926}}} 11/07/2021 00:34:38 - INFO - __main__ - Step 23809: {'lr': 0.00047368831959990453, 'samples': 4571328, 'steps': 23808, 'loss/train': 1.5713011026382446}} 11/07/2021 00:34:38 - INFO - __main__ - Step 23809: {'lr': 0.00047368831959990453, 'samples': 4571328, 'steps': 23808, 'loss/train': 1.5713011026382446}} 11/07/2021 00:34:43 - INFO - __main__ - Step 23817: {'lr': 0.0004736693581016117, 'samples': 4572864, 'steps': 23816, 'loss/train': 1.0271397829055786}}} 11/07/2021 00:34:44 - INFO - __main__ - Step 23821: {'lr': 0.000473659874933664, 'samples': 4573632, 'steps': 23820, 'loss/train': 2.0048911571502686}}}} 11/07/2021 00:34:46 - INFO - __main__ - Step 23825: {'lr': 0.0004736503901532734, 'samples': 4574400, 'steps': 23824, 'loss/train': 2.036221504211426}}}} 11/07/2021 00:34:49 - INFO - __main__ - Step 23830: {'lr': 0.0004736385319103912, 'samples': 4575360, 'steps': 23829, 'loss/train': 0.9364742636680603}}} 11/07/2021 00:34:51 - INFO - __main__ - Step 23834: {'lr': 0.00047362904350225376, 'samples': 4576128, 'steps': 23833, 'loss/train': 0.8249647617340088}} 11/07/2021 00:34:51 - INFO - __main__ - Step 23834: {'lr': 0.00047362904350225376, 'samples': 4576128, 'steps': 23833, 'loss/train': 0.8249647617340088}} 11/07/2021 00:34:54 - INFO - __main__ - Step 23841: {'lr': 0.00047361243490864826, 'samples': 4577472, 'steps': 23840, 'loss/train': 0.14377540349960327} 11/07/2021 00:34:54 - INFO - __main__ - Step 23841: {'lr': 0.00047361243490864826, 'samples': 4577472, 'steps': 23840, 'loss/train': 0.14377540349960327} 11/07/2021 00:34:58 - INFO - __main__ - Step 23849: {'lr': 0.0004735934476134561, 'samples': 4579008, 'steps': 23848, 'loss/train': 2.162973165512085}27} 11/07/2021 00:35:00 - INFO - __main__ - Step 23853: {'lr': 0.0004735839515478796, 'samples': 4579776, 'steps': 23852, 'loss/train': 1.931373119354248}27} 11/07/2021 00:35:02 - INFO - __main__ - Step 23857: {'lr': 0.00047357445387040745, 'samples': 4580544, 'steps': 23856, 'loss/train': 1.524977445602417}7} 11/07/2021 00:35:04 - INFO - __main__ - Step 23861: {'lr': 0.00047356495458110806, 'samples': 4581312, 'steps': 23860, 'loss/train': 1.451634168624878}7} 11/07/2021 00:35:06 - INFO - __main__ - Step 23866: {'lr': 0.00047355307820295625, 'samples': 4582272, 'steps': 23865, 'loss/train': 1.762752890586853}7} 11/07/2021 00:35:08 - INFO - __main__ - Step 23870: {'lr': 0.0004735435752872962, 'samples': 4583040, 'steps': 23869, 'loss/train': 1.8624155521392822}7} 11/07/2021 00:35:11 - INFO - __main__ - Step 23875: {'lr': 0.0004735316943764102, 'samples': 4584000, 'steps': 23874, 'loss/train': 1.478752851486206}}7} 11/07/2021 00:35:11 - INFO - __main__ - Step 23875: {'lr': 0.0004735316943764102, 'samples': 4584000, 'steps': 23874, 'loss/train': 1.478752851486206}}7} 11/07/2021 00:35:14 - INFO - __main__ - Step 23882: {'lr': 0.00047351505687096257, 'samples': 4585344, 'steps': 23881, 'loss/train': 1.5504294633865356}} 11/07/2021 00:35:16 - INFO - __main__ - Step 23887: {'lr': 0.0004735031699171055, 'samples': 4586304, 'steps': 23886, 'loss/train': 1.0130150318145752}}} 11/07/2021 00:35:19 - INFO - __main__ - Step 23892: {'lr': 0.00047349128044557153, 'samples': 4587264, 'steps': 23891, 'loss/train': 1.682618498802185}}} 11/07/2021 00:35:21 - INFO - __main__ - Step 23896: {'lr': 0.0004734817670557069, 'samples': 4588032, 'steps': 23895, 'loss/train': 1.325243353843689}}}} 11/07/2021 00:35:23 - INFO - __main__ - Step 23900: {'lr': 0.00047347225205468323, 'samples': 4588800, 'steps': 23899, 'loss/train': 1.0516488552093506}} 11/07/2021 00:35:23 - INFO - __main__ - Step 23900: {'lr': 0.00047347225205468323, 'samples': 4588800, 'steps': 23899, 'loss/train': 1.0516488552093506}} 11/07/2021 00:35:26 - INFO - __main__ - Step 23907: {'lr': 0.000473455596926247, 'samples': 4590144, 'steps': 23906, 'loss/train': 1.787940502166748}06}} 11/07/2021 00:35:29 - INFO - __main__ - Step 23911: {'lr': 0.00047344607749489, 'samples': 4590912, 'steps': 23910, 'loss/train': 1.5765531063079834}06}} 11/07/2021 00:35:31 - INFO - __main__ - Step 23915: {'lr': 0.0004734365564526313, 'samples': 4591680, 'steps': 23914, 'loss/train': 1.8650660514831543}}} 11/07/2021 00:35:32 - INFO - __main__ - Step 23919: {'lr': 0.0004734270337995395, 'samples': 4592448, 'steps': 23918, 'loss/train': 1.1809464693069458}}} 11/07/2021 00:35:32 - INFO - __main__ - Step 23919: {'lr': 0.0004734270337995395, 'samples': 4592448, 'steps': 23918, 'loss/train': 1.1809464693069458}}} 11/07/2021 00:35:36 - INFO - __main__ - Step 23924: {'lr': 0.0004734151282180454, 'samples': 4593408, 'steps': 23923, 'loss/train': 1.8191454410552979}}} 11/07/2021 00:35:39 - INFO - __main__ - Step 23930: {'lr': 0.0004734008381982399, 'samples': 4594560, 'steps': 23929, 'loss/train': 1.452463150024414}}}} 11/07/2021 00:35:41 - INFO - __main__ - Step 23934: {'lr': 0.0004733913095051358, 'samples': 4595328, 'steps': 23933, 'loss/train': 1.8170708417892456}}} 11/07/2021 00:35:43 - INFO - __main__ - Step 23938: {'lr': 0.0004733817792015249, 'samples': 4596096, 'steps': 23937, 'loss/train': 1.0775352716445923}}} 11/07/2021 00:35:44 - INFO - __main__ - Step 23942: {'lr': 0.0004733722472874759, 'samples': 4596864, 'steps': 23941, 'loss/train': 1.8950859308242798}}} 11/07/2021 00:35:47 - INFO - __main__ - Step 23946: {'lr': 0.0004733627137630574, 'samples': 4597632, 'steps': 23945, 'loss/train': 2.1152851581573486}}} 11/07/2021 00:35:47 - INFO - __main__ - Step 23946: {'lr': 0.0004733627137630574, 'samples': 4597632, 'steps': 23945, 'loss/train': 2.1152851581573486}}} 11/07/2021 00:35:51 - INFO - __main__ - Step 23954: {'lr': 0.00047334364188338725, 'samples': 4599168, 'steps': 23953, 'loss/train': 1.7158654928207397}} 11/07/2021 00:35:53 - INFO - __main__ - Step 23958: {'lr': 0.000473334103528273, 'samples': 4599936, 'steps': 23957, 'loss/train': 1.7731554508209229}7}} 11/07/2021 00:35:54 - INFO - __main__ - Step 23962: {'lr': 0.0004733245635630644, 'samples': 4600704, 'steps': 23961, 'loss/train': 1.7841717004776}9}7}} 11/07/2021 00:35:56 - INFO - __main__ - Step 23966: {'lr': 0.0004733150219878301, 'samples': 4601472, 'steps': 23965, 'loss/train': 1.1554112434387207}}} 11/07/2021 00:35:59 - INFO - __main__ - Step 23972: {'lr': 0.00047330070660633113, 'samples': 4602624, 'steps': 23971, 'loss/train': 1.6697735786437988}} 11/07/2021 00:35:59 - INFO - __main__ - Step 23972: {'lr': 0.00047330070660633113, 'samples': 4602624, 'steps': 23971, 'loss/train': 1.6697735786437988}} 11/07/2021 00:36:03 - INFO - __main__ - Step 23979: {'lr': 0.00047328400074991064, 'samples': 4603968, 'steps': 23978, 'loss/train': 1.5901334285736084}} 11/07/2021 00:36:04 - INFO - __main__ - Step 23983: {'lr': 0.00047327445233283496, 'samples': 4604736, 'steps': 23982, 'loss/train': 1.675779938697815}}} 11/07/2021 00:36:06 - INFO - __main__ - Step 23987: {'lr': 0.00047326490230609495, 'samples': 4605504, 'steps': 23986, 'loss/train': 1.521935224533081}}} 11/07/2021 00:36:09 - INFO - __main__ - Step 23992: {'lr': 0.0004732529625091843, 'samples': 4606464, 'steps': 23991, 'loss/train': 0.9550225734710693}}} 11/07/2021 00:36:11 - INFO - __main__ - Step 23996: {'lr': 0.0004732434088609512, 'samples': 4607232, 'steps': 23995, 'loss/train': 1.5826812982559204}}} 11/07/2021 00:36:11 - INFO - __main__ - Step 23996: {'lr': 0.0004732434088609512, 'samples': 4607232, 'steps': 23995, 'loss/train': 1.5826812982559204}}} 11/07/2021 00:36:14 - INFO - __main__ - Step 24003: {'lr': 0.0004732266861038684, 'samples': 4608576, 'steps': 24002, 'loss/train': 1.3503319025039673}}} 11/07/2021 00:36:17 - INFO - __main__ - Step 24008: {'lr': 0.0004732147382598842, 'samples': 4609536, 'steps': 24007, 'loss/train': 1.3278956413269043}}} 11/07/2021 00:36:19 - INFO - __main__ - Step 24013: {'lr': 0.00047320278790147197, 'samples': 4610496, 'steps': 24012, 'loss/train': 1.3343220949172974}} 11/07/2021 00:36:21 - INFO - __main__ - Step 24017: {'lr': 0.0004731932258044446, 'samples': 4611264, 'steps': 24016, 'loss/train': 1.6866132020950317}}} 11/07/2021 00:36:23 - INFO - __main__ - Step 24021: {'lr': 0.0004731836620983384, 'samples': 4612032, 'steps': 24020, 'loss/train': 1.8780555725097656}}} 11/07/2021 00:36:23 - INFO - __main__ - Step 24021: {'lr': 0.0004731836620983384, 'samples': 4612032, 'steps': 24020, 'loss/train': 1.8780555725097656}}} 11/07/2021 00:36:26 - INFO - __main__ - Step 24028: {'lr': 0.0004731669217410142, 'samples': 4613376, 'steps': 24027, 'loss/train': 1.6594531536102295}}} 11/07/2021 00:36:29 - INFO - __main__ - Step 24033: {'lr': 0.0004731549613262368, 'samples': 4614336, 'steps': 24032, 'loss/train': 1.5604989528656006}}} 11/07/2021 00:36:31 - INFO - __main__ - Step 24037: {'lr': 0.00047314539118450516, 'samples': 4615104, 'steps': 24036, 'loss/train': 1.7597320079803467}} 11/07/2021 00:36:33 - INFO - __main__ - Step 24041: {'lr': 0.00047313581943403963, 'samples': 4615872, 'steps': 24040, 'loss/train': 1.5378525257110596}} 11/07/2021 00:36:34 - INFO - __main__ - Step 24045: {'lr': 0.00047312624607490913, 'samples': 4616640, 'steps': 24044, 'loss/train': 1.356187105178833}}} 11/07/2021 00:36:36 - INFO - __main__ - Step 24049: {'lr': 0.0004731166711071827, 'samples': 4617408, 'steps': 24048, 'loss/train': 1.6041375398635864}}} 11/07/2021 00:36:39 - INFO - __main__ - Step 24054: {'lr': 0.00047310470013554195, 'samples': 4618368, 'steps': 24053, 'loss/train': 1.6378589868545532}} 11/07/2021 00:36:41 - INFO - __main__ - Step 24059: {'lr': 0.0004730927266507128, 'samples': 4619328, 'steps': 24058, 'loss/train': 1.7581125497817993}}} 11/07/2021 00:36:43 - INFO - __main__ - Step 24063: {'lr': 0.00047308314605344447, 'samples': 4620096, 'steps': 24062, 'loss/train': 1.4223600625991821}} 11/07/2021 00:36:43 - INFO - __main__ - Step 24063: {'lr': 0.00047308314605344447, 'samples': 4620096, 'steps': 24062, 'loss/train': 1.4223600625991821}} 11/07/2021 00:36:46 - INFO - __main__ - Step 24070: {'lr': 0.00047306637613833024, 'samples': 4621440, 'steps': 24069, 'loss/train': 1.380083441734314}}} 11/07/2021 00:36:49 - INFO - __main__ - Step 24075: {'lr': 0.00047305439461220477, 'samples': 4622400, 'steps': 24074, 'loss/train': 1.5761562585830688}} 11/07/2021 00:36:49 - INFO - __main__ - Step 24075: {'lr': 0.00047305439461220477, 'samples': 4622400, 'steps': 24074, 'loss/train': 1.5761562585830688}} 11/07/2021 00:36:53 - INFO - __main__ - Step 24083: {'lr': 0.00047303521894420707, 'samples': 4623936, 'steps': 24082, 'loss/train': 1.7466765642166138}} 11/07/2021 00:36:53 - INFO - __main__ - Step 24083: {'lr': 0.00047303521894420707, 'samples': 4623936, 'steps': 24082, 'loss/train': 1.7466765642166138}} 11/07/2021 00:36:57 - INFO - __main__ - Step 24090: {'lr': 0.0004730184349586382, 'samples': 4625280, 'steps': 24089, 'loss/train': 0.1390826255083084}}} 11/07/2021 00:36:59 - INFO - __main__ - Step 24095: {'lr': 0.00047300644338283597, 'samples': 4626240, 'steps': 24094, 'loss/train': 1.0832172632217407}} 11/07/2021 00:37:01 - INFO - __main__ - Step 24100: {'lr': 0.0004729944492949523, 'samples': 4627200, 'steps': 24099, 'loss/train': 1.3924369812011719}}} 11/07/2021 00:37:03 - INFO - __main__ - Step 24104: {'lr': 0.00047298485221603735, 'samples': 4627968, 'steps': 24103, 'loss/train': 1.4751757383346558}} 11/07/2021 00:37:03 - INFO - __main__ - Step 24104: {'lr': 0.00047298485221603735, 'samples': 4627968, 'steps': 24103, 'loss/train': 1.4751757383346558}} 11/07/2021 00:37:07 - INFO - __main__ - Step 24111: {'lr': 0.0004729680534597468, 'samples': 4629312, 'steps': 24110, 'loss/train': 1.310340166091919}8}} 11/07/2021 00:37:09 - INFO - __main__ - Step 24115: {'lr': 0.000472958451960163, 'samples': 4630080, 'steps': 24114, 'loss/train': 1.6103174686431885}8}} 11/07/2021 00:37:11 - INFO - __main__ - Step 24120: {'lr': 0.00047294644782530437, 'samples': 4631040, 'steps': 24119, 'loss/train': 1.7983100414276123}} 11/07/2021 00:37:11 - INFO - __main__ - Step 24120: {'lr': 0.00047294644782530437, 'samples': 4631040, 'steps': 24119, 'loss/train': 1.7983100414276123}} 11/07/2021 00:37:15 - INFO - __main__ - Step 24128: {'lr': 0.00047292723598586295, 'samples': 4632576, 'steps': 24127, 'loss/train': 1.7393834590911865}} 11/07/2021 00:37:17 - INFO - __main__ - Step 24132: {'lr': 0.0004729176276553659, 'samples': 4633344, 'steps': 24131, 'loss/train': 1.6785240173339844}}} 11/07/2021 00:37:19 - INFO - __main__ - Step 24136: {'lr': 0.0004729080177177769, 'samples': 4634112, 'steps': 24135, 'loss/train': 1.8441475629806519}}} 11/07/2021 00:37:21 - INFO - __main__ - Step 24141: {'lr': 0.00047289600303592334, 'samples': 4635072, 'steps': 24140, 'loss/train': 1.7222843170166016}} 11/07/2021 00:37:21 - INFO - __main__ - Step 24141: {'lr': 0.00047289600303592334, 'samples': 4635072, 'steps': 24140, 'loss/train': 1.7222843170166016}} 11/07/2021 00:37:24 - INFO - __main__ - Step 24147: {'lr': 0.0004728815821034055, 'samples': 4636224, 'steps': 24146, 'loss/train': 1.7480651140213013}}} 11/07/2021 00:37:27 - INFO - __main__ - Step 24152: {'lr': 0.00047286956189788803, 'samples': 4637184, 'steps': 24151, 'loss/train': 1.6570688486099243}} 11/07/2021 00:37:29 - INFO - __main__ - Step 24157: {'lr': 0.00047285753918183105, 'samples': 4638144, 'steps': 24156, 'loss/train': 1.6647619009017944}} 11/07/2021 00:37:29 - INFO - __main__ - Step 24157: {'lr': 0.00047285753918183105, 'samples': 4638144, 'steps': 24156, 'loss/train': 1.6647619009017944}} 11/07/2021 00:37:32 - INFO - __main__ - Step 24163: {'lr': 0.00047284310860884097, 'samples': 4639296, 'steps': 24162, 'loss/train': 1.4184315204620361}} 11/07/2021 00:37:35 - INFO - __main__ - Step 24168: {'lr': 0.0004728310803700735, 'samples': 4640256, 'steps': 24167, 'loss/train': 1.5293444395065308}}} 11/07/2021 00:37:37 - INFO - __main__ - Step 24172: {'lr': 0.0004728214559717766, 'samples': 4641024, 'steps': 24171, 'loss/train': 0.9853553771972656}}} 11/07/2021 00:37:39 - INFO - __main__ - Step 24176: {'lr': 0.0004728118299670812, 'samples': 4641792, 'steps': 24175, 'loss/train': 1.3534637689590454}}} 11/07/2021 00:37:41 - INFO - __main__ - Step 24180: {'lr': 0.00047280220235605653, 'samples': 4642560, 'steps': 24179, 'loss/train': 1.5663083791732788}} 11/07/2021 00:37:43 - INFO - __main__ - Step 24184: {'lr': 0.00047279257313877216, 'samples': 4643328, 'steps': 24183, 'loss/train': 1.3279659748077393}} 11/07/2021 00:37:45 - INFO - __main__ - Step 24188: {'lr': 0.00047278294231529745, 'samples': 4644096, 'steps': 24187, 'loss/train': 1.364418387413025}}} 11/07/2021 00:37:47 - INFO - __main__ - Step 24192: {'lr': 0.0004727733098857019, 'samples': 4644864, 'steps': 24191, 'loss/train': 1.4970722198486328}}} 11/07/2021 00:37:49 - INFO - __main__ - Step 24196: {'lr': 0.0004727636758500548, 'samples': 4645632, 'steps': 24195, 'loss/train': 2.057713031768799}}}} 11/07/2021 00:37:51 - INFO - __main__ - Step 24200: {'lr': 0.0004727540402084258, 'samples': 4646400, 'steps': 24199, 'loss/train': 1.6688822507858276}}} 11/07/2021 00:37:53 - INFO - __main__ - Step 24204: {'lr': 0.0004727444029608842, 'samples': 4647168, 'steps': 24203, 'loss/train': 1.4829760789871216}}} 11/07/2021 00:37:55 - INFO - __main__ - Step 24209: {'lr': 0.0004727323541432486, 'samples': 4648128, 'steps': 24208, 'loss/train': 1.4723137617111206}}} 11/07/2021 00:37:58 - INFO - __main__ - Step 24213: {'lr': 0.0004727227132826579, 'samples': 4648896, 'steps': 24212, 'loss/train': 1.7764638662338257}}} 11/07/2021 00:37:58 - INFO - __main__ - Step 24213: {'lr': 0.0004727227132826579, 'samples': 4648896, 'steps': 24212, 'loss/train': 1.7764638662338257}}} 11/07/2021 00:38:01 - INFO - __main__ - Step 24220: {'lr': 0.0004727058379129824, 'samples': 4650240, 'steps': 24219, 'loss/train': 1.6307088136672974}}} 11/07/2021 00:38:03 - INFO - __main__ - Step 24224: {'lr': 0.00047269619263692056, 'samples': 4651008, 'steps': 24223, 'loss/train': 1.6635693311691284}} 11/07/2021 00:38:05 - INFO - __main__ - Step 24229: {'lr': 0.0004726841337841234, 'samples': 4651968, 'steps': 24228, 'loss/train': 1.3337944746017456}}} 11/07/2021 00:38:07 - INFO - __main__ - Step 24233: {'lr': 0.00047267448489579455, 'samples': 4652736, 'steps': 24232, 'loss/train': 1.5871590375900269}} 11/07/2021 00:38:09 - INFO - __main__ - Step 24237: {'lr': 0.0004726648344021267, 'samples': 4653504, 'steps': 24236, 'loss/train': 1.330606460571289}9}} 11/07/2021 00:38:12 - INFO - __main__ - Step 24241: {'lr': 0.0004726551823031894, 'samples': 4654272, 'steps': 24240, 'loss/train': 1.6037311553955078}}} 11/07/2021 00:38:13 - INFO - __main__ - Step 24245: {'lr': 0.0004726455285990523, 'samples': 4655040, 'steps': 24244, 'loss/train': 1.6974471807479858}}} 11/07/2021 00:38:15 - INFO - __main__ - Step 24249: {'lr': 0.00047263587328978495, 'samples': 4655808, 'steps': 24248, 'loss/train': 0.9708447456359863}} 11/07/2021 00:38:18 - INFO - __main__ - Step 24254: {'lr': 0.00047262380189609253, 'samples': 4656768, 'steps': 24253, 'loss/train': 1.8602099418640137}} 11/07/2021 00:38:20 - INFO - __main__ - Step 24258: {'lr': 0.0004726141429755367, 'samples': 4657536, 'steps': 24257, 'loss/train': 0.9711540937423706}}} 11/07/2021 00:38:22 - INFO - __main__ - Step 24262: {'lr': 0.0004726044824500769, 'samples': 4658304, 'steps': 24261, 'loss/train': 1.469969391822815}}}} 11/07/2021 00:38:23 - INFO - __main__ - Step 24266: {'lr': 0.0004725948203197828, 'samples': 4659072, 'steps': 24265, 'loss/train': 1.1550707817077637}}} 11/07/2021 00:38:25 - INFO - __main__ - Step 24270: {'lr': 0.000472585156584724, 'samples': 4659840, 'steps': 24269, 'loss/train': 1.5329806804656982}}}} 11/07/2021 00:38:25 - INFO - __main__ - Step 24270: {'lr': 0.000472585156584724, 'samples': 4659840, 'steps': 24269, 'loss/train': 1.5329806804656982}}}} 11/07/2021 00:38:29 - INFO - __main__ - Step 24276: {'lr': 0.0004725706579733546, 'samples': 4660992, 'steps': 24275, 'loss/train': 1.0492531061172485}}} 11/07/2021 00:38:32 - INFO - __main__ - Step 24281: {'lr': 0.00047255857303931347, 'samples': 4661952, 'steps': 24280, 'loss/train': 1.8590461015701294}} 11/07/2021 00:38:34 - INFO - __main__ - Step 24285: {'lr': 0.0004725489032870079, 'samples': 4662720, 'steps': 24284, 'loss/train': 1.6883810758590698}}} 11/07/2021 00:38:36 - INFO - __main__ - Step 24289: {'lr': 0.0004725392319302686, 'samples': 4663488, 'steps': 24288, 'loss/train': 1.825372576713562}}}} 11/07/2021 00:38:37 - INFO - __main__ - Step 24293: {'lr': 0.00047252955896916546, 'samples': 4664256, 'steps': 24292, 'loss/train': 1.183845043182373}}} 11/07/2021 00:38:39 - INFO - __main__ - Step 24297: {'lr': 0.0004725198844037681, 'samples': 4665024, 'steps': 24296, 'loss/train': 1.612744927406311}}}} 11/07/2021 00:38:42 - INFO - __main__ - Step 24302: {'lr': 0.00047250778894108905, 'samples': 4665984, 'steps': 24301, 'loss/train': 0.9645561575889587}} 11/07/2021 00:38:44 - INFO - __main__ - Step 24306: {'lr': 0.00047249811076628483, 'samples': 4666752, 'steps': 24305, 'loss/train': 2.0788919925689697}} 11/07/2021 00:38:44 - INFO - __main__ - Step 24306: {'lr': 0.00047249811076628483, 'samples': 4666752, 'steps': 24305, 'loss/train': 2.0788919925689697}} 11/07/2021 00:38:47 - INFO - __main__ - Step 24313: {'lr': 0.0004724811701006322, 'samples': 4668096, 'steps': 24312, 'loss/train': 0.16031067073345184}} 11/07/2021 00:38:50 - INFO - __main__ - Step 24318: {'lr': 0.0004724690666177468, 'samples': 4669056, 'steps': 24317, 'loss/train': 1.5593488216400146}}} 11/07/2021 00:38:52 - INFO - __main__ - Step 24322: {'lr': 0.0004724593820270916, 'samples': 4669824, 'steps': 24321, 'loss/train': 1.7909736633300781}}} 11/07/2021 00:38:54 - INFO - __main__ - Step 24326: {'lr': 0.0004724496958326482, 'samples': 4670592, 'steps': 24325, 'loss/train': 0.8827059268951416}}} 11/07/2021 00:38:56 - INFO - __main__ - Step 24330: {'lr': 0.00047244000803448635, 'samples': 4671360, 'steps': 24329, 'loss/train': 1.240768551826477}}} 11/07/2021 00:38:57 - INFO - __main__ - Step 24334: {'lr': 0.00047243031863267594, 'samples': 4672128, 'steps': 24333, 'loss/train': 1.7888855934143066}} 11/07/2021 00:38:59 - INFO - __main__ - Step 24338: {'lr': 0.0004724206276272868, 'samples': 4672896, 'steps': 24337, 'loss/train': 1.5123300552368164}}} 11/07/2021 00:39:02 - INFO - __main__ - Step 24343: {'lr': 0.00047240851161562433, 'samples': 4673856, 'steps': 24342, 'loss/train': 1.980881929397583}}} 11/07/2021 00:39:04 - INFO - __main__ - Step 24347: {'lr': 0.0004723988170024386, 'samples': 4674624, 'steps': 24346, 'loss/train': 1.3413307666778564}}} 11/07/2021 00:39:06 - INFO - __main__ - Step 24351: {'lr': 0.0004723891207859012, 'samples': 4675392, 'steps': 24350, 'loss/train': 1.4413059949874878}}} 11/07/2021 00:39:07 - INFO - __main__ - Step 24355: {'lr': 0.00047237942296608223, 'samples': 4676160, 'steps': 24354, 'loss/train': 1.353247046470642}}} 11/07/2021 00:39:09 - INFO - __main__ - Step 24359: {'lr': 0.0004723697235430514, 'samples': 4676928, 'steps': 24358, 'loss/train': 1.738546371459961}}}} 11/07/2021 00:39:12 - INFO - __main__ - Step 24364: {'lr': 0.0004723575970098528, 'samples': 4677888, 'steps': 24363, 'loss/train': 1.7367303371429443}}} 11/07/2021 00:39:12 - INFO - __main__ - Step 24364: {'lr': 0.0004723575970098528, 'samples': 4677888, 'steps': 24363, 'loss/train': 1.7367303371429443}}} 11/07/2021 00:39:15 - INFO - __main__ - Step 24371: {'lr': 0.00047234061565538753, 'samples': 4679232, 'steps': 24370, 'loss/train': 1.5303529500961304}} 11/07/2021 00:39:17 - INFO - __main__ - Step 24375: {'lr': 0.000472330909820209, 'samples': 4680000, 'steps': 24374, 'loss/train': 1.458146095275879}04}} 11/07/2021 00:39:20 - INFO - __main__ - Step 24380: {'lr': 0.0004723187752722193, 'samples': 4680960, 'steps': 24379, 'loss/train': 1.0134202241897583}}} 11/07/2021 00:39:22 - INFO - __main__ - Step 24385: {'lr': 0.0004723066382198943, 'samples': 4681920, 'steps': 24384, 'loss/train': 1.359289288520813}}}} 11/07/2021 00:39:24 - INFO - __main__ - Step 24389: {'lr': 0.0004722969267750048, 'samples': 4682688, 'steps': 24388, 'loss/train': 1.652161717414856}}}} 11/07/2021 00:39:24 - INFO - __main__ - Step 24389: {'lr': 0.0004722969267750048, 'samples': 4682688, 'steps': 24388, 'loss/train': 1.652161717414856}}}} 11/07/2021 00:39:27 - INFO - __main__ - Step 24395: {'lr': 0.0004722823566027855, 'samples': 4683840, 'steps': 24394, 'loss/train': 1.5702743530273438}}} 11/07/2021 00:39:29 - INFO - __main__ - Step 24400: {'lr': 0.00047227021203827523, 'samples': 4684800, 'steps': 24399, 'loss/train': 1.2277770042419434}} 11/07/2021 00:39:29 - INFO - __main__ - Step 24400: {'lr': 0.00047227021203827523, 'samples': 4684800, 'steps': 24399, 'loss/train': 1.2277770042419434}} 11/07/2021 00:39:33 - INFO - __main__ - Step 24408: {'lr': 0.0004722507755272364, 'samples': 4686336, 'steps': 24407, 'loss/train': 2.113673210144043}4}} 11/07/2021 00:39:35 - INFO - __main__ - Step 24412: {'lr': 0.00047224105486825543, 'samples': 4687104, 'steps': 24411, 'loss/train': 1.7264853715896606}} 11/07/2021 00:39:37 - INFO - __main__ - Step 24416: {'lr': 0.0004722313326070602, 'samples': 4687872, 'steps': 24415, 'loss/train': 1.982115626335144}6}} 11/07/2021 00:39:39 - INFO - __main__ - Step 24420: {'lr': 0.0004722216087437208, 'samples': 4688640, 'steps': 24419, 'loss/train': 1.2910867929458618}}} 11/07/2021 00:39:42 - INFO - __main__ - Step 24424: {'lr': 0.0004722118832783074, 'samples': 4689408, 'steps': 24423, 'loss/train': 1.6926751136779785}}} 11/07/2021 00:39:42 - INFO - __main__ - Step 24424: {'lr': 0.0004722118832783074, 'samples': 4689408, 'steps': 24423, 'loss/train': 1.6926751136779785}}} 11/07/2021 00:39:45 - INFO - __main__ - Step 24431: {'lr': 0.0004721948598590542, 'samples': 4690752, 'steps': 24430, 'loss/train': 1.5635464191436768}}} 11/07/2021 00:39:48 - INFO - __main__ - Step 24436: {'lr': 0.00047218269727032413, 'samples': 4691712, 'steps': 24435, 'loss/train': 1.4665985107421875}} 11/07/2021 00:39:50 - INFO - __main__ - Step 24440: {'lr': 0.0004721729653973158, 'samples': 4692480, 'steps': 24439, 'loss/train': 1.4278517961502075}}} 11/07/2021 00:39:52 - INFO - __main__ - Step 24444: {'lr': 0.00047216323192258416, 'samples': 4693248, 'steps': 24443, 'loss/train': 1.642939567565918}}} 11/07/2021 00:39:53 - INFO - __main__ - Step 24448: {'lr': 0.0004721534968461992, 'samples': 4694016, 'steps': 24447, 'loss/train': 1.7327210903167725}}} 11/07/2021 00:39:55 - INFO - __main__ - Step 24452: {'lr': 0.00047214376016823143, 'samples': 4694784, 'steps': 24451, 'loss/train': 1.277727484703064}}} 11/07/2021 00:39:58 - INFO - __main__ - Step 24457: {'lr': 0.00047213158706865246, 'samples': 4695744, 'steps': 24456, 'loss/train': 1.4079968929290771}} 11/07/2021 00:40:00 - INFO - __main__ - Step 24461: {'lr': 0.00047212184678737946, 'samples': 4696512, 'steps': 24460, 'loss/train': 1.7175503969192505}} 11/07/2021 00:40:00 - INFO - __main__ - Step 24461: {'lr': 0.00047212184678737946, 'samples': 4696512, 'steps': 24460, 'loss/train': 1.7175503969192505}} 11/07/2021 00:40:03 - INFO - __main__ - Step 24468: {'lr': 0.00047210479744193404, 'samples': 4697856, 'steps': 24467, 'loss/train': 1.6929770708084106}} 11/07/2021 00:40:05 - INFO - __main__ - Step 24472: {'lr': 0.0004720950527571043, 'samples': 4698624, 'steps': 24471, 'loss/train': 2.0582070350646973}}} 11/07/2021 00:40:08 - INFO - __main__ - Step 24477: {'lr': 0.0004720828696494418, 'samples': 4699584, 'steps': 24476, 'loss/train': 1.696797251701355}}}} 11/07/2021 00:40:10 - INFO - __main__ - Step 24481: {'lr': 0.0004720731213620972, 'samples': 4700352, 'steps': 24480, 'loss/train': 1.7137320041656494}}} 11/07/2021 00:40:10 - INFO - __main__ - Step 24481: {'lr': 0.0004720731213620972, 'samples': 4700352, 'steps': 24480, 'loss/train': 1.7137320041656494}}} 11/07/2021 00:40:13 - INFO - __main__ - Step 24488: {'lr': 0.00047205605800687154, 'samples': 4701696, 'steps': 24487, 'loss/train': 0.6376531720161438}} 11/07/2021 00:40:15 - INFO - __main__ - Step 24493: {'lr': 0.0004720438668943232, 'samples': 4702656, 'steps': 24492, 'loss/train': 1.3901020288467407}}} 11/07/2021 00:40:18 - INFO - __main__ - Step 24498: {'lr': 0.00047203167328053634, 'samples': 4703616, 'steps': 24497, 'loss/train': 1.4276381731033325}} 11/07/2021 00:40:18 - INFO - __main__ - Step 24498: {'lr': 0.00047203167328053634, 'samples': 4703616, 'steps': 24497, 'loss/train': 1.4276381731033325}} 11/07/2021 00:40:22 - INFO - __main__ - Step 24506: {'lr': 0.000472012158296244, 'samples': 4705152, 'steps': 24505, 'loss/train': 1.6829310655593872}5}} 11/07/2021 00:40:22 - INFO - __main__ - Step 24506: {'lr': 0.000472012158296244, 'samples': 4705152, 'steps': 24505, 'loss/train': 1.6829310655593872}5}} 11/07/2021 00:40:25 - INFO - __main__ - Step 24513: {'lr': 0.0004719950774331183, 'samples': 4706496, 'steps': 24512, 'loss/train': 1.3297889232635498}}} 11/07/2021 00:40:28 - INFO - __main__ - Step 24518: {'lr': 0.0004719828738157512, 'samples': 4707456, 'steps': 24517, 'loss/train': 1.7587006092071533}}} 11/07/2021 00:40:30 - INFO - __main__ - Step 24523: {'lr': 0.00047197066769783284, 'samples': 4708416, 'steps': 24522, 'loss/train': 2.5680861473083496}} 11/07/2021 00:40:32 - INFO - __main__ - Step 24527: {'lr': 0.00047196090100319333, 'samples': 4709184, 'steps': 24526, 'loss/train': 2.022808313369751}}} 11/07/2021 00:40:32 - INFO - __main__ - Step 24527: {'lr': 0.00047196090100319333, 'samples': 4709184, 'steps': 24526, 'loss/train': 2.022808313369751}}} 11/07/2021 00:40:35 - INFO - __main__ - Step 24534: {'lr': 0.0004719438054371487, 'samples': 4710528, 'steps': 24533, 'loss/train': 1.3666858673095703}}} 11/07/2021 00:40:38 - INFO - __main__ - Step 24539: {'lr': 0.0004719315913183897, 'samples': 4711488, 'steps': 24538, 'loss/train': 1.300622582435608}}}} 11/07/2021 00:40:40 - INFO - __main__ - Step 24544: {'lr': 0.000471919374699657, 'samples': 4712448, 'steps': 24543, 'loss/train': 1.5135279893875122}}}} 11/07/2021 00:40:43 - INFO - __main__ - Step 24549: {'lr': 0.0004719071555810881, 'samples': 4713408, 'steps': 24548, 'loss/train': 1.8267337083816528}}} 11/07/2021 00:40:43 - INFO - __main__ - Step 24549: {'lr': 0.0004719071555810881, 'samples': 4713408, 'steps': 24548, 'loss/train': 1.8267337083816528}}} 11/07/2021 00:40:46 - INFO - __main__ - Step 24556: {'lr': 0.0004718900446156291, 'samples': 4714752, 'steps': 24555, 'loss/train': 1.4986615180969238}}} 11/07/2021 00:40:48 - INFO - __main__ - Step 24560: {'lr': 0.00047188026472149184, 'samples': 4715520, 'steps': 24559, 'loss/train': 1.8036824464797974}} 11/07/2021 00:40:51 - INFO - __main__ - Step 24565: {'lr': 0.0004718680376043724, 'samples': 4716480, 'steps': 24564, 'loss/train': 0.17302846908569336}} 11/07/2021 00:40:53 - INFO - __main__ - Step 24569: {'lr': 0.00047185825411120454, 'samples': 4717248, 'steps': 24568, 'loss/train': 1.6954476833343506}} 11/07/2021 00:40:55 - INFO - __main__ - Step 24573: {'lr': 0.00047184846901858225, 'samples': 4718016, 'steps': 24572, 'loss/train': 1.6777819395065308}} 11/07/2021 00:40:56 - INFO - __main__ - Step 24577: {'lr': 0.000471838682326576, 'samples': 4718784, 'steps': 24576, 'loss/train': 1.5703866481781006}8}} 11/07/2021 00:40:58 - INFO - __main__ - Step 24581: {'lr': 0.0004718288940352564, 'samples': 4719552, 'steps': 24580, 'loss/train': 1.4074021577835083}}} 11/07/2021 00:41:01 - INFO - __main__ - Step 24586: {'lr': 0.0004718166564221799, 'samples': 4720512, 'steps': 24585, 'loss/train': 1.821776032447815}}}} 11/07/2021 00:41:01 - INFO - __main__ - Step 24586: {'lr': 0.0004718166564221799, 'samples': 4720512, 'steps': 24585, 'loss/train': 1.821776032447815}}}} 11/07/2021 00:41:04 - INFO - __main__ - Step 24593: {'lr': 0.0004717995195661229, 'samples': 4721856, 'steps': 24592, 'loss/train': 3.1901888847351074}}} 11/07/2021 00:41:06 - INFO - __main__ - Step 24597: {'lr': 0.0004717897248782555, 'samples': 4722624, 'steps': 24596, 'loss/train': 1.4173022508621216}}} 11/07/2021 00:41:09 - INFO - __main__ - Step 24602: {'lr': 0.00047177747926989134, 'samples': 4723584, 'steps': 24601, 'loss/train': 1.7290757894515991}} 11/07/2021 00:41:11 - INFO - __main__ - Step 24606: {'lr': 0.00047176768098446234, 'samples': 4724352, 'steps': 24605, 'loss/train': 1.3547577857971191}} 11/07/2021 00:41:11 - INFO - __main__ - Step 24606: {'lr': 0.00047176768098446234, 'samples': 4724352, 'steps': 24605, 'loss/train': 1.3547577857971191}} 11/07/2021 00:41:14 - INFO - __main__ - Step 24613: {'lr': 0.0004717505301378877, 'samples': 4725696, 'steps': 24612, 'loss/train': 1.8070693016052246}}} 11/07/2021 00:41:17 - INFO - __main__ - Step 24618: {'lr': 0.0004717382765356485, 'samples': 4726656, 'steps': 24617, 'loss/train': 1.8763755559921265}}} 11/07/2021 00:41:19 - INFO - __main__ - Step 24622: {'lr': 0.0004717284718554373, 'samples': 4727424, 'steps': 24621, 'loss/train': 1.660263180732727}}}} 11/07/2021 00:41:21 - INFO - __main__ - Step 24626: {'lr': 0.0004717186655767073, 'samples': 4728192, 'steps': 24625, 'loss/train': 1.1362905502319336}}} 11/07/2021 00:41:22 - INFO - __main__ - Step 24630: {'lr': 0.00047170885769952907, 'samples': 4728960, 'steps': 24629, 'loss/train': 1.637890338897705}}} 11/07/2021 00:41:25 - INFO - __main__ - Step 24634: {'lr': 0.0004716990482239735, 'samples': 4729728, 'steps': 24633, 'loss/train': 1.6207184791564941}}} 11/07/2021 00:41:25 - INFO - __main__ - Step 24634: {'lr': 0.0004716990482239735, 'samples': 4729728, 'steps': 24633, 'loss/train': 1.6207184791564941}}} 11/07/2021 00:41:29 - INFO - __main__ - Step 24642: {'lr': 0.0004716794244780127, 'samples': 4731264, 'steps': 24641, 'loss/train': 1.5986493825912476}}} 11/07/2021 00:41:31 - INFO - __main__ - Step 24646: {'lr': 0.0004716696102077491, 'samples': 4732032, 'steps': 24645, 'loss/train': 0.7927688956260681}}} 11/07/2021 00:41:32 - INFO - __main__ - Step 24650: {'lr': 0.000471659794339391, 'samples': 4732800, 'steps': 24649, 'loss/train': 1.5934165716171265}}}} 11/07/2021 00:41:34 - INFO - __main__ - Step 24654: {'lr': 0.0004716499768730092, 'samples': 4733568, 'steps': 24653, 'loss/train': 1.840376853942871}}}} 11/07/2021 00:41:36 - INFO - __main__ - Step 24658: {'lr': 0.00047164015780867444, 'samples': 4734336, 'steps': 24657, 'loss/train': 1.7362557649612427}} 11/07/2021 00:41:38 - INFO - __main__ - Step 24662: {'lr': 0.0004716303371464575, 'samples': 4735104, 'steps': 24661, 'loss/train': 2.1853394508361816}}} 11/07/2021 00:41:40 - INFO - __main__ - Step 24666: {'lr': 0.0004716205148864292, 'samples': 4735872, 'steps': 24665, 'loss/train': 1.3622562885284424}}} 11/07/2021 00:41:42 - INFO - __main__ - Step 24670: {'lr': 0.00047161069102866037, 'samples': 4736640, 'steps': 24669, 'loss/train': 1.5919777154922485}} 11/07/2021 00:41:44 - INFO - __main__ - Step 24674: {'lr': 0.00047160086557322185, 'samples': 4737408, 'steps': 24673, 'loss/train': 1.5195907354354858}} 11/07/2021 00:41:46 - INFO - __main__ - Step 24679: {'lr': 0.00047158858150730856, 'samples': 4738368, 'steps': 24678, 'loss/train': 1.2686253786087036}} 11/07/2021 00:41:46 - INFO - __main__ - Step 24679: {'lr': 0.00047158858150730856, 'samples': 4738368, 'steps': 24678, 'loss/train': 1.2686253786087036}} 11/07/2021 00:41:51 - INFO - __main__ - Step 24687: {'lr': 0.00047156892180999624, 'samples': 4739904, 'steps': 24686, 'loss/train': 1.6515963077545166}} 11/07/2021 00:41:52 - INFO - __main__ - Step 24691: {'lr': 0.00047155908956525173, 'samples': 4740672, 'steps': 24690, 'loss/train': 1.6381641626358032}} 11/07/2021 00:41:54 - INFO - __main__ - Step 24695: {'lr': 0.00047154925572320957, 'samples': 4741440, 'steps': 24694, 'loss/train': 1.3117071390151978}} 11/07/2021 00:41:54 - INFO - __main__ - Step 24695: {'lr': 0.00047154925572320957, 'samples': 4741440, 'steps': 24694, 'loss/train': 1.3117071390151978}} 11/07/2021 00:41:59 - INFO - __main__ - Step 24703: {'lr': 0.0004715295832475156, 'samples': 4742976, 'steps': 24702, 'loss/train': 1.516232967376709}8}} 11/07/2021 00:42:00 - INFO - __main__ - Step 24707: {'lr': 0.0004715197446140057, 'samples': 4743744, 'steps': 24706, 'loss/train': 0.7026177644729614}}} 11/07/2021 00:42:02 - INFO - __main__ - Step 24711: {'lr': 0.0004715099043834818, 'samples': 4744512, 'steps': 24710, 'loss/train': 1.6812454462051392}}} 11/07/2021 00:42:04 - INFO - __main__ - Step 24716: {'lr': 0.00047149760184963385, 'samples': 4745472, 'steps': 24715, 'loss/train': 1.8487763404846191}} 11/07/2021 00:42:07 - INFO - __main__ - Step 24721: {'lr': 0.00047148529682070094, 'samples': 4746432, 'steps': 24720, 'loss/train': 0.9560302495956421}} 11/07/2021 00:42:07 - INFO - __main__ - Step 24721: {'lr': 0.00047148529682070094, 'samples': 4746432, 'steps': 24720, 'loss/train': 0.9560302495956421}} 11/07/2021 00:42:11 - INFO - __main__ - Step 24728: {'lr': 0.00047146806558871594, 'samples': 4747776, 'steps': 24727, 'loss/train': 1.5736908912658691}} 11/07/2021 00:42:12 - INFO - __main__ - Step 24732: {'lr': 0.00047145821697503235, 'samples': 4748544, 'steps': 24731, 'loss/train': 1.6949464082717896}} 11/07/2021 00:42:14 - INFO - __main__ - Step 24737: {'lr': 0.00047144590396275895, 'samples': 4749504, 'steps': 24736, 'loss/train': 1.4250119924545288}} 11/07/2021 00:42:17 - INFO - __main__ - Step 24742: {'lr': 0.00047143358845598283, 'samples': 4750464, 'steps': 24741, 'loss/train': 1.4710402488708496}} 11/07/2021 00:42:19 - INFO - __main__ - Step 24746: {'lr': 0.0004714237342546133, 'samples': 4751232, 'steps': 24745, 'loss/train': 1.678723692893982}6}} 11/07/2021 00:42:19 - INFO - __main__ - Step 24746: {'lr': 0.0004714237342546133, 'samples': 4751232, 'steps': 24745, 'loss/train': 1.678723692893982}6}} 11/07/2021 00:42:22 - INFO - __main__ - Step 24753: {'lr': 0.00047140648556110966, 'samples': 4752576, 'steps': 24752, 'loss/train': 1.7825654745101929}} 11/07/2021 00:42:24 - INFO - __main__ - Step 24757: {'lr': 0.0004713966269700259, 'samples': 4753344, 'steps': 24756, 'loss/train': 1.1757967472076416}}} 11/07/2021 00:42:27 - INFO - __main__ - Step 24762: {'lr': 0.00047138430148662666, 'samples': 4754304, 'steps': 24761, 'loss/train': 1.994339942932129}}} 11/07/2021 00:42:27 - INFO - __main__ - Step 24762: {'lr': 0.00047138430148662666, 'samples': 4754304, 'steps': 24761, 'loss/train': 1.994339942932129}}} 11/07/2021 00:42:30 - INFO - __main__ - Step 24769: {'lr': 0.0004713670416203001, 'samples': 4755648, 'steps': 24768, 'loss/train': 1.62954580783844}9}}} 11/07/2021 00:42:32 - INFO - __main__ - Step 24773: {'lr': 0.00047135717664513704, 'samples': 4756416, 'steps': 24772, 'loss/train': 1.426689863204956}}} 11/07/2021 00:42:35 - INFO - __main__ - Step 24778: {'lr': 0.0004713448431820387, 'samples': 4757376, 'steps': 24777, 'loss/train': 1.5333009958267212}}} 11/07/2021 00:42:37 - INFO - __main__ - Step 24782: {'lr': 0.000471334974616331, 'samples': 4758144, 'steps': 24781, 'loss/train': 1.3281073570251465}}}} 11/07/2021 00:42:39 - INFO - __main__ - Step 24786: {'lr': 0.0004713251044549414, 'samples': 4758912, 'steps': 24785, 'loss/train': 1.5033193826675415}}} 11/07/2021 00:42:40 - INFO - __main__ - Step 24790: {'lr': 0.000471315232697941, 'samples': 4759680, 'steps': 24789, 'loss/train': 1.7934147119522095}}}} 11/07/2021 00:42:42 - INFO - __main__ - Step 24794: {'lr': 0.00047130535934540086, 'samples': 4760448, 'steps': 24793, 'loss/train': 1.7444671392440796}} 11/07/2021 00:42:45 - INFO - __main__ - Step 24799: {'lr': 0.0004712930154111065, 'samples': 4761408, 'steps': 24798, 'loss/train': 1.6242315769195557}}} 11/07/2021 00:42:47 - INFO - __main__ - Step 24804: {'lr': 0.00047128066898403166, 'samples': 4762368, 'steps': 24803, 'loss/train': 1.2261611223220825}} 11/07/2021 00:42:47 - INFO - __main__ - Step 24804: {'lr': 0.00047128066898403166, 'samples': 4762368, 'steps': 24803, 'loss/train': 1.2261611223220825}} 11/07/2021 00:42:51 - INFO - __main__ - Step 24811: {'lr': 0.0004712633797985206, 'samples': 4763712, 'steps': 24810, 'loss/train': 2.1266679763793945}}} 11/07/2021 00:42:52 - INFO - __main__ - Step 24815: {'lr': 0.0004712534980705654, 'samples': 4764480, 'steps': 24814, 'loss/train': 1.4523907899856567}}} 11/07/2021 00:42:54 - INFO - __main__ - Step 24819: {'lr': 0.0004712436147475155, 'samples': 4765248, 'steps': 24818, 'loss/train': 1.3041859865188599}}} 11/07/2021 00:42:57 - INFO - __main__ - Step 24824: {'lr': 0.00047123125835071004, 'samples': 4766208, 'steps': 24823, 'loss/train': 1.301546573638916}}} 11/07/2021 00:42:57 - INFO - __main__ - Step 24824: {'lr': 0.00047123125835071004, 'samples': 4766208, 'steps': 24823, 'loss/train': 1.301546573638916}}} 11/07/2021 00:43:01 - INFO - __main__ - Step 24832: {'lr': 0.00047121148293234274, 'samples': 4767744, 'steps': 24831, 'loss/train': 1.7730728387832642}} 11/07/2021 00:43:03 - INFO - __main__ - Step 24836: {'lr': 0.0004712015928309359, 'samples': 4768512, 'steps': 24835, 'loss/train': 1.6604273319244385}}} 11/07/2021 00:43:05 - INFO - __main__ - Step 24840: {'lr': 0.00047119170113480867, 'samples': 4769280, 'steps': 24839, 'loss/train': 1.0669902563095093}} 11/07/2021 00:43:07 - INFO - __main__ - Step 24845: {'lr': 0.0004711793342721828, 'samples': 4770240, 'steps': 24844, 'loss/train': 1.9089672565460205}}} 11/07/2021 00:43:09 - INFO - __main__ - Step 24849: {'lr': 0.0004711694389881955, 'samples': 4771008, 'steps': 24848, 'loss/train': 1.607546329498291}}}} 11/07/2021 00:43:11 - INFO - __main__ - Step 24853: {'lr': 0.00047115954210971955, 'samples': 4771776, 'steps': 24852, 'loss/train': 1.1933653354644775}} 11/07/2021 00:43:13 - INFO - __main__ - Step 24857: {'lr': 0.0004711496436368264, 'samples': 4772544, 'steps': 24856, 'loss/train': 1.4110667705535889}}} 11/07/2021 00:43:15 - INFO - __main__ - Step 24861: {'lr': 0.00047113974356958744, 'samples': 4773312, 'steps': 24860, 'loss/train': 1.6529386043548584}} 11/07/2021 00:43:17 - INFO - __main__ - Step 24865: {'lr': 0.0004711298419080739, 'samples': 4774080, 'steps': 24864, 'loss/train': 1.6152267456054688}}} 11/07/2021 00:43:19 - INFO - __main__ - Step 24870: {'lr': 0.0004711174625893423, 'samples': 4775040, 'steps': 24869, 'loss/train': 1.597435474395752}}}} 11/07/2021 00:43:19 - INFO - __main__ - Step 24870: {'lr': 0.0004711174625893423, 'samples': 4775040, 'steps': 24869, 'loss/train': 1.597435474395752}}}} 11/07/2021 00:43:22 - INFO - __main__ - Step 24877: {'lr': 0.0004711001273586003, 'samples': 4776384, 'steps': 24876, 'loss/train': 1.7008123397827148}}} 11/07/2021 00:43:25 - INFO - __main__ - Step 24881: {'lr': 0.00047109021932070284, 'samples': 4777152, 'steps': 24880, 'loss/train': 1.713632345199585}}} 11/07/2021 00:43:27 - INFO - __main__ - Step 24886: {'lr': 0.00047107783203189285, 'samples': 4778112, 'steps': 24885, 'loss/train': 1.2156516313552856}} 11/07/2021 00:43:27 - INFO - __main__ - Step 24886: {'lr': 0.00047107783203189285, 'samples': 4778112, 'steps': 24885, 'loss/train': 1.2156516313552856}} 11/07/2021 00:43:30 - INFO - __main__ - Step 24892: {'lr': 0.0004710629639980626, 'samples': 4779264, 'steps': 24891, 'loss/train': 1.6932626962661743}}} 11/07/2021 00:43:32 - INFO - __main__ - Step 24897: {'lr': 0.0004710505712306526, 'samples': 4780224, 'steps': 24896, 'loss/train': 1.4603281021118164}}} 11/07/2021 00:43:32 - INFO - __main__ - Step 24897: {'lr': 0.0004710505712306526, 'samples': 4780224, 'steps': 24896, 'loss/train': 1.4603281021118164}}} 11/07/2021 00:43:37 - INFO - __main__ - Step 24905: {'lr': 0.00047103073762355186, 'samples': 4781760, 'steps': 24904, 'loss/train': 1.4467488527297974}} 11/07/2021 00:43:39 - INFO - __main__ - Step 24909: {'lr': 0.0004710208184297329, 'samples': 4782528, 'steps': 24908, 'loss/train': 0.9432609677314758}}} 11/07/2021 00:43:40 - INFO - __main__ - Step 24913: {'lr': 0.00047101089764249674, 'samples': 4783296, 'steps': 24912, 'loss/train': 1.6054977178573608}} 11/07/2021 00:43:43 - INFO - __main__ - Step 24917: {'lr': 0.00047100097526191486, 'samples': 4784064, 'steps': 24916, 'loss/train': 0.7655185461044312}} 11/07/2021 00:43:45 - INFO - __main__ - Step 24921: {'lr': 0.00047099105128805906, 'samples': 4784832, 'steps': 24920, 'loss/train': 1.7223882675170898}} 11/07/2021 00:43:47 - INFO - __main__ - Step 24925: {'lr': 0.0004709811257210007, 'samples': 4785600, 'steps': 24924, 'loss/train': 1.4822044372558594}}} 11/07/2021 00:43:48 - INFO - __main__ - Step 24929: {'lr': 0.0004709711985608114, 'samples': 4786368, 'steps': 24928, 'loss/train': 1.8573694229125977}}} 11/07/2021 00:43:50 - INFO - __main__ - Step 24933: {'lr': 0.0004709612698075627, 'samples': 4787136, 'steps': 24932, 'loss/train': 2.014444351196289}}}} 11/07/2021 00:43:53 - INFO - __main__ - Step 24938: {'lr': 0.00047094885662587104, 'samples': 4788096, 'steps': 24937, 'loss/train': 1.4386154413223267}} 11/07/2021 00:43:53 - INFO - __main__ - Step 24938: {'lr': 0.00047094885662587104, 'samples': 4788096, 'steps': 24937, 'loss/train': 1.4386154413223267}} 11/07/2021 00:43:53 - INFO - __main__ - Step 24938: {'lr': 0.00047094885662587104, 'samples': 4788096, 'steps': 24937, 'loss/train': 1.4386154413223267}} 11/07/2021 00:43:59 - INFO - __main__ - Step 24949: {'lr': 0.00047092153886540554, 'samples': 4790208, 'steps': 24948, 'loss/train': 0.7883797883987427}} 11/07/2021 00:43:59 - INFO - __main__ - Step 24949: {'lr': 0.00047092153886540554, 'samples': 4790208, 'steps': 24948, 'loss/train': 0.7883797883987427}} 11/07/2021 00:44:02 - INFO - __main__ - Step 24957: {'lr': 0.0004709016638378323, 'samples': 4791744, 'steps': 24956, 'loss/train': 1.393989086151123}7}} 11/07/2021 00:44:04 - INFO - __main__ - Step 24961: {'lr': 0.0004708917239351727, 'samples': 4792512, 'steps': 24960, 'loss/train': 1.5697776079177856}}} 11/07/2021 00:44:07 - INFO - __main__ - Step 24966: {'lr': 0.00047087929681742253, 'samples': 4793472, 'steps': 24965, 'loss/train': 1.5869165658950806}} 11/07/2021 00:44:09 - INFO - __main__ - Step 24971: {'lr': 0.00047086686721155237, 'samples': 4794432, 'steps': 24970, 'loss/train': 1.0224506855010986}} 11/07/2021 00:44:09 - INFO - __main__ - Step 24971: {'lr': 0.00047086686721155237, 'samples': 4794432, 'steps': 24970, 'loss/train': 1.0224506855010986}} 11/07/2021 00:44:12 - INFO - __main__ - Step 24978: {'lr': 0.0004708494615835589, 'samples': 4795776, 'steps': 24977, 'loss/train': 1.6182292699813843}}} 11/07/2021 00:44:14 - INFO - __main__ - Step 24982: {'lr': 0.0004708395133211452, 'samples': 4796544, 'steps': 24981, 'loss/train': 1.6060330867767334}}} 11/07/2021 00:44:17 - INFO - __main__ - Step 24987: {'lr': 0.00047082707575423177, 'samples': 4797504, 'steps': 24986, 'loss/train': 1.502872109413147}}} 11/07/2021 00:44:19 - INFO - __main__ - Step 24992: {'lr': 0.00047081463569978655, 'samples': 4798464, 'steps': 24991, 'loss/train': 1.200833797454834}}} 11/07/2021 00:44:21 - INFO - __main__ - Step 24996: {'lr': 0.0004708046818653017, 'samples': 4799232, 'steps': 24995, 'loss/train': 1.973433256149292}}}} 11/07/2021 00:44:21 - INFO - __main__ - Step 24996: {'lr': 0.0004708046818653017, 'samples': 4799232, 'steps': 24995, 'loss/train': 1.973433256149292}}}} 11/07/2021 00:44:25 - INFO - __main__ - Step 25002: {'lr': 0.0004707897481288612, 'samples': 4800384, 'steps': 25001, 'loss/train': 1.7442258596420288}}} 11/07/2021 00:44:25 - INFO - __main__ - Step 25002: {'lr': 0.0004707897481288612, 'samples': 4800384, 'steps': 25001, 'loss/train': 1.7442258596420288}}} 11/07/2021 00:44:29 - INFO - __main__ - Step 25011: {'lr': 0.00047076734080907576, 'samples': 4802112, 'steps': 25010, 'loss/train': 2.856903314590454}}} 11/07/2021 00:44:31 - INFO - __main__ - Step 25015: {'lr': 0.0004707573794139003, 'samples': 4802880, 'steps': 25014, 'loss/train': 1.6455538272857666}}} 11/07/2021 00:44:33 - INFO - __main__ - Step 25020: {'lr': 0.0004707449254318673, 'samples': 4803840, 'steps': 25019, 'loss/train': 1.0126532316207886}}} 11/07/2021 00:44:33 - INFO - __main__ - Step 25020: {'lr': 0.0004707449254318673, 'samples': 4803840, 'steps': 25019, 'loss/train': 1.0126532316207886}}} 11/07/2021 00:44:37 - INFO - __main__ - Step 25028: {'lr': 0.00047072499388853164, 'samples': 4805376, 'steps': 25027, 'loss/train': 1.6089210510253906}} 11/07/2021 00:44:39 - INFO - __main__ - Step 25032: {'lr': 0.0004707150257299012, 'samples': 4806144, 'steps': 25031, 'loss/train': 1.7053147554397583}}} 11/07/2021 00:44:41 - INFO - __main__ - Step 25036: {'lr': 0.0004707050559800582, 'samples': 4806912, 'steps': 25035, 'loss/train': 1.3381532430648804}}} 11/07/2021 00:44:43 - INFO - __main__ - Step 25040: {'lr': 0.0004706950846390746, 'samples': 4807680, 'steps': 25039, 'loss/train': 1.9020907878875732}}} 11/07/2021 00:44:45 - INFO - __main__ - Step 25044: {'lr': 0.0004706851117070221, 'samples': 4808448, 'steps': 25043, 'loss/train': 1.604849100112915}}}} 11/07/2021 00:44:47 - INFO - __main__ - Step 25049: {'lr': 0.0004706726433046256, 'samples': 4809408, 'steps': 25048, 'loss/train': 1.6697512865066528}}} 11/07/2021 00:44:49 - INFO - __main__ - Step 25053: {'lr': 0.00047066266679293125, 'samples': 4810176, 'steps': 25052, 'loss/train': 1.6824945211410522}} 11/07/2021 00:44:51 - INFO - __main__ - Step 25057: {'lr': 0.0004706526886904019, 'samples': 4810944, 'steps': 25056, 'loss/train': 1.7684483528137207}}} 11/07/2021 00:44:53 - INFO - __main__ - Step 25062: {'lr': 0.0004706402138252379, 'samples': 4811904, 'steps': 25061, 'loss/train': 1.537904143333435}}}} 11/07/2021 00:44:55 - INFO - __main__ - Step 25066: {'lr': 0.0004706302321435926, 'samples': 4812672, 'steps': 25065, 'loss/train': 1.943956971168518}}}} 11/07/2021 00:44:57 - INFO - __main__ - Step 25070: {'lr': 0.000470620248871346, 'samples': 4813440, 'steps': 25069, 'loss/train': 2.15901517868042}8}}}} 11/07/2021 00:44:57 - INFO - __main__ - Step 25070: {'lr': 0.000470620248871346, 'samples': 4813440, 'steps': 25069, 'loss/train': 2.15901517868042}8}}}} 11/07/2021 00:45:00 - INFO - __main__ - Step 25077: {'lr': 0.0004706027743177467, 'samples': 4814784, 'steps': 25076, 'loss/train': 1.1344234943389893}}} 11/07/2021 00:45:03 - INFO - __main__ - Step 25083: {'lr': 0.00047058779225232474, 'samples': 4815936, 'steps': 25082, 'loss/train': 1.377502679824829}}} 11/07/2021 00:45:05 - INFO - __main__ - Step 25087: {'lr': 0.0004705778022208259, 'samples': 4816704, 'steps': 25086, 'loss/train': 1.7859622240066528}}} 11/07/2021 00:45:07 - INFO - __main__ - Step 25091: {'lr': 0.0004705678105991039, 'samples': 4817472, 'steps': 25090, 'loss/train': 1.7953321933746338}}} 11/07/2021 00:45:07 - INFO - __main__ - Step 25091: {'lr': 0.0004705678105991039, 'samples': 4817472, 'steps': 25090, 'loss/train': 1.7953321933746338}}} 11/07/2021 00:45:11 - INFO - __main__ - Step 25098: {'lr': 0.0004705503214348323, 'samples': 4818816, 'steps': 25097, 'loss/train': 1.6429988145828247}}} 11/07/2021 00:45:13 - INFO - __main__ - Step 25103: {'lr': 0.0004705378261933186, 'samples': 4819776, 'steps': 25102, 'loss/train': 1.6882308721542358}}} 11/07/2021 00:45:15 - INFO - __main__ - Step 25108: {'lr': 0.0004705253284675314, 'samples': 4820736, 'steps': 25107, 'loss/train': 1.394099473953247}}}} 11/07/2021 00:45:17 - INFO - __main__ - Step 25112: {'lr': 0.0004705153284983192, 'samples': 4821504, 'steps': 25111, 'loss/train': 1.374193787574768}}}} 11/07/2021 00:45:17 - INFO - __main__ - Step 25112: {'lr': 0.0004705153284983192, 'samples': 4821504, 'steps': 25111, 'loss/train': 1.374193787574768}}}} 11/07/2021 00:45:21 - INFO - __main__ - Step 25119: {'lr': 0.0004704978247268505, 'samples': 4822848, 'steps': 25118, 'loss/train': 1.158705472946167}}}} 11/07/2021 00:45:23 - INFO - __main__ - Step 25124: {'lr': 0.0004704853190523342, 'samples': 4823808, 'steps': 25123, 'loss/train': 0.8902316093444824}}} 11/07/2021 00:45:25 - INFO - __main__ - Step 25129: {'lr': 0.0004704728108941358, 'samples': 4824768, 'steps': 25128, 'loss/train': 1.8579035997390747}}} 11/07/2021 00:45:28 - INFO - __main__ - Step 25133: {'lr': 0.00047046280257942067, 'samples': 4825536, 'steps': 25132, 'loss/train': 1.7930819988250732}} 11/07/2021 00:45:30 - INFO - __main__ - Step 25137: {'lr': 0.0004704527926753114, 'samples': 4826304, 'steps': 25136, 'loss/train': 1.6575398445129395}}} 11/07/2021 00:45:32 - INFO - __main__ - Step 25141: {'lr': 0.00047044278118188004, 'samples': 4827072, 'steps': 25140, 'loss/train': 1.7771896123886108}} 11/07/2021 00:45:33 - INFO - __main__ - Step 25145: {'lr': 0.0004704327680991989, 'samples': 4827840, 'steps': 25144, 'loss/train': 1.3845484256744385}}} 11/07/2021 00:45:35 - INFO - __main__ - Step 25149: {'lr': 0.00047042275342734006, 'samples': 4828608, 'steps': 25148, 'loss/train': 1.718420147895813}}} 11/07/2021 00:45:37 - INFO - __main__ - Step 25154: {'lr': 0.00047041023285284545, 'samples': 4829568, 'steps': 25153, 'loss/train': 1.5758517980575562}} 11/07/2021 00:45:40 - INFO - __main__ - Step 25158: {'lr': 0.0004704002146056009, 'samples': 4830336, 'steps': 25157, 'loss/train': 1.5786726474761963}}} 11/07/2021 00:45:42 - INFO - __main__ - Step 25162: {'lr': 0.0004703901947694134, 'samples': 4831104, 'steps': 25161, 'loss/train': 1.6837447881698608}}} 11/07/2021 00:45:43 - INFO - __main__ - Step 25166: {'lr': 0.00047038017334435504, 'samples': 4831872, 'steps': 25165, 'loss/train': 1.4276436567306519}} 11/07/2021 00:45:45 - INFO - __main__ - Step 25170: {'lr': 0.0004703701503304983, 'samples': 4832640, 'steps': 25169, 'loss/train': 1.633947491645813}9}} 11/07/2021 00:45:47 - INFO - __main__ - Step 25174: {'lr': 0.0004703601257279153, 'samples': 4833408, 'steps': 25173, 'loss/train': 1.7189385890960693}}} 11/07/2021 00:45:49 - INFO - __main__ - Step 25178: {'lr': 0.0004703500995366784, 'samples': 4834176, 'steps': 25177, 'loss/train': 1.363921046257019}}}} 11/07/2021 00:45:51 - INFO - __main__ - Step 25182: {'lr': 0.00047034007175685976, 'samples': 4834944, 'steps': 25181, 'loss/train': 1.1298303604125977}} 11/07/2021 00:45:53 - INFO - __main__ - Step 25186: {'lr': 0.0004703300423885318, 'samples': 4835712, 'steps': 25185, 'loss/train': 1.8879923820495605}}} 11/07/2021 00:45:55 - INFO - __main__ - Step 25190: {'lr': 0.0004703200114317667, 'samples': 4836480, 'steps': 25189, 'loss/train': 1.608988881111145}}}} 11/07/2021 00:45:57 - INFO - __main__ - Step 25195: {'lr': 0.00047030747050218094, 'samples': 4837440, 'steps': 25194, 'loss/train': 1.4449735879898071}} 11/07/2021 00:45:59 - INFO - __main__ - Step 25199: {'lr': 0.00047029743597169684, 'samples': 4838208, 'steps': 25198, 'loss/train': 1.051732063293457}}} 11/07/2021 00:45:59 - INFO - __main__ - Step 25199: {'lr': 0.00047029743597169684, 'samples': 4838208, 'steps': 25198, 'loss/train': 1.051732063293457}}} 11/07/2021 00:46:03 - INFO - __main__ - Step 25206: {'lr': 0.0004702798717217822, 'samples': 4839552, 'steps': 25205, 'loss/train': 1.5200536251068115}}} 11/07/2021 00:46:05 - INFO - __main__ - Step 25211: {'lr': 0.0004702673228513221, 'samples': 4840512, 'steps': 25210, 'loss/train': 0.9686245322227478}}} 11/07/2021 00:46:07 - INFO - __main__ - Step 25216: {'lr': 0.0004702547714996355, 'samples': 4841472, 'steps': 25215, 'loss/train': 1.6218972206115723}}} 11/07/2021 00:46:09 - INFO - __main__ - Step 25220: {'lr': 0.0004702447286318983, 'samples': 4842240, 'steps': 25219, 'loss/train': 1.173579454421997}}}} 11/07/2021 00:46:09 - INFO - __main__ - Step 25220: {'lr': 0.0004702447286318983, 'samples': 4842240, 'steps': 25219, 'loss/train': 1.173579454421997}}}} 11/07/2021 00:46:13 - INFO - __main__ - Step 25227: {'lr': 0.00047022714979270497, 'samples': 4843584, 'steps': 25226, 'loss/train': 1.372851848602295}}} 11/07/2021 00:46:15 - INFO - __main__ - Step 25231: {'lr': 0.00047021710255863144, 'samples': 4844352, 'steps': 25230, 'loss/train': 1.5361984968185425}} 11/07/2021 00:46:17 - INFO - __main__ - Step 25237: {'lr': 0.00047020202873075093, 'samples': 4845504, 'steps': 25236, 'loss/train': 1.9136335849761963}} 11/07/2021 00:46:17 - INFO - __main__ - Step 25237: {'lr': 0.00047020202873075093, 'samples': 4845504, 'steps': 25236, 'loss/train': 1.9136335849761963}} 11/07/2021 00:46:21 - INFO - __main__ - Step 25244: {'lr': 0.0004701844380837474, 'samples': 4846848, 'steps': 25243, 'loss/train': 1.9548184871673584}}} 11/07/2021 00:46:23 - INFO - __main__ - Step 25248: {'lr': 0.0004701743841027771, 'samples': 4847616, 'steps': 25247, 'loss/train': 1.7109966278076172}}} 11/07/2021 00:46:25 - INFO - __main__ - Step 25252: {'lr': 0.000470164328534492, 'samples': 4848384, 'steps': 25251, 'loss/train': 1.9276931285858154}}}} 11/07/2021 00:46:28 - INFO - __main__ - Step 25258: {'lr': 0.0004701492422060074, 'samples': 4849536, 'steps': 25257, 'loss/train': 1.896033525466919}}}} 11/07/2021 00:46:30 - INFO - __main__ - Step 25262: {'lr': 0.0004701391826697523, 'samples': 4850304, 'steps': 25261, 'loss/train': 2.0049753189086914}}} 11/07/2021 00:46:30 - INFO - __main__ - Step 25262: {'lr': 0.0004701391826697523, 'samples': 4850304, 'steps': 25261, 'loss/train': 2.0049753189086914}}} 11/07/2021 00:46:33 - INFO - __main__ - Step 25269: {'lr': 0.0004701215746624836, 'samples': 4851648, 'steps': 25268, 'loss/train': 1.712836503982544}}}} 11/07/2021 00:46:35 - INFO - __main__ - Step 25273: {'lr': 0.000470111510761985, 'samples': 4852416, 'steps': 25272, 'loss/train': 1.329630970954895}}}}} 11/07/2021 00:46:38 - INFO - __main__ - Step 25279: {'lr': 0.00047009641193589423, 'samples': 4853568, 'steps': 25278, 'loss/train': 1.3197660446166992}} 11/07/2021 00:46:38 - INFO - __main__ - Step 25279: {'lr': 0.00047009641193589423, 'samples': 4853568, 'steps': 25278, 'loss/train': 1.3197660446166992}} 11/07/2021 00:46:42 - INFO - __main__ - Step 25286: {'lr': 0.00047007879212647744, 'samples': 4854912, 'steps': 25285, 'loss/train': 1.4370410442352295}} 11/07/2021 00:46:43 - INFO - __main__ - Step 25290: {'lr': 0.00047006872148231814, 'samples': 4855680, 'steps': 25289, 'loss/train': 1.8425540924072266}} 11/07/2021 00:46:45 - INFO - __main__ - Step 25294: {'lr': 0.0004700586492516058, 'samples': 4856448, 'steps': 25293, 'loss/train': 1.5009207725524902}}} 11/07/2021 00:46:47 - INFO - __main__ - Step 25299: {'lr': 0.00047004605673223567, 'samples': 4857408, 'steps': 25298, 'loss/train': 1.1666847467422485}} 11/07/2021 00:46:50 - INFO - __main__ - Step 25304: {'lr': 0.0004700334617341316, 'samples': 4858368, 'steps': 25303, 'loss/train': 1.4991850852966309}}} 11/07/2021 00:46:50 - INFO - __main__ - Step 25304: {'lr': 0.0004700334617341316, 'samples': 4858368, 'steps': 25303, 'loss/train': 1.4991850852966309}}} 11/07/2021 00:46:53 - INFO - __main__ - Step 25311: {'lr': 0.000470015824572783, 'samples': 4859712, 'steps': 25310, 'loss/train': 1.4917999505996704}}}} 11/07/2021 00:46:55 - INFO - __main__ - Step 25315: {'lr': 0.00047000574401385835, 'samples': 4860480, 'steps': 25314, 'loss/train': 1.6544361114501953}} 11/07/2021 00:46:57 - INFO - __main__ - Step 25319: {'lr': 0.00046999566186883466, 'samples': 4861248, 'steps': 25318, 'loss/train': 1.403463363647461}}} 11/07/2021 00:46:59 - INFO - __main__ - Step 25323: {'lr': 0.0004699855781377845, 'samples': 4862016, 'steps': 25322, 'loss/train': 0.9190696477890015}}} 11/07/2021 00:47:01 - INFO - __main__ - Step 25327: {'lr': 0.0004699754928207807, 'samples': 4862784, 'steps': 25326, 'loss/train': 1.365442156791687}}}} 11/07/2021 00:47:03 - INFO - __main__ - Step 25331: {'lr': 0.00046996540591789584, 'samples': 4863552, 'steps': 25330, 'loss/train': 1.7588785886764526}} 11/07/2021 00:47:03 - INFO - __main__ - Step 25331: {'lr': 0.00046996540591789584, 'samples': 4863552, 'steps': 25330, 'loss/train': 1.7588785886764526}} 11/07/2021 00:47:07 - INFO - __main__ - Step 25337: {'lr': 0.00046995027259020075, 'samples': 4864704, 'steps': 25336, 'loss/train': 1.8280538320541382}} 11/07/2021 00:47:10 - INFO - __main__ - Step 25343: {'lr': 0.0004699351356946825, 'samples': 4865856, 'steps': 25342, 'loss/train': 1.4869623184204102}}} 11/07/2021 00:47:10 - INFO - __main__ - Step 25343: {'lr': 0.0004699351356946825, 'samples': 4865856, 'steps': 25342, 'loss/train': 1.4869623184204102}}} 11/07/2021 00:47:13 - INFO - __main__ - Step 25350: {'lr': 0.0004699174714742401, 'samples': 4867200, 'steps': 25349, 'loss/train': 1.6201637983322144}}} 11/07/2021 00:47:15 - INFO - __main__ - Step 25354: {'lr': 0.0004699073754539511, 'samples': 4867968, 'steps': 25353, 'loss/train': 1.281815767288208}}}} 11/07/2021 00:47:18 - INFO - __main__ - Step 25359: {'lr': 0.0004698947531991438, 'samples': 4868928, 'steps': 25358, 'loss/train': 1.5658401250839233}}} 11/07/2021 00:47:18 - INFO - __main__ - Step 25359: {'lr': 0.0004698947531991438, 'samples': 4868928, 'steps': 25358, 'loss/train': 1.5658401250839233}}} 11/07/2021 00:47:22 - INFO - __main__ - Step 25366: {'lr': 0.0004698770778810357, 'samples': 4870272, 'steps': 25365, 'loss/train': 1.8698196411132812}}} 11/07/2021 00:47:23 - INFO - __main__ - Step 25370: {'lr': 0.0004698669755196239, 'samples': 4871040, 'steps': 25369, 'loss/train': 1.3435550928115845}}} 11/07/2021 00:47:25 - INFO - __main__ - Step 25374: {'lr': 0.0004698568715731133, 'samples': 4871808, 'steps': 25373, 'loss/train': 1.8538455963134766}}} 11/07/2021 00:47:28 - INFO - __main__ - Step 25379: {'lr': 0.0004698442394110411, 'samples': 4872768, 'steps': 25378, 'loss/train': 1.3638150691986084}}} 11/07/2021 00:47:30 - INFO - __main__ - Step 25383: {'lr': 0.0004698341318983249, 'samples': 4873536, 'steps': 25382, 'loss/train': 0.7573006749153137}}} 11/07/2021 00:47:32 - INFO - __main__ - Step 25387: {'lr': 0.0004698240228007469, 'samples': 4874304, 'steps': 25386, 'loss/train': 1.6194496154785156}}} 11/07/2021 00:47:33 - INFO - __main__ - Step 25391: {'lr': 0.0004698139121183798, 'samples': 4875072, 'steps': 25390, 'loss/train': 1.7080029249191284}}} 11/07/2021 00:47:35 - INFO - __main__ - Step 25395: {'lr': 0.0004698037998512966, 'samples': 4875840, 'steps': 25394, 'loss/train': 1.2428244352340698}}} 11/07/2021 00:47:38 - INFO - __main__ - Step 25400: {'lr': 0.00046979115728904675, 'samples': 4876800, 'steps': 25399, 'loss/train': 1.4458239078521729}} 11/07/2021 00:47:40 - INFO - __main__ - Step 25404: {'lr': 0.00046978104145661885, 'samples': 4877568, 'steps': 25403, 'loss/train': 1.4307785034179688}} 11/07/2021 00:47:42 - INFO - __main__ - Step 25408: {'lr': 0.0004697709240397119, 'samples': 4878336, 'steps': 25407, 'loss/train': 1.5929255485534668}}} 11/07/2021 00:47:43 - INFO - __main__ - Step 25412: {'lr': 0.00046976080503839874, 'samples': 4879104, 'steps': 25411, 'loss/train': 1.5920426845550537}} 11/07/2021 00:47:45 - INFO - __main__ - Step 25416: {'lr': 0.0004697506844527523, 'samples': 4879872, 'steps': 25415, 'loss/train': 1.047653079032898}7}} 11/07/2021 00:47:48 - INFO - __main__ - Step 25421: {'lr': 0.00046973803149283686, 'samples': 4880832, 'steps': 25420, 'loss/train': 1.7239376306533813}} 11/07/2021 00:47:50 - INFO - __main__ - Step 25425: {'lr': 0.00046972790734270745, 'samples': 4881600, 'steps': 25424, 'loss/train': 1.1467617750167847}} 11/07/2021 00:47:52 - INFO - __main__ - Step 25429: {'lr': 0.00046971778160848196, 'samples': 4882368, 'steps': 25428, 'loss/train': 1.0612508058547974}} 11/07/2021 00:47:54 - INFO - __main__ - Step 25433: {'lr': 0.00046970765429023336, 'samples': 4883136, 'steps': 25432, 'loss/train': 1.723750352859497}}} 11/07/2021 00:47:55 - INFO - __main__ - Step 25437: {'lr': 0.00046969752538803477, 'samples': 4883904, 'steps': 25436, 'loss/train': 1.1509177684783936}} 11/07/2021 00:47:58 - INFO - __main__ - Step 25441: {'lr': 0.0004696873949019591, 'samples': 4884672, 'steps': 25440, 'loss/train': 1.9094892740249634}}} 11/07/2021 00:47:58 - INFO - __main__ - Step 25441: {'lr': 0.0004696873949019591, 'samples': 4884672, 'steps': 25440, 'loss/train': 1.9094892740249634}}} 11/07/2021 00:48:01 - INFO - __main__ - Step 25449: {'lr': 0.00046966712917846887, 'samples': 4886208, 'steps': 25448, 'loss/train': 1.662883996963501}}} 11/07/2021 00:48:03 - INFO - __main__ - Step 25453: {'lr': 0.00046965699394120033, 'samples': 4886976, 'steps': 25452, 'loss/train': 1.3922358751296997}} 11/07/2021 00:48:06 - INFO - __main__ - Step 25458: {'lr': 0.00046964432266770713, 'samples': 4887936, 'steps': 25457, 'loss/train': 2.6210789680480957}} 11/07/2021 00:48:08 - INFO - __main__ - Step 25463: {'lr': 0.0004696316489200053, 'samples': 4888896, 'steps': 25462, 'loss/train': 1.4440584182739258}}} 11/07/2021 00:48:10 - INFO - __main__ - Step 25467: {'lr': 0.00046962150814050963, 'samples': 4889664, 'steps': 25466, 'loss/train': 1.4495553970336914}} 11/07/2021 00:48:12 - INFO - __main__ - Step 25471: {'lr': 0.000469611365777685, 'samples': 4890432, 'steps': 25470, 'loss/train': 1.69800865650177}914}} 11/07/2021 00:48:14 - INFO - __main__ - Step 25475: {'lr': 0.00046960122183160446, 'samples': 4891200, 'steps': 25474, 'loss/train': 1.417409062385559}}} 11/07/2021 00:48:16 - INFO - __main__ - Step 25479: {'lr': 0.0004695910763023412, 'samples': 4891968, 'steps': 25478, 'loss/train': 1.392314076423645}}}} 11/07/2021 00:48:16 - INFO - __main__ - Step 25479: {'lr': 0.0004695910763023412, 'samples': 4891968, 'steps': 25478, 'loss/train': 1.392314076423645}}}} 11/07/2021 00:48:16 - INFO - __main__ - Step 25479: {'lr': 0.0004695910763023412, 'samples': 4891968, 'steps': 25478, 'loss/train': 1.392314076423645}}}} 11/07/2021 00:48:21 - INFO - __main__ - Step 25489: {'lr': 0.0004695657055532384, 'samples': 4893888, 'steps': 25488, 'loss/train': 1.464012622833252}}}} 11/07/2021 00:48:21 - INFO - __main__ - Step 25489: {'lr': 0.0004695657055532384, 'samples': 4893888, 'steps': 25488, 'loss/train': 1.464012622833252}}}} 11/07/2021 00:48:25 - INFO - __main__ - Step 25496: {'lr': 0.0004695479401422898, 'samples': 4895232, 'steps': 25495, 'loss/train': 1.751136302947998}}}} 11/07/2021 00:48:25 - INFO - __main__ - Step 25496: {'lr': 0.0004695479401422898, 'samples': 4895232, 'steps': 25495, 'loss/train': 1.751136302947998}}}} 11/07/2021 00:48:30 - INFO - __main__ - Step 25505: {'lr': 0.00046952509177710267, 'samples': 4896960, 'steps': 25504, 'loss/train': 1.435947299003601}}} 11/07/2021 00:48:30 - INFO - __main__ - Step 25505: {'lr': 0.00046952509177710267, 'samples': 4896960, 'steps': 25504, 'loss/train': 1.435947299003601}}} 11/07/2021 00:48:33 - INFO - __main__ - Step 25512: {'lr': 0.0004695073152871403, 'samples': 4898304, 'steps': 25511, 'loss/train': 1.6699031591415405}}} 11/07/2021 00:48:36 - INFO - __main__ - Step 25517: {'lr': 0.00046949461482708875, 'samples': 4899264, 'steps': 25516, 'loss/train': 1.4564876556396484}} 11/07/2021 00:48:36 - INFO - __main__ - Step 25517: {'lr': 0.00046949461482708875, 'samples': 4899264, 'steps': 25516, 'loss/train': 1.4564876556396484}} 11/07/2021 00:48:40 - INFO - __main__ - Step 25525: {'lr': 0.0004694742889482199, 'samples': 4900800, 'steps': 25524, 'loss/train': 1.5668319463729858}}} 11/07/2021 00:48:41 - INFO - __main__ - Step 25529: {'lr': 0.00046946412363534735, 'samples': 4901568, 'steps': 25528, 'loss/train': 1.9022449254989624}} 11/07/2021 00:48:44 - INFO - __main__ - Step 25533: {'lr': 0.00046945395674028047, 'samples': 4902336, 'steps': 25532, 'loss/train': 1.9562991857528687}} 11/07/2021 00:48:44 - INFO - __main__ - Step 25533: {'lr': 0.00046945395674028047, 'samples': 4902336, 'steps': 25532, 'loss/train': 1.9562991857528687}} 11/07/2021 00:48:47 - INFO - __main__ - Step 25541: {'lr': 0.0004694336182038567, 'samples': 4903872, 'steps': 25540, 'loss/train': 1.7164955139160156}}} 11/07/2021 00:48:49 - INFO - __main__ - Step 25545: {'lr': 0.00046942344656264657, 'samples': 4904640, 'steps': 25544, 'loss/train': 1.6937353610992432}} 11/07/2021 00:48:52 - INFO - __main__ - Step 25550: {'lr': 0.00046941072978659397, 'samples': 4905600, 'steps': 25549, 'loss/train': 1.1602411270141602}} 11/07/2021 00:48:52 - INFO - __main__ - Step 25550: {'lr': 0.00046941072978659397, 'samples': 4905600, 'steps': 25549, 'loss/train': 1.1602411270141602}} 11/07/2021 00:48:56 - INFO - __main__ - Step 25558: {'lr': 0.0004693903778040889, 'samples': 4907136, 'steps': 25557, 'loss/train': 1.0258276462554932}}} 11/07/2021 00:48:57 - INFO - __main__ - Step 25562: {'lr': 0.00046938019944030556, 'samples': 4907904, 'steps': 25561, 'loss/train': 1.0206115245819092}} 11/07/2021 00:48:59 - INFO - __main__ - Step 25566: {'lr': 0.00046937001949493294, 'samples': 4908672, 'steps': 25565, 'loss/train': 1.5839568376541138}} 11/07/2021 00:49:02 - INFO - __main__ - Step 25571: {'lr': 0.000469357292339219, 'samples': 4909632, 'steps': 25570, 'loss/train': 1.6451069116592407}8}} 11/07/2021 00:49:04 - INFO - __main__ - Step 25575: {'lr': 0.00046934710883553884, 'samples': 4910400, 'steps': 25574, 'loss/train': 1.674824595451355}}} 11/07/2021 00:49:06 - INFO - __main__ - Step 25579: {'lr': 0.00046933692375050783, 'samples': 4911168, 'steps': 25578, 'loss/train': 0.6779226064682007}} 11/07/2021 00:49:08 - INFO - __main__ - Step 25583: {'lr': 0.0004693267370841995, 'samples': 4911936, 'steps': 25582, 'loss/train': 1.6979387998580933}}} 11/07/2021 00:49:10 - INFO - __main__ - Step 25587: {'lr': 0.0004693165488366873, 'samples': 4912704, 'steps': 25586, 'loss/train': 1.040775179862976}}}} 11/07/2021 00:49:12 - INFO - __main__ - Step 25591: {'lr': 0.00046930635900804466, 'samples': 4913472, 'steps': 25590, 'loss/train': 1.1948552131652832}} 11/07/2021 00:49:14 - INFO - __main__ - Step 25596: {'lr': 0.00046929361949888857, 'samples': 4914432, 'steps': 25595, 'loss/train': 1.9493794441223145}} 11/07/2021 00:49:16 - INFO - __main__ - Step 25601: {'lr': 0.00046928087751947444, 'samples': 4915392, 'steps': 25600, 'loss/train': 1.4352083206176758}} 11/07/2021 00:49:18 - INFO - __main__ - Step 25605: {'lr': 0.0004692706821574538, 'samples': 4916160, 'steps': 25604, 'loss/train': 1.0325216054916382}}} 11/07/2021 00:49:21 - INFO - __main__ - Step 25609: {'lr': 0.00046926048521463344, 'samples': 4916928, 'steps': 25608, 'loss/train': 1.3411468267440796}} 11/07/2021 00:49:23 - INFO - __main__ - Step 25613: {'lr': 0.0004692502866910868, 'samples': 4917696, 'steps': 25612, 'loss/train': 1.538058876991272}6}} 11/07/2021 00:49:24 - INFO - __main__ - Step 25617: {'lr': 0.00046924008658688745, 'samples': 4918464, 'steps': 25616, 'loss/train': 2.023993968963623}}} 11/07/2021 00:49:26 - INFO - __main__ - Step 25621: {'lr': 0.0004692298849021088, 'samples': 4919232, 'steps': 25620, 'loss/train': 1.6011896133422852}}} 11/07/2021 00:49:28 - INFO - __main__ - Step 25626: {'lr': 0.00046921713057355817, 'samples': 4920192, 'steps': 25625, 'loss/train': 1.2167147397994995}} 11/07/2021 00:49:28 - INFO - __main__ - Step 25626: {'lr': 0.00046921713057355817, 'samples': 4920192, 'steps': 25625, 'loss/train': 1.2167147397994995}} 11/07/2021 00:49:32 - INFO - __main__ - Step 25633: {'lr': 0.00046919927036503353, 'samples': 4921536, 'steps': 25632, 'loss/train': 1.6610690355300903}} 11/07/2021 00:49:34 - INFO - __main__ - Step 25637: {'lr': 0.0004691890623586737, 'samples': 4922304, 'steps': 25636, 'loss/train': 0.921523928642273}3}} 11/07/2021 00:49:36 - INFO - __main__ - Step 25641: {'lr': 0.0004691788527721026, 'samples': 4923072, 'steps': 25640, 'loss/train': 1.4608443975448608}}} 11/07/2021 00:49:36 - INFO - __main__ - Step 25641: {'lr': 0.0004691788527721026, 'samples': 4923072, 'steps': 25640, 'loss/train': 1.4608443975448608}}} 11/07/2021 00:49:41 - INFO - __main__ - Step 25650: {'lr': 0.0004691558754250511, 'samples': 4924800, 'steps': 25649, 'loss/train': 0.6195133328437805}}} 11/07/2021 00:49:43 - INFO - __main__ - Step 25654: {'lr': 0.00046914566070330144, 'samples': 4925568, 'steps': 25653, 'loss/train': 2.113215446472168}}} 11/07/2021 00:49:44 - INFO - __main__ - Step 25658: {'lr': 0.0004691354444016534, 'samples': 4926336, 'steps': 25657, 'loss/train': 1.2985042333602905}}} 11/07/2021 00:49:46 - INFO - __main__ - Step 25662: {'lr': 0.0004691252265201805, 'samples': 4927104, 'steps': 25661, 'loss/train': 1.4861279726028442}}} 11/07/2021 00:49:46 - INFO - __main__ - Step 25662: {'lr': 0.0004691252265201805, 'samples': 4927104, 'steps': 25661, 'loss/train': 1.4861279726028442}}} 11/07/2021 00:49:51 - INFO - __main__ - Step 25671: {'lr': 0.0004691022305110138, 'samples': 4928832, 'steps': 25670, 'loss/train': 1.5334299802780151}}} 11/07/2021 00:49:52 - INFO - __main__ - Step 25675: {'lr': 0.00046909200749561914, 'samples': 4929600, 'steps': 25674, 'loss/train': 1.751390814781189}}} 11/07/2021 00:49:54 - INFO - __main__ - Step 25679: {'lr': 0.0004690817829007129, 'samples': 4930368, 'steps': 25678, 'loss/train': 1.2293751239776611}}} 11/07/2021 00:49:57 - INFO - __main__ - Step 25684: {'lr': 0.00046906899993600406, 'samples': 4931328, 'steps': 25683, 'loss/train': 1.7066028118133545}} 11/07/2021 00:49:59 - INFO - __main__ - Step 25688: {'lr': 0.00046905877178746614, 'samples': 4932096, 'steps': 25687, 'loss/train': 1.6840893030166626}} 11/07/2021 00:50:01 - INFO - __main__ - Step 25692: {'lr': 0.0004690485420596561, 'samples': 4932864, 'steps': 25691, 'loss/train': 1.3529332876205444}}} 11/07/2021 00:50:02 - INFO - __main__ - Step 25696: {'lr': 0.0004690383107526479, 'samples': 4933632, 'steps': 25695, 'loss/train': 1.5363796949386597}}} 11/07/2021 00:50:04 - INFO - __main__ - Step 25700: {'lr': 0.00046902807786651507, 'samples': 4934400, 'steps': 25699, 'loss/train': 1.112980604171753}}} 11/07/2021 00:50:07 - INFO - __main__ - Step 25705: {'lr': 0.00046901528453831764, 'samples': 4935360, 'steps': 25704, 'loss/train': 1.5190848112106323}} 11/07/2021 00:50:09 - INFO - __main__ - Step 25709: {'lr': 0.00046900504809942433, 'samples': 4936128, 'steps': 25708, 'loss/train': 0.857280969619751}}} 11/07/2021 00:50:09 - INFO - __main__ - Step 25709: {'lr': 0.00046900504809942433, 'samples': 4936128, 'steps': 25708, 'loss/train': 0.857280969619751}}} 11/07/2021 00:50:12 - INFO - __main__ - Step 25716: {'lr': 0.0004689871305322143, 'samples': 4937472, 'steps': 25715, 'loss/train': 1.7055093050003052}}} 11/07/2021 00:50:14 - INFO - __main__ - Step 25720: {'lr': 0.0004689768897515657, 'samples': 4938240, 'steps': 25719, 'loss/train': 1.4422892332077026}}} 11/07/2021 00:50:17 - INFO - __main__ - Step 25725: {'lr': 0.0004689640865557424, 'samples': 4939200, 'steps': 25724, 'loss/train': 1.4920438528060913}}} 11/07/2021 00:50:17 - INFO - __main__ - Step 25725: {'lr': 0.0004689640865557424, 'samples': 4939200, 'steps': 25724, 'loss/train': 1.4920438528060913}}} 11/07/2021 00:50:21 - INFO - __main__ - Step 25733: {'lr': 0.0004689435963120696, 'samples': 4940736, 'steps': 25732, 'loss/train': 0.15897658467292786}} 11/07/2021 00:50:22 - INFO - __main__ - Step 25737: {'lr': 0.0004689333488225337, 'samples': 4941504, 'steps': 25736, 'loss/train': 1.6334772109985352}}} 11/07/2021 00:50:24 - INFO - __main__ - Step 25741: {'lr': 0.00046892309975463, 'samples': 4942272, 'steps': 25740, 'loss/train': 1.3078798055648804}2}}} 11/07/2021 00:50:24 - INFO - __main__ - Step 25741: {'lr': 0.00046892309975463, 'samples': 4942272, 'steps': 25740, 'loss/train': 1.3078798055648804}2}}} 11/07/2021 00:50:24 - INFO - __main__ - Step 25741: {'lr': 0.00046892309975463, 'samples': 4942272, 'steps': 25740, 'loss/train': 1.3078798055648804}2}}} 11/07/2021 00:50:31 - INFO - __main__ - Step 25752: {'lr': 0.00046889490668003896, 'samples': 4944384, 'steps': 25751, 'loss/train': 1.8758376836776733}} 11/07/2021 00:50:31 - INFO - __main__ - Step 25752: {'lr': 0.00046889490668003896, 'samples': 4944384, 'steps': 25751, 'loss/train': 1.8758376836776733}} 11/07/2021 00:50:31 - INFO - __main__ - Step 25752: {'lr': 0.00046889490668003896, 'samples': 4944384, 'steps': 25751, 'loss/train': 1.8758376836776733}} 11/07/2021 00:50:36 - INFO - __main__ - Step 25763: {'lr': 0.00046886670167113734, 'samples': 4946496, 'steps': 25762, 'loss/train': 1.8013571500778198}} 11/07/2021 00:50:39 - INFO - __main__ - Step 25768: {'lr': 0.00046885387726773494, 'samples': 4947456, 'steps': 25767, 'loss/train': 0.9453241229057312}} 11/07/2021 00:50:39 - INFO - __main__ - Step 25768: {'lr': 0.00046885387726773494, 'samples': 4947456, 'steps': 25767, 'loss/train': 0.9453241229057312}} 11/07/2021 00:50:42 - INFO - __main__ - Step 25775: {'lr': 0.0004688359189612923, 'samples': 4948800, 'steps': 25774, 'loss/train': 1.6210196018218994}}} 11/07/2021 00:50:44 - INFO - __main__ - Step 25779: {'lr': 0.00046882565490258125, 'samples': 4949568, 'steps': 25778, 'loss/train': 1.9911561012268066}} 11/07/2021 00:50:46 - INFO - __main__ - Step 25784: {'lr': 0.000468812822610713, 'samples': 4950528, 'steps': 25783, 'loss/train': 1.4683573246002197}6}} 11/07/2021 00:50:49 - INFO - __main__ - Step 25789: {'lr': 0.0004687999878540028, 'samples': 4951488, 'steps': 25788, 'loss/train': 1.8325769901275635}}} 11/07/2021 00:50:49 - INFO - __main__ - Step 25789: {'lr': 0.0004687999878540028, 'samples': 4951488, 'steps': 25788, 'loss/train': 1.8325769901275635}}} 11/07/2021 00:50:53 - INFO - __main__ - Step 25796: {'lr': 0.00046878201505394913, 'samples': 4952832, 'steps': 25795, 'loss/train': 1.5337727069854736}} 11/07/2021 00:50:54 - INFO - __main__ - Step 25800: {'lr': 0.00046877174271370894, 'samples': 4953600, 'steps': 25799, 'loss/train': 1.3573501110076904}} 11/07/2021 00:50:56 - INFO - __main__ - Step 25804: {'lr': 0.0004687614687962659, 'samples': 4954368, 'steps': 25803, 'loss/train': 1.5000717639923096}}} 11/07/2021 00:50:59 - INFO - __main__ - Step 25809: {'lr': 0.00046874862418163363, 'samples': 4955328, 'steps': 25808, 'loss/train': 1.6389553546905518}} 11/07/2021 00:51:01 - INFO - __main__ - Step 25813: {'lr': 0.0004687383467157553, 'samples': 4956096, 'steps': 25812, 'loss/train': 1.514067530632019}8}} 11/07/2021 00:51:02 - INFO - __main__ - Step 25817: {'lr': 0.000468728067672915, 'samples': 4956864, 'steps': 25816, 'loss/train': 1.031846284866333}}8}} 11/07/2021 00:51:04 - INFO - __main__ - Step 25821: {'lr': 0.00046871778705318673, 'samples': 4957632, 'steps': 25820, 'loss/train': 1.3064281940460205}} 11/07/2021 00:51:07 - INFO - __main__ - Step 25826: {'lr': 0.00046870493406114084, 'samples': 4958592, 'steps': 25825, 'loss/train': 1.2004915475845337}} 11/07/2021 00:51:09 - INFO - __main__ - Step 25830: {'lr': 0.0004686946498936859, 'samples': 4959360, 'steps': 25829, 'loss/train': 1.9060460329055786}}} 11/07/2021 00:51:09 - INFO - __main__ - Step 25830: {'lr': 0.0004686946498936859, 'samples': 4959360, 'steps': 25829, 'loss/train': 1.9060460329055786}}} 11/07/2021 00:51:12 - INFO - __main__ - Step 25837: {'lr': 0.00046867664880687775, 'samples': 4960704, 'steps': 25836, 'loss/train': 2.374248743057251}}} 11/07/2021 00:51:15 - INFO - __main__ - Step 25842: {'lr': 0.00046866378793173616, 'samples': 4961664, 'steps': 25841, 'loss/train': 1.5353429317474365}} 11/07/2021 00:51:15 - INFO - __main__ - Step 25842: {'lr': 0.00046866378793173616, 'samples': 4961664, 'steps': 25841, 'loss/train': 1.5353429317474365}} 11/07/2021 00:51:19 - INFO - __main__ - Step 25850: {'lr': 0.0004686432054081904, 'samples': 4963200, 'steps': 25849, 'loss/train': 1.765515685081482}5}} 11/07/2021 00:51:21 - INFO - __main__ - Step 25854: {'lr': 0.00046863291178196625, 'samples': 4963968, 'steps': 25853, 'loss/train': 1.1156255006790161}} 11/07/2021 00:51:22 - INFO - __main__ - Step 25858: {'lr': 0.00046862261657954033, 'samples': 4964736, 'steps': 25857, 'loss/train': 1.3477070331573486}} 11/07/2021 00:51:24 - INFO - __main__ - Step 25862: {'lr': 0.0004686123198009867, 'samples': 4965504, 'steps': 25861, 'loss/train': 1.3693437576293945}}} 11/07/2021 00:51:27 - INFO - __main__ - Step 25867: {'lr': 0.00046859944661147837, 'samples': 4966464, 'steps': 25866, 'loss/train': 1.6364786624908447}} 11/07/2021 00:51:29 - INFO - __main__ - Step 25871: {'lr': 0.00046858914628690896, 'samples': 4967232, 'steps': 25870, 'loss/train': 1.5514227151870728}} 11/07/2021 00:51:31 - INFO - __main__ - Step 25875: {'lr': 0.00046857884438645327, 'samples': 4968000, 'steps': 25874, 'loss/train': 1.4419405460357666}} 11/07/2021 00:51:32 - INFO - __main__ - Step 25879: {'lr': 0.0004685685409101855, 'samples': 4968768, 'steps': 25878, 'loss/train': 1.3810776472091675}}} 11/07/2021 00:51:34 - INFO - __main__ - Step 25883: {'lr': 0.00046855823585818004, 'samples': 4969536, 'steps': 25882, 'loss/train': 0.21515725553035736} 11/07/2021 00:51:37 - INFO - __main__ - Step 25888: {'lr': 0.00046854535232740505, 'samples': 4970496, 'steps': 25887, 'loss/train': 1.6409744024276733}} 11/07/2021 00:51:39 - INFO - __main__ - Step 25892: {'lr': 0.00046853504373026107, 'samples': 4971264, 'steps': 25891, 'loss/train': 1.618612289428711}}} 11/07/2021 00:51:41 - INFO - __main__ - Step 25896: {'lr': 0.0004685247335576209, 'samples': 4972032, 'steps': 25895, 'loss/train': 0.9556924104690552}}} 11/07/2021 00:51:43 - INFO - __main__ - Step 25900: {'lr': 0.0004685144218095587, 'samples': 4972800, 'steps': 25899, 'loss/train': 1.3517341613769531}}} 11/07/2021 00:51:45 - INFO - __main__ - Step 25904: {'lr': 0.0004685041084861489, 'samples': 4973568, 'steps': 25903, 'loss/train': 1.6833339929580688}}} 11/07/2021 00:51:47 - INFO - __main__ - Step 25909: {'lr': 0.00046849121461666734, 'samples': 4974528, 'steps': 25908, 'loss/train': 1.144979476928711}}} 11/07/2021 00:51:49 - INFO - __main__ - Step 25913: {'lr': 0.0004684808977489973, 'samples': 4975296, 'steps': 25912, 'loss/train': 1.625329613685608}}}} 11/07/2021 00:51:49 - INFO - __main__ - Step 25913: {'lr': 0.0004684808977489973, 'samples': 4975296, 'steps': 25912, 'loss/train': 1.625329613685608}}}} 11/07/2021 00:51:53 - INFO - __main__ - Step 25920: {'lr': 0.00046846283944052073, 'samples': 4976640, 'steps': 25919, 'loss/train': 1.417981743812561}}} 11/07/2021 00:51:55 - INFO - __main__ - Step 25924: {'lr': 0.00046845251824148825, 'samples': 4977408, 'steps': 25923, 'loss/train': 1.748632788658142}}} 11/07/2021 00:51:57 - INFO - __main__ - Step 25930: {'lr': 0.00046843703349002286, 'samples': 4978560, 'steps': 25929, 'loss/train': 1.4150608777999878}} 11/07/2021 00:51:57 - INFO - __main__ - Step 25930: {'lr': 0.00046843703349002286, 'samples': 4978560, 'steps': 25929, 'loss/train': 1.4150608777999878}} 11/07/2021 00:52:01 - INFO - __main__ - Step 25937: {'lr': 0.000468418963468356, 'samples': 4979904, 'steps': 25936, 'loss/train': 1.8304779529571533}8}} 11/07/2021 00:52:03 - INFO - __main__ - Step 25941: {'lr': 0.0004684086355765069, 'samples': 4980672, 'steps': 25940, 'loss/train': 1.4727085828781128}}} 11/07/2021 00:52:05 - INFO - __main__ - Step 25945: {'lr': 0.00046839830611007297, 'samples': 4981440, 'steps': 25944, 'loss/train': 1.7420538663864136}} 11/07/2021 00:52:07 - INFO - __main__ - Step 25950: {'lr': 0.00046838539206288366, 'samples': 4982400, 'steps': 25949, 'loss/train': 1.9250874519348145}} 11/07/2021 00:52:07 - INFO - __main__ - Step 25950: {'lr': 0.00046838539206288366, 'samples': 4982400, 'steps': 25949, 'loss/train': 1.9250874519348145}} 11/07/2021 00:52:11 - INFO - __main__ - Step 25957: {'lr': 0.00046836730826400565, 'samples': 4983744, 'steps': 25956, 'loss/train': 2.7427055835723877}} 11/07/2021 00:52:13 - INFO - __main__ - Step 25961: {'lr': 0.0004683569724999765, 'samples': 4984512, 'steps': 25960, 'loss/train': 1.4902448654174805}}} 11/07/2021 00:52:15 - INFO - __main__ - Step 25966: {'lr': 0.00046834405058121244, 'samples': 4985472, 'steps': 25965, 'loss/train': 1.5850639343261719}} 11/07/2021 00:52:15 - INFO - __main__ - Step 25966: {'lr': 0.00046834405058121244, 'samples': 4985472, 'steps': 25965, 'loss/train': 1.5850639343261719}} 11/07/2021 00:52:19 - INFO - __main__ - Step 25974: {'lr': 0.0004683233703953626, 'samples': 4987008, 'steps': 25973, 'loss/train': 1.5884449481964111}}} 11/07/2021 00:52:21 - INFO - __main__ - Step 25978: {'lr': 0.00046831302794144504, 'samples': 4987776, 'steps': 25977, 'loss/train': 1.1016311645507812}} 11/07/2021 00:52:23 - INFO - __main__ - Step 25982: {'lr': 0.00046830268391363176, 'samples': 4988544, 'steps': 25981, 'loss/train': 1.5783252716064453}} 11/07/2021 00:52:25 - INFO - __main__ - Step 25986: {'lr': 0.0004682923383119973, 'samples': 4989312, 'steps': 25985, 'loss/train': 1.7473276853561401}}} 11/07/2021 00:52:27 - INFO - __main__ - Step 25991: {'lr': 0.0004682794040968819, 'samples': 4990272, 'steps': 25990, 'loss/train': 1.1837917566299438}}} 11/07/2021 00:52:30 - INFO - __main__ - Step 25995: {'lr': 0.00046826905495442263, 'samples': 4991040, 'steps': 25994, 'loss/train': 1.8412169218063354}} 11/07/2021 00:52:32 - INFO - __main__ - Step 25999: {'lr': 0.00046825870423838466, 'samples': 4991808, 'steps': 25998, 'loss/train': 1.5395135879516602}} 11/07/2021 00:52:33 - INFO - __main__ - Step 26003: {'lr': 0.00046824835194884273, 'samples': 4992576, 'steps': 26002, 'loss/train': 1.9249521493911743}} 11/07/2021 00:52:35 - INFO - __main__ - Step 26007: {'lr': 0.00046823799808587126, 'samples': 4993344, 'steps': 26006, 'loss/train': 2.0492119789123535}} 11/07/2021 00:52:35 - INFO - __main__ - Step 26007: {'lr': 0.00046823799808587126, 'samples': 4993344, 'steps': 26006, 'loss/train': 2.0492119789123535}} 11/07/2021 00:52:39 - INFO - __main__ - Step 26015: {'lr': 0.00046821728563993867, 'samples': 4994880, 'steps': 26014, 'loss/train': 1.8622995615005493}} 11/07/2021 00:52:41 - INFO - __main__ - Step 26019: {'lr': 0.00046820692705712685, 'samples': 4995648, 'steps': 26018, 'loss/train': 2.186852216720581}}} 11/07/2021 00:52:43 - INFO - __main__ - Step 26023: {'lr': 0.00046819656690118424, 'samples': 4996416, 'steps': 26022, 'loss/train': 1.6353843212127686}} 11/07/2021 00:52:45 - INFO - __main__ - Step 26027: {'lr': 0.00046818620517218544, 'samples': 4997184, 'steps': 26026, 'loss/train': 1.5888382196426392}} 11/07/2021 00:52:47 - INFO - __main__ - Step 26032: {'lr': 0.0004681732507989408, 'samples': 4998144, 'steps': 26031, 'loss/train': 1.2013065814971924}}} 11/07/2021 00:52:47 - INFO - __main__ - Step 26032: {'lr': 0.0004681732507989408, 'samples': 4998144, 'steps': 26031, 'loss/train': 1.2013065814971924}}} 11/07/2021 00:52:50 - INFO - __main__ - Step 26039: {'lr': 0.0004681551105475999, 'samples': 4999488, 'steps': 26038, 'loss/train': 1.6333225965499878}}} 11/07/2021 00:52:52 - INFO - __main__ - Step 26043: {'lr': 0.0004681447425271239, 'samples': 5000256, 'steps': 26042, 'loss/train': 1.1876591444015503}}} 11/07/2021 00:52:55 - INFO - __main__ - Step 26048: {'lr': 0.000468131780289953, 'samples': 5001216, 'steps': 26047, 'loss/train': 1.707362174987793}3}}} 11/07/2021 00:52:57 - INFO - __main__ - Step 26052: {'lr': 0.00046812140873104657, 'samples': 5001984, 'steps': 26051, 'loss/train': 0.976646363735199}}} 11/07/2021 00:52:59 - INFO - __main__ - Step 26056: {'lr': 0.00046811103559962585, 'samples': 5002752, 'steps': 26055, 'loss/train': 1.3633968830108643}} 11/07/2021 00:53:01 - INFO - __main__ - Step 26060: {'lr': 0.00046810066089576573, 'samples': 5003520, 'steps': 26059, 'loss/train': 1.6209688186645508}} 11/07/2021 00:53:03 - INFO - __main__ - Step 26065: {'lr': 0.00046808769030481153, 'samples': 5004480, 'steps': 26064, 'loss/train': 1.38559091091156}8}} 11/07/2021 00:53:05 - INFO - __main__ - Step 26069: {'lr': 0.00046807731206323605, 'samples': 5005248, 'steps': 26068, 'loss/train': 1.3988279104232788}} 11/07/2021 00:53:05 - INFO - __main__ - Step 26069: {'lr': 0.00046807731206323605, 'samples': 5005248, 'steps': 26068, 'loss/train': 1.3988279104232788}} 11/07/2021 00:53:09 - INFO - __main__ - Step 26076: {'lr': 0.00046805914635742656, 'samples': 5006592, 'steps': 26075, 'loss/train': 1.3818391561508179}} 11/07/2021 00:53:11 - INFO - __main__ - Step 26080: {'lr': 0.0004680487637924912, 'samples': 5007360, 'steps': 26079, 'loss/train': 1.373033046722412}9}} 11/07/2021 00:53:13 - INFO - __main__ - Step 26085: {'lr': 0.00046803578337571917, 'samples': 5008320, 'steps': 26084, 'loss/train': 1.5039665699005127}} 11/07/2021 00:53:16 - INFO - __main__ - Step 26089: {'lr': 0.00046802539727391033, 'samples': 5009088, 'steps': 26088, 'loss/train': 1.5393263101577759}} 11/07/2021 00:53:18 - INFO - __main__ - Step 26093: {'lr': 0.00046801500960027957, 'samples': 5009856, 'steps': 26092, 'loss/train': 1.4452115297317505}} 11/07/2021 00:53:19 - INFO - __main__ - Step 26097: {'lr': 0.00046800462035490156, 'samples': 5010624, 'steps': 26096, 'loss/train': 1.4790648221969604}} 11/07/2021 00:53:21 - INFO - __main__ - Step 26101: {'lr': 0.00046799422953785124, 'samples': 5011392, 'steps': 26100, 'loss/train': 1.5862019062042236}} 11/07/2021 00:53:23 - INFO - __main__ - Step 26106: {'lr': 0.00046798123880648833, 'samples': 5012352, 'steps': 26105, 'loss/train': 1.7747282981872559}} 11/07/2021 00:53:26 - INFO - __main__ - Step 26110: {'lr': 0.0004679708444534493, 'samples': 5013120, 'steps': 26109, 'loss/train': 0.920870304107666}9}} 11/07/2021 00:53:28 - INFO - __main__ - Step 26114: {'lr': 0.00046796044852898144, 'samples': 5013888, 'steps': 26113, 'loss/train': 1.8528300523757935}} 11/07/2021 00:53:28 - INFO - __main__ - Step 26114: {'lr': 0.00046796044852898144, 'samples': 5013888, 'steps': 26113, 'loss/train': 1.8528300523757935}} 11/07/2021 00:53:31 - INFO - __main__ - Step 26121: {'lr': 0.00046794225188013773, 'samples': 5015232, 'steps': 26120, 'loss/train': 1.3868657350540161}} 11/07/2021 00:53:33 - INFO - __main__ - Step 26127: {'lr': 0.0004679266509226869, 'samples': 5016384, 'steps': 26126, 'loss/train': 1.6026920080184937}}} 11/07/2021 00:53:36 - INFO - __main__ - Step 26131: {'lr': 0.00046791624832048307, 'samples': 5017152, 'steps': 26130, 'loss/train': 1.457753300666809}}} 11/07/2021 00:53:38 - INFO - __main__ - Step 26135: {'lr': 0.00046790584414724404, 'samples': 5017920, 'steps': 26134, 'loss/train': 1.0392847061157227}} 11/07/2021 00:53:40 - INFO - __main__ - Step 26139: {'lr': 0.0004678954384030448, 'samples': 5018688, 'steps': 26138, 'loss/train': 1.807644248008728}7}} 11/07/2021 00:53:41 - INFO - __main__ - Step 26143: {'lr': 0.0004678850310879604, 'samples': 5019456, 'steps': 26142, 'loss/train': 1.6074074506759644}}} 11/07/2021 00:53:44 - INFO - __main__ - Step 26148: {'lr': 0.00046787201973516195, 'samples': 5020416, 'steps': 26147, 'loss/train': 1.5438616275787354}} 11/07/2021 00:53:46 - INFO - __main__ - Step 26152: {'lr': 0.0004678616088858603, 'samples': 5021184, 'steps': 26151, 'loss/train': 1.5728057622909546}}} 11/07/2021 00:53:46 - INFO - __main__ - Step 26152: {'lr': 0.0004678616088858603, 'samples': 5021184, 'steps': 26151, 'loss/train': 1.5728057622909546}}} 11/07/2021 00:53:49 - INFO - __main__ - Step 26159: {'lr': 0.0004678433861202721, 'samples': 5022528, 'steps': 26158, 'loss/train': 1.1330708265304565}}} 11/07/2021 00:53:51 - INFO - __main__ - Step 26164: {'lr': 0.0004678303669144081, 'samples': 5023488, 'steps': 26163, 'loss/train': 1.6530953645706177}}} 11/07/2021 00:53:53 - INFO - __main__ - Step 26168: {'lr': 0.0004678199497829919, 'samples': 5024256, 'steps': 26167, 'loss/train': 1.3655349016189575}}} 11/07/2021 00:53:53 - INFO - __main__ - Step 26168: {'lr': 0.0004678199497829919, 'samples': 5024256, 'steps': 26167, 'loss/train': 1.3655349016189575}}} 11/07/2021 00:53:53 - INFO - __main__ - Step 26168: {'lr': 0.0004678199497829919, 'samples': 5024256, 'steps': 26167, 'loss/train': 1.3655349016189575}}} 11/07/2021 00:53:59 - INFO - __main__ - Step 26179: {'lr': 0.0004677912945747527, 'samples': 5026368, 'steps': 26178, 'loss/train': 1.700212001800537}}}} 11/07/2021 00:54:02 - INFO - __main__ - Step 26184: {'lr': 0.0004677782655546687, 'samples': 5027328, 'steps': 26183, 'loss/train': 1.2588014602661133}}} 11/07/2021 00:54:02 - INFO - __main__ - Step 26184: {'lr': 0.0004677782655546687, 'samples': 5027328, 'steps': 26183, 'loss/train': 1.2588014602661133}}} 11/07/2021 00:54:06 - INFO - __main__ - Step 26192: {'lr': 0.0004677574140199642, 'samples': 5028864, 'steps': 26191, 'loss/train': 1.589673638343811}}}} 11/07/2021 00:54:07 - INFO - __main__ - Step 26196: {'lr': 0.0004677469858977391, 'samples': 5029632, 'steps': 26195, 'loss/train': 2.0205130577087402}}} 11/07/2021 00:54:09 - INFO - __main__ - Step 26200: {'lr': 0.00046773655620569924, 'samples': 5030400, 'steps': 26199, 'loss/train': 1.1757603883743286}} 11/07/2021 00:54:12 - INFO - __main__ - Step 26205: {'lr': 0.0004677235168832117, 'samples': 5031360, 'steps': 26204, 'loss/train': 1.6033073663711548}}} 11/07/2021 00:54:12 - INFO - __main__ - Step 26205: {'lr': 0.0004677235168832117, 'samples': 5031360, 'steps': 26204, 'loss/train': 1.6033073663711548}}} 11/07/2021 00:54:16 - INFO - __main__ - Step 26213: {'lr': 0.0004677026488659441, 'samples': 5032896, 'steps': 26212, 'loss/train': 1.433125376701355}}}} 11/07/2021 00:54:17 - INFO - __main__ - Step 26217: {'lr': 0.00046769221250302984, 'samples': 5033664, 'steps': 26216, 'loss/train': 1.13893723487854}}}} 11/07/2021 00:54:19 - INFO - __main__ - Step 26221: {'lr': 0.0004676817745706955, 'samples': 5034432, 'steps': 26220, 'loss/train': 1.638798475265503}}}} 11/07/2021 00:54:22 - INFO - __main__ - Step 26226: {'lr': 0.0004676687249483953, 'samples': 5035392, 'steps': 26225, 'loss/train': 1.5932910442352295}}} 11/07/2021 00:54:24 - INFO - __main__ - Step 26230: {'lr': 0.0004676582834851411, 'samples': 5036160, 'steps': 26229, 'loss/train': 1.6839022636413574}}} 11/07/2021 00:54:26 - INFO - __main__ - Step 26234: {'lr': 0.00046764784045271146, 'samples': 5036928, 'steps': 26233, 'loss/train': 1.1304795742034912}} 11/07/2021 00:54:27 - INFO - __main__ - Step 26238: {'lr': 0.0004676373958511817, 'samples': 5037696, 'steps': 26237, 'loss/train': 1.7899938821792603}}} 11/07/2021 00:54:30 - INFO - __main__ - Step 26242: {'lr': 0.00046762694968062706, 'samples': 5038464, 'steps': 26241, 'loss/train': 1.24952232837677}}}} 11/07/2021 00:54:32 - INFO - __main__ - Step 26247: {'lr': 0.00046761388976110737, 'samples': 5039424, 'steps': 26246, 'loss/train': 1.0271192789077759}} 11/07/2021 00:54:32 - INFO - __main__ - Step 26247: {'lr': 0.00046761388976110737, 'samples': 5039424, 'steps': 26246, 'loss/train': 1.0271192789077759}} 11/07/2021 00:54:36 - INFO - __main__ - Step 26255: {'lr': 0.0004675929887911571, 'samples': 5040960, 'steps': 26254, 'loss/train': 1.548449993133545}9}} 11/07/2021 00:54:38 - INFO - __main__ - Step 26259: {'lr': 0.0004675825359530872, 'samples': 5041728, 'steps': 26258, 'loss/train': 1.4623467922210693}}} 11/07/2021 00:54:40 - INFO - __main__ - Step 26263: {'lr': 0.0004675720815463881, 'samples': 5042496, 'steps': 26262, 'loss/train': 1.4288592338562012}}} 11/07/2021 00:54:42 - INFO - __main__ - Step 26267: {'lr': 0.0004675616255711349, 'samples': 5043264, 'steps': 26266, 'loss/train': 1.8957056999206543}}} 11/07/2021 00:54:44 - INFO - __main__ - Step 26272: {'lr': 0.00046754855339640436, 'samples': 5044224, 'steps': 26271, 'loss/train': 1.261167287826538}}} 11/07/2021 00:54:44 - INFO - __main__ - Step 26272: {'lr': 0.00046754855339640436, 'samples': 5044224, 'steps': 26271, 'loss/train': 1.261167287826538}}} 11/07/2021 00:54:47 - INFO - __main__ - Step 26279: {'lr': 0.0004675302482348056, 'samples': 5045568, 'steps': 26278, 'loss/train': 1.286010980606079}}}} 11/07/2021 00:54:49 - INFO - __main__ - Step 26283: {'lr': 0.00046751978598609056, 'samples': 5046336, 'steps': 26282, 'loss/train': 1.43919837474823}}}} 11/07/2021 00:54:52 - INFO - __main__ - Step 26288: {'lr': 0.0004675067059699567, 'samples': 5047296, 'steps': 26287, 'loss/train': 1.8886771202087402}}} 11/07/2021 00:54:54 - INFO - __main__ - Step 26293: {'lr': 0.0004674936235036938, 'samples': 5048256, 'steps': 26292, 'loss/train': 1.5952370166778564}}} 11/07/2021 00:54:56 - INFO - __main__ - Step 26297: {'lr': 0.00046748315576668946, 'samples': 5049024, 'steps': 26296, 'loss/train': 1.6581768989562988}} 11/07/2021 00:54:58 - INFO - __main__ - Step 26301: {'lr': 0.0004674726864617723, 'samples': 5049792, 'steps': 26300, 'loss/train': 1.4105170965194702}}} 11/07/2021 00:55:01 - INFO - __main__ - Step 26305: {'lr': 0.0004674622155890178, 'samples': 5050560, 'steps': 26304, 'loss/train': 1.6631202697753906}}} 11/07/2021 00:55:02 - INFO - __main__ - Step 26309: {'lr': 0.00046745174314850136, 'samples': 5051328, 'steps': 26308, 'loss/train': 0.5985563397407532}} 11/07/2021 00:55:04 - INFO - __main__ - Step 26313: {'lr': 0.0004674412691402985, 'samples': 5052096, 'steps': 26312, 'loss/train': 1.536624789237976}2}} 11/07/2021 00:55:06 - INFO - __main__ - Step 26317: {'lr': 0.00046743079356448476, 'samples': 5052864, 'steps': 26316, 'loss/train': 1.2478556632995605}} 11/07/2021 00:55:06 - INFO - __main__ - Step 26317: {'lr': 0.00046743079356448476, 'samples': 5052864, 'steps': 26316, 'loss/train': 1.2478556632995605}} 11/07/2021 00:55:10 - INFO - __main__ - Step 26324: {'lr': 0.0004674124575349742, 'samples': 5054208, 'steps': 26323, 'loss/train': 1.8238298892974854}}} 11/07/2021 00:55:10 - INFO - __main__ - Step 26324: {'lr': 0.0004674124575349742, 'samples': 5054208, 'steps': 26323, 'loss/train': 1.8238298892974854}}} 11/07/2021 00:55:13 - INFO - __main__ - Step 26332: {'lr': 0.0004673914961949381, 'samples': 5055744, 'steps': 26331, 'loss/train': 1.1333625316619873}}} 11/07/2021 00:55:15 - INFO - __main__ - Step 26336: {'lr': 0.00046738101317400415, 'samples': 5056512, 'steps': 26335, 'loss/train': 1.6568629741668701}} 11/07/2021 00:55:18 - INFO - __main__ - Step 26341: {'lr': 0.00046736790719400373, 'samples': 5057472, 'steps': 26340, 'loss/train': 1.307187557220459}}} 11/07/2021 00:55:20 - INFO - __main__ - Step 26345: {'lr': 0.00046735742064702904, 'samples': 5058240, 'steps': 26344, 'loss/train': 1.4895724058151245}} 11/07/2021 00:55:20 - INFO - __main__ - Step 26345: {'lr': 0.00046735742064702904, 'samples': 5058240, 'steps': 26344, 'loss/train': 1.4895724058151245}} 11/07/2021 00:55:23 - INFO - __main__ - Step 26352: {'lr': 0.00046733906541925963, 'samples': 5059584, 'steps': 26351, 'loss/train': 1.4118645191192627}} 11/07/2021 00:55:25 - INFO - __main__ - Step 26356: {'lr': 0.0004673285745631993, 'samples': 5060352, 'steps': 26355, 'loss/train': 1.95510995388031}27}} 11/07/2021 00:55:25 - INFO - __main__ - Step 26356: {'lr': 0.0004673285745631993, 'samples': 5060352, 'steps': 26355, 'loss/train': 1.95510995388031}27}} 11/07/2021 00:55:30 - INFO - __main__ - Step 26365: {'lr': 0.0004673049644085721, 'samples': 5062080, 'steps': 26364, 'loss/train': 1.914736032485962}7}} 11/07/2021 00:55:32 - INFO - __main__ - Step 26369: {'lr': 0.0004672944684606934, 'samples': 5062848, 'steps': 26368, 'loss/train': 1.4223899841308594}}} 11/07/2021 00:55:34 - INFO - __main__ - Step 26373: {'lr': 0.00046728397094626217, 'samples': 5063616, 'steps': 26372, 'loss/train': 1.6166425943374634}} 11/07/2021 00:55:36 - INFO - __main__ - Step 26378: {'lr': 0.00046727084685037394, 'samples': 5064576, 'steps': 26377, 'loss/train': 1.7713539600372314}} 11/07/2021 00:55:38 - INFO - __main__ - Step 26382: {'lr': 0.00046726034581147624, 'samples': 5065344, 'steps': 26381, 'loss/train': 1.748800277709961}}} 11/07/2021 00:55:38 - INFO - __main__ - Step 26382: {'lr': 0.00046726034581147624, 'samples': 5065344, 'steps': 26381, 'loss/train': 1.748800277709961}}} 11/07/2021 00:55:42 - INFO - __main__ - Step 26389: {'lr': 0.00046724196522452565, 'samples': 5066688, 'steps': 26388, 'loss/train': 0.40509405732154846} 11/07/2021 00:55:44 - INFO - __main__ - Step 26394: {'lr': 0.00046722883329724667, 'samples': 5067648, 'steps': 26393, 'loss/train': 1.5836986303329468}} 11/07/2021 00:55:46 - INFO - __main__ - Step 26399: {'lr': 0.00046721569892296875, 'samples': 5068608, 'steps': 26398, 'loss/train': 1.6494786739349365}} 11/07/2021 00:55:46 - INFO - __main__ - Step 26399: {'lr': 0.00046721569892296875, 'samples': 5068608, 'steps': 26398, 'loss/train': 1.6494786739349365}} 11/07/2021 00:55:50 - INFO - __main__ - Step 26406: {'lr': 0.00046719730668830293, 'samples': 5069952, 'steps': 26405, 'loss/train': 1.4376373291015625}} 11/07/2021 00:55:52 - INFO - __main__ - Step 26410: {'lr': 0.0004671867946868499, 'samples': 5070720, 'steps': 26409, 'loss/train': 1.9088515043258667}}} 11/07/2021 00:55:54 - INFO - __main__ - Step 26415: {'lr': 0.00046717365248316947, 'samples': 5071680, 'steps': 26414, 'loss/train': 1.0957757234573364}} 11/07/2021 00:55:56 - INFO - __main__ - Step 26420: {'lr': 0.00046716050783311166, 'samples': 5072640, 'steps': 26419, 'loss/train': 1.319366693496704}}} 11/07/2021 00:55:58 - INFO - __main__ - Step 26424: {'lr': 0.0004671499903517732, 'samples': 5073408, 'steps': 26423, 'loss/train': 1.5245121717453003}}} 11/07/2021 00:56:01 - INFO - __main__ - Step 26428: {'lr': 0.00046713947130492373, 'samples': 5074176, 'steps': 26427, 'loss/train': 1.628362774848938}}} 11/07/2021 00:56:02 - INFO - __main__ - Step 26432: {'lr': 0.00046712895069263917, 'samples': 5074944, 'steps': 26431, 'loss/train': 1.7663159370422363}} 11/07/2021 00:56:04 - INFO - __main__ - Step 26436: {'lr': 0.00046711842851499533, 'samples': 5075712, 'steps': 26435, 'loss/train': 1.3361963033676147}} 11/07/2021 00:56:04 - INFO - __main__ - Step 26436: {'lr': 0.00046711842851499533, 'samples': 5075712, 'steps': 26435, 'loss/train': 1.3361963033676147}} 11/07/2021 00:56:08 - INFO - __main__ - Step 26444: {'lr': 0.0004670973794639333, 'samples': 5077248, 'steps': 26443, 'loss/train': 1.4020780324935913}}} 11/07/2021 00:56:10 - INFO - __main__ - Step 26448: {'lr': 0.0004670868525906668, 'samples': 5078016, 'steps': 26447, 'loss/train': 2.0321922302246094}}} 11/07/2021 00:56:12 - INFO - __main__ - Step 26452: {'lr': 0.0004670763241523446, 'samples': 5078784, 'steps': 26451, 'loss/train': 1.4854100942611694}}} 11/07/2021 00:56:14 - INFO - __main__ - Step 26456: {'lr': 0.0004670657941490425, 'samples': 5079552, 'steps': 26455, 'loss/train': 1.230172038078308}}}} 11/07/2021 00:56:16 - INFO - __main__ - Step 26461: {'lr': 0.0004670526294442775, 'samples': 5080512, 'steps': 26460, 'loss/train': 1.234046459197998}}}} 11/07/2021 00:56:18 - INFO - __main__ - Step 26465: {'lr': 0.0004670420959200483, 'samples': 5081280, 'steps': 26464, 'loss/train': 1.194366693496704}}}} 11/07/2021 00:56:21 - INFO - __main__ - Step 26469: {'lr': 0.00046703156083108597, 'samples': 5082048, 'steps': 26468, 'loss/train': 1.4993257522583008}} 11/07/2021 00:56:22 - INFO - __main__ - Step 26473: {'lr': 0.0004670210241774664, 'samples': 5082816, 'steps': 26472, 'loss/train': 1.5517992973327637}}} 11/07/2021 00:56:24 - INFO - __main__ - Step 26477: {'lr': 0.00046701048595926574, 'samples': 5083584, 'steps': 26476, 'loss/train': 1.6932660341262817}} 11/07/2021 00:56:24 - INFO - __main__ - Step 26477: {'lr': 0.00046701048595926574, 'samples': 5083584, 'steps': 26476, 'loss/train': 1.6932660341262817}} 11/07/2021 00:56:24 - INFO - __main__ - Step 26477: {'lr': 0.00046701048595926574, 'samples': 5083584, 'steps': 26476, 'loss/train': 1.6932660341262817}} 11/07/2021 00:56:30 - INFO - __main__ - Step 26488: {'lr': 0.00046698149779246235, 'samples': 5085696, 'steps': 26487, 'loss/train': 1.7259870767593384}} 11/07/2021 00:56:33 - INFO - __main__ - Step 26493: {'lr': 0.00046696831744217065, 'samples': 5086656, 'steps': 26492, 'loss/train': 1.3535728454589844}} 11/07/2021 00:56:33 - INFO - __main__ - Step 26493: {'lr': 0.00046696831744217065, 'samples': 5086656, 'steps': 26492, 'loss/train': 1.3535728454589844}} 11/07/2021 00:56:36 - INFO - __main__ - Step 26500: {'lr': 0.0004669498608457674, 'samples': 5088000, 'steps': 26499, 'loss/train': 1.5733281373977661}}} 11/07/2021 00:56:38 - INFO - __main__ - Step 26505: {'lr': 0.0004669366746299707, 'samples': 5088960, 'steps': 26504, 'loss/train': 1.5311013460159302}}} 11/07/2021 00:56:38 - INFO - __main__ - Step 26505: {'lr': 0.0004669366746299707, 'samples': 5088960, 'steps': 26504, 'loss/train': 1.5311013460159302}}} 11/07/2021 00:56:43 - INFO - __main__ - Step 26513: {'lr': 0.00046691557160184516, 'samples': 5090496, 'steps': 26512, 'loss/train': 1.611854910850525}}} 11/07/2021 00:56:44 - INFO - __main__ - Step 26517: {'lr': 0.0004669050177420129, 'samples': 5091264, 'steps': 26516, 'loss/train': 1.6753724813461304}}} 11/07/2021 00:56:46 - INFO - __main__ - Step 26521: {'lr': 0.00046689446231843585, 'samples': 5092032, 'steps': 26520, 'loss/train': 5.975055694580078}}} 11/07/2021 00:56:46 - INFO - __main__ - Step 26521: {'lr': 0.00046689446231843585, 'samples': 5092032, 'steps': 26520, 'loss/train': 5.975055694580078}}} 11/07/2021 00:56:51 - INFO - __main__ - Step 26530: {'lr': 0.00046687070689833943, 'samples': 5093760, 'steps': 26529, 'loss/train': 1.699949026107788}}} 11/07/2021 00:56:53 - INFO - __main__ - Step 26534: {'lr': 0.0004668601463931172, 'samples': 5094528, 'steps': 26533, 'loss/train': 1.5847936868667603}}} 11/07/2021 00:56:54 - INFO - __main__ - Step 26538: {'lr': 0.00046684958432447355, 'samples': 5095296, 'steps': 26537, 'loss/train': 1.0727238655090332}} 11/07/2021 00:56:56 - INFO - __main__ - Step 26542: {'lr': 0.00046683902069248465, 'samples': 5096064, 'steps': 26541, 'loss/train': 1.3768486976623535}} 11/07/2021 00:56:59 - INFO - __main__ - Step 26547: {'lr': 0.0004668258139541604, 'samples': 5097024, 'steps': 26546, 'loss/train': 1.2917490005493164}}} 11/07/2021 00:57:01 - INFO - __main__ - Step 26551: {'lr': 0.00046681524680492327, 'samples': 5097792, 'steps': 26550, 'loss/train': 1.545667052268982}}} 11/07/2021 00:57:03 - INFO - __main__ - Step 26555: {'lr': 0.0004668046780925884, 'samples': 5098560, 'steps': 26554, 'loss/train': 1.1346396207809448}}} 11/07/2021 00:57:04 - INFO - __main__ - Step 26559: {'lr': 0.00046679410781723206, 'samples': 5099328, 'steps': 26558, 'loss/train': 1.526893973350525}}} 11/07/2021 00:57:06 - INFO - __main__ - Step 26563: {'lr': 0.00046678353597893053, 'samples': 5100096, 'steps': 26562, 'loss/train': 1.7865734100341797}} 11/07/2021 00:57:06 - INFO - __main__ - Step 26563: {'lr': 0.00046678353597893053, 'samples': 5100096, 'steps': 26562, 'loss/train': 1.7865734100341797}} 11/07/2021 00:57:10 - INFO - __main__ - Step 26571: {'lr': 0.0004667623876137965, 'samples': 5101632, 'steps': 26570, 'loss/train': 0.8069031238555908}}} 11/07/2021 00:57:12 - INFO - __main__ - Step 26575: {'lr': 0.0004667518110871164, 'samples': 5102400, 'steps': 26574, 'loss/train': 1.0139391422271729}}} 11/07/2021 00:57:14 - INFO - __main__ - Step 26579: {'lr': 0.00046674123299779603, 'samples': 5103168, 'steps': 26578, 'loss/train': 0.872469961643219}}} 11/07/2021 00:57:16 - INFO - __main__ - Step 26584: {'lr': 0.00046672800818879873, 'samples': 5104128, 'steps': 26583, 'loss/train': 1.5642935037612915}} 11/07/2021 00:57:19 - INFO - __main__ - Step 26589: {'lr': 0.000466714780938444, 'samples': 5105088, 'steps': 26588, 'loss/train': 1.3386892080307007}5}} 11/07/2021 00:57:19 - INFO - __main__ - Step 26589: {'lr': 0.000466714780938444, 'samples': 5105088, 'steps': 26588, 'loss/train': 1.3386892080307007}5}} 11/07/2021 00:57:22 - INFO - __main__ - Step 26596: {'lr': 0.0004666962586867507, 'samples': 5106432, 'steps': 26595, 'loss/train': 1.4996187686920166}}} 11/07/2021 00:57:24 - INFO - __main__ - Step 26600: {'lr': 0.00046668567239481994, 'samples': 5107200, 'steps': 26599, 'loss/train': 1.3034520149230957}} 11/07/2021 00:57:27 - INFO - __main__ - Step 26605: {'lr': 0.00046667243733312296, 'samples': 5108160, 'steps': 26604, 'loss/train': 1.6284083127975464}} 11/07/2021 00:57:27 - INFO - __main__ - Step 26605: {'lr': 0.00046667243733312296, 'samples': 5108160, 'steps': 26604, 'loss/train': 1.6284083127975464}} 11/07/2021 00:57:31 - INFO - __main__ - Step 26612: {'lr': 0.00046665390414635184, 'samples': 5109504, 'steps': 26611, 'loss/train': 1.829398274421692}}} 11/07/2021 00:57:32 - INFO - __main__ - Step 26616: {'lr': 0.000466643311606225, 'samples': 5110272, 'steps': 26615, 'loss/train': 1.795561671257019}2}}} 11/07/2021 00:57:34 - INFO - __main__ - Step 26620: {'lr': 0.0004666327175042401, 'samples': 5111040, 'steps': 26619, 'loss/train': 1.1487702131271362}}} 11/07/2021 00:57:37 - INFO - __main__ - Step 26625: {'lr': 0.0004666194726805122, 'samples': 5112000, 'steps': 26624, 'loss/train': 1.82273268699646}2}}} 11/07/2021 00:57:39 - INFO - __main__ - Step 26629: {'lr': 0.0004666088750646257, 'samples': 5112768, 'steps': 26628, 'loss/train': 1.5872939825057983}}} 11/07/2021 00:57:41 - INFO - __main__ - Step 26633: {'lr': 0.0004665982758871294, 'samples': 5113536, 'steps': 26632, 'loss/train': 1.1848700046539307}}} 11/07/2021 00:57:42 - INFO - __main__ - Step 26637: {'lr': 0.0004665876751480996, 'samples': 5114304, 'steps': 26636, 'loss/train': 1.8105500936508179}}} 11/07/2021 00:57:45 - INFO - __main__ - Step 26642: {'lr': 0.0004665744220285224, 'samples': 5115264, 'steps': 26641, 'loss/train': 0.37159693241119385}} 11/07/2021 00:57:47 - INFO - __main__ - Step 26646: {'lr': 0.00046656381777632173, 'samples': 5116032, 'steps': 26645, 'loss/train': 1.6319818496704102}} 11/07/2021 00:57:47 - INFO - __main__ - Step 26646: {'lr': 0.00046656381777632173, 'samples': 5116032, 'steps': 26645, 'loss/train': 1.6319818496704102}} 11/07/2021 00:57:50 - INFO - __main__ - Step 26653: {'lr': 0.00046654525657817457, 'samples': 5117376, 'steps': 26652, 'loss/train': 1.292604923248291}}} 11/07/2021 00:57:52 - INFO - __main__ - Step 26657: {'lr': 0.0004665346480326241, 'samples': 5118144, 'steps': 26656, 'loss/train': 1.6312495470046997}}} 11/07/2021 00:57:55 - INFO - __main__ - Step 26662: {'lr': 0.00046652138515543366, 'samples': 5119104, 'steps': 26661, 'loss/train': 1.6175771951675415}} 11/07/2021 00:57:57 - INFO - __main__ - Step 26666: {'lr': 0.00046651077309757256, 'samples': 5119872, 'steps': 26665, 'loss/train': 0.7705839276313782}} 11/07/2021 00:57:57 - INFO - __main__ - Step 26666: {'lr': 0.00046651077309757256, 'samples': 5119872, 'steps': 26665, 'loss/train': 0.7705839276313782}} 11/07/2021 00:58:00 - INFO - __main__ - Step 26673: {'lr': 0.00046649219824043984, 'samples': 5121216, 'steps': 26672, 'loss/train': 1.622520089149475}}} 11/07/2021 00:58:03 - INFO - __main__ - Step 26678: {'lr': 0.0004664789275588798, 'samples': 5122176, 'steps': 26677, 'loss/train': 1.651261806488037}}}} 11/07/2021 00:58:05 - INFO - __main__ - Step 26683: {'lr': 0.00046646565443876815, 'samples': 5123136, 'steps': 26682, 'loss/train': 1.6432113647460938}} 11/07/2021 00:58:07 - INFO - __main__ - Step 26687: {'lr': 0.0004664550341870222, 'samples': 5123904, 'steps': 26686, 'loss/train': 1.472825050354004}8}} 11/07/2021 00:58:09 - INFO - __main__ - Step 26691: {'lr': 0.00046644441237477544, 'samples': 5124672, 'steps': 26690, 'loss/train': 0.9981279373168945}} 11/07/2021 00:58:09 - INFO - __main__ - Step 26691: {'lr': 0.00046644441237477544, 'samples': 5124672, 'steps': 26690, 'loss/train': 0.9981279373168945}} 11/07/2021 00:58:12 - INFO - __main__ - Step 26698: {'lr': 0.0004664258204486189, 'samples': 5126016, 'steps': 26697, 'loss/train': 1.4459261894226074}}} 11/07/2021 00:58:15 - INFO - __main__ - Step 26704: {'lr': 0.00046640988070869053, 'samples': 5127168, 'steps': 26703, 'loss/train': 1.4754691123962402}} 11/07/2021 00:58:17 - INFO - __main__ - Step 26708: {'lr': 0.00046639925226517, 'samples': 5127936, 'steps': 26707, 'loss/train': 1.6880942583084106}02}} 11/07/2021 00:58:17 - INFO - __main__ - Step 26708: {'lr': 0.00046639925226517, 'samples': 5127936, 'steps': 26707, 'loss/train': 1.6880942583084106}02}} 11/07/2021 00:58:21 - INFO - __main__ - Step 26715: {'lr': 0.0004663806487350677, 'samples': 5129280, 'steps': 26714, 'loss/train': 1.3665399551391602}}} 11/07/2021 00:58:23 - INFO - __main__ - Step 26719: {'lr': 0.00046637001600146027, 'samples': 5130048, 'steps': 26718, 'loss/train': 1.4479613304138184}} 11/07/2021 00:58:25 - INFO - __main__ - Step 26723: {'lr': 0.00046635938170796505, 'samples': 5130816, 'steps': 26722, 'loss/train': 1.5450035333633423}} 11/07/2021 00:58:25 - INFO - __main__ - Step 26723: {'lr': 0.00046635938170796505, 'samples': 5130816, 'steps': 26722, 'loss/train': 1.5450035333633423}} 11/07/2021 00:58:29 - INFO - __main__ - Step 26731: {'lr': 0.0004663381084416177, 'samples': 5132352, 'steps': 26730, 'loss/train': 0.9006083607673645}}} 11/07/2021 00:58:30 - INFO - __main__ - Step 26735: {'lr': 0.000466327469468919, 'samples': 5133120, 'steps': 26734, 'loss/train': 1.420594573020935}5}}} 11/07/2021 00:58:32 - INFO - __main__ - Step 26739: {'lr': 0.0004663168289366391, 'samples': 5133888, 'steps': 26738, 'loss/train': 1.6448155641555786}}} 11/07/2021 00:58:35 - INFO - __main__ - Step 26744: {'lr': 0.0004663035260782452, 'samples': 5134848, 'steps': 26743, 'loss/train': 1.5186762809753418}}} 11/07/2021 00:58:37 - INFO - __main__ - Step 26749: {'lr': 0.00046629022078327557, 'samples': 5135808, 'steps': 26748, 'loss/train': 1.623582124710083}}} 11/07/2021 00:58:37 - INFO - __main__ - Step 26749: {'lr': 0.00046629022078327557, 'samples': 5135808, 'steps': 26748, 'loss/train': 1.623582124710083}}} 11/07/2021 00:58:40 - INFO - __main__ - Step 26756: {'lr': 0.0004662715892771561, 'samples': 5137152, 'steps': 26755, 'loss/train': 1.6319177150726318}}} 11/07/2021 00:58:43 - INFO - __main__ - Step 26760: {'lr': 0.00046626094055833426, 'samples': 5137920, 'steps': 26759, 'loss/train': 1.6341724395751953}} 11/07/2021 00:58:45 - INFO - __main__ - Step 26765: {'lr': 0.0004662476274673294, 'samples': 5138880, 'steps': 26764, 'loss/train': 0.9535731077194214}}} 11/07/2021 00:58:47 - INFO - __main__ - Step 26769: {'lr': 0.00046623697524063713, 'samples': 5139648, 'steps': 26768, 'loss/train': 1.5940784215927124}} 11/07/2021 00:58:49 - INFO - __main__ - Step 26773: {'lr': 0.0004662263214550162, 'samples': 5140416, 'steps': 26772, 'loss/train': 1.5235966444015503}}} 11/07/2021 00:58:51 - INFO - __main__ - Step 26777: {'lr': 0.0004662156661105433, 'samples': 5141184, 'steps': 26776, 'loss/train': 1.5256009101867676}}} 11/07/2021 00:58:53 - INFO - __main__ - Step 26781: {'lr': 0.0004662050092072954, 'samples': 5141952, 'steps': 26780, 'loss/train': 1.4966504573822021}}} 11/07/2021 00:58:53 - INFO - __main__ - Step 26781: {'lr': 0.0004662050092072954, 'samples': 5141952, 'steps': 26780, 'loss/train': 1.4966504573822021}}} 11/07/2021 00:58:57 - INFO - __main__ - Step 26789: {'lr': 0.00046618369072478163, 'samples': 5143488, 'steps': 26788, 'loss/train': 1.4976593255996704}} 11/07/2021 00:58:59 - INFO - __main__ - Step 26793: {'lr': 0.00046617302914566945, 'samples': 5144256, 'steps': 26792, 'loss/train': 1.5732007026672363}} 11/07/2021 00:59:01 - INFO - __main__ - Step 26797: {'lr': 0.0004661623660080896, 'samples': 5145024, 'steps': 26796, 'loss/train': 1.5374164581298828}}} 11/07/2021 00:59:03 - INFO - __main__ - Step 26802: {'lr': 0.00046614903489463667, 'samples': 5145984, 'steps': 26801, 'loss/train': 1.6035794019699097}} 11/07/2021 00:59:03 - INFO - __main__ - Step 26802: {'lr': 0.00046614903489463667, 'samples': 5145984, 'steps': 26801, 'loss/train': 1.6035794019699097}} 11/07/2021 00:59:07 - INFO - __main__ - Step 26810: {'lr': 0.00046612770004871663, 'samples': 5147520, 'steps': 26809, 'loss/train': 0.7488903403282166}} 11/07/2021 00:59:10 - INFO - __main__ - Step 26814: {'lr': 0.00046611703028850683, 'samples': 5148288, 'steps': 26813, 'loss/train': 1.94196617603302}6}} 11/07/2021 00:59:11 - INFO - __main__ - Step 26818: {'lr': 0.00046610635897023303, 'samples': 5149056, 'steps': 26817, 'loss/train': 0.9505210518836975}} 11/07/2021 00:59:13 - INFO - __main__ - Step 26823: {'lr': 0.0004660930176314805, 'samples': 5150016, 'steps': 26822, 'loss/train': 1.2598298788070679}}} 11/07/2021 00:59:13 - INFO - __main__ - Step 26823: {'lr': 0.0004660930176314805, 'samples': 5150016, 'steps': 26822, 'loss/train': 1.2598298788070679}}} 11/07/2021 00:59:18 - INFO - __main__ - Step 26830: {'lr': 0.00046607433566779713, 'samples': 5151360, 'steps': 26829, 'loss/train': 0.7690796256065369}} 11/07/2021 00:59:19 - INFO - __main__ - Step 26834: {'lr': 0.00046606365811803686, 'samples': 5152128, 'steps': 26833, 'loss/train': 1.5306406021118164}} 11/07/2021 00:59:21 - INFO - __main__ - Step 26838: {'lr': 0.0004660529790105974, 'samples': 5152896, 'steps': 26837, 'loss/train': 1.1704834699630737}}} 11/07/2021 00:59:21 - INFO - __main__ - Step 26838: {'lr': 0.0004660529790105974, 'samples': 5152896, 'steps': 26837, 'loss/train': 1.1704834699630737}}} 11/07/2021 00:59:21 - INFO - __main__ - Step 26838: {'lr': 0.0004660529790105974, 'samples': 5152896, 'steps': 26837, 'loss/train': 1.1704834699630737}}} 11/07/2021 00:59:27 - INFO - __main__ - Step 26849: {'lr': 0.00046602360343398397, 'samples': 5155008, 'steps': 26848, 'loss/train': 1.0663906335830688}} 11/07/2021 00:59:30 - INFO - __main__ - Step 26855: {'lr': 0.0004660075754279105, 'samples': 5156160, 'steps': 26854, 'loss/train': 1.432096242904663}8}} 11/07/2021 00:59:32 - INFO - __main__ - Step 26859: {'lr': 0.0004659968881439186, 'samples': 5156928, 'steps': 26858, 'loss/train': 1.651904582977295}8}} 11/07/2021 00:59:34 - INFO - __main__ - Step 26863: {'lr': 0.00046598619930272883, 'samples': 5157696, 'steps': 26862, 'loss/train': 1.7206814289093018}} 11/07/2021 00:59:34 - INFO - __main__ - Step 26863: {'lr': 0.00046598619930272883, 'samples': 5157696, 'steps': 26862, 'loss/train': 1.7206814289093018}} 11/07/2021 00:59:37 - INFO - __main__ - Step 26870: {'lr': 0.00046596749008387124, 'samples': 5159040, 'steps': 26869, 'loss/train': 1.3670252561569214}} 11/07/2021 00:59:39 - INFO - __main__ - Step 26875: {'lr': 0.00046595412343674317, 'samples': 5160000, 'steps': 26874, 'loss/train': 1.6973767280578613}} 11/07/2021 00:59:42 - INFO - __main__ - Step 26880: {'lr': 0.0004659407543569752, 'samples': 5160960, 'steps': 26879, 'loss/train': 1.7946914434432983}}} 11/07/2021 00:59:42 - INFO - __main__ - Step 26880: {'lr': 0.0004659407543569752, 'samples': 5160960, 'steps': 26879, 'loss/train': 1.7946914434432983}}} 11/07/2021 00:59:45 - INFO - __main__ - Step 26887: {'lr': 0.00046592203355875177, 'samples': 5162304, 'steps': 26886, 'loss/train': 1.3362911939620972}} 11/07/2021 00:59:47 - INFO - __main__ - Step 26891: {'lr': 0.00046591133381933546, 'samples': 5163072, 'steps': 26890, 'loss/train': 1.739495038986206}}} 11/07/2021 00:59:50 - INFO - __main__ - Step 26896: {'lr': 0.000465897956956132, 'samples': 5164032, 'steps': 26895, 'loss/train': 1.4558935165405273}}}} 11/07/2021 00:59:52 - INFO - __main__ - Step 26900: {'lr': 0.00046588725371451685, 'samples': 5164800, 'steps': 26899, 'loss/train': 1.7321528196334839}} 11/07/2021 00:59:52 - INFO - __main__ - Step 26900: {'lr': 0.00046588725371451685, 'samples': 5164800, 'steps': 26899, 'loss/train': 1.7321528196334839}} 11/07/2021 00:59:55 - INFO - __main__ - Step 26907: {'lr': 0.00046586851929663134, 'samples': 5166144, 'steps': 26906, 'loss/train': 1.7989552021026611}} 11/07/2021 00:59:57 - INFO - __main__ - Step 26911: {'lr': 0.00046585781177508137, 'samples': 5166912, 'steps': 26910, 'loss/train': 1.115211009979248}}} 11/07/2021 00:59:59 - INFO - __main__ - Step 26915: {'lr': 0.00046584710269733623, 'samples': 5167680, 'steps': 26914, 'loss/train': 1.373899221420288}}} 11/07/2021 00:59:59 - INFO - __main__ - Step 26915: {'lr': 0.00046584710269733623, 'samples': 5167680, 'steps': 26914, 'loss/train': 1.373899221420288}}} 11/07/2021 01:00:03 - INFO - __main__ - Step 26923: {'lr': 0.0004658256798735693, 'samples': 5169216, 'steps': 26922, 'loss/train': 1.5124496221542358}}} 11/07/2021 01:00:05 - INFO - __main__ - Step 26927: {'lr': 0.0004658149661277019, 'samples': 5169984, 'steps': 26926, 'loss/train': 1.4179913997650146}}} 11/07/2021 01:00:05 - INFO - __main__ - Step 26927: {'lr': 0.0004658149661277019, 'samples': 5169984, 'steps': 26926, 'loss/train': 1.4179913997650146}}} 11/07/2021 01:00:09 - INFO - __main__ - Step 26934: {'lr': 0.000465796213328629, 'samples': 5171328, 'steps': 26933, 'loss/train': 1.6522505283355713}}}} 11/07/2021 01:00:12 - INFO - __main__ - Step 26939: {'lr': 0.00046578281555509094, 'samples': 5172288, 'steps': 26938, 'loss/train': 1.3857241868972778}} 11/07/2021 01:00:14 - INFO - __main__ - Step 26943: {'lr': 0.0004657720955861419, 'samples': 5173056, 'steps': 26942, 'loss/train': 1.8198291063308716}}} 11/07/2021 01:00:16 - INFO - __main__ - Step 26947: {'lr': 0.0004657613740616157, 'samples': 5173824, 'steps': 26946, 'loss/train': 1.2829210758209229}}} 11/07/2021 01:00:18 - INFO - __main__ - Step 26951: {'lr': 0.00046575065098158945, 'samples': 5174592, 'steps': 26950, 'loss/train': 1.3539212942123413}} 11/07/2021 01:00:19 - INFO - __main__ - Step 26955: {'lr': 0.00046573992634614064, 'samples': 5175360, 'steps': 26954, 'loss/train': 1.4932307004928589}} 11/07/2021 01:00:21 - INFO - __main__ - Step 26959: {'lr': 0.0004657292001553465, 'samples': 5176128, 'steps': 26958, 'loss/train': 0.9837502837181091}}} 11/07/2021 01:00:21 - INFO - __main__ - Step 26959: {'lr': 0.0004657292001553465, 'samples': 5176128, 'steps': 26958, 'loss/train': 0.9837502837181091}}} 11/07/2021 01:00:25 - INFO - __main__ - Step 26965: {'lr': 0.00046571310795305213, 'samples': 5177280, 'steps': 26964, 'loss/train': 1.6530050039291382}} 11/07/2021 01:00:28 - INFO - __main__ - Step 26970: {'lr': 0.00046569969511154485, 'samples': 5178240, 'steps': 26969, 'loss/train': 1.4461259841918945}} 11/07/2021 01:00:30 - INFO - __main__ - Step 26974: {'lr': 0.0004656889630888946, 'samples': 5179008, 'steps': 26973, 'loss/train': 1.9120339155197144}}} 11/07/2021 01:00:31 - INFO - __main__ - Step 26978: {'lr': 0.00046567822951126646, 'samples': 5179776, 'steps': 26977, 'loss/train': 0.8509805798530579}} 11/07/2021 01:00:33 - INFO - __main__ - Step 26982: {'lr': 0.0004656674943787379, 'samples': 5180544, 'steps': 26981, 'loss/train': 1.6337497234344482}}} 11/07/2021 01:00:36 - INFO - __main__ - Step 26987: {'lr': 0.00046565407327661614, 'samples': 5181504, 'steps': 26986, 'loss/train': 1.3278738260269165}} 11/07/2021 01:00:38 - INFO - __main__ - Step 26991: {'lr': 0.0004656433346458444, 'samples': 5182272, 'steps': 26990, 'loss/train': 1.328795075416565}5}} 11/07/2021 01:00:39 - INFO - __main__ - Step 26995: {'lr': 0.0004656325944604236, 'samples': 5183040, 'steps': 26994, 'loss/train': 1.5736900568008423}}} 11/07/2021 01:00:42 - INFO - __main__ - Step 26999: {'lr': 0.00046562185272043137, 'samples': 5183808, 'steps': 26998, 'loss/train': 1.7131564617156982}} 11/07/2021 01:00:44 - INFO - __main__ - Step 27004: {'lr': 0.0004656084233594429, 'samples': 5184768, 'steps': 27003, 'loss/train': 1.6964117288589478}}} 11/07/2021 01:00:46 - INFO - __main__ - Step 27008: {'lr': 0.00046559767812194786, 'samples': 5185536, 'steps': 27007, 'loss/train': 1.5308825969696045}} 11/07/2021 01:00:46 - INFO - __main__ - Step 27008: {'lr': 0.00046559767812194786, 'samples': 5185536, 'steps': 27007, 'loss/train': 1.5308825969696045}} 11/07/2021 01:00:49 - INFO - __main__ - Step 27015: {'lr': 0.00046557887021629623, 'samples': 5186880, 'steps': 27014, 'loss/train': 1.6177726984024048}} 11/07/2021 01:00:52 - INFO - __main__ - Step 27019: {'lr': 0.0004655681207046083, 'samples': 5187648, 'steps': 27018, 'loss/train': 1.6902590990066528}}} 11/07/2021 01:00:54 - INFO - __main__ - Step 27024: {'lr': 0.0004655546816295448, 'samples': 5188608, 'steps': 27023, 'loss/train': 1.745589256286621}}}} 11/07/2021 01:00:56 - INFO - __main__ - Step 27028: {'lr': 0.0004655439286212257, 'samples': 5189376, 'steps': 27027, 'loss/train': 2.311471939086914}}}} 11/07/2021 01:00:56 - INFO - __main__ - Step 27028: {'lr': 0.0004655439286212257, 'samples': 5189376, 'steps': 27027, 'loss/train': 2.311471939086914}}}} 11/07/2021 01:00:59 - INFO - __main__ - Step 27035: {'lr': 0.00046552510711756444, 'samples': 5190720, 'steps': 27034, 'loss/train': 1.8001316785812378}} 11/07/2021 01:01:02 - INFO - __main__ - Step 27040: {'lr': 0.00046551166027298505, 'samples': 5191680, 'steps': 27039, 'loss/train': 2.7235004901885986}} 11/07/2021 01:01:02 - INFO - __main__ - Step 27040: {'lr': 0.00046551166027298505, 'samples': 5191680, 'steps': 27039, 'loss/train': 2.7235004901885986}} 11/07/2021 01:01:02 - INFO - __main__ - Step 27040: {'lr': 0.00046551166027298505, 'samples': 5191680, 'steps': 27039, 'loss/train': 2.7235004901885986}} 11/07/2021 01:01:08 - INFO - __main__ - Step 27051: {'lr': 0.0004654820686697754, 'samples': 5193792, 'steps': 27050, 'loss/train': 0.23133371770381927}} 11/07/2021 01:01:10 - INFO - __main__ - Step 27056: {'lr': 0.000465468614057231, 'samples': 5194752, 'steps': 27055, 'loss/train': 1.6699409484863281}7}} 11/07/2021 01:01:12 - INFO - __main__ - Step 27060: {'lr': 0.00046545784861962516, 'samples': 5195520, 'steps': 27059, 'loss/train': 1.312885046005249}}} 11/07/2021 01:01:14 - INFO - __main__ - Step 27064: {'lr': 0.0004654470816287076, 'samples': 5196288, 'steps': 27063, 'loss/train': 1.9853992462158203}}} 11/07/2021 01:01:14 - INFO - __main__ - Step 27064: {'lr': 0.0004654470816287076, 'samples': 5196288, 'steps': 27063, 'loss/train': 1.9853992462158203}}} 11/07/2021 01:01:17 - INFO - __main__ - Step 27071: {'lr': 0.00046542823565717914, 'samples': 5197632, 'steps': 27070, 'loss/train': 1.6977858543395996}} 11/07/2021 01:01:20 - INFO - __main__ - Step 27076: {'lr': 0.00046541477133686107, 'samples': 5198592, 'steps': 27075, 'loss/train': 1.4234788417816162}} 11/07/2021 01:01:20 - INFO - __main__ - Step 27076: {'lr': 0.00046541477133686107, 'samples': 5198592, 'steps': 27075, 'loss/train': 1.4234788417816162}} 11/07/2021 01:01:24 - INFO - __main__ - Step 27084: {'lr': 0.00046539322337716153, 'samples': 5200128, 'steps': 27083, 'loss/train': 2.0047829151153564}} 11/07/2021 01:01:25 - INFO - __main__ - Step 27088: {'lr': 0.0004653824470680043, 'samples': 5200896, 'steps': 27087, 'loss/train': 1.2486436367034912}}} 11/07/2021 01:01:28 - INFO - __main__ - Step 27092: {'lr': 0.00046537166920607886, 'samples': 5201664, 'steps': 27091, 'loss/train': 2.424959897994995}}} 11/07/2021 01:01:30 - INFO - __main__ - Step 27096: {'lr': 0.000465360889791463, 'samples': 5202432, 'steps': 27095, 'loss/train': 1.7497080564498901}}}} 11/07/2021 01:01:30 - INFO - __main__ - Step 27096: {'lr': 0.000465360889791463, 'samples': 5202432, 'steps': 27095, 'loss/train': 1.7497080564498901}}}} 11/07/2021 01:01:30 - INFO - __main__ - Step 27096: {'lr': 0.000465360889791463, 'samples': 5202432, 'steps': 27095, 'loss/train': 1.7497080564498901}}}} 11/07/2021 01:01:35 - INFO - __main__ - Step 27107: {'lr': 0.00046533123839584406, 'samples': 5204544, 'steps': 27106, 'loss/train': 1.4945223331451416}} 11/07/2021 01:01:38 - INFO - __main__ - Step 27113: {'lr': 0.0004653150599589498, 'samples': 5205696, 'steps': 27112, 'loss/train': 1.4433401823043823}}} 11/07/2021 01:01:38 - INFO - __main__ - Step 27113: {'lr': 0.0004653150599589498, 'samples': 5205696, 'steps': 27112, 'loss/train': 1.4433401823043823}}} 11/07/2021 01:01:38 - INFO - __main__ - Step 27113: {'lr': 0.0004653150599589498, 'samples': 5205696, 'steps': 27112, 'loss/train': 1.4433401823043823}}} 11/07/2021 01:01:43 - INFO - __main__ - Step 27124: {'lr': 0.00046528539042035, 'samples': 5207808, 'steps': 27123, 'loss/train': 1.5094414949417114}3}}} 11/07/2021 01:01:45 - INFO - __main__ - Step 27128: {'lr': 0.0004652745985870095, 'samples': 5208576, 'steps': 27127, 'loss/train': 1.5834203958511353}}} 11/07/2021 01:01:45 - INFO - __main__ - Step 27128: {'lr': 0.0004652745985870095, 'samples': 5208576, 'steps': 27127, 'loss/train': 1.5834203958511353}}} 11/07/2021 01:01:50 - INFO - __main__ - Step 27137: {'lr': 0.0004652503112876463, 'samples': 5210304, 'steps': 27136, 'loss/train': 1.476821780204773}}}} 11/07/2021 01:01:52 - INFO - __main__ - Step 27141: {'lr': 0.00046523951441062087, 'samples': 5211072, 'steps': 27140, 'loss/train': 1.591782569885254}}} 11/07/2021 01:01:53 - INFO - __main__ - Step 27145: {'lr': 0.0004652287159818577, 'samples': 5211840, 'steps': 27144, 'loss/train': 1.5447921752929688}}} 11/07/2021 01:01:55 - INFO - __main__ - Step 27149: {'lr': 0.00046521791600143483, 'samples': 5212608, 'steps': 27148, 'loss/train': 1.2859762907028198}} 11/07/2021 01:01:58 - INFO - __main__ - Step 27154: {'lr': 0.0004652044138440032, 'samples': 5213568, 'steps': 27153, 'loss/train': 1.6012340784072876}}} 11/07/2021 01:01:58 - INFO - __main__ - Step 27154: {'lr': 0.0004652044138440032, 'samples': 5213568, 'steps': 27153, 'loss/train': 1.6012340784072876}}} 11/07/2021 01:02:02 - INFO - __main__ - Step 27162: {'lr': 0.0004651828053498509, 'samples': 5215104, 'steps': 27161, 'loss/train': 1.4773788452148438}}} 11/07/2021 01:02:02 - INFO - __main__ - Step 27162: {'lr': 0.0004651828053498509, 'samples': 5215104, 'steps': 27161, 'loss/train': 1.4773788452148438}}} 11/07/2021 01:02:06 - INFO - __main__ - Step 27169: {'lr': 0.0004651638928271487, 'samples': 5216448, 'steps': 27168, 'loss/train': 1.6680691242218018}}} 11/07/2021 01:02:06 - INFO - __main__ - Step 27169: {'lr': 0.0004651638928271487, 'samples': 5216448, 'steps': 27168, 'loss/train': 1.6680691242218018}}} 11/07/2021 01:02:10 - INFO - __main__ - Step 27177: {'lr': 0.0004651422726985415, 'samples': 5217984, 'steps': 27176, 'loss/train': 1.8215045928955078}}} 11/07/2021 01:02:12 - INFO - __main__ - Step 27181: {'lr': 0.0004651314603076441, 'samples': 5218752, 'steps': 27180, 'loss/train': 1.4661346673965454}}} 11/07/2021 01:02:14 - INFO - __main__ - Step 27185: {'lr': 0.000465120646365788, 'samples': 5219520, 'steps': 27184, 'loss/train': 1.2242282629013062}}}} 11/07/2021 01:02:16 - INFO - __main__ - Step 27190: {'lr': 0.00046510712675755094, 'samples': 5220480, 'steps': 27189, 'loss/train': 1.5899168252944946}} 11/07/2021 01:02:18 - INFO - __main__ - Step 27194: {'lr': 0.00046509630932632293, 'samples': 5221248, 'steps': 27193, 'loss/train': 1.738891839981079}}} 11/07/2021 01:02:18 - INFO - __main__ - Step 27194: {'lr': 0.00046509630932632293, 'samples': 5221248, 'steps': 27193, 'loss/train': 1.738891839981079}}} 11/07/2021 01:02:22 - INFO - __main__ - Step 27201: {'lr': 0.0004650773750903363, 'samples': 5222592, 'steps': 27200, 'loss/train': 2.2038509845733643}}} 11/07/2021 01:02:24 - INFO - __main__ - Step 27206: {'lr': 0.00046506384772871935, 'samples': 5223552, 'steps': 27205, 'loss/train': 1.5792165994644165}} 11/07/2021 01:02:26 - INFO - __main__ - Step 27210: {'lr': 0.0004650530240951383, 'samples': 5224320, 'steps': 27209, 'loss/train': 1.2404732704162598}}} 11/07/2021 01:02:29 - INFO - __main__ - Step 27214: {'lr': 0.00046504219891116416, 'samples': 5225088, 'steps': 27213, 'loss/train': 1.3884916305541992}} 11/07/2021 01:02:29 - INFO - __main__ - Step 27214: {'lr': 0.00046504219891116416, 'samples': 5225088, 'steps': 27213, 'loss/train': 1.3884916305541992}} 11/07/2021 01:02:32 - INFO - __main__ - Step 27221: {'lr': 0.0004650232511088105, 'samples': 5226432, 'steps': 27220, 'loss/train': 1.6209487915039062}}} 11/07/2021 01:02:34 - INFO - __main__ - Step 27226: {'lr': 0.000465009714057663, 'samples': 5227392, 'steps': 27225, 'loss/train': 1.7976675033569336}}}} 11/07/2021 01:02:34 - INFO - __main__ - Step 27226: {'lr': 0.000465009714057663, 'samples': 5227392, 'steps': 27225, 'loss/train': 1.7976675033569336}}}} 11/07/2021 01:02:38 - INFO - __main__ - Step 27234: {'lr': 0.00046498804973812735, 'samples': 5228928, 'steps': 27233, 'loss/train': 5.008947372436523}}} 11/07/2021 01:02:38 - INFO - __main__ - Step 27234: {'lr': 0.00046498804973812735, 'samples': 5228928, 'steps': 27233, 'loss/train': 5.008947372436523}}} 11/07/2021 01:02:42 - INFO - __main__ - Step 27241: {'lr': 0.00046496908837282173, 'samples': 5230272, 'steps': 27240, 'loss/train': 1.9977402687072754}} 11/07/2021 01:02:44 - INFO - __main__ - Step 27247: {'lr': 0.0004649528319963641, 'samples': 5231424, 'steps': 27246, 'loss/train': 1.4237511157989502}}} 11/07/2021 01:02:47 - INFO - __main__ - Step 27251: {'lr': 0.0004649419924749541, 'samples': 5232192, 'steps': 27250, 'loss/train': 1.6707024574279785}}} 11/07/2021 01:02:47 - INFO - __main__ - Step 27251: {'lr': 0.0004649419924749541, 'samples': 5232192, 'steps': 27250, 'loss/train': 1.6707024574279785}}} 11/07/2021 01:02:50 - INFO - __main__ - Step 27258: {'lr': 0.0004649230195838261, 'samples': 5233536, 'steps': 27257, 'loss/train': 1.4425030946731567}}} 11/07/2021 01:02:52 - INFO - __main__ - Step 27262: {'lr': 0.00046491217580122427, 'samples': 5234304, 'steps': 27261, 'loss/train': 1.703986644744873}}} 11/07/2021 01:02:54 - INFO - __main__ - Step 27267: {'lr': 0.0004648986188941685, 'samples': 5235264, 'steps': 27266, 'loss/train': 1.7877448797225952}}} 11/07/2021 01:02:57 - INFO - __main__ - Step 27272: {'lr': 0.00046488505956636286, 'samples': 5236224, 'steps': 27271, 'loss/train': 1.777441143989563}}} 11/07/2021 01:02:59 - INFO - __main__ - Step 27276: {'lr': 0.00046487421036128085, 'samples': 5236992, 'steps': 27275, 'loss/train': 1.2483229637145996}} 11/07/2021 01:02:59 - INFO - __main__ - Step 27276: {'lr': 0.00046487421036128085, 'samples': 5236992, 'steps': 27275, 'loss/train': 1.2483229637145996}} 11/07/2021 01:03:02 - INFO - __main__ - Step 27283: {'lr': 0.0004648552205249029, 'samples': 5238336, 'steps': 27282, 'loss/train': 1.3607357740402222}}} 11/07/2021 01:03:04 - INFO - __main__ - Step 27288: {'lr': 0.0004648416534517236, 'samples': 5239296, 'steps': 27287, 'loss/train': 1.1679052114486694}}} 11/07/2021 01:03:07 - INFO - __main__ - Step 27292: {'lr': 0.0004648307980506948, 'samples': 5240064, 'steps': 27291, 'loss/train': 1.4171942472457886}}} 11/07/2021 01:03:09 - INFO - __main__ - Step 27296: {'lr': 0.000464819941100875, 'samples': 5240832, 'steps': 27295, 'loss/train': 1.562004566192627}6}}} 11/07/2021 01:03:09 - INFO - __main__ - Step 27296: {'lr': 0.000464819941100875, 'samples': 5240832, 'steps': 27295, 'loss/train': 1.562004566192627}6}}} 11/07/2021 01:03:12 - INFO - __main__ - Step 27303: {'lr': 0.00046480093771214716, 'samples': 5242176, 'steps': 27302, 'loss/train': 1.8879752159118652}} 11/07/2021 01:03:12 - INFO - __main__ - Step 27303: {'lr': 0.00046480093771214716, 'samples': 5242176, 'steps': 27302, 'loss/train': 1.8879752159118652}} 11/07/2021 01:03:17 - INFO - __main__ - Step 27311: {'lr': 0.00046477921374646624, 'samples': 5243712, 'steps': 27310, 'loss/train': 1.506710410118103}}} 11/07/2021 01:03:18 - INFO - __main__ - Step 27315: {'lr': 0.0004647683494409578, 'samples': 5244480, 'steps': 27314, 'loss/train': 1.5598256587982178}}} 11/07/2021 01:03:20 - INFO - __main__ - Step 27319: {'lr': 0.00046475748358710856, 'samples': 5245248, 'steps': 27318, 'loss/train': 1.410498023033142}}} 11/07/2021 01:03:23 - INFO - __main__ - Step 27324: {'lr': 0.000464743899092562, 'samples': 5246208, 'steps': 27323, 'loss/train': 1.4427452087402344}}}} 11/07/2021 01:03:25 - INFO - __main__ - Step 27328: {'lr': 0.00046473302975523224, 'samples': 5246976, 'steps': 27327, 'loss/train': 1.3807927370071411}} 11/07/2021 01:03:25 - INFO - __main__ - Step 27328: {'lr': 0.00046473302975523224, 'samples': 5246976, 'steps': 27327, 'loss/train': 1.3807927370071411}} 11/07/2021 01:03:28 - INFO - __main__ - Step 27335: {'lr': 0.0004647140046898697, 'samples': 5248320, 'steps': 27334, 'loss/train': 1.3127461671829224}}} 11/07/2021 01:03:31 - INFO - __main__ - Step 27340: {'lr': 0.00046470041245503895, 'samples': 5249280, 'steps': 27339, 'loss/train': 1.445697546005249}}} 11/07/2021 01:03:33 - INFO - __main__ - Step 27344: {'lr': 0.0004646895369258345, 'samples': 5250048, 'steps': 27343, 'loss/train': 1.6821507215499878}}} 11/07/2021 01:03:33 - INFO - __main__ - Step 27344: {'lr': 0.0004646895369258345, 'samples': 5250048, 'steps': 27343, 'loss/train': 1.6821507215499878}}} 11/07/2021 01:03:36 - INFO - __main__ - Step 27351: {'lr': 0.00046467050102544594, 'samples': 5251392, 'steps': 27350, 'loss/train': 1.8179465532302856}} 11/07/2021 01:03:38 - INFO - __main__ - Step 27355: {'lr': 0.00046465962124005535, 'samples': 5252160, 'steps': 27354, 'loss/train': 1.4978750944137573}} 11/07/2021 01:03:41 - INFO - __main__ - Step 27360: {'lr': 0.00046464601933207417, 'samples': 5253120, 'steps': 27359, 'loss/train': 1.4246464967727661}} 11/07/2021 01:03:43 - INFO - __main__ - Step 27365: {'lr': 0.00046463241500618846, 'samples': 5254080, 'steps': 27364, 'loss/train': 1.322129249572754}}} 11/07/2021 01:03:43 - INFO - __main__ - Step 27365: {'lr': 0.00046463241500618846, 'samples': 5254080, 'steps': 27364, 'loss/train': 1.322129249572754}}} 11/07/2021 01:03:46 - INFO - __main__ - Step 27372: {'lr': 0.0004646133648881606, 'samples': 5255424, 'steps': 27371, 'loss/train': 1.596226692199707}}}} 11/07/2021 01:03:48 - INFO - __main__ - Step 27376: {'lr': 0.000464602476978971, 'samples': 5256192, 'steps': 27375, 'loss/train': 1.6068828105926514}}}} 11/07/2021 01:03:50 - INFO - __main__ - Step 27380: {'lr': 0.00046459158752263643, 'samples': 5256960, 'steps': 27379, 'loss/train': 1.663011908531189}}} 11/07/2021 01:03:53 - INFO - __main__ - Step 27384: {'lr': 0.0004645806965192353, 'samples': 5257728, 'steps': 27383, 'loss/train': 0.6871659159660339}}} 11/07/2021 01:03:55 - INFO - __main__ - Step 27388: {'lr': 0.0004645698039688461, 'samples': 5258496, 'steps': 27387, 'loss/train': 1.952163815498352}}}} 11/07/2021 01:03:56 - INFO - __main__ - Step 27392: {'lr': 0.00046455890987154747, 'samples': 5259264, 'steps': 27391, 'loss/train': 1.5040371417999268}} 11/07/2021 01:03:56 - INFO - __main__ - Step 27392: {'lr': 0.00046455890987154747, 'samples': 5259264, 'steps': 27391, 'loss/train': 1.5040371417999268}} 11/07/2021 01:04:00 - INFO - __main__ - Step 27400: {'lr': 0.000464537117036536, 'samples': 5260800, 'steps': 27399, 'loss/train': 1.4849789142608643}8}} 11/07/2021 01:04:03 - INFO - __main__ - Step 27404: {'lr': 0.0004645262182989802, 'samples': 5261568, 'steps': 27403, 'loss/train': 1.6559795141220093}}} 11/07/2021 01:04:04 - INFO - __main__ - Step 27408: {'lr': 0.00046451531801482913, 'samples': 5262336, 'steps': 27407, 'loss/train': 1.6804707050323486}} 11/07/2021 01:04:07 - INFO - __main__ - Step 27413: {'lr': 0.00046450169048486045, 'samples': 5263296, 'steps': 27412, 'loss/train': 1.5773380994796753}} 11/07/2021 01:04:09 - INFO - __main__ - Step 27418: {'lr': 0.000464488060538613, 'samples': 5264256, 'steps': 27417, 'loss/train': 1.6511754989624023}3}} 11/07/2021 01:04:11 - INFO - __main__ - Step 27422: {'lr': 0.0004644771548419975, 'samples': 5265024, 'steps': 27421, 'loss/train': 1.4866039752960205}}} 11/07/2021 01:04:13 - INFO - __main__ - Step 27426: {'lr': 0.00046446624759914043, 'samples': 5265792, 'steps': 27425, 'loss/train': 1.6005641222000122}} 11/07/2021 01:04:13 - INFO - __main__ - Step 27426: {'lr': 0.00046446624759914043, 'samples': 5265792, 'steps': 27425, 'loss/train': 1.6005641222000122}} 11/07/2021 01:04:16 - INFO - __main__ - Step 27433: {'lr': 0.0004644471562037333, 'samples': 5267136, 'steps': 27432, 'loss/train': 1.556830644607544}2}} 11/07/2021 01:04:16 - INFO - __main__ - Step 27433: {'lr': 0.0004644471562037333, 'samples': 5267136, 'steps': 27432, 'loss/train': 1.556830644607544}2}} 11/07/2021 01:04:16 - INFO - __main__ - Step 27433: {'lr': 0.0004644471562037333, 'samples': 5267136, 'steps': 27432, 'loss/train': 1.556830644607544}2}} 11/07/2021 01:04:22 - INFO - __main__ - Step 27444: {'lr': 0.00046441714587365317, 'samples': 5269248, 'steps': 27443, 'loss/train': 1.5707968473434448}} 11/07/2021 01:04:24 - INFO - __main__ - Step 27449: {'lr': 0.0004644035009499052, 'samples': 5270208, 'steps': 27448, 'loss/train': 1.9587770700454712}}} 11/07/2021 01:04:24 - INFO - __main__ - Step 27449: {'lr': 0.0004644035009499052, 'samples': 5270208, 'steps': 27448, 'loss/train': 1.9587770700454712}}} 11/07/2021 01:04:29 - INFO - __main__ - Step 27457: {'lr': 0.0004643816640484131, 'samples': 5271744, 'steps': 27456, 'loss/train': 1.6021026372909546}}} 11/07/2021 01:04:30 - INFO - __main__ - Step 27461: {'lr': 0.00046437074327929795, 'samples': 5272512, 'steps': 27460, 'loss/train': 1.1870478391647339}} 11/07/2021 01:04:32 - INFO - __main__ - Step 27465: {'lr': 0.0004643598209647085, 'samples': 5273280, 'steps': 27464, 'loss/train': 1.6166335344314575}}} 11/07/2021 01:04:35 - INFO - __main__ - Step 27470: {'lr': 0.0004643461658982683, 'samples': 5274240, 'steps': 27469, 'loss/train': 1.721252202987671}}}} 11/07/2021 01:04:37 - INFO - __main__ - Step 27474: {'lr': 0.0004643352401066494, 'samples': 5275008, 'steps': 27473, 'loss/train': 1.6142194271087646}}} 11/07/2021 01:04:37 - INFO - __main__ - Step 27474: {'lr': 0.0004643352401066494, 'samples': 5275008, 'steps': 27473, 'loss/train': 1.6142194271087646}}} 11/07/2021 01:04:40 - INFO - __main__ - Step 27481: {'lr': 0.0004643161162531818, 'samples': 5276352, 'steps': 27480, 'loss/train': 1.2445502281188965}}} 11/07/2021 01:04:42 - INFO - __main__ - Step 27485: {'lr': 0.0004643051862124018, 'samples': 5277120, 'steps': 27484, 'loss/train': 1.5504546165466309}}} 11/07/2021 01:04:45 - INFO - __main__ - Step 27491: {'lr': 0.00046428878825437815, 'samples': 5278272, 'steps': 27490, 'loss/train': 1.4722696542739868}} 11/07/2021 01:04:47 - INFO - __main__ - Step 27495: {'lr': 0.00046427785435124147, 'samples': 5279040, 'steps': 27494, 'loss/train': 1.5332293510437012}} 11/07/2021 01:04:47 - INFO - __main__ - Step 27495: {'lr': 0.00046427785435124147, 'samples': 5279040, 'steps': 27494, 'loss/train': 1.5332293510437012}} 11/07/2021 01:04:50 - INFO - __main__ - Step 27502: {'lr': 0.00046425871630361343, 'samples': 5280384, 'steps': 27501, 'loss/train': 1.7171363830566406}} 11/07/2021 01:04:52 - INFO - __main__ - Step 27506: {'lr': 0.00046424777815245354, 'samples': 5281152, 'steps': 27505, 'loss/train': 3.3891711235046387}} 11/07/2021 01:04:55 - INFO - __main__ - Step 27511: {'lr': 0.0004642341032914362, 'samples': 5282112, 'steps': 27510, 'loss/train': 1.5339746475219727}}} 11/07/2021 01:04:55 - INFO - __main__ - Step 27511: {'lr': 0.0004642341032914362, 'samples': 5282112, 'steps': 27510, 'loss/train': 1.5339746475219727}}} 11/07/2021 01:04:59 - INFO - __main__ - Step 27519: {'lr': 0.0004642122184942824, 'samples': 5283648, 'steps': 27518, 'loss/train': 1.648287296295166}}}} 11/07/2021 01:05:00 - INFO - __main__ - Step 27523: {'lr': 0.00046420127377916863, 'samples': 5284416, 'steps': 27522, 'loss/train': 1.6656675338745117}} 11/07/2021 01:05:02 - INFO - __main__ - Step 27527: {'lr': 0.0004641903275198024, 'samples': 5285184, 'steps': 27526, 'loss/train': 0.6573544144630432}}} 11/07/2021 01:05:05 - INFO - __main__ - Step 27532: {'lr': 0.0004641766425241095, 'samples': 5286144, 'steps': 27531, 'loss/train': 1.555513858795166}}}} 11/07/2021 01:05:07 - INFO - __main__ - Step 27536: {'lr': 0.0004641656927904634, 'samples': 5286912, 'steps': 27535, 'loss/train': 1.411777377128601}}}} 11/07/2021 01:05:07 - INFO - __main__ - Step 27536: {'lr': 0.0004641656927904634, 'samples': 5286912, 'steps': 27535, 'loss/train': 1.411777377128601}}}} 11/07/2021 01:05:10 - INFO - __main__ - Step 27543: {'lr': 0.0004641465270413896, 'samples': 5288256, 'steps': 27542, 'loss/train': 0.9904304146766663}}} 11/07/2021 01:05:13 - INFO - __main__ - Step 27547: {'lr': 0.0004641355730619442, 'samples': 5289024, 'steps': 27546, 'loss/train': 1.589703917503357}}}} 11/07/2021 01:05:13 - INFO - __main__ - Step 27547: {'lr': 0.0004641355730619442, 'samples': 5289024, 'steps': 27546, 'loss/train': 1.589703917503357}}}} 11/07/2021 01:05:17 - INFO - __main__ - Step 27555: {'lr': 0.00046411366047179547, 'samples': 5290560, 'steps': 27554, 'loss/train': 1.6179206371307373}} 11/07/2021 01:05:18 - INFO - __main__ - Step 27559: {'lr': 0.00046410270186125014, 'samples': 5291328, 'steps': 27558, 'loss/train': 1.542672038078308}}} 11/07/2021 01:05:18 - INFO - __main__ - Step 27559: {'lr': 0.00046410270186125014, 'samples': 5291328, 'steps': 27558, 'loss/train': 1.542672038078308}}} 11/07/2021 01:05:22 - INFO - __main__ - Step 27564: {'lr': 0.0004640890014274718, 'samples': 5292288, 'steps': 27563, 'loss/train': 1.3283414840698242}}} 11/07/2021 01:05:25 - INFO - __main__ - Step 27570: {'lr': 0.0004640725577235998, 'samples': 5293440, 'steps': 27569, 'loss/train': 1.3322672843933105}}} 11/07/2021 01:05:27 - INFO - __main__ - Step 27574: {'lr': 0.00046406159332517956, 'samples': 5294208, 'steps': 27573, 'loss/train': 0.921796977519989}}} 11/07/2021 01:05:29 - INFO - __main__ - Step 27578: {'lr': 0.00046405062738351366, 'samples': 5294976, 'steps': 27577, 'loss/train': 1.8103362321853638}} 11/07/2021 01:05:31 - INFO - __main__ - Step 27582: {'lr': 0.00046403965989868124, 'samples': 5295744, 'steps': 27581, 'loss/train': 1.3456676006317139}} 11/07/2021 01:05:33 - INFO - __main__ - Step 27587: {'lr': 0.00046402594837268314, 'samples': 5296704, 'steps': 27586, 'loss/train': 1.1525654792785645}} 11/07/2021 01:05:35 - INFO - __main__ - Step 27591: {'lr': 0.00046401497741601505, 'samples': 5297472, 'steps': 27590, 'loss/train': 1.3765476942062378}} 11/07/2021 01:05:37 - INFO - __main__ - Step 27595: {'lr': 0.00046400400491643744, 'samples': 5298240, 'steps': 27594, 'loss/train': 1.5986907482147217}} 11/07/2021 01:05:37 - INFO - __main__ - Step 27595: {'lr': 0.00046400400491643744, 'samples': 5298240, 'steps': 27594, 'loss/train': 1.5986907482147217}} 11/07/2021 01:05:41 - INFO - __main__ - Step 27602: {'lr': 0.0004639847993297884, 'samples': 5299584, 'steps': 27601, 'loss/train': 1.5791829824447632}}} 11/07/2021 01:05:43 - INFO - __main__ - Step 27607: {'lr': 0.0004639710781610384, 'samples': 5300544, 'steps': 27606, 'loss/train': 0.813217043876648}}}} 11/07/2021 01:05:43 - INFO - __main__ - Step 27607: {'lr': 0.0004639710781610384, 'samples': 5300544, 'steps': 27606, 'loss/train': 0.813217043876648}}}} 11/07/2021 01:05:47 - INFO - __main__ - Step 27615: {'lr': 0.00046394911927767526, 'samples': 5302080, 'steps': 27614, 'loss/train': 1.3175309896469116}} 11/07/2021 01:05:49 - INFO - __main__ - Step 27619: {'lr': 0.000463938137522302, 'samples': 5302848, 'steps': 27618, 'loss/train': 1.403387188911438}16}} 11/07/2021 01:05:51 - INFO - __main__ - Step 27623: {'lr': 0.0004639271542245731, 'samples': 5303616, 'steps': 27622, 'loss/train': 1.7359400987625122}}} 11/07/2021 01:05:53 - INFO - __main__ - Step 27627: {'lr': 0.0004639161693845678, 'samples': 5304384, 'steps': 27626, 'loss/train': 1.8536758422851562}}} 11/07/2021 01:05:55 - INFO - __main__ - Step 27631: {'lr': 0.00046390518300236535, 'samples': 5305152, 'steps': 27630, 'loss/train': 1.5727344751358032}} 11/07/2021 01:05:57 - INFO - __main__ - Step 27635: {'lr': 0.00046389419507804493, 'samples': 5305920, 'steps': 27634, 'loss/train': 1.580593466758728}}} 11/07/2021 01:05:58 - INFO - __main__ - Step 27639: {'lr': 0.00046388320561168567, 'samples': 5306688, 'steps': 27638, 'loss/train': 1.0632545948028564}} 11/07/2021 01:06:01 - INFO - __main__ - Step 27643: {'lr': 0.0004638722146033669, 'samples': 5307456, 'steps': 27642, 'loss/train': 1.375866174697876}4}} 11/07/2021 01:06:03 - INFO - __main__ - Step 27648: {'lr': 0.0004638584736747085, 'samples': 5308416, 'steps': 27647, 'loss/train': 1.1750167608261108}}} 11/07/2021 01:06:03 - INFO - __main__ - Step 27648: {'lr': 0.0004638584736747085, 'samples': 5308416, 'steps': 27647, 'loss/train': 1.1750167608261108}}} 11/07/2021 01:06:06 - INFO - __main__ - Step 27655: {'lr': 0.00046383923232744565, 'samples': 5309760, 'steps': 27654, 'loss/train': 1.659050464630127}}} 11/07/2021 01:06:08 - INFO - __main__ - Step 27659: {'lr': 0.0004638282351520812, 'samples': 5310528, 'steps': 27658, 'loss/train': 0.984664261341095}}}} 11/07/2021 01:06:11 - INFO - __main__ - Step 27664: {'lr': 0.00046381448651506153, 'samples': 5311488, 'steps': 27663, 'loss/train': 1.6429427862167358}} 11/07/2021 01:06:11 - INFO - __main__ - Step 27664: {'lr': 0.00046381448651506153, 'samples': 5311488, 'steps': 27663, 'loss/train': 1.6429427862167358}} 11/07/2021 01:06:15 - INFO - __main__ - Step 27672: {'lr': 0.00046379248368613615, 'samples': 5313024, 'steps': 27671, 'loss/train': 1.229887843132019}}} 11/07/2021 01:06:15 - INFO - __main__ - Step 27672: {'lr': 0.00046379248368613615, 'samples': 5313024, 'steps': 27671, 'loss/train': 1.229887843132019}}} 11/07/2021 01:06:18 - INFO - __main__ - Step 27679: {'lr': 0.000463773226153396, 'samples': 5314368, 'steps': 27678, 'loss/train': 0.9993812441825867}}}} 11/07/2021 01:06:21 - INFO - __main__ - Step 27684: {'lr': 0.000463759467883155, 'samples': 5315328, 'steps': 27683, 'loss/train': 1.6087898015975952}}}} 11/07/2021 01:06:21 - INFO - __main__ - Step 27684: {'lr': 0.000463759467883155, 'samples': 5315328, 'steps': 27683, 'loss/train': 1.6087898015975952}}}} 11/07/2021 01:06:21 - INFO - __main__ - Step 27684: {'lr': 0.000463759467883155, 'samples': 5315328, 'steps': 27683, 'loss/train': 1.6087898015975952}}}} 11/07/2021 01:06:26 - INFO - __main__ - Step 27695: {'lr': 0.00046372919121297207, 'samples': 5317440, 'steps': 27694, 'loss/train': 1.4711743593215942}} 11/07/2021 01:06:29 - INFO - __main__ - Step 27700: {'lr': 0.0004637154252379394, 'samples': 5318400, 'steps': 27699, 'loss/train': 1.6177538633346558}}} 11/07/2021 01:06:29 - INFO - __main__ - Step 27700: {'lr': 0.0004637154252379394, 'samples': 5318400, 'steps': 27699, 'loss/train': 1.6177538633346558}}} 11/07/2021 01:06:33 - INFO - __main__ - Step 27706: {'lr': 0.00046369890289011696, 'samples': 5319552, 'steps': 27705, 'loss/train': 1.6031389236450195}} 11/07/2021 01:06:33 - INFO - __main__ - Step 27706: {'lr': 0.00046369890289011696, 'samples': 5319552, 'steps': 27705, 'loss/train': 1.6031389236450195}} 11/07/2021 01:06:37 - INFO - __main__ - Step 27715: {'lr': 0.0004636741128689308, 'samples': 5321280, 'steps': 27714, 'loss/train': 2.0229625701904297}}} 11/07/2021 01:06:39 - INFO - __main__ - Step 27719: {'lr': 0.0004636630925784484, 'samples': 5322048, 'steps': 27718, 'loss/train': 1.936221957206726}}}} 11/07/2021 01:06:41 - INFO - __main__ - Step 27723: {'lr': 0.00046365207074759344, 'samples': 5322816, 'steps': 27722, 'loss/train': 1.82841157913208}}}} 11/07/2021 01:06:43 - INFO - __main__ - Step 27728: {'lr': 0.00046363829129299655, 'samples': 5323776, 'steps': 27727, 'loss/train': 1.5989880561828613}} 11/07/2021 01:06:45 - INFO - __main__ - Step 27732: {'lr': 0.00046362726599659355, 'samples': 5324544, 'steps': 27731, 'loss/train': 1.2981113195419312}} 11/07/2021 01:06:45 - INFO - __main__ - Step 27732: {'lr': 0.00046362726599659355, 'samples': 5324544, 'steps': 27731, 'loss/train': 1.2981113195419312}} 11/07/2021 01:06:49 - INFO - __main__ - Step 27739: {'lr': 0.0004636079680220358, 'samples': 5325888, 'steps': 27738, 'loss/train': 1.7593157291412354}}} 11/07/2021 01:06:51 - INFO - __main__ - Step 27743: {'lr': 0.0004635969384905095, 'samples': 5326656, 'steps': 27742, 'loss/train': 1.571807622909546}}}} 11/07/2021 01:06:53 - INFO - __main__ - Step 27748: {'lr': 0.0004635831494106325, 'samples': 5327616, 'steps': 27747, 'loss/train': 1.4999247789382935}}} 11/07/2021 01:06:55 - INFO - __main__ - Step 27752: {'lr': 0.0004635721164144526, 'samples': 5328384, 'steps': 27751, 'loss/train': 1.455724835395813}}}} 11/07/2021 01:06:57 - INFO - __main__ - Step 27756: {'lr': 0.00046356108187855594, 'samples': 5329152, 'steps': 27755, 'loss/train': 1.769636869430542}}} 11/07/2021 01:06:59 - INFO - __main__ - Step 27760: {'lr': 0.000463550045803022, 'samples': 5329920, 'steps': 27759, 'loss/train': 1.2639565467834473}}}} 11/07/2021 01:07:01 - INFO - __main__ - Step 27764: {'lr': 0.0004635390081879303, 'samples': 5330688, 'steps': 27763, 'loss/train': 1.293779730796814}}}} 11/07/2021 01:07:03 - INFO - __main__ - Step 27768: {'lr': 0.0004635279690333606, 'samples': 5331456, 'steps': 27767, 'loss/train': 1.8142160177230835}}} 11/07/2021 01:07:03 - INFO - __main__ - Step 27768: {'lr': 0.0004635279690333606, 'samples': 5331456, 'steps': 27767, 'loss/train': 1.8142160177230835}}} 11/07/2021 01:07:08 - INFO - __main__ - Step 27776: {'lr': 0.0004635058861061051, 'samples': 5332992, 'steps': 27775, 'loss/train': 1.5662585496902466}}} 11/07/2021 01:07:09 - INFO - __main__ - Step 27780: {'lr': 0.00046349484233357854, 'samples': 5333760, 'steps': 27779, 'loss/train': 1.5026328563690186}} 11/07/2021 01:07:09 - INFO - __main__ - Step 27780: {'lr': 0.00046349484233357854, 'samples': 5333760, 'steps': 27779, 'loss/train': 1.5026328563690186}} 11/07/2021 01:07:14 - INFO - __main__ - Step 27788: {'lr': 0.000463472750171126, 'samples': 5335296, 'steps': 27787, 'loss/train': 1.5533576011657715}6}} 11/07/2021 01:07:16 - INFO - __main__ - Step 27792: {'lr': 0.0004634617017813593, 'samples': 5336064, 'steps': 27791, 'loss/train': 1.4849623441696167}}} 11/07/2021 01:07:17 - INFO - __main__ - Step 27796: {'lr': 0.0004634506518526718, 'samples': 5336832, 'steps': 27795, 'loss/train': 1.4754371643066406}}} 11/07/2021 01:07:19 - INFO - __main__ - Step 27800: {'lr': 0.0004634396003851431, 'samples': 5337600, 'steps': 27799, 'loss/train': 1.513837456703186}}}} 11/07/2021 01:07:19 - INFO - __main__ - Step 27800: {'lr': 0.0004634396003851431, 'samples': 5337600, 'steps': 27799, 'loss/train': 1.513837456703186}}}} 11/07/2021 01:07:23 - INFO - __main__ - Step 27807: {'lr': 0.0004634202566143712, 'samples': 5338944, 'steps': 27806, 'loss/train': 1.6079967021942139}}} 11/07/2021 01:07:25 - INFO - __main__ - Step 27812: {'lr': 0.0004634064367503072, 'samples': 5339904, 'steps': 27811, 'loss/train': 1.5238810777664185}}} 11/07/2021 01:07:25 - INFO - __main__ - Step 27812: {'lr': 0.0004634064367503072, 'samples': 5339904, 'steps': 27811, 'loss/train': 1.5238810777664185}}} 11/07/2021 01:07:25 - INFO - __main__ - Step 27812: {'lr': 0.0004634064367503072, 'samples': 5339904, 'steps': 27811, 'loss/train': 1.5238810777664185}}} 11/07/2021 01:07:31 - INFO - __main__ - Step 27823: {'lr': 0.0004633760245877129, 'samples': 5342016, 'steps': 27822, 'loss/train': 1.960642695426941}}}} 11/07/2021 01:07:33 - INFO - __main__ - Step 27828: {'lr': 0.00046336219703158526, 'samples': 5342976, 'steps': 27827, 'loss/train': 1.6386125087738037}} 11/07/2021 01:07:36 - INFO - __main__ - Step 27833: {'lr': 0.00046334836707201486, 'samples': 5343936, 'steps': 27832, 'loss/train': 1.515210509300232}}} 11/07/2021 01:07:36 - INFO - __main__ - Step 27833: {'lr': 0.00046334836707201486, 'samples': 5343936, 'steps': 27832, 'loss/train': 1.515210509300232}}} 11/07/2021 01:07:39 - INFO - __main__ - Step 27840: {'lr': 0.00046332900109112893, 'samples': 5345280, 'steps': 27839, 'loss/train': 1.3671104907989502}} 11/07/2021 01:07:41 - INFO - __main__ - Step 27844: {'lr': 0.00046331793270160885, 'samples': 5346048, 'steps': 27843, 'loss/train': 1.369606614112854}}} 11/07/2021 01:07:43 - INFO - __main__ - Step 27848: {'lr': 0.00046330686277420454, 'samples': 5346816, 'steps': 27847, 'loss/train': 2.1239893436431885}} 11/07/2021 01:07:46 - INFO - __main__ - Step 27853: {'lr': 0.0004632930232024209, 'samples': 5347776, 'steps': 27852, 'loss/train': 1.594679832458496}5}} 11/07/2021 01:07:46 - INFO - __main__ - Step 27853: {'lr': 0.0004632930232024209, 'samples': 5347776, 'steps': 27852, 'loss/train': 1.594679832458496}5}} 11/07/2021 01:07:49 - INFO - __main__ - Step 27860: {'lr': 0.00046327364376548384, 'samples': 5349120, 'steps': 27859, 'loss/train': 1.5656856298446655}} 11/07/2021 01:07:51 - INFO - __main__ - Step 27864: {'lr': 0.00046326256768734053, 'samples': 5349888, 'steps': 27863, 'loss/train': 1.8845356702804565}} 11/07/2021 01:07:54 - INFO - __main__ - Step 27869: {'lr': 0.0004632487204275822, 'samples': 5350848, 'steps': 27868, 'loss/train': 2.0191144943237305}}} 11/07/2021 01:07:54 - INFO - __main__ - Step 27869: {'lr': 0.0004632487204275822, 'samples': 5350848, 'steps': 27868, 'loss/train': 2.0191144943237305}}} 11/07/2021 01:07:58 - INFO - __main__ - Step 27877: {'lr': 0.0004632265598155315, 'samples': 5352384, 'steps': 27876, 'loss/train': 1.4199742078781128}}} 11/07/2021 01:07:59 - INFO - __main__ - Step 27881: {'lr': 0.0004632154772036279, 'samples': 5353152, 'steps': 27880, 'loss/train': 1.6037013530731201}}} 11/07/2021 01:08:01 - INFO - __main__ - Step 27885: {'lr': 0.0004632043930545785, 'samples': 5353920, 'steps': 27884, 'loss/train': 1.512109398841858}}}} 11/07/2021 01:08:01 - INFO - __main__ - Step 27885: {'lr': 0.0004632043930545785, 'samples': 5353920, 'steps': 27884, 'loss/train': 1.512109398841858}}}} 11/07/2021 01:08:05 - INFO - __main__ - Step 27892: {'lr': 0.0004631849920952259, 'samples': 5355264, 'steps': 27891, 'loss/train': 1.7205824851989746}}} 11/07/2021 01:08:08 - INFO - __main__ - Step 27897: {'lr': 0.00046317113138535584, 'samples': 5356224, 'steps': 27896, 'loss/train': 1.501318097114563}}} 11/07/2021 01:08:08 - INFO - __main__ - Step 27897: {'lr': 0.00046317113138535584, 'samples': 5356224, 'steps': 27896, 'loss/train': 1.501318097114563}}} 11/07/2021 01:08:12 - INFO - __main__ - Step 27905: {'lr': 0.0004631489492549443, 'samples': 5357760, 'steps': 27904, 'loss/train': 1.9023025035858154}}} 11/07/2021 01:08:12 - INFO - __main__ - Step 27905: {'lr': 0.0004631489492549443, 'samples': 5357760, 'steps': 27904, 'loss/train': 1.9023025035858154}}} 11/07/2021 01:08:15 - INFO - __main__ - Step 27912: {'lr': 0.000463129534848627, 'samples': 5359104, 'steps': 27911, 'loss/train': 1.0003376007080078}}}} 11/07/2021 01:08:18 - INFO - __main__ - Step 27917: {'lr': 0.0004631156645345318, 'samples': 5360064, 'steps': 27916, 'loss/train': 1.992263674736023}}}} 11/07/2021 01:08:20 - INFO - __main__ - Step 27922: {'lr': 0.0004631017918197709, 'samples': 5361024, 'steps': 27921, 'loss/train': 1.594227910041809}}}} 11/07/2021 01:08:20 - INFO - __main__ - Step 27922: {'lr': 0.0004631017918197709, 'samples': 5361024, 'steps': 27921, 'loss/train': 1.594227910041809}}}} 11/07/2021 01:08:24 - INFO - __main__ - Step 27929: {'lr': 0.0004630823659862846, 'samples': 5362368, 'steps': 27928, 'loss/train': 1.5739498138427734}}} 11/07/2021 01:08:26 - INFO - __main__ - Step 27934: {'lr': 0.00046306848751056346, 'samples': 5363328, 'steps': 27933, 'loss/train': 1.3461530208587646}} 11/07/2021 01:08:28 - INFO - __main__ - Step 27938: {'lr': 0.0004630573830018824, 'samples': 5364096, 'steps': 27937, 'loss/train': 1.4308301210403442}}} 11/07/2021 01:08:30 - INFO - __main__ - Step 27943: {'lr': 0.0004630435002060321, 'samples': 5365056, 'steps': 27942, 'loss/train': 1.272430419921875}}}} 11/07/2021 01:08:32 - INFO - __main__ - Step 27947: {'lr': 0.0004630323922414503, 'samples': 5365824, 'steps': 27946, 'loss/train': 1.3695404529571533}}} 11/07/2021 01:08:32 - INFO - __main__ - Step 27947: {'lr': 0.0004630323922414503, 'samples': 5365824, 'steps': 27946, 'loss/train': 1.3695404529571533}}} 11/07/2021 01:08:35 - INFO - __main__ - Step 27954: {'lr': 0.0004630129496078997, 'samples': 5367168, 'steps': 27953, 'loss/train': 1.6662789583206177}}} 11/07/2021 01:08:37 - INFO - __main__ - Step 27958: {'lr': 0.0004630018374199899, 'samples': 5367936, 'steps': 27957, 'loss/train': 1.4369566440582275}}} 11/07/2021 01:08:40 - INFO - __main__ - Step 27963: {'lr': 0.00046298794502566676, 'samples': 5368896, 'steps': 27962, 'loss/train': 1.5460844039916992}} 11/07/2021 01:08:42 - INFO - __main__ - Step 27967: {'lr': 0.00046297682938275733, 'samples': 5369664, 'steps': 27966, 'loss/train': 1.5414485931396484}} 11/07/2021 01:08:44 - INFO - __main__ - Step 27971: {'lr': 0.00046296571220442274, 'samples': 5370432, 'steps': 27970, 'loss/train': 1.4134701490402222}} 11/07/2021 01:08:46 - INFO - __main__ - Step 27975: {'lr': 0.00046295459349074316, 'samples': 5371200, 'steps': 27974, 'loss/train': 1.613280177116394}}} 11/07/2021 01:08:48 - INFO - __main__ - Step 27979: {'lr': 0.0004629434732417986, 'samples': 5371968, 'steps': 27978, 'loss/train': 1.2018623352050781}}} 11/07/2021 01:08:50 - INFO - __main__ - Step 27984: {'lr': 0.000462929570771774, 'samples': 5372928, 'steps': 27983, 'loss/train': 0.9213356375694275}}}} 11/07/2021 01:08:52 - INFO - __main__ - Step 27989: {'lr': 0.0004629156659031799, 'samples': 5373888, 'steps': 27988, 'loss/train': 1.552217721939087}}}} 11/07/2021 01:08:52 - INFO - __main__ - Step 27989: {'lr': 0.0004629156659031799, 'samples': 5373888, 'steps': 27988, 'loss/train': 1.552217721939087}}}} 11/07/2021 01:08:56 - INFO - __main__ - Step 27996: {'lr': 0.0004628961950578496, 'samples': 5375232, 'steps': 27995, 'loss/train': 1.5386420488357544}}} 11/07/2021 01:08:58 - INFO - __main__ - Step 28000: {'lr': 0.00046288506675008014, 'samples': 5376000, 'steps': 27999, 'loss/train': 1.262174367904663}}} 11/07/2021 01:09:00 - INFO - __main__ - Step 28004: {'lr': 0.0004628739369075471, 'samples': 5376768, 'steps': 28003, 'loss/train': 1.1522947549819946}}} 11/07/2021 01:09:00 - INFO - __main__ - Step 28004: {'lr': 0.0004628739369075471, 'samples': 5376768, 'steps': 28003, 'loss/train': 1.1522947549819946}}} 11/07/2021 01:09:03 - INFO - __main__ - Step 28011: {'lr': 0.00046285445599033063, 'samples': 5378112, 'steps': 28010, 'loss/train': 1.582600712776184}}} 11/07/2021 01:09:05 - INFO - __main__ - Step 28016: {'lr': 0.0004628405381721686, 'samples': 5379072, 'steps': 28015, 'loss/train': 1.4995726346969604}}} 11/07/2021 01:09:08 - INFO - __main__ - Step 28020: {'lr': 0.00046282940219138366, 'samples': 5379840, 'steps': 28019, 'loss/train': 1.337292194366455}}} 11/07/2021 01:09:10 - INFO - __main__ - Step 28025: {'lr': 0.00046281548005771476, 'samples': 5380800, 'steps': 28024, 'loss/train': 1.9095693826675415}} 11/07/2021 01:09:10 - INFO - __main__ - Step 28025: {'lr': 0.00046281548005771476, 'samples': 5380800, 'steps': 28024, 'loss/train': 1.9095693826675415}} 11/07/2021 01:09:13 - INFO - __main__ - Step 28032: {'lr': 0.0004627959850431759, 'samples': 5382144, 'steps': 28031, 'loss/train': 2.019195318222046}5}} 11/07/2021 01:09:15 - INFO - __main__ - Step 28036: {'lr': 0.00046278484292542346, 'samples': 5382912, 'steps': 28035, 'loss/train': 1.1731336116790771}} 11/07/2021 01:09:18 - INFO - __main__ - Step 28041: {'lr': 0.00046277091312099704, 'samples': 5383872, 'steps': 28040, 'loss/train': 1.6613887548446655}} 11/07/2021 01:09:18 - INFO - __main__ - Step 28041: {'lr': 0.00046277091312099704, 'samples': 5383872, 'steps': 28040, 'loss/train': 1.6613887548446655}} 11/07/2021 01:09:22 - INFO - __main__ - Step 28049: {'lr': 0.000462748620448673, 'samples': 5385408, 'steps': 28048, 'loss/train': 1.335105061531067}55}} 11/07/2021 01:09:23 - INFO - __main__ - Step 28053: {'lr': 0.0004627374718118009, 'samples': 5386176, 'steps': 28052, 'loss/train': 1.6419172286987305}}} 11/07/2021 01:09:25 - INFO - __main__ - Step 28057: {'lr': 0.0004627263216412292, 'samples': 5386944, 'steps': 28056, 'loss/train': 2.0487048625946045}}} 11/07/2021 01:09:28 - INFO - __main__ - Step 28062: {'lr': 0.00046271238177137216, 'samples': 5387904, 'steps': 28061, 'loss/train': 1.3745687007904053}} 11/07/2021 01:09:30 - INFO - __main__ - Step 28066: {'lr': 0.0004627012281502704, 'samples': 5388672, 'steps': 28065, 'loss/train': 1.640199065208435}3}} 11/07/2021 01:09:32 - INFO - __main__ - Step 28070: {'lr': 0.0004626900729957305, 'samples': 5389440, 'steps': 28069, 'loss/train': 1.433807611465454}3}} 11/07/2021 01:09:32 - INFO - __main__ - Step 28070: {'lr': 0.0004626900729957305, 'samples': 5389440, 'steps': 28069, 'loss/train': 1.433807611465454}3}} 11/07/2021 01:09:35 - INFO - __main__ - Step 28077: {'lr': 0.00046267054778569163, 'samples': 5390784, 'steps': 28076, 'loss/train': 1.3834558725357056}} 11/07/2021 01:09:38 - INFO - __main__ - Step 28082: {'lr': 0.00046265659833228523, 'samples': 5391744, 'steps': 28081, 'loss/train': 1.5451849699020386}} 11/07/2021 01:09:40 - INFO - __main__ - Step 28087: {'lr': 0.0004626426464833844, 'samples': 5392704, 'steps': 28086, 'loss/train': 1.1907124519348145}}} 11/07/2021 01:09:40 - INFO - __main__ - Step 28087: {'lr': 0.0004626426464833844, 'samples': 5392704, 'steps': 28086, 'loss/train': 1.1907124519348145}}} 11/07/2021 01:09:43 - INFO - __main__ - Step 28094: {'lr': 0.00046262310987079156, 'samples': 5394048, 'steps': 28093, 'loss/train': 1.2868162393569946}} 11/07/2021 01:09:46 - INFO - __main__ - Step 28099: {'lr': 0.00046260915227334503, 'samples': 5395008, 'steps': 28098, 'loss/train': 1.442736029624939}}} 11/07/2021 01:09:48 - INFO - __main__ - Step 28103: {'lr': 0.00046259798447100903, 'samples': 5395776, 'steps': 28102, 'loss/train': 0.6677843332290649}} 11/07/2021 01:09:50 - INFO - __main__ - Step 28108: {'lr': 0.0004625840225627476, 'samples': 5396736, 'steps': 28107, 'loss/train': 1.8279814720153809}}} 11/07/2021 01:09:52 - INFO - __main__ - Step 28112: {'lr': 0.0004625728513119635, 'samples': 5397504, 'steps': 28111, 'loss/train': 1.454695463180542}}}} 11/07/2021 01:09:52 - INFO - __main__ - Step 28112: {'lr': 0.0004625728513119635, 'samples': 5397504, 'steps': 28111, 'loss/train': 1.454695463180542}}}} 11/07/2021 01:09:55 - INFO - __main__ - Step 28119: {'lr': 0.0004625532979355309, 'samples': 5398848, 'steps': 28118, 'loss/train': 1.590772271156311}}}} 11/07/2021 01:09:58 - INFO - __main__ - Step 28124: {'lr': 0.0004625393283648568, 'samples': 5399808, 'steps': 28123, 'loss/train': 1.4818974733352661}}} 11/07/2021 01:10:00 - INFO - __main__ - Step 28129: {'lr': 0.0004625253564000092, 'samples': 5400768, 'steps': 28128, 'loss/train': 2.1188430786132812}}} 11/07/2021 01:10:00 - INFO - __main__ - Step 28129: {'lr': 0.0004625253564000092, 'samples': 5400768, 'steps': 28128, 'loss/train': 2.1188430786132812}}} 11/07/2021 01:10:04 - INFO - __main__ - Step 28136: {'lr': 0.0004625057916273107, 'samples': 5402112, 'steps': 28135, 'loss/train': 1.384974479675293}}}} 11/07/2021 01:10:06 - INFO - __main__ - Step 28140: {'lr': 0.00046249460965062917, 'samples': 5402880, 'steps': 28139, 'loss/train': 0.8357280492782593}} 11/07/2021 01:10:08 - INFO - __main__ - Step 28145: {'lr': 0.000462480630025484, 'samples': 5403840, 'steps': 28144, 'loss/train': 1.7505838871002197}3}} 11/07/2021 01:10:10 - INFO - __main__ - Step 28150: {'lr': 0.0004624666480068265, 'samples': 5404800, 'steps': 28149, 'loss/train': 1.492389440536499}3}} 11/07/2021 01:10:10 - INFO - __main__ - Step 28150: {'lr': 0.0004624666480068265, 'samples': 5404800, 'steps': 28149, 'loss/train': 1.492389440536499}3}} 11/07/2021 01:10:14 - INFO - __main__ - Step 28157: {'lr': 0.0004624470691599052, 'samples': 5406144, 'steps': 28156, 'loss/train': 1.7866506576538086}}} 11/07/2021 01:10:16 - INFO - __main__ - Step 28161: {'lr': 0.00046243587914139285, 'samples': 5406912, 'steps': 28160, 'loss/train': 1.6217018365859985}} 11/07/2021 01:10:18 - INFO - __main__ - Step 28166: {'lr': 0.00046242188946455444, 'samples': 5407872, 'steps': 28165, 'loss/train': 1.6223536729812622}} 11/07/2021 01:10:20 - INFO - __main__ - Step 28170: {'lr': 0.0004624106960002237, 'samples': 5408640, 'steps': 28169, 'loss/train': 1.0919755697250366}}} 11/07/2021 01:10:22 - INFO - __main__ - Step 28174: {'lr': 0.0004623995010045493, 'samples': 5409408, 'steps': 28173, 'loss/train': 0.9593502879142761}}} 11/07/2021 01:10:25 - INFO - __main__ - Step 28178: {'lr': 0.00046238830447761184, 'samples': 5410176, 'steps': 28177, 'loss/train': 1.2456005811691284}} 11/07/2021 01:10:26 - INFO - __main__ - Step 28182: {'lr': 0.0004623771064194921, 'samples': 5410944, 'steps': 28181, 'loss/train': 1.347800612449646}4}} 11/07/2021 01:10:28 - INFO - __main__ - Step 28186: {'lr': 0.0004623659068302708, 'samples': 5411712, 'steps': 28185, 'loss/train': 1.4337131977081299}}} 11/07/2021 01:10:31 - INFO - __main__ - Step 28191: {'lr': 0.00046235190519075564, 'samples': 5412672, 'steps': 28190, 'loss/train': 1.4072515964508057}} 11/07/2021 01:10:31 - INFO - __main__ - Step 28191: {'lr': 0.00046235190519075564, 'samples': 5412672, 'steps': 28190, 'loss/train': 1.4072515964508057}} 11/07/2021 01:10:34 - INFO - __main__ - Step 28198: {'lr': 0.00046233229887680517, 'samples': 5414016, 'steps': 28197, 'loss/train': 1.3359768390655518}} 11/07/2021 01:10:36 - INFO - __main__ - Step 28203: {'lr': 0.00046231829149660553, 'samples': 5414976, 'steps': 28202, 'loss/train': 1.7517107725143433}} 11/07/2021 01:10:36 - INFO - __main__ - Step 28203: {'lr': 0.00046231829149660553, 'samples': 5414976, 'steps': 28202, 'loss/train': 1.7517107725143433}} 11/07/2021 01:10:41 - INFO - __main__ - Step 28211: {'lr': 0.0004622958747136498, 'samples': 5416512, 'steps': 28210, 'loss/train': 1.35165536403656}33}} 11/07/2021 01:10:43 - INFO - __main__ - Step 28215: {'lr': 0.00046228466402635764, 'samples': 5417280, 'steps': 28214, 'loss/train': 1.9090080261230469}} 11/07/2021 01:10:44 - INFO - __main__ - Step 28219: {'lr': 0.0004622734518086304, 'samples': 5418048, 'steps': 28218, 'loss/train': 1.9097330570220947}}} 11/07/2021 01:10:46 - INFO - __main__ - Step 28223: {'lr': 0.0004622622380605489, 'samples': 5418816, 'steps': 28222, 'loss/train': 1.0220810174942017}}} 11/07/2021 01:10:49 - INFO - __main__ - Step 28228: {'lr': 0.0004622482187235094, 'samples': 5419776, 'steps': 28227, 'loss/train': 1.6979622840881348}}} 11/07/2021 01:10:51 - INFO - __main__ - Step 28232: {'lr': 0.0004622370015324264, 'samples': 5420544, 'steps': 28231, 'loss/train': 1.5277044773101807}}} 11/07/2021 01:10:53 - INFO - __main__ - Step 28236: {'lr': 0.00046222578281125194, 'samples': 5421312, 'steps': 28235, 'loss/train': 1.3534091711044312}} 11/07/2021 01:10:54 - INFO - __main__ - Step 28240: {'lr': 0.0004622145625600668, 'samples': 5422080, 'steps': 28239, 'loss/train': 1.3777828216552734}}} 11/07/2021 01:10:56 - INFO - __main__ - Step 28244: {'lr': 0.000462203340778952, 'samples': 5422848, 'steps': 28243, 'loss/train': 1.1842589378356934}}}} 11/07/2021 01:10:59 - INFO - __main__ - Step 28249: {'lr': 0.000462189311401218, 'samples': 5423808, 'steps': 28248, 'loss/train': 0.5545594096183777}}}} 11/07/2021 01:11:01 - INFO - __main__ - Step 28253: {'lr': 0.0004621780861780572, 'samples': 5424576, 'steps': 28252, 'loss/train': 1.4097659587860107}}} 11/07/2021 01:11:03 - INFO - __main__ - Step 28257: {'lr': 0.00046216685942522957, 'samples': 5425344, 'steps': 28256, 'loss/train': 1.0055487155914307}} 11/07/2021 01:11:04 - INFO - __main__ - Step 28261: {'lr': 0.00046215563114281613, 'samples': 5426112, 'steps': 28260, 'loss/train': 1.4386929273605347}} 11/07/2021 01:11:07 - INFO - __main__ - Step 28265: {'lr': 0.0004621444013308979, 'samples': 5426880, 'steps': 28264, 'loss/train': 1.032285451889038}7}} 11/07/2021 01:11:09 - INFO - __main__ - Step 28269: {'lr': 0.0004621331699895557, 'samples': 5427648, 'steps': 28268, 'loss/train': 1.922584056854248}7}} 11/07/2021 01:11:11 - INFO - __main__ - Step 28273: {'lr': 0.0004621219371188706, 'samples': 5428416, 'steps': 28272, 'loss/train': 1.6785149574279785}}} 11/07/2021 01:11:13 - INFO - __main__ - Step 28277: {'lr': 0.00046211070271892353, 'samples': 5429184, 'steps': 28276, 'loss/train': 1.8365651369094849}} 11/07/2021 01:11:14 - INFO - __main__ - Step 28281: {'lr': 0.0004620994667897955, 'samples': 5429952, 'steps': 28280, 'loss/train': 1.7610883712768555}}} 11/07/2021 01:11:16 - INFO - __main__ - Step 28285: {'lr': 0.00046208822933156756, 'samples': 5430720, 'steps': 28284, 'loss/train': 1.523167371749878}}} 11/07/2021 01:11:19 - INFO - __main__ - Step 28291: {'lr': 0.00046207137027734046, 'samples': 5431872, 'steps': 28290, 'loss/train': 1.3798640966415405}} 11/07/2021 01:11:21 - INFO - __main__ - Step 28295: {'lr': 0.00046206012899671715, 'samples': 5432640, 'steps': 28294, 'loss/train': 1.629433035850525}}} 11/07/2021 01:11:21 - INFO - __main__ - Step 28295: {'lr': 0.00046206012899671715, 'samples': 5432640, 'steps': 28294, 'loss/train': 1.629433035850525}}} 11/07/2021 01:11:24 - INFO - __main__ - Step 28301: {'lr': 0.0004620432642092768, 'samples': 5433792, 'steps': 28300, 'loss/train': 0.30110833048820496}} 11/07/2021 01:11:24 - INFO - __main__ - Step 28301: {'lr': 0.0004620432642092768, 'samples': 5433792, 'steps': 28300, 'loss/train': 0.30110833048820496}} 11/07/2021 01:11:29 - INFO - __main__ - Step 28309: {'lr': 0.0004620207724756386, 'samples': 5435328, 'steps': 28308, 'loss/train': 0.8543437719345093}}} 11/07/2021 01:11:29 - INFO - __main__ - Step 28309: {'lr': 0.0004620207724756386, 'samples': 5435328, 'steps': 28308, 'loss/train': 0.8543437719345093}}} 11/07/2021 01:11:32 - INFO - __main__ - Step 28316: {'lr': 0.00046200108719318537, 'samples': 5436672, 'steps': 28315, 'loss/train': 1.4547991752624512}} 11/07/2021 01:11:34 - INFO - __main__ - Step 28321: {'lr': 0.00046198702341138944, 'samples': 5437632, 'steps': 28320, 'loss/train': 1.7438561916351318}} 11/07/2021 01:11:37 - INFO - __main__ - Step 28326: {'lr': 0.0004619729572416415, 'samples': 5438592, 'steps': 28325, 'loss/train': 1.8733631372451782}}} 11/07/2021 01:11:39 - INFO - __main__ - Step 28330: {'lr': 0.0004619617025866242, 'samples': 5439360, 'steps': 28329, 'loss/train': 0.6419276595115662}}} 11/07/2021 01:11:39 - INFO - __main__ - Step 28330: {'lr': 0.0004619617025866242, 'samples': 5439360, 'steps': 28329, 'loss/train': 0.6419276595115662}}} 11/07/2021 01:11:42 - INFO - __main__ - Step 28337: {'lr': 0.0004619420032633857, 'samples': 5440704, 'steps': 28336, 'loss/train': 1.5093754529953003}}} 11/07/2021 01:11:44 - INFO - __main__ - Step 28342: {'lr': 0.0004619279294532561, 'samples': 5441664, 'steps': 28341, 'loss/train': 1.285418152809143}}}} 11/07/2021 01:11:47 - INFO - __main__ - Step 28346: {'lr': 0.0004619166686862987, 'samples': 5442432, 'steps': 28345, 'loss/train': 1.6994646787643433}}} 11/07/2021 01:11:47 - INFO - __main__ - Step 28346: {'lr': 0.0004619166686862987, 'samples': 5442432, 'steps': 28345, 'loss/train': 1.6994646787643433}}} 11/07/2021 01:11:50 - INFO - __main__ - Step 28353: {'lr': 0.00046189695866794635, 'samples': 5443776, 'steps': 28352, 'loss/train': 1.3247753381729126}} 11/07/2021 01:11:53 - INFO - __main__ - Step 28359: {'lr': 0.0004618800606428626, 'samples': 5444928, 'steps': 28358, 'loss/train': 1.3694254159927368}}} 11/07/2021 01:11:53 - INFO - __main__ - Step 28359: {'lr': 0.0004618800606428626, 'samples': 5444928, 'steps': 28358, 'loss/train': 1.3694254159927368}}} 11/07/2021 01:11:57 - INFO - __main__ - Step 28366: {'lr': 0.0004618603419364042, 'samples': 5446272, 'steps': 28365, 'loss/train': 1.3224831819534302}}} 11/07/2021 01:11:58 - INFO - __main__ - Step 28370: {'lr': 0.0004618490720039723, 'samples': 5447040, 'steps': 28369, 'loss/train': 1.5186891555786133}}} 11/07/2021 01:12:00 - INFO - __main__ - Step 28374: {'lr': 0.00046183780054424574, 'samples': 5447808, 'steps': 28373, 'loss/train': 1.1196340322494507}} 11/07/2021 01:12:03 - INFO - __main__ - Step 28379: {'lr': 0.00046182370907195294, 'samples': 5448768, 'steps': 28378, 'loss/train': 1.7771023511886597}} 11/07/2021 01:12:03 - INFO - __main__ - Step 28379: {'lr': 0.00046182370907195294, 'samples': 5448768, 'steps': 28378, 'loss/train': 1.7771023511886597}} 11/07/2021 01:12:06 - INFO - __main__ - Step 28386: {'lr': 0.00046180397700210985, 'samples': 5450112, 'steps': 28385, 'loss/train': 1.4372633695602417}} 11/07/2021 01:12:08 - INFO - __main__ - Step 28390: {'lr': 0.00046179269943401693, 'samples': 5450880, 'steps': 28389, 'loss/train': 1.8363102674484253}} 11/07/2021 01:12:11 - INFO - __main__ - Step 28395: {'lr': 0.0004617786003267235, 'samples': 5451840, 'steps': 28394, 'loss/train': 1.5543104410171509}}} 11/07/2021 01:12:11 - INFO - __main__ - Step 28395: {'lr': 0.0004617786003267235, 'samples': 5451840, 'steps': 28394, 'loss/train': 1.5543104410171509}}} 11/07/2021 01:12:15 - INFO - __main__ - Step 28403: {'lr': 0.00046175603679306324, 'samples': 5453376, 'steps': 28402, 'loss/train': 1.4348992109298706}} 11/07/2021 01:12:15 - INFO - __main__ - Step 28403: {'lr': 0.00046175603679306324, 'samples': 5453376, 'steps': 28402, 'loss/train': 1.4348992109298706}} 11/07/2021 01:12:18 - INFO - __main__ - Step 28410: {'lr': 0.0004617362886918531, 'samples': 5454720, 'steps': 28409, 'loss/train': 1.8714888095855713}}} 11/07/2021 01:12:21 - INFO - __main__ - Step 28416: {'lr': 0.0004617193580271433, 'samples': 5455872, 'steps': 28415, 'loss/train': 3.148294687271118}}}} 11/07/2021 01:12:23 - INFO - __main__ - Step 28420: {'lr': 0.0004617080690093701, 'samples': 5456640, 'steps': 28419, 'loss/train': 1.2667416334152222}}} 11/07/2021 01:12:23 - INFO - __main__ - Step 28420: {'lr': 0.0004617080690093701, 'samples': 5456640, 'steps': 28419, 'loss/train': 1.2667416334152222}}} 11/07/2021 01:12:26 - INFO - __main__ - Step 28427: {'lr': 0.0004616883095557092, 'samples': 5457984, 'steps': 28426, 'loss/train': 1.5702557563781738}}} 11/07/2021 01:12:28 - INFO - __main__ - Step 28431: {'lr': 0.0004616770163408669, 'samples': 5458752, 'steps': 28430, 'loss/train': 1.5220882892608643}}} 11/07/2021 01:12:28 - INFO - __main__ - Step 28431: {'lr': 0.0004616770163408669, 'samples': 5458752, 'steps': 28430, 'loss/train': 1.5220882892608643}}} 11/07/2021 01:12:33 - INFO - __main__ - Step 28440: {'lr': 0.00046165160102795943, 'samples': 5460480, 'steps': 28439, 'loss/train': 0.8910706639289856}} 11/07/2021 01:12:34 - INFO - __main__ - Step 28444: {'lr': 0.0004616403028537382, 'samples': 5461248, 'steps': 28443, 'loss/train': 1.9984899759292603}}} 11/07/2021 01:12:37 - INFO - __main__ - Step 28448: {'lr': 0.0004616290031537273, 'samples': 5462016, 'steps': 28447, 'loss/train': 1.434464454650879}}}} 11/07/2021 01:12:39 - INFO - __main__ - Step 28453: {'lr': 0.000461614876383196, 'samples': 5462976, 'steps': 28452, 'loss/train': 1.8666967153549194}}}} 11/07/2021 01:12:41 - INFO - __main__ - Step 28457: {'lr': 0.0004616035732504562, 'samples': 5463744, 'steps': 28456, 'loss/train': 1.5295157432556152}}} 11/07/2021 01:12:41 - INFO - __main__ - Step 28457: {'lr': 0.0004616035732504562, 'samples': 5463744, 'steps': 28456, 'loss/train': 1.5295157432556152}}} 11/07/2021 01:12:44 - INFO - __main__ - Step 28463: {'lr': 0.0004615866156910128, 'samples': 5464896, 'steps': 28462, 'loss/train': 1.6727573871612549}}} 11/07/2021 01:12:47 - INFO - __main__ - Step 28468: {'lr': 0.00046157248176967915, 'samples': 5465856, 'steps': 28467, 'loss/train': 1.5455939769744873}} 11/07/2021 01:12:49 - INFO - __main__ - Step 28473: {'lr': 0.0004615583454650632, 'samples': 5466816, 'steps': 28472, 'loss/train': 1.3618167638778687}}} 11/07/2021 01:12:49 - INFO - __main__ - Step 28473: {'lr': 0.0004615583454650632, 'samples': 5466816, 'steps': 28472, 'loss/train': 1.3618167638778687}}} 11/07/2021 01:12:53 - INFO - __main__ - Step 28481: {'lr': 0.00046153572242084776, 'samples': 5468352, 'steps': 28480, 'loss/train': 0.9786882400512695}} 11/07/2021 01:12:55 - INFO - __main__ - Step 28485: {'lr': 0.0004615244086111456, 'samples': 5469120, 'steps': 28484, 'loss/train': 1.76587975025177}95}} 11/07/2021 01:12:57 - INFO - __main__ - Step 28489: {'lr': 0.0004615130932764894, 'samples': 5469888, 'steps': 28488, 'loss/train': 1.659226655960083}5}} 11/07/2021 01:12:59 - INFO - __main__ - Step 28494: {'lr': 0.00046149894696382655, 'samples': 5470848, 'steps': 28493, 'loss/train': 1.8293430805206299}} 11/07/2021 01:12:59 - INFO - __main__ - Step 28494: {'lr': 0.00046149894696382655, 'samples': 5470848, 'steps': 28493, 'loss/train': 1.8293430805206299}} 11/07/2021 01:13:03 - INFO - __main__ - Step 28501: {'lr': 0.00046147913812361155, 'samples': 5472192, 'steps': 28500, 'loss/train': 1.4224140644073486}} 11/07/2021 01:13:04 - INFO - __main__ - Step 28505: {'lr': 0.00046146781668995456, 'samples': 5472960, 'steps': 28504, 'loss/train': 1.5621429681777954}} 11/07/2021 01:13:06 - INFO - __main__ - Step 28509: {'lr': 0.00046145649373175145, 'samples': 5473728, 'steps': 28508, 'loss/train': 1.8721892833709717}} 11/07/2021 01:13:09 - INFO - __main__ - Step 28514: {'lr': 0.0004614423378902289, 'samples': 5474688, 'steps': 28513, 'loss/train': 1.4837170839309692}}} 11/07/2021 01:13:09 - INFO - __main__ - Step 28514: {'lr': 0.0004614423378902289, 'samples': 5474688, 'steps': 28513, 'loss/train': 1.4837170839309692}}} 11/07/2021 01:13:12 - INFO - __main__ - Step 28520: {'lr': 0.0004614253477364182, 'samples': 5475840, 'steps': 28519, 'loss/train': 1.642098069190979}}}} 11/07/2021 01:13:15 - INFO - __main__ - Step 28525: {'lr': 0.0004614111866551101, 'samples': 5476800, 'steps': 28524, 'loss/train': 1.7257108688354492}}} 11/07/2021 01:13:17 - INFO - __main__ - Step 28529: {'lr': 0.00046139985607540087, 'samples': 5477568, 'steps': 28528, 'loss/train': 1.7291768789291382}} 11/07/2021 01:13:19 - INFO - __main__ - Step 28533: {'lr': 0.00046138852397163547, 'samples': 5478336, 'steps': 28532, 'loss/train': 1.5962907075881958}} 11/07/2021 01:13:20 - INFO - __main__ - Step 28537: {'lr': 0.0004613771903438955, 'samples': 5479104, 'steps': 28536, 'loss/train': 1.6306997537612915}}} 11/07/2021 01:13:20 - INFO - __main__ - Step 28537: {'lr': 0.0004613771903438955, 'samples': 5479104, 'steps': 28536, 'loss/train': 1.6306997537612915}}} 11/07/2021 01:13:25 - INFO - __main__ - Step 28544: {'lr': 0.00046135735282853263, 'samples': 5480448, 'steps': 28543, 'loss/train': 1.8351638317108154}} 11/07/2021 01:13:26 - INFO - __main__ - Step 28548: {'lr': 0.00046134601501028404, 'samples': 5481216, 'steps': 28547, 'loss/train': 1.278522253036499}}} 11/07/2021 01:13:29 - INFO - __main__ - Step 28552: {'lr': 0.0004613346756683675, 'samples': 5481984, 'steps': 28551, 'loss/train': 1.5659213066101074}}} 11/07/2021 01:13:29 - INFO - __main__ - Step 28552: {'lr': 0.0004613346756683675, 'samples': 5481984, 'steps': 28551, 'loss/train': 1.5659213066101074}}} 11/07/2021 01:13:33 - INFO - __main__ - Step 28560: {'lr': 0.00046131199241385726, 'samples': 5483520, 'steps': 28559, 'loss/train': 1.786787986755371}}} 11/07/2021 01:13:34 - INFO - __main__ - Step 28564: {'lr': 0.00046130064850142703, 'samples': 5484288, 'steps': 28563, 'loss/train': 1.3572795391082764}} 11/07/2021 01:13:36 - INFO - __main__ - Step 28568: {'lr': 0.0004612893030656559, 'samples': 5485056, 'steps': 28567, 'loss/train': 1.6528112888336182}}} 11/07/2021 01:13:39 - INFO - __main__ - Step 28573: {'lr': 0.0004612751191288682, 'samples': 5486016, 'steps': 28572, 'loss/train': 1.5951439142227173}}} 11/07/2021 01:13:41 - INFO - __main__ - Step 28577: {'lr': 0.00046126377026587897, 'samples': 5486784, 'steps': 28576, 'loss/train': 1.7694735527038574}} 11/07/2021 01:13:43 - INFO - __main__ - Step 28581: {'lr': 0.00046125241987981445, 'samples': 5487552, 'steps': 28580, 'loss/train': 1.4468891620635986}} 11/07/2021 01:13:45 - INFO - __main__ - Step 28585: {'lr': 0.00046124106797075683, 'samples': 5488320, 'steps': 28584, 'loss/train': 1.6580777168273926}} 11/07/2021 01:13:46 - INFO - __main__ - Step 28589: {'lr': 0.0004612297145387876, 'samples': 5489088, 'steps': 28588, 'loss/train': 2.1595003604888916}}} 11/07/2021 01:13:49 - INFO - __main__ - Step 28594: {'lr': 0.0004612155206073566, 'samples': 5490048, 'steps': 28593, 'loss/train': 1.3600523471832275}}} 11/07/2021 01:13:49 - INFO - __main__ - Step 28594: {'lr': 0.0004612155206073566, 'samples': 5490048, 'steps': 28593, 'loss/train': 1.3600523471832275}}} 11/07/2021 01:13:49 - INFO - __main__ - Step 28594: {'lr': 0.0004612155206073566, 'samples': 5490048, 'steps': 28593, 'loss/train': 1.3600523471832275}}} 11/07/2021 01:13:54 - INFO - __main__ - Step 28605: {'lr': 0.0004611842855834336, 'samples': 5492160, 'steps': 28604, 'loss/train': 1.6320688724517822}}} 11/07/2021 01:13:57 - INFO - __main__ - Step 28610: {'lr': 0.00046117008403892925, 'samples': 5493120, 'steps': 28609, 'loss/train': 1.485178828239441}}} 11/07/2021 01:13:57 - INFO - __main__ - Step 28610: {'lr': 0.00046117008403892925, 'samples': 5493120, 'steps': 28609, 'loss/train': 1.485178828239441}}} 11/07/2021 01:14:01 - INFO - __main__ - Step 28618: {'lr': 0.00046114735661998975, 'samples': 5494656, 'steps': 28617, 'loss/train': 0.9758543372154236}} 11/07/2021 01:14:03 - INFO - __main__ - Step 28622: {'lr': 0.0004611359906271253, 'samples': 5495424, 'steps': 28621, 'loss/train': 1.5294830799102783}}} 11/07/2021 01:14:05 - INFO - __main__ - Step 28626: {'lr': 0.0004611246231121069, 'samples': 5496192, 'steps': 28625, 'loss/train': 1.6356327533721924}}} 11/07/2021 01:14:07 - INFO - __main__ - Step 28631: {'lr': 0.00046111041157792987, 'samples': 5497152, 'steps': 28630, 'loss/train': 2.1358513832092285}} 11/07/2021 01:14:09 - INFO - __main__ - Step 28635: {'lr': 0.0004610990406383648, 'samples': 5497920, 'steps': 28634, 'loss/train': 1.223016619682312}5}} 11/07/2021 01:14:09 - INFO - __main__ - Step 28635: {'lr': 0.0004610990406383648, 'samples': 5497920, 'steps': 28634, 'loss/train': 1.223016619682312}5}} 11/07/2021 01:14:13 - INFO - __main__ - Step 28642: {'lr': 0.0004610791378321335, 'samples': 5499264, 'steps': 28641, 'loss/train': 1.575751543045044}5}} 11/07/2021 01:14:15 - INFO - __main__ - Step 28647: {'lr': 0.0004610649186886725, 'samples': 5500224, 'steps': 28646, 'loss/train': 1.5664775371551514}}} 11/07/2021 01:14:15 - INFO - __main__ - Step 28647: {'lr': 0.0004610649186886725, 'samples': 5500224, 'steps': 28646, 'loss/train': 1.5664775371551514}}} 11/07/2021 01:14:15 - INFO - __main__ - Step 28647: {'lr': 0.0004610649186886725, 'samples': 5500224, 'steps': 28646, 'loss/train': 1.5664775371551514}}} 11/07/2021 01:14:21 - INFO - __main__ - Step 28658: {'lr': 0.00046103362820425567, 'samples': 5502336, 'steps': 28657, 'loss/train': 1.311152696609497}}} 11/07/2021 01:14:23 - INFO - __main__ - Step 28663: {'lr': 0.000461019401453151, 'samples': 5503296, 'steps': 28662, 'loss/train': 2.106566905975342}7}}} 11/07/2021 01:14:23 - INFO - __main__ - Step 28663: {'lr': 0.000461019401453151, 'samples': 5503296, 'steps': 28662, 'loss/train': 2.106566905975342}7}}} 11/07/2021 01:14:28 - INFO - __main__ - Step 28671: {'lr': 0.0004609966337071819, 'samples': 5504832, 'steps': 28670, 'loss/train': 1.561232089996338}}}} 11/07/2021 01:14:29 - INFO - __main__ - Step 28675: {'lr': 0.00046098524755243246, 'samples': 5505600, 'steps': 28674, 'loss/train': 1.3095825910568237}} 11/07/2021 01:14:31 - INFO - __main__ - Step 28679: {'lr': 0.00046097385987661576, 'samples': 5506368, 'steps': 28678, 'loss/train': 0.7278133630752563}} 11/07/2021 01:14:31 - INFO - __main__ - Step 28679: {'lr': 0.00046097385987661576, 'samples': 5506368, 'steps': 28678, 'loss/train': 0.7278133630752563}} 11/07/2021 01:14:35 - INFO - __main__ - Step 28687: {'lr': 0.0004609510799621095, 'samples': 5507904, 'steps': 28686, 'loss/train': 1.7127150297164917}}} 11/07/2021 01:14:35 - INFO - __main__ - Step 28687: {'lr': 0.0004609510799621095, 'samples': 5507904, 'steps': 28686, 'loss/train': 1.7127150297164917}}} 11/07/2021 01:14:39 - INFO - __main__ - Step 28694: {'lr': 0.00046093114254670066, 'samples': 5509248, 'steps': 28693, 'loss/train': 0.881278395652771}}} 11/07/2021 01:14:41 - INFO - __main__ - Step 28700: {'lr': 0.000460914049626826, 'samples': 5510400, 'steps': 28699, 'loss/train': 1.46780526638031}71}}} 11/07/2021 01:14:43 - INFO - __main__ - Step 28704: {'lr': 0.0004609026524462002, 'samples': 5511168, 'steps': 28703, 'loss/train': 1.540532112121582}}}} 11/07/2021 01:14:45 - INFO - __main__ - Step 28708: {'lr': 0.0004608912537451027, 'samples': 5511936, 'steps': 28707, 'loss/train': 1.765599012374878}}}} 11/07/2021 01:14:47 - INFO - __main__ - Step 28712: {'lr': 0.0004608798535236156, 'samples': 5512704, 'steps': 28711, 'loss/train': 1.5437874794006348}}} 11/07/2021 01:14:49 - INFO - __main__ - Step 28716: {'lr': 0.00046086845178182123, 'samples': 5513472, 'steps': 28715, 'loss/train': 1.343458652496338}}} 11/07/2021 01:14:51 - INFO - __main__ - Step 28721: {'lr': 0.00046085419746677136, 'samples': 5514432, 'steps': 28720, 'loss/train': 1.8643708229064941}} 11/07/2021 01:14:51 - INFO - __main__ - Step 28721: {'lr': 0.00046085419746677136, 'samples': 5514432, 'steps': 28720, 'loss/train': 1.8643708229064941}} 11/07/2021 01:14:55 - INFO - __main__ - Step 28728: {'lr': 0.0004608342374354162, 'samples': 5515776, 'steps': 28727, 'loss/train': 1.8156870603561401}}} 11/07/2021 01:14:57 - INFO - __main__ - Step 28732: {'lr': 0.00046082282961321466, 'samples': 5516544, 'steps': 28731, 'loss/train': 1.5800282955169678}} 11/07/2021 01:14:59 - INFO - __main__ - Step 28737: {'lr': 0.0004608085676981182, 'samples': 5517504, 'steps': 28736, 'loss/train': 1.891564965248108}8}} 11/07/2021 01:14:59 - INFO - __main__ - Step 28737: {'lr': 0.0004608085676981182, 'samples': 5517504, 'steps': 28736, 'loss/train': 1.891564965248108}8}} 11/07/2021 01:14:59 - INFO - __main__ - Step 28737: {'lr': 0.0004608085676981182, 'samples': 5517504, 'steps': 28736, 'loss/train': 1.891564965248108}8}} 11/07/2021 01:15:05 - INFO - __main__ - Step 28747: {'lr': 0.00046078003674405457, 'samples': 5519424, 'steps': 28746, 'loss/train': 1.357011079788208}}} 11/07/2021 01:15:07 - INFO - __main__ - Step 28752: {'lr': 0.00046076576770540865, 'samples': 5520384, 'steps': 28751, 'loss/train': 1.7125825881958008}} 11/07/2021 01:15:07 - INFO - __main__ - Step 28752: {'lr': 0.00046076576770540865, 'samples': 5520384, 'steps': 28751, 'loss/train': 1.7125825881958008}} 11/07/2021 01:15:11 - INFO - __main__ - Step 28760: {'lr': 0.0004607429323053164, 'samples': 5521920, 'steps': 28759, 'loss/train': 1.9364488124847412}}} 11/07/2021 01:15:13 - INFO - __main__ - Step 28764: {'lr': 0.0004607315123262488, 'samples': 5522688, 'steps': 28763, 'loss/train': 1.6237186193466187}}} 11/07/2021 01:15:15 - INFO - __main__ - Step 28768: {'lr': 0.00046072009082794333, 'samples': 5523456, 'steps': 28767, 'loss/train': 1.846705436706543}}} 11/07/2021 01:15:17 - INFO - __main__ - Step 28773: {'lr': 0.0004607058118187586, 'samples': 5524416, 'steps': 28772, 'loss/train': 1.6379915475845337}}} 11/07/2021 01:15:20 - INFO - __main__ - Step 28778: {'lr': 0.0004606915304360542, 'samples': 5525376, 'steps': 28777, 'loss/train': 1.768597960472107}}}} 11/07/2021 01:15:20 - INFO - __main__ - Step 28778: {'lr': 0.0004606915304360542, 'samples': 5525376, 'steps': 28777, 'loss/train': 1.768597960472107}}}} 11/07/2021 01:15:23 - INFO - __main__ - Step 28785: {'lr': 0.00046067153251306127, 'samples': 5526720, 'steps': 28784, 'loss/train': 1.8561345338821411}} 11/07/2021 01:15:26 - INFO - __main__ - Step 28790: {'lr': 0.0004606572454345661, 'samples': 5527680, 'steps': 28789, 'loss/train': 1.6745975017547607}}} 11/07/2021 01:15:28 - INFO - __main__ - Step 28795: {'lr': 0.0004606429559830982, 'samples': 5528640, 'steps': 28794, 'loss/train': 1.8930948972702026}}} 11/07/2021 01:15:28 - INFO - __main__ - Step 28795: {'lr': 0.0004606429559830982, 'samples': 5528640, 'steps': 28794, 'loss/train': 1.8930948972702026}}} 11/07/2021 01:15:32 - INFO - __main__ - Step 28802: {'lr': 0.00046062294676475584, 'samples': 5529984, 'steps': 28801, 'loss/train': 1.5560965538024902}} 11/07/2021 01:15:33 - INFO - __main__ - Step 28806: {'lr': 0.00046061151083779886, 'samples': 5530752, 'steps': 28805, 'loss/train': 1.1285860538482666}} 11/07/2021 01:15:33 - INFO - __main__ - Step 28806: {'lr': 0.00046061151083779886, 'samples': 5530752, 'steps': 28805, 'loss/train': 1.1285860538482666}} 11/07/2021 01:15:38 - INFO - __main__ - Step 28814: {'lr': 0.0004605886344288489, 'samples': 5532288, 'steps': 28813, 'loss/train': 1.0264168977737427}}} 11/07/2021 01:15:40 - INFO - __main__ - Step 28818: {'lr': 0.00046057719394702103, 'samples': 5533056, 'steps': 28817, 'loss/train': 1.4326366186141968}} 11/07/2021 01:15:41 - INFO - __main__ - Step 28822: {'lr': 0.00046056575194706773, 'samples': 5533824, 'steps': 28821, 'loss/train': 1.3864659070968628}} 11/07/2021 01:15:44 - INFO - __main__ - Step 28826: {'lr': 0.0004605543084290716, 'samples': 5534592, 'steps': 28825, 'loss/train': 1.4224810600280762}}} 11/07/2021 01:15:44 - INFO - __main__ - Step 28826: {'lr': 0.0004605543084290716, 'samples': 5534592, 'steps': 28825, 'loss/train': 1.4224810600280762}}} 11/07/2021 01:15:48 - INFO - __main__ - Step 28834: {'lr': 0.0004605314168392809, 'samples': 5536128, 'steps': 28833, 'loss/train': 1.3571157455444336}}} 11/07/2021 01:15:49 - INFO - __main__ - Step 28838: {'lr': 0.0004605199687676512, 'samples': 5536896, 'steps': 28837, 'loss/train': 1.2500851154327393}}} 11/07/2021 01:15:51 - INFO - __main__ - Step 28842: {'lr': 0.00046050851917830884, 'samples': 5537664, 'steps': 28841, 'loss/train': 1.69772207736969}}}} 11/07/2021 01:15:54 - INFO - __main__ - Step 28847: {'lr': 0.00046049420505747294, 'samples': 5538624, 'steps': 28846, 'loss/train': 1.3030951023101807}} 11/07/2021 01:15:56 - INFO - __main__ - Step 28851: {'lr': 0.00046048275205357855, 'samples': 5539392, 'steps': 28850, 'loss/train': 0.1284380406141281}} 11/07/2021 01:15:56 - INFO - __main__ - Step 28851: {'lr': 0.00046048275205357855, 'samples': 5539392, 'steps': 28850, 'loss/train': 0.1284380406141281}} 11/07/2021 01:15:59 - INFO - __main__ - Step 28858: {'lr': 0.0004604627056454622, 'samples': 5540736, 'steps': 28857, 'loss/train': 1.6770565509796143}}} 11/07/2021 01:16:01 - INFO - __main__ - Step 28862: {'lr': 0.00046045124846879427, 'samples': 5541504, 'steps': 28861, 'loss/train': 1.5456140041351318}} 11/07/2021 01:16:04 - INFO - __main__ - Step 28868: {'lr': 0.00046043405985903555, 'samples': 5542656, 'steps': 28867, 'loss/train': 1.8655368089675903}} 11/07/2021 01:16:04 - INFO - __main__ - Step 28868: {'lr': 0.00046043405985903555, 'samples': 5542656, 'steps': 28867, 'loss/train': 1.8655368089675903}} 11/07/2021 01:16:04 - INFO - __main__ - Step 28868: {'lr': 0.00046043405985903555, 'samples': 5542656, 'steps': 28867, 'loss/train': 1.8655368089675903}} 11/07/2021 01:16:09 - INFO - __main__ - Step 28878: {'lr': 0.00046040540459077483, 'samples': 5544576, 'steps': 28877, 'loss/train': 1.636860966682434}}} 11/07/2021 01:16:11 - INFO - __main__ - Step 28883: {'lr': 0.00046039107340136023, 'samples': 5545536, 'steps': 28882, 'loss/train': 1.5530414581298828}} 11/07/2021 01:16:14 - INFO - __main__ - Step 28888: {'lr': 0.0004603767398419713, 'samples': 5546496, 'steps': 28887, 'loss/train': 1.016764760017395}8}} 11/07/2021 01:16:16 - INFO - __main__ - Step 28892: {'lr': 0.00046036527128818724, 'samples': 5547264, 'steps': 28891, 'loss/train': 1.4205087423324585}} 11/07/2021 01:16:18 - INFO - __main__ - Step 28896: {'lr': 0.00046035380121780563, 'samples': 5548032, 'steps': 28895, 'loss/train': 1.6449450254440308}} 11/07/2021 01:16:20 - INFO - __main__ - Step 28900: {'lr': 0.0004603423296309092, 'samples': 5548800, 'steps': 28899, 'loss/train': 1.0341877937316895}}} 11/07/2021 01:16:22 - INFO - __main__ - Step 28904: {'lr': 0.00046033085652758053, 'samples': 5549568, 'steps': 28903, 'loss/train': 1.4512176513671875}} 11/07/2021 01:16:24 - INFO - __main__ - Step 28908: {'lr': 0.00046031938190790254, 'samples': 5550336, 'steps': 28907, 'loss/train': 1.7647217512130737}} 11/07/2021 01:16:26 - INFO - __main__ - Step 28912: {'lr': 0.0004603079057719579, 'samples': 5551104, 'steps': 28911, 'loss/train': 1.697066068649292}7}} 11/07/2021 01:16:28 - INFO - __main__ - Step 28916: {'lr': 0.0004602964281198293, 'samples': 5551872, 'steps': 28915, 'loss/train': 1.4778679609298706}}} 11/07/2021 01:16:29 - INFO - __main__ - Step 28920: {'lr': 0.0004602849489515995, 'samples': 5552640, 'steps': 28919, 'loss/train': 1.5974217653274536}}} 11/07/2021 01:16:31 - INFO - __main__ - Step 28924: {'lr': 0.0004602734682673512, 'samples': 5553408, 'steps': 28923, 'loss/train': 1.2557610273361206}}} 11/07/2021 01:16:34 - INFO - __main__ - Step 28929: {'lr': 0.000460259115280266, 'samples': 5554368, 'steps': 28928, 'loss/train': 1.3246605396270752}}}} 11/07/2021 01:16:36 - INFO - __main__ - Step 28933: {'lr': 0.00046024763118527885, 'samples': 5555136, 'steps': 28932, 'loss/train': 1.3774343729019165}} 11/07/2021 01:16:36 - INFO - __main__ - Step 28933: {'lr': 0.00046024763118527885, 'samples': 5555136, 'steps': 28932, 'loss/train': 1.3774343729019165}} 11/07/2021 01:16:39 - INFO - __main__ - Step 28940: {'lr': 0.00046022753037182915, 'samples': 5556480, 'steps': 28939, 'loss/train': 1.280436635017395}}} 11/07/2021 01:16:42 - INFO - __main__ - Step 28945: {'lr': 0.0004602131698061521, 'samples': 5557440, 'steps': 28944, 'loss/train': 2.0704479217529297}}} 11/07/2021 01:16:42 - INFO - __main__ - Step 28945: {'lr': 0.0004602131698061521, 'samples': 5557440, 'steps': 28944, 'loss/train': 2.0704479217529297}}} 11/07/2021 01:16:46 - INFO - __main__ - Step 28951: {'lr': 0.0004601959340016333, 'samples': 5558592, 'steps': 28950, 'loss/train': 1.1154563426971436}}} 11/07/2021 01:16:46 - INFO - __main__ - Step 28951: {'lr': 0.0004601959340016333, 'samples': 5558592, 'steps': 28950, 'loss/train': 1.1154563426971436}}} 11/07/2021 01:16:50 - INFO - __main__ - Step 28960: {'lr': 0.0004601700739019469, 'samples': 5560320, 'steps': 28959, 'loss/train': 1.454060673713684}}}} 11/07/2021 01:16:52 - INFO - __main__ - Step 28964: {'lr': 0.00046015857806206816, 'samples': 5561088, 'steps': 28963, 'loss/train': 1.4131869077682495}} 11/07/2021 01:16:52 - INFO - __main__ - Step 28964: {'lr': 0.00046015857806206816, 'samples': 5561088, 'steps': 28963, 'loss/train': 1.4131869077682495}} 11/07/2021 01:16:57 - INFO - __main__ - Step 28972: {'lr': 0.0004601355818370714, 'samples': 5562624, 'steps': 28971, 'loss/train': 1.8072031736373901}}} 11/07/2021 01:16:58 - INFO - __main__ - Step 28976: {'lr': 0.0004601240814521192, 'samples': 5563392, 'steps': 28975, 'loss/train': 1.3847191333770752}}} 11/07/2021 01:17:00 - INFO - __main__ - Step 28980: {'lr': 0.00046011257955230826, 'samples': 5564160, 'steps': 28979, 'loss/train': 1.8839620351791382}} 11/07/2021 01:17:00 - INFO - __main__ - Step 28980: {'lr': 0.00046011257955230826, 'samples': 5564160, 'steps': 28979, 'loss/train': 1.8839620351791382}} 11/07/2021 01:17:04 - INFO - __main__ - Step 28987: {'lr': 0.00046009244758275986, 'samples': 5565504, 'steps': 28986, 'loss/train': 1.4445399045944214}} 11/07/2021 01:17:05 - INFO - __main__ - Step 28991: {'lr': 0.00046008094151751513, 'samples': 5566272, 'steps': 28990, 'loss/train': 0.9555196166038513}} 11/07/2021 01:17:08 - INFO - __main__ - Step 28995: {'lr': 0.00046006943393772274, 'samples': 5567040, 'steps': 28994, 'loss/train': 1.4568381309509277}} 11/07/2021 01:17:10 - INFO - __main__ - Step 29000: {'lr': 0.0004600550473332759, 'samples': 5568000, 'steps': 28999, 'loss/train': 1.7568962574005127}}} 11/07/2021 01:17:12 - INFO - __main__ - Step 29004: {'lr': 0.00046004353634605447, 'samples': 5568768, 'steps': 29003, 'loss/train': 1.6470844745635986}} 11/07/2021 01:17:14 - INFO - __main__ - Step 29008: {'lr': 0.00046003202384455505, 'samples': 5569536, 'steps': 29007, 'loss/train': 1.0364364385604858}} 11/07/2021 01:17:16 - INFO - __main__ - Step 29012: {'lr': 0.0004600205098288606, 'samples': 5570304, 'steps': 29011, 'loss/train': 1.7710977792739868}}} 11/07/2021 01:17:18 - INFO - __main__ - Step 29016: {'lr': 0.0004600089942990542, 'samples': 5571072, 'steps': 29015, 'loss/train': 1.5851895809173584}}} 11/07/2021 01:17:20 - INFO - __main__ - Step 29021: {'lr': 0.0004599945977577026, 'samples': 5572032, 'steps': 29020, 'loss/train': 1.4682435989379883}}} 11/07/2021 01:17:22 - INFO - __main__ - Step 29025: {'lr': 0.0004599830788214477, 'samples': 5572800, 'steps': 29024, 'loss/train': 0.9922259449958801}}} 11/07/2021 01:17:22 - INFO - __main__ - Step 29025: {'lr': 0.0004599830788214477, 'samples': 5572800, 'steps': 29024, 'loss/train': 0.9922259449958801}}} 11/07/2021 01:17:25 - INFO - __main__ - Step 29032: {'lr': 0.00045996291704036884, 'samples': 5574144, 'steps': 29031, 'loss/train': 1.6111900806427002}} 11/07/2021 01:17:28 - INFO - __main__ - Step 29036: {'lr': 0.00045995139394124784, 'samples': 5574912, 'steps': 29035, 'loss/train': 1.251185417175293}}} 11/07/2021 01:17:30 - INFO - __main__ - Step 29041: {'lr': 0.0004599369879388371, 'samples': 5575872, 'steps': 29040, 'loss/train': 0.8322492241859436}}} 11/07/2021 01:17:30 - INFO - __main__ - Step 29041: {'lr': 0.0004599369879388371, 'samples': 5575872, 'steps': 29040, 'loss/train': 0.8322492241859436}}} 11/07/2021 01:17:34 - INFO - __main__ - Step 29049: {'lr': 0.00045991393341614017, 'samples': 5577408, 'steps': 29048, 'loss/train': 2.1169443130493164}} 11/07/2021 01:17:36 - INFO - __main__ - Step 29053: {'lr': 0.0004599024038847347, 'samples': 5578176, 'steps': 29052, 'loss/train': 1.7626140117645264}}} 11/07/2021 01:17:38 - INFO - __main__ - Step 29057: {'lr': 0.00045989087284006863, 'samples': 5578944, 'steps': 29056, 'loss/train': 1.551805019378662}}} 11/07/2021 01:17:40 - INFO - __main__ - Step 29062: {'lr': 0.00045987645690634003, 'samples': 5579904, 'steps': 29061, 'loss/train': 1.555138349533081}}} 11/07/2021 01:17:43 - INFO - __main__ - Step 29067: {'lr': 0.0004598620386084342, 'samples': 5580864, 'steps': 29066, 'loss/train': 1.6981607675552368}}} 11/07/2021 01:17:43 - INFO - __main__ - Step 29067: {'lr': 0.0004598620386084342, 'samples': 5580864, 'steps': 29066, 'loss/train': 1.6981607675552368}}} 11/07/2021 01:17:46 - INFO - __main__ - Step 29074: {'lr': 0.00045984184901985735, 'samples': 5582208, 'steps': 29073, 'loss/train': 1.5174460411071777}} 11/07/2021 01:17:48 - INFO - __main__ - Step 29078: {'lr': 0.00045983031003193756, 'samples': 5582976, 'steps': 29077, 'loss/train': 1.7958552837371826}} 11/07/2021 01:17:51 - INFO - __main__ - Step 29084: {'lr': 0.00045981299871369484, 'samples': 5584128, 'steps': 29083, 'loss/train': 0.7894611358642578}} 11/07/2021 01:17:51 - INFO - __main__ - Step 29084: {'lr': 0.00045981299871369484, 'samples': 5584128, 'steps': 29083, 'loss/train': 0.7894611358642578}} 11/07/2021 01:17:51 - INFO - __main__ - Step 29084: {'lr': 0.00045981299871369484, 'samples': 5584128, 'steps': 29083, 'loss/train': 0.7894611358642578}} 11/07/2021 01:17:55 - INFO - __main__ - Step 29094: {'lr': 0.0004597841389536825, 'samples': 5586048, 'steps': 29093, 'loss/train': 1.0229157209396362}}} 11/07/2021 01:17:58 - INFO - __main__ - Step 29099: {'lr': 0.00045976970552888896, 'samples': 5587008, 'steps': 29098, 'loss/train': 1.1772266626358032}} 11/07/2021 01:18:01 - INFO - __main__ - Step 29105: {'lr': 0.0004597523823000243, 'samples': 5588160, 'steps': 29104, 'loss/train': 1.5525542497634888}}} 11/07/2021 01:18:03 - INFO - __main__ - Step 29109: {'lr': 0.00045974083159054, 'samples': 5588928, 'steps': 29108, 'loss/train': 1.43483567237854}888}}} 11/07/2021 01:18:03 - INFO - __main__ - Step 29109: {'lr': 0.00045974083159054, 'samples': 5588928, 'steps': 29108, 'loss/train': 1.43483567237854}888}}} 11/07/2021 01:18:06 - INFO - __main__ - Step 29115: {'lr': 0.0004597235026911603, 'samples': 5590080, 'steps': 29114, 'loss/train': 1.5206252336502075}}} 11/07/2021 01:18:08 - INFO - __main__ - Step 29120: {'lr': 0.00045970905934296537, 'samples': 5591040, 'steps': 29119, 'loss/train': 1.3336411714553833}} 11/07/2021 01:18:10 - INFO - __main__ - Step 29124: {'lr': 0.00045969750296355173, 'samples': 5591808, 'steps': 29123, 'loss/train': 1.3595398664474487}} 11/07/2021 01:18:12 - INFO - __main__ - Step 29128: {'lr': 0.00045968594507235467, 'samples': 5592576, 'steps': 29127, 'loss/train': 1.769572138786316}}} 11/07/2021 01:18:14 - INFO - __main__ - Step 29132: {'lr': 0.0004596743856694576, 'samples': 5593344, 'steps': 29131, 'loss/train': 0.9651497602462769}}} 11/07/2021 01:18:16 - INFO - __main__ - Step 29136: {'lr': 0.0004596628247549439, 'samples': 5594112, 'steps': 29135, 'loss/train': 1.4166126251220703}}} 11/07/2021 01:18:18 - INFO - __main__ - Step 29141: {'lr': 0.00045964837148621776, 'samples': 5595072, 'steps': 29140, 'loss/train': 1.4857066869735718}} 11/07/2021 01:18:20 - INFO - __main__ - Step 29145: {'lr': 0.00045963680717087124, 'samples': 5595840, 'steps': 29144, 'loss/train': 1.531014323234558}}} 11/07/2021 01:18:20 - INFO - __main__ - Step 29145: {'lr': 0.00045963680717087124, 'samples': 5595840, 'steps': 29144, 'loss/train': 1.531014323234558}}} 11/07/2021 01:18:24 - INFO - __main__ - Step 29152: {'lr': 0.00045961656598238925, 'samples': 5597184, 'steps': 29151, 'loss/train': 1.5725375413894653}} 11/07/2021 01:18:26 - INFO - __main__ - Step 29157: {'lr': 0.00045960210515709064, 'samples': 5598144, 'steps': 29156, 'loss/train': 1.6044692993164062}} 11/07/2021 01:18:29 - INFO - __main__ - Step 29162: {'lr': 0.0004595876419707052, 'samples': 5599104, 'steps': 29161, 'loss/train': 1.5310627222061157}}} 11/07/2021 01:18:29 - INFO - __main__ - Step 29162: {'lr': 0.0004595876419707052, 'samples': 5599104, 'steps': 29161, 'loss/train': 1.5310627222061157}}} 11/07/2021 01:18:32 - INFO - __main__ - Step 29169: {'lr': 0.0004595673895434498, 'samples': 5600448, 'steps': 29168, 'loss/train': 1.7352036237716675}}} 11/07/2021 01:18:34 - INFO - __main__ - Step 29173: {'lr': 0.0004595558146504344, 'samples': 5601216, 'steps': 29172, 'loss/train': 1.5260308980941772}}} 11/07/2021 01:18:36 - INFO - __main__ - Step 29177: {'lr': 0.00045954423824665704, 'samples': 5601984, 'steps': 29176, 'loss/train': 1.3339821100234985}} 11/07/2021 01:18:39 - INFO - __main__ - Step 29183: {'lr': 0.00045952687080849517, 'samples': 5603136, 'steps': 29182, 'loss/train': 1.6467374563217163}} 11/07/2021 01:18:39 - INFO - __main__ - Step 29183: {'lr': 0.00045952687080849517, 'samples': 5603136, 'steps': 29182, 'loss/train': 1.6467374563217163}} 11/07/2021 01:18:43 - INFO - __main__ - Step 29190: {'lr': 0.00045950660450169034, 'samples': 5604480, 'steps': 29189, 'loss/train': 1.7768203020095825}} 11/07/2021 01:18:44 - INFO - __main__ - Step 29194: {'lr': 0.0004594950216781063, 'samples': 5605248, 'steps': 29193, 'loss/train': 1.210199236869812}5}} 11/07/2021 01:18:46 - INFO - __main__ - Step 29198: {'lr': 0.00045948343734419873, 'samples': 5606016, 'steps': 29197, 'loss/train': 1.6112602949142456}} 11/07/2021 01:18:49 - INFO - __main__ - Step 29203: {'lr': 0.0004594689548030489, 'samples': 5606976, 'steps': 29202, 'loss/train': 1.0666319131851196}}} 11/07/2021 01:18:49 - INFO - __main__ - Step 29203: {'lr': 0.0004594689548030489, 'samples': 5606976, 'steps': 29202, 'loss/train': 1.0666319131851196}}} 11/07/2021 01:18:52 - INFO - __main__ - Step 29210: {'lr': 0.00045944867528136956, 'samples': 5608320, 'steps': 29209, 'loss/train': 1.8951489925384521}} 11/07/2021 01:18:54 - INFO - __main__ - Step 29214: {'lr': 0.0004594370849070029, 'samples': 5609088, 'steps': 29213, 'loss/train': 1.8933584690093994}}} 11/07/2021 01:18:57 - INFO - __main__ - Step 29219: {'lr': 0.0004594225948157492, 'samples': 5610048, 'steps': 29218, 'loss/train': 1.590041160583496}}}} 11/07/2021 01:18:57 - INFO - __main__ - Step 29219: {'lr': 0.0004594225948157492, 'samples': 5610048, 'steps': 29218, 'loss/train': 1.590041160583496}}}} 11/07/2021 01:19:01 - INFO - __main__ - Step 29227: {'lr': 0.0004593994057629565, 'samples': 5611584, 'steps': 29226, 'loss/train': 2.0218591690063477}}} 11/07/2021 01:19:02 - INFO - __main__ - Step 29231: {'lr': 0.00045938780897206686, 'samples': 5612352, 'steps': 29230, 'loss/train': 1.9293100833892822}} 11/07/2021 01:19:04 - INFO - __main__ - Step 29235: {'lr': 0.00045937621067162674, 'samples': 5613120, 'steps': 29234, 'loss/train': 1.460890293121338}}} 11/07/2021 01:19:07 - INFO - __main__ - Step 29240: {'lr': 0.00045936171067339826, 'samples': 5614080, 'steps': 29239, 'loss/train': 1.7664074897766113}} 11/07/2021 01:19:07 - INFO - __main__ - Step 29240: {'lr': 0.00045936171067339826, 'samples': 5614080, 'steps': 29239, 'loss/train': 1.7664074897766113}} 11/07/2021 01:19:10 - INFO - __main__ - Step 29247: {'lr': 0.0004593414067138385, 'samples': 5615424, 'steps': 29246, 'loss/train': 2.345348596572876}3}} 11/07/2021 01:19:13 - INFO - __main__ - Step 29251: {'lr': 0.00045932980237603196, 'samples': 5616192, 'steps': 29250, 'loss/train': 1.3415101766586304}} 11/07/2021 01:19:15 - INFO - __main__ - Step 29256: {'lr': 0.0004593152948315661, 'samples': 5617152, 'steps': 29255, 'loss/train': 1.6951172351837158}}} 11/07/2021 01:19:15 - INFO - __main__ - Step 29256: {'lr': 0.0004593152948315661, 'samples': 5617152, 'steps': 29255, 'loss/train': 1.6951172351837158}}} 11/07/2021 01:19:18 - INFO - __main__ - Step 29263: {'lr': 0.0004592949803081524, 'samples': 5618496, 'steps': 29262, 'loss/train': 1.6450902223587036}}} 11/07/2021 01:19:20 - INFO - __main__ - Step 29267: {'lr': 0.0004592833699343181, 'samples': 5619264, 'steps': 29266, 'loss/train': 1.5430980920791626}}} 11/07/2021 01:19:23 - INFO - __main__ - Step 29272: {'lr': 0.00045926885484528823, 'samples': 5620224, 'steps': 29271, 'loss/train': 1.9551517963409424}} 11/07/2021 01:19:25 - INFO - __main__ - Step 29276: {'lr': 0.0004592572410767768, 'samples': 5620992, 'steps': 29275, 'loss/train': 1.3488487005233765}}} 11/07/2021 01:19:25 - INFO - __main__ - Step 29276: {'lr': 0.0004592572410767768, 'samples': 5620992, 'steps': 29275, 'loss/train': 1.3488487005233765}}} 11/07/2021 01:19:28 - INFO - __main__ - Step 29283: {'lr': 0.000459236913351841, 'samples': 5622336, 'steps': 29282, 'loss/train': 2.1861953735351562}}}} 11/07/2021 01:19:31 - INFO - __main__ - Step 29288: {'lr': 0.0004592223907199215, 'samples': 5623296, 'steps': 29287, 'loss/train': 1.471245527267456}}}} 11/07/2021 01:19:33 - INFO - __main__ - Step 29292: {'lr': 0.0004592107709174752, 'samples': 5624064, 'steps': 29291, 'loss/train': 1.3144384622573853}}} 11/07/2021 01:19:33 - INFO - __main__ - Step 29292: {'lr': 0.0004592107709174752, 'samples': 5624064, 'steps': 29291, 'loss/train': 1.3144384622573853}}} 11/07/2021 01:19:37 - INFO - __main__ - Step 29299: {'lr': 0.00045919043263395953, 'samples': 5625408, 'steps': 29298, 'loss/train': 1.7972590923309326}} 11/07/2021 01:19:39 - INFO - __main__ - Step 29304: {'lr': 0.0004591759024608255, 'samples': 5626368, 'steps': 29303, 'loss/train': 1.6076900959014893}}} 11/07/2021 01:19:41 - INFO - __main__ - Step 29309: {'lr': 0.00045916136993140574, 'samples': 5627328, 'steps': 29308, 'loss/train': 2.0325093269348145}} 11/07/2021 01:19:41 - INFO - __main__ - Step 29309: {'lr': 0.00045916136993140574, 'samples': 5627328, 'steps': 29308, 'loss/train': 2.0325093269348145}} 11/07/2021 01:19:45 - INFO - __main__ - Step 29316: {'lr': 0.00045914102043196947, 'samples': 5628672, 'steps': 29315, 'loss/train': 0.8205442428588867}} 11/07/2021 01:19:47 - INFO - __main__ - Step 29320: {'lr': 0.00045912939007336273, 'samples': 5629440, 'steps': 29319, 'loss/train': 1.9441035985946655}} 11/07/2021 01:19:47 - INFO - __main__ - Step 29320: {'lr': 0.00045912939007336273, 'samples': 5629440, 'steps': 29319, 'loss/train': 1.9441035985946655}} 11/07/2021 01:19:51 - INFO - __main__ - Step 29328: {'lr': 0.00045910612483317025, 'samples': 5630976, 'steps': 29327, 'loss/train': 1.7314233779907227}} 11/07/2021 01:19:51 - INFO - __main__ - Step 29328: {'lr': 0.00045910612483317025, 'samples': 5630976, 'steps': 29327, 'loss/train': 1.7314233779907227}} 11/07/2021 01:19:54 - INFO - __main__ - Step 29335: {'lr': 0.00045908576280142925, 'samples': 5632320, 'steps': 29334, 'loss/train': 1.6490302085876465}} 11/07/2021 01:19:57 - INFO - __main__ - Step 29340: {'lr': 0.00045907121566669216, 'samples': 5633280, 'steps': 29339, 'loss/train': 1.7707066535949707}} 11/07/2021 01:19:57 - INFO - __main__ - Step 29340: {'lr': 0.00045907121566669216, 'samples': 5633280, 'steps': 29339, 'loss/train': 1.7707066535949707}} 11/07/2021 01:20:01 - INFO - __main__ - Step 29348: {'lr': 0.0004590479353525591, 'samples': 5634816, 'steps': 29347, 'loss/train': 1.354005217552185}7}} 11/07/2021 01:20:02 - INFO - __main__ - Step 29352: {'lr': 0.0004590362929348001, 'samples': 5635584, 'steps': 29351, 'loss/train': 1.6804929971694946}}} 11/07/2021 01:20:05 - INFO - __main__ - Step 29356: {'lr': 0.0004590246490100246, 'samples': 5636352, 'steps': 29355, 'loss/train': 0.7158932089805603}}} 11/07/2021 01:20:07 - INFO - __main__ - Step 29361: {'lr': 0.00045901009198494124, 'samples': 5637312, 'steps': 29360, 'loss/train': 1.6712108850479126}} 11/07/2021 01:20:09 - INFO - __main__ - Step 29365: {'lr': 0.00045899844466968574, 'samples': 5638080, 'steps': 29364, 'loss/train': 1.6341489553451538}} 11/07/2021 01:20:11 - INFO - __main__ - Step 29369: {'lr': 0.0004589867958476866, 'samples': 5638848, 'steps': 29368, 'loss/train': 1.667285680770874}8}} 11/07/2021 01:20:11 - INFO - __main__ - Step 29369: {'lr': 0.0004589867958476866, 'samples': 5638848, 'steps': 29368, 'loss/train': 1.667285680770874}8}} 11/07/2021 01:20:15 - INFO - __main__ - Step 29376: {'lr': 0.0004589664067838389, 'samples': 5640192, 'steps': 29375, 'loss/train': 1.7532117366790771}}} 11/07/2021 01:20:17 - INFO - __main__ - Step 29381: {'lr': 0.0004589518403420676, 'samples': 5641152, 'steps': 29380, 'loss/train': 1.6963664293289185}}} 11/07/2021 01:20:19 - INFO - __main__ - Step 29385: {'lr': 0.00045894018549393404, 'samples': 5641920, 'steps': 29384, 'loss/train': 1.575050711631775}}} 11/07/2021 01:20:21 - INFO - __main__ - Step 29389: {'lr': 0.0004589285291394769, 'samples': 5642688, 'steps': 29388, 'loss/train': 1.6082696914672852}}} 11/07/2021 01:20:21 - INFO - __main__ - Step 29389: {'lr': 0.0004589285291394769, 'samples': 5642688, 'steps': 29388, 'loss/train': 1.6082696914672852}}} 11/07/2021 01:20:24 - INFO - __main__ - Step 29396: {'lr': 0.0004589081268948386, 'samples': 5644032, 'steps': 29395, 'loss/train': 0.7713015675544739}}} 11/07/2021 01:20:27 - INFO - __main__ - Step 29401: {'lr': 0.0004588935510390045, 'samples': 5644992, 'steps': 29400, 'loss/train': 1.18025541305542}9}}} 11/07/2021 01:20:27 - INFO - __main__ - Step 29401: {'lr': 0.0004588935510390045, 'samples': 5644992, 'steps': 29400, 'loss/train': 1.18025541305542}9}}} 11/07/2021 01:20:31 - INFO - __main__ - Step 29409: {'lr': 0.00045887022477527923, 'samples': 5646528, 'steps': 29408, 'loss/train': 1.9197208881378174}} 11/07/2021 01:20:33 - INFO - __main__ - Step 29413: {'lr': 0.0004588585593846458, 'samples': 5647296, 'steps': 29412, 'loss/train': 0.8832883834838867}}} 11/07/2021 01:20:35 - INFO - __main__ - Step 29418: {'lr': 0.0004588439755289238, 'samples': 5648256, 'steps': 29417, 'loss/train': 1.3511896133422852}}} 11/07/2021 01:20:35 - INFO - __main__ - Step 29418: {'lr': 0.0004588439755289238, 'samples': 5648256, 'steps': 29417, 'loss/train': 1.3511896133422852}}} 11/07/2021 01:20:39 - INFO - __main__ - Step 29426: {'lr': 0.00045882063646653966, 'samples': 5649792, 'steps': 29425, 'loss/train': 1.5203857421875}2}}} 11/07/2021 01:20:42 - INFO - __main__ - Step 29430: {'lr': 0.000458808964677113, 'samples': 5650560, 'steps': 29429, 'loss/train': 1.303795576095581}2}}} 11/07/2021 01:20:43 - INFO - __main__ - Step 29434: {'lr': 0.0004587972913823087, 'samples': 5651328, 'steps': 29433, 'loss/train': 1.5912048816680908}}} 11/07/2021 01:20:45 - INFO - __main__ - Step 29439: {'lr': 0.0004587826976469944, 'samples': 5652288, 'steps': 29438, 'loss/train': 1.6085463762283325}}} 11/07/2021 01:20:47 - INFO - __main__ - Step 29443: {'lr': 0.0004587710209653984, 'samples': 5653056, 'steps': 29442, 'loss/train': 1.3992745876312256}}} 11/07/2021 01:20:47 - INFO - __main__ - Step 29443: {'lr': 0.0004587710209653984, 'samples': 5653056, 'steps': 29442, 'loss/train': 1.3992745876312256}}} 11/07/2021 01:20:51 - INFO - __main__ - Step 29450: {'lr': 0.0004587505831509994, 'samples': 5654400, 'steps': 29449, 'loss/train': 1.8191397190093994}}} 11/07/2021 01:20:53 - INFO - __main__ - Step 29455: {'lr': 0.00045873598189032295, 'samples': 5655360, 'steps': 29454, 'loss/train': 0.8377273082733154}} 11/07/2021 01:20:53 - INFO - __main__ - Step 29455: {'lr': 0.00045873598189032295, 'samples': 5655360, 'steps': 29454, 'loss/train': 0.8377273082733154}} 11/07/2021 01:20:57 - INFO - __main__ - Step 29463: {'lr': 0.000458712614982542, 'samples': 5656896, 'steps': 29462, 'loss/train': 1.614371418952942}54}} 11/07/2021 01:21:00 - INFO - __main__ - Step 29467: {'lr': 0.000458700929271585, 'samples': 5657664, 'steps': 29466, 'loss/train': 1.7337079048156738}4}} 11/07/2021 01:21:01 - INFO - __main__ - Step 29471: {'lr': 0.0004586892420560294, 'samples': 5658432, 'steps': 29470, 'loss/train': 1.1268757581710815}}} 11/07/2021 01:21:03 - INFO - __main__ - Step 29475: {'lr': 0.0004586775533359592, 'samples': 5659200, 'steps': 29474, 'loss/train': 2.014317750930786}}}} 11/07/2021 01:21:05 - INFO - __main__ - Step 29479: {'lr': 0.0004586658631114589, 'samples': 5659968, 'steps': 29478, 'loss/train': 1.5325437784194946}}} 11/07/2021 01:21:05 - INFO - __main__ - Step 29479: {'lr': 0.0004586658631114589, 'samples': 5659968, 'steps': 29478, 'loss/train': 1.5325437784194946}}} 11/07/2021 01:21:09 - INFO - __main__ - Step 29486: {'lr': 0.0004586454015988019, 'samples': 5661312, 'steps': 29485, 'loss/train': 1.2461416721343994}}} 11/07/2021 01:21:09 - INFO - __main__ - Step 29486: {'lr': 0.0004586454015988019, 'samples': 5661312, 'steps': 29485, 'loss/train': 1.2461416721343994}}} 11/07/2021 01:21:13 - INFO - __main__ - Step 29493: {'lr': 0.0004586249354795372, 'samples': 5662656, 'steps': 29492, 'loss/train': 1.8026632070541382}}} 11/07/2021 01:21:16 - INFO - __main__ - Step 29498: {'lr': 0.000458610314002798, 'samples': 5663616, 'steps': 29497, 'loss/train': 1.5161337852478027}}}} 11/07/2021 01:21:16 - INFO - __main__ - Step 29498: {'lr': 0.000458610314002798, 'samples': 5663616, 'steps': 29497, 'loss/train': 1.5161337852478027}}}} 11/07/2021 01:21:19 - INFO - __main__ - Step 29505: {'lr': 0.00045858983998754336, 'samples': 5664960, 'steps': 29504, 'loss/train': 1.9173896312713623}} 11/07/2021 01:21:21 - INFO - __main__ - Step 29509: {'lr': 0.0004585781384825039, 'samples': 5665728, 'steps': 29508, 'loss/train': 2.0159964561462402}}} 11/07/2021 01:21:23 - INFO - __main__ - Step 29514: {'lr': 0.0004585635094866175, 'samples': 5666688, 'steps': 29513, 'loss/train': 1.7652790546417236}}} 11/07/2021 01:21:23 - INFO - __main__ - Step 29514: {'lr': 0.0004585635094866175, 'samples': 5666688, 'steps': 29513, 'loss/train': 1.7652790546417236}}} 11/07/2021 01:21:27 - INFO - __main__ - Step 29521: {'lr': 0.0004585430249454425, 'samples': 5668032, 'steps': 29520, 'loss/train': 0.4509488046169281}}} 11/07/2021 01:21:29 - INFO - __main__ - Step 29525: {'lr': 0.00045853131742605563, 'samples': 5668800, 'steps': 29524, 'loss/train': 1.8235915899276733}} 11/07/2021 01:21:29 - INFO - __main__ - Step 29525: {'lr': 0.00045853131742605563, 'samples': 5668800, 'steps': 29524, 'loss/train': 1.8235915899276733}} 11/07/2021 01:21:33 - INFO - __main__ - Step 29533: {'lr': 0.0004585078978772385, 'samples': 5670336, 'steps': 29532, 'loss/train': 1.7644169330596924}}} 11/07/2021 01:21:35 - INFO - __main__ - Step 29537: {'lr': 0.00045849618584797717, 'samples': 5671104, 'steps': 29536, 'loss/train': 1.442642092704773}}} 11/07/2021 01:21:37 - INFO - __main__ - Step 29541: {'lr': 0.00045848447231559315, 'samples': 5671872, 'steps': 29540, 'loss/train': 1.6380871534347534}} 11/07/2021 01:21:39 - INFO - __main__ - Step 29545: {'lr': 0.000458472757280171, 'samples': 5672640, 'steps': 29544, 'loss/train': 1.7970337867736816}4}} 11/07/2021 01:21:41 - INFO - __main__ - Step 29550: {'lr': 0.00045845811137237445, 'samples': 5673600, 'steps': 29549, 'loss/train': 2.0732905864715576}} 11/07/2021 01:21:43 - INFO - __main__ - Step 29554: {'lr': 0.00045844639295542525, 'samples': 5674368, 'steps': 29553, 'loss/train': 1.4013590812683105}} 11/07/2021 01:21:43 - INFO - __main__ - Step 29554: {'lr': 0.00045844639295542525, 'samples': 5674368, 'steps': 29553, 'loss/train': 1.4013590812683105}} 11/07/2021 01:21:47 - INFO - __main__ - Step 29561: {'lr': 0.0004584258821097899, 'samples': 5675712, 'steps': 29560, 'loss/train': 1.5316812992095947}}} 11/07/2021 01:21:49 - INFO - __main__ - Step 29566: {'lr': 0.0004584112286883336, 'samples': 5676672, 'steps': 29565, 'loss/train': 1.372520089149475}}}} 11/07/2021 01:21:49 - INFO - __main__ - Step 29566: {'lr': 0.0004584112286883336, 'samples': 5676672, 'steps': 29565, 'loss/train': 1.372520089149475}}}} 11/07/2021 01:21:53 - INFO - __main__ - Step 29574: {'lr': 0.00045838777833091425, 'samples': 5678208, 'steps': 29573, 'loss/train': 1.731628656387329}}} 11/07/2021 01:21:55 - INFO - __main__ - Step 29578: {'lr': 0.0004583760508986508, 'samples': 5678976, 'steps': 29577, 'loss/train': 1.2868000268936157}}} 11/07/2021 01:21:57 - INFO - __main__ - Step 29582: {'lr': 0.0004583643219641307, 'samples': 5679744, 'steps': 29581, 'loss/train': 1.2891649007797241}}} 11/07/2021 01:21:59 - INFO - __main__ - Step 29587: {'lr': 0.0004583496586835612, 'samples': 5680704, 'steps': 29586, 'loss/train': 1.5053143501281738}}} 11/07/2021 01:21:59 - INFO - __main__ - Step 29587: {'lr': 0.0004583496586835612, 'samples': 5680704, 'steps': 29586, 'loss/train': 1.5053143501281738}}} 11/07/2021 01:22:03 - INFO - __main__ - Step 29593: {'lr': 0.0004583320596488807, 'samples': 5681856, 'steps': 29592, 'loss/train': 1.5070799589157104}}} 11/07/2021 01:22:05 - INFO - __main__ - Step 29597: {'lr': 0.0004583203250816518, 'samples': 5682624, 'steps': 29596, 'loss/train': 1.4190809726715088}}} 11/07/2021 01:22:07 - INFO - __main__ - Step 29602: {'lr': 0.0004583056547606424, 'samples': 5683584, 'steps': 29601, 'loss/train': 1.6327733993530273}}} 11/07/2021 01:22:10 - INFO - __main__ - Step 29606: {'lr': 0.00045829391681435926, 'samples': 5684352, 'steps': 29605, 'loss/train': 1.6640363931655884}} 11/07/2021 01:22:10 - INFO - __main__ - Step 29606: {'lr': 0.00045829391681435926, 'samples': 5684352, 'steps': 29605, 'loss/train': 1.6640363931655884}} 11/07/2021 01:22:13 - INFO - __main__ - Step 29613: {'lr': 0.0004582733717950347, 'samples': 5685696, 'steps': 29612, 'loss/train': 1.6518462896347046}}} 11/07/2021 01:22:15 - INFO - __main__ - Step 29618: {'lr': 0.000458258693965862, 'samples': 5686656, 'steps': 29617, 'loss/train': 1.9197696447372437}}}} 11/07/2021 01:22:15 - INFO - __main__ - Step 29618: {'lr': 0.000458258693965862, 'samples': 5686656, 'steps': 29617, 'loss/train': 1.9197696447372437}}}} 11/07/2021 01:22:15 - INFO - __main__ - Step 29618: {'lr': 0.000458258693965862, 'samples': 5686656, 'steps': 29617, 'loss/train': 1.9197696447372437}}}} 11/07/2021 01:22:21 - INFO - __main__ - Step 29629: {'lr': 0.00045822639448415736, 'samples': 5688768, 'steps': 29628, 'loss/train': 1.2518410682678223}} 11/07/2021 01:22:24 - INFO - __main__ - Step 29634: {'lr': 0.0004582117091485145, 'samples': 5689728, 'steps': 29633, 'loss/train': 1.304877758026123}3}} 11/07/2021 01:22:24 - INFO - __main__ - Step 29634: {'lr': 0.0004582117091485145, 'samples': 5689728, 'steps': 29633, 'loss/train': 1.304877758026123}3}} 11/07/2021 01:22:27 - INFO - __main__ - Step 29641: {'lr': 0.0004581911457383382, 'samples': 5691072, 'steps': 29640, 'loss/train': 1.6379077434539795}}} 11/07/2021 01:22:29 - INFO - __main__ - Step 29645: {'lr': 0.00045817939315443855, 'samples': 5691840, 'steps': 29644, 'loss/train': 1.4004114866256714}} 11/07/2021 01:22:31 - INFO - __main__ - Step 29650: {'lr': 0.00045816470031401945, 'samples': 5692800, 'steps': 29649, 'loss/train': 1.462112307548523}}} 11/07/2021 01:22:31 - INFO - __main__ - Step 29650: {'lr': 0.00045816470031401945, 'samples': 5692800, 'steps': 29649, 'loss/train': 1.462112307548523}}} 11/07/2021 01:22:35 - INFO - __main__ - Step 29657: {'lr': 0.0004581441263980461, 'samples': 5694144, 'steps': 29656, 'loss/train': 1.448478102684021}}}} 11/07/2021 01:22:37 - INFO - __main__ - Step 29661: {'lr': 0.00045813236781129996, 'samples': 5694912, 'steps': 29660, 'loss/train': 1.7136733531951904}} 11/07/2021 01:22:40 - INFO - __main__ - Step 29666: {'lr': 0.0004581176674677995, 'samples': 5695872, 'steps': 29665, 'loss/train': 1.7237344980239868}}} 11/07/2021 01:22:42 - INFO - __main__ - Step 29671: {'lr': 0.0004581029647799337, 'samples': 5696832, 'steps': 29670, 'loss/train': 1.8532898426055908}}} 11/07/2021 01:22:44 - INFO - __main__ - Step 29675: {'lr': 0.00045809120094180946, 'samples': 5697600, 'steps': 29674, 'loss/train': 1.7726764678955078}} 11/07/2021 01:22:44 - INFO - __main__ - Step 29675: {'lr': 0.00045809120094180946, 'samples': 5697600, 'steps': 29674, 'loss/train': 1.7726764678955078}} 11/07/2021 01:22:47 - INFO - __main__ - Step 29682: {'lr': 0.0004580706106152796, 'samples': 5698944, 'steps': 29681, 'loss/train': 1.5208772420883179}}} 11/07/2021 01:22:49 - INFO - __main__ - Step 29686: {'lr': 0.0004580588426518013, 'samples': 5699712, 'steps': 29685, 'loss/train': 1.3639895915985107}}} 11/07/2021 01:22:52 - INFO - __main__ - Step 29691: {'lr': 0.0004580441305881311, 'samples': 5700672, 'steps': 29690, 'loss/train': 1.4484436511993408}}} 11/07/2021 01:22:54 - INFO - __main__ - Step 29695: {'lr': 0.0004580323592498404, 'samples': 5701440, 'steps': 29694, 'loss/train': 1.6245121955871582}}} 11/07/2021 01:22:54 - INFO - __main__ - Step 29695: {'lr': 0.0004580323592498404, 'samples': 5701440, 'steps': 29694, 'loss/train': 1.6245121955871582}}} 11/07/2021 01:22:57 - INFO - __main__ - Step 29702: {'lr': 0.0004580117557990402, 'samples': 5702784, 'steps': 29701, 'loss/train': 1.9294558763504028}}} 11/07/2021 01:23:00 - INFO - __main__ - Step 29707: {'lr': 0.00045799703623663546, 'samples': 5703744, 'steps': 29706, 'loss/train': 1.4139878749847412}} 11/07/2021 01:23:02 - INFO - __main__ - Step 29713: {'lr': 0.00045797936966899595, 'samples': 5704896, 'steps': 29712, 'loss/train': 1.2416294813156128}} 11/07/2021 01:23:02 - INFO - __main__ - Step 29713: {'lr': 0.00045797936966899595, 'samples': 5704896, 'steps': 29712, 'loss/train': 1.2416294813156128}} 11/07/2021 01:23:05 - INFO - __main__ - Step 29720: {'lr': 0.00045795875440952726, 'samples': 5706240, 'steps': 29719, 'loss/train': 1.9400047063827515}} 11/07/2021 01:23:08 - INFO - __main__ - Step 29724: {'lr': 0.0004579469721997641, 'samples': 5707008, 'steps': 29723, 'loss/train': 1.326294183731079}5}} 11/07/2021 01:23:10 - INFO - __main__ - Step 29729: {'lr': 0.00045793224232937193, 'samples': 5707968, 'steps': 29728, 'loss/train': 1.545784831047058}}} 11/07/2021 01:23:10 - INFO - __main__ - Step 29729: {'lr': 0.00045793224232937193, 'samples': 5707968, 'steps': 29728, 'loss/train': 1.545784831047058}}} 11/07/2021 01:23:10 - INFO - __main__ - Step 29729: {'lr': 0.00045793224232937193, 'samples': 5707968, 'steps': 29728, 'loss/train': 1.545784831047058}}} 11/07/2021 01:23:15 - INFO - __main__ - Step 29740: {'lr': 0.0004578998283699296, 'samples': 5710080, 'steps': 29739, 'loss/train': 1.4859967231750488}}} 11/07/2021 01:23:18 - INFO - __main__ - Step 29745: {'lr': 0.0004578850910048369, 'samples': 5711040, 'steps': 29744, 'loss/train': 0.6552532315254211}}} 11/07/2021 01:23:20 - INFO - __main__ - Step 29749: {'lr': 0.00045787329942669803, 'samples': 5711808, 'steps': 29748, 'loss/train': 1.2877225875854492}} 11/07/2021 01:23:22 - INFO - __main__ - Step 29753: {'lr': 0.00045786150634992716, 'samples': 5712576, 'steps': 29752, 'loss/train': 1.583794355392456}}} 11/07/2021 01:23:23 - INFO - __main__ - Step 29757: {'lr': 0.0004578497117746094, 'samples': 5713344, 'steps': 29756, 'loss/train': 1.3814347982406616}}} 11/07/2021 01:23:26 - INFO - __main__ - Step 29761: {'lr': 0.00045783791570082956, 'samples': 5714112, 'steps': 29760, 'loss/train': 1.479193091392517}}} 11/07/2021 01:23:28 - INFO - __main__ - Step 29766: {'lr': 0.0004578231685015223, 'samples': 5715072, 'steps': 29765, 'loss/train': 1.6487336158752441}}} 11/07/2021 01:23:28 - INFO - __main__ - Step 29766: {'lr': 0.0004578231685015223, 'samples': 5715072, 'steps': 29765, 'loss/train': 1.6487336158752441}}} 11/07/2021 01:23:31 - INFO - __main__ - Step 29772: {'lr': 0.000457805468772185, 'samples': 5716224, 'steps': 29771, 'loss/train': 1.9433209896087646}}}} 11/07/2021 01:23:34 - INFO - __main__ - Step 29777: {'lr': 0.00045779071642279177, 'samples': 5717184, 'steps': 29776, 'loss/train': 1.4320186376571655}} 11/07/2021 01:23:36 - INFO - __main__ - Step 29781: {'lr': 0.000457778912857978, 'samples': 5717952, 'steps': 29780, 'loss/train': 1.4643261432647705}5}} 11/07/2021 01:23:38 - INFO - __main__ - Step 29785: {'lr': 0.0004577671077952127, 'samples': 5718720, 'steps': 29784, 'loss/train': 1.080474853515625}5}} 11/07/2021 01:23:40 - INFO - __main__ - Step 29789: {'lr': 0.00045775530123458096, 'samples': 5719488, 'steps': 29788, 'loss/train': 1.7799813747406006}} 11/07/2021 01:23:42 - INFO - __main__ - Step 29793: {'lr': 0.00045774349317616786, 'samples': 5720256, 'steps': 29792, 'loss/train': 1.5883327722549438}} 11/07/2021 01:23:44 - INFO - __main__ - Step 29797: {'lr': 0.0004577316836200586, 'samples': 5721024, 'steps': 29796, 'loss/train': 1.347123146057129}8}} 11/07/2021 01:23:44 - INFO - __main__ - Step 29797: {'lr': 0.0004577316836200586, 'samples': 5721024, 'steps': 29796, 'loss/train': 1.347123146057129}8}} 11/07/2021 01:23:48 - INFO - __main__ - Step 29803: {'lr': 0.00045771396647790053, 'samples': 5722176, 'steps': 29802, 'loss/train': 1.8053834438323975}} 11/07/2021 01:23:48 - INFO - __main__ - Step 29803: {'lr': 0.00045771396647790053, 'samples': 5722176, 'steps': 29802, 'loss/train': 1.8053834438323975}} 11/07/2021 01:23:52 - INFO - __main__ - Step 29812: {'lr': 0.0004576873844472455, 'samples': 5723904, 'steps': 29811, 'loss/train': 1.5087987184524536}}} 11/07/2021 01:23:53 - INFO - __main__ - Step 29816: {'lr': 0.00045767556777824217, 'samples': 5724672, 'steps': 29815, 'loss/train': 1.5217554569244385}} 11/07/2021 01:23:56 - INFO - __main__ - Step 29820: {'lr': 0.00045766374961203236, 'samples': 5725440, 'steps': 29819, 'loss/train': 1.401145100593567}}} 11/07/2021 01:23:58 - INFO - __main__ - Step 29825: {'lr': 0.00045764897479895315, 'samples': 5726400, 'steps': 29824, 'loss/train': 1.7080674171447754}} 11/07/2021 01:24:00 - INFO - __main__ - Step 29830: {'lr': 0.0004576341976467884, 'samples': 5727360, 'steps': 29829, 'loss/train': 1.267899990081787}4}} 11/07/2021 01:24:02 - INFO - __main__ - Step 29834: {'lr': 0.00045762237424102687, 'samples': 5728128, 'steps': 29833, 'loss/train': 1.6399792432785034}} 11/07/2021 01:24:05 - INFO - __main__ - Step 29838: {'lr': 0.0004576105493384423, 'samples': 5728896, 'steps': 29837, 'loss/train': 1.2299456596374512}}} 11/07/2021 01:24:05 - INFO - __main__ - Step 29838: {'lr': 0.0004576105493384423, 'samples': 5728896, 'steps': 29837, 'loss/train': 1.2299456596374512}}} 11/07/2021 01:24:08 - INFO - __main__ - Step 29845: {'lr': 0.00045758985215744536, 'samples': 5730240, 'steps': 29844, 'loss/train': 1.447424292564392}}} 11/07/2021 01:24:11 - INFO - __main__ - Step 29851: {'lr': 0.00045757210806863895, 'samples': 5731392, 'steps': 29850, 'loss/train': 1.5924853086471558}} 11/07/2021 01:24:11 - INFO - __main__ - Step 29851: {'lr': 0.00045757210806863895, 'samples': 5731392, 'steps': 29850, 'loss/train': 1.5924853086471558}} 11/07/2021 01:24:14 - INFO - __main__ - Step 29858: {'lr': 0.0004575514023761585, 'samples': 5732736, 'steps': 29857, 'loss/train': 1.5171469449996948}}} 11/07/2021 01:24:16 - INFO - __main__ - Step 29862: {'lr': 0.00045753956849442647, 'samples': 5733504, 'steps': 29861, 'loss/train': 1.9545060396194458}} 11/07/2021 01:24:18 - INFO - __main__ - Step 29866: {'lr': 0.00045752773311646846, 'samples': 5734272, 'steps': 29865, 'loss/train': 1.4891483783721924}} 11/07/2021 01:24:21 - INFO - __main__ - Step 29871: {'lr': 0.0004575129367900831, 'samples': 5735232, 'steps': 29870, 'loss/train': 1.8289682865142822}}} 11/07/2021 01:24:23 - INFO - __main__ - Step 29875: {'lr': 0.0004575010980459285, 'samples': 5736000, 'steps': 29874, 'loss/train': 1.251508355140686}}}} 11/07/2021 01:24:23 - INFO - __main__ - Step 29875: {'lr': 0.0004575010980459285, 'samples': 5736000, 'steps': 29874, 'loss/train': 1.251508355140686}}}} 11/07/2021 01:24:26 - INFO - __main__ - Step 29882: {'lr': 0.00045748037664408275, 'samples': 5737344, 'steps': 29881, 'loss/train': 1.6705900430679321}} 11/07/2021 01:24:28 - INFO - __main__ - Step 29887: {'lr': 0.000457465572838114, 'samples': 5738304, 'steps': 29886, 'loss/train': 0.6421449184417725}1}} 11/07/2021 01:24:30 - INFO - __main__ - Step 29891: {'lr': 0.00045745372811067687, 'samples': 5739072, 'steps': 29890, 'loss/train': 1.0584920644760132}} 11/07/2021 01:24:33 - INFO - __main__ - Step 29895: {'lr': 0.0004574418818876326, 'samples': 5739840, 'steps': 29894, 'loss/train': 1.4991633892059326}}} 11/07/2021 01:24:34 - INFO - __main__ - Step 29899: {'lr': 0.0004574300341690665, 'samples': 5740608, 'steps': 29898, 'loss/train': 1.7634446620941162}}} 11/07/2021 01:24:36 - INFO - __main__ - Step 29903: {'lr': 0.00045741818495506403, 'samples': 5741376, 'steps': 29902, 'loss/train': 1.6585086584091187}} 11/07/2021 01:24:39 - INFO - __main__ - Step 29908: {'lr': 0.00045740337133473374, 'samples': 5742336, 'steps': 29907, 'loss/train': 1.775071620941162}}} 11/07/2021 01:24:39 - INFO - __main__ - Step 29908: {'lr': 0.00045740337133473374, 'samples': 5742336, 'steps': 29907, 'loss/train': 1.775071620941162}}} 11/07/2021 01:24:42 - INFO - __main__ - Step 29915: {'lr': 0.00045738262834129283, 'samples': 5743680, 'steps': 29914, 'loss/train': 1.5828520059585571}} 11/07/2021 01:24:44 - INFO - __main__ - Step 29919: {'lr': 0.0004573707731463993, 'samples': 5744448, 'steps': 29918, 'loss/train': 1.1830955743789673}}} 11/07/2021 01:24:46 - INFO - __main__ - Step 29923: {'lr': 0.0004573589164564966, 'samples': 5745216, 'steps': 29922, 'loss/train': 1.684158205986023}}}} 11/07/2021 01:24:48 - INFO - __main__ - Step 29927: {'lr': 0.00045734705827167035, 'samples': 5745984, 'steps': 29926, 'loss/train': 1.7693537473678589}} 11/07/2021 01:24:50 - INFO - __main__ - Step 29931: {'lr': 0.0004573351985920059, 'samples': 5746752, 'steps': 29930, 'loss/train': 1.7663991451263428}}} 11/07/2021 01:24:52 - INFO - __main__ - Step 29935: {'lr': 0.0004573233374175888, 'samples': 5747520, 'steps': 29934, 'loss/train': 1.8239710330963135}}} 11/07/2021 01:24:54 - INFO - __main__ - Step 29939: {'lr': 0.0004573114747485045, 'samples': 5748288, 'steps': 29938, 'loss/train': 1.498435139656067}}}} 11/07/2021 01:24:56 - INFO - __main__ - Step 29944: {'lr': 0.0004572966443104038, 'samples': 5749248, 'steps': 29943, 'loss/train': 1.4378646612167358}}} 11/07/2021 01:24:59 - INFO - __main__ - Step 29949: {'lr': 0.0004572818115371864, 'samples': 5750208, 'steps': 29948, 'loss/train': 1.4937506914138794}}} 11/07/2021 01:24:59 - INFO - __main__ - Step 29949: {'lr': 0.0004572818115371864, 'samples': 5750208, 'steps': 29948, 'loss/train': 1.4937506914138794}}} 11/07/2021 01:25:02 - INFO - __main__ - Step 29956: {'lr': 0.000457261041732004, 'samples': 5751552, 'steps': 29955, 'loss/train': 1.3401647806167603}}}} 11/07/2021 01:25:04 - INFO - __main__ - Step 29960: {'lr': 0.00045724917121732055, 'samples': 5752320, 'steps': 29959, 'loss/train': 1.8313194513320923}} 11/07/2021 01:25:06 - INFO - __main__ - Step 29965: {'lr': 0.00045723433097285247, 'samples': 5753280, 'steps': 29964, 'loss/train': 1.1049710512161255}} 11/07/2021 01:25:08 - INFO - __main__ - Step 29969: {'lr': 0.0004572224570964915, 'samples': 5754048, 'steps': 29968, 'loss/train': 1.915236473083496}5}} 11/07/2021 01:25:11 - INFO - __main__ - Step 29973: {'lr': 0.00045721058172619043, 'samples': 5754816, 'steps': 29972, 'loss/train': 1.2458912134170532}} 11/07/2021 01:25:12 - INFO - __main__ - Step 29977: {'lr': 0.0004571987048620353, 'samples': 5755584, 'steps': 29976, 'loss/train': 1.5356825590133667}}} 11/07/2021 01:25:14 - INFO - __main__ - Step 29981: {'lr': 0.00045718682650411146, 'samples': 5756352, 'steps': 29980, 'loss/train': 1.6693494319915771}} 11/07/2021 01:25:16 - INFO - __main__ - Step 29985: {'lr': 0.0004571749466525046, 'samples': 5757120, 'steps': 29984, 'loss/train': 1.3214921951293945}}} 11/07/2021 01:25:19 - INFO - __main__ - Step 29989: {'lr': 0.00045716306530730043, 'samples': 5757888, 'steps': 29988, 'loss/train': 1.2930711507797241}} 11/07/2021 01:25:20 - INFO - __main__ - Step 29993: {'lr': 0.00045715118246858466, 'samples': 5758656, 'steps': 29992, 'loss/train': 1.5766459703445435}} 11/07/2021 01:25:22 - INFO - __main__ - Step 29997: {'lr': 0.00045713929813644274, 'samples': 5759424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:25:22 - INFO - __main__ - Step 29997: {'lr': 0.00045713929813644274, 'samples': 5759424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:25:22 - INFO - __main__ - Step 29997: {'lr': 0.00045713929813644274, 'samples': 5759424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:25:22 - INFO - __main__ - Step 29997: {'lr': 0.00045713929813644274, 'samples': 5759424, 'steps': 29996, 'loss/train': 1.537301778793335}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) 11/07/2021 01:25:22 - INFO - __main__ - Step 29997: {'lr': 0.00045713929813644274, 'samples': 5759424, 'steps': 29996, 'loss/train': 1.537301778793335}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) 11/07/2021 01:25:22 - INFO - __main__ - Step 29997: {'lr': 0.00045713929813644274, 'samples': 5759424, 'steps': 29996, 'loss/train': 1.537301778793335}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:28:53 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.9424, 'steps': 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:29:19 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small 29996, 'loss/train': 1.537301778793335}}} 11/07/2021 01:29:19 - WARNING - huggingface_hub.repository - To https://huggingface.co/lvwerra/codeparrot-small 29996, 'loss/train': 1.537301778793335}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible 11/07/2021 01:29:21 - INFO - __main__ - Step 30001: {'lr': 0.00045712741231096054, 'samples': 5760192, 'steps': 30000, 'loss/train': 1.1647251844406128}} 11/07/2021 01:29:24 - INFO - __main__ - Step 30007: {'lr': 0.00045710958077291156, 'samples': 5761344, 'steps': 30006, 'loss/train': 1.6442089080810547}} 11/07/2021 01:29:24 - INFO - __main__ - Step 30007: {'lr': 0.00045710958077291156, 'samples': 5761344, 'steps': 30006, 'loss/train': 1.6442089080810547}} 11/07/2021 01:29:27 - INFO - __main__ - Step 30014: {'lr': 0.00045708877306579733, 'samples': 5762688, 'steps': 30013, 'loss/train': 1.8259247541427612}} 11/07/2021 01:29:29 - INFO - __main__ - Step 30018: {'lr': 0.0004570768808945748, 'samples': 5763456, 'steps': 30017, 'loss/train': 2.3335750102996826}}} 11/07/2021 01:29:29 - INFO - __main__ - Step 30018: {'lr': 0.0004570768808945748, 'samples': 5763456, 'steps': 30017, 'loss/train': 2.3335750102996826}}} 11/07/2021 01:29:29 - INFO - __main__ - Step 30018: {'lr': 0.0004570768808945748, 'samples': 5763456, 'steps': 30017, 'loss/train': 2.3335750102996826}}} 11/07/2021 01:29:35 - INFO - __main__ - Step 30030: {'lr': 0.000457041195423908, 'samples': 5765760, 'steps': 30029, 'loss/train': 0.26046043634414673}}} 11/07/2021 01:29:37 - INFO - __main__ - Step 30034: {'lr': 0.00045702929728163845, 'samples': 5766528, 'steps': 30033, 'loss/train': 1.5465935468673706}} 11/07/2021 01:29:40 - INFO - __main__ - Step 30039: {'lr': 0.0004570144225049171, 'samples': 5767488, 'steps': 30038, 'loss/train': 1.7716975212097168}}} 11/07/2021 01:29:42 - INFO - __main__ - Step 30043: {'lr': 0.0004570025210045368, 'samples': 5768256, 'steps': 30042, 'loss/train': 1.53825044631958}8}}} 11/07/2021 01:29:42 - INFO - __main__ - Step 30043: {'lr': 0.0004570025210045368, 'samples': 5768256, 'steps': 30042, 'loss/train': 1.53825044631958}8}}} 11/07/2021 01:29:45 - INFO - __main__ - Step 30050: {'lr': 0.00045698168978794553, 'samples': 5769600, 'steps': 30049, 'loss/train': 1.3786624670028687}} 11/07/2021 01:29:48 - INFO - __main__ - Step 30055: {'lr': 0.0004569668075496137, 'samples': 5770560, 'steps': 30054, 'loss/train': 0.4359086751937866}}} 11/07/2021 01:29:50 - INFO - __main__ - Step 30060: {'lr': 0.00045695192297988066, 'samples': 5771520, 'steps': 30059, 'loss/train': 1.5076161623001099}} 11/07/2021 01:29:52 - INFO - __main__ - Step 30064: {'lr': 0.00045694001364559797, 'samples': 5772288, 'steps': 30063, 'loss/train': 1.5570694208145142}} 11/07/2021 01:29:52 - INFO - __main__ - Step 30064: {'lr': 0.00045694001364559797, 'samples': 5772288, 'steps': 30063, 'loss/train': 1.5570694208145142}} 11/07/2021 01:29:56 - INFO - __main__ - Step 30070: {'lr': 0.0004569221468468815, 'samples': 5773440, 'steps': 30069, 'loss/train': 2.7942593097686768}}} 11/07/2021 01:29:56 - INFO - __main__ - Step 30070: {'lr': 0.0004569221468468815, 'samples': 5773440, 'steps': 30069, 'loss/train': 2.7942593097686768}}} 11/07/2021 01:30:00 - INFO - __main__ - Step 30079: {'lr': 0.0004568953403554723, 'samples': 5775168, 'steps': 30078, 'loss/train': 2.0979011058807373}}} 11/07/2021 01:30:02 - INFO - __main__ - Step 30083: {'lr': 0.00045688342393541227, 'samples': 5775936, 'steps': 30082, 'loss/train': 1.5688196420669556}} 11/07/2021 01:30:04 - INFO - __main__ - Step 30087: {'lr': 0.0004568715060238565, 'samples': 5776704, 'steps': 30086, 'loss/train': 1.9113773107528687}}} 11/07/2021 01:30:06 - INFO - __main__ - Step 30091: {'lr': 0.00045685958662089113, 'samples': 5777472, 'steps': 30090, 'loss/train': 2.05692195892334}}}} 11/07/2021 01:30:08 - INFO - __main__ - Step 30097: {'lr': 0.00045684170472023766, 'samples': 5778624, 'steps': 30096, 'loss/train': 1.069875955581665}}} 11/07/2021 01:30:08 - INFO - __main__ - Step 30097: {'lr': 0.00045684170472023766, 'samples': 5778624, 'steps': 30096, 'loss/train': 1.069875955581665}}} 11/07/2021 01:30:08 - INFO - __main__ - Step 30097: {'lr': 0.00045684170472023766, 'samples': 5778624, 'steps': 30096, 'loss/train': 1.069875955581665}}} 11/07/2021 01:30:13 - INFO - __main__ - Step 30107: {'lr': 0.00045681189409665083, 'samples': 5780544, 'steps': 30106, 'loss/train': 1.6340943574905396}} 11/07/2021 01:30:16 - INFO - __main__ - Step 30112: {'lr': 0.00045679698529028906, 'samples': 5781504, 'steps': 30111, 'loss/train': 1.3997163772583008}} 11/07/2021 01:30:16 - INFO - __main__ - Step 30112: {'lr': 0.00045679698529028906, 'samples': 5781504, 'steps': 30111, 'loss/train': 1.3997163772583008}} 11/07/2021 01:30:20 - INFO - __main__ - Step 30120: {'lr': 0.00045677312635484466, 'samples': 5783040, 'steps': 30119, 'loss/train': 1.8276495933532715}} 11/07/2021 01:30:22 - INFO - __main__ - Step 30124: {'lr': 0.0004567611946510287, 'samples': 5783808, 'steps': 30123, 'loss/train': 1.199395775794983}5}} 11/07/2021 01:30:22 - INFO - __main__ - Step 30124: {'lr': 0.0004567611946510287, 'samples': 5783808, 'steps': 30123, 'loss/train': 1.199395775794983}5}} 11/07/2021 01:30:25 - INFO - __main__ - Step 30130: {'lr': 0.0004567432943004296, 'samples': 5784960, 'steps': 30129, 'loss/train': 2.230543375015259}5}} 11/07/2021 01:30:27 - INFO - __main__ - Step 30134: {'lr': 0.00045673135887023874, 'samples': 5785728, 'steps': 30133, 'loss/train': 1.5163437128067017}} 11/07/2021 01:30:30 - INFO - __main__ - Step 30140: {'lr': 0.00045671345293048075, 'samples': 5786880, 'steps': 30139, 'loss/train': 2.0110180377960205}} 11/07/2021 01:30:30 - INFO - __main__ - Step 30140: {'lr': 0.00045671345293048075, 'samples': 5786880, 'steps': 30139, 'loss/train': 2.0110180377960205}} 11/07/2021 01:30:34 - INFO - __main__ - Step 30147: {'lr': 0.0004566925584294939, 'samples': 5788224, 'steps': 30146, 'loss/train': 1.2389037609100342}}} 11/07/2021 01:30:36 - INFO - __main__ - Step 30151: {'lr': 0.00045668061666570027, 'samples': 5788992, 'steps': 30150, 'loss/train': 1.859587550163269}}} 11/07/2021 01:30:38 - INFO - __main__ - Step 30156: {'lr': 0.00045666568736560853, 'samples': 5789952, 'steps': 30155, 'loss/train': 1.7113829851150513}} 11/07/2021 01:30:40 - INFO - __main__ - Step 30160: {'lr': 0.0004566537422493605, 'samples': 5790720, 'steps': 30159, 'loss/train': 1.5131640434265137}}} 11/07/2021 01:30:42 - INFO - __main__ - Step 30164: {'lr': 0.00045664179564327266, 'samples': 5791488, 'steps': 30163, 'loss/train': 1.4880365133285522}} 11/07/2021 01:30:42 - INFO - __main__ - Step 30164: {'lr': 0.00045664179564327266, 'samples': 5791488, 'steps': 30163, 'loss/train': 1.4880365133285522}} 11/07/2021 01:30:46 - INFO - __main__ - Step 30171: {'lr': 0.00045662088549795087, 'samples': 5792832, 'steps': 30170, 'loss/train': 1.545324444770813}}} 11/07/2021 01:30:48 - INFO - __main__ - Step 30176: {'lr': 0.00045660594688683154, 'samples': 5793792, 'steps': 30175, 'loss/train': 1.4284133911132812}} 11/07/2021 01:30:50 - INFO - __main__ - Step 30180: {'lr': 0.00045659399432224583, 'samples': 5794560, 'steps': 30179, 'loss/train': 1.3032660484313965}} 11/07/2021 01:30:50 - INFO - __main__ - Step 30180: {'lr': 0.00045659399432224583, 'samples': 5794560, 'steps': 30179, 'loss/train': 1.3032660484313965}} 11/07/2021 01:30:54 - INFO - __main__ - Step 30187: {'lr': 0.00045657307375038226, 'samples': 5795904, 'steps': 30186, 'loss/train': 1.1355433464050293}} 11/07/2021 01:30:56 - INFO - __main__ - Step 30192: {'lr': 0.00045655812769237927, 'samples': 5796864, 'steps': 30191, 'loss/train': 1.4939942359924316}} 11/07/2021 01:30:56 - INFO - __main__ - Step 30192: {'lr': 0.00045655812769237927, 'samples': 5796864, 'steps': 30191, 'loss/train': 1.4939942359924316}} 11/07/2021 01:31:00 - INFO - __main__ - Step 30199: {'lr': 0.0004565371993021927, 'samples': 5798208, 'steps': 30198, 'loss/train': 1.5073283910751343}}} 11/07/2021 01:31:02 - INFO - __main__ - Step 30203: {'lr': 0.0004565252381746821, 'samples': 5798976, 'steps': 30202, 'loss/train': 1.5877180099487305}}} 11/07/2021 01:31:04 - INFO - __main__ - Step 30208: {'lr': 0.0004565102846715195, 'samples': 5799936, 'steps': 30207, 'loss/train': 1.5977096557617188}}} 11/07/2021 01:31:04 - INFO - __main__ - Step 30208: {'lr': 0.0004565102846715195, 'samples': 5799936, 'steps': 30207, 'loss/train': 1.5977096557617188}}} 11/07/2021 01:31:08 - INFO - __main__ - Step 30216: {'lr': 0.0004564863542279113, 'samples': 5801472, 'steps': 30215, 'loss/train': 1.6348497867584229}}} 11/07/2021 01:31:10 - INFO - __main__ - Step 30220: {'lr': 0.0004564743867731145, 'samples': 5802240, 'steps': 30219, 'loss/train': 1.4747366905212402}}} 11/07/2021 01:31:12 - INFO - __main__ - Step 30224: {'lr': 0.000456462417829771, 'samples': 5803008, 'steps': 30223, 'loss/train': 1.4281342029571533}}}} 11/07/2021 01:31:12 - INFO - __main__ - Step 30224: {'lr': 0.000456462417829771, 'samples': 5803008, 'steps': 30223, 'loss/train': 1.4281342029571533}}}} 11/07/2021 01:31:16 - INFO - __main__ - Step 30232: {'lr': 0.0004564384754777888, 'samples': 5804544, 'steps': 30231, 'loss/train': 1.618294358253479}7}} 11/07/2021 01:31:17 - INFO - __main__ - Step 30236: {'lr': 0.0004564265020693227, 'samples': 5805312, 'steps': 30235, 'loss/train': 1.6616016626358032}}} 11/07/2021 01:31:20 - INFO - __main__ - Step 30240: {'lr': 0.00045641452717265507, 'samples': 5806080, 'steps': 30239, 'loss/train': 1.3447606563568115}} 11/07/2021 01:31:22 - INFO - __main__ - Step 30245: {'lr': 0.00045639955645916875, 'samples': 5807040, 'steps': 30244, 'loss/train': 0.9178106188774109}} 11/07/2021 01:31:24 - INFO - __main__ - Step 30249: {'lr': 0.0004563875782143633, 'samples': 5807808, 'steps': 30248, 'loss/train': 1.6026545763015747}}} 11/07/2021 01:31:26 - INFO - __main__ - Step 30253: {'lr': 0.000456375598481637, 'samples': 5808576, 'steps': 30252, 'loss/train': 1.7426419258117676}}}} 11/07/2021 01:31:28 - INFO - __main__ - Step 30257: {'lr': 0.0004563636172610761, 'samples': 5809344, 'steps': 30256, 'loss/train': 1.6551815271377563}}} 11/07/2021 01:31:30 - INFO - __main__ - Step 30261: {'lr': 0.00045635163455276707, 'samples': 5810112, 'steps': 30260, 'loss/train': 1.5694831609725952}} 11/07/2021 01:31:30 - INFO - __main__ - Step 30261: {'lr': 0.00045635163455276707, 'samples': 5810112, 'steps': 30260, 'loss/train': 1.5694831609725952}} 11/07/2021 01:31:30 - INFO - __main__ - Step 30261: {'lr': 0.00045635163455276707, 'samples': 5810112, 'steps': 30260, 'loss/train': 1.5694831609725952}} 11/07/2021 01:31:36 - INFO - __main__ - Step 30272: {'lr': 0.00045631867443442084, 'samples': 5812224, 'steps': 30271, 'loss/train': 1.8668856620788574}} 11/07/2021 01:31:39 - INFO - __main__ - Step 30278: {'lr': 0.0004563006914467709, 'samples': 5813376, 'steps': 30277, 'loss/train': 1.3554658889770508}}} 11/07/2021 01:31:39 - INFO - __main__ - Step 30278: {'lr': 0.0004563006914467709, 'samples': 5813376, 'steps': 30277, 'loss/train': 1.3554658889770508}}} 11/07/2021 01:31:42 - INFO - __main__ - Step 30284: {'lr': 0.0004562827051127082, 'samples': 5814528, 'steps': 30283, 'loss/train': 1.2854667901992798}}} 11/07/2021 01:31:44 - INFO - __main__ - Step 30288: {'lr': 0.00045627071236435896, 'samples': 5815296, 'steps': 30287, 'loss/train': 1.6247920989990234}} 11/07/2021 01:31:46 - INFO - __main__ - Step 30293: {'lr': 0.00045625571933772857, 'samples': 5816256, 'steps': 30292, 'loss/train': 1.7130277156829834}} 11/07/2021 01:31:48 - INFO - __main__ - Step 30297: {'lr': 0.00045624372324357457, 'samples': 5817024, 'steps': 30296, 'loss/train': 1.6591161489486694}} 11/07/2021 01:31:50 - INFO - __main__ - Step 30301: {'lr': 0.00045623172566253676, 'samples': 5817792, 'steps': 30300, 'loss/train': 1.6559518575668335}} 11/07/2021 01:31:52 - INFO - __main__ - Step 30305: {'lr': 0.00045621972659470156, 'samples': 5818560, 'steps': 30304, 'loss/train': 1.796301007270813}}} 11/07/2021 01:31:54 - INFO - __main__ - Step 30309: {'lr': 0.0004562077260401556, 'samples': 5819328, 'steps': 30308, 'loss/train': 1.4213775396347046}}} torch.nn.utils.clip_grad_norm_(parameters, max_norm, norm_type=norm_type)01556, 'samples': 5819328, 'steps': 30308, 'loss/train': 1.4213775396347046}}} 11/07/2021 01:31:59 - INFO - __main__ - Step 30318: {'lr': 0.0004561807193570888, 'samples': 5821056, 'steps': 30317, 'loss/train': 0.8450649976730347}}} 11/07/2021 01:31:59 - INFO - __main__ - Step 30318: {'lr': 0.0004561807193570888, 'samples': 5821056, 'steps': 30317, 'loss/train': 0.8450649976730347}}} 11/07/2021 01:32:02 - INFO - __main__ - Step 30325: {'lr': 0.00045615970895659393, 'samples': 5822400, 'steps': 30324, 'loss/train': 0.860467255115509}}} 11/07/2021 01:32:05 - INFO - __main__ - Step 30330: {'lr': 0.0004561446987408704, 'samples': 5823360, 'steps': 30329, 'loss/train': 1.852549433708191}}}} 11/07/2021 01:32:07 - INFO - __main__ - Step 30334: {'lr': 0.0004561326888963423, 'samples': 5824128, 'steps': 30333, 'loss/train': 1.3868197202682495}}} 11/07/2021 01:32:07 - INFO - __main__ - Step 30334: {'lr': 0.0004561326888963423, 'samples': 5824128, 'steps': 30333, 'loss/train': 1.3868197202682495}}} 11/07/2021 01:32:10 - INFO - __main__ - Step 30341: {'lr': 0.00045611166809258227, 'samples': 5825472, 'steps': 30340, 'loss/train': 1.75202214717865}}}} 11/07/2021 01:32:12 - INFO - __main__ - Step 30346: {'lr': 0.0004560966504466044, 'samples': 5826432, 'steps': 30345, 'loss/train': 1.844706416130066}}}} 11/07/2021 01:32:15 - INFO - __main__ - Step 30351: {'lr': 0.0004560816304790274, 'samples': 5827392, 'steps': 30350, 'loss/train': 1.9288541078567505}}} 11/07/2021 01:32:15 - INFO - __main__ - Step 30351: {'lr': 0.0004560816304790274, 'samples': 5827392, 'steps': 30350, 'loss/train': 1.9288541078567505}}} 11/07/2021 01:32:18 - INFO - __main__ - Step 30358: {'lr': 0.00045606059862445485, 'samples': 5828736, 'steps': 30357, 'loss/train': 0.9508589506149292}} 11/07/2021 01:32:21 - INFO - __main__ - Step 30362: {'lr': 0.00045604857837916224, 'samples': 5829504, 'steps': 30361, 'loss/train': 1.3667658567428589}} 11/07/2021 01:32:23 - INFO - __main__ - Step 30366: {'lr': 0.0004560365566483927, 'samples': 5830272, 'steps': 30365, 'loss/train': 1.4809709787368774}}} 11/07/2021 01:32:25 - INFO - __main__ - Step 30370: {'lr': 0.0004560245334322328, 'samples': 5831040, 'steps': 30369, 'loss/train': 1.7748409509658813}}} 11/07/2021 01:32:26 - INFO - __main__ - Step 30374: {'lr': 0.0004560125087307693, 'samples': 5831808, 'steps': 30373, 'loss/train': 1.8265717029571533}}} 11/07/2021 01:32:28 - INFO - __main__ - Step 30378: {'lr': 0.0004560004825440889, 'samples': 5832576, 'steps': 30377, 'loss/train': 1.6653149127960205}}} 11/07/2021 01:32:31 - INFO - __main__ - Step 30383: {'lr': 0.0004559854477222842, 'samples': 5833536, 'steps': 30382, 'loss/train': 1.8445994853973389}}} 11/07/2021 01:32:33 - INFO - __main__ - Step 30387: {'lr': 0.0004559734181941828, 'samples': 5834304, 'steps': 30386, 'loss/train': 2.4096007347106934}}} 11/07/2021 01:32:35 - INFO - __main__ - Step 30391: {'lr': 0.00045596138718114626, 'samples': 5835072, 'steps': 30390, 'loss/train': 1.5772721767425537}} 11/07/2021 01:32:36 - INFO - __main__ - Step 30395: {'lr': 0.00045594935468326137, 'samples': 5835840, 'steps': 30394, 'loss/train': 1.174822211265564}}} 11/07/2021 01:32:38 - INFO - __main__ - Step 30399: {'lr': 0.00045593732070061484, 'samples': 5836608, 'steps': 30398, 'loss/train': 1.6387768983840942}} 11/07/2021 01:32:41 - INFO - __main__ - Step 30404: {'lr': 0.0004559222761344928, 'samples': 5837568, 'steps': 30403, 'loss/train': 1.3113209009170532}}} 11/07/2021 01:32:41 - INFO - __main__ - Step 30404: {'lr': 0.0004559222761344928, 'samples': 5837568, 'steps': 30403, 'loss/train': 1.3113209009170532}}} 11/07/2021 01:32:44 - INFO - __main__ - Step 30411: {'lr': 0.0004559012098449732, 'samples': 5838912, 'steps': 30410, 'loss/train': 1.7170732021331787}}} 11/07/2021 01:32:46 - INFO - __main__ - Step 30415: {'lr': 0.00045588916992414784, 'samples': 5839680, 'steps': 30414, 'loss/train': 0.5767570734024048}} 11/07/2021 01:32:49 - INFO - __main__ - Step 30420: {'lr': 0.00045587411793579047, 'samples': 5840640, 'steps': 30419, 'loss/train': 1.4096364974975586}} 11/07/2021 01:32:51 - INFO - __main__ - Step 30425: {'lr': 0.00045585906362834063, 'samples': 5841600, 'steps': 30424, 'loss/train': 1.7328251600265503}} 11/07/2021 01:32:53 - INFO - __main__ - Step 30429: {'lr': 0.00045584701851274814, 'samples': 5842368, 'steps': 30428, 'loss/train': 1.8039371967315674}} 11/07/2021 01:32:53 - INFO - __main__ - Step 30429: {'lr': 0.00045584701851274814, 'samples': 5842368, 'steps': 30428, 'loss/train': 1.8039371967315674}} 11/07/2021 01:32:56 - INFO - __main__ - Step 30436: {'lr': 0.00045582593598958107, 'samples': 5843712, 'steps': 30435, 'loss/train': 1.0785267353057861}} 11/07/2021 01:32:58 - INFO - __main__ - Step 30440: {'lr': 0.00045581388679313194, 'samples': 5844480, 'steps': 30439, 'loss/train': 1.785849928855896}}} 11/07/2021 01:33:01 - INFO - __main__ - Step 30446: {'lr': 0.00045579581021638855, 'samples': 5845632, 'steps': 30445, 'loss/train': 0.5898078083992004}} 11/07/2021 01:33:03 - INFO - __main__ - Step 30450: {'lr': 0.0004557837573106399, 'samples': 5846400, 'steps': 30449, 'loss/train': 1.6386229991912842}}} 11/07/2021 01:33:03 - INFO - __main__ - Step 30450: {'lr': 0.0004557837573106399, 'samples': 5846400, 'steps': 30449, 'loss/train': 1.6386229991912842}}} 11/07/2021 01:33:06 - INFO - __main__ - Step 30456: {'lr': 0.0004557656751703544, 'samples': 5847552, 'steps': 30455, 'loss/train': 1.2597217559814453}}} 11/07/2021 01:33:09 - INFO - __main__ - Step 30461: {'lr': 0.00045575060417044614, 'samples': 5848512, 'steps': 30460, 'loss/train': 1.6543169021606445}} 11/07/2021 01:33:11 - INFO - __main__ - Step 30466: {'lr': 0.0004557355308528366, 'samples': 5849472, 'steps': 30465, 'loss/train': 1.7070040702819824}}} 11/07/2021 01:33:13 - INFO - __main__ - Step 30470: {'lr': 0.0004557234705301182, 'samples': 5850240, 'steps': 30469, 'loss/train': 1.3435560464859009}}} 11/07/2021 01:33:15 - INFO - __main__ - Step 30474: {'lr': 0.0004557114087242667, 'samples': 5851008, 'steps': 30473, 'loss/train': 1.2576820850372314}}} 11/07/2021 01:33:17 - INFO - __main__ - Step 30478: {'lr': 0.000455699345435369, 'samples': 5851776, 'steps': 30477, 'loss/train': 1.789986252784729}4}}} 11/07/2021 01:33:19 - INFO - __main__ - Step 30482: {'lr': 0.00045568728066351205, 'samples': 5852544, 'steps': 30481, 'loss/train': 1.6299755573272705}} 11/07/2021 01:33:19 - INFO - __main__ - Step 30482: {'lr': 0.00045568728066351205, 'samples': 5852544, 'steps': 30481, 'loss/train': 1.6299755573272705}} 11/07/2021 01:33:19 - INFO - __main__ - Step 30482: {'lr': 0.00045568728066351205, 'samples': 5852544, 'steps': 30481, 'loss/train': 1.6299755573272705}} 11/07/2021 01:33:25 - INFO - __main__ - Step 30493: {'lr': 0.0004556540948951073, 'samples': 5854656, 'steps': 30492, 'loss/train': 1.3986448049545288}}} 11/07/2021 01:33:27 - INFO - __main__ - Step 30498: {'lr': 0.00045563900674823205, 'samples': 5855616, 'steps': 30497, 'loss/train': 1.3164902925491333}} 11/07/2021 01:33:29 - INFO - __main__ - Step 30503: {'lr': 0.00045562391628491274, 'samples': 5856576, 'steps': 30502, 'loss/train': 1.3326011896133423}} 11/07/2021 01:33:29 - INFO - __main__ - Step 30503: {'lr': 0.00045562391628491274, 'samples': 5856576, 'steps': 30502, 'loss/train': 1.3326011896133423}} 11/07/2021 01:33:33 - INFO - __main__ - Step 30509: {'lr': 0.00045560580467146275, 'samples': 5857728, 'steps': 30508, 'loss/train': 1.6736948490142822}} 11/07/2021 01:33:35 - INFO - __main__ - Step 30514: {'lr': 0.00045559070911256486, 'samples': 5858688, 'steps': 30513, 'loss/train': 1.4619766473770142}} 11/07/2021 01:33:37 - INFO - __main__ - Step 30518: {'lr': 0.00045557863099799034, 'samples': 5859456, 'steps': 30517, 'loss/train': 1.5410176515579224}} 11/07/2021 01:33:39 - INFO - __main__ - Step 30522: {'lr': 0.00045556655140132696, 'samples': 5860224, 'steps': 30521, 'loss/train': 1.1270097494125366}} 11/07/2021 01:33:41 - INFO - __main__ - Step 30526: {'lr': 0.00045555447032266167, 'samples': 5860992, 'steps': 30525, 'loss/train': 1.9231667518615723}} 11/07/2021 01:33:43 - INFO - __main__ - Step 30530: {'lr': 0.0004555423877620817, 'samples': 5861760, 'steps': 30529, 'loss/train': 1.6974469423294067}}} 11/07/2021 01:33:45 - INFO - __main__ - Step 30535: {'lr': 0.00045552728247754673, 'samples': 5862720, 'steps': 30534, 'loss/train': 1.8246897459030151}} 11/07/2021 01:33:45 - INFO - __main__ - Step 30535: {'lr': 0.00045552728247754673, 'samples': 5862720, 'steps': 30534, 'loss/train': 1.8246897459030151}} 11/07/2021 01:33:49 - INFO - __main__ - Step 30542: {'lr': 0.0004555061311897241, 'samples': 5864064, 'steps': 30541, 'loss/train': 1.6158738136291504}}} 11/07/2021 01:33:51 - INFO - __main__ - Step 30546: {'lr': 0.0004554940427023562, 'samples': 5864832, 'steps': 30545, 'loss/train': 1.4866725206375122}}} 11/07/2021 01:33:51 - INFO - __main__ - Step 30546: {'lr': 0.0004554940427023562, 'samples': 5864832, 'steps': 30545, 'loss/train': 1.4866725206375122}}} 11/07/2021 01:33:55 - INFO - __main__ - Step 30553: {'lr': 0.00045547288428470574, 'samples': 5866176, 'steps': 30552, 'loss/train': 1.4973241090774536}} 11/07/2021 01:33:57 - INFO - __main__ - Step 30559: {'lr': 0.00045545474488739693, 'samples': 5867328, 'steps': 30558, 'loss/train': 1.0276840925216675}} 11/07/2021 01:33:57 - INFO - __main__ - Step 30559: {'lr': 0.00045545474488739693, 'samples': 5867328, 'steps': 30558, 'loss/train': 1.0276840925216675}} 11/07/2021 01:34:02 - INFO - __main__ - Step 30566: {'lr': 0.00045543357804507344, 'samples': 5868672, 'steps': 30565, 'loss/train': 1.806578516960144}}} 11/07/2021 01:34:03 - INFO - __main__ - Step 30570: {'lr': 0.0004554214806701384, 'samples': 5869440, 'steps': 30569, 'loss/train': 1.164947748184204}}}} 11/07/2021 01:34:03 - INFO - __main__ - Step 30570: {'lr': 0.0004554214806701384, 'samples': 5869440, 'steps': 30569, 'loss/train': 1.164947748184204}}}} 11/07/2021 01:34:06 - INFO - __main__ - Step 30575: {'lr': 0.0004554063568688857, 'samples': 5870400, 'steps': 30574, 'loss/train': 1.8389360904693604}}} 11/07/2021 01:34:09 - INFO - __main__ - Step 30581: {'lr': 0.0004553882052531504, 'samples': 5871552, 'steps': 30580, 'loss/train': 1.6969722509384155}}} 11/07/2021 01:34:09 - INFO - __main__ - Step 30581: {'lr': 0.0004553882052531504, 'samples': 5871552, 'steps': 30580, 'loss/train': 1.6969722509384155}}} 11/07/2021 01:34:12 - INFO - __main__ - Step 30587: {'lr': 0.000455370050305804, 'samples': 5872704, 'steps': 30586, 'loss/train': 1.578984022140503}5}}} 11/07/2021 01:34:15 - INFO - __main__ - Step 30593: {'lr': 0.0004553518920271408, 'samples': 5873856, 'steps': 30592, 'loss/train': 1.2298734188079834}}} 11/07/2021 01:34:17 - INFO - __main__ - Step 30597: {'lr': 0.000455339784657446, 'samples': 5874624, 'steps': 30596, 'loss/train': 1.2345815896987915}}}} 11/07/2021 01:34:17 - INFO - __main__ - Step 30597: {'lr': 0.000455339784657446, 'samples': 5874624, 'steps': 30596, 'loss/train': 1.2345815896987915}}}} 11/07/2021 01:34:20 - INFO - __main__ - Step 30604: {'lr': 0.0004553185931983994, 'samples': 5875968, 'steps': 30603, 'loss/train': 1.6972366571426392}}} 11/07/2021 01:34:22 - INFO - __main__ - Step 30608: {'lr': 0.0004553064817579053, 'samples': 5876736, 'steps': 30607, 'loss/train': 1.6638671159744263}}} 11/07/2021 01:34:25 - INFO - __main__ - Step 30613: {'lr': 0.0004552913403758695, 'samples': 5877696, 'steps': 30612, 'loss/train': 1.6245088577270508}}} 11/07/2021 01:34:27 - INFO - __main__ - Step 30618: {'lr': 0.0004552761966813059, 'samples': 5878656, 'steps': 30617, 'loss/train': 1.8219928741455078}}} 11/07/2021 01:34:29 - INFO - __main__ - Step 30622: {'lr': 0.00045526408006074973, 'samples': 5879424, 'steps': 30621, 'loss/train': 1.3899623155593872}} 11/07/2021 01:34:29 - INFO - __main__ - Step 30622: {'lr': 0.00045526408006074973, 'samples': 5879424, 'steps': 30621, 'loss/train': 1.3899623155593872}} 11/07/2021 01:34:33 - INFO - __main__ - Step 30629: {'lr': 0.0004552428724140091, 'samples': 5880768, 'steps': 30628, 'loss/train': 0.9583744406700134}}} 11/07/2021 01:34:33 - INFO - __main__ - Step 30629: {'lr': 0.0004552428724140091, 'samples': 5880768, 'steps': 30628, 'loss/train': 0.9583744406700134}}} 11/07/2021 01:34:37 - INFO - __main__ - Step 30635: {'lr': 0.0004552246908243792, 'samples': 5881920, 'steps': 30634, 'loss/train': 1.81877601146698}4}}} 11/07/2021 01:34:39 - INFO - __main__ - Step 30641: {'lr': 0.00045520650590579056, 'samples': 5883072, 'steps': 30640, 'loss/train': 1.666163444519043}}} 11/07/2021 01:34:42 - INFO - __main__ - Step 30645: {'lr': 0.00045519438077745543, 'samples': 5883840, 'steps': 30644, 'loss/train': 1.3501056432724}3}}} 11/07/2021 01:34:43 - INFO - __main__ - Step 30649: {'lr': 0.0004551822541698017, 'samples': 5884608, 'steps': 30648, 'loss/train': 1.6494215726852417}}} 11/07/2021 01:34:45 - INFO - __main__ - Step 30653: {'lr': 0.0004551701260829166, 'samples': 5885376, 'steps': 30652, 'loss/train': 1.7156943082809448}}} 11/07/2021 01:34:47 - INFO - __main__ - Step 30657: {'lr': 0.0004551579965168876, 'samples': 5886144, 'steps': 30656, 'loss/train': 1.431735634803772}}}} 11/07/2021 01:34:47 - INFO - __main__ - Step 30657: {'lr': 0.0004551579965168876, 'samples': 5886144, 'steps': 30656, 'loss/train': 1.431735634803772}}}} 11/07/2021 01:34:52 - INFO - __main__ - Step 30665: {'lr': 0.0004551337329477477, 'samples': 5887680, 'steps': 30664, 'loss/train': 1.2398808002471924}}} 11/07/2021 01:34:53 - INFO - __main__ - Step 30669: {'lr': 0.00045512159894481183, 'samples': 5888448, 'steps': 30668, 'loss/train': 1.554823637008667}}} 11/07/2021 01:34:53 - INFO - __main__ - Step 30669: {'lr': 0.00045512159894481183, 'samples': 5888448, 'steps': 30668, 'loss/train': 1.554823637008667}}} 11/07/2021 01:34:57 - INFO - __main__ - Step 30676: {'lr': 0.0004551003608813784, 'samples': 5889792, 'steps': 30675, 'loss/train': 1.331461787223816}}}} 11/07/2021 01:34:59 - INFO - __main__ - Step 30680: {'lr': 0.00045508822281196937, 'samples': 5890560, 'steps': 30679, 'loss/train': 1.4626752138137817}} 11/07/2021 01:35:01 - INFO - __main__ - Step 30685: {'lr': 0.0004550730481460027, 'samples': 5891520, 'steps': 30684, 'loss/train': 1.6096035242080688}}} 11/07/2021 01:35:03 - INFO - __main__ - Step 30689: {'lr': 0.00045506090674997157, 'samples': 5892288, 'steps': 30688, 'loss/train': 1.6416163444519043}} 11/07/2021 01:35:05 - INFO - __main__ - Step 30693: {'lr': 0.000455048763875584, 'samples': 5893056, 'steps': 30692, 'loss/train': 1.2560726404190063}3}} 11/07/2021 01:35:08 - INFO - __main__ - Step 30697: {'lr': 0.0004550366195229274, 'samples': 5893824, 'steps': 30696, 'loss/train': 1.283591628074646}3}} 11/07/2021 01:35:09 - INFO - __main__ - Step 30701: {'lr': 0.00045502447369208957, 'samples': 5894592, 'steps': 30700, 'loss/train': 0.1268743872642517}} 11/07/2021 01:35:11 - INFO - __main__ - Step 30705: {'lr': 0.0004550123263831578, 'samples': 5895360, 'steps': 30704, 'loss/train': 2.1430163383483887}}} 11/07/2021 01:35:13 - INFO - __main__ - Step 30710: {'lr': 0.00045499714016855705, 'samples': 5896320, 'steps': 30709, 'loss/train': 1.656386137008667}}} 11/07/2021 01:35:13 - INFO - __main__ - Step 30710: {'lr': 0.00045499714016855705, 'samples': 5896320, 'steps': 30709, 'loss/train': 1.656386137008667}}} 11/07/2021 01:35:17 - INFO - __main__ - Step 30718: {'lr': 0.00045497283742210263, 'samples': 5897856, 'steps': 30717, 'loss/train': 1.0657594203948975}} 11/07/2021 01:35:19 - INFO - __main__ - Step 30722: {'lr': 0.0004549606838322492, 'samples': 5898624, 'steps': 30721, 'loss/train': 1.3796584606170654}}} 11/07/2021 01:35:22 - INFO - __main__ - Step 30727: {'lr': 0.00045494548976702, 'samples': 5899584, 'steps': 30726, 'loss/train': 1.3677185773849487}4}}} 11/07/2021 01:35:24 - INFO - __main__ - Step 30731: {'lr': 0.0004549333328526135, 'samples': 5900352, 'steps': 30730, 'loss/train': 1.7263344526290894}}} 11/07/2021 01:35:24 - INFO - __main__ - Step 30731: {'lr': 0.0004549333328526135, 'samples': 5900352, 'steps': 30730, 'loss/train': 1.7263344526290894}}} 11/07/2021 01:35:27 - INFO - __main__ - Step 30738: {'lr': 0.00045491205469737263, 'samples': 5901696, 'steps': 30737, 'loss/train': 1.1181199550628662}} 11/07/2021 01:35:29 - INFO - __main__ - Step 30742: {'lr': 0.000454899893720226, 'samples': 5902464, 'steps': 30741, 'loss/train': 1.7650607824325562}2}} 11/07/2021 01:35:32 - INFO - __main__ - Step 30747: {'lr': 0.0004548846904214964, 'samples': 5903424, 'steps': 30746, 'loss/train': 1.4247400760650635}}} 11/07/2021 01:35:34 - INFO - __main__ - Step 30752: {'lr': 0.0004548694848148199, 'samples': 5904384, 'steps': 30751, 'loss/train': 1.7409619092941284}}} 11/07/2021 01:35:34 - INFO - __main__ - Step 30752: {'lr': 0.0004548694848148199, 'samples': 5904384, 'steps': 30751, 'loss/train': 1.7409619092941284}}} 11/07/2021 01:35:37 - INFO - __main__ - Step 30758: {'lr': 0.0004548512350405593, 'samples': 5905536, 'steps': 30757, 'loss/train': 2.074098587036133}}}} 11/07/2021 01:35:39 - INFO - __main__ - Step 30763: {'lr': 0.00045483602435700233, 'samples': 5906496, 'steps': 30762, 'loss/train': 1.6819427013397217}} 11/07/2021 01:35:39 - INFO - __main__ - Step 30763: {'lr': 0.00045483602435700233, 'samples': 5906496, 'steps': 30762, 'loss/train': 1.6819427013397217}} 11/07/2021 01:35:43 - INFO - __main__ - Step 30771: {'lr': 0.0004548116824639931, 'samples': 5908032, 'steps': 30770, 'loss/train': 1.5290954113006592}}} 11/07/2021 01:35:45 - INFO - __main__ - Step 30775: {'lr': 0.00045479950930260495, 'samples': 5908800, 'steps': 30774, 'loss/train': 1.0466399192810059}} 11/07/2021 01:35:47 - INFO - __main__ - Step 30779: {'lr': 0.00045478733466474487, 'samples': 5909568, 'steps': 30778, 'loss/train': 1.6353005170822144}} 11/07/2021 01:35:49 - INFO - __main__ - Step 30784: {'lr': 0.0004547721142912647, 'samples': 5910528, 'steps': 30783, 'loss/train': 1.5956065654754639}}} 11/07/2021 01:35:49 - INFO - __main__ - Step 30784: {'lr': 0.0004547721142912647, 'samples': 5910528, 'steps': 30783, 'loss/train': 1.5956065654754639}}} 11/07/2021 01:35:53 - INFO - __main__ - Step 30792: {'lr': 0.00045474775689587576, 'samples': 5912064, 'steps': 30791, 'loss/train': 1.1766566038131714}} 11/07/2021 01:35:55 - INFO - __main__ - Step 30796: {'lr': 0.0004547355759839891, 'samples': 5912832, 'steps': 30795, 'loss/train': 1.5019656419754028}}} 11/07/2021 01:35:57 - INFO - __main__ - Step 30800: {'lr': 0.0004547233935960914, 'samples': 5913600, 'steps': 30799, 'loss/train': 1.9017813205718994}}} 11/07/2021 01:35:59 - INFO - __main__ - Step 30805: {'lr': 0.00045470816353571244, 'samples': 5914560, 'steps': 30804, 'loss/train': 1.770569920539856}}} 11/07/2021 01:35:59 - INFO - __main__ - Step 30805: {'lr': 0.00045470816353571244, 'samples': 5914560, 'steps': 30804, 'loss/train': 1.770569920539856}}} 11/07/2021 01:36:04 - INFO - __main__ - Step 30813: {'lr': 0.0004546837906427839, 'samples': 5916096, 'steps': 30812, 'loss/train': 1.618916392326355}}}} 11/07/2021 01:36:05 - INFO - __main__ - Step 30817: {'lr': 0.0004546716019828191, 'samples': 5916864, 'steps': 30816, 'loss/train': 1.5973073244094849}}} 11/07/2021 01:36:07 - INFO - __main__ - Step 30821: {'lr': 0.0004546594118473044, 'samples': 5917632, 'steps': 30820, 'loss/train': 1.6678979396820068}}} 11/07/2021 01:36:09 - INFO - __main__ - Step 30826: {'lr': 0.00045464417210305303, 'samples': 5918592, 'steps': 30825, 'loss/train': 0.868805468082428}}} 11/07/2021 01:36:12 - INFO - __main__ - Step 30830: {'lr': 0.0004546319786478726, 'samples': 5919360, 'steps': 30829, 'loss/train': 1.4833358526229858}}} 11/07/2021 01:36:14 - INFO - __main__ - Step 30834: {'lr': 0.00045461978371742794, 'samples': 5920128, 'steps': 30833, 'loss/train': 2.1018025875091553}} 11/07/2021 01:36:15 - INFO - __main__ - Step 30838: {'lr': 0.000454607587311807, 'samples': 5920896, 'steps': 30837, 'loss/train': 1.4543534517288208}3}} 11/07/2021 01:36:17 - INFO - __main__ - Step 30842: {'lr': 0.00045459538943109774, 'samples': 5921664, 'steps': 30841, 'loss/train': 1.5130547285079956}} 11/07/2021 01:36:20 - INFO - __main__ - Step 30847: {'lr': 0.00045458014000600213, 'samples': 5922624, 'steps': 30846, 'loss/train': 1.7849777936935425}} 11/07/2021 01:36:20 - INFO - __main__ - Step 30847: {'lr': 0.00045458014000600213, 'samples': 5922624, 'steps': 30846, 'loss/train': 1.7849777936935425}} 11/07/2021 01:36:23 - INFO - __main__ - Step 30854: {'lr': 0.0004545587869393193, 'samples': 5923968, 'steps': 30853, 'loss/train': 0.9640048742294312}}} 11/07/2021 01:36:25 - INFO - __main__ - Step 30858: {'lr': 0.00045454658315913617, 'samples': 5924736, 'steps': 30857, 'loss/train': 1.4153854846954346}} 11/07/2021 01:36:28 - INFO - __main__ - Step 30863: {'lr': 0.000454531326360193, 'samples': 5925696, 'steps': 30862, 'loss/train': 1.8231418132781982}6}} 11/07/2021 01:36:30 - INFO - __main__ - Step 30868: {'lr': 0.00045451606725728337, 'samples': 5926656, 'steps': 30867, 'loss/train': 1.2515405416488647}} 11/07/2021 01:36:32 - INFO - __main__ - Step 30872: {'lr': 0.00045450385831621534, 'samples': 5927424, 'steps': 30871, 'loss/train': 1.7874201536178589}} 11/07/2021 01:36:32 - INFO - __main__ - Step 30872: {'lr': 0.00045450385831621534, 'samples': 5927424, 'steps': 30871, 'loss/train': 1.7874201536178589}} 11/07/2021 01:36:35 - INFO - __main__ - Step 30879: {'lr': 0.00045448248912176726, 'samples': 5928768, 'steps': 30878, 'loss/train': 1.218980312347412}}} 11/07/2021 01:36:38 - INFO - __main__ - Step 30884: {'lr': 0.0004544672226473201, 'samples': 5929728, 'steps': 30883, 'loss/train': 1.5864168405532837}}} 11/07/2021 01:36:40 - INFO - __main__ - Step 30888: {'lr': 0.0004544550078094182, 'samples': 5930496, 'steps': 30887, 'loss/train': 1.7455068826675415}}} 11/07/2021 01:36:42 - INFO - __main__ - Step 30892: {'lr': 0.0004544427914975279, 'samples': 5931264, 'steps': 30891, 'loss/train': 1.3994450569152832}}} 11/07/2021 01:36:43 - INFO - __main__ - Step 30896: {'lr': 0.00045443057371173727, 'samples': 5932032, 'steps': 30895, 'loss/train': 1.4993561506271362}} 11/07/2021 01:36:45 - INFO - __main__ - Step 30900: {'lr': 0.0004544183544521345, 'samples': 5932800, 'steps': 30899, 'loss/train': 2.022730827331543}2}} 11/07/2021 01:36:48 - INFO - __main__ - Step 30905: {'lr': 0.0004544030783052169, 'samples': 5933760, 'steps': 30904, 'loss/train': 1.6024492979049683}}} 11/07/2021 01:36:50 - INFO - __main__ - Step 30909: {'lr': 0.0004543908557298588, 'samples': 5934528, 'steps': 30908, 'loss/train': 1.8101469278335571}}} 11/07/2021 01:36:52 - INFO - __main__ - Step 30913: {'lr': 0.0004543786316809749, 'samples': 5935296, 'steps': 30912, 'loss/train': 1.6242061853408813}}} 11/07/2021 01:36:54 - INFO - __main__ - Step 30917: {'lr': 0.0004543664061586532, 'samples': 5936064, 'steps': 30916, 'loss/train': 1.454323649406433}}}} 11/07/2021 01:36:56 - INFO - __main__ - Step 30921: {'lr': 0.00045435417916298205, 'samples': 5936832, 'steps': 30920, 'loss/train': 1.4937081336975098}} 11/07/2021 01:36:58 - INFO - __main__ - Step 30925: {'lr': 0.0004543419506940494, 'samples': 5937600, 'steps': 30924, 'loss/train': 1.425700306892395}8}} 11/07/2021 01:37:00 - INFO - __main__ - Step 30930: {'lr': 0.0004543266630362439, 'samples': 5938560, 'steps': 30929, 'loss/train': 1.2183094024658203}}} 11/07/2021 01:37:00 - INFO - __main__ - Step 30930: {'lr': 0.0004543266630362439, 'samples': 5938560, 'steps': 30929, 'loss/train': 1.2183094024658203}}} 11/07/2021 01:37:03 - INFO - __main__ - Step 30937: {'lr': 0.0004543052564485644, 'samples': 5939904, 'steps': 30936, 'loss/train': 1.327895164489746}}}} 11/07/2021 01:37:06 - INFO - __main__ - Step 30941: {'lr': 0.0004542930220874677, 'samples': 5940672, 'steps': 30940, 'loss/train': 1.8650132417678833}}} 11/07/2021 01:37:08 - INFO - __main__ - Step 30946: {'lr': 0.0004542777270649533, 'samples': 5941632, 'steps': 30945, 'loss/train': 1.518599033355713}}}} 11/07/2021 01:37:08 - INFO - __main__ - Step 30946: {'lr': 0.0004542777270649533, 'samples': 5941632, 'steps': 30945, 'loss/train': 1.518599033355713}}}} 11/07/2021 01:37:12 - INFO - __main__ - Step 30954: {'lr': 0.0004542532502426935, 'samples': 5943168, 'steps': 30953, 'loss/train': 1.6398005485534668}}} 11/07/2021 01:37:13 - INFO - __main__ - Step 30958: {'lr': 0.00045424100962271883, 'samples': 5943936, 'steps': 30957, 'loss/train': 1.5568513870239258}} 11/07/2021 01:37:16 - INFO - __main__ - Step 30962: {'lr': 0.00045422876753029853, 'samples': 5944704, 'steps': 30961, 'loss/train': 1.60807466506958}8}} 11/07/2021 01:37:18 - INFO - __main__ - Step 30966: {'lr': 0.00045421652396552094, 'samples': 5945472, 'steps': 30965, 'loss/train': 1.6757169961929321}} 11/07/2021 01:37:20 - INFO - __main__ - Step 30970: {'lr': 0.0004542042789284744, 'samples': 5946240, 'steps': 30969, 'loss/train': 1.701396107673645}1}} 11/07/2021 01:37:22 - INFO - __main__ - Step 30974: {'lr': 0.00045419203241924705, 'samples': 5947008, 'steps': 30973, 'loss/train': 1.4163742065429688}} 11/07/2021 01:37:23 - INFO - __main__ - Step 30978: {'lr': 0.0004541797844379273, 'samples': 5947776, 'steps': 30977, 'loss/train': 1.4600822925567627}}} 11/07/2021 01:37:25 - INFO - __main__ - Step 30982: {'lr': 0.0004541675349846033, 'samples': 5948544, 'steps': 30981, 'loss/train': 1.3606353998184204}}} 11/07/2021 01:37:28 - INFO - __main__ - Step 30987: {'lr': 0.000454152221098077, 'samples': 5949504, 'steps': 30986, 'loss/train': 1.2379319667816162}}}} 11/07/2021 01:37:30 - INFO - __main__ - Step 30991: {'lr': 0.0004541399683330666, 'samples': 5950272, 'steps': 30990, 'loss/train': 1.38321053981781}}}}} 11/07/2021 01:37:32 - INFO - __main__ - Step 30995: {'lr': 0.00045412771409633905, 'samples': 5951040, 'steps': 30994, 'loss/train': 1.0903595685958862}} 11/07/2021 01:37:33 - INFO - __main__ - Step 30999: {'lr': 0.00045411545838798273, 'samples': 5951808, 'steps': 30998, 'loss/train': 1.1896541118621826}} 11/07/2021 01:37:35 - INFO - __main__ - Step 31003: {'lr': 0.000454103201208086, 'samples': 5952576, 'steps': 31002, 'loss/train': 1.9771652221679688}6}} 11/07/2021 01:37:38 - INFO - __main__ - Step 31008: {'lr': 0.00045408787766399605, 'samples': 5953536, 'steps': 31007, 'loss/train': 1.7356077432632446}} 11/07/2021 01:37:40 - INFO - __main__ - Step 31012: {'lr': 0.0004540756171734565, 'samples': 5954304, 'steps': 31011, 'loss/train': 1.0137343406677246}}} 11/07/2021 01:37:42 - INFO - __main__ - Step 31016: {'lr': 0.0004540633552116638, 'samples': 5955072, 'steps': 31015, 'loss/train': 1.128483772277832}}}} 11/07/2021 01:37:44 - INFO - __main__ - Step 31020: {'lr': 0.0004540510917787063, 'samples': 5955840, 'steps': 31019, 'loss/train': 1.4873820543289185}}} 11/07/2021 01:37:46 - INFO - __main__ - Step 31024: {'lr': 0.0004540388268746724, 'samples': 5956608, 'steps': 31023, 'loss/train': 1.7353435754776}85}}} 11/07/2021 01:37:48 - INFO - __main__ - Step 31029: {'lr': 0.0004540234936760636, 'samples': 5957568, 'steps': 31028, 'loss/train': 1.6700613498687744}}} 11/07/2021 01:37:50 - INFO - __main__ - Step 31033: {'lr': 0.0004540112254624312, 'samples': 5958336, 'steps': 31032, 'loss/train': 1.4836091995239258}}} 11/07/2021 01:37:50 - INFO - __main__ - Step 31033: {'lr': 0.0004540112254624312, 'samples': 5958336, 'steps': 31032, 'loss/train': 1.4836091995239258}}} 11/07/2021 01:37:53 - INFO - __main__ - Step 31040: {'lr': 0.0004539897525495418, 'samples': 5959680, 'steps': 31039, 'loss/train': 1.6444486379623413}}} 11/07/2021 01:37:56 - INFO - __main__ - Step 31045: {'lr': 0.00045397441199715406, 'samples': 5960640, 'steps': 31044, 'loss/train': 1.871448040008545}}} 11/07/2021 01:37:58 - INFO - __main__ - Step 31050: {'lr': 0.0004539590691470733, 'samples': 5961600, 'steps': 31049, 'loss/train': 1.3571999073028564}}} 11/07/2021 01:37:58 - INFO - __main__ - Step 31050: {'lr': 0.0004539590691470733, 'samples': 5961600, 'steps': 31049, 'loss/train': 1.3571999073028564}}} 11/07/2021 01:38:02 - INFO - __main__ - Step 31057: {'lr': 0.00045393758529716497, 'samples': 5962944, 'steps': 31056, 'loss/train': 1.5746870040893555}} 11/07/2021 01:38:03 - INFO - __main__ - Step 31061: {'lr': 0.00045392530678986775, 'samples': 5963712, 'steps': 31060, 'loss/train': 1.517366647720337}}} 11/07/2021 01:38:06 - INFO - __main__ - Step 31066: {'lr': 0.0004539099565883308, 'samples': 5964672, 'steps': 31065, 'loss/train': 1.4995030164718628}}} 11/07/2021 01:38:06 - INFO - __main__ - Step 31066: {'lr': 0.0004539099565883308, 'samples': 5964672, 'steps': 31065, 'loss/train': 1.4995030164718628}}} 11/07/2021 01:38:10 - INFO - __main__ - Step 31074: {'lr': 0.00045388539148825214, 'samples': 5966208, 'steps': 31073, 'loss/train': 1.272490382194519}}} 11/07/2021 01:38:12 - INFO - __main__ - Step 31078: {'lr': 0.0004538731067333459, 'samples': 5966976, 'steps': 31077, 'loss/train': 1.8784071207046509}}} 11/07/2021 01:38:14 - INFO - __main__ - Step 31082: {'lr': 0.0004538608205086464, 'samples': 5967744, 'steps': 31081, 'loss/train': 1.457392692565918}}}} 11/07/2021 01:38:16 - INFO - __main__ - Step 31087: {'lr': 0.0004538454606610103, 'samples': 5968704, 'steps': 31086, 'loss/train': 1.311660647392273}}}} 11/07/2021 01:38:19 - INFO - __main__ - Step 31091: {'lr': 0.00045383317112959997, 'samples': 5969472, 'steps': 31090, 'loss/train': 0.956122636795044}}} 11/07/2021 01:38:21 - INFO - __main__ - Step 31095: {'lr': 0.0004538208801286843, 'samples': 5970240, 'steps': 31094, 'loss/train': 1.6640911102294922}}} 11/07/2021 01:38:22 - INFO - __main__ - Step 31099: {'lr': 0.000453808587658352, 'samples': 5971008, 'steps': 31098, 'loss/train': 1.3138790130615234}}}} 11/07/2021 01:38:24 - INFO - __main__ - Step 31103: {'lr': 0.0004537962937186916, 'samples': 5971776, 'steps': 31102, 'loss/train': 1.065061092376709}}}} 11/07/2021 01:38:24 - INFO - __main__ - Step 31103: {'lr': 0.0004537962937186916, 'samples': 5971776, 'steps': 31102, 'loss/train': 1.065061092376709}}}} 11/07/2021 01:38:28 - INFO - __main__ - Step 31111: {'lr': 0.0004537717014317411, 'samples': 5973312, 'steps': 31110, 'loss/train': 1.3271428346633911}}} 11/07/2021 01:38:30 - INFO - __main__ - Step 31115: {'lr': 0.00045375940308462826, 'samples': 5974080, 'steps': 31114, 'loss/train': 1.6915082931518555}} 11/07/2021 01:38:32 - INFO - __main__ - Step 31119: {'lr': 0.00045374710326854194, 'samples': 5974848, 'steps': 31118, 'loss/train': 1.8023872375488281}} 11/07/2021 01:38:34 - INFO - __main__ - Step 31123: {'lr': 0.0004537348019835709, 'samples': 5975616, 'steps': 31122, 'loss/train': 1.7239537239074707}}} 11/07/2021 01:38:36 - INFO - __main__ - Step 31128: {'lr': 0.00045371942331187286, 'samples': 5976576, 'steps': 31127, 'loss/train': 1.8973110914230347}} 11/07/2021 01:38:36 - INFO - __main__ - Step 31128: {'lr': 0.00045371942331187286, 'samples': 5976576, 'steps': 31127, 'loss/train': 1.8973110914230347}} 11/07/2021 01:38:40 - INFO - __main__ - Step 31134: {'lr': 0.00045370096587668714, 'samples': 5977728, 'steps': 31133, 'loss/train': 1.3561971187591553}} 11/07/2021 01:38:40 - INFO - __main__ - Step 31134: {'lr': 0.00045370096587668714, 'samples': 5977728, 'steps': 31133, 'loss/train': 1.3561971187591553}} 11/07/2021 01:38:44 - INFO - __main__ - Step 31143: {'lr': 0.0004536732735285476, 'samples': 5979456, 'steps': 31142, 'loss/train': 1.359701156616211}3}} 11/07/2021 01:38:44 - INFO - __main__ - Step 31143: {'lr': 0.0004536732735285476, 'samples': 5979456, 'steps': 31142, 'loss/train': 1.359701156616211}3}} 11/07/2021 01:38:48 - INFO - __main__ - Step 31150: {'lr': 0.0004536517298962645, 'samples': 5980800, 'steps': 31149, 'loss/train': 2.274333953857422}3}} 11/07/2021 01:38:50 - INFO - __main__ - Step 31154: {'lr': 0.00045363941723044386, 'samples': 5981568, 'steps': 31153, 'loss/train': 1.6942732334136963}} 11/07/2021 01:38:52 - INFO - __main__ - Step 31158: {'lr': 0.0004536271030965148, 'samples': 5982336, 'steps': 31157, 'loss/train': 1.5780308246612549}}} 11/07/2021 01:38:54 - INFO - __main__ - Step 31162: {'lr': 0.00045361478749456595, 'samples': 5983104, 'steps': 31161, 'loss/train': 1.8032037019729614}} 11/07/2021 01:38:56 - INFO - __main__ - Step 31168: {'lr': 0.00045359631133930016, 'samples': 5984256, 'steps': 31167, 'loss/train': 1.7573238611221313}} 11/07/2021 01:38:56 - INFO - __main__ - Step 31168: {'lr': 0.00045359631133930016, 'samples': 5984256, 'steps': 31167, 'loss/train': 1.7573238611221313}} 11/07/2021 01:39:01 - INFO - __main__ - Step 31175: {'lr': 0.0004535747516507947, 'samples': 5985600, 'steps': 31174, 'loss/train': 1.1287404298782349}}} 11/07/2021 01:39:02 - INFO - __main__ - Step 31179: {'lr': 0.0004535624298107529, 'samples': 5986368, 'steps': 31178, 'loss/train': 1.567328929901123}}}} 11/07/2021 01:39:04 - INFO - __main__ - Step 31183: {'lr': 0.0004535501065031577, 'samples': 5987136, 'steps': 31182, 'loss/train': 1.4410059452056885}}} 11/07/2021 01:39:04 - INFO - __main__ - Step 31183: {'lr': 0.0004535501065031577, 'samples': 5987136, 'steps': 31182, 'loss/train': 1.4410059452056885}}} 11/07/2021 01:39:08 - INFO - __main__ - Step 31192: {'lr': 0.00045352237369578643, 'samples': 5988864, 'steps': 31191, 'loss/train': 1.604499340057373}}} 11/07/2021 01:39:11 - INFO - __main__ - Step 31196: {'lr': 0.0004535100456192562, 'samples': 5989632, 'steps': 31195, 'loss/train': 1.4107824563980103}}} 11/07/2021 01:39:12 - INFO - __main__ - Step 31200: {'lr': 0.00045349771607555017, 'samples': 5990400, 'steps': 31199, 'loss/train': 1.4114199876785278}} 11/07/2021 01:39:14 - INFO - __main__ - Step 31204: {'lr': 0.0004534853850647572, 'samples': 5991168, 'steps': 31203, 'loss/train': 0.8451942205429077}}} 11/07/2021 01:39:17 - INFO - __main__ - Step 31209: {'lr': 0.0004534699692383106, 'samples': 5992128, 'steps': 31208, 'loss/train': 2.1556811332702637}}} 11/07/2021 01:39:19 - INFO - __main__ - Step 31213: {'lr': 0.0004534576349268973, 'samples': 5992896, 'steps': 31212, 'loss/train': 1.6039478778839111}}} 11/07/2021 01:39:21 - INFO - __main__ - Step 31217: {'lr': 0.00045344529914868593, 'samples': 5993664, 'steps': 31216, 'loss/train': 1.2807214260101318}} 11/07/2021 01:39:22 - INFO - __main__ - Step 31221: {'lr': 0.00045343296190376566, 'samples': 5994432, 'steps': 31220, 'loss/train': 1.400227665901184}}} 11/07/2021 01:39:24 - INFO - __main__ - Step 31225: {'lr': 0.0004534206231922253, 'samples': 5995200, 'steps': 31224, 'loss/train': 1.1981773376464844}}} 11/07/2021 01:39:27 - INFO - __main__ - Step 31230: {'lr': 0.00045340519774050093, 'samples': 5996160, 'steps': 31229, 'loss/train': 1.1463749408721924}} 11/07/2021 01:39:27 - INFO - __main__ - Step 31230: {'lr': 0.00045340519774050093, 'samples': 5996160, 'steps': 31229, 'loss/train': 1.1463749408721924}} 11/07/2021 01:39:30 - INFO - __main__ - Step 31236: {'lr': 0.00045338668417395595, 'samples': 5997312, 'steps': 31235, 'loss/train': 0.497263103723526}}} 11/07/2021 01:39:33 - INFO - __main__ - Step 31241: {'lr': 0.0004533712536816426, 'samples': 5998272, 'steps': 31240, 'loss/train': 1.227736234664917}}}} 11/07/2021 01:39:35 - INFO - __main__ - Step 31245: {'lr': 0.00045335890763833646, 'samples': 5999040, 'steps': 31244, 'loss/train': 2.0331690311431885}} 11/07/2021 01:39:37 - INFO - __main__ - Step 31249: {'lr': 0.00045334656012894424, 'samples': 5999808, 'steps': 31248, 'loss/train': 1.0691889524459839}} 11/07/2021 01:39:39 - INFO - __main__ - Step 31253: {'lr': 0.00045333421115355477, 'samples': 6000576, 'steps': 31252, 'loss/train': 1.006943702697754}}} 11/07/2021 01:39:40 - INFO - __main__ - Step 31257: {'lr': 0.00045332186071225724, 'samples': 6001344, 'steps': 31256, 'loss/train': 1.2591632604599}4}}} 11/07/2021 01:39:42 - INFO - __main__ - Step 31261: {'lr': 0.00045330950880514065, 'samples': 6002112, 'steps': 31260, 'loss/train': 1.3240946531295776}} 11/07/2021 01:39:45 - INFO - __main__ - Step 31266: {'lr': 0.0004532940668600724, 'samples': 6003072, 'steps': 31265, 'loss/train': 0.8703139424324036}}} 11/07/2021 01:39:47 - INFO - __main__ - Step 31270: {'lr': 0.0004532817116551884, 'samples': 6003840, 'steps': 31269, 'loss/train': 1.169777512550354}}}} 11/07/2021 01:39:49 - INFO - __main__ - Step 31274: {'lr': 0.00045326935498477477, 'samples': 6004608, 'steps': 31273, 'loss/train': 1.14182448387146}}}} 11/07/2021 01:39:50 - INFO - __main__ - Step 31278: {'lr': 0.00045325699684892065, 'samples': 6005376, 'steps': 31277, 'loss/train': 1.491533875465393}}} 11/07/2021 01:39:52 - INFO - __main__ - Step 31282: {'lr': 0.000453244637247715, 'samples': 6006144, 'steps': 31281, 'loss/train': 0.903304934501648}3}}} 11/07/2021 01:39:55 - INFO - __main__ - Step 31287: {'lr': 0.00045322918568569315, 'samples': 6007104, 'steps': 31286, 'loss/train': 1.453641414642334}}} 11/07/2021 01:39:57 - INFO - __main__ - Step 31291: {'lr': 0.00045321682278777253, 'samples': 6007872, 'steps': 31290, 'loss/train': 1.9191278219223022}} 11/07/2021 01:39:59 - INFO - __main__ - Step 31295: {'lr': 0.0004532044584247901, 'samples': 6008640, 'steps': 31294, 'loss/train': 1.4246947765350342}}} 11/07/2021 01:40:00 - INFO - __main__ - Step 31299: {'lr': 0.00045319209259683503, 'samples': 6009408, 'steps': 31298, 'loss/train': 1.696137547492981}}} 11/07/2021 01:40:03 - INFO - __main__ - Step 31303: {'lr': 0.00045317972530399634, 'samples': 6010176, 'steps': 31302, 'loss/train': 1.545304536819458}}} 11/07/2021 01:40:03 - INFO - __main__ - Step 31303: {'lr': 0.00045317972530399634, 'samples': 6010176, 'steps': 31302, 'loss/train': 1.545304536819458}}} 11/07/2021 01:40:07 - INFO - __main__ - Step 31311: {'lr': 0.00045315498632402494, 'samples': 6011712, 'steps': 31310, 'loss/train': 1.181226372718811}}} 11/07/2021 01:40:09 - INFO - __main__ - Step 31315: {'lr': 0.00045314261463707064, 'samples': 6012480, 'steps': 31314, 'loss/train': 0.8304257392883301}} 11/07/2021 01:40:11 - INFO - __main__ - Step 31319: {'lr': 0.0004531302414855895, 'samples': 6013248, 'steps': 31318, 'loss/train': 1.498241662979126}1}} 11/07/2021 01:40:11 - INFO - __main__ - Step 31319: {'lr': 0.0004531302414855895, 'samples': 6013248, 'steps': 31318, 'loss/train': 1.498241662979126}1}} 11/07/2021 01:40:11 - INFO - __main__ - Step 31319: {'lr': 0.0004531302414855895, 'samples': 6013248, 'steps': 31318, 'loss/train': 1.498241662979126}1}} 11/07/2021 01:40:16 - INFO - __main__ - Step 31330: {'lr': 0.00045309620776827817, 'samples': 6015360, 'steps': 31329, 'loss/train': 1.5829106569290161}} 11/07/2021 01:40:18 - INFO - __main__ - Step 31334: {'lr': 0.0004530838291256159, 'samples': 6016128, 'steps': 31333, 'loss/train': 1.8291977643966675}}} 11/07/2021 01:40:21 - INFO - __main__ - Step 31339: {'lr': 0.00045306835376340366, 'samples': 6017088, 'steps': 31338, 'loss/train': 1.6746957302093506}} 11/07/2021 01:40:23 - INFO - __main__ - Step 31343: {'lr': 0.0004530559718266351, 'samples': 6017856, 'steps': 31342, 'loss/train': 1.5785162448883057}}} 11/07/2021 01:40:25 - INFO - __main__ - Step 31347: {'lr': 0.0004530435884259644, 'samples': 6018624, 'steps': 31346, 'loss/train': 1.60947847366333}7}}} 11/07/2021 01:40:26 - INFO - __main__ - Step 31351: {'lr': 0.00045303120356148067, 'samples': 6019392, 'steps': 31350, 'loss/train': 1.539341688156128}}} 11/07/2021 01:40:28 - INFO - __main__ - Step 31355: {'lr': 0.0004530188172332733, 'samples': 6020160, 'steps': 31354, 'loss/train': 1.6850532293319702}}} 11/07/2021 01:40:31 - INFO - __main__ - Step 31360: {'lr': 0.00045300333226478887, 'samples': 6021120, 'steps': 31359, 'loss/train': 1.4599359035491943}} 11/07/2021 01:40:33 - INFO - __main__ - Step 31364: {'lr': 0.00045299094264352987, 'samples': 6021888, 'steps': 31363, 'loss/train': 1.5297051668167114}} 11/07/2021 01:40:33 - INFO - __main__ - Step 31364: {'lr': 0.00045299094264352987, 'samples': 6021888, 'steps': 31363, 'loss/train': 1.5297051668167114}} 11/07/2021 01:40:36 - INFO - __main__ - Step 31371: {'lr': 0.0004529692572849938, 'samples': 6023232, 'steps': 31370, 'loss/train': 1.7392520904541016}}} 11/07/2021 01:40:38 - INFO - __main__ - Step 31376: {'lr': 0.0004529537649995099, 'samples': 6024192, 'steps': 31375, 'loss/train': 1.51448392868042}6}}} 11/07/2021 01:40:41 - INFO - __main__ - Step 31380: {'lr': 0.00045294136952505346, 'samples': 6024960, 'steps': 31379, 'loss/train': 1.8567157983779907}} 11/07/2021 01:40:43 - INFO - __main__ - Step 31384: {'lr': 0.00045292897258752095, 'samples': 6025728, 'steps': 31383, 'loss/train': 5.561633110046387}}} 11/07/2021 01:40:44 - INFO - __main__ - Step 31388: {'lr': 0.0004529165741870018, 'samples': 6026496, 'steps': 31387, 'loss/train': 1.2687602043151855}}} 11/07/2021 01:40:46 - INFO - __main__ - Step 31392: {'lr': 0.00045290417432358553, 'samples': 6027264, 'steps': 31391, 'loss/train': 1.7551089525222778}} 11/07/2021 01:40:49 - INFO - __main__ - Step 31397: {'lr': 0.00045288867243725207, 'samples': 6028224, 'steps': 31396, 'loss/train': 0.6855195760726929}} 11/07/2021 01:40:51 - INFO - __main__ - Step 31401: {'lr': 0.0004528762692826439, 'samples': 6028992, 'steps': 31400, 'loss/train': 1.1373130083084106}}} 11/07/2021 01:40:53 - INFO - __main__ - Step 31405: {'lr': 0.00045286386466542896, 'samples': 6029760, 'steps': 31404, 'loss/train': 0.9506607055664062}} 11/07/2021 01:40:54 - INFO - __main__ - Step 31409: {'lr': 0.0004528514585856968, 'samples': 6030528, 'steps': 31408, 'loss/train': 1.5592893362045288}}} 11/07/2021 01:40:56 - INFO - __main__ - Step 31413: {'lr': 0.0004528390510435368, 'samples': 6031296, 'steps': 31412, 'loss/train': 1.2986992597579956}}} 11/07/2021 01:40:59 - INFO - __main__ - Step 31418: {'lr': 0.00045282353955943417, 'samples': 6032256, 'steps': 31417, 'loss/train': 1.755852222442627}}} 11/07/2021 01:41:01 - INFO - __main__ - Step 31422: {'lr': 0.0004528111287271388, 'samples': 6033024, 'steps': 31421, 'loss/train': 1.262176752090454}}}} 11/07/2021 01:41:03 - INFO - __main__ - Step 31426: {'lr': 0.0004527987164327063, 'samples': 6033792, 'steps': 31425, 'loss/train': 1.0587403774261475}}} 11/07/2021 01:41:03 - INFO - __main__ - Step 31426: {'lr': 0.0004527987164327063, 'samples': 6033792, 'steps': 31425, 'loss/train': 1.0587403774261475}}} 11/07/2021 01:41:06 - INFO - __main__ - Step 31433: {'lr': 0.0004527769913994515, 'samples': 6035136, 'steps': 31432, 'loss/train': 1.1822314262390137}}} 11/07/2021 01:41:09 - INFO - __main__ - Step 31439: {'lr': 0.0004527583663789986, 'samples': 6036288, 'steps': 31438, 'loss/train': 0.918387770652771}}}} 11/07/2021 01:41:11 - INFO - __main__ - Step 31444: {'lr': 0.00045274284301621414, 'samples': 6037248, 'steps': 31443, 'loss/train': 1.4954262971878052}} 11/07/2021 01:41:13 - INFO - __main__ - Step 31448: {'lr': 0.0004527304226816278, 'samples': 6038016, 'steps': 31447, 'loss/train': 1.6347882747650146}}} 11/07/2021 01:41:13 - INFO - __main__ - Step 31448: {'lr': 0.0004527304226816278, 'samples': 6038016, 'steps': 31447, 'loss/train': 1.6347882747650146}}} 11/07/2021 01:41:17 - INFO - __main__ - Step 31455: {'lr': 0.0004527086835792884, 'samples': 6039360, 'steps': 31454, 'loss/train': 1.391819715499878}}}} 11/07/2021 01:41:17 - INFO - __main__ - Step 31455: {'lr': 0.0004527086835792884, 'samples': 6039360, 'steps': 31454, 'loss/train': 1.391819715499878}}}} 11/07/2021 01:41:21 - INFO - __main__ - Step 31463: {'lr': 0.0004526838334106842, 'samples': 6040896, 'steps': 31462, 'loss/train': 1.9305704832077026}}} 11/07/2021 01:41:21 - INFO - __main__ - Step 31463: {'lr': 0.0004526838334106842, 'samples': 6040896, 'steps': 31462, 'loss/train': 1.9305704832077026}}} 11/07/2021 01:41:24 - INFO - __main__ - Step 31471: {'lr': 0.00045265897739720277, 'samples': 6042432, 'steps': 31470, 'loss/train': 1.2458423376083374}} 11/07/2021 01:41:26 - INFO - __main__ - Step 31475: {'lr': 0.000452646547198857, 'samples': 6043200, 'steps': 31474, 'loss/train': 1.3974919319152832}4}} 11/07/2021 01:41:29 - INFO - __main__ - Step 31480: {'lr': 0.00045263100739647373, 'samples': 6044160, 'steps': 31479, 'loss/train': 1.4764176607131958}} 11/07/2021 01:41:31 - INFO - __main__ - Step 31485: {'lr': 0.0004526154653115303, 'samples': 6045120, 'steps': 31484, 'loss/train': 1.9318288564682007}}} 11/07/2021 01:41:33 - INFO - __main__ - Step 31489: {'lr': 0.00045260303000024994, 'samples': 6045888, 'steps': 31488, 'loss/train': 1.7852832078933716}} 11/07/2021 01:41:33 - INFO - __main__ - Step 31489: {'lr': 0.00045260303000024994, 'samples': 6045888, 'steps': 31488, 'loss/train': 1.7852832078933716}} 11/07/2021 01:41:36 - INFO - __main__ - Step 31496: {'lr': 0.0004525812646909059, 'samples': 6047232, 'steps': 31495, 'loss/train': 1.5362430810928345}}} 11/07/2021 01:41:39 - INFO - __main__ - Step 31501: {'lr': 0.00045256571530294664, 'samples': 6048192, 'steps': 31500, 'loss/train': 1.1621021032333374}} 11/07/2021 01:41:41 - INFO - __main__ - Step 31506: {'lr': 0.0004525501636331628, 'samples': 6049152, 'steps': 31505, 'loss/train': 1.6992638111114502}}} 11/07/2021 01:41:41 - INFO - __main__ - Step 31506: {'lr': 0.0004525501636331628, 'samples': 6049152, 'steps': 31505, 'loss/train': 1.6992638111114502}}} 11/07/2021 01:41:45 - INFO - __main__ - Step 31513: {'lr': 0.0004525283874623336, 'samples': 6050496, 'steps': 31512, 'loss/train': 1.5510832071304321}}} 11/07/2021 01:41:46 - INFO - __main__ - Step 31517: {'lr': 0.000452515941928479, 'samples': 6051264, 'steps': 31516, 'loss/train': 1.4280192852020264}}}} 11/07/2021 01:41:48 - INFO - __main__ - Step 31521: {'lr': 0.0004525034949346155, 'samples': 6052032, 'steps': 31520, 'loss/train': 1.63162100315094}}}}} 11/07/2021 01:41:51 - INFO - __main__ - Step 31527: {'lr': 0.0004524848217064997, 'samples': 6053184, 'steps': 31526, 'loss/train': 1.5448814630508423}}} 11/07/2021 01:41:53 - INFO - __main__ - Step 31531: {'lr': 0.0004524723710630064, 'samples': 6053952, 'steps': 31530, 'loss/train': 1.370402455329895}}}} 11/07/2021 01:41:55 - INFO - __main__ - Step 31535: {'lr': 0.0004524599189598183, 'samples': 6054720, 'steps': 31534, 'loss/train': 2.3842573165893555}}} 11/07/2021 01:41:55 - INFO - __main__ - Step 31535: {'lr': 0.0004524599189598183, 'samples': 6054720, 'steps': 31534, 'loss/train': 2.3842573165893555}}} 11/07/2021 01:41:59 - INFO - __main__ - Step 31542: {'lr': 0.00045243812426711856, 'samples': 6056064, 'steps': 31541, 'loss/train': 1.4651298522949219}} 11/07/2021 01:42:01 - INFO - __main__ - Step 31547: {'lr': 0.0004524225538929829, 'samples': 6057024, 'steps': 31546, 'loss/train': 1.714564323425293}9}} 11/07/2021 01:42:01 - INFO - __main__ - Step 31547: {'lr': 0.0004524225538929829, 'samples': 6057024, 'steps': 31546, 'loss/train': 1.714564323425293}9}} 11/07/2021 01:42:05 - INFO - __main__ - Step 31554: {'lr': 0.00045240075153847625, 'samples': 6058368, 'steps': 31553, 'loss/train': 1.6579509973526}}9}} 11/07/2021 01:42:07 - INFO - __main__ - Step 31558: {'lr': 0.00045238829104378545, 'samples': 6059136, 'steps': 31557, 'loss/train': 1.4942617416381836}} 11/07/2021 01:42:09 - INFO - __main__ - Step 31563: {'lr': 0.00045237271337358897, 'samples': 6060096, 'steps': 31562, 'loss/train': 1.5823599100112915}} 11/07/2021 01:42:11 - INFO - __main__ - Step 31567: {'lr': 0.00045236024959607505, 'samples': 6060864, 'steps': 31566, 'loss/train': 1.4078350067138672}} 11/07/2021 01:42:13 - INFO - __main__ - Step 31571: {'lr': 0.0004523477843596746, 'samples': 6061632, 'steps': 31570, 'loss/train': 1.3582419157028198}}} 11/07/2021 01:42:15 - INFO - __main__ - Step 31575: {'lr': 0.00045233531766447757, 'samples': 6062400, 'steps': 31574, 'loss/train': 1.8818330764770508}} 11/07/2021 01:42:17 - INFO - __main__ - Step 31579: {'lr': 0.00045232284951057366, 'samples': 6063168, 'steps': 31578, 'loss/train': 1.141363263130188}}} 11/07/2021 01:42:19 - INFO - __main__ - Step 31584: {'lr': 0.00045230726226702444, 'samples': 6064128, 'steps': 31583, 'loss/train': 1.5771918296813965}} 11/07/2021 01:42:21 - INFO - __main__ - Step 31588: {'lr': 0.00045229479083135917, 'samples': 6064896, 'steps': 31587, 'loss/train': 2.3265585899353027}} 11/07/2021 01:42:21 - INFO - __main__ - Step 31588: {'lr': 0.00045229479083135917, 'samples': 6064896, 'steps': 31587, 'loss/train': 2.3265585899353027}} 11/07/2021 01:42:25 - INFO - __main__ - Step 31594: {'lr': 0.0004522760809433619, 'samples': 6066048, 'steps': 31593, 'loss/train': 5.449794292449951}7}} 11/07/2021 01:42:27 - INFO - __main__ - Step 31600: {'lr': 0.0004522573677742353, 'samples': 6067200, 'steps': 31599, 'loss/train': 1.8566700220108032}}} 11/07/2021 01:42:30 - INFO - __main__ - Step 31604: {'lr': 0.00045224489050545125, 'samples': 6067968, 'steps': 31603, 'loss/train': 1.3735251426696777}} 11/07/2021 01:42:30 - INFO - __main__ - Step 31604: {'lr': 0.00045224489050545125, 'samples': 6067968, 'steps': 31603, 'loss/train': 1.3735251426696777}} 11/07/2021 01:42:33 - INFO - __main__ - Step 31611: {'lr': 0.00045222305177668875, 'samples': 6069312, 'steps': 31610, 'loss/train': 0.9692068696022034}} 11/07/2021 01:42:35 - INFO - __main__ - Step 31616: {'lr': 0.0004522074499511299, 'samples': 6070272, 'steps': 31615, 'loss/train': 1.5862935781478882}}} 11/07/2021 01:42:35 - INFO - __main__ - Step 31616: {'lr': 0.0004522074499511299, 'samples': 6070272, 'steps': 31615, 'loss/train': 1.5862935781478882}}} 11/07/2021 01:42:39 - INFO - __main__ - Step 31624: {'lr': 0.0004521824822925078, 'samples': 6071808, 'steps': 31623, 'loss/train': 1.0066297054290771}}} 11/07/2021 01:42:41 - INFO - __main__ - Step 31628: {'lr': 0.00045216999627674436, 'samples': 6072576, 'steps': 31627, 'loss/train': 1.5675814151763916}} 11/07/2021 01:42:43 - INFO - __main__ - Step 31632: {'lr': 0.00045215750880346617, 'samples': 6073344, 'steps': 31631, 'loss/train': 1.265383243560791}}} 11/07/2021 01:42:45 - INFO - __main__ - Step 31637: {'lr': 0.0004521418974123751, 'samples': 6074304, 'steps': 31636, 'loss/train': 1.620078682899475}}}} 11/07/2021 01:42:45 - INFO - __main__ - Step 31637: {'lr': 0.0004521418974123751, 'samples': 6074304, 'steps': 31636, 'loss/train': 1.620078682899475}}}} 11/07/2021 01:42:49 - INFO - __main__ - Step 31643: {'lr': 0.0004521231607373747, 'samples': 6075456, 'steps': 31642, 'loss/train': 1.7724577188491821}}} 11/07/2021 01:42:49 - INFO - __main__ - Step 31643: {'lr': 0.0004521231607373747, 'samples': 6075456, 'steps': 31642, 'loss/train': 1.7724577188491821}}} 11/07/2021 01:42:52 - INFO - __main__ - Step 31651: {'lr': 0.0004520981734039731, 'samples': 6076992, 'steps': 31650, 'loss/train': 1.3422218561172485}}} 11/07/2021 01:42:54 - INFO - __main__ - Step 31655: {'lr': 0.0004520856775517316, 'samples': 6077760, 'steps': 31654, 'loss/train': 1.7040534019470215}}} 11/07/2021 01:42:57 - INFO - __main__ - Step 31660: {'lr': 0.0004520700556876648, 'samples': 6078720, 'steps': 31659, 'loss/train': 1.4073563814163208}}} 11/07/2021 01:42:57 - INFO - __main__ - Step 31660: {'lr': 0.0004520700556876648, 'samples': 6078720, 'steps': 31659, 'loss/train': 1.4073563814163208}}} 11/07/2021 01:42:57 - INFO - __main__ - Step 31660: {'lr': 0.0004520700556876648, 'samples': 6078720, 'steps': 31659, 'loss/train': 1.4073563814163208}}} 11/07/2021 01:43:03 - INFO - __main__ - Step 31671: {'lr': 0.00045203567957459657, 'samples': 6080832, 'steps': 31670, 'loss/train': 1.2069718837738037}} 11/07/2021 01:43:05 - INFO - __main__ - Step 31676: {'lr': 0.00045202005042717743, 'samples': 6081792, 'steps': 31675, 'loss/train': 1.475319266319275}}} 11/07/2021 01:43:07 - INFO - __main__ - Step 31681: {'lr': 0.0004520044190040804, 'samples': 6082752, 'steps': 31680, 'loss/train': 1.6463545560836792}}} 11/07/2021 01:43:07 - INFO - __main__ - Step 31681: {'lr': 0.0004520044190040804, 'samples': 6082752, 'steps': 31680, 'loss/train': 1.6463545560836792}}} 11/07/2021 01:43:11 - INFO - __main__ - Step 31688: {'lr': 0.00045198253118894084, 'samples': 6084096, 'steps': 31687, 'loss/train': 1.5900429487228394}} 11/07/2021 01:43:13 - INFO - __main__ - Step 31692: {'lr': 0.0004519700218637482, 'samples': 6084864, 'steps': 31691, 'loss/train': 1.5056148767471313}}} 11/07/2021 01:43:15 - INFO - __main__ - Step 31697: {'lr': 0.0004519543831596652, 'samples': 6085824, 'steps': 31696, 'loss/train': 1.0026839971542358}}} 11/07/2021 01:43:17 - INFO - __main__ - Step 31701: {'lr': 0.0004519418705584348, 'samples': 6086592, 'steps': 31700, 'loss/train': 1.449439525604248}}}} 11/07/2021 01:43:17 - INFO - __main__ - Step 31701: {'lr': 0.0004519418705584348, 'samples': 6086592, 'steps': 31700, 'loss/train': 1.449439525604248}}}} 11/07/2021 01:43:21 - INFO - __main__ - Step 31707: {'lr': 0.0004519230989268606, 'samples': 6087744, 'steps': 31706, 'loss/train': 1.0818159580230713}}} 11/07/2021 01:43:21 - INFO - __main__ - Step 31707: {'lr': 0.0004519230989268606, 'samples': 6087744, 'steps': 31706, 'loss/train': 1.0818159580230713}}} 11/07/2021 01:43:21 - INFO - __main__ - Step 31707: {'lr': 0.0004519230989268606, 'samples': 6087744, 'steps': 31706, 'loss/train': 1.0818159580230713}}} 11/07/2021 01:43:27 - INFO - __main__ - Step 31718: {'lr': 0.0004518886757622435, 'samples': 6089856, 'steps': 31717, 'loss/train': 1.869469165802002}}}} 11/07/2021 01:43:29 - INFO - __main__ - Step 31724: {'lr': 0.00045186989485115014, 'samples': 6091008, 'steps': 31723, 'loss/train': 1.8566874265670776}} 11/07/2021 01:43:32 - INFO - __main__ - Step 31728: {'lr': 0.0004518573724245467, 'samples': 6091776, 'steps': 31727, 'loss/train': 1.43443763256073}76}} 11/07/2021 01:43:33 - INFO - __main__ - Step 31732: {'lr': 0.00045184484854268216, 'samples': 6092544, 'steps': 31731, 'loss/train': 2.2386746406555176}} 11/07/2021 01:43:35 - INFO - __main__ - Step 31736: {'lr': 0.0004518323232056468, 'samples': 6093312, 'steps': 31735, 'loss/train': 1.3299400806427002}}} 11/07/2021 01:43:37 - INFO - __main__ - Step 31741: {'lr': 0.0004518166644881563, 'samples': 6094272, 'steps': 31740, 'loss/train': 1.8045982122421265}}} 11/07/2021 01:43:40 - INFO - __main__ - Step 31745: {'lr': 0.0004518041358773168, 'samples': 6095040, 'steps': 31744, 'loss/train': 1.9787827730178833}}} 11/07/2021 01:43:40 - INFO - __main__ - Step 31745: {'lr': 0.0004518041358773168, 'samples': 6095040, 'steps': 31744, 'loss/train': 1.9787827730178833}}} 11/07/2021 01:43:43 - INFO - __main__ - Step 31752: {'lr': 0.00045178220730760367, 'samples': 6096384, 'steps': 31751, 'loss/train': 1.2757604122161865}} 11/07/2021 01:43:45 - INFO - __main__ - Step 31757: {'lr': 0.00045176654131589617, 'samples': 6097344, 'steps': 31756, 'loss/train': 1.2734031677246094}} 11/07/2021 01:43:47 - INFO - __main__ - Step 31761: {'lr': 0.0004517540068860897, 'samples': 6098112, 'steps': 31760, 'loss/train': 1.867502212524414}4}} 11/07/2021 01:43:49 - INFO - __main__ - Step 31765: {'lr': 0.00045174147100176734, 'samples': 6098880, 'steps': 31764, 'loss/train': 1.377150535583496}}} 11/07/2021 01:43:51 - INFO - __main__ - Step 31769: {'lr': 0.0004517289336630195, 'samples': 6099648, 'steps': 31768, 'loss/train': 1.407793641090393}}}} 11/07/2021 01:43:53 - INFO - __main__ - Step 31773: {'lr': 0.0004517163948699365, 'samples': 6100416, 'steps': 31772, 'loss/train': 1.575675129890442}}}} 11/07/2021 01:43:55 - INFO - __main__ - Step 31778: {'lr': 0.0004517007193335617, 'samples': 6101376, 'steps': 31777, 'loss/train': 1.0152171850204468}}} 11/07/2021 01:43:55 - INFO - __main__ - Step 31778: {'lr': 0.0004517007193335617, 'samples': 6101376, 'steps': 31777, 'loss/train': 1.0152171850204468}}} 11/07/2021 01:44:00 - INFO - __main__ - Step 31786: {'lr': 0.0004516756337495075, 'samples': 6102912, 'steps': 31785, 'loss/train': 1.448751449584961}}}} 11/07/2021 01:44:01 - INFO - __main__ - Step 31790: {'lr': 0.0004516630887765089, 'samples': 6103680, 'steps': 31789, 'loss/train': 1.7036118507385254}}} 11/07/2021 01:44:03 - INFO - __main__ - Step 31794: {'lr': 0.00045165054234964984, 'samples': 6104448, 'steps': 31793, 'loss/train': 1.4671131372451782}} 11/07/2021 01:44:05 - INFO - __main__ - Step 31799: {'lr': 0.0004516348572717227, 'samples': 6105408, 'steps': 31798, 'loss/train': 1.4903631210327148}}} 11/07/2021 01:44:05 - INFO - __main__ - Step 31799: {'lr': 0.0004516348572717227, 'samples': 6105408, 'steps': 31798, 'loss/train': 1.4903631210327148}}} 11/07/2021 01:44:10 - INFO - __main__ - Step 31807: {'lr': 0.00045160975642272795, 'samples': 6106944, 'steps': 31806, 'loss/train': 1.7411733865737915}} 11/07/2021 01:44:12 - INFO - __main__ - Step 31811: {'lr': 0.0004515972038179714, 'samples': 6107712, 'steps': 31810, 'loss/train': 1.3670642375946045}}} 11/07/2021 01:44:13 - INFO - __main__ - Step 31815: {'lr': 0.0004515846497598294, 'samples': 6108480, 'steps': 31814, 'loss/train': 1.7123581171035767}}} 11/07/2021 01:44:15 - INFO - __main__ - Step 31819: {'lr': 0.00045157209424839253, 'samples': 6109248, 'steps': 31818, 'loss/train': 1.529592752456665}}} 11/07/2021 01:44:18 - INFO - __main__ - Step 31824: {'lr': 0.00045155639781553825, 'samples': 6110208, 'steps': 31823, 'loss/train': 1.6955679655075073}} 11/07/2021 01:44:20 - INFO - __main__ - Step 31828: {'lr': 0.0004515438390345188, 'samples': 6110976, 'steps': 31827, 'loss/train': 1.3006891012191772}}} 11/07/2021 01:44:22 - INFO - __main__ - Step 31832: {'lr': 0.0004515312788004986, 'samples': 6111744, 'steps': 31831, 'loss/train': 1.4673216342926025}}} 11/07/2021 01:44:23 - INFO - __main__ - Step 31836: {'lr': 0.00045151871711356827, 'samples': 6112512, 'steps': 31835, 'loss/train': 0.9932732582092285}} 11/07/2021 01:44:26 - INFO - __main__ - Step 31840: {'lr': 0.00045150615397381835, 'samples': 6113280, 'steps': 31839, 'loss/train': 1.0138678550720215}} 11/07/2021 01:44:28 - INFO - __main__ - Step 31845: {'lr': 0.00045149044800624135, 'samples': 6114240, 'steps': 31844, 'loss/train': 1.5553712844848633}} 11/07/2021 01:44:30 - INFO - __main__ - Step 31849: {'lr': 0.0004514778815979785, 'samples': 6115008, 'steps': 31848, 'loss/train': 1.764130711555481}3}} 11/07/2021 01:44:30 - INFO - __main__ - Step 31849: {'lr': 0.0004514778815979785, 'samples': 6115008, 'steps': 31848, 'loss/train': 1.764130711555481}3}} 11/07/2021 01:44:33 - INFO - __main__ - Step 31856: {'lr': 0.0004514558868884343, 'samples': 6116352, 'steps': 31855, 'loss/train': 1.3141847848892212}}} 11/07/2021 01:44:36 - INFO - __main__ - Step 31861: {'lr': 0.0004514401736584013, 'samples': 6117312, 'steps': 31860, 'loss/train': 1.288212537765503}}}} 11/07/2021 01:44:38 - INFO - __main__ - Step 31866: {'lr': 0.00045142445815922244, 'samples': 6118272, 'steps': 31865, 'loss/train': 1.5117849111557007}} 11/07/2021 01:44:40 - INFO - __main__ - Step 31870: {'lr': 0.0004514118841262133, 'samples': 6119040, 'steps': 31869, 'loss/train': 1.6058647632598877}}} 11/07/2021 01:44:40 - INFO - __main__ - Step 31870: {'lr': 0.0004514118841262133, 'samples': 6119040, 'steps': 31869, 'loss/train': 1.6058647632598877}}} 11/07/2021 01:44:44 - INFO - __main__ - Step 31877: {'lr': 0.00045138987607450803, 'samples': 6120384, 'steps': 31876, 'loss/train': 2.0093131065368652}} 11/07/2021 01:44:44 - INFO - __main__ - Step 31877: {'lr': 0.00045138987607450803, 'samples': 6120384, 'steps': 31876, 'loss/train': 2.0093131065368652}} 11/07/2021 01:44:48 - INFO - __main__ - Step 31885: {'lr': 0.00045136471857085435, 'samples': 6121920, 'steps': 31884, 'loss/train': 1.7443677186965942}} 11/07/2021 01:44:50 - INFO - __main__ - Step 31889: {'lr': 0.00045135213764141814, 'samples': 6122688, 'steps': 31888, 'loss/train': 1.505753993988037}}} 11/07/2021 01:44:52 - INFO - __main__ - Step 31893: {'lr': 0.0004513395552603633, 'samples': 6123456, 'steps': 31892, 'loss/train': 1.3275775909423828}}} 11/07/2021 01:44:54 - INFO - __main__ - Step 31898: {'lr': 0.0004513238252428442, 'samples': 6124416, 'steps': 31897, 'loss/train': 1.728275179862976}}}} 11/07/2021 01:44:56 - INFO - __main__ - Step 31902: {'lr': 0.00045131123959597905, 'samples': 6125184, 'steps': 31901, 'loss/train': 1.482000708580017}}} 11/07/2021 01:44:58 - INFO - __main__ - Step 31906: {'lr': 0.00045129865249779, 'samples': 6125952, 'steps': 31905, 'loss/train': 1.4602670669555664}7}}} 11/07/2021 01:45:00 - INFO - __main__ - Step 31910: {'lr': 0.00045128606394836805, 'samples': 6126720, 'steps': 31909, 'loss/train': 1.6861112117767334}} 11/07/2021 01:45:02 - INFO - __main__ - Step 31914: {'lr': 0.00045127347394780367, 'samples': 6127488, 'steps': 31913, 'loss/train': 1.6989363431930542}} 11/07/2021 01:45:04 - INFO - __main__ - Step 31919: {'lr': 0.00045125773440656756, 'samples': 6128448, 'steps': 31918, 'loss/train': 1.2627248764038086}} 11/07/2021 01:45:06 - INFO - __main__ - Step 31923: {'lr': 0.00045124514114126493, 'samples': 6129216, 'steps': 31922, 'loss/train': 1.5508592128753662}} 11/07/2021 01:45:06 - INFO - __main__ - Step 31923: {'lr': 0.00045124514114126493, 'samples': 6129216, 'steps': 31922, 'loss/train': 1.5508592128753662}} 11/07/2021 01:45:09 - INFO - __main__ - Step 31929: {'lr': 0.0004512262485230007, 'samples': 6130368, 'steps': 31928, 'loss/train': 1.0703651905059814}}} 11/07/2021 01:45:12 - INFO - __main__ - Step 31934: {'lr': 0.0004512105021810244, 'samples': 6131328, 'steps': 31933, 'loss/train': 1.6673365831375122}}} 11/07/2021 01:45:15 - INFO - __main__ - Step 31940: {'lr': 0.00045119160357881105, 'samples': 6132480, 'steps': 31939, 'loss/train': 1.8165699243545532}} 11/07/2021 01:45:15 - INFO - __main__ - Step 31940: {'lr': 0.00045119160357881105, 'samples': 6132480, 'steps': 31939, 'loss/train': 1.8165699243545532}} 11/07/2021 01:45:17 - INFO - __main__ - Step 31946: {'lr': 0.0004511727017130598, 'samples': 6133632, 'steps': 31945, 'loss/train': 1.8258341550827026}}} 11/07/2021 01:45:19 - INFO - __main__ - Step 31950: {'lr': 0.00045116009865630034, 'samples': 6134400, 'steps': 31949, 'loss/train': 1.8072712421417236}} 11/07/2021 01:45:22 - INFO - __main__ - Step 31955: {'lr': 0.00045114434279596994, 'samples': 6135360, 'steps': 31954, 'loss/train': 1.5438740253448486}} 11/07/2021 01:45:22 - INFO - __main__ - Step 31955: {'lr': 0.00045114434279596994, 'samples': 6135360, 'steps': 31954, 'loss/train': 1.5438740253448486}} 11/07/2021 01:45:26 - INFO - __main__ - Step 31963: {'lr': 0.0004511191287066232, 'samples': 6136896, 'steps': 31962, 'loss/train': 1.9909965991973877}}} 11/07/2021 01:45:28 - INFO - __main__ - Step 31967: {'lr': 0.0004511065194869961, 'samples': 6137664, 'steps': 31966, 'loss/train': 1.316340446472168}}}} 11/07/2021 01:45:30 - INFO - __main__ - Step 31971: {'lr': 0.0004510939088175211, 'samples': 6138432, 'steps': 31970, 'loss/train': 1.5126852989196777}}} 11/07/2021 01:45:30 - INFO - __main__ - Step 31971: {'lr': 0.0004510939088175211, 'samples': 6138432, 'steps': 31970, 'loss/train': 1.5126852989196777}}} 11/07/2021 01:45:34 - INFO - __main__ - Step 31979: {'lr': 0.00045106868312939116, 'samples': 6139968, 'steps': 31978, 'loss/train': 0.6979764103889465}} 11/07/2021 01:45:35 - INFO - __main__ - Step 31983: {'lr': 0.0004510560681109179, 'samples': 6140736, 'steps': 31982, 'loss/train': 1.8358553647994995}}} 11/07/2021 01:45:38 - INFO - __main__ - Step 31987: {'lr': 0.0004510434516429606, 'samples': 6141504, 'steps': 31986, 'loss/train': 3.6436564922332764}}} 11/07/2021 01:45:40 - INFO - __main__ - Step 31992: {'lr': 0.0004510276790198153, 'samples': 6142464, 'steps': 31991, 'loss/train': 1.7565659284591675}}} 11/07/2021 01:45:42 - INFO - __main__ - Step 31996: {'lr': 0.0004510150592908511, 'samples': 6143232, 'steps': 31995, 'loss/train': 0.27138158679008484}} 11/07/2021 01:45:44 - INFO - __main__ - Step 32000: {'lr': 0.00045100243811269834, 'samples': 6144000, 'steps': 31999, 'loss/train': 1.2587133646011353}} 11/07/2021 01:45:46 - INFO - __main__ - Step 32004: {'lr': 0.0004509898154854481, 'samples': 6144768, 'steps': 32003, 'loss/train': 1.133880615234375}3}} 11/07/2021 01:45:48 - INFO - __main__ - Step 32008: {'lr': 0.00045097719140919126, 'samples': 6145536, 'steps': 32007, 'loss/train': 1.929354190826416}}} 11/07/2021 01:45:50 - INFO - __main__ - Step 32013: {'lr': 0.0004509614092763434, 'samples': 6146496, 'steps': 32012, 'loss/train': 1.2444761991500854}}} 11/07/2021 01:45:50 - INFO - __main__ - Step 32013: {'lr': 0.0004509614092763434, 'samples': 6146496, 'steps': 32012, 'loss/train': 1.2444761991500854}}} 11/07/2021 01:45:54 - INFO - __main__ - Step 32020: {'lr': 0.00045093931048729156, 'samples': 6147840, 'steps': 32019, 'loss/train': 1.719070315361023}}} 11/07/2021 01:45:56 - INFO - __main__ - Step 32024: {'lr': 0.00045092668061591875, 'samples': 6148608, 'steps': 32023, 'loss/train': 1.3829171657562256}} 11/07/2021 01:45:58 - INFO - __main__ - Step 32029: {'lr': 0.00045091089123968796, 'samples': 6149568, 'steps': 32028, 'loss/train': 1.7335764169692993}} 11/07/2021 01:46:00 - INFO - __main__ - Step 32033: {'lr': 0.0004508982581092026, 'samples': 6150336, 'steps': 32032, 'loss/train': 0.9741845726966858}}} 11/07/2021 01:46:02 - INFO - __main__ - Step 32037: {'lr': 0.00045088562353037077, 'samples': 6151104, 'steps': 32036, 'loss/train': 1.8636360168457031}} 11/07/2021 01:46:02 - INFO - __main__ - Step 32037: {'lr': 0.00045088562353037077, 'samples': 6151104, 'steps': 32036, 'loss/train': 1.8636360168457031}} 11/07/2021 01:46:05 - INFO - __main__ - Step 32044: {'lr': 0.00045086350953260526, 'samples': 6152448, 'steps': 32043, 'loss/train': 1.8391896486282349}} 11/07/2021 01:46:08 - INFO - __main__ - Step 32049: {'lr': 0.00045084771110470717, 'samples': 6153408, 'steps': 32048, 'loss/train': 1.6490238904953003}} 11/07/2021 01:46:08 - INFO - __main__ - Step 32049: {'lr': 0.00045084771110470717, 'samples': 6153408, 'steps': 32048, 'loss/train': 1.6490238904953003}} 11/07/2021 01:46:12 - INFO - __main__ - Step 32057: {'lr': 0.0004508224289142026, 'samples': 6154944, 'steps': 32056, 'loss/train': 1.6833237409591675}}} 11/07/2021 01:46:14 - INFO - __main__ - Step 32061: {'lr': 0.00045080978564720505, 'samples': 6155712, 'steps': 32060, 'loss/train': 1.436179757118225}}} 11/07/2021 01:46:16 - INFO - __main__ - Step 32065: {'lr': 0.00045079714093249887, 'samples': 6156480, 'steps': 32064, 'loss/train': 1.9286205768585205}} 11/07/2021 01:46:18 - INFO - __main__ - Step 32070: {'lr': 0.0004507813330034147, 'samples': 6157440, 'steps': 32069, 'loss/train': 1.153349757194519}5}} 11/07/2021 01:46:20 - INFO - __main__ - Step 32074: {'lr': 0.0004507686850316973, 'samples': 6158208, 'steps': 32073, 'loss/train': 1.6692560911178589}}} 11/07/2021 01:46:22 - INFO - __main__ - Step 32078: {'lr': 0.0004507560356125676, 'samples': 6158976, 'steps': 32077, 'loss/train': 1.8561049699783325}}} 11/07/2021 01:46:24 - INFO - __main__ - Step 32082: {'lr': 0.00045074338474611683, 'samples': 6159744, 'steps': 32081, 'loss/train': 5.802849769592285}}} 11/07/2021 01:46:26 - INFO - __main__ - Step 32086: {'lr': 0.00045073073243243603, 'samples': 6160512, 'steps': 32085, 'loss/train': 1.0529669523239136}} 11/07/2021 01:46:28 - INFO - __main__ - Step 32091: {'lr': 0.00045071491500530694, 'samples': 6161472, 'steps': 32090, 'loss/train': 0.4279614984989166}} 11/07/2021 01:46:28 - INFO - __main__ - Step 32091: {'lr': 0.00045071491500530694, 'samples': 6161472, 'steps': 32090, 'loss/train': 0.4279614984989166}} 11/07/2021 01:46:33 - INFO - __main__ - Step 32098: {'lr': 0.00045069276680892624, 'samples': 6162816, 'steps': 32097, 'loss/train': 1.6708344221115112}} 11/07/2021 01:46:34 - INFO - __main__ - Step 32102: {'lr': 0.00045068010870723783, 'samples': 6163584, 'steps': 32101, 'loss/train': 1.115096092224121}}} 11/07/2021 01:46:36 - INFO - __main__ - Step 32107: {'lr': 0.0004506642840456126, 'samples': 6164544, 'steps': 32106, 'loss/train': 1.7105660438537598}}} 11/07/2021 01:46:38 - INFO - __main__ - Step 32111: {'lr': 0.00045065162268881164, 'samples': 6165312, 'steps': 32110, 'loss/train': 1.431797981262207}}} 11/07/2021 01:46:40 - INFO - __main__ - Step 32115: {'lr': 0.00045063895988544235, 'samples': 6166080, 'steps': 32114, 'loss/train': 1.5505790710449219}} 11/07/2021 01:46:40 - INFO - __main__ - Step 32115: {'lr': 0.00045063895988544235, 'samples': 6166080, 'steps': 32114, 'loss/train': 1.5505790710449219}} 11/07/2021 01:46:44 - INFO - __main__ - Step 32122: {'lr': 0.00045061679649901543, 'samples': 6167424, 'steps': 32121, 'loss/train': 1.6725994348526}19}} 11/07/2021 01:46:46 - INFO - __main__ - Step 32127: {'lr': 0.00045060096279683694, 'samples': 6168384, 'steps': 32126, 'loss/train': 1.4542152881622314}} 11/07/2021 01:46:48 - INFO - __main__ - Step 32132: {'lr': 0.00045058512683496607, 'samples': 6169344, 'steps': 32131, 'loss/train': 1.505821943283081}}} 11/07/2021 01:46:51 - INFO - __main__ - Step 32136: {'lr': 0.0004505724564386106, 'samples': 6170112, 'steps': 32135, 'loss/train': 1.8550945520401}81}}} 11/07/2021 01:46:53 - INFO - __main__ - Step 32140: {'lr': 0.0004505597845962575, 'samples': 6170880, 'steps': 32139, 'loss/train': 1.7972909212112427}}} 11/07/2021 01:46:55 - INFO - __main__ - Step 32144: {'lr': 0.00045054711130799806, 'samples': 6171648, 'steps': 32143, 'loss/train': 1.3463897705078125}} 11/07/2021 01:46:56 - INFO - __main__ - Step 32148: {'lr': 0.0004505344365739238, 'samples': 6172416, 'steps': 32147, 'loss/train': 1.6588155031204224}}} 11/07/2021 01:46:58 - INFO - __main__ - Step 32152: {'lr': 0.00045052176039412587, 'samples': 6173184, 'steps': 32151, 'loss/train': 1.3574669361114502}} 11/07/2021 01:47:01 - INFO - __main__ - Step 32157: {'lr': 0.0004505059131364689, 'samples': 6174144, 'steps': 32156, 'loss/train': 1.6518559455871582}}} 11/07/2021 01:47:03 - INFO - __main__ - Step 32161: {'lr': 0.00045049323370412723, 'samples': 6174912, 'steps': 32160, 'loss/train': 1.4618717432022095}} 11/07/2021 01:47:05 - INFO - __main__ - Step 32165: {'lr': 0.0004504805528263589, 'samples': 6175680, 'steps': 32164, 'loss/train': 1.6876972913742065}}} 11/07/2021 01:47:06 - INFO - __main__ - Step 32169: {'lr': 0.00045046787050325555, 'samples': 6176448, 'steps': 32168, 'loss/train': 0.7906864285469055}} 11/07/2021 01:47:08 - INFO - __main__ - Step 32173: {'lr': 0.0004504551867349085, 'samples': 6177216, 'steps': 32172, 'loss/train': 1.5836315155029297}}} 11/07/2021 01:47:10 - INFO - __main__ - Step 32177: {'lr': 0.0004504425015214092, 'samples': 6177984, 'steps': 32176, 'loss/train': 1.4530431032180786}}} 11/07/2021 01:47:13 - INFO - __main__ - Step 32181: {'lr': 0.0004504298148628492, 'samples': 6178752, 'steps': 32180, 'loss/train': 0.48151570558547974}} 11/07/2021 01:47:14 - INFO - __main__ - Step 32185: {'lr': 0.00045041712675931983, 'samples': 6179520, 'steps': 32184, 'loss/train': 1.6092675924301147}} 11/07/2021 01:47:16 - INFO - __main__ - Step 32189: {'lr': 0.00045040443721091266, 'samples': 6180288, 'steps': 32188, 'loss/train': 1.3511254787445068}} 11/07/2021 01:47:18 - INFO - __main__ - Step 32194: {'lr': 0.00045038857324368367, 'samples': 6181248, 'steps': 32193, 'loss/train': 1.5134190320968628}} 11/07/2021 01:47:18 - INFO - __main__ - Step 32194: {'lr': 0.00045038857324368367, 'samples': 6181248, 'steps': 32193, 'loss/train': 1.5134190320968628}} 11/07/2021 01:47:22 - INFO - __main__ - Step 32201: {'lr': 0.00045036635989733904, 'samples': 6182592, 'steps': 32200, 'loss/train': 0.16695256531238556} 11/07/2021 01:47:24 - INFO - __main__ - Step 32206: {'lr': 0.00045035049051289037, 'samples': 6183552, 'steps': 32205, 'loss/train': 1.389227032661438}6} 11/07/2021 01:47:24 - INFO - __main__ - Step 32206: {'lr': 0.00045035049051289037, 'samples': 6183552, 'steps': 32205, 'loss/train': 1.389227032661438}6} 11/07/2021 01:47:28 - INFO - __main__ - Step 32213: {'lr': 0.0004503282695831589, 'samples': 6184896, 'steps': 32212, 'loss/train': 2.376875400543213}}6} 11/07/2021 01:47:30 - INFO - __main__ - Step 32217: {'lr': 0.000450315569923169, 'samples': 6185664, 'steps': 32216, 'loss/train': 1.9647176265716553}}6} 11/07/2021 01:47:32 - INFO - __main__ - Step 32222: {'lr': 0.00045029969331736254, 'samples': 6186624, 'steps': 32221, 'loss/train': 1.9964864253997803}} 11/07/2021 01:47:34 - INFO - __main__ - Step 32226: {'lr': 0.0004502869904081736, 'samples': 6187392, 'steps': 32225, 'loss/train': 1.5956135988235474}}} 11/07/2021 01:47:36 - INFO - __main__ - Step 32230: {'lr': 0.00045027428605504507, 'samples': 6188160, 'steps': 32229, 'loss/train': 1.6560455560684204}} 11/07/2021 01:47:38 - INFO - __main__ - Step 32234: {'lr': 0.0004502615802580685, 'samples': 6188928, 'steps': 32233, 'loss/train': 1.5005950927734375}}} 11/07/2021 01:47:40 - INFO - __main__ - Step 32238: {'lr': 0.00045024887301733555, 'samples': 6189696, 'steps': 32237, 'loss/train': 1.6385005712509155}} 11/07/2021 01:47:42 - INFO - __main__ - Step 32242: {'lr': 0.00045023616433293763, 'samples': 6190464, 'steps': 32241, 'loss/train': 1.55082106590271}5}} 11/07/2021 01:47:44 - INFO - __main__ - Step 32247: {'lr': 0.00045022027644742624, 'samples': 6191424, 'steps': 32246, 'loss/train': 1.9687552452087402}} 11/07/2021 01:47:47 - INFO - __main__ - Step 32251: {'lr': 0.0004502075645151175, 'samples': 6192192, 'steps': 32250, 'loss/train': 1.9542632102966309}}} 11/07/2021 01:47:48 - INFO - __main__ - Step 32255: {'lr': 0.0004501948511394417, 'samples': 6192960, 'steps': 32254, 'loss/train': 1.3751741647720337}}} 11/07/2021 01:47:50 - INFO - __main__ - Step 32259: {'lr': 0.0004501821363204906, 'samples': 6193728, 'steps': 32258, 'loss/train': 1.2249864339828491}}} 11/07/2021 01:47:53 - INFO - __main__ - Step 32264: {'lr': 0.0004501662407673354, 'samples': 6194688, 'steps': 32263, 'loss/train': 1.338904857635498}}}} 11/07/2021 01:47:55 - INFO - __main__ - Step 32268: {'lr': 0.0004501535227013498, 'samples': 6195456, 'steps': 32267, 'loss/train': 1.4296706914901733}}} 11/07/2021 01:47:57 - INFO - __main__ - Step 32272: {'lr': 0.00045014080319238686, 'samples': 6196224, 'steps': 32271, 'loss/train': 1.12027907371521}}}} 11/07/2021 01:47:59 - INFO - __main__ - Step 32276: {'lr': 0.0004501280822405382, 'samples': 6196992, 'steps': 32275, 'loss/train': 1.6742898225784302}}} 11/07/2021 01:48:00 - INFO - __main__ - Step 32280: {'lr': 0.00045011535984589544, 'samples': 6197760, 'steps': 32279, 'loss/train': 1.3071985244750977}} 11/07/2021 01:48:02 - INFO - __main__ - Step 32284: {'lr': 0.0004501026360085505, 'samples': 6198528, 'steps': 32283, 'loss/train': 1.030112624168396}7}} 11/07/2021 01:48:02 - INFO - __main__ - Step 32284: {'lr': 0.0004501026360085505, 'samples': 6198528, 'steps': 32283, 'loss/train': 1.030112624168396}7}} 11/07/2021 01:48:07 - INFO - __main__ - Step 32292: {'lr': 0.0004500771840061206, 'samples': 6200064, 'steps': 32291, 'loss/train': 1.9112358093261719}}} 11/07/2021 01:48:08 - INFO - __main__ - Step 32296: {'lr': 0.00045006445584121923, 'samples': 6200832, 'steps': 32295, 'loss/train': 1.9976251125335693}} 11/07/2021 01:48:10 - INFO - __main__ - Step 32300: {'lr': 0.0004500517262339825, 'samples': 6201600, 'steps': 32299, 'loss/train': 1.860845685005188}3}} 11/07/2021 01:48:13 - INFO - __main__ - Step 32305: {'lr': 0.00045003581219679235, 'samples': 6202560, 'steps': 32304, 'loss/train': 1.3765199184417725}} 11/07/2021 01:48:13 - INFO - __main__ - Step 32305: {'lr': 0.00045003581219679235, 'samples': 6202560, 'steps': 32304, 'loss/train': 1.3765199184417725}} 11/07/2021 01:48:17 - INFO - __main__ - Step 32313: {'lr': 0.00045001034505044415, 'samples': 6204096, 'steps': 32312, 'loss/train': 1.686047077178955}}} 11/07/2021 01:48:19 - INFO - __main__ - Step 32317: {'lr': 0.0004499976093143063, 'samples': 6204864, 'steps': 32316, 'loss/train': 1.5840280055999756}}} 11/07/2021 01:48:20 - INFO - __main__ - Step 32321: {'lr': 0.0004499848721363151, 'samples': 6205632, 'steps': 32320, 'loss/train': 2.0209126472473145}}} 11/07/2021 01:48:23 - INFO - __main__ - Step 32326: {'lr': 0.00044996894863635965, 'samples': 6206592, 'steps': 32325, 'loss/train': 1.345014214515686}}} 11/07/2021 01:48:25 - INFO - __main__ - Step 32330: {'lr': 0.00044995620821453416, 'samples': 6207360, 'steps': 32329, 'loss/train': 1.9484862089157104}} 11/07/2021 01:48:25 - INFO - __main__ - Step 32330: {'lr': 0.00044995620821453416, 'samples': 6207360, 'steps': 32329, 'loss/train': 1.9484862089157104}} 11/07/2021 01:48:28 - INFO - __main__ - Step 32337: {'lr': 0.0004499339090076532, 'samples': 6208704, 'steps': 32336, 'loss/train': 2.068150520324707}4}} 11/07/2021 01:48:31 - INFO - __main__ - Step 32342: {'lr': 0.00044991797830009543, 'samples': 6209664, 'steps': 32341, 'loss/train': 2.685377597808838}}} 11/07/2021 01:48:33 - INFO - __main__ - Step 32347: {'lr': 0.0004499020453405388, 'samples': 6210624, 'steps': 32346, 'loss/train': 1.9387210607528687}}} 11/07/2021 01:48:35 - INFO - __main__ - Step 32351: {'lr': 0.000449889297351575, 'samples': 6211392, 'steps': 32350, 'loss/train': 1.0773181915283203}}}} 11/07/2021 01:48:37 - INFO - __main__ - Step 32355: {'lr': 0.00044987654792153853, 'samples': 6212160, 'steps': 32354, 'loss/train': 1.4395281076431274}} 11/07/2021 01:48:39 - INFO - __main__ - Step 32359: {'lr': 0.0004498637970505215, 'samples': 6212928, 'steps': 32358, 'loss/train': 1.9765815734863281}}} 11/07/2021 01:48:41 - INFO - __main__ - Step 32363: {'lr': 0.00044985104473861583, 'samples': 6213696, 'steps': 32362, 'loss/train': 1.6762796640396118}} 11/07/2021 01:48:43 - INFO - __main__ - Step 32368: {'lr': 0.00044983510232262405, 'samples': 6214656, 'steps': 32367, 'loss/train': 1.2581473588943481}} 11/07/2021 01:48:45 - INFO - __main__ - Step 32372: {'lr': 0.0004498223467690549, 'samples': 6215424, 'steps': 32371, 'loss/train': 1.257175087928772}1}} 11/07/2021 01:48:47 - INFO - __main__ - Step 32376: {'lr': 0.00044980958977489593, 'samples': 6216192, 'steps': 32375, 'loss/train': 1.2944344282150269}} 11/07/2021 01:48:47 - INFO - __main__ - Step 32376: {'lr': 0.00044980958977489593, 'samples': 6216192, 'steps': 32375, 'loss/train': 1.2944344282150269}} 11/07/2021 01:48:51 - INFO - __main__ - Step 32383: {'lr': 0.0004497872615689751, 'samples': 6217536, 'steps': 32382, 'loss/train': 0.8333058953285217}}} 11/07/2021 01:48:53 - INFO - __main__ - Step 32388: {'lr': 0.00044977131014979974, 'samples': 6218496, 'steps': 32387, 'loss/train': 1.2199071645736694}} 11/07/2021 01:48:53 - INFO - __main__ - Step 32388: {'lr': 0.00044977131014979974, 'samples': 6218496, 'steps': 32387, 'loss/train': 1.2199071645736694}} 11/07/2021 01:48:57 - INFO - __main__ - Step 32396: {'lr': 0.0004497457831984727, 'samples': 6220032, 'steps': 32395, 'loss/train': 1.5898518562316895}}} 11/07/2021 01:48:59 - INFO - __main__ - Step 32400: {'lr': 0.00044973301756270635, 'samples': 6220800, 'steps': 32399, 'loss/train': 1.4757723808288574}} 11/07/2021 01:49:01 - INFO - __main__ - Step 32404: {'lr': 0.0004497202504869941, 'samples': 6221568, 'steps': 32403, 'loss/train': 1.795836091041565}4}} 11/07/2021 01:49:03 - INFO - __main__ - Step 32409: {'lr': 0.00044970428961757026, 'samples': 6222528, 'steps': 32408, 'loss/train': 1.5618425607681274}} 11/07/2021 01:49:06 - INFO - __main__ - Step 32414: {'lr': 0.00044968832649855455, 'samples': 6223488, 'steps': 32413, 'loss/train': 1.6851338148117065}} 11/07/2021 01:49:06 - INFO - __main__ - Step 32414: {'lr': 0.00044968832649855455, 'samples': 6223488, 'steps': 32413, 'loss/train': 1.6851338148117065}} 11/07/2021 01:49:09 - INFO - __main__ - Step 32421: {'lr': 0.0004496659743529608, 'samples': 6224832, 'steps': 32420, 'loss/train': 1.5266687870025635}}} 11/07/2021 01:49:11 - INFO - __main__ - Step 32425: {'lr': 0.0004496531997190432, 'samples': 6225600, 'steps': 32424, 'loss/train': 1.8037725687026978}}} 11/07/2021 01:49:11 - INFO - __main__ - Step 32425: {'lr': 0.0004496531997190432, 'samples': 6225600, 'steps': 32424, 'loss/train': 1.8037725687026978}}} 11/07/2021 01:49:15 - INFO - __main__ - Step 32432: {'lr': 0.00044963084064625775, 'samples': 6226944, 'steps': 32431, 'loss/train': 1.677183747291565}}} 11/07/2021 01:49:16 - INFO - __main__ - Step 32436: {'lr': 0.0004496180620542931, 'samples': 6227712, 'steps': 32435, 'loss/train': 1.2223323583602905}}} 11/07/2021 01:49:18 - INFO - __main__ - Step 32440: {'lr': 0.00044960528202321143, 'samples': 6228480, 'steps': 32439, 'loss/train': 1.3973376750946045}} 11/07/2021 01:49:18 - INFO - __main__ - Step 32440: {'lr': 0.00044960528202321143, 'samples': 6228480, 'steps': 32439, 'loss/train': 1.3973376750946045}} 11/07/2021 01:49:23 - INFO - __main__ - Step 32449: {'lr': 0.000449576521691983, 'samples': 6230208, 'steps': 32448, 'loss/train': 1.449952244758606}45}} 11/07/2021 01:49:25 - INFO - __main__ - Step 32453: {'lr': 0.0004495637369844071, 'samples': 6230976, 'steps': 32452, 'loss/train': 1.0895562171936035}}} 11/07/2021 01:49:26 - INFO - __main__ - Step 32457: {'lr': 0.0004495509508381058, 'samples': 6231744, 'steps': 32456, 'loss/train': 1.1795073747634888}}} 11/07/2021 01:49:28 - INFO - __main__ - Step 32461: {'lr': 0.00044953816325317116, 'samples': 6232512, 'steps': 32460, 'loss/train': 1.4539257287979126}} 11/07/2021 01:49:31 - INFO - __main__ - Step 32466: {'lr': 0.0004495221767490653, 'samples': 6233472, 'steps': 32465, 'loss/train': 1.3486028909683228}}} 11/07/2021 01:49:31 - INFO - __main__ - Step 32466: {'lr': 0.0004495221767490653, 'samples': 6233472, 'steps': 32465, 'loss/train': 1.3486028909683228}}} 11/07/2021 01:49:35 - INFO - __main__ - Step 32474: {'lr': 0.00044949659366768697, 'samples': 6235008, 'steps': 32473, 'loss/train': 1.6390635967254639}} 11/07/2021 01:49:37 - INFO - __main__ - Step 32478: {'lr': 0.00044948379996958963, 'samples': 6235776, 'steps': 32477, 'loss/train': 1.706484317779541}}} 11/07/2021 01:49:39 - INFO - __main__ - Step 32482: {'lr': 0.00044947100483334315, 'samples': 6236544, 'steps': 32481, 'loss/train': 1.956660509109497}}} 11/07/2021 01:49:41 - INFO - __main__ - Step 32487: {'lr': 0.0004494550088907783, 'samples': 6237504, 'steps': 32486, 'loss/train': 1.5964031219482422}}} 11/07/2021 01:49:41 - INFO - __main__ - Step 32487: {'lr': 0.0004494550088907783, 'samples': 6237504, 'steps': 32486, 'loss/train': 1.5964031219482422}}} 11/07/2021 01:49:44 - INFO - __main__ - Step 32494: {'lr': 0.0004494326107966311, 'samples': 6238848, 'steps': 32493, 'loss/train': 1.3394577503204346}}} 11/07/2021 01:49:47 - INFO - __main__ - Step 32498: {'lr': 0.0004494198099087106, 'samples': 6239616, 'steps': 32497, 'loss/train': 1.2754056453704834}}} 11/07/2021 01:49:49 - INFO - __main__ - Step 32503: {'lr': 0.00044940380677707214, 'samples': 6240576, 'steps': 32502, 'loss/train': 0.6593044400215149}} 11/07/2021 01:49:51 - INFO - __main__ - Step 32508: {'lr': 0.0004493878013992268, 'samples': 6241536, 'steps': 32507, 'loss/train': 1.458335518836975}9}} 11/07/2021 01:49:53 - INFO - __main__ - Step 32512: {'lr': 0.00044937499547980265, 'samples': 6242304, 'steps': 32511, 'loss/train': 1.2921862602233887}} 11/07/2021 01:49:55 - INFO - __main__ - Step 32516: {'lr': 0.0004493621881230138, 'samples': 6243072, 'steps': 32515, 'loss/train': 1.0435876846313477}}} 11/07/2021 01:49:55 - INFO - __main__ - Step 32516: {'lr': 0.0004493621881230138, 'samples': 6243072, 'steps': 32515, 'loss/train': 1.0435876846313477}}} 11/07/2021 01:49:59 - INFO - __main__ - Step 32523: {'lr': 0.0004493397717902521, 'samples': 6244416, 'steps': 32522, 'loss/train': 1.3674840927124023}}} 11/07/2021 01:50:01 - INFO - __main__ - Step 32528: {'lr': 0.000449323757429382, 'samples': 6245376, 'steps': 32527, 'loss/train': 1.4659003019332886}}}} 11/07/2021 01:50:01 - INFO - __main__ - Step 32528: {'lr': 0.000449323757429382, 'samples': 6245376, 'steps': 32527, 'loss/train': 1.4659003019332886}}}} 11/07/2021 01:50:01 - INFO - __main__ - Step 32528: {'lr': 0.000449323757429382, 'samples': 6245376, 'steps': 32527, 'loss/train': 1.4659003019332886}}}} 11/07/2021 01:50:07 - INFO - __main__ - Step 32539: {'lr': 0.00044928851793224765, 'samples': 6247488, 'steps': 32538, 'loss/train': 1.605699896812439}}} 11/07/2021 01:50:07 - INFO - __main__ - Step 32539: {'lr': 0.00044928851793224765, 'samples': 6247488, 'steps': 32538, 'loss/train': 1.605699896812439}}} 11/07/2021 01:50:11 - INFO - __main__ - Step 32546: {'lr': 0.00044926608714041763, 'samples': 6248832, 'steps': 32545, 'loss/train': 1.8412197828292847}} 11/07/2021 01:50:13 - INFO - __main__ - Step 32551: {'lr': 0.00044925006245263757, 'samples': 6249792, 'steps': 32550, 'loss/train': 1.6064367294311523}} 11/07/2021 01:50:13 - INFO - __main__ - Step 32551: {'lr': 0.00044925006245263757, 'samples': 6249792, 'steps': 32550, 'loss/train': 1.6064367294311523}} 11/07/2021 01:50:17 - INFO - __main__ - Step 32559: {'lr': 0.0004492244182837565, 'samples': 6251328, 'steps': 32558, 'loss/train': 1.5362626314163208}}} 11/07/2021 01:50:19 - INFO - __main__ - Step 32563: {'lr': 0.000449211594044851, 'samples': 6252096, 'steps': 32562, 'loss/train': 1.7514045238494873}}}} 11/07/2021 01:50:21 - INFO - __main__ - Step 32567: {'lr': 0.00044919876836975876, 'samples': 6252864, 'steps': 32566, 'loss/train': 1.298283338546753}}} 11/07/2021 01:50:23 - INFO - __main__ - Step 32572: {'lr': 0.0004491827342563968, 'samples': 6253824, 'steps': 32571, 'loss/train': 1.350450038909912}}}} 11/07/2021 01:50:25 - INFO - __main__ - Step 32576: {'lr': 0.00044916990535022244, 'samples': 6254592, 'steps': 32575, 'loss/train': 1.6362158060073853}} 11/07/2021 01:50:28 - INFO - __main__ - Step 32580: {'lr': 0.00044915707500816206, 'samples': 6255360, 'steps': 32579, 'loss/train': 1.317042589187622}}} 11/07/2021 01:50:29 - INFO - __main__ - Step 32584: {'lr': 0.0004491442432303079, 'samples': 6256128, 'steps': 32583, 'loss/train': 1.6612629890441895}}} 11/07/2021 01:50:31 - INFO - __main__ - Step 32588: {'lr': 0.0004491314100167526, 'samples': 6256896, 'steps': 32587, 'loss/train': 1.5734999179840088}}} 11/07/2021 01:50:34 - INFO - __main__ - Step 32593: {'lr': 0.0004491153664809947, 'samples': 6257856, 'steps': 32592, 'loss/train': 2.1112542152404785}}} 11/07/2021 01:50:34 - INFO - __main__ - Step 32593: {'lr': 0.0004491153664809947, 'samples': 6257856, 'steps': 32592, 'loss/train': 2.1112542152404785}}} 11/07/2021 01:50:38 - INFO - __main__ - Step 32600: {'lr': 0.00044909290176280495, 'samples': 6259200, 'steps': 32599, 'loss/train': 0.8739935159683228}} 11/07/2021 01:50:39 - INFO - __main__ - Step 32604: {'lr': 0.0004490800628073703, 'samples': 6259968, 'steps': 32603, 'loss/train': 0.8565096855163574}}} 11/07/2021 01:50:41 - INFO - __main__ - Step 32608: {'lr': 0.0004490672224166972, 'samples': 6260736, 'steps': 32607, 'loss/train': 1.7255065441131592}}} 11/07/2021 01:50:44 - INFO - __main__ - Step 32613: {'lr': 0.00044905116991019264, 'samples': 6261696, 'steps': 32612, 'loss/train': 1.5295449495315552}} 11/07/2021 01:50:46 - INFO - __main__ - Step 32617: {'lr': 0.0004490383262905714, 'samples': 6262464, 'steps': 32616, 'loss/train': 1.5342575311660767}}} 11/07/2021 01:50:46 - INFO - __main__ - Step 32617: {'lr': 0.0004490383262905714, 'samples': 6262464, 'steps': 32616, 'loss/train': 1.5342575311660767}}} 11/07/2021 01:50:49 - INFO - __main__ - Step 32624: {'lr': 0.00044901584650347147, 'samples': 6263808, 'steps': 32623, 'loss/train': 1.515797734260559}}} 11/07/2021 01:50:51 - INFO - __main__ - Step 32629: {'lr': 0.0004489997868224528, 'samples': 6264768, 'steps': 32628, 'loss/train': 1.539214015007019}}}} 11/07/2021 01:50:51 - INFO - __main__ - Step 32629: {'lr': 0.0004489997868224528, 'samples': 6264768, 'steps': 32628, 'loss/train': 1.539214015007019}}}} 11/07/2021 01:50:56 - INFO - __main__ - Step 32637: {'lr': 0.00044897408667025397, 'samples': 6266304, 'steps': 32636, 'loss/train': 1.4235544204711914}} 11/07/2021 01:50:56 - INFO - __main__ - Step 32637: {'lr': 0.00044897408667025397, 'samples': 6266304, 'steps': 32636, 'loss/train': 1.4235544204711914}} 11/07/2021 01:50:59 - INFO - __main__ - Step 32644: {'lr': 0.0004489515943301854, 'samples': 6267648, 'steps': 32643, 'loss/train': 1.586187720298767}4}} 11/07/2021 01:51:02 - INFO - __main__ - Step 32649: {'lr': 0.00044893552568362903, 'samples': 6268608, 'steps': 32648, 'loss/train': 1.0430372953414917}} 11/07/2021 01:51:02 - INFO - __main__ - Step 32649: {'lr': 0.00044893552568362903, 'samples': 6268608, 'steps': 32648, 'loss/train': 1.0430372953414917}} 11/07/2021 01:51:06 - INFO - __main__ - Step 32657: {'lr': 0.00044890981118807585, 'samples': 6270144, 'steps': 32656, 'loss/train': 2.038282871246338}}} 11/07/2021 01:51:07 - INFO - __main__ - Step 32661: {'lr': 0.0004488969517892363, 'samples': 6270912, 'steps': 32660, 'loss/train': 0.7900300025939941}}} 11/07/2021 01:51:09 - INFO - __main__ - Step 32665: {'lr': 0.00044888409095647833, 'samples': 6271680, 'steps': 32664, 'loss/train': 1.3088997602462769}} 11/07/2021 01:51:12 - INFO - __main__ - Step 32670: {'lr': 0.0004488680128992244, 'samples': 6272640, 'steps': 32669, 'loss/train': 1.103938341140747}9}} 11/07/2021 01:51:12 - INFO - __main__ - Step 32670: {'lr': 0.0004488680128992244, 'samples': 6272640, 'steps': 32669, 'loss/train': 1.103938341140747}9}} 11/07/2021 01:51:16 - INFO - __main__ - Step 32677: {'lr': 0.00044884549985562165, 'samples': 6273984, 'steps': 32676, 'loss/train': 1.7669689655303955}} 11/07/2021 01:51:17 - INFO - __main__ - Step 32681: {'lr': 0.0004488326332881175, 'samples': 6274752, 'steps': 32680, 'loss/train': 1.354612946510315}5}} 11/07/2021 01:51:20 - INFO - __main__ - Step 32686: {'lr': 0.0004488165480629527, 'samples': 6275712, 'steps': 32685, 'loss/train': 1.4111723899841309}}} 11/07/2021 01:51:22 - INFO - __main__ - Step 32691: {'lr': 0.00044880046059819615, 'samples': 6276672, 'steps': 32690, 'loss/train': 1.3003720045089722}} 11/07/2021 01:51:24 - INFO - __main__ - Step 32695: {'lr': 0.00044878758901400665, 'samples': 6277440, 'steps': 32694, 'loss/train': 1.5347445011138916}} 11/07/2021 01:51:24 - INFO - __main__ - Step 32695: {'lr': 0.00044878758901400665, 'samples': 6277440, 'steps': 32694, 'loss/train': 1.5347445011138916}} 11/07/2021 01:51:27 - INFO - __main__ - Step 32702: {'lr': 0.0004487650602932619, 'samples': 6278784, 'steps': 32701, 'loss/train': 1.208509087562561}6}} 11/07/2021 01:51:29 - INFO - __main__ - Step 32706: {'lr': 0.00044875218476818845, 'samples': 6279552, 'steps': 32705, 'loss/train': 0.7631714940071106}} 11/07/2021 01:51:32 - INFO - __main__ - Step 32711: {'lr': 0.00044873608834687754, 'samples': 6280512, 'steps': 32710, 'loss/train': 0.9771917462348938}} 11/07/2021 01:51:32 - INFO - __main__ - Step 32711: {'lr': 0.00044873608834687754, 'samples': 6280512, 'steps': 32710, 'loss/train': 0.9771917462348938}} 11/07/2021 01:51:35 - INFO - __main__ - Step 32718: {'lr': 0.00044871354959609135, 'samples': 6281856, 'steps': 32717, 'loss/train': 0.7198702096939087}} 11/07/2021 01:51:38 - INFO - __main__ - Step 32723: {'lr': 0.0004486974478022402, 'samples': 6282816, 'steps': 32722, 'loss/train': 1.418270230293274}7}} 11/07/2021 01:51:40 - INFO - __main__ - Step 32728: {'lr': 0.0004486813437701389, 'samples': 6283776, 'steps': 32727, 'loss/train': 1.4692602157592773}}} 11/07/2021 01:51:40 - INFO - __main__ - Step 32728: {'lr': 0.0004486813437701389, 'samples': 6283776, 'steps': 32727, 'loss/train': 1.4692602157592773}}} 11/07/2021 01:51:43 - INFO - __main__ - Step 32735: {'lr': 0.0004486587943652823, 'samples': 6285120, 'steps': 32734, 'loss/train': 0.731379508972168}}}} 11/07/2021 01:51:45 - INFO - __main__ - Step 32739: {'lr': 0.00044864590702176977, 'samples': 6285888, 'steps': 32738, 'loss/train': 2.4802517890930176}} 11/07/2021 01:51:48 - INFO - __main__ - Step 32744: {'lr': 0.0004486297958284874, 'samples': 6286848, 'steps': 32743, 'loss/train': 1.754269003868103}6}} 11/07/2021 01:51:50 - INFO - __main__ - Step 32749: {'lr': 0.00044861368239771694, 'samples': 6287808, 'steps': 32748, 'loss/train': 1.5400633811950684}} 11/07/2021 01:51:50 - INFO - __main__ - Step 32749: {'lr': 0.00044861368239771694, 'samples': 6287808, 'steps': 32748, 'loss/train': 1.5400633811950684}} 11/07/2021 01:51:53 - INFO - __main__ - Step 32756: {'lr': 0.0004485911198360041, 'samples': 6289152, 'steps': 32755, 'loss/train': 1.0894807577133179}}} 11/07/2021 01:51:56 - INFO - __main__ - Step 32760: {'lr': 0.0004485782249749587, 'samples': 6289920, 'steps': 32759, 'loss/train': 1.6369692087173462}}} 11/07/2021 01:51:58 - INFO - __main__ - Step 32764: {'lr': 0.0004485653286822927, 'samples': 6290688, 'steps': 32763, 'loss/train': 1.508261799812317}}}} 11/07/2021 01:51:58 - INFO - __main__ - Step 32764: {'lr': 0.0004485653286822927, 'samples': 6290688, 'steps': 32763, 'loss/train': 1.508261799812317}}}} 11/07/2021 01:52:02 - INFO - __main__ - Step 32771: {'lr': 0.0004485427567255701, 'samples': 6292032, 'steps': 32770, 'loss/train': 1.6293174028396606}}} 11/07/2021 01:52:02 - INFO - __main__ - Step 32771: {'lr': 0.0004485427567255701, 'samples': 6292032, 'steps': 32770, 'loss/train': 1.6293174028396606}}} 11/07/2021 01:52:02 - INFO - __main__ - Step 32771: {'lr': 0.0004485427567255701, 'samples': 6292032, 'steps': 32770, 'loss/train': 1.6293174028396606}}} 11/07/2021 01:52:08 - INFO - __main__ - Step 32781: {'lr': 0.00044851050346910706, 'samples': 6293952, 'steps': 32780, 'loss/train': 1.8087482452392578}} 11/07/2021 01:52:10 - INFO - __main__ - Step 32785: {'lr': 0.0004484975996619589, 'samples': 6294720, 'steps': 32784, 'loss/train': 1.4865195751190186}}} 11/07/2021 01:52:12 - INFO - __main__ - Step 32789: {'lr': 0.0004484846944237714, 'samples': 6295488, 'steps': 32788, 'loss/train': 1.3773404359817505}}} 11/07/2021 01:52:13 - INFO - __main__ - Step 32793: {'lr': 0.0004484717877546377, 'samples': 6296256, 'steps': 32792, 'loss/train': 1.530842900276184}}}} 11/07/2021 01:52:15 - INFO - __main__ - Step 32797: {'lr': 0.00044845887965465076, 'samples': 6297024, 'steps': 32796, 'loss/train': 0.6192265748977661}} 11/07/2021 01:52:18 - INFO - __main__ - Step 32803: {'lr': 0.0004484395148220243, 'samples': 6298176, 'steps': 32802, 'loss/train': 1.4404723644256592}}} 11/07/2021 01:52:20 - INFO - __main__ - Step 32807: {'lr': 0.00044842660314531145, 'samples': 6298944, 'steps': 32806, 'loss/train': 1.6565542221069336}} 11/07/2021 01:52:22 - INFO - __main__ - Step 32811: {'lr': 0.0004484136900380713, 'samples': 6299712, 'steps': 32810, 'loss/train': 1.570265531539917}6}} 11/07/2021 01:52:22 - INFO - __main__ - Step 32811: {'lr': 0.0004484136900380713, 'samples': 6299712, 'steps': 32810, 'loss/train': 1.570265531539917}6}} 11/07/2021 01:52:25 - INFO - __main__ - Step 32818: {'lr': 0.0004483910886584743, 'samples': 6301056, 'steps': 32817, 'loss/train': 1.8153525590896606}}} 11/07/2021 01:52:28 - INFO - __main__ - Step 32823: {'lr': 0.000448374942134117, 'samples': 6302016, 'steps': 32822, 'loss/train': 1.758912205696106}6}}} 11/07/2021 01:52:30 - INFO - __main__ - Step 32828: {'lr': 0.00044835879337514254, 'samples': 6302976, 'steps': 32827, 'loss/train': 1.0733113288879395}} 11/07/2021 01:52:30 - INFO - __main__ - Step 32828: {'lr': 0.00044835879337514254, 'samples': 6302976, 'steps': 32827, 'loss/train': 1.0733113288879395}} 11/07/2021 01:52:33 - INFO - __main__ - Step 32834: {'lr': 0.00044833941191493463, 'samples': 6304128, 'steps': 32833, 'loss/train': 1.464808464050293}}} 11/07/2021 01:52:36 - INFO - __main__ - Step 32839: {'lr': 0.00044832325824044274, 'samples': 6305088, 'steps': 32838, 'loss/train': 1.810701847076416}}} 11/07/2021 01:52:38 - INFO - __main__ - Step 32844: {'lr': 0.00044830710233191573, 'samples': 6306048, 'steps': 32843, 'loss/train': 1.419494867324829}}} 11/07/2021 01:52:38 - INFO - __main__ - Step 32844: {'lr': 0.00044830710233191573, 'samples': 6306048, 'steps': 32843, 'loss/train': 1.419494867324829}}} 11/07/2021 01:52:38 - INFO - __main__ - Step 32844: {'lr': 0.00044830710233191573, 'samples': 6306048, 'steps': 32843, 'loss/train': 1.419494867324829}}} 11/07/2021 01:52:44 - INFO - __main__ - Step 32854: {'lr': 0.00044827478381348495, 'samples': 6307968, 'steps': 32853, 'loss/train': 0.8628458976745605}} 11/07/2021 01:52:45 - INFO - __main__ - Step 32858: {'lr': 0.0004482618539045234, 'samples': 6308736, 'steps': 32857, 'loss/train': 1.3535361289978027}}} 11/07/2021 01:52:47 - INFO - __main__ - Step 32862: {'lr': 0.0004482489225662222, 'samples': 6309504, 'steps': 32861, 'loss/train': 1.0501551628112793}}} 11/07/2021 01:52:50 - INFO - __main__ - Step 32867: {'lr': 0.0004482327563834787, 'samples': 6310464, 'steps': 32866, 'loss/train': 2.1392643451690674}}} 11/07/2021 01:52:52 - INFO - __main__ - Step 32872: {'lr': 0.0004482165879677197, 'samples': 6311424, 'steps': 32871, 'loss/train': 1.16120183467865}4}}} 11/07/2021 01:52:54 - INFO - __main__ - Step 32876: {'lr': 0.00044820365162746373, 'samples': 6312192, 'steps': 32875, 'loss/train': 1.523007869720459}}} 11/07/2021 01:52:54 - INFO - __main__ - Step 32876: {'lr': 0.00044820365162746373, 'samples': 6312192, 'steps': 32875, 'loss/train': 1.523007869720459}}} 11/07/2021 01:52:58 - INFO - __main__ - Step 32883: {'lr': 0.0004481810095937329, 'samples': 6313536, 'steps': 32882, 'loss/train': 1.4392449855804443}}} 11/07/2021 01:53:00 - INFO - __main__ - Step 32888: {'lr': 0.0004481648340335482, 'samples': 6314496, 'steps': 32887, 'loss/train': 1.4408528804779053}}} 11/07/2021 01:53:00 - INFO - __main__ - Step 32888: {'lr': 0.0004481648340335482, 'samples': 6314496, 'steps': 32887, 'loss/train': 1.4408528804779053}}} 11/07/2021 01:53:04 - INFO - __main__ - Step 32896: {'lr': 0.00044813894849424777, 'samples': 6316032, 'steps': 32895, 'loss/train': 1.8859457969665527}} 11/07/2021 01:53:06 - INFO - __main__ - Step 32900: {'lr': 0.0004481260035818704, 'samples': 6316800, 'steps': 32899, 'loss/train': 0.8094185590744019}}} 11/07/2021 01:53:08 - INFO - __main__ - Step 32904: {'lr': 0.0004481130572411327, 'samples': 6317568, 'steps': 32903, 'loss/train': 1.5772401094436646}}} 11/07/2021 01:53:10 - INFO - __main__ - Step 32909: {'lr': 0.00044809687230672115, 'samples': 6318528, 'steps': 32908, 'loss/train': 1.201430320739746}}} 11/07/2021 01:53:10 - INFO - __main__ - Step 32909: {'lr': 0.00044809687230672115, 'samples': 6318528, 'steps': 32908, 'loss/train': 1.201430320739746}}} 11/07/2021 01:53:14 - INFO - __main__ - Step 32916: {'lr': 0.00044807420964969113, 'samples': 6319872, 'steps': 32915, 'loss/train': 1.5196776390075684}} 11/07/2021 01:53:16 - INFO - __main__ - Step 32920: {'lr': 0.00044806125759644567, 'samples': 6320640, 'steps': 32919, 'loss/train': 1.3492869138717651}} 11/07/2021 01:53:18 - INFO - __main__ - Step 32924: {'lr': 0.0004480483041153066, 'samples': 6321408, 'steps': 32923, 'loss/train': 1.350351095199585}1}} 11/07/2021 01:53:20 - INFO - __main__ - Step 32929: {'lr': 0.00044803211025604985, 'samples': 6322368, 'steps': 32928, 'loss/train': 1.494863748550415}}} 11/07/2021 01:53:22 - INFO - __main__ - Step 32933: {'lr': 0.0004480191535624918, 'samples': 6323136, 'steps': 32932, 'loss/train': 1.4989677667617798}}} 11/07/2021 01:53:24 - INFO - __main__ - Step 32937: {'lr': 0.00044800619544134375, 'samples': 6323904, 'steps': 32936, 'loss/train': 1.7458767890930176}} 11/07/2021 01:53:24 - INFO - __main__ - Step 32937: {'lr': 0.00044800619544134375, 'samples': 6323904, 'steps': 32936, 'loss/train': 1.7458767890930176}} 11/07/2021 01:53:28 - INFO - __main__ - Step 32945: {'lr': 0.00044798027491665135, 'samples': 6325440, 'steps': 32944, 'loss/train': 1.762140154838562}}} 11/07/2021 01:53:30 - INFO - __main__ - Step 32950: {'lr': 0.0004479640716894483, 'samples': 6326400, 'steps': 32949, 'loss/train': 1.559309720993042}}}} 11/07/2021 01:53:33 - INFO - __main__ - Step 32955: {'lr': 0.00044794786623225636, 'samples': 6327360, 'steps': 32954, 'loss/train': 1.3424718379974365}} 11/07/2021 01:53:33 - INFO - __main__ - Step 32955: {'lr': 0.00044794786623225636, 'samples': 6327360, 'steps': 32954, 'loss/train': 1.3424718379974365}} 11/07/2021 01:53:36 - INFO - __main__ - Step 32962: {'lr': 0.00044792517484615384, 'samples': 6328704, 'steps': 32961, 'loss/train': 1.1192171573638916}} 11/07/2021 01:53:38 - INFO - __main__ - Step 32966: {'lr': 0.0004479122063777728, 'samples': 6329472, 'steps': 32965, 'loss/train': 1.8037034273147583}}} 11/07/2021 01:53:41 - INFO - __main__ - Step 32971: {'lr': 0.00044789599378584324, 'samples': 6330432, 'steps': 32970, 'loss/train': 1.7470932006835938}} 11/07/2021 01:53:41 - INFO - __main__ - Step 32971: {'lr': 0.00044789599378584324, 'samples': 6330432, 'steps': 32970, 'loss/train': 1.7470932006835938}} 11/07/2021 01:53:43 - INFO - __main__ - Step 32977: {'lr': 0.0004478765357329708, 'samples': 6331584, 'steps': 32976, 'loss/train': 1.7699060440063477}}} 11/07/2021 01:53:46 - INFO - __main__ - Step 32982: {'lr': 0.00044786031823699384, 'samples': 6332544, 'steps': 32981, 'loss/train': 1.4322315454483032}} 11/07/2021 01:53:46 - INFO - __main__ - Step 32982: {'lr': 0.00044786031823699384, 'samples': 6332544, 'steps': 32981, 'loss/train': 1.4322315454483032}} 11/07/2021 01:53:50 - INFO - __main__ - Step 32990: {'lr': 0.00044783436560756086, 'samples': 6334080, 'steps': 32989, 'loss/train': 1.7610799074172974}} 11/07/2021 01:53:52 - INFO - __main__ - Step 32994: {'lr': 0.00044782138715341094, 'samples': 6334848, 'steps': 32993, 'loss/train': 1.7642078399658203}} 11/07/2021 01:53:54 - INFO - __main__ - Step 32998: {'lr': 0.00044780840727309676, 'samples': 6335616, 'steps': 32997, 'loss/train': 1.4560761451721191}} 11/07/2021 01:53:56 - INFO - __main__ - Step 33003: {'lr': 0.00044779218041730314, 'samples': 6336576, 'steps': 33002, 'loss/train': 1.16958487033844}1}} 11/07/2021 01:53:56 - INFO - __main__ - Step 33003: {'lr': 0.00044779218041730314, 'samples': 6336576, 'steps': 33002, 'loss/train': 1.16958487033844}1}} 11/07/2021 01:53:56 - INFO - __main__ - Step 33003: {'lr': 0.00044779218041730314, 'samples': 6336576, 'steps': 33002, 'loss/train': 1.16958487033844}1}} 11/07/2021 01:54:01 - INFO - __main__ - Step 33014: {'lr': 0.0004477564734920694, 'samples': 6338688, 'steps': 33013, 'loss/train': 1.2043906450271606}}} 11/07/2021 01:54:03 - INFO - __main__ - Step 33018: {'lr': 0.0004477434864823379, 'samples': 6339456, 'steps': 33017, 'loss/train': 1.5662287473678589}}} 11/07/2021 01:54:06 - INFO - __main__ - Step 33023: {'lr': 0.0004477272507154308, 'samples': 6340416, 'steps': 33022, 'loss/train': 1.5063848495483398}}} 11/07/2021 01:54:08 - INFO - __main__ - Step 33028: {'lr': 0.0004477110127212025, 'samples': 6341376, 'steps': 33027, 'loss/train': 1.0178325176239014}}} 11/07/2021 01:54:10 - INFO - __main__ - Step 33032: {'lr': 0.0004476980207222716, 'samples': 6342144, 'steps': 33031, 'loss/train': 1.3227752447128296}}} 11/07/2021 01:54:10 - INFO - __main__ - Step 33032: {'lr': 0.0004476980207222716, 'samples': 6342144, 'steps': 33031, 'loss/train': 1.3227752447128296}}} 11/07/2021 01:54:14 - INFO - __main__ - Step 33039: {'lr': 0.0004476752812946312, 'samples': 6343488, 'steps': 33038, 'loss/train': 1.3962721824645996}}} 11/07/2021 01:54:16 - INFO - __main__ - Step 33044: {'lr': 0.00044765903617420436, 'samples': 6344448, 'steps': 33043, 'loss/train': 1.1298271417617798}} 11/07/2021 01:54:18 - INFO - __main__ - Step 33049: {'lr': 0.0004476427888272248, 'samples': 6345408, 'steps': 33048, 'loss/train': 1.2218657732009888}}} 11/07/2021 01:54:18 - INFO - __main__ - Step 33049: {'lr': 0.0004476427888272248, 'samples': 6345408, 'steps': 33048, 'loss/train': 1.2218657732009888}}} 11/07/2021 01:54:22 - INFO - __main__ - Step 33056: {'lr': 0.0004476200388011932, 'samples': 6346752, 'steps': 33055, 'loss/train': 1.2742310762405396}}} 11/07/2021 01:54:24 - INFO - __main__ - Step 33060: {'lr': 0.00044760703682730584, 'samples': 6347520, 'steps': 33059, 'loss/train': 3.064441680908203}}} 11/07/2021 01:54:26 - INFO - __main__ - Step 33064: {'lr': 0.0004475940334287996, 'samples': 6348288, 'steps': 33063, 'loss/train': 1.500831961631775}}}} 11/07/2021 01:54:28 - INFO - __main__ - Step 33069: {'lr': 0.0004475777771774393, 'samples': 6349248, 'steps': 33068, 'loss/train': 1.1545443534851074}}} 11/07/2021 01:54:30 - INFO - __main__ - Step 33073: {'lr': 0.00044756477057388336, 'samples': 6350016, 'steps': 33072, 'loss/train': 1.4639098644256592}} 11/07/2021 01:54:30 - INFO - __main__ - Step 33073: {'lr': 0.00044756477057388336, 'samples': 6350016, 'steps': 33072, 'loss/train': 1.4639098644256592}} 11/07/2021 01:54:34 - INFO - __main__ - Step 33080: {'lr': 0.00044754200559046076, 'samples': 6351360, 'steps': 33079, 'loss/train': 1.2222187519073486}} 11/07/2021 01:54:36 - INFO - __main__ - Step 33085: {'lr': 0.00044752574221770537, 'samples': 6352320, 'steps': 33084, 'loss/train': 1.0361472368240356}} 11/07/2021 01:54:36 - INFO - __main__ - Step 33085: {'lr': 0.00044752574221770537, 'samples': 6352320, 'steps': 33084, 'loss/train': 1.0361472368240356}} 11/07/2021 01:54:40 - INFO - __main__ - Step 33093: {'lr': 0.00044749971619326633, 'samples': 6353856, 'steps': 33092, 'loss/train': 1.8795900344848633}} 11/07/2021 01:54:42 - INFO - __main__ - Step 33097: {'lr': 0.0004474867010452321, 'samples': 6354624, 'steps': 33096, 'loss/train': 1.044918179512024}3}} 11/07/2021 01:54:44 - INFO - __main__ - Step 33102: {'lr': 0.00044747043010805, 'samples': 6355584, 'steps': 33101, 'loss/train': 1.7007176876068115}}3}} 11/07/2021 01:54:47 - INFO - __main__ - Step 33106: {'lr': 0.0004474574117567072, 'samples': 6356352, 'steps': 33105, 'loss/train': 1.1456880569458008}}} 11/07/2021 01:54:49 - INFO - __main__ - Step 33110: {'lr': 0.0004474443919818241, 'samples': 6357120, 'steps': 33109, 'loss/train': 1.4442769289016724}}} 11/07/2021 01:54:51 - INFO - __main__ - Step 33114: {'lr': 0.0004474313707834947, 'samples': 6357888, 'steps': 33113, 'loss/train': 4.484914779663086}}}} 11/07/2021 01:54:52 - INFO - __main__ - Step 33118: {'lr': 0.0004474183481618129, 'samples': 6358656, 'steps': 33117, 'loss/train': 1.3420227766036987}}} 11/07/2021 01:54:52 - INFO - __main__ - Step 33118: {'lr': 0.0004474183481618129, 'samples': 6358656, 'steps': 33117, 'loss/train': 1.3420227766036987}}} 11/07/2021 01:54:56 - INFO - __main__ - Step 33126: {'lr': 0.0004473922986487674, 'samples': 6360192, 'steps': 33125, 'loss/train': 1.3780709505081177}}} 11/07/2021 01:54:58 - INFO - __main__ - Step 33130: {'lr': 0.0004473792717575915, 'samples': 6360960, 'steps': 33129, 'loss/train': 1.9495084285736084}}} 11/07/2021 01:55:00 - INFO - __main__ - Step 33134: {'lr': 0.0004473662434434388, 'samples': 6361728, 'steps': 33133, 'loss/train': 1.762020230293274}}}} 11/07/2021 01:55:02 - INFO - __main__ - Step 33139: {'lr': 0.00044734995604982973, 'samples': 6362688, 'steps': 33138, 'loss/train': 1.522831916809082}}} 11/07/2021 01:55:02 - INFO - __main__ - Step 33139: {'lr': 0.00044734995604982973, 'samples': 6362688, 'steps': 33138, 'loss/train': 1.522831916809082}}} 11/07/2021 01:55:02 - INFO - __main__ - Step 33139: {'lr': 0.00044734995604982973, 'samples': 6362688, 'steps': 33138, 'loss/train': 1.522831916809082}}} 11/07/2021 01:55:08 - INFO - __main__ - Step 33149: {'lr': 0.0004473173745935818, 'samples': 6364608, 'steps': 33148, 'loss/train': 1.4953867197036743}}} 11/07/2021 01:55:10 - INFO - __main__ - Step 33154: {'lr': 0.00044730108053130986, 'samples': 6365568, 'steps': 33153, 'loss/train': 1.3361729383468628}} 11/07/2021 01:55:13 - INFO - __main__ - Step 33159: {'lr': 0.00044728478424651744, 'samples': 6366528, 'steps': 33158, 'loss/train': 0.8608505129814148}} 11/07/2021 01:55:13 - INFO - __main__ - Step 33159: {'lr': 0.00044728478424651744, 'samples': 6366528, 'steps': 33158, 'loss/train': 0.8608505129814148}} 11/07/2021 01:55:16 - INFO - __main__ - Step 33166: {'lr': 0.0004472619657143229, 'samples': 6367872, 'steps': 33165, 'loss/train': 1.4132847785949707}}} 11/07/2021 01:55:18 - INFO - __main__ - Step 33170: {'lr': 0.0004472489245976063, 'samples': 6368640, 'steps': 33169, 'loss/train': 1.5054649114608765}}} 11/07/2021 01:55:20 - INFO - __main__ - Step 33175: {'lr': 0.00044723262120198177, 'samples': 6369600, 'steps': 33174, 'loss/train': 1.5991913080215454}} 11/07/2021 01:55:20 - INFO - __main__ - Step 33175: {'lr': 0.00044723262120198177, 'samples': 6369600, 'steps': 33174, 'loss/train': 1.5991913080215454}} 11/07/2021 01:55:24 - INFO - __main__ - Step 33182: {'lr': 0.00044720979271560963, 'samples': 6370944, 'steps': 33181, 'loss/train': 1.4953434467315674}} 11/07/2021 01:55:26 - INFO - __main__ - Step 33186: {'lr': 0.0004471967459113086, 'samples': 6371712, 'steps': 33185, 'loss/train': 0.953913688659668}4}} 11/07/2021 01:55:28 - INFO - __main__ - Step 33191: {'lr': 0.00044718043540673257, 'samples': 6372672, 'steps': 33190, 'loss/train': 1.7337013483047485}} 11/07/2021 01:55:28 - INFO - __main__ - Step 33191: {'lr': 0.00044718043540673257, 'samples': 6372672, 'steps': 33190, 'loss/train': 1.7337013483047485}} 11/07/2021 01:55:32 - INFO - __main__ - Step 33199: {'lr': 0.0004471543339794715, 'samples': 6374208, 'steps': 33198, 'loss/train': 1.2976843118667603}}} 11/07/2021 01:55:34 - INFO - __main__ - Step 33203: {'lr': 0.0004471412811337611, 'samples': 6374976, 'steps': 33202, 'loss/train': 1.6096456050872803}}} 11/07/2021 01:55:36 - INFO - __main__ - Step 33207: {'lr': 0.00044712822686678955, 'samples': 6375744, 'steps': 33206, 'loss/train': 1.668056607246399}}} 11/07/2021 01:55:38 - INFO - __main__ - Step 33212: {'lr': 0.00044711190703457005, 'samples': 6376704, 'steps': 33211, 'loss/train': 1.2352567911148071}} 11/07/2021 01:55:40 - INFO - __main__ - Step 33216: {'lr': 0.0004470988495701052, 'samples': 6377472, 'steps': 33215, 'loss/train': 1.591020107269287}1}} 11/07/2021 01:55:42 - INFO - __main__ - Step 33220: {'lr': 0.00044708579068468505, 'samples': 6378240, 'steps': 33219, 'loss/train': 1.5764384269714355}} 11/07/2021 01:55:44 - INFO - __main__ - Step 33224: {'lr': 0.0004470727303784039, 'samples': 6379008, 'steps': 33223, 'loss/train': 1.5153917074203491}}} 11/07/2021 01:55:46 - INFO - __main__ - Step 33228: {'lr': 0.00044705966865135583, 'samples': 6379776, 'steps': 33227, 'loss/train': 2.1757476329803467}} 11/07/2021 01:55:48 - INFO - __main__ - Step 33232: {'lr': 0.00044704660550363507, 'samples': 6380544, 'steps': 33231, 'loss/train': 1.9485957622528076}} 11/07/2021 01:55:50 - INFO - __main__ - Step 33237: {'lr': 0.0004470302745713065, 'samples': 6381504, 'steps': 33236, 'loss/train': 1.4370431900024414}}} 11/07/2021 01:55:50 - INFO - __main__ - Step 33237: {'lr': 0.0004470302745713065, 'samples': 6381504, 'steps': 33236, 'loss/train': 1.4370431900024414}}} 11/07/2021 01:55:55 - INFO - __main__ - Step 33245: {'lr': 0.0004470041404631597, 'samples': 6383040, 'steps': 33244, 'loss/train': 0.9164215922355652}}} 11/07/2021 01:55:56 - INFO - __main__ - Step 33249: {'lr': 0.00044699107127863056, 'samples': 6383808, 'steps': 33248, 'loss/train': 1.3086721897125244}} 11/07/2021 01:55:59 - INFO - __main__ - Step 33253: {'lr': 0.00044697800067392327, 'samples': 6384576, 'steps': 33252, 'loss/train': 1.573535680770874}}} 11/07/2021 01:55:59 - INFO - __main__ - Step 33253: {'lr': 0.00044697800067392327, 'samples': 6384576, 'steps': 33252, 'loss/train': 1.573535680770874}}} 11/07/2021 01:56:02 - INFO - __main__ - Step 33260: {'lr': 0.0004469551236986651, 'samples': 6385920, 'steps': 33259, 'loss/train': 1.6525382995605469}}} 11/07/2021 01:56:04 - INFO - __main__ - Step 33264: {'lr': 0.00044694204918895367, 'samples': 6386688, 'steps': 33263, 'loss/train': 1.9074795246124268}} 11/07/2021 01:56:06 - INFO - __main__ - Step 33269: {'lr': 0.00044692570405519683, 'samples': 6387648, 'steps': 33268, 'loss/train': 1.416410207748413}}} 11/07/2021 01:56:06 - INFO - __main__ - Step 33269: {'lr': 0.00044692570405519683, 'samples': 6387648, 'steps': 33268, 'loss/train': 1.416410207748413}}} 11/07/2021 01:56:11 - INFO - __main__ - Step 33277: {'lr': 0.00044689954722721494, 'samples': 6389184, 'steps': 33276, 'loss/train': 1.537747859954834}}} 11/07/2021 01:56:12 - INFO - __main__ - Step 33281: {'lr': 0.00044688646668389933, 'samples': 6389952, 'steps': 33280, 'loss/train': 1.5683271884918213}} 11/07/2021 01:56:14 - INFO - __main__ - Step 33285: {'lr': 0.00044687338472115964, 'samples': 6390720, 'steps': 33284, 'loss/train': 1.4872294664382935}} 11/07/2021 01:56:17 - INFO - __main__ - Step 33290: {'lr': 0.00044685703027181364, 'samples': 6391680, 'steps': 33289, 'loss/train': 1.8223915100097656}} 11/07/2021 01:56:19 - INFO - __main__ - Step 33295: {'lr': 0.00044684067360494905, 'samples': 6392640, 'steps': 33294, 'loss/train': 1.848178744316101}}} 11/07/2021 01:56:19 - INFO - __main__ - Step 33295: {'lr': 0.00044684067360494905, 'samples': 6392640, 'steps': 33294, 'loss/train': 1.848178744316101}}} 11/07/2021 01:56:23 - INFO - __main__ - Step 33302: {'lr': 0.0004468177705462585, 'samples': 6393984, 'steps': 33301, 'loss/train': 1.3737915754318237}}} 11/07/2021 01:56:24 - INFO - __main__ - Step 33306: {'lr': 0.00044680468113309006, 'samples': 6394752, 'steps': 33305, 'loss/train': 1.0961544513702393}} 11/07/2021 01:56:27 - INFO - __main__ - Step 33311: {'lr': 0.0004467883173714047, 'samples': 6395712, 'steps': 33310, 'loss/train': 1.6529051065444946}}} 11/07/2021 01:56:29 - INFO - __main__ - Step 33316: {'lr': 0.00044677195139297476, 'samples': 6396672, 'steps': 33315, 'loss/train': 1.5994012355804443}} 11/07/2021 01:56:31 - INFO - __main__ - Step 33320: {'lr': 0.00044675885701429873, 'samples': 6397440, 'steps': 33319, 'loss/train': 1.3172155618667603}} 11/07/2021 01:56:31 - INFO - __main__ - Step 33320: {'lr': 0.00044675885701429873, 'samples': 6397440, 'steps': 33319, 'loss/train': 1.3172155618667603}} 11/07/2021 01:56:34 - INFO - __main__ - Step 33327: {'lr': 0.000446735938438397, 'samples': 6398784, 'steps': 33326, 'loss/train': 0.787378191947937}03}} 11/07/2021 01:56:37 - INFO - __main__ - Step 33331: {'lr': 0.0004467228401590619, 'samples': 6399552, 'steps': 33330, 'loss/train': 1.2289947271347046}}} 11/07/2021 01:56:37 - INFO - __main__ - Step 33331: {'lr': 0.0004467228401590619, 'samples': 6399552, 'steps': 33330, 'loss/train': 1.2289947271347046}}} 11/07/2021 01:56:41 - INFO - __main__ - Step 33338: {'lr': 0.00044669991475763173, 'samples': 6400896, 'steps': 33337, 'loss/train': 1.0887442827224731}} 11/07/2021 01:56:42 - INFO - __main__ - Step 33342: {'lr': 0.00044668681257835173, 'samples': 6401664, 'steps': 33341, 'loss/train': 1.473926305770874}}} 11/07/2021 01:56:44 - INFO - __main__ - Step 33346: {'lr': 0.0004466737089810871, 'samples': 6402432, 'steps': 33345, 'loss/train': 1.721977710723877}}}} 11/07/2021 01:56:47 - INFO - __main__ - Step 33351: {'lr': 0.0004466573274906092, 'samples': 6403392, 'steps': 33350, 'loss/train': 1.2804734706878662}}} 11/07/2021 01:56:48 - INFO - __main__ - Step 33355: {'lr': 0.0004466442207032244, 'samples': 6404160, 'steps': 33354, 'loss/train': 1.2369035482406616}}} 11/07/2021 01:56:51 - INFO - __main__ - Step 33360: {'lr': 0.0004466278352253954, 'samples': 6405120, 'steps': 33359, 'loss/train': 1.5842740535736084}}} 11/07/2021 01:56:51 - INFO - __main__ - Step 33360: {'lr': 0.0004466278352253954, 'samples': 6405120, 'steps': 33359, 'loss/train': 1.5842740535736084}}} 11/07/2021 01:56:54 - INFO - __main__ - Step 33367: {'lr': 0.00044660489183538237, 'samples': 6406464, 'steps': 33366, 'loss/train': 1.8919562101364136}} 11/07/2021 01:56:57 - INFO - __main__ - Step 33372: {'lr': 0.0004465885010420154, 'samples': 6407424, 'steps': 33371, 'loss/train': 1.249595046043396}6}} 11/07/2021 01:56:57 - INFO - __main__ - Step 33372: {'lr': 0.0004465885010420154, 'samples': 6407424, 'steps': 33371, 'loss/train': 1.249595046043396}6}} 11/07/2021 01:57:01 - INFO - __main__ - Step 33380: {'lr': 0.00044656227116655824, 'samples': 6408960, 'steps': 33379, 'loss/train': 1.671711802482605}}} 11/07/2021 01:57:01 - INFO - __main__ - Step 33380: {'lr': 0.00044656227116655824, 'samples': 6408960, 'steps': 33379, 'loss/train': 1.671711802482605}}} 11/07/2021 01:57:05 - INFO - __main__ - Step 33387: {'lr': 0.00044653931537569125, 'samples': 6410304, 'steps': 33386, 'loss/train': 7.1502532958984375}} 11/07/2021 01:57:06 - INFO - __main__ - Step 33391: {'lr': 0.0004465261958326108, 'samples': 6411072, 'steps': 33390, 'loss/train': 2.0901994705200195}}} 11/07/2021 01:57:08 - INFO - __main__ - Step 33395: {'lr': 0.0004465130748727036, 'samples': 6411840, 'steps': 33394, 'loss/train': 1.4100768566131592}}} 11/07/2021 01:57:11 - INFO - __main__ - Step 33400: {'lr': 0.0004464966716805511, 'samples': 6412800, 'steps': 33399, 'loss/train': 1.7051008939743042}}} 11/07/2021 01:57:13 - INFO - __main__ - Step 33404: {'lr': 0.0004464835475331296, 'samples': 6413568, 'steps': 33403, 'loss/train': 1.8330544233322144}}} 11/07/2021 01:57:13 - INFO - __main__ - Step 33404: {'lr': 0.0004464835475331296, 'samples': 6413568, 'steps': 33403, 'loss/train': 1.8330544233322144}}} 11/07/2021 01:57:16 - INFO - __main__ - Step 33411: {'lr': 0.0004464605768666995, 'samples': 6414912, 'steps': 33410, 'loss/train': 1.5178637504577637}}} 11/07/2021 01:57:19 - INFO - __main__ - Step 33416: {'lr': 0.00044644416659212806, 'samples': 6415872, 'steps': 33415, 'loss/train': 1.6313021183013916}} 11/07/2021 01:57:19 - INFO - __main__ - Step 33416: {'lr': 0.00044644416659212806, 'samples': 6415872, 'steps': 33415, 'loss/train': 1.6313021183013916}} 11/07/2021 01:57:23 - INFO - __main__ - Step 33424: {'lr': 0.0004464179055501258, 'samples': 6417408, 'steps': 33423, 'loss/train': 1.4190086126327515}}} 11/07/2021 01:57:25 - INFO - __main__ - Step 33428: {'lr': 0.00044640477290500824, 'samples': 6418176, 'steps': 33427, 'loss/train': 1.526247501373291}}} 11/07/2021 01:57:26 - INFO - __main__ - Step 33432: {'lr': 0.0004463916388439394, 'samples': 6418944, 'steps': 33431, 'loss/train': 1.5159145593643188}}} 11/07/2021 01:57:29 - INFO - __main__ - Step 33437: {'lr': 0.000446375219276566, 'samples': 6419904, 'steps': 33436, 'loss/train': 1.4341305494308472}}}} 11/07/2021 01:57:31 - INFO - __main__ - Step 33441: {'lr': 0.00044636208202995277, 'samples': 6420672, 'steps': 33440, 'loss/train': 1.4302234649658203}} 11/07/2021 01:57:33 - INFO - __main__ - Step 33445: {'lr': 0.0004463489433676959, 'samples': 6421440, 'steps': 33444, 'loss/train': 2.250821352005005}3}} 11/07/2021 01:57:35 - INFO - __main__ - Step 33449: {'lr': 0.0004463358032898903, 'samples': 6422208, 'steps': 33448, 'loss/train': 2.1877307891845703}}} 11/07/2021 01:57:36 - INFO - __main__ - Step 33453: {'lr': 0.0004463226617966305, 'samples': 6422976, 'steps': 33452, 'loss/train': 1.6182461977005005}}} 11/07/2021 01:57:38 - INFO - __main__ - Step 33457: {'lr': 0.0004463095188880113, 'samples': 6423744, 'steps': 33456, 'loss/train': 1.511284589767456}}}} 11/07/2021 01:57:41 - INFO - __main__ - Step 33462: {'lr': 0.0004462930882620325, 'samples': 6424704, 'steps': 33461, 'loss/train': 1.2532994747161865}}} 11/07/2021 01:57:43 - INFO - __main__ - Step 33466: {'lr': 0.0004462799421692012, 'samples': 6425472, 'steps': 33465, 'loss/train': 1.49006986618042}5}}} 11/07/2021 01:57:45 - INFO - __main__ - Step 33470: {'lr': 0.0004462667946613184, 'samples': 6426240, 'steps': 33469, 'loss/train': 1.6399989128112793}}} 11/07/2021 01:57:47 - INFO - __main__ - Step 33474: {'lr': 0.00044625364573847904, 'samples': 6427008, 'steps': 33473, 'loss/train': 0.8817580342292786}} 11/07/2021 01:57:49 - INFO - __main__ - Step 33478: {'lr': 0.00044624049540077784, 'samples': 6427776, 'steps': 33477, 'loss/train': 1.6491341590881348}} 11/07/2021 01:57:51 - INFO - __main__ - Step 33482: {'lr': 0.0004462273436483095, 'samples': 6428544, 'steps': 33481, 'loss/train': 2.0158817768096924}}} 11/07/2021 01:57:53 - INFO - __main__ - Step 33486: {'lr': 0.0004462141904811691, 'samples': 6429312, 'steps': 33485, 'loss/train': 1.271095633506775}}}} 11/07/2021 01:57:55 - INFO - __main__ - Step 33490: {'lr': 0.0004462010358994513, 'samples': 6430080, 'steps': 33489, 'loss/train': 1.6507304906845093}}} 11/07/2021 01:57:57 - INFO - __main__ - Step 33494: {'lr': 0.00044618787990325086, 'samples': 6430848, 'steps': 33493, 'loss/train': 1.9450844526290894}} 11/07/2021 01:57:59 - INFO - __main__ - Step 33498: {'lr': 0.0004461747224926628, 'samples': 6431616, 'steps': 33497, 'loss/train': 1.6501895189285278}}} 11/07/2021 01:58:01 - INFO - __main__ - Step 33503: {'lr': 0.0004461582737405895, 'samples': 6432576, 'steps': 33502, 'loss/train': 1.302112340927124}}}} 11/07/2021 01:58:03 - INFO - __main__ - Step 33507: {'lr': 0.0004461451131479759, 'samples': 6433344, 'steps': 33506, 'loss/train': 1.5932680368423462}}} 11/07/2021 01:58:05 - INFO - __main__ - Step 33511: {'lr': 0.0004461319511412829, 'samples': 6434112, 'steps': 33510, 'loss/train': 1.8179429769515991}}} 11/07/2021 01:58:07 - INFO - __main__ - Step 33515: {'lr': 0.0004461187877206055, 'samples': 6434880, 'steps': 33514, 'loss/train': 1.1957716941833496}}} 11/07/2021 01:58:09 - INFO - __main__ - Step 33519: {'lr': 0.00044610562288603846, 'samples': 6435648, 'steps': 33518, 'loss/train': 1.0899434089660645}} 11/07/2021 01:58:11 - INFO - __main__ - Step 33524: {'lr': 0.00044608916485469195, 'samples': 6436608, 'steps': 33523, 'loss/train': 1.9318363666534424}} 11/07/2021 01:58:13 - INFO - __main__ - Step 33528: {'lr': 0.0004460759968392204, 'samples': 6437376, 'steps': 33527, 'loss/train': 1.5011610984802246}}} 11/07/2021 01:58:15 - INFO - __main__ - Step 33532: {'lr': 0.0004460628274101677, 'samples': 6438144, 'steps': 33531, 'loss/train': 1.682828664779663}}}} 11/07/2021 01:58:17 - INFO - __main__ - Step 33536: {'lr': 0.00044604965656762884, 'samples': 6438912, 'steps': 33535, 'loss/train': 1.7080373764038086}} 11/07/2021 01:58:19 - INFO - __main__ - Step 33540: {'lr': 0.00044603648431169884, 'samples': 6439680, 'steps': 33539, 'loss/train': 1.581534504890442}}} 11/07/2021 01:58:21 - INFO - __main__ - Step 33545: {'lr': 0.00044602001700434963, 'samples': 6440640, 'steps': 33544, 'loss/train': 0.7786942720413208}} 11/07/2021 01:58:23 - INFO - __main__ - Step 33549: {'lr': 0.0004460068415686366, 'samples': 6441408, 'steps': 33548, 'loss/train': 1.798831582069397}8}} 11/07/2021 01:58:25 - INFO - __main__ - Step 33553: {'lr': 0.000445993664719841, 'samples': 6442176, 'steps': 33552, 'loss/train': 1.5944808721542358}8}} 11/07/2021 01:58:27 - INFO - __main__ - Step 33557: {'lr': 0.000445980486458058, 'samples': 6442944, 'steps': 33556, 'loss/train': 1.659221887588501}}8}} 11/07/2021 01:58:29 - INFO - __main__ - Step 33561: {'lr': 0.00044596730678338236, 'samples': 6443712, 'steps': 33560, 'loss/train': 1.454308032989502}}} 11/07/2021 01:58:31 - INFO - __main__ - Step 33565: {'lr': 0.00044595412569590934, 'samples': 6444480, 'steps': 33564, 'loss/train': 1.0743993520736694}} 11/07/2021 01:58:33 - INFO - __main__ - Step 33570: {'lr': 0.00044593764734996615, 'samples': 6445440, 'steps': 33569, 'loss/train': 1.4638475179672241}} 11/07/2021 01:58:35 - INFO - __main__ - Step 33574: {'lr': 0.0004459244630840461, 'samples': 6446208, 'steps': 33573, 'loss/train': 0.9683477878570557}}} 11/07/2021 01:58:38 - INFO - __main__ - Step 33578: {'lr': 0.0004459112774056374, 'samples': 6446976, 'steps': 33577, 'loss/train': 1.8841018676757812}}} 11/07/2021 01:58:40 - INFO - __main__ - Step 33582: {'lr': 0.00044589809031483517, 'samples': 6447744, 'steps': 33581, 'loss/train': 1.8601634502410889}} 11/07/2021 01:58:41 - INFO - __main__ - Step 33586: {'lr': 0.00044588490181173435, 'samples': 6448512, 'steps': 33585, 'loss/train': 1.6263537406921387}} 11/07/2021 01:58:43 - INFO - __main__ - Step 33591: {'lr': 0.0004458684141969585, 'samples': 6449472, 'steps': 33590, 'loss/train': 1.5999499559402466}}} 11/07/2021 01:58:43 - INFO - __main__ - Step 33591: {'lr': 0.0004458684141969585, 'samples': 6449472, 'steps': 33590, 'loss/train': 1.5999499559402466}}} 11/07/2021 01:58:48 - INFO - __main__ - Step 33599: {'lr': 0.00044584202942411956, 'samples': 6451008, 'steps': 33598, 'loss/train': 1.7060731649398804}} 11/07/2021 01:58:50 - INFO - __main__ - Step 33603: {'lr': 0.00044582883491981097, 'samples': 6451776, 'steps': 33602, 'loss/train': 1.7198821306228638}} 11/07/2021 01:58:51 - INFO - __main__ - Step 33607: {'lr': 0.00044581563900370326, 'samples': 6452544, 'steps': 33606, 'loss/train': 1.5778969526290894}} 11/07/2021 01:58:53 - INFO - __main__ - Step 33611: {'lr': 0.00044580244167589136, 'samples': 6453312, 'steps': 33610, 'loss/train': 1.672492265701294}}} 11/07/2021 01:58:56 - INFO - __main__ - Step 33616: {'lr': 0.00044578594303106266, 'samples': 6454272, 'steps': 33615, 'loss/train': 1.8838460445404053}} 11/07/2021 01:58:56 - INFO - __main__ - Step 33616: {'lr': 0.00044578594303106266, 'samples': 6454272, 'steps': 33615, 'loss/train': 1.8838460445404053}} 11/07/2021 01:58:59 - INFO - __main__ - Step 33623: {'lr': 0.0004457628412231828, 'samples': 6455616, 'steps': 33622, 'loss/train': 1.6696631908416748}}} 11/07/2021 01:59:01 - INFO - __main__ - Step 33627: {'lr': 0.0004457496382495062, 'samples': 6456384, 'steps': 33626, 'loss/train': 1.0140951871871948}}} 11/07/2021 01:59:03 - INFO - __main__ - Step 33632: {'lr': 0.00044573313254788176, 'samples': 6457344, 'steps': 33631, 'loss/train': 1.1901419162750244}} 11/07/2021 01:59:06 - INFO - __main__ - Step 33637: {'lr': 0.0004457166246413992, 'samples': 6458304, 'steps': 33636, 'loss/train': 1.040553092956543}4}} 11/07/2021 01:59:08 - INFO - __main__ - Step 33641: {'lr': 0.00044570341672884006, 'samples': 6459072, 'steps': 33640, 'loss/train': 1.2570524215698242}} 11/07/2021 01:59:08 - INFO - __main__ - Step 33641: {'lr': 0.00044570341672884006, 'samples': 6459072, 'steps': 33640, 'loss/train': 1.2570524215698242}} 11/07/2021 01:59:11 - INFO - __main__ - Step 33648: {'lr': 0.00044568029948695287, 'samples': 6460416, 'steps': 33647, 'loss/train': 1.5932193994522095}} 11/07/2021 01:59:14 - INFO - __main__ - Step 33653: {'lr': 0.00044566378452617363, 'samples': 6461376, 'steps': 33652, 'loss/train': 1.5953766107559204}} 11/07/2021 01:59:16 - INFO - __main__ - Step 33658: {'lr': 0.0004456472673613174, 'samples': 6462336, 'steps': 33657, 'loss/train': 1.685717225074768}4}} 11/07/2021 01:59:18 - INFO - __main__ - Step 33662: {'lr': 0.000445634052042622, 'samples': 6463104, 'steps': 33661, 'loss/train': 1.6241892576217651}4}} 11/07/2021 01:59:18 - INFO - __main__ - Step 33662: {'lr': 0.000445634052042622, 'samples': 6463104, 'steps': 33661, 'loss/train': 1.6241892576217651}4}} 11/07/2021 01:59:21 - INFO - __main__ - Step 33669: {'lr': 0.00044561092184119933, 'samples': 6464448, 'steps': 33668, 'loss/train': 1.6479049921035767}} 11/07/2021 01:59:23 - INFO - __main__ - Step 33673: {'lr': 0.000445597702644147, 'samples': 6465216, 'steps': 33672, 'loss/train': 1.821217656135559}67}} 11/07/2021 01:59:26 - INFO - __main__ - Step 33678: {'lr': 0.0004455811766648434, 'samples': 6466176, 'steps': 33677, 'loss/train': 1.6165025234222412}}} 11/07/2021 01:59:28 - INFO - __main__ - Step 33683: {'lr': 0.0004455646484823933, 'samples': 6467136, 'steps': 33682, 'loss/train': 2.1249895095825195}}} 11/07/2021 01:59:28 - INFO - __main__ - Step 33683: {'lr': 0.0004455646484823933, 'samples': 6467136, 'steps': 33682, 'loss/train': 2.1249895095825195}}} 11/07/2021 01:59:31 - INFO - __main__ - Step 33690: {'lr': 0.00044554150532603154, 'samples': 6468480, 'steps': 33689, 'loss/train': 1.949639081954956}}} 11/07/2021 01:59:34 - INFO - __main__ - Step 33694: {'lr': 0.00044552827872684493, 'samples': 6469248, 'steps': 33693, 'loss/train': 1.9272459745407104}} 11/07/2021 01:59:36 - INFO - __main__ - Step 33699: {'lr': 0.00044551174349557733, 'samples': 6470208, 'steps': 33698, 'loss/train': 1.4626320600509644}} 11/07/2021 01:59:36 - INFO - __main__ - Step 33699: {'lr': 0.00044551174349557733, 'samples': 6470208, 'steps': 33698, 'loss/train': 1.4626320600509644}} 11/07/2021 01:59:40 - INFO - __main__ - Step 33707: {'lr': 0.0004454852825447087, 'samples': 6471744, 'steps': 33706, 'loss/train': 1.672659158706665}4}} 11/07/2021 01:59:42 - INFO - __main__ - Step 33711: {'lr': 0.00044547204995524305, 'samples': 6472512, 'steps': 33710, 'loss/train': 1.8348017930984497}} 11/07/2021 01:59:44 - INFO - __main__ - Step 33715: {'lr': 0.00044545881595655035, 'samples': 6473280, 'steps': 33714, 'loss/train': 1.4596387147903442}} 11/07/2021 01:59:46 - INFO - __main__ - Step 33719: {'lr': 0.0004454455805487261, 'samples': 6474048, 'steps': 33718, 'loss/train': 1.7880730628967285}}} 11/07/2021 01:59:48 - INFO - __main__ - Step 33723: {'lr': 0.00044543234373186556, 'samples': 6474816, 'steps': 33722, 'loss/train': 1.5587968826293945}} 11/07/2021 01:59:48 - INFO - __main__ - Step 33723: {'lr': 0.00044543234373186556, 'samples': 6474816, 'steps': 33722, 'loss/train': 1.5587968826293945}} 11/07/2021 01:59:51 - INFO - __main__ - Step 33730: {'lr': 0.00044540917591215335, 'samples': 6476160, 'steps': 33729, 'loss/train': 1.6598936319351196}} 11/07/2021 01:59:54 - INFO - __main__ - Step 33736: {'lr': 0.0004453893143470661, 'samples': 6477312, 'steps': 33735, 'loss/train': 1.5552427768707275}}} 11/07/2021 01:59:54 - INFO - __main__ - Step 33736: {'lr': 0.0004453893143470661, 'samples': 6477312, 'steps': 33735, 'loss/train': 1.5552427768707275}}} 11/07/2021 01:59:58 - INFO - __main__ - Step 33743: {'lr': 0.0004453661385153604, 'samples': 6478656, 'steps': 33742, 'loss/train': 1.8474880456924438}}} 11/07/2021 01:59:59 - INFO - __main__ - Step 33747: {'lr': 0.00044535289324628704, 'samples': 6479424, 'steps': 33746, 'loss/train': 1.3100298643112183}} 11/07/2021 02:00:01 - INFO - __main__ - Step 33751: {'lr': 0.0004453396465688457, 'samples': 6480192, 'steps': 33750, 'loss/train': 1.4245951175689697}}} 11/07/2021 02:00:04 - INFO - __main__ - Step 33756: {'lr': 0.0004453230862416721, 'samples': 6481152, 'steps': 33755, 'loss/train': 1.6372071504592896}}} 11/07/2021 02:00:06 - INFO - __main__ - Step 33760: {'lr': 0.00044530983639575193, 'samples': 6481920, 'steps': 33759, 'loss/train': 1.6232199668884277}} 11/07/2021 02:00:08 - INFO - __main__ - Step 33764: {'lr': 0.0004452965851417743, 'samples': 6482688, 'steps': 33763, 'loss/train': 1.709043025970459}7}} 11/07/2021 02:00:10 - INFO - __main__ - Step 33768: {'lr': 0.00044528333247983456, 'samples': 6483456, 'steps': 33767, 'loss/train': 2.097458600997925}}} 11/07/2021 02:00:12 - INFO - __main__ - Step 33773: {'lr': 0.0004452667646726088, 'samples': 6484416, 'steps': 33772, 'loss/train': 1.6249957084655762}}} 11/07/2021 02:00:14 - INFO - __main__ - Step 33777: {'lr': 0.0004452535088431038, 'samples': 6485184, 'steps': 33776, 'loss/train': 1.4473791122436523}}} 11/07/2021 02:00:14 - INFO - __main__ - Step 33777: {'lr': 0.0004452535088431038, 'samples': 6485184, 'steps': 33776, 'loss/train': 1.4473791122436523}}} 11/07/2021 02:00:17 - INFO - __main__ - Step 33784: {'lr': 0.00044523030775436617, 'samples': 6486528, 'steps': 33783, 'loss/train': 1.2860418558120728}} 11/07/2021 02:00:20 - INFO - __main__ - Step 33789: {'lr': 0.0004452137329090622, 'samples': 6487488, 'steps': 33788, 'loss/train': 1.3245213031768799}}} 11/07/2021 02:00:22 - INFO - __main__ - Step 33794: {'lr': 0.00044519715586475083, 'samples': 6488448, 'steps': 33793, 'loss/train': 1.6966907978057861}} 11/07/2021 02:00:22 - INFO - __main__ - Step 33794: {'lr': 0.00044519715586475083, 'samples': 6488448, 'steps': 33793, 'loss/train': 1.6966907978057861}} 11/07/2021 02:00:26 - INFO - __main__ - Step 33801: {'lr': 0.0004451739443087381, 'samples': 6489792, 'steps': 33800, 'loss/train': 1.3621511459350586}}} 11/07/2021 02:00:28 - INFO - __main__ - Step 33805: {'lr': 0.00044516067862768015, 'samples': 6490560, 'steps': 33804, 'loss/train': 1.5042489767074585}} 11/07/2021 02:00:30 - INFO - __main__ - Step 33809: {'lr': 0.00044514741153964, 'samples': 6491328, 'steps': 33808, 'loss/train': 1.6844371557235718}85}} 11/07/2021 02:00:32 - INFO - __main__ - Step 33815: {'lr': 0.00044512750826969724, 'samples': 6492480, 'steps': 33814, 'loss/train': 1.7158915996551514}} 11/07/2021 02:00:34 - INFO - __main__ - Step 33819: {'lr': 0.0004451142376646199, 'samples': 6493248, 'steps': 33818, 'loss/train': 1.484433889389038}4}} 11/07/2021 02:00:34 - INFO - __main__ - Step 33819: {'lr': 0.0004451142376646199, 'samples': 6493248, 'steps': 33818, 'loss/train': 1.484433889389038}4}} 11/07/2021 02:00:38 - INFO - __main__ - Step 33826: {'lr': 0.0004450910107210467, 'samples': 6494592, 'steps': 33825, 'loss/train': 1.1198339462280273}}} 11/07/2021 02:00:40 - INFO - __main__ - Step 33830: {'lr': 0.0004450777362479192, 'samples': 6495360, 'steps': 33829, 'loss/train': 1.4964271783828735}}} 11/07/2021 02:00:42 - INFO - __main__ - Step 33836: {'lr': 0.0004450578219012873, 'samples': 6496512, 'steps': 33835, 'loss/train': 1.7628679275512695}}} 11/07/2021 02:00:45 - INFO - __main__ - Step 33840: {'lr': 0.0004450445439123785, 'samples': 6497280, 'steps': 33839, 'loss/train': 1.5153329372406006}}} 11/07/2021 02:00:47 - INFO - __main__ - Step 33844: {'lr': 0.00044503126451732474, 'samples': 6498048, 'steps': 33843, 'loss/train': 1.7846750020980835}} 11/07/2021 02:00:47 - INFO - __main__ - Step 33844: {'lr': 0.00044503126451732474, 'samples': 6498048, 'steps': 33843, 'loss/train': 1.7846750020980835}} 11/07/2021 02:00:50 - INFO - __main__ - Step 33851: {'lr': 0.00044500802219273224, 'samples': 6499392, 'steps': 33850, 'loss/train': 1.3523143529891968}} 11/07/2021 02:00:53 - INFO - __main__ - Step 33856: {'lr': 0.00044499141789625086, 'samples': 6500352, 'steps': 33855, 'loss/train': 1.5748056173324585}} 11/07/2021 02:00:53 - INFO - __main__ - Step 33856: {'lr': 0.00044499141789625086, 'samples': 6500352, 'steps': 33855, 'loss/train': 1.5748056173324585}} 11/07/2021 02:00:56 - INFO - __main__ - Step 33862: {'lr': 0.00044497148984110567, 'samples': 6501504, 'steps': 33861, 'loss/train': 1.399930715560913}}} 11/07/2021 02:00:59 - INFO - __main__ - Step 33868: {'lr': 0.0004449515586233193, 'samples': 6502656, 'steps': 33867, 'loss/train': 0.7612836956977844}}} 11/07/2021 02:01:01 - INFO - __main__ - Step 33872: {'lr': 0.0004449382693879318, 'samples': 6503424, 'steps': 33871, 'loss/train': 1.233699083328247}}}} 11/07/2021 02:01:03 - INFO - __main__ - Step 33876: {'lr': 0.0004449249787471655, 'samples': 6504192, 'steps': 33875, 'loss/train': 1.0890296697616577}}} 11/07/2021 02:01:05 - INFO - __main__ - Step 33880: {'lr': 0.00044491168670111615, 'samples': 6504960, 'steps': 33879, 'loss/train': 1.1301995515823364}} 11/07/2021 02:01:06 - INFO - __main__ - Step 33884: {'lr': 0.0004448983932498797, 'samples': 6505728, 'steps': 33883, 'loss/train': 1.7393323183059692}}} 11/07/2021 02:01:09 - INFO - __main__ - Step 33889: {'lr': 0.00044488177445993563, 'samples': 6506688, 'steps': 33888, 'loss/train': 1.2288641929626465}} 11/07/2021 02:01:09 - INFO - __main__ - Step 33889: {'lr': 0.00044488177445993563, 'samples': 6506688, 'steps': 33888, 'loss/train': 1.2288641929626465}} 11/07/2021 02:01:13 - INFO - __main__ - Step 33896: {'lr': 0.0004448585044660055, 'samples': 6508032, 'steps': 33895, 'loss/train': 0.20409615337848663}} 11/07/2021 02:01:14 - INFO - __main__ - Step 33900: {'lr': 0.0004448452053949789, 'samples': 6508800, 'steps': 33899, 'loss/train': 1.3028892278671265}}} 11/07/2021 02:01:14 - INFO - __main__ - Step 33900: {'lr': 0.0004448452053949789, 'samples': 6508800, 'steps': 33899, 'loss/train': 1.3028892278671265}}} 11/07/2021 02:01:18 - INFO - __main__ - Step 33908: {'lr': 0.00044481860303889766, 'samples': 6510336, 'steps': 33907, 'loss/train': 1.1449620723724365}} 11/07/2021 02:01:18 - INFO - __main__ - Step 33908: {'lr': 0.00044481860303889766, 'samples': 6510336, 'steps': 33907, 'loss/train': 1.1449620723724365}} 11/07/2021 02:01:22 - INFO - __main__ - Step 33915: {'lr': 0.0004447953213687319, 'samples': 6511680, 'steps': 33914, 'loss/train': 1.4334322214126587}}} 11/07/2021 02:01:24 - INFO - __main__ - Step 33920: {'lr': 0.0004447786889711449, 'samples': 6512640, 'steps': 33919, 'loss/train': 1.3927388191223145}}} 11/07/2021 02:01:24 - INFO - __main__ - Step 33920: {'lr': 0.0004447786889711449, 'samples': 6512640, 'steps': 33919, 'loss/train': 1.3927388191223145}}} 11/07/2021 02:01:28 - INFO - __main__ - Step 33928: {'lr': 0.00044475207257134143, 'samples': 6514176, 'steps': 33927, 'loss/train': 1.7115229368209839}} 11/07/2021 02:01:30 - INFO - __main__ - Step 33932: {'lr': 0.00044473876226533703, 'samples': 6514944, 'steps': 33931, 'loss/train': 1.2794572114944458}} 11/07/2021 02:01:32 - INFO - __main__ - Step 33937: {'lr': 0.00044472212240855155, 'samples': 6515904, 'steps': 33936, 'loss/train': 1.6173049211502075}} 11/07/2021 02:01:34 - INFO - __main__ - Step 33941: {'lr': 0.000444708808943816, 'samples': 6516672, 'steps': 33940, 'loss/train': 1.5928783416748047}5}} 11/07/2021 02:01:36 - INFO - __main__ - Step 33945: {'lr': 0.00044469549407535593, 'samples': 6517440, 'steps': 33944, 'loss/train': 1.3439016342163086}} 11/07/2021 02:01:38 - INFO - __main__ - Step 33949: {'lr': 0.00044468217780326724, 'samples': 6518208, 'steps': 33948, 'loss/train': 1.5032635927200317}} 11/07/2021 02:01:40 - INFO - __main__ - Step 33953: {'lr': 0.00044466886012764603, 'samples': 6518976, 'steps': 33952, 'loss/train': 1.2247635126113892}} 11/07/2021 02:01:42 - INFO - __main__ - Step 33957: {'lr': 0.00044465554104858817, 'samples': 6519744, 'steps': 33956, 'loss/train': 1.220327377319336}}} 11/07/2021 02:01:44 - INFO - __main__ - Step 33962: {'lr': 0.00044463889022632963, 'samples': 6520704, 'steps': 33961, 'loss/train': 1.528019666671753}}} 11/07/2021 02:01:46 - INFO - __main__ - Step 33966: {'lr': 0.0004446255679898907, 'samples': 6521472, 'steps': 33965, 'loss/train': 1.6411830186843872}}} 11/07/2021 02:01:46 - INFO - __main__ - Step 33966: {'lr': 0.0004446255679898907, 'samples': 6521472, 'steps': 33965, 'loss/train': 1.6411830186843872}}} 11/07/2021 02:01:50 - INFO - __main__ - Step 33973: {'lr': 0.0004446022506999122, 'samples': 6522816, 'steps': 33972, 'loss/train': 1.216179370880127}}}} 11/07/2021 02:01:52 - INFO - __main__ - Step 33977: {'lr': 0.00044458892460511225, 'samples': 6523584, 'steps': 33976, 'loss/train': 1.4237685203552246}} 11/07/2021 02:01:54 - INFO - __main__ - Step 33982: {'lr': 0.0004445722650138512, 'samples': 6524544, 'steps': 33981, 'loss/train': 1.6285544633865356}}} 11/07/2021 02:01:54 - INFO - __main__ - Step 33982: {'lr': 0.0004445722650138512, 'samples': 6524544, 'steps': 33981, 'loss/train': 1.6285544633865356}}} 11/07/2021 02:01:58 - INFO - __main__ - Step 33990: {'lr': 0.0004445456051090062, 'samples': 6526080, 'steps': 33989, 'loss/train': 1.3475840091705322}}} 11/07/2021 02:02:00 - INFO - __main__ - Step 33994: {'lr': 0.0004445322730527137, 'samples': 6526848, 'steps': 33993, 'loss/train': 1.6258047819137573}}} 11/07/2021 02:02:02 - INFO - __main__ - Step 33998: {'lr': 0.0004445189395939694, 'samples': 6527616, 'steps': 33997, 'loss/train': 1.4667303562164307}}} 11/07/2021 02:02:05 - INFO - __main__ - Step 34003: {'lr': 0.0004445022707984874, 'samples': 6528576, 'steps': 34002, 'loss/train': 0.221288800239563}}}} 11/07/2021 02:02:05 - INFO - __main__ - Step 34003: {'lr': 0.0004445022707984874, 'samples': 6528576, 'steps': 34002, 'loss/train': 0.221288800239563}}}} 11/07/2021 02:02:08 - INFO - __main__ - Step 34010: {'lr': 0.0004444789308039865, 'samples': 6529920, 'steps': 34009, 'loss/train': 2.2077717781066895}}} 11/07/2021 02:02:10 - INFO - __main__ - Step 34014: {'lr': 0.0004444655917363961, 'samples': 6530688, 'steps': 34013, 'loss/train': 1.3438079357147217}}} 11/07/2021 02:02:13 - INFO - __main__ - Step 34019: {'lr': 0.0004444489159303976, 'samples': 6531648, 'steps': 34018, 'loss/train': 1.5133620500564575}}} 11/07/2021 02:02:15 - INFO - __main__ - Step 34023: {'lr': 0.00044443557370850743, 'samples': 6532416, 'steps': 34022, 'loss/train': 1.3057926893234253}} 11/07/2021 02:02:17 - INFO - __main__ - Step 34027: {'lr': 0.0004444222300848626, 'samples': 6533184, 'steps': 34026, 'loss/train': 1.056647777557373}3}} 11/07/2021 02:02:19 - INFO - __main__ - Step 34031: {'lr': 0.00044440888505955926, 'samples': 6533952, 'steps': 34030, 'loss/train': 2.1320152282714844}} 11/07/2021 02:02:20 - INFO - __main__ - Step 34035: {'lr': 0.00044439553863269356, 'samples': 6534720, 'steps': 34034, 'loss/train': 1.7785248756408691}} 11/07/2021 02:02:22 - INFO - __main__ - Step 34039: {'lr': 0.00044438219080436184, 'samples': 6535488, 'steps': 34038, 'loss/train': 1.7232627868652344}} 11/07/2021 02:02:25 - INFO - __main__ - Step 34044: {'lr': 0.00044436550404828207, 'samples': 6536448, 'steps': 34043, 'loss/train': 1.2998487949371338}} 11/07/2021 02:02:27 - INFO - __main__ - Step 34048: {'lr': 0.0004443521530670035, 'samples': 6537216, 'steps': 34047, 'loss/train': 1.7646113634109497}}} 11/07/2021 02:02:29 - INFO - __main__ - Step 34052: {'lr': 0.00044433880068457166, 'samples': 6537984, 'steps': 34051, 'loss/train': 1.6336530447006226}} 11/07/2021 02:02:30 - INFO - __main__ - Step 34056: {'lr': 0.0004443254469010828, 'samples': 6538752, 'steps': 34055, 'loss/train': 1.014756441116333}6}} 11/07/2021 02:02:32 - INFO - __main__ - Step 34060: {'lr': 0.00044431209171663313, 'samples': 6539520, 'steps': 34059, 'loss/train': 0.7567717432975769}} 11/07/2021 02:02:35 - INFO - __main__ - Step 34065: {'lr': 0.00044429539576611664, 'samples': 6540480, 'steps': 34064, 'loss/train': 1.0325992107391357}} 11/07/2021 02:02:35 - INFO - __main__ - Step 34065: {'lr': 0.00044429539576611664, 'samples': 6540480, 'steps': 34064, 'loss/train': 1.0325992107391357}} 11/07/2021 02:02:38 - INFO - __main__ - Step 34072: {'lr': 0.00044427201775848246, 'samples': 6541824, 'steps': 34071, 'loss/train': 1.4158926010131836}} 11/07/2021 02:02:40 - INFO - __main__ - Step 34077: {'lr': 0.00044425531655549157, 'samples': 6542784, 'steps': 34076, 'loss/train': 0.8294751048088074}} 11/07/2021 02:02:43 - INFO - __main__ - Step 34081: {'lr': 0.0004442419540175778, 'samples': 6543552, 'steps': 34080, 'loss/train': 1.5299001932144165}}} 11/07/2021 02:02:43 - INFO - __main__ - Step 34081: {'lr': 0.0004442419540175778, 'samples': 6543552, 'steps': 34080, 'loss/train': 1.5299001932144165}}} 11/07/2021 02:02:46 - INFO - __main__ - Step 34088: {'lr': 0.0004442185662066731, 'samples': 6544896, 'steps': 34087, 'loss/train': 1.6649442911148071}}} 11/07/2021 02:02:48 - INFO - __main__ - Step 34092: {'lr': 0.00044420519981800446, 'samples': 6545664, 'steps': 34091, 'loss/train': 1.9090162515640259}} 11/07/2021 02:02:48 - INFO - __main__ - Step 34092: {'lr': 0.00044420519981800446, 'samples': 6545664, 'steps': 34091, 'loss/train': 1.9090162515640259}} 11/07/2021 02:02:52 - INFO - __main__ - Step 34100: {'lr': 0.0004441784628404817, 'samples': 6547200, 'steps': 34099, 'loss/train': 1.303661823272705}9}} 11/07/2021 02:02:55 - INFO - __main__ - Step 34104: {'lr': 0.00044416509225182044, 'samples': 6547968, 'steps': 34103, 'loss/train': 1.9034819602966309}} 11/07/2021 02:02:56 - INFO - __main__ - Step 34108: {'lr': 0.0004441517202633546, 'samples': 6548736, 'steps': 34107, 'loss/train': 1.857909083366394}9}} 11/07/2021 02:02:56 - INFO - __main__ - Step 34108: {'lr': 0.0004441517202633546, 'samples': 6548736, 'steps': 34107, 'loss/train': 1.857909083366394}9}} 11/07/2021 02:02:56 - INFO - __main__ - Step 34108: {'lr': 0.0004441517202633546, 'samples': 6548736, 'steps': 34107, 'loss/train': 1.857909083366394}9}} 11/07/2021 02:03:02 - INFO - __main__ - Step 34118: {'lr': 0.00044411828416867684, 'samples': 6550656, 'steps': 34117, 'loss/train': 1.4765671491622925}} 11/07/2021 02:03:05 - INFO - __main__ - Step 34124: {'lr': 0.0004440982183133721, 'samples': 6551808, 'steps': 34123, 'loss/train': 1.508685827255249}5}} 11/07/2021 02:03:07 - INFO - __main__ - Step 34128: {'lr': 0.00044408483932732886, 'samples': 6552576, 'steps': 34127, 'loss/train': 1.5073697566986084}} 11/07/2021 02:03:07 - INFO - __main__ - Step 34128: {'lr': 0.00044408483932732886, 'samples': 6552576, 'steps': 34127, 'loss/train': 1.5073697566986084}} 11/07/2021 02:03:10 - INFO - __main__ - Step 34135: {'lr': 0.00044406142273492334, 'samples': 6553920, 'steps': 34134, 'loss/train': 1.5329359769821167}} 11/07/2021 02:03:13 - INFO - __main__ - Step 34140: {'lr': 0.00044404469397422823, 'samples': 6554880, 'steps': 34139, 'loss/train': 0.6725926399230957}} 11/07/2021 02:03:15 - INFO - __main__ - Step 34144: {'lr': 0.0004440313093918593, 'samples': 6555648, 'steps': 34143, 'loss/train': 1.4875729084014893}}} 11/07/2021 02:03:15 - INFO - __main__ - Step 34144: {'lr': 0.0004440313093918593, 'samples': 6555648, 'steps': 34143, 'loss/train': 1.4875729084014893}}} 11/07/2021 02:03:18 - INFO - __main__ - Step 34151: {'lr': 0.0004440078830068125, 'samples': 6556992, 'steps': 34150, 'loss/train': 1.9597066640853882}}} 11/07/2021 02:03:21 - INFO - __main__ - Step 34156: {'lr': 0.0004439911472520972, 'samples': 6557952, 'steps': 34155, 'loss/train': 1.247413992881775}}}} 11/07/2021 02:03:21 - INFO - __main__ - Step 34156: {'lr': 0.0004439911472520972, 'samples': 6557952, 'steps': 34155, 'loss/train': 1.247413992881775}}}} 11/07/2021 02:03:25 - INFO - __main__ - Step 34164: {'lr': 0.00044396436549934155, 'samples': 6559488, 'steps': 34163, 'loss/train': 1.2462340593338013}} 11/07/2021 02:03:26 - INFO - __main__ - Step 34168: {'lr': 0.00044395097252537905, 'samples': 6560256, 'steps': 34167, 'loss/train': 1.355995535850525}}} 11/07/2021 02:03:28 - INFO - __main__ - Step 34172: {'lr': 0.0004439375781531555, 'samples': 6561024, 'steps': 34171, 'loss/train': 1.5006424188613892}}} 11/07/2021 02:03:31 - INFO - __main__ - Step 34177: {'lr': 0.0004439208332217186, 'samples': 6561984, 'steps': 34176, 'loss/train': 1.02590012550354}2}}} 11/07/2021 02:03:33 - INFO - __main__ - Step 34182: {'lr': 0.0004439040861058383, 'samples': 6562944, 'steps': 34181, 'loss/train': 1.599212408065796}}}} 11/07/2021 02:03:33 - INFO - __main__ - Step 34182: {'lr': 0.0004439040861058383, 'samples': 6562944, 'steps': 34181, 'loss/train': 1.599212408065796}}}} 11/07/2021 02:03:36 - INFO - __main__ - Step 34189: {'lr': 0.00044388063647410016, 'samples': 6564288, 'steps': 34188, 'loss/train': 1.8877049684524536}} 11/07/2021 02:03:38 - INFO - __main__ - Step 34193: {'lr': 0.0004438672347625907, 'samples': 6565056, 'steps': 34192, 'loss/train': 1.344968557357788}6}} 11/07/2021 02:03:41 - INFO - __main__ - Step 34198: {'lr': 0.0004438504806577594, 'samples': 6566016, 'steps': 34197, 'loss/train': 1.6417779922485352}}} 11/07/2021 02:03:43 - INFO - __main__ - Step 34203: {'lr': 0.00044383372436927727, 'samples': 6566976, 'steps': 34202, 'loss/train': 1.6074336767196655}} 11/07/2021 02:03:43 - INFO - __main__ - Step 34203: {'lr': 0.00044383372436927727, 'samples': 6566976, 'steps': 34202, 'loss/train': 1.6074336767196655}} 11/07/2021 02:03:47 - INFO - __main__ - Step 34210: {'lr': 0.00044381026189722824, 'samples': 6568320, 'steps': 34209, 'loss/train': 1.6742136478424072}} 11/07/2021 02:03:49 - INFO - __main__ - Step 34214: {'lr': 0.00044379685284909575, 'samples': 6569088, 'steps': 34213, 'loss/train': 1.9888619184494019}} 11/07/2021 02:03:51 - INFO - __main__ - Step 34218: {'lr': 0.0004437834424038133, 'samples': 6569856, 'steps': 34217, 'loss/train': 0.8595290780067444}}} 11/07/2021 02:03:53 - INFO - __main__ - Step 34222: {'lr': 0.00044377003056147757, 'samples': 6570624, 'steps': 34221, 'loss/train': 1.8908228874206543}} 11/07/2021 02:03:55 - INFO - __main__ - Step 34226: {'lr': 0.0004437566173221853, 'samples': 6571392, 'steps': 34225, 'loss/train': 1.329896330833435}3}} 11/07/2021 02:03:56 - INFO - __main__ - Step 34230: {'lr': 0.0004437432026860332, 'samples': 6572160, 'steps': 34229, 'loss/train': 1.3589770793914795}}} 11/07/2021 02:03:58 - INFO - __main__ - Step 34234: {'lr': 0.0004437297866531179, 'samples': 6572928, 'steps': 34233, 'loss/train': 1.6355782747268677}}} 11/07/2021 02:04:00 - INFO - __main__ - Step 34239: {'lr': 0.0004437130146479229, 'samples': 6573888, 'steps': 34238, 'loss/train': 1.646666169166565}}}} 11/07/2021 02:04:00 - INFO - __main__ - Step 34239: {'lr': 0.0004437130146479229, 'samples': 6573888, 'steps': 34238, 'loss/train': 1.646666169166565}}}} 11/07/2021 02:04:05 - INFO - __main__ - Step 34247: {'lr': 0.00044368617490091655, 'samples': 6575424, 'steps': 34246, 'loss/train': 1.436553955078125}}} 11/07/2021 02:04:06 - INFO - __main__ - Step 34251: {'lr': 0.00044367275293283705, 'samples': 6576192, 'steps': 34250, 'loss/train': 1.1485662460327148}} 11/07/2021 02:04:08 - INFO - __main__ - Step 34255: {'lr': 0.0004436593295685022, 'samples': 6576960, 'steps': 34254, 'loss/train': 1.3940390348434448}}} 11/07/2021 02:04:11 - INFO - __main__ - Step 34260: {'lr': 0.00044364254839974717, 'samples': 6577920, 'steps': 34259, 'loss/train': 1.560340404510498}}} 11/07/2021 02:04:13 - INFO - __main__ - Step 34265: {'lr': 0.00044362576504968344, 'samples': 6578880, 'steps': 34264, 'loss/train': 0.7693606019020081}} 11/07/2021 02:04:13 - INFO - __main__ - Step 34265: {'lr': 0.00044362576504968344, 'samples': 6578880, 'steps': 34264, 'loss/train': 0.7693606019020081}} 11/07/2021 02:04:17 - INFO - __main__ - Step 34272: {'lr': 0.00044360226469535583, 'samples': 6580224, 'steps': 34271, 'loss/train': 1.5781022310256958}} 11/07/2021 02:04:18 - INFO - __main__ - Step 34276: {'lr': 0.0004435888340022688, 'samples': 6580992, 'steps': 34275, 'loss/train': 1.8282495737075806}}} 11/07/2021 02:04:21 - INFO - __main__ - Step 34281: {'lr': 0.0004435720436732882, 'samples': 6581952, 'steps': 34280, 'loss/train': 1.3213019371032715}}} 11/07/2021 02:04:23 - INFO - __main__ - Step 34285: {'lr': 0.0004435586098401243, 'samples': 6582720, 'steps': 34284, 'loss/train': 1.3020122051239014}}} 11/07/2021 02:04:25 - INFO - __main__ - Step 34289: {'lr': 0.000443545174611528, 'samples': 6583488, 'steps': 34288, 'loss/train': 1.1464428901672363}}}} 11/07/2021 02:04:27 - INFO - __main__ - Step 34293: {'lr': 0.00044353173798759616, 'samples': 6584256, 'steps': 34292, 'loss/train': 1.7356139421463013}} 11/07/2021 02:04:29 - INFO - __main__ - Step 34297: {'lr': 0.00044351829996842575, 'samples': 6585024, 'steps': 34296, 'loss/train': 0.8310204148292542}} 11/07/2021 02:04:31 - INFO - __main__ - Step 34302: {'lr': 0.000443501500482556, 'samples': 6585984, 'steps': 34301, 'loss/train': 0.19430121779441833}}} 11/07/2021 02:04:33 - INFO - __main__ - Step 34306: {'lr': 0.0004434880593244528, 'samples': 6586752, 'steps': 34305, 'loss/train': 1.5231199264526367}}} 11/07/2021 02:04:35 - INFO - __main__ - Step 34310: {'lr': 0.000443474616771426, 'samples': 6587520, 'steps': 34309, 'loss/train': 1.5393397808074951}}}} 11/07/2021 02:04:37 - INFO - __main__ - Step 34314: {'lr': 0.0004434611728235722, 'samples': 6588288, 'steps': 34313, 'loss/train': 1.2123658657073975}}} 11/07/2021 02:04:39 - INFO - __main__ - Step 34318: {'lr': 0.00044344772748098867, 'samples': 6589056, 'steps': 34317, 'loss/train': 1.4679759740829468}} 11/07/2021 02:04:39 - INFO - __main__ - Step 34318: {'lr': 0.00044344772748098867, 'samples': 6589056, 'steps': 34317, 'loss/train': 1.4679759740829468}} 11/07/2021 02:04:39 - INFO - __main__ - Step 34318: {'lr': 0.00044344772748098867, 'samples': 6589056, 'steps': 34317, 'loss/train': 1.4679759740829468}} 11/07/2021 02:04:45 - INFO - __main__ - Step 34329: {'lr': 0.00044341074559809903, 'samples': 6591168, 'steps': 34328, 'loss/train': 1.5705645084381104}} 11/07/2021 02:04:47 - INFO - __main__ - Step 34334: {'lr': 0.00044339393216529394, 'samples': 6592128, 'steps': 34333, 'loss/train': 1.4728405475616455}} 11/07/2021 02:04:49 - INFO - __main__ - Step 34339: {'lr': 0.00044337711655398083, 'samples': 6593088, 'steps': 34338, 'loss/train': 1.6327975988388062}} 11/07/2021 02:04:51 - INFO - __main__ - Step 34343: {'lr': 0.0004433636624965318, 'samples': 6593856, 'steps': 34342, 'loss/train': 2.237250328063965}2}} 11/07/2021 02:04:51 - INFO - __main__ - Step 34343: {'lr': 0.0004433636624965318, 'samples': 6593856, 'steps': 34342, 'loss/train': 2.237250328063965}2}} 11/07/2021 02:04:55 - INFO - __main__ - Step 34350: {'lr': 0.0004433401145416771, 'samples': 6595200, 'steps': 34349, 'loss/train': 1.1282519102096558}}} 11/07/2021 02:04:57 - INFO - __main__ - Step 34355: {'lr': 0.00044332329196041133, 'samples': 6596160, 'steps': 34354, 'loss/train': 1.3930827379226685}} 11/07/2021 02:04:57 - INFO - __main__ - Step 34355: {'lr': 0.00044332329196041133, 'samples': 6596160, 'steps': 34354, 'loss/train': 1.3930827379226685}} 11/07/2021 02:05:01 - INFO - __main__ - Step 34363: {'lr': 0.00044329637130082324, 'samples': 6597696, 'steps': 34362, 'loss/train': 1.4699153900146484}} 11/07/2021 02:05:03 - INFO - __main__ - Step 34367: {'lr': 0.000443282908880668, 'samples': 6598464, 'steps': 34366, 'loss/train': 1.5377881526947021}4}} 11/07/2021 02:05:05 - INFO - __main__ - Step 34371: {'lr': 0.000443269445067068, 'samples': 6599232, 'steps': 34370, 'loss/train': 1.4963665008544922}4}} 11/07/2021 02:05:05 - INFO - __main__ - Step 34371: {'lr': 0.000443269445067068, 'samples': 6599232, 'steps': 34370, 'loss/train': 1.4963665008544922}4}} 11/07/2021 02:05:09 - INFO - __main__ - Step 34379: {'lr': 0.00044324251325992214, 'samples': 6600768, 'steps': 34378, 'loss/train': 1.768172264099121}}} 11/07/2021 02:05:11 - INFO - __main__ - Step 34383: {'lr': 0.0004432290452665704, 'samples': 6601536, 'steps': 34382, 'loss/train': 1.7808390855789185}}} 11/07/2021 02:05:13 - INFO - __main__ - Step 34387: {'lr': 0.00044321557588016214, 'samples': 6602304, 'steps': 34386, 'loss/train': 1.2353405952453613}} 11/07/2021 02:05:15 - INFO - __main__ - Step 34391: {'lr': 0.0004432021051007946, 'samples': 6603072, 'steps': 34390, 'loss/train': 5.095420837402344}3}} 11/07/2021 02:05:17 - INFO - __main__ - Step 34396: {'lr': 0.0004431852646678842, 'samples': 6604032, 'steps': 34395, 'loss/train': 1.4153156280517578}}} 11/07/2021 02:05:17 - INFO - __main__ - Step 34396: {'lr': 0.0004431852646678842, 'samples': 6604032, 'steps': 34395, 'loss/train': 1.4153156280517578}}} 11/07/2021 02:05:21 - INFO - __main__ - Step 34403: {'lr': 0.00044316168440590757, 'samples': 6605376, 'steps': 34402, 'loss/train': 1.8850730657577515}} 11/07/2021 02:05:23 - INFO - __main__ - Step 34407: {'lr': 0.000443148208055674, 'samples': 6606144, 'steps': 34406, 'loss/train': 1.199692726135254}15}} 11/07/2021 02:05:25 - INFO - __main__ - Step 34413: {'lr': 0.0004431279909194661, 'samples': 6607296, 'steps': 34412, 'loss/train': 2.042417526245117}5}} 11/07/2021 02:05:25 - INFO - __main__ - Step 34413: {'lr': 0.0004431279909194661, 'samples': 6607296, 'steps': 34412, 'loss/train': 2.042417526245117}5}} 11/07/2021 02:05:25 - INFO - __main__ - Step 34413: {'lr': 0.0004431279909194661, 'samples': 6607296, 'steps': 34412, 'loss/train': 2.042417526245117}5}} 11/07/2021 02:05:31 - INFO - __main__ - Step 34423: {'lr': 0.0004430942887309755, 'samples': 6609216, 'steps': 34422, 'loss/train': 1.5944414138793945}}} 11/07/2021 02:05:33 - INFO - __main__ - Step 34428: {'lr': 0.00044307743437393623, 'samples': 6610176, 'steps': 34427, 'loss/train': 0.5758892893791199}} 11/07/2021 02:05:35 - INFO - __main__ - Step 34432: {'lr': 0.00044306394932233694, 'samples': 6610944, 'steps': 34431, 'loss/train': 1.181859016418457}}} 11/07/2021 02:05:37 - INFO - __main__ - Step 34436: {'lr': 0.0004430504628788714, 'samples': 6611712, 'steps': 34435, 'loss/train': 1.7379834651947021}}} 11/07/2021 02:05:39 - INFO - __main__ - Step 34440: {'lr': 0.0004430369750436369, 'samples': 6612480, 'steps': 34439, 'loss/train': 1.3736330270767212}}} 11/07/2021 02:05:41 - INFO - __main__ - Step 34444: {'lr': 0.0004430234858167308, 'samples': 6613248, 'steps': 34443, 'loss/train': 1.2908469438552856}}} 11/07/2021 02:05:43 - INFO - __main__ - Step 34449: {'lr': 0.00044300662232620784, 'samples': 6614208, 'steps': 34448, 'loss/train': 1.7782011032104492}} 11/07/2021 02:05:43 - INFO - __main__ - Step 34449: {'lr': 0.00044300662232620784, 'samples': 6614208, 'steps': 34448, 'loss/train': 1.7782011032104492}} 11/07/2021 02:05:48 - INFO - __main__ - Step 34457: {'lr': 0.0004429796362192283, 'samples': 6615744, 'steps': 34456, 'loss/train': 1.631561279296875}2}} 11/07/2021 02:05:49 - INFO - __main__ - Step 34461: {'lr': 0.0004429661410788024, 'samples': 6616512, 'steps': 34460, 'loss/train': 1.6277323961257935}}} 11/07/2021 02:05:51 - INFO - __main__ - Step 34465: {'lr': 0.00044295264454721544, 'samples': 6617280, 'steps': 34464, 'loss/train': 1.2352428436279297}} 11/07/2021 02:05:53 - INFO - __main__ - Step 34469: {'lr': 0.00044293914662456475, 'samples': 6618048, 'steps': 34468, 'loss/train': 1.8215768337249756}} 11/07/2021 02:05:55 - INFO - __main__ - Step 34473: {'lr': 0.0004429256473109476, 'samples': 6618816, 'steps': 34472, 'loss/train': 2.1451096534729004}}} 11/07/2021 02:05:57 - INFO - __main__ - Step 34477: {'lr': 0.0004429121466064614, 'samples': 6619584, 'steps': 34476, 'loss/train': 1.6861153841018677}}} 11/07/2021 02:05:59 - INFO - __main__ - Step 34481: {'lr': 0.0004428986445112033, 'samples': 6620352, 'steps': 34480, 'loss/train': 1.5794472694396973}}} 11/07/2021 02:06:01 - INFO - __main__ - Step 34485: {'lr': 0.0004428851410252709, 'samples': 6621120, 'steps': 34484, 'loss/train': 1.702553629875183}}}} 11/07/2021 02:06:03 - INFO - __main__ - Step 34490: {'lr': 0.0004428682597123677, 'samples': 6622080, 'steps': 34489, 'loss/train': 1.1206878423690796}}} 11/07/2021 02:06:05 - INFO - __main__ - Step 34494: {'lr': 0.0004428547530977736, 'samples': 6622848, 'steps': 34493, 'loss/train': 2.4662539958953857}}} 11/07/2021 02:06:07 - INFO - __main__ - Step 34498: {'lr': 0.0004428412450928216, 'samples': 6623616, 'steps': 34497, 'loss/train': 2.3211231231689453}}} 11/07/2021 02:06:09 - INFO - __main__ - Step 34502: {'lr': 0.0004428277356976089, 'samples': 6624384, 'steps': 34501, 'loss/train': 1.5118417739868164}}} 11/07/2021 02:06:11 - INFO - __main__ - Step 34506: {'lr': 0.0004428142249122331, 'samples': 6625152, 'steps': 34505, 'loss/train': 1.3635393381118774}}} 11/07/2021 02:06:13 - INFO - __main__ - Step 34511: {'lr': 0.00044279733447574456, 'samples': 6626112, 'steps': 34510, 'loss/train': 1.3775317668914795}} 11/07/2021 02:06:15 - INFO - __main__ - Step 34515: {'lr': 0.0004427838205628575, 'samples': 6626880, 'steps': 34514, 'loss/train': 1.34334397315979}95}} 11/07/2021 02:06:15 - INFO - __main__ - Step 34515: {'lr': 0.0004427838205628575, 'samples': 6626880, 'steps': 34514, 'loss/train': 1.34334397315979}95}} 11/07/2021 02:06:19 - INFO - __main__ - Step 34522: {'lr': 0.00044276016787104535, 'samples': 6628224, 'steps': 34521, 'loss/train': 1.5454742908477783}} 11/07/2021 02:06:21 - INFO - __main__ - Step 34527: {'lr': 0.0004427432704855064, 'samples': 6629184, 'steps': 34526, 'loss/train': 2.385096311569214}3}} 11/07/2021 02:06:23 - INFO - __main__ - Step 34532: {'lr': 0.0004427263709287889, 'samples': 6630144, 'steps': 34531, 'loss/train': 1.4572937488555908}}} 11/07/2021 02:06:23 - INFO - __main__ - Step 34532: {'lr': 0.0004427263709287889, 'samples': 6630144, 'steps': 34531, 'loss/train': 1.4572937488555908}}} 11/07/2021 02:06:27 - INFO - __main__ - Step 34539: {'lr': 0.0004427027079021667, 'samples': 6631488, 'steps': 34538, 'loss/train': 1.4280678033828735}}} 11/07/2021 02:06:29 - INFO - __main__ - Step 34543: {'lr': 0.0004426891842623998, 'samples': 6632256, 'steps': 34542, 'loss/train': 1.1412307024002075}}} 11/07/2021 02:06:31 - INFO - __main__ - Step 34548: {'lr': 0.0004426722777591902, 'samples': 6633216, 'steps': 34547, 'loss/train': 1.6598010063171387}}} 11/07/2021 02:06:34 - INFO - __main__ - Step 34553: {'lr': 0.0004426553690856016, 'samples': 6634176, 'steps': 34552, 'loss/train': 0.9244617223739624}}} 11/07/2021 02:06:34 - INFO - __main__ - Step 34553: {'lr': 0.0004426553690856016, 'samples': 6634176, 'steps': 34552, 'loss/train': 0.9244617223739624}}} 11/07/2021 02:06:37 - INFO - __main__ - Step 34560: {'lr': 0.0004426316932967038, 'samples': 6635520, 'steps': 34559, 'loss/train': 1.3592138290405273}}} 11/07/2021 02:06:39 - INFO - __main__ - Step 34564: {'lr': 0.00044261816236491186, 'samples': 6636288, 'steps': 34563, 'loss/train': 1.6028127670288086}} 11/07/2021 02:06:42 - INFO - __main__ - Step 34569: {'lr': 0.000442601246747391, 'samples': 6637248, 'steps': 34568, 'loss/train': 0.9717766046524048}6}} 11/07/2021 02:06:44 - INFO - __main__ - Step 34573: {'lr': 0.0004425877126912685, 'samples': 6638016, 'steps': 34572, 'loss/train': 1.4090310335159302}}} 11/07/2021 02:06:46 - INFO - __main__ - Step 34577: {'lr': 0.0004425741772467131, 'samples': 6638784, 'steps': 34576, 'loss/train': 1.5985610485076904}}} 11/07/2021 02:06:46 - INFO - __main__ - Step 34577: {'lr': 0.0004425741772467131, 'samples': 6638784, 'steps': 34576, 'loss/train': 1.5985610485076904}}} 11/07/2021 02:06:49 - INFO - __main__ - Step 34584: {'lr': 0.0004425504868781183, 'samples': 6640128, 'steps': 34583, 'loss/train': 1.4580005407333374}}} 11/07/2021 02:06:52 - INFO - __main__ - Step 34589: {'lr': 0.000442533562583426, 'samples': 6641088, 'steps': 34588, 'loss/train': 1.6986876726150513}}}} 11/07/2021 02:06:54 - INFO - __main__ - Step 34593: {'lr': 0.0004425200215861153, 'samples': 6641856, 'steps': 34592, 'loss/train': 1.6354613304138184}}} 11/07/2021 02:06:56 - INFO - __main__ - Step 34597: {'lr': 0.0004425064792008597, 'samples': 6642624, 'steps': 34596, 'loss/train': 2.3666765689849854}}} 11/07/2021 02:06:57 - INFO - __main__ - Step 34601: {'lr': 0.000442492935427757, 'samples': 6643392, 'steps': 34600, 'loss/train': 1.5505502223968506}}}} 11/07/2021 02:06:59 - INFO - __main__ - Step 34605: {'lr': 0.00044247939026690475, 'samples': 6644160, 'steps': 34604, 'loss/train': 1.9255297183990479}} 11/07/2021 02:07:02 - INFO - __main__ - Step 34610: {'lr': 0.0004424624568644654, 'samples': 6645120, 'steps': 34609, 'loss/train': 1.5061067342758179}}} 11/07/2021 02:07:02 - INFO - __main__ - Step 34610: {'lr': 0.0004424624568644654, 'samples': 6645120, 'steps': 34609, 'loss/train': 1.5061067342758179}}} 11/07/2021 02:07:06 - INFO - __main__ - Step 34617: {'lr': 0.00044243874645882733, 'samples': 6646464, 'steps': 34616, 'loss/train': 1.3017340898513794}} 11/07/2021 02:07:07 - INFO - __main__ - Step 34621: {'lr': 0.00044242519574795347, 'samples': 6647232, 'steps': 34620, 'loss/train': 1.5387800931930542}} 11/07/2021 02:07:09 - INFO - __main__ - Step 34625: {'lr': 0.0004424116436498185, 'samples': 6648000, 'steps': 34624, 'loss/train': 1.0495797395706177}}} 11/07/2021 02:07:11 - INFO - __main__ - Step 34629: {'lr': 0.00044239809016452, 'samples': 6648768, 'steps': 34628, 'loss/train': 1.4800750017166138}7}}} 11/07/2021 02:07:13 - INFO - __main__ - Step 34633: {'lr': 0.00044238453529215575, 'samples': 6649536, 'steps': 34632, 'loss/train': 1.3253827095031738}} 11/07/2021 02:07:15 - INFO - __main__ - Step 34637: {'lr': 0.0004423709790328235, 'samples': 6650304, 'steps': 34636, 'loss/train': 1.3128234148025513}}} 11/07/2021 02:07:17 - INFO - __main__ - Step 34641: {'lr': 0.00044235742138662085, 'samples': 6651072, 'steps': 34640, 'loss/train': 1.697799563407898}}} 11/07/2021 02:07:19 - INFO - __main__ - Step 34646: {'lr': 0.0004423404723787301, 'samples': 6652032, 'steps': 34645, 'loss/train': 0.7880754470825195}}} 11/07/2021 02:07:22 - INFO - __main__ - Step 34651: {'lr': 0.0004423235212041982, 'samples': 6652992, 'steps': 34650, 'loss/train': 1.5523748397827148}}} 11/07/2021 02:07:22 - INFO - __main__ - Step 34651: {'lr': 0.0004423235212041982, 'samples': 6652992, 'steps': 34650, 'loss/train': 1.5523748397827148}}} 11/07/2021 02:07:25 - INFO - __main__ - Step 34658: {'lr': 0.00044229978592025975, 'samples': 6654336, 'steps': 34657, 'loss/train': 1.4368984699249268}} 11/07/2021 02:07:27 - INFO - __main__ - Step 34662: {'lr': 0.00044228622099459183, 'samples': 6655104, 'steps': 34661, 'loss/train': 1.4233191013336182}} 11/07/2021 02:07:30 - INFO - __main__ - Step 34667: {'lr': 0.0004422692628880913, 'samples': 6656064, 'steps': 34666, 'loss/train': 1.0020115375518799}}} 11/07/2021 02:07:32 - INFO - __main__ - Step 34672: {'lr': 0.00044225230261575165, 'samples': 6657024, 'steps': 34671, 'loss/train': 1.550666332244873}}} 11/07/2021 02:07:34 - INFO - __main__ - Step 34676: {'lr': 0.0004422387328386042, 'samples': 6657792, 'steps': 34675, 'loss/train': 1.2925069332122803}}} 11/07/2021 02:07:34 - INFO - __main__ - Step 34676: {'lr': 0.0004422387328386042, 'samples': 6657792, 'steps': 34675, 'loss/train': 1.2925069332122803}}} 11/07/2021 02:07:37 - INFO - __main__ - Step 34682: {'lr': 0.00044221837557431945, 'samples': 6658944, 'steps': 34681, 'loss/train': 1.5414425134658813}} 11/07/2021 02:07:40 - INFO - __main__ - Step 34688: {'lr': 0.0004421980151920518, 'samples': 6660096, 'steps': 34687, 'loss/train': 1.3367774486541748}}} 11/07/2021 02:07:42 - INFO - __main__ - Step 34692: {'lr': 0.00044218443987182384, 'samples': 6660864, 'steps': 34691, 'loss/train': 1.4428645372390747}} 11/07/2021 02:07:42 - INFO - __main__ - Step 34692: {'lr': 0.00044218443987182384, 'samples': 6660864, 'steps': 34691, 'loss/train': 1.4428645372390747}} 11/07/2021 02:07:45 - INFO - __main__ - Step 34699: {'lr': 0.000442160679727563, 'samples': 6662208, 'steps': 34698, 'loss/train': 0.9124966859817505}7}} 11/07/2021 02:07:48 - INFO - __main__ - Step 34704: {'lr': 0.0004421437055983785, 'samples': 6663168, 'steps': 34703, 'loss/train': 1.1789891719818115}}} 11/07/2021 02:07:50 - INFO - __main__ - Step 34709: {'lr': 0.0004421267293047692, 'samples': 6664128, 'steps': 34708, 'loss/train': 1.248964786529541}}}} 11/07/2021 02:07:50 - INFO - __main__ - Step 34709: {'lr': 0.0004421267293047692, 'samples': 6664128, 'steps': 34708, 'loss/train': 1.248964786529541}}}} 11/07/2021 02:07:53 - INFO - __main__ - Step 34716: {'lr': 0.0004421029588578468, 'samples': 6665472, 'steps': 34715, 'loss/train': 1.2604187726974487}}} 11/07/2021 02:07:56 - INFO - __main__ - Step 34720: {'lr': 0.00044208937384099614, 'samples': 6666240, 'steps': 34719, 'loss/train': 1.3792513608932495}} 11/07/2021 02:07:58 - INFO - __main__ - Step 34725: {'lr': 0.00044207239062251297, 'samples': 6667200, 'steps': 34724, 'loss/train': 1.4305295944213867}} 11/07/2021 02:08:00 - INFO - __main__ - Step 34729: {'lr': 0.0004420588024899098, 'samples': 6667968, 'steps': 34728, 'loss/train': 1.555402159690857}7}} 11/07/2021 02:08:02 - INFO - __main__ - Step 34733: {'lr': 0.000442045212972687, 'samples': 6668736, 'steps': 34732, 'loss/train': 1.4219645261764526}7}} 11/07/2021 02:08:03 - INFO - __main__ - Step 34737: {'lr': 0.0004420316220709424, 'samples': 6669504, 'steps': 34736, 'loss/train': 1.3695883750915527}}} 11/07/2021 02:08:06 - INFO - __main__ - Step 34741: {'lr': 0.000442018029784774, 'samples': 6670272, 'steps': 34740, 'loss/train': 1.6417057514190674}}}} 11/07/2021 02:08:08 - INFO - __main__ - Step 34746: {'lr': 0.000442001037480367, 'samples': 6671232, 'steps': 34745, 'loss/train': 1.3379449844360352}}}} 11/07/2021 02:08:08 - INFO - __main__ - Step 34746: {'lr': 0.000442001037480367, 'samples': 6671232, 'steps': 34745, 'loss/train': 1.3379449844360352}}}} 11/07/2021 02:08:12 - INFO - __main__ - Step 34754: {'lr': 0.0004419738452947346, 'samples': 6672768, 'steps': 34753, 'loss/train': 1.601043462753296}}}} 11/07/2021 02:08:14 - INFO - __main__ - Step 34758: {'lr': 0.00044196024712585854, 'samples': 6673536, 'steps': 34757, 'loss/train': 1.2186495065689087}} 11/07/2021 02:08:16 - INFO - __main__ - Step 34762: {'lr': 0.0004419466475730732, 'samples': 6674304, 'steps': 34761, 'loss/train': 2.0209977626800537}}} 11/07/2021 02:08:18 - INFO - __main__ - Step 34766: {'lr': 0.00044193304663647684, 'samples': 6675072, 'steps': 34765, 'loss/train': 0.5651389360427856}} 11/07/2021 02:08:20 - INFO - __main__ - Step 34771: {'lr': 0.0004419160435198963, 'samples': 6676032, 'steps': 34770, 'loss/train': 1.1811498403549194}}} 11/07/2021 02:08:20 - INFO - __main__ - Step 34771: {'lr': 0.0004419160435198963, 'samples': 6676032, 'steps': 34770, 'loss/train': 1.1811498403549194}}} 11/07/2021 02:08:24 - INFO - __main__ - Step 34779: {'lr': 0.00044188883403677783, 'samples': 6677568, 'steps': 34778, 'loss/train': 1.1535234451293945}} 11/07/2021 02:08:26 - INFO - __main__ - Step 34783: {'lr': 0.000441875227220078, 'samples': 6678336, 'steps': 34782, 'loss/train': 1.3227585554122925}5}} 11/07/2021 02:08:28 - INFO - __main__ - Step 34787: {'lr': 0.00044186161902008193, 'samples': 6679104, 'steps': 34786, 'loss/train': 0.9370352625846863}} 11/07/2021 02:08:28 - INFO - __main__ - Step 34787: {'lr': 0.00044186161902008193, 'samples': 6679104, 'steps': 34786, 'loss/train': 0.9370352625846863}} 11/07/2021 02:08:32 - INFO - __main__ - Step 34795: {'lr': 0.0004418343984705935, 'samples': 6680640, 'steps': 34794, 'loss/train': 1.2904330492019653}}} 11/07/2021 02:08:34 - INFO - __main__ - Step 34799: {'lr': 0.0004418207861212973, 'samples': 6681408, 'steps': 34798, 'loss/train': 1.2074306011199951}}} 11/07/2021 02:08:36 - INFO - __main__ - Step 34803: {'lr': 0.0004418071723890973, 'samples': 6682176, 'steps': 34802, 'loss/train': 1.9970906972885132}}} 11/07/2021 02:08:38 - INFO - __main__ - Step 34808: {'lr': 0.00044179015327928847, 'samples': 6683136, 'steps': 34807, 'loss/train': 1.496474266052246}}} 11/07/2021 02:08:38 - INFO - __main__ - Step 34808: {'lr': 0.00044179015327928847, 'samples': 6683136, 'steps': 34807, 'loss/train': 1.496474266052246}}} 11/07/2021 02:08:42 - INFO - __main__ - Step 34815: {'lr': 0.0004417663228960562, 'samples': 6684480, 'steps': 34814, 'loss/train': 1.3058736324310303}}} 11/07/2021 02:08:43 - INFO - __main__ - Step 34819: {'lr': 0.0004417527036332227, 'samples': 6685248, 'steps': 34818, 'loss/train': 1.158371090888977}}}} 11/07/2021 02:08:45 - INFO - __main__ - Step 34823: {'lr': 0.00044173908298797627, 'samples': 6686016, 'steps': 34822, 'loss/train': 1.4924274682998657}} 11/07/2021 02:08:48 - INFO - __main__ - Step 34828: {'lr': 0.0004417220552375496, 'samples': 6686976, 'steps': 34827, 'loss/train': 1.4486048221588135}}} 11/07/2021 02:08:50 - INFO - __main__ - Step 34832: {'lr': 0.00044170843148223305, 'samples': 6687744, 'steps': 34831, 'loss/train': 1.9806476831436157}} 11/07/2021 02:08:52 - INFO - __main__ - Step 34836: {'lr': 0.00044169480634482274, 'samples': 6688512, 'steps': 34835, 'loss/train': 1.677834391593933}}} 11/07/2021 02:08:54 - INFO - __main__ - Step 34840: {'lr': 0.0004416811798254168, 'samples': 6689280, 'steps': 34839, 'loss/train': 1.6745431423187256}}} 11/07/2021 02:08:56 - INFO - __main__ - Step 34844: {'lr': 0.00044166755192411364, 'samples': 6690048, 'steps': 34843, 'loss/train': 2.0981333255767822}} 11/07/2021 02:08:58 - INFO - __main__ - Step 34849: {'lr': 0.0004416505151043412, 'samples': 6691008, 'steps': 34848, 'loss/train': 1.796020269393921}2}} 11/07/2021 02:09:00 - INFO - __main__ - Step 34853: {'lr': 0.00044163688409412833, 'samples': 6691776, 'steps': 34852, 'loss/train': 1.48081636428833}2}} 11/07/2021 02:09:00 - INFO - __main__ - Step 34853: {'lr': 0.00044163688409412833, 'samples': 6691776, 'steps': 34852, 'loss/train': 1.48081636428833}2}} 11/07/2021 02:09:04 - INFO - __main__ - Step 34860: {'lr': 0.00044161302650189295, 'samples': 6693120, 'steps': 34859, 'loss/train': 1.5468337535858154}} 11/07/2021 02:09:06 - INFO - __main__ - Step 34865: {'lr': 0.000441595982774415, 'samples': 6694080, 'steps': 34864, 'loss/train': 1.3882853984832764}4}} 11/07/2021 02:09:08 - INFO - __main__ - Step 34870: {'lr': 0.00044157893688868223, 'samples': 6695040, 'steps': 34869, 'loss/train': 1.635990023612976}}} 11/07/2021 02:09:08 - INFO - __main__ - Step 34870: {'lr': 0.00044157893688868223, 'samples': 6695040, 'steps': 34869, 'loss/train': 1.635990023612976}}} 11/07/2021 02:09:12 - INFO - __main__ - Step 34877: {'lr': 0.0004415550690231539, 'samples': 6696384, 'steps': 34876, 'loss/train': 1.5939267873764038}}} 11/07/2021 02:09:14 - INFO - __main__ - Step 34881: {'lr': 0.00044154142834395947, 'samples': 6697152, 'steps': 34880, 'loss/train': 1.8461229801177979}} 11/07/2021 02:09:16 - INFO - __main__ - Step 34886: {'lr': 0.00044152437555310174, 'samples': 6698112, 'steps': 34885, 'loss/train': 1.097977638244629}}} 11/07/2021 02:09:19 - INFO - __main__ - Step 34891: {'lr': 0.0004415073206047958, 'samples': 6699072, 'steps': 34890, 'loss/train': 1.6461631059646606}}} 11/07/2021 02:09:19 - INFO - __main__ - Step 34891: {'lr': 0.0004415073206047958, 'samples': 6699072, 'steps': 34890, 'loss/train': 1.6461631059646606}}} 11/07/2021 02:09:22 - INFO - __main__ - Step 34898: {'lr': 0.0004414834400530203, 'samples': 6700416, 'steps': 34897, 'loss/train': 1.4663927555084229}}} 11/07/2021 02:09:24 - INFO - __main__ - Step 34902: {'lr': 0.00044146979212525184, 'samples': 6701184, 'steps': 34901, 'loss/train': 1.720201015472412}}} 11/07/2021 02:09:26 - INFO - __main__ - Step 34906: {'lr': 0.00044145614281711, 'samples': 6701952, 'steps': 34905, 'loss/train': 1.3921641111373901}2}}} 11/07/2021 02:09:28 - INFO - __main__ - Step 34911: {'lr': 0.0004414390792409326, 'samples': 6702912, 'steps': 34910, 'loss/train': 1.1971884965896606}}} 11/07/2021 02:09:30 - INFO - __main__ - Step 34915: {'lr': 0.0004414254268273107, 'samples': 6703680, 'steps': 34914, 'loss/train': 1.2348486185073853}}} 11/07/2021 02:09:32 - INFO - __main__ - Step 34919: {'lr': 0.0004414117730336351, 'samples': 6704448, 'steps': 34918, 'loss/train': 1.4219346046447754}}} 11/07/2021 02:09:34 - INFO - __main__ - Step 34923: {'lr': 0.0004413981178600046, 'samples': 6705216, 'steps': 34922, 'loss/train': 1.5527942180633545}}} 11/07/2021 02:09:36 - INFO - __main__ - Step 34928: {'lr': 0.00044138104695255455, 'samples': 6706176, 'steps': 34927, 'loss/train': 1.990412712097168}}} 11/07/2021 02:09:36 - INFO - __main__ - Step 34928: {'lr': 0.00044138104695255455, 'samples': 6706176, 'steps': 34927, 'loss/train': 1.990412712097168}}} 11/07/2021 02:09:41 - INFO - __main__ - Step 34936: {'lr': 0.00044135372901658046, 'samples': 6707712, 'steps': 34935, 'loss/train': 1.6198914051055908}} 11/07/2021 02:09:43 - INFO - __main__ - Step 34940: {'lr': 0.0004413400679792393, 'samples': 6708480, 'steps': 34939, 'loss/train': 1.1660223007202148}}} 11/07/2021 02:09:44 - INFO - __main__ - Step 34944: {'lr': 0.00044132640556246, 'samples': 6709248, 'steps': 34943, 'loss/train': 1.5002176761627197}8}}} 11/07/2021 02:09:46 - INFO - __main__ - Step 34948: {'lr': 0.00044131274176634113, 'samples': 6710016, 'steps': 34947, 'loss/train': 1.4638006687164307}} 11/07/2021 02:09:49 - INFO - __main__ - Step 34953: {'lr': 0.0004412956600816462, 'samples': 6710976, 'steps': 34952, 'loss/train': 1.6466397047042847}}} 11/07/2021 02:09:51 - INFO - __main__ - Step 34957: {'lr': 0.0004412819931823734, 'samples': 6711744, 'steps': 34956, 'loss/train': 1.5563851594924927}}} 11/07/2021 02:09:51 - INFO - __main__ - Step 34957: {'lr': 0.0004412819931823734, 'samples': 6711744, 'steps': 34956, 'loss/train': 1.5563851594924927}}} 11/07/2021 02:09:54 - INFO - __main__ - Step 34964: {'lr': 0.0004412580727904396, 'samples': 6713088, 'steps': 34963, 'loss/train': 1.7027897834777832}}} 11/07/2021 02:09:56 - INFO - __main__ - Step 34968: {'lr': 0.0004412444020991004, 'samples': 6713856, 'steps': 34967, 'loss/train': 1.6180024147033691}}} 11/07/2021 02:09:58 - INFO - __main__ - Step 34972: {'lr': 0.00044123073002901286, 'samples': 6714624, 'steps': 34971, 'loss/train': 0.8073616027832031}} 11/07/2021 02:10:00 - INFO - __main__ - Step 34976: {'lr': 0.00044121705658027545, 'samples': 6715392, 'steps': 34975, 'loss/train': 1.497333288192749}}} 11/07/2021 02:10:02 - INFO - __main__ - Step 34980: {'lr': 0.0004412033817529867, 'samples': 6716160, 'steps': 34979, 'loss/train': 1.5079094171524048}}} 11/07/2021 02:10:04 - INFO - __main__ - Step 34985: {'lr': 0.0004411862862804382, 'samples': 6717120, 'steps': 34984, 'loss/train': 1.4085384607315063}}} 11/07/2021 02:10:04 - INFO - __main__ - Step 34985: {'lr': 0.0004411862862804382, 'samples': 6717120, 'steps': 34984, 'loss/train': 1.4085384607315063}}} 11/07/2021 02:10:08 - INFO - __main__ - Step 34993: {'lr': 0.0004411589290448701, 'samples': 6718656, 'steps': 34992, 'loss/train': 1.3170068264007568}}} 11/07/2021 02:10:10 - INFO - __main__ - Step 34997: {'lr': 0.00044114524835983844, 'samples': 6719424, 'steps': 34996, 'loss/train': 1.6504180431365967}} 11/07/2021 02:10:12 - INFO - __main__ - Step 35001: {'lr': 0.00044113156629677313, 'samples': 6720192, 'steps': 35000, 'loss/train': 1.2026489973068237}} 11/07/2021 02:10:14 - INFO - __main__ - Step 35006: {'lr': 0.00044111446178023205, 'samples': 6721152, 'steps': 35005, 'loss/train': 1.3469408750534058}} 11/07/2021 02:10:16 - INFO - __main__ - Step 35010: {'lr': 0.00044110077661695194, 'samples': 6721920, 'steps': 35009, 'loss/train': 1.402260661125183}}} 11/07/2021 02:10:18 - INFO - __main__ - Step 35014: {'lr': 0.0004410870900759587, 'samples': 6722688, 'steps': 35013, 'loss/train': 1.4623432159423828}}} 11/07/2021 02:10:20 - INFO - __main__ - Step 35018: {'lr': 0.00044107340215735125, 'samples': 6723456, 'steps': 35017, 'loss/train': 1.3238352537155151}} 11/07/2021 02:10:22 - INFO - __main__ - Step 35022: {'lr': 0.00044105971286122816, 'samples': 6724224, 'steps': 35021, 'loss/train': 1.7440218925476074}} 11/07/2021 02:10:24 - INFO - __main__ - Step 35027: {'lr': 0.0004410425993040933, 'samples': 6725184, 'steps': 35026, 'loss/train': 1.0795031785964966}}} 11/07/2021 02:10:27 - INFO - __main__ - Step 35032: {'lr': 0.0004410254835949372, 'samples': 6726144, 'steps': 35031, 'loss/train': 2.850522041320801}}}} 11/07/2021 02:10:27 - INFO - __main__ - Step 35032: {'lr': 0.0004410254835949372, 'samples': 6726144, 'steps': 35031, 'loss/train': 2.850522041320801}}}} 11/07/2021 02:10:30 - INFO - __main__ - Step 35039: {'lr': 0.0004410015179870903, 'samples': 6727488, 'steps': 35038, 'loss/train': 1.1722732782363892}}} 11/07/2021 02:10:32 - INFO - __main__ - Step 35043: {'lr': 0.00044098782146062955, 'samples': 6728256, 'steps': 35042, 'loss/train': 1.5331951379776}2}}} 11/07/2021 02:10:34 - INFO - __main__ - Step 35047: {'lr': 0.0004409741235572701, 'samples': 6729024, 'steps': 35046, 'loss/train': 1.4944453239440918}}} 11/07/2021 02:10:36 - INFO - __main__ - Step 35052: {'lr': 0.0004409569992419576, 'samples': 6729984, 'steps': 35051, 'loss/train': 1.4920984506607056}}} 11/07/2021 02:10:39 - INFO - __main__ - Step 35057: {'lr': 0.0004409398727755882, 'samples': 6730944, 'steps': 35056, 'loss/train': 1.7569228410720825}}} 11/07/2021 02:10:41 - INFO - __main__ - Step 35061: {'lr': 0.00044092617005386125, 'samples': 6731712, 'steps': 35060, 'loss/train': 1.2886829376220703}} 11/07/2021 02:10:41 - INFO - __main__ - Step 35061: {'lr': 0.00044092617005386125, 'samples': 6731712, 'steps': 35060, 'loss/train': 1.2886829376220703}} 11/07/2021 02:10:44 - INFO - __main__ - Step 35068: {'lr': 0.00044090218697880577, 'samples': 6733056, 'steps': 35067, 'loss/train': 1.7040773630142212}} 11/07/2021 02:10:47 - INFO - __main__ - Step 35073: {'lr': 0.0004408850536303507, 'samples': 6734016, 'steps': 35072, 'loss/train': 1.6482864618301392}}} 11/07/2021 02:10:47 - INFO - __main__ - Step 35073: {'lr': 0.0004408850536303507, 'samples': 6734016, 'steps': 35072, 'loss/train': 1.6482864618301392}}} 11/07/2021 02:10:50 - INFO - __main__ - Step 35080: {'lr': 0.0004408610633301428, 'samples': 6735360, 'steps': 35079, 'loss/train': 1.6156529188156128}}} 11/07/2021 02:10:52 - INFO - __main__ - Step 35084: {'lr': 0.00044084735269515375, 'samples': 6736128, 'steps': 35083, 'loss/train': 1.6152665615081787}} 11/07/2021 02:10:55 - INFO - __main__ - Step 35089: {'lr': 0.0004408302124665894, 'samples': 6737088, 'steps': 35088, 'loss/train': 1.307666540145874}7}} 11/07/2021 02:10:57 - INFO - __main__ - Step 35094: {'lr': 0.0004408130700883964, 'samples': 6738048, 'steps': 35093, 'loss/train': 1.5962964296340942}}} 11/07/2021 02:10:57 - INFO - __main__ - Step 35094: {'lr': 0.0004408130700883964, 'samples': 6738048, 'steps': 35093, 'loss/train': 1.5962964296340942}}} 11/07/2021 02:11:00 - INFO - __main__ - Step 35101: {'lr': 0.00044078906714791757, 'samples': 6739392, 'steps': 35100, 'loss/train': 1.0961785316467285}} 11/07/2021 02:11:02 - INFO - __main__ - Step 35105: {'lr': 0.00044077534929063024, 'samples': 6740160, 'steps': 35104, 'loss/train': 0.9975321292877197}} 11/07/2021 02:11:05 - INFO - __main__ - Step 35110: {'lr': 0.00044075820003492295, 'samples': 6741120, 'steps': 35109, 'loss/train': 1.5062674283981323}} 11/07/2021 02:11:08 - INFO - __main__ - Step 35115: {'lr': 0.0004407410486303983, 'samples': 6742080, 'steps': 35114, 'loss/train': 1.7748589515686035}}} 11/07/2021 02:11:11 - INFO - __main__ - Step 35121: {'lr': 0.00044072046410880143, 'samples': 6743232, 'steps': 35120, 'loss/train': 1.271795392036438}}} 11/07/2021 02:11:11 - INFO - __main__ - Step 35121: {'lr': 0.00044072046410880143, 'samples': 6743232, 'steps': 35120, 'loss/train': 1.271795392036438}}} 11/07/2021 02:11:14 - INFO - __main__ - Step 35128: {'lr': 0.0004406964449235544, 'samples': 6744576, 'steps': 35127, 'loss/train': 1.5889697074890137}}} 11/07/2021 02:11:14 - INFO - __main__ - Step 35128: {'lr': 0.0004406964449235544, 'samples': 6744576, 'steps': 35127, 'loss/train': 1.5889697074890137}}} 11/07/2021 02:11:17 - INFO - __main__ - Step 35133: {'lr': 0.00044067928578488645, 'samples': 6745536, 'steps': 35132, 'loss/train': 1.795264720916748}}} 11/07/2021 02:11:20 - INFO - __main__ - Step 35139: {'lr': 0.00044065869198323614, 'samples': 6746688, 'steps': 35138, 'loss/train': 1.7210921049118042}} 11/07/2021 02:11:22 - INFO - __main__ - Step 35144: {'lr': 0.0004406415281193805, 'samples': 6747648, 'steps': 35143, 'loss/train': 2.018486738204956}2}} 11/07/2021 02:11:25 - INFO - __main__ - Step 35149: {'lr': 0.0004406243621080216, 'samples': 6748608, 'steps': 35148, 'loss/train': 1.8648744821548462}}} 11/07/2021 02:11:27 - INFO - __main__ - Step 35153: {'lr': 0.000440610627752862, 'samples': 6749376, 'steps': 35152, 'loss/train': 1.7012907266616821}}}} 11/07/2021 02:11:27 - INFO - __main__ - Step 35153: {'lr': 0.000440610627752862, 'samples': 6749376, 'steps': 35152, 'loss/train': 1.7012907266616821}}}} 11/07/2021 02:11:30 - INFO - __main__ - Step 35160: {'lr': 0.00044058658932477336, 'samples': 6750720, 'steps': 35159, 'loss/train': 2.0084025859832764}} 11/07/2021 02:11:32 - INFO - __main__ - Step 35164: {'lr': 0.00044057285119085887, 'samples': 6751488, 'steps': 35163, 'loss/train': 1.3808073997497559}} 11/07/2021 02:11:35 - INFO - __main__ - Step 35169: {'lr': 0.00044055567659142083, 'samples': 6752448, 'steps': 35168, 'loss/train': 0.897270679473877}}} 11/07/2021 02:11:35 - INFO - __main__ - Step 35169: {'lr': 0.00044055567659142083, 'samples': 6752448, 'steps': 35168, 'loss/train': 0.897270679473877}}} 11/07/2021 02:11:39 - INFO - __main__ - Step 35177: {'lr': 0.0004405281927676051, 'samples': 6753984, 'steps': 35176, 'loss/train': 1.600562572479248}}}} 11/07/2021 02:11:40 - INFO - __main__ - Step 35181: {'lr': 0.00044051444879527013, 'samples': 6754752, 'steps': 35180, 'loss/train': 1.525868535041809}}} 11/07/2021 02:11:42 - INFO - __main__ - Step 35185: {'lr': 0.0004405007034494494, 'samples': 6755520, 'steps': 35184, 'loss/train': 1.5869981050491333}}} 11/07/2021 02:11:45 - INFO - __main__ - Step 35190: {'lr': 0.00044048351983585966, 'samples': 6756480, 'steps': 35189, 'loss/train': 1.7315864562988281}} 11/07/2021 02:11:47 - INFO - __main__ - Step 35194: {'lr': 0.00044046977140005774, 'samples': 6757248, 'steps': 35193, 'loss/train': 1.190617561340332}}} 11/07/2021 02:11:49 - INFO - __main__ - Step 35198: {'lr': 0.00044045602159109207, 'samples': 6758016, 'steps': 35197, 'loss/train': 1.4819884300231934}} 11/07/2021 02:11:49 - INFO - __main__ - Step 35198: {'lr': 0.00044045602159109207, 'samples': 6758016, 'steps': 35197, 'loss/train': 1.4819884300231934}} 11/07/2021 02:11:52 - INFO - __main__ - Step 35205: {'lr': 0.00044043195612152475, 'samples': 6759360, 'steps': 35204, 'loss/train': 1.4423195123672485}} 11/07/2021 02:11:55 - INFO - __main__ - Step 35211: {'lr': 0.00044041132522973885, 'samples': 6760512, 'steps': 35210, 'loss/train': 1.3795652389526367}} 11/07/2021 02:11:57 - INFO - __main__ - Step 35215: {'lr': 0.00044039756958593287, 'samples': 6761280, 'steps': 35214, 'loss/train': 1.8432458639144897}} 11/07/2021 02:11:59 - INFO - __main__ - Step 35219: {'lr': 0.00044038381256948357, 'samples': 6762048, 'steps': 35218, 'loss/train': 1.3994035720825195}} 11/07/2021 02:11:59 - INFO - __main__ - Step 35219: {'lr': 0.00044038381256948357, 'samples': 6762048, 'steps': 35218, 'loss/train': 1.3994035720825195}} 11/07/2021 02:12:02 - INFO - __main__ - Step 35226: {'lr': 0.00044035973448807266, 'samples': 6763392, 'steps': 35225, 'loss/train': 1.6354080438613892}} 11/07/2021 02:12:05 - INFO - __main__ - Step 35231: {'lr': 0.00044034253328526765, 'samples': 6764352, 'steps': 35230, 'loss/train': 1.5945069789886475}} 11/07/2021 02:12:07 - INFO - __main__ - Step 35236: {'lr': 0.0004403253299383274, 'samples': 6765312, 'steps': 35235, 'loss/train': 1.3000872135162354}}} 11/07/2021 02:12:07 - INFO - __main__ - Step 35236: {'lr': 0.0004403253299383274, 'samples': 6765312, 'steps': 35235, 'loss/train': 1.3000872135162354}}} 11/07/2021 02:12:10 - INFO - __main__ - Step 35243: {'lr': 0.0004403012416508329, 'samples': 6766656, 'steps': 35242, 'loss/train': 1.65208899974823}4}}} 11/07/2021 02:12:12 - INFO - __main__ - Step 35247: {'lr': 0.00044028747502865794, 'samples': 6767424, 'steps': 35246, 'loss/train': 1.4407219886779785}} 11/07/2021 02:12:12 - INFO - __main__ - Step 35247: {'lr': 0.00044028747502865794, 'samples': 6767424, 'steps': 35246, 'loss/train': 1.4407219886779785}} 11/07/2021 02:12:17 - INFO - __main__ - Step 35255: {'lr': 0.00044025993766885866, 'samples': 6768960, 'steps': 35254, 'loss/train': 1.382339358329773}}} 11/07/2021 02:12:18 - INFO - __main__ - Step 35259: {'lr': 0.0004402461669314327, 'samples': 6769728, 'steps': 35258, 'loss/train': 1.6906169652938843}}} 11/07/2021 02:12:18 - INFO - __main__ - Step 35259: {'lr': 0.0004402461669314327, 'samples': 6769728, 'steps': 35258, 'loss/train': 1.6906169652938843}}} 11/07/2021 02:12:22 - INFO - __main__ - Step 35265: {'lr': 0.00044022550825366526, 'samples': 6770880, 'steps': 35264, 'loss/train': 1.8454724550247192}} 11/07/2021 02:12:25 - INFO - __main__ - Step 35270: {'lr': 0.00044020829033174615, 'samples': 6771840, 'steps': 35269, 'loss/train': 1.5778212547302246}} 11/07/2021 02:12:27 - INFO - __main__ - Step 35274: {'lr': 0.00044019451445151305, 'samples': 6772608, 'steps': 35273, 'loss/train': 1.4892654418945312}} 11/07/2021 02:12:29 - INFO - __main__ - Step 35278: {'lr': 0.0004401807372001004, 'samples': 6773376, 'steps': 35277, 'loss/train': 1.63253915309906}12}} 11/07/2021 02:12:30 - INFO - __main__ - Step 35282: {'lr': 0.0004401669585776078, 'samples': 6774144, 'steps': 35281, 'loss/train': 1.5396783351898193}}} 11/07/2021 02:12:32 - INFO - __main__ - Step 35286: {'lr': 0.0004401531785841344, 'samples': 6774912, 'steps': 35285, 'loss/train': 1.7355401515960693}}} 11/07/2021 02:12:35 - INFO - __main__ - Step 35291: {'lr': 0.0004401359516645023, 'samples': 6775872, 'steps': 35290, 'loss/train': 1.8301565647125244}}} 11/07/2021 02:12:35 - INFO - __main__ - Step 35291: {'lr': 0.0004401359516645023, 'samples': 6775872, 'steps': 35290, 'loss/train': 1.8301565647125244}}} 11/07/2021 02:12:39 - INFO - __main__ - Step 35299: {'lr': 0.00044010838413821075, 'samples': 6777408, 'steps': 35298, 'loss/train': 1.658327341079712}}} 11/07/2021 02:12:40 - INFO - __main__ - Step 35303: {'lr': 0.00044009459831917755, 'samples': 6778176, 'steps': 35302, 'loss/train': 1.5483982563018799}} 11/07/2021 02:12:43 - INFO - __main__ - Step 35307: {'lr': 0.00044008081112968537, 'samples': 6778944, 'steps': 35306, 'loss/train': 1.7320812940597534}} 11/07/2021 02:12:45 - INFO - __main__ - Step 35312: {'lr': 0.00044006357521576334, 'samples': 6779904, 'steps': 35311, 'loss/train': 0.9323607087135315}} 11/07/2021 02:12:45 - INFO - __main__ - Step 35312: {'lr': 0.00044006357521576334, 'samples': 6779904, 'steps': 35311, 'loss/train': 0.9323607087135315}} 11/07/2021 02:12:49 - INFO - __main__ - Step 35320: {'lr': 0.00044003599330030385, 'samples': 6781440, 'steps': 35319, 'loss/train': 1.5260698795318604}} 11/07/2021 02:12:51 - INFO - __main__ - Step 35324: {'lr': 0.0004400222002874695, 'samples': 6782208, 'steps': 35323, 'loss/train': 1.541028380393982}4}} 11/07/2021 02:12:53 - INFO - __main__ - Step 35328: {'lr': 0.000440008405904698, 'samples': 6782976, 'steps': 35327, 'loss/train': 1.5916683673858643}4}} 11/07/2021 02:12:55 - INFO - __main__ - Step 35332: {'lr': 0.0004399946101520889, 'samples': 6783744, 'steps': 35331, 'loss/train': 1.4010270833969116}}} 11/07/2021 02:12:57 - INFO - __main__ - Step 35336: {'lr': 0.0004399808130297415, 'samples': 6784512, 'steps': 35335, 'loss/train': 1.3630006313323975}}} 11/07/2021 02:12:59 - INFO - __main__ - Step 35340: {'lr': 0.00043996701453775526, 'samples': 6785280, 'steps': 35339, 'loss/train': 1.4155287742614746}} 11/07/2021 02:13:01 - INFO - __main__ - Step 35344: {'lr': 0.00043995321467622984, 'samples': 6786048, 'steps': 35343, 'loss/train': 1.4926680326461792}} 11/07/2021 02:13:03 - INFO - __main__ - Step 35348: {'lr': 0.00043993941344526455, 'samples': 6786816, 'steps': 35347, 'loss/train': 1.232430100440979}}} 11/07/2021 02:13:03 - INFO - __main__ - Step 35348: {'lr': 0.00043993941344526455, 'samples': 6786816, 'steps': 35347, 'loss/train': 1.232430100440979}}} 11/07/2021 02:13:07 - INFO - __main__ - Step 35356: {'lr': 0.0004399118068754127, 'samples': 6788352, 'steps': 35355, 'loss/train': 1.4081023931503296}}} 11/07/2021 02:13:09 - INFO - __main__ - Step 35360: {'lr': 0.0004398980015367251, 'samples': 6789120, 'steps': 35359, 'loss/train': 5.794185638427734}}}} 11/07/2021 02:13:11 - INFO - __main__ - Step 35364: {'lr': 0.0004398841948289958, 'samples': 6789888, 'steps': 35363, 'loss/train': 0.6771053075790405}}} 11/07/2021 02:13:13 - INFO - __main__ - Step 35369: {'lr': 0.00043986693451927074, 'samples': 6790848, 'steps': 35368, 'loss/train': 0.9575570225715637}} 11/07/2021 02:13:13 - INFO - __main__ - Step 35369: {'lr': 0.00043986693451927074, 'samples': 6790848, 'steps': 35368, 'loss/train': 0.9575570225715637}} 11/07/2021 02:13:17 - INFO - __main__ - Step 35377: {'lr': 0.0004398393135751338, 'samples': 6792384, 'steps': 35376, 'loss/train': 0.8111445903778076}}} 11/07/2021 02:13:19 - INFO - __main__ - Step 35381: {'lr': 0.0004398255010500877, 'samples': 6793152, 'steps': 35380, 'loss/train': 1.5714269876480103}}} 11/07/2021 02:13:21 - INFO - __main__ - Step 35385: {'lr': 0.0004398116871565224, 'samples': 6793920, 'steps': 35384, 'loss/train': 1.3924572467803955}}} 11/07/2021 02:13:21 - INFO - __main__ - Step 35385: {'lr': 0.0004398116871565224, 'samples': 6793920, 'steps': 35384, 'loss/train': 1.3924572467803955}}} 11/07/2021 02:13:21 - INFO - __main__ - Step 35385: {'lr': 0.0004398116871565224, 'samples': 6793920, 'steps': 35384, 'loss/train': 1.3924572467803955}}} 11/07/2021 02:13:27 - INFO - __main__ - Step 35396: {'lr': 0.0004397736918936046, 'samples': 6796032, 'steps': 35395, 'loss/train': 1.4847302436828613}}} 11/07/2021 02:13:29 - INFO - __main__ - Step 35401: {'lr': 0.0004397564178990626, 'samples': 6796992, 'steps': 35400, 'loss/train': 1.856229305267334}}}} 11/07/2021 02:13:32 - INFO - __main__ - Step 35406: {'lr': 0.0004397391417669878, 'samples': 6797952, 'steps': 35405, 'loss/train': 1.601456880569458}}}} 11/07/2021 02:13:34 - INFO - __main__ - Step 35410: {'lr': 0.00043972531932243516, 'samples': 6798720, 'steps': 35409, 'loss/train': 1.706610918045044}}} 11/07/2021 02:13:34 - INFO - __main__ - Step 35410: {'lr': 0.00043972531932243516, 'samples': 6798720, 'steps': 35409, 'loss/train': 1.706610918045044}}} 11/07/2021 02:13:37 - INFO - __main__ - Step 35417: {'lr': 0.0004397011267532668, 'samples': 6800064, 'steps': 35416, 'loss/train': 1.8843176364898682}}} 11/07/2021 02:13:39 - INFO - __main__ - Step 35422: {'lr': 0.00043968384378239477, 'samples': 6801024, 'steps': 35421, 'loss/train': 1.3193522691726685}} 11/07/2021 02:13:42 - INFO - __main__ - Step 35427: {'lr': 0.0004396665586748075, 'samples': 6801984, 'steps': 35426, 'loss/train': 1.7397141456604004}}} 11/07/2021 02:13:44 - INFO - __main__ - Step 35431: {'lr': 0.0004396527290504334, 'samples': 6802752, 'steps': 35430, 'loss/train': 3.315039873123169}}}} 11/07/2021 02:13:46 - INFO - __main__ - Step 35435: {'lr': 0.0004396388980587859, 'samples': 6803520, 'steps': 35434, 'loss/train': 1.7204598188400269}}} 11/07/2021 02:13:47 - INFO - __main__ - Step 35439: {'lr': 0.0004396250656999646, 'samples': 6804288, 'steps': 35438, 'loss/train': 1.060101866722107}}}} 11/07/2021 02:13:49 - INFO - __main__ - Step 35443: {'lr': 0.0004396112319740692, 'samples': 6805056, 'steps': 35442, 'loss/train': 1.7257599830627441}}} 11/07/2021 02:13:49 - INFO - __main__ - Step 35443: {'lr': 0.0004396112319740692, 'samples': 6805056, 'steps': 35442, 'loss/train': 1.7257599830627441}}} 11/07/2021 02:13:52 - INFO - __main__ - Step 35450: {'lr': 0.00043958701966453033, 'samples': 6806400, 'steps': 35449, 'loss/train': 1.6128551959991455}} 11/07/2021 02:13:54 - INFO - __main__ - Step 35454: {'lr': 0.0004395731821796956, 'samples': 6807168, 'steps': 35453, 'loss/train': 1.5661472082138062}}} 11/07/2021 02:13:57 - INFO - __main__ - Step 35459: {'lr': 0.00043955588340174195, 'samples': 6808128, 'steps': 35458, 'loss/train': 2.366567373275757}}} 11/07/2021 02:14:00 - INFO - __main__ - Step 35465: {'lr': 0.0004395351220496532, 'samples': 6809280, 'steps': 35464, 'loss/train': 1.655928611755371}}}} 11/07/2021 02:14:00 - INFO - __main__ - Step 35465: {'lr': 0.0004395351220496532, 'samples': 6809280, 'steps': 35464, 'loss/train': 1.655928611755371}}}} 11/07/2021 02:14:03 - INFO - __main__ - Step 35471: {'lr': 0.00043951435762310686, 'samples': 6810432, 'steps': 35470, 'loss/train': 1.4175527095794678}} 11/07/2021 02:14:05 - INFO - __main__ - Step 35475: {'lr': 0.00043950051296421023, 'samples': 6811200, 'steps': 35474, 'loss/train': 1.6213352680206299}} 11/07/2021 02:14:07 - INFO - __main__ - Step 35480: {'lr': 0.00043948320521941596, 'samples': 6812160, 'steps': 35479, 'loss/train': 1.7159295082092285}} 11/07/2021 02:14:07 - INFO - __main__ - Step 35480: {'lr': 0.00043948320521941596, 'samples': 6812160, 'steps': 35479, 'loss/train': 1.7159295082092285}} 11/07/2021 02:14:12 - INFO - __main__ - Step 35488: {'lr': 0.00043945550838815953, 'samples': 6813696, 'steps': 35487, 'loss/train': 1.4050376415252686}} 11/07/2021 02:14:13 - INFO - __main__ - Step 35492: {'lr': 0.00043944165792370385, 'samples': 6814464, 'steps': 35491, 'loss/train': 1.8067296743392944}} 11/07/2021 02:14:15 - INFO - __main__ - Step 35496: {'lr': 0.00043942780609349636, 'samples': 6815232, 'steps': 35495, 'loss/train': 1.5365619659423828}} 11/07/2021 02:14:18 - INFO - __main__ - Step 35501: {'lr': 0.0004394104893853007, 'samples': 6816192, 'steps': 35500, 'loss/train': 0.7865248918533325}}} 11/07/2021 02:14:20 - INFO - __main__ - Step 35505: {'lr': 0.0004393966344825168, 'samples': 6816960, 'steps': 35504, 'loss/train': 2.8831872940063477}}} 11/07/2021 02:14:20 - INFO - __main__ - Step 35505: {'lr': 0.0004393966344825168, 'samples': 6816960, 'steps': 35504, 'loss/train': 2.8831872940063477}}} 11/07/2021 02:14:23 - INFO - __main__ - Step 35512: {'lr': 0.0004393723851171459, 'samples': 6818304, 'steps': 35511, 'loss/train': 2.5056517124176025}}} 11/07/2021 02:14:25 - INFO - __main__ - Step 35517: {'lr': 0.00043935506158200143, 'samples': 6819264, 'steps': 35516, 'loss/train': 1.2624260187149048}} 11/07/2021 02:14:28 - INFO - __main__ - Step 35522: {'lr': 0.0004393377359138454, 'samples': 6820224, 'steps': 35521, 'loss/train': 1.1675945520401}048}} 11/07/2021 02:14:28 - INFO - __main__ - Step 35522: {'lr': 0.0004393377359138454, 'samples': 6820224, 'steps': 35521, 'loss/train': 1.1675945520401}048}} 11/07/2021 02:14:31 - INFO - __main__ - Step 35529: {'lr': 0.0004393134763953387, 'samples': 6821568, 'steps': 35528, 'loss/train': 1.5105087757110596}}} 11/07/2021 02:14:33 - INFO - __main__ - Step 35533: {'lr': 0.00043929961193666246, 'samples': 6822336, 'steps': 35532, 'loss/train': 1.3283123970031738}} 11/07/2021 02:14:35 - INFO - __main__ - Step 35537: {'lr': 0.00043928574611325845, 'samples': 6823104, 'steps': 35536, 'loss/train': 2.5706093311309814}} 11/07/2021 02:14:35 - INFO - __main__ - Step 35537: {'lr': 0.00043928574611325845, 'samples': 6823104, 'steps': 35536, 'loss/train': 2.5706093311309814}} 11/07/2021 02:14:39 - INFO - __main__ - Step 35546: {'lr': 0.0004392545430213315, 'samples': 6824832, 'steps': 35545, 'loss/train': 1.476622462272644}4}} 11/07/2021 02:14:39 - INFO - __main__ - Step 35546: {'lr': 0.0004392545430213315, 'samples': 6824832, 'steps': 35545, 'loss/train': 1.476622462272644}4}} 11/07/2021 02:14:43 - INFO - __main__ - Step 35554: {'lr': 0.0004392268011408712, 'samples': 6826368, 'steps': 35553, 'loss/train': 1.6246516704559326}}} 11/07/2021 02:14:45 - INFO - __main__ - Step 35558: {'lr': 0.0004392129281542868, 'samples': 6827136, 'steps': 35557, 'loss/train': 1.6843079328536987}}} 11/07/2021 02:14:48 - INFO - __main__ - Step 35563: {'lr': 0.00043919558500279845, 'samples': 6828096, 'steps': 35562, 'loss/train': 1.2379157543182373}} 11/07/2021 02:14:48 - INFO - __main__ - Step 35563: {'lr': 0.00043919558500279845, 'samples': 6828096, 'steps': 35562, 'loss/train': 1.2379157543182373}} 11/07/2021 02:14:52 - INFO - __main__ - Step 35571: {'lr': 0.0004391678315275706, 'samples': 6829632, 'steps': 35570, 'loss/train': 1.577763557434082}3}} 11/07/2021 02:14:53 - INFO - __main__ - Step 35575: {'lr': 0.0004391539527442401, 'samples': 6830400, 'steps': 35574, 'loss/train': 1.359189510345459}3}} 11/07/2021 02:14:56 - INFO - __main__ - Step 35579: {'lr': 0.00043914007259723196, 'samples': 6831168, 'steps': 35578, 'loss/train': 0.13724112510681152} 11/07/2021 02:14:58 - INFO - __main__ - Step 35583: {'lr': 0.0004391261910866463, 'samples': 6831936, 'steps': 35582, 'loss/train': 1.4237700700759888}2} 11/07/2021 02:15:00 - INFO - __main__ - Step 35587: {'lr': 0.00043911230821258313, 'samples': 6832704, 'steps': 35586, 'loss/train': 1.623017430305481}2} 11/07/2021 02:15:00 - INFO - __main__ - Step 35587: {'lr': 0.00043911230821258313, 'samples': 6832704, 'steps': 35586, 'loss/train': 1.623017430305481}2} 11/07/2021 02:15:03 - INFO - __main__ - Step 35594: {'lr': 0.0004390880099024059, 'samples': 6834048, 'steps': 35593, 'loss/train': 0.9020565748214722}2} 11/07/2021 02:15:06 - INFO - __main__ - Step 35599: {'lr': 0.00043907065141052953, 'samples': 6835008, 'steps': 35598, 'loss/train': 0.11254219710826874} 11/07/2021 02:15:06 - INFO - __main__ - Step 35599: {'lr': 0.00043907065141052953, 'samples': 6835008, 'steps': 35598, 'loss/train': 0.11254219710826874} 11/07/2021 02:15:10 - INFO - __main__ - Step 35607: {'lr': 0.0004390428733936082, 'samples': 6836544, 'steps': 35606, 'loss/train': 0.6637961268424988}4} 11/07/2021 02:15:11 - INFO - __main__ - Step 35611: {'lr': 0.00043902898234078223, 'samples': 6837312, 'steps': 35610, 'loss/train': 1.6454213857650757}} 11/07/2021 02:15:14 - INFO - __main__ - Step 35615: {'lr': 0.00043901508992517956, 'samples': 6838080, 'steps': 35614, 'loss/train': 0.969472348690033}}} 11/07/2021 02:15:14 - INFO - __main__ - Step 35615: {'lr': 0.00043901508992517956, 'samples': 6838080, 'steps': 35614, 'loss/train': 0.969472348690033}}} 11/07/2021 02:15:18 - INFO - __main__ - Step 35623: {'lr': 0.0004389873010060449, 'samples': 6839616, 'steps': 35622, 'loss/train': 0.8303018808364868}}} 11/07/2021 02:15:19 - INFO - __main__ - Step 35627: {'lr': 0.00043897340450271317, 'samples': 6840384, 'steps': 35626, 'loss/train': 1.2990198135375977}} 11/07/2021 02:15:21 - INFO - __main__ - Step 35631: {'lr': 0.00043895950663700546, 'samples': 6841152, 'steps': 35630, 'loss/train': 1.5904690027236938}} 11/07/2021 02:15:24 - INFO - __main__ - Step 35636: {'lr': 0.0004389421323891822, 'samples': 6842112, 'steps': 35635, 'loss/train': 1.6761291027069092}}} 11/07/2021 02:15:26 - INFO - __main__ - Step 35641: {'lr': 0.0004389247560129987, 'samples': 6843072, 'steps': 35640, 'loss/train': 1.627358078956604}}}} 11/07/2021 02:15:26 - INFO - __main__ - Step 35641: {'lr': 0.0004389247560129987, 'samples': 6843072, 'steps': 35640, 'loss/train': 1.627358078956604}}}} 11/07/2021 02:15:29 - INFO - __main__ - Step 35648: {'lr': 0.0004389004255110693, 'samples': 6844416, 'steps': 35647, 'loss/train': 1.8229610919952393}}} 11/07/2021 02:15:31 - INFO - __main__ - Step 35652: {'lr': 0.00043888652049453163, 'samples': 6845184, 'steps': 35651, 'loss/train': 1.656050443649292}}} 11/07/2021 02:15:33 - INFO - __main__ - Step 35656: {'lr': 0.00043887261411624433, 'samples': 6845952, 'steps': 35655, 'loss/train': 1.5127336978912354}} 11/07/2021 02:15:36 - INFO - __main__ - Step 35660: {'lr': 0.00043885870637630763, 'samples': 6846720, 'steps': 35659, 'loss/train': 1.3242474794387817}} 11/07/2021 02:15:37 - INFO - __main__ - Step 35664: {'lr': 0.00043884479727482193, 'samples': 6847488, 'steps': 35663, 'loss/train': 1.5567348003387451}} 11/07/2021 02:15:39 - INFO - __main__ - Step 35668: {'lr': 0.0004388308868118873, 'samples': 6848256, 'steps': 35667, 'loss/train': 0.6823694109916687}}} 11/07/2021 02:15:42 - INFO - __main__ - Step 35673: {'lr': 0.0004388134968188344, 'samples': 6849216, 'steps': 35672, 'loss/train': 1.7342904806137085}}} 11/07/2021 02:15:44 - INFO - __main__ - Step 35677: {'lr': 0.0004387995832930067, 'samples': 6849984, 'steps': 35676, 'loss/train': 1.7262928485870361}}} 11/07/2021 02:15:46 - INFO - __main__ - Step 35681: {'lr': 0.00043878566840605606, 'samples': 6850752, 'steps': 35680, 'loss/train': 0.8708986639976501}} 11/07/2021 02:15:46 - INFO - __main__ - Step 35681: {'lr': 0.00043878566840605606, 'samples': 6850752, 'steps': 35680, 'loss/train': 0.8708986639976501}} 11/07/2021 02:15:49 - INFO - __main__ - Step 35688: {'lr': 0.00043876131407899233, 'samples': 6852096, 'steps': 35687, 'loss/train': 0.9426607489585876}} 11/07/2021 02:15:52 - INFO - __main__ - Step 35694: {'lr': 0.0004387404356244243, 'samples': 6853248, 'steps': 35693, 'loss/train': 1.674772024154663}6}} 11/07/2021 02:15:54 - INFO - __main__ - Step 35698: {'lr': 0.00043872651495382076, 'samples': 6854016, 'steps': 35697, 'loss/train': 1.7205495834350586}} 11/07/2021 02:15:54 - INFO - __main__ - Step 35698: {'lr': 0.00043872651495382076, 'samples': 6854016, 'steps': 35697, 'loss/train': 1.7205495834350586}} 11/07/2021 02:15:57 - INFO - __main__ - Step 35705: {'lr': 0.00043870215050639073, 'samples': 6855360, 'steps': 35704, 'loss/train': 5.773858070373535}}} 11/07/2021 02:15:59 - INFO - __main__ - Step 35710: {'lr': 0.00043868474477883523, 'samples': 6856320, 'steps': 35709, 'loss/train': 1.6956952810287476}} 11/07/2021 02:15:59 - INFO - __main__ - Step 35710: {'lr': 0.00043868474477883523, 'samples': 6856320, 'steps': 35709, 'loss/train': 1.6956952810287476}} 11/07/2021 02:16:04 - INFO - __main__ - Step 35717: {'lr': 0.00043866037318952735, 'samples': 6857664, 'steps': 35716, 'loss/train': 2.144253730773926}}} 11/07/2021 02:16:05 - INFO - __main__ - Step 35721: {'lr': 0.00043864644469686717, 'samples': 6858432, 'steps': 35720, 'loss/train': 0.9276940226554871}} 11/07/2021 02:16:08 - INFO - __main__ - Step 35725: {'lr': 0.0004386325148441882, 'samples': 6859200, 'steps': 35724, 'loss/train': 1.640588402748108}1}} 11/07/2021 02:16:10 - INFO - __main__ - Step 35730: {'lr': 0.0004386151006159659, 'samples': 6860160, 'steps': 35729, 'loss/train': 1.2281548976898193}}} 11/07/2021 02:16:12 - INFO - __main__ - Step 35734: {'lr': 0.0004386011677036118, 'samples': 6860928, 'steps': 35733, 'loss/train': 1.756344199180603}}}} 11/07/2021 02:16:13 - INFO - __main__ - Step 35738: {'lr': 0.00043858723343156514, 'samples': 6861696, 'steps': 35737, 'loss/train': 1.6566646099090576}} 11/07/2021 02:16:16 - INFO - __main__ - Step 35742: {'lr': 0.0004385732977999266, 'samples': 6862464, 'steps': 35741, 'loss/train': 1.0279544591903687}}} 11/07/2021 02:16:18 - INFO - __main__ - Step 35747: {'lr': 0.0004385558763486053, 'samples': 6863424, 'steps': 35746, 'loss/train': 1.5872620344161987}}} 11/07/2021 02:16:18 - INFO - __main__ - Step 35747: {'lr': 0.0004385558763486053, 'samples': 6863424, 'steps': 35746, 'loss/train': 1.5872620344161987}}} 11/07/2021 02:16:21 - INFO - __main__ - Step 35754: {'lr': 0.000438531482748464, 'samples': 6864768, 'steps': 35753, 'loss/train': 1.4319709539413452}}}} 11/07/2021 02:16:23 - INFO - __main__ - Step 35758: {'lr': 0.00043851754167946244, 'samples': 6865536, 'steps': 35757, 'loss/train': 1.4238637685775757}} 11/07/2021 02:16:26 - INFO - __main__ - Step 35763: {'lr': 0.0004385001134320026, 'samples': 6866496, 'steps': 35762, 'loss/train': 1.5388737916946411}}} 11/07/2021 02:16:26 - INFO - __main__ - Step 35763: {'lr': 0.0004385001134320026, 'samples': 6866496, 'steps': 35762, 'loss/train': 1.5388737916946411}}} 11/07/2021 02:16:30 - INFO - __main__ - Step 35771: {'lr': 0.0004384722238195159, 'samples': 6868032, 'steps': 35770, 'loss/train': 0.987594723701477}}}} 11/07/2021 02:16:32 - INFO - __main__ - Step 35775: {'lr': 0.000438458276975078, 'samples': 6868800, 'steps': 35774, 'loss/train': 1.3052928447723389}}}} 11/07/2021 02:16:33 - INFO - __main__ - Step 35779: {'lr': 0.0004384443287719779, 'samples': 6869568, 'steps': 35778, 'loss/train': 1.264674186706543}}}} 11/07/2021 02:16:36 - INFO - __main__ - Step 35783: {'lr': 0.00043843037921031616, 'samples': 6870336, 'steps': 35782, 'loss/train': 1.6619880199432373}} 11/07/2021 02:16:38 - INFO - __main__ - Step 35788: {'lr': 0.00043841294034791466, 'samples': 6871296, 'steps': 35787, 'loss/train': 1.5242619514465332}} 11/07/2021 02:16:38 - INFO - __main__ - Step 35788: {'lr': 0.00043841294034791466, 'samples': 6871296, 'steps': 35787, 'loss/train': 1.5242619514465332}} 11/07/2021 02:16:42 - INFO - __main__ - Step 35796: {'lr': 0.000438385033753564, 'samples': 6872832, 'steps': 35795, 'loss/train': 1.5358773469924927}2}} 11/07/2021 02:16:43 - INFO - __main__ - Step 35800: {'lr': 0.000438371078419137, 'samples': 6873600, 'steps': 35799, 'loss/train': 1.3921492099761963}2}} 11/07/2021 02:16:45 - INFO - __main__ - Step 35804: {'lr': 0.00043835712172667643, 'samples': 6874368, 'steps': 35803, 'loss/train': 1.0208276510238647}} 11/07/2021 02:16:48 - INFO - __main__ - Step 35809: {'lr': 0.0004383396739515192, 'samples': 6875328, 'steps': 35808, 'loss/train': 1.5520555973052979}}} 11/07/2021 02:16:50 - INFO - __main__ - Step 35814: {'lr': 0.0004383222240547882, 'samples': 6876288, 'steps': 35813, 'loss/train': 0.9822989702224731}}} 11/07/2021 02:16:50 - INFO - __main__ - Step 35814: {'lr': 0.0004383222240547882, 'samples': 6876288, 'steps': 35813, 'loss/train': 0.9822989702224731}}} 11/07/2021 02:16:54 - INFO - __main__ - Step 35821: {'lr': 0.00043829779063549515, 'samples': 6877632, 'steps': 35820, 'loss/train': 1.6605607271194458}} 11/07/2021 02:16:54 - INFO - __main__ - Step 35821: {'lr': 0.00043829779063549515, 'samples': 6877632, 'steps': 35820, 'loss/train': 1.6605607271194458}} 11/07/2021 02:16:58 - INFO - __main__ - Step 35829: {'lr': 0.00043826986163711835, 'samples': 6879168, 'steps': 35828, 'loss/train': 1.2509788274765015}} 11/07/2021 02:17:00 - INFO - __main__ - Step 35834: {'lr': 0.0004382524032560582, 'samples': 6880128, 'steps': 35833, 'loss/train': 1.533860683441162}5}} 11/07/2021 02:17:02 - INFO - __main__ - Step 35839: {'lr': 0.00043823494275440733, 'samples': 6881088, 'steps': 35838, 'loss/train': 1.4565870761871338}} 11/07/2021 02:17:04 - INFO - __main__ - Step 35843: {'lr': 0.0004382209728263935, 'samples': 6881856, 'steps': 35842, 'loss/train': 1.3423986434936523}}} 11/07/2021 02:17:07 - INFO - __main__ - Step 35847: {'lr': 0.00043820700154142825, 'samples': 6882624, 'steps': 35846, 'loss/train': 1.7680792808532715}} 11/07/2021 02:17:08 - INFO - __main__ - Step 35851: {'lr': 0.0004381930288996122, 'samples': 6883392, 'steps': 35850, 'loss/train': 1.5950745344161987}}} 11/07/2021 02:17:10 - INFO - __main__ - Step 35855: {'lr': 0.00043817905490104613, 'samples': 6884160, 'steps': 35854, 'loss/train': 1.4157003164291382}} 11/07/2021 02:17:12 - INFO - __main__ - Step 35860: {'lr': 0.0004381615854950625, 'samples': 6885120, 'steps': 35859, 'loss/train': 1.409956455230713}2}} 11/07/2021 02:17:12 - INFO - __main__ - Step 35860: {'lr': 0.0004381615854950625, 'samples': 6885120, 'steps': 35859, 'loss/train': 1.409956455230713}2}} 11/07/2021 02:17:15 - INFO - __main__ - Step 35866: {'lr': 0.00043814061941007, 'samples': 6886272, 'steps': 35865, 'loss/train': 1.0974922180175781}}2}} 11/07/2021 02:17:17 - INFO - __main__ - Step 35871: {'lr': 0.00043812314534129716, 'samples': 6887232, 'steps': 35870, 'loss/train': 1.0092236995697021}} 11/07/2021 02:17:20 - INFO - __main__ - Step 35875: {'lr': 0.00043810916456049257, 'samples': 6888000, 'steps': 35874, 'loss/train': 0.8110805153846741}} 11/07/2021 02:17:20 - INFO - __main__ - Step 35875: {'lr': 0.00043810916456049257, 'samples': 6888000, 'steps': 35874, 'loss/train': 0.8110805153846741}} 11/07/2021 02:17:24 - INFO - __main__ - Step 35883: {'lr': 0.00043808119893054787, 'samples': 6889536, 'steps': 35882, 'loss/train': 1.728338599205017}}} 11/07/2021 02:17:25 - INFO - __main__ - Step 35887: {'lr': 0.0004380672140816095, 'samples': 6890304, 'steps': 35886, 'loss/train': 1.350689172744751}}}} 11/07/2021 02:17:27 - INFO - __main__ - Step 35891: {'lr': 0.0004380532278768282, 'samples': 6891072, 'steps': 35890, 'loss/train': 1.6306557655334473}}} 11/07/2021 02:17:27 - INFO - __main__ - Step 35891: {'lr': 0.0004380532278768282, 'samples': 6891072, 'steps': 35890, 'loss/train': 1.6306557655334473}}} 11/07/2021 02:17:32 - INFO - __main__ - Step 35900: {'lr': 0.00043802175395929156, 'samples': 6892800, 'steps': 35899, 'loss/train': 1.3293200731277466}} 11/07/2021 02:17:32 - INFO - __main__ - Step 35900: {'lr': 0.00043802175395929156, 'samples': 6892800, 'steps': 35899, 'loss/train': 1.3293200731277466}} 11/07/2021 02:17:35 - INFO - __main__ - Step 35907: {'lr': 0.00043799726950128997, 'samples': 6894144, 'steps': 35906, 'loss/train': 1.206464171409607}}} 11/07/2021 02:17:38 - INFO - __main__ - Step 35912: {'lr': 0.00043797977806142585, 'samples': 6895104, 'steps': 35911, 'loss/train': 1.616759181022644}}} 11/07/2021 02:17:40 - INFO - __main__ - Step 35917: {'lr': 0.000437962284504042, 'samples': 6896064, 'steps': 35916, 'loss/train': 1.4445488452911377}}}} 11/07/2021 02:17:40 - INFO - __main__ - Step 35917: {'lr': 0.000437962284504042, 'samples': 6896064, 'steps': 35916, 'loss/train': 1.4445488452911377}}}} 11/07/2021 02:17:43 - INFO - __main__ - Step 35924: {'lr': 0.0004379377899666468, 'samples': 6897408, 'steps': 35923, 'loss/train': 1.3089522123336792}}} 11/07/2021 02:17:45 - INFO - __main__ - Step 35928: {'lr': 0.0004379237912250994, 'samples': 6898176, 'steps': 35927, 'loss/train': 1.6177557706832886}}} 11/07/2021 02:17:48 - INFO - __main__ - Step 35933: {'lr': 0.000437906290892977, 'samples': 6899136, 'steps': 35932, 'loss/train': 1.3447669744491577}}}} 11/07/2021 02:17:48 - INFO - __main__ - Step 35933: {'lr': 0.000437906290892977, 'samples': 6899136, 'steps': 35932, 'loss/train': 1.3447669744491577}}}} 11/07/2021 02:17:52 - INFO - __main__ - Step 35941: {'lr': 0.0004378782859589439, 'samples': 6900672, 'steps': 35940, 'loss/train': 1.4257400035858154}}} 11/07/2021 02:17:52 - INFO - __main__ - Step 35941: {'lr': 0.0004378782859589439, 'samples': 6900672, 'steps': 35940, 'loss/train': 1.4257400035858154}}} 11/07/2021 02:17:56 - INFO - __main__ - Step 35949: {'lr': 0.0004378502756069873, 'samples': 6902208, 'steps': 35948, 'loss/train': 0.6523028612136841}}} 11/07/2021 02:17:58 - INFO - __main__ - Step 35954: {'lr': 0.0004378327663860839, 'samples': 6903168, 'steps': 35953, 'loss/train': 1.661206841468811}}}} 11/07/2021 02:18:00 - INFO - __main__ - Step 35959: {'lr': 0.000437815255049317, 'samples': 6904128, 'steps': 35958, 'loss/train': 1.3309332132339478}}}} 11/07/2021 02:18:02 - INFO - __main__ - Step 35963: {'lr': 0.00043780124445661416, 'samples': 6904896, 'steps': 35962, 'loss/train': 1.721779704093933}}} 11/07/2021 02:18:04 - INFO - __main__ - Step 35967: {'lr': 0.0004377872325099858, 'samples': 6905664, 'steps': 35966, 'loss/train': 1.4563828706741333}}} 11/07/2021 02:18:07 - INFO - __main__ - Step 35971: {'lr': 0.000437773219209533, 'samples': 6906432, 'steps': 35970, 'loss/train': 1.6383625268936157}}}} 11/07/2021 02:18:08 - INFO - __main__ - Step 35975: {'lr': 0.0004377592045553568, 'samples': 6907200, 'steps': 35974, 'loss/train': 1.6243177652359009}}} 11/07/2021 02:18:10 - INFO - __main__ - Step 35979: {'lr': 0.0004377451885475581, 'samples': 6907968, 'steps': 35978, 'loss/train': 0.8041215538978577}}} 11/07/2021 02:18:13 - INFO - __main__ - Step 35984: {'lr': 0.0004377276666344322, 'samples': 6908928, 'steps': 35983, 'loss/train': 1.7502566576004028}}} 11/07/2021 02:18:13 - INFO - __main__ - Step 35984: {'lr': 0.0004377276666344322, 'samples': 6908928, 'steps': 35983, 'loss/train': 1.7502566576004028}}} 11/07/2021 02:18:16 - INFO - __main__ - Step 35990: {'lr': 0.0004377066375473213, 'samples': 6910080, 'steps': 35989, 'loss/train': 0.25427624583244324}} 11/07/2021 02:18:18 - INFO - __main__ - Step 35995: {'lr': 0.0004376891109821606, 'samples': 6911040, 'steps': 35994, 'loss/train': 1.4026904106140137}}} 11/07/2021 02:18:21 - INFO - __main__ - Step 36000: {'lr': 0.0004376715823027544, 'samples': 6912000, 'steps': 35999, 'loss/train': 1.4603431224822998}}} 11/07/2021 02:18:21 - INFO - __main__ - Step 36000: {'lr': 0.0004376715823027544, 'samples': 6912000, 'steps': 35999, 'loss/train': 1.4603431224822998}}} 11/07/2021 02:18:24 - INFO - __main__ - Step 36007: {'lr': 0.0004376470386000294, 'samples': 6913344, 'steps': 36006, 'loss/train': 1.5615692138671875}}} 11/07/2021 02:18:26 - INFO - __main__ - Step 36011: {'lr': 0.00043763301176689, 'samples': 6914112, 'steps': 36010, 'loss/train': 2.8755180835723877}5}}} 11/07/2021 02:18:29 - INFO - __main__ - Step 36016: {'lr': 0.0004376154763232255, 'samples': 6915072, 'steps': 36015, 'loss/train': 1.160670518875122}}}} 11/07/2021 02:18:29 - INFO - __main__ - Step 36016: {'lr': 0.0004376154763232255, 'samples': 6915072, 'steps': 36015, 'loss/train': 1.160670518875122}}}} 11/07/2021 02:18:32 - INFO - __main__ - Step 36023: {'lr': 0.00043759092315160064, 'samples': 6916416, 'steps': 36022, 'loss/train': 1.4155391454696655}} 11/07/2021 02:18:34 - INFO - __main__ - Step 36027: {'lr': 0.0004375768909082175, 'samples': 6917184, 'steps': 36026, 'loss/train': 1.2582416534423828}}} 11/07/2021 02:18:37 - INFO - __main__ - Step 36032: {'lr': 0.0004375593487023174, 'samples': 6918144, 'steps': 36031, 'loss/train': 1.5267812013626099}}} 11/07/2021 02:18:39 - INFO - __main__ - Step 36036: {'lr': 0.00043754531341638346, 'samples': 6918912, 'steps': 36035, 'loss/train': 2.0117814540863037}} 11/07/2021 02:18:41 - INFO - __main__ - Step 36040: {'lr': 0.00043753127677836917, 'samples': 6919680, 'steps': 36039, 'loss/train': 0.44519782066345215} 11/07/2021 02:18:43 - INFO - __main__ - Step 36044: {'lr': 0.0004375172387883757, 'samples': 6920448, 'steps': 36043, 'loss/train': 1.5872255563735962}5} 11/07/2021 02:18:44 - INFO - __main__ - Step 36048: {'lr': 0.0004375031994465042, 'samples': 6921216, 'steps': 36047, 'loss/train': 1.1131813526153564}5} 11/07/2021 02:18:46 - INFO - __main__ - Step 36052: {'lr': 0.000437489158752856, 'samples': 6921984, 'steps': 36051, 'loss/train': 0.20430973172187805}5} 11/07/2021 02:18:46 - INFO - __main__ - Step 36052: {'lr': 0.000437489158752856, 'samples': 6921984, 'steps': 36051, 'loss/train': 0.20430973172187805}5} 11/07/2021 02:18:50 - INFO - __main__ - Step 36059: {'lr': 0.00043746458428656324, 'samples': 6923328, 'steps': 36058, 'loss/train': 1.5204285383224487}} 11/07/2021 02:18:52 - INFO - __main__ - Step 36063: {'lr': 0.00043745053987605075, 'samples': 6924096, 'steps': 36062, 'loss/train': 1.710706353187561}}} 11/07/2021 02:18:54 - INFO - __main__ - Step 36068: {'lr': 0.00043743298246251994, 'samples': 6925056, 'steps': 36067, 'loss/train': 1.7695902585983276}} 11/07/2021 02:18:54 - INFO - __main__ - Step 36068: {'lr': 0.00043743298246251994, 'samples': 6925056, 'steps': 36067, 'loss/train': 1.7695902585983276}} 11/07/2021 02:18:58 - INFO - __main__ - Step 36076: {'lr': 0.0004374048862093236, 'samples': 6926592, 'steps': 36075, 'loss/train': 1.6173049211502075}}} 11/07/2021 02:19:00 - INFO - __main__ - Step 36080: {'lr': 0.00043739083605607275, 'samples': 6927360, 'steps': 36079, 'loss/train': 1.165663242340088}}} 11/07/2021 02:19:02 - INFO - __main__ - Step 36084: {'lr': 0.00043737678455185524, 'samples': 6928128, 'steps': 36083, 'loss/train': 1.3838664293289185}} 11/07/2021 02:19:04 - INFO - __main__ - Step 36089: {'lr': 0.0004373592182719408, 'samples': 6929088, 'steps': 36088, 'loss/train': 1.0795824527740479}}} 11/07/2021 02:19:06 - INFO - __main__ - Step 36093: {'lr': 0.0004373451637284186, 'samples': 6929856, 'steps': 36092, 'loss/train': 2.1793367862701416}}} 11/07/2021 02:19:08 - INFO - __main__ - Step 36097: {'lr': 0.00043733110783425894, 'samples': 6930624, 'steps': 36096, 'loss/train': 1.4539893865585327}} 11/07/2021 02:19:11 - INFO - __main__ - Step 36101: {'lr': 0.0004373170505895632, 'samples': 6931392, 'steps': 36100, 'loss/train': 1.4807472229003906}}} 11/07/2021 02:19:12 - INFO - __main__ - Step 36105: {'lr': 0.0004373029919944327, 'samples': 6932160, 'steps': 36104, 'loss/train': 0.14362464845180511}} 11/07/2021 02:19:14 - INFO - __main__ - Step 36109: {'lr': 0.0004372889320489688, 'samples': 6932928, 'steps': 36108, 'loss/train': 1.5239927768707275}}} 11/07/2021 02:19:16 - INFO - __main__ - Step 36114: {'lr': 0.00043727135521838697, 'samples': 6933888, 'steps': 36113, 'loss/train': 1.4923559427261353}} 11/07/2021 02:19:16 - INFO - __main__ - Step 36114: {'lr': 0.00043727135521838697, 'samples': 6933888, 'steps': 36113, 'loss/train': 1.4923559427261353}} 11/07/2021 02:19:20 - INFO - __main__ - Step 36121: {'lr': 0.0004372467441115903, 'samples': 6935232, 'steps': 36120, 'loss/train': 1.1723154783248901}}} 11/07/2021 02:19:22 - INFO - __main__ - Step 36125: {'lr': 0.0004372326787658065, 'samples': 6936000, 'steps': 36124, 'loss/train': 0.9541419148445129}}} 11/07/2021 02:19:24 - INFO - __main__ - Step 36130: {'lr': 0.00043721509518539507, 'samples': 6936960, 'steps': 36129, 'loss/train': 1.2257826328277588}} 11/07/2021 02:19:27 - INFO - __main__ - Step 36135: {'lr': 0.0004371975094960778, 'samples': 6937920, 'steps': 36134, 'loss/train': 1.6989877223968506}}} 11/07/2021 02:19:27 - INFO - __main__ - Step 36135: {'lr': 0.0004371975094960778, 'samples': 6937920, 'steps': 36134, 'loss/train': 1.6989877223968506}}} 11/07/2021 02:19:30 - INFO - __main__ - Step 36142: {'lr': 0.00043717288598844916, 'samples': 6939264, 'steps': 36141, 'loss/train': 1.166265606880188}}} 11/07/2021 02:19:32 - INFO - __main__ - Step 36146: {'lr': 0.00043715881355720776, 'samples': 6940032, 'steps': 36145, 'loss/train': 1.328829288482666}}} 11/07/2021 02:19:34 - INFO - __main__ - Step 36150: {'lr': 0.0004371447397766724, 'samples': 6940800, 'steps': 36149, 'loss/train': 1.4469424486160278}}} 11/07/2021 02:19:37 - INFO - __main__ - Step 36156: {'lr': 0.0004371236265761651, 'samples': 6941952, 'steps': 36155, 'loss/train': 1.5506919622421265}}} 11/07/2021 02:19:39 - INFO - __main__ - Step 36160: {'lr': 0.00043710954942283875, 'samples': 6942720, 'steps': 36159, 'loss/train': 1.5076546669006348}} 11/07/2021 02:19:39 - INFO - __main__ - Step 36160: {'lr': 0.00043710954942283875, 'samples': 6942720, 'steps': 36159, 'loss/train': 1.5076546669006348}} 11/07/2021 02:19:42 - INFO - __main__ - Step 36167: {'lr': 0.0004370849111586946, 'samples': 6944064, 'steps': 36166, 'loss/train': 1.7870904207229614}}} 11/07/2021 02:19:44 - INFO - __main__ - Step 36171: {'lr': 0.0004370708302960307, 'samples': 6944832, 'steps': 36170, 'loss/train': 1.1478573083877563}}} 11/07/2021 02:19:47 - INFO - __main__ - Step 36176: {'lr': 0.00043705322732116007, 'samples': 6945792, 'steps': 36175, 'loss/train': 1.362337350845337}}} 11/07/2021 02:19:49 - INFO - __main__ - Step 36180: {'lr': 0.00043703914342415473, 'samples': 6946560, 'steps': 36179, 'loss/train': 1.6111247539520264}} 11/07/2021 02:19:51 - INFO - __main__ - Step 36184: {'lr': 0.0004370250581787181, 'samples': 6947328, 'steps': 36183, 'loss/train': 0.8045414686203003}}} 11/07/2021 02:19:53 - INFO - __main__ - Step 36188: {'lr': 0.00043701097158495186, 'samples': 6948096, 'steps': 36187, 'loss/train': 1.0144716501235962}} 11/07/2021 02:19:54 - INFO - __main__ - Step 36192: {'lr': 0.0004369968836429574, 'samples': 6948864, 'steps': 36191, 'loss/train': 1.3015068769454956}}} 11/07/2021 02:19:56 - INFO - __main__ - Step 36196: {'lr': 0.00043698279435283637, 'samples': 6949632, 'steps': 36195, 'loss/train': 1.662702202796936}}} 11/07/2021 02:19:59 - INFO - __main__ - Step 36201: {'lr': 0.000436965180844537, 'samples': 6950592, 'steps': 36200, 'loss/train': 1.7721482515335083}}}} 11/07/2021 02:20:01 - INFO - __main__ - Step 36205: {'lr': 0.0004369510885215026, 'samples': 6951360, 'steps': 36204, 'loss/train': 1.2825634479522705}}} 11/07/2021 02:20:03 - INFO - __main__ - Step 36209: {'lr': 0.00043693699485067186, 'samples': 6952128, 'steps': 36208, 'loss/train': 1.2388901710510254}} 11/07/2021 02:20:04 - INFO - __main__ - Step 36213: {'lr': 0.00043692289983214626, 'samples': 6952896, 'steps': 36212, 'loss/train': 1.5022273063659668}} 11/07/2021 02:20:06 - INFO - __main__ - Step 36217: {'lr': 0.00043690880346602755, 'samples': 6953664, 'steps': 36216, 'loss/train': 1.4669190645217896}} 11/07/2021 02:20:09 - INFO - __main__ - Step 36222: {'lr': 0.00043689118111348105, 'samples': 6954624, 'steps': 36221, 'loss/train': 1.7597016096115112}} 11/07/2021 02:20:09 - INFO - __main__ - Step 36222: {'lr': 0.00043689118111348105, 'samples': 6954624, 'steps': 36221, 'loss/train': 1.7597016096115112}} 11/07/2021 02:20:13 - INFO - __main__ - Step 36230: {'lr': 0.00043686298097055456, 'samples': 6956160, 'steps': 36229, 'loss/train': 1.7766621112823486}} 11/07/2021 02:20:15 - INFO - __main__ - Step 36234: {'lr': 0.00043684887887829863, 'samples': 6956928, 'steps': 36233, 'loss/train': 1.5048789978027344}} 11/07/2021 02:20:16 - INFO - __main__ - Step 36238: {'lr': 0.00043683477543898314, 'samples': 6957696, 'steps': 36237, 'loss/train': 0.8398938775062561}} 11/07/2021 02:20:18 - INFO - __main__ - Step 36242: {'lr': 0.0004368206706527098, 'samples': 6958464, 'steps': 36241, 'loss/train': 1.5783356428146362}}} 11/07/2021 02:20:21 - INFO - __main__ - Step 36248: {'lr': 0.0004367995109479763, 'samples': 6959616, 'steps': 36247, 'loss/train': 1.4469062089920044}}} 11/07/2021 02:20:23 - INFO - __main__ - Step 36252: {'lr': 0.00043678540279475314, 'samples': 6960384, 'steps': 36251, 'loss/train': 1.4475023746490479}} 11/07/2021 02:20:23 - INFO - __main__ - Step 36252: {'lr': 0.00043678540279475314, 'samples': 6960384, 'steps': 36251, 'loss/train': 1.4475023746490479}} 11/07/2021 02:20:26 - INFO - __main__ - Step 36258: {'lr': 0.0004367642380400717, 'samples': 6961536, 'steps': 36257, 'loss/train': 1.542945146560669}9}} 11/07/2021 02:20:28 - INFO - __main__ - Step 36263: {'lr': 0.0004367465984302794, 'samples': 6962496, 'steps': 36262, 'loss/train': 1.3904285430908203}}} 11/07/2021 02:20:31 - INFO - __main__ - Step 36268: {'lr': 0.0004367289567168588, 'samples': 6963456, 'steps': 36267, 'loss/train': 1.3548353910446167}}} 11/07/2021 02:20:33 - INFO - __main__ - Step 36272: {'lr': 0.0004367148418316434, 'samples': 6964224, 'steps': 36271, 'loss/train': 1.1501420736312866}}} 11/07/2021 02:20:33 - INFO - __main__ - Step 36272: {'lr': 0.0004367148418316434, 'samples': 6964224, 'steps': 36271, 'loss/train': 1.1501420736312866}}} 11/07/2021 02:20:36 - INFO - __main__ - Step 36279: {'lr': 0.0004366901375435408, 'samples': 6965568, 'steps': 36278, 'loss/train': 1.6254451274871826}}} 11/07/2021 02:20:38 - INFO - __main__ - Step 36283: {'lr': 0.000436676018956814, 'samples': 6966336, 'steps': 36282, 'loss/train': 1.441980004310608}6}}} 11/07/2021 02:20:41 - INFO - __main__ - Step 36288: {'lr': 0.00043665836883086725, 'samples': 6967296, 'steps': 36287, 'loss/train': 1.3476370573043823}} 11/07/2021 02:20:43 - INFO - __main__ - Step 36293: {'lr': 0.00043664071660228605, 'samples': 6968256, 'steps': 36292, 'loss/train': 1.2941185235977173}} 11/07/2021 02:20:43 - INFO - __main__ - Step 36293: {'lr': 0.00043664071660228605, 'samples': 6968256, 'steps': 36292, 'loss/train': 1.2941185235977173}} 11/07/2021 02:20:47 - INFO - __main__ - Step 36301: {'lr': 0.0004366124686635727, 'samples': 6969792, 'steps': 36300, 'loss/train': 1.804154396057129}3}} 11/07/2021 02:20:49 - INFO - __main__ - Step 36305: {'lr': 0.00043659834267613227, 'samples': 6970560, 'steps': 36304, 'loss/train': 1.4944956302642822}} 11/07/2021 02:20:51 - INFO - __main__ - Step 36309: {'lr': 0.00043658421534343856, 'samples': 6971328, 'steps': 36308, 'loss/train': 1.585021734237671}}} 11/07/2021 02:20:51 - INFO - __main__ - Step 36309: {'lr': 0.00043658421534343856, 'samples': 6971328, 'steps': 36308, 'loss/train': 1.585021734237671}}} 11/07/2021 02:20:55 - INFO - __main__ - Step 36317: {'lr': 0.0004365559566426985, 'samples': 6972864, 'steps': 36316, 'loss/train': 1.3983964920043945}}} 11/07/2021 02:20:56 - INFO - __main__ - Step 36321: {'lr': 0.0004365418252748559, 'samples': 6973632, 'steps': 36320, 'loss/train': 1.5906174182891846}}} 11/07/2021 02:20:58 - INFO - __main__ - Step 36325: {'lr': 0.0004365276925621674, 'samples': 6974400, 'steps': 36324, 'loss/train': 1.2448979616165161}}} 11/07/2021 02:21:01 - INFO - __main__ - Step 36330: {'lr': 0.0004365100247802725, 'samples': 6975360, 'steps': 36329, 'loss/train': 1.696042776107788}}}} 11/07/2021 02:21:01 - INFO - __main__ - Step 36330: {'lr': 0.0004365100247802725, 'samples': 6975360, 'steps': 36329, 'loss/train': 1.696042776107788}}}} 11/07/2021 02:21:04 - INFO - __main__ - Step 36336: {'lr': 0.0004364888206687443, 'samples': 6976512, 'steps': 36335, 'loss/train': 2.372626304626465}}}} 11/07/2021 02:21:07 - INFO - __main__ - Step 36341: {'lr': 0.0004364711482649925, 'samples': 6977472, 'steps': 36340, 'loss/train': 1.3252372741699219}}} 11/07/2021 02:21:09 - INFO - __main__ - Step 36346: {'lr': 0.00043645347376071507, 'samples': 6978432, 'steps': 36345, 'loss/train': 1.7303555011749268}} 11/07/2021 02:21:09 - INFO - __main__ - Step 36346: {'lr': 0.00043645347376071507, 'samples': 6978432, 'steps': 36345, 'loss/train': 1.7303555011749268}} 11/07/2021 02:21:12 - INFO - __main__ - Step 36353: {'lr': 0.00043642872592622293, 'samples': 6979776, 'steps': 36352, 'loss/train': 1.3439143896102905}} 11/07/2021 02:21:14 - INFO - __main__ - Step 36357: {'lr': 0.0004364145824584361, 'samples': 6980544, 'steps': 36356, 'loss/train': 1.4082454442977905}}} 11/07/2021 02:21:17 - INFO - __main__ - Step 36362: {'lr': 0.00043639690123381503, 'samples': 6981504, 'steps': 36361, 'loss/train': 0.9121442437171936}} 11/07/2021 02:21:19 - INFO - __main__ - Step 36366: {'lr': 0.0004363827547423324, 'samples': 6982272, 'steps': 36365, 'loss/train': 4.262648582458496}6}} 11/07/2021 02:21:21 - INFO - __main__ - Step 36370: {'lr': 0.00043636860690715064, 'samples': 6983040, 'steps': 36369, 'loss/train': 1.4799058437347412}} 11/07/2021 02:21:22 - INFO - __main__ - Step 36374: {'lr': 0.0004363544577283718, 'samples': 6983808, 'steps': 36373, 'loss/train': 1.4895075559616089}}} 11/07/2021 02:21:25 - INFO - __main__ - Step 36378: {'lr': 0.000436340307206098, 'samples': 6984576, 'steps': 36377, 'loss/train': 1.4843051433563232}}}} 11/07/2021 02:21:27 - INFO - __main__ - Step 36383: {'lr': 0.00043632261716412097, 'samples': 6985536, 'steps': 36382, 'loss/train': 0.906044065952301}}} 11/07/2021 02:21:27 - INFO - __main__ - Step 36383: {'lr': 0.00043632261716412097, 'samples': 6985536, 'steps': 36382, 'loss/train': 0.906044065952301}}} 11/07/2021 02:21:31 - INFO - __main__ - Step 36391: {'lr': 0.00043629430873142773, 'samples': 6987072, 'steps': 36390, 'loss/train': 0.39275264739990234} 11/07/2021 02:21:33 - INFO - __main__ - Step 36395: {'lr': 0.00043628015250043794, 'samples': 6987840, 'steps': 36394, 'loss/train': 1.107160210609436}4} 11/07/2021 02:21:35 - INFO - __main__ - Step 36399: {'lr': 0.00043626599492648877, 'samples': 6988608, 'steps': 36398, 'loss/train': 1.4821674823760986}} 11/07/2021 02:21:37 - INFO - __main__ - Step 36403: {'lr': 0.00043625183600968224, 'samples': 6989376, 'steps': 36402, 'loss/train': 1.764100193977356}}} 11/07/2021 02:21:39 - INFO - __main__ - Step 36408: {'lr': 0.00043623413547543645, 'samples': 6990336, 'steps': 36407, 'loss/train': 1.0938903093338013}} 11/07/2021 02:21:41 - INFO - __main__ - Step 36412: {'lr': 0.0004362199735375742, 'samples': 6991104, 'steps': 36411, 'loss/train': 1.1211869716644287}}} 11/07/2021 02:21:41 - INFO - __main__ - Step 36412: {'lr': 0.0004362199735375742, 'samples': 6991104, 'steps': 36411, 'loss/train': 1.1211869716644287}}} 11/07/2021 02:21:45 - INFO - __main__ - Step 36419: {'lr': 0.00043619518691592453, 'samples': 6992448, 'steps': 36418, 'loss/train': 0.7464272975921631}} 11/07/2021 02:21:47 - INFO - __main__ - Step 36424: {'lr': 0.0004361774796692425, 'samples': 6993408, 'steps': 36423, 'loss/train': 1.7541043758392334}}} 11/07/2021 02:21:49 - INFO - __main__ - Step 36428: {'lr': 0.0004361633123618908, 'samples': 6994176, 'steps': 36427, 'loss/train': 1.10480797290802}4}}} 11/07/2021 02:21:49 - INFO - __main__ - Step 36428: {'lr': 0.0004361633123618908, 'samples': 6994176, 'steps': 36427, 'loss/train': 1.10480797290802}4}}} 11/07/2021 02:21:53 - INFO - __main__ - Step 36435: {'lr': 0.00043613851634461743, 'samples': 6995520, 'steps': 36434, 'loss/train': 1.5099252462387085}} 11/07/2021 02:21:54 - INFO - __main__ - Step 36439: {'lr': 0.0004361243453466896, 'samples': 6996288, 'steps': 36438, 'loss/train': 1.8450305461883545}}} 11/07/2021 02:21:57 - INFO - __main__ - Step 36443: {'lr': 0.0004361101730069256, 'samples': 6997056, 'steps': 36442, 'loss/train': 0.9707183837890625}}} 11/07/2021 02:21:59 - INFO - __main__ - Step 36448: {'lr': 0.00043609245569541924, 'samples': 6998016, 'steps': 36447, 'loss/train': 1.5884859561920166}} 11/07/2021 02:22:01 - INFO - __main__ - Step 36453: {'lr': 0.000436074736287653, 'samples': 6998976, 'steps': 36452, 'loss/train': 1.6394306421279907}6}} 11/07/2021 02:22:01 - INFO - __main__ - Step 36453: {'lr': 0.000436074736287653, 'samples': 6998976, 'steps': 36452, 'loss/train': 1.6394306421279907}6}} 11/07/2021 02:22:05 - INFO - __main__ - Step 36460: {'lr': 0.0004360499255954442, 'samples': 7000320, 'steps': 36459, 'loss/train': 1.176275372505188}6}} 11/07/2021 02:22:07 - INFO - __main__ - Step 36464: {'lr': 0.0004360357462127171, 'samples': 7001088, 'steps': 36463, 'loss/train': 1.5522664785385132}}} 11/07/2021 02:22:09 - INFO - __main__ - Step 36469: {'lr': 0.0004360180200982613, 'samples': 7002048, 'steps': 36468, 'loss/train': 1.7998279333114624}}} 11/07/2021 02:22:09 - INFO - __main__ - Step 36469: {'lr': 0.0004360180200982613, 'samples': 7002048, 'steps': 36468, 'loss/train': 1.7998279333114624}}} 11/07/2021 02:22:13 - INFO - __main__ - Step 36477: {'lr': 0.00043598965395673893, 'samples': 7003584, 'steps': 36476, 'loss/train': 1.5747464895248413}} 11/07/2021 02:22:15 - INFO - __main__ - Step 36481: {'lr': 0.000435975468874629, 'samples': 7004352, 'steps': 36480, 'loss/train': 1.4760842323303223}3}} 11/07/2021 02:22:17 - INFO - __main__ - Step 36485: {'lr': 0.0004359612824517563, 'samples': 7005120, 'steps': 36484, 'loss/train': 1.6380879878997803}}} 11/07/2021 02:22:19 - INFO - __main__ - Step 36489: {'lr': 0.000435947094688223, 'samples': 7005888, 'steps': 36488, 'loss/train': 1.4367139339447021}}}} 11/07/2021 02:22:21 - INFO - __main__ - Step 36494: {'lr': 0.0004359293580986583, 'samples': 7006848, 'steps': 36493, 'loss/train': 1.8951630592346191}}} 11/07/2021 02:22:21 - INFO - __main__ - Step 36494: {'lr': 0.0004359293580986583, 'samples': 7006848, 'steps': 36493, 'loss/train': 1.8951630592346191}}} 11/07/2021 02:22:25 - INFO - __main__ - Step 36501: {'lr': 0.00043590452335468265, 'samples': 7008192, 'steps': 36500, 'loss/train': 0.6577228307723999}} 11/07/2021 02:22:27 - INFO - __main__ - Step 36505: {'lr': 0.0004358903302295301, 'samples': 7008960, 'steps': 36504, 'loss/train': 1.8605214357376099}}} 11/07/2021 02:22:29 - INFO - __main__ - Step 36510: {'lr': 0.00043587258693851685, 'samples': 7009920, 'steps': 36509, 'loss/train': 1.5353094339370728}} 11/07/2021 02:22:31 - INFO - __main__ - Step 36514: {'lr': 0.0004358583907981729, 'samples': 7010688, 'steps': 36513, 'loss/train': 1.7945281267166138}}} 11/07/2021 02:22:31 - INFO - __main__ - Step 36514: {'lr': 0.0004358583907981729, 'samples': 7010688, 'steps': 36513, 'loss/train': 1.7945281267166138}}} 11/07/2021 02:22:35 - INFO - __main__ - Step 36521: {'lr': 0.000435833544328453, 'samples': 7012032, 'steps': 36520, 'loss/train': 1.0032232999801636}}}} 11/07/2021 02:22:37 - INFO - __main__ - Step 36526: {'lr': 0.0004358157943380379, 'samples': 7012992, 'steps': 36525, 'loss/train': 1.2418889999389648}}} 11/07/2021 02:22:37 - INFO - __main__ - Step 36526: {'lr': 0.0004358157943380379, 'samples': 7012992, 'steps': 36525, 'loss/train': 1.2418889999389648}}} 11/07/2021 02:22:42 - INFO - __main__ - Step 36534: {'lr': 0.00043578738999971886, 'samples': 7014528, 'steps': 36533, 'loss/train': 1.2998322248458862}} 11/07/2021 02:22:43 - INFO - __main__ - Step 36538: {'lr': 0.0004357731858213978, 'samples': 7015296, 'steps': 36537, 'loss/train': 1.26777982711792}62}} 11/07/2021 02:22:45 - INFO - __main__ - Step 36542: {'lr': 0.00043575898030377225, 'samples': 7016064, 'steps': 36541, 'loss/train': 1.8516111373901367}} 11/07/2021 02:22:47 - INFO - __main__ - Step 36547: {'lr': 0.0004357412215234994, 'samples': 7017024, 'steps': 36546, 'loss/train': 1.2742689847946167}}} 11/07/2021 02:22:50 - INFO - __main__ - Step 36551: {'lr': 0.00043572701299281327, 'samples': 7017792, 'steps': 36550, 'loss/train': 1.5491282939910889}} 11/07/2021 02:22:51 - INFO - __main__ - Step 36555: {'lr': 0.00043571280312315543, 'samples': 7018560, 'steps': 36554, 'loss/train': 0.42579716444015503} 11/07/2021 02:22:54 - INFO - __main__ - Step 36559: {'lr': 0.00043569859191462847, 'samples': 7019328, 'steps': 36558, 'loss/train': 1.3331043720245361}} 11/07/2021 02:22:55 - INFO - __main__ - Step 36563: {'lr': 0.00043568437936733473, 'samples': 7020096, 'steps': 36562, 'loss/train': 1.5390021800994873}} 11/07/2021 02:22:57 - INFO - __main__ - Step 36567: {'lr': 0.00043567016548137685, 'samples': 7020864, 'steps': 36566, 'loss/train': 1.0905752182006836}} 11/07/2021 02:22:57 - INFO - __main__ - Step 36567: {'lr': 0.00043567016548137685, 'samples': 7020864, 'steps': 36566, 'loss/train': 1.0905752182006836}} 11/07/2021 02:22:57 - INFO - __main__ - Step 36567: {'lr': 0.00043567016548137685, 'samples': 7020864, 'steps': 36566, 'loss/train': 1.0905752182006836}} 11/07/2021 02:23:03 - INFO - __main__ - Step 36578: {'lr': 0.0004356310703933415, 'samples': 7022976, 'steps': 36577, 'loss/train': 0.35052648186683655}} 11/07/2021 02:23:06 - INFO - __main__ - Step 36583: {'lr': 0.000435613296552952, 'samples': 7023936, 'steps': 36582, 'loss/train': 1.6587493419647217}5}} 11/07/2021 02:23:06 - INFO - __main__ - Step 36583: {'lr': 0.000435613296552952, 'samples': 7023936, 'steps': 36582, 'loss/train': 1.6587493419647217}5}} 11/07/2021 02:23:06 - INFO - __main__ - Step 36583: {'lr': 0.000435613296552952, 'samples': 7023936, 'steps': 36582, 'loss/train': 1.6587493419647217}5}} 11/07/2021 02:23:11 - INFO - __main__ - Step 36594: {'lr': 0.0004355741867445423, 'samples': 7026048, 'steps': 36593, 'loss/train': 1.5897024869918823}}} 11/07/2021 02:23:13 - INFO - __main__ - Step 36598: {'lr': 0.00043555996248741157, 'samples': 7026816, 'steps': 36597, 'loss/train': 1.5443997383117676}} 11/07/2021 02:23:13 - INFO - __main__ - Step 36598: {'lr': 0.00043555996248741157, 'samples': 7026816, 'steps': 36597, 'loss/train': 1.5443997383117676}} 11/07/2021 02:23:18 - INFO - __main__ - Step 36607: {'lr': 0.000435527953017812, 'samples': 7028544, 'steps': 36606, 'loss/train': 2.162864923477173}76}} 11/07/2021 02:23:18 - INFO - __main__ - Step 36607: {'lr': 0.000435527953017812, 'samples': 7028544, 'steps': 36606, 'loss/train': 2.162864923477173}76}} 11/07/2021 02:23:21 - INFO - __main__ - Step 36614: {'lr': 0.0004355030520822414, 'samples': 7029888, 'steps': 36613, 'loss/train': 1.3485804796218872}}} 11/07/2021 02:23:24 - INFO - __main__ - Step 36620: {'lr': 0.0004354817051633523, 'samples': 7031040, 'steps': 36619, 'loss/train': 1.7353585958480835}}} 11/07/2021 02:23:24 - INFO - __main__ - Step 36620: {'lr': 0.0004354817051633523, 'samples': 7031040, 'steps': 36619, 'loss/train': 1.7353585958480835}}} 11/07/2021 02:23:28 - INFO - __main__ - Step 36627: {'lr': 0.0004354567966220013, 'samples': 7032384, 'steps': 36626, 'loss/train': 0.542655348777771}}}} 11/07/2021 02:23:28 - INFO - __main__ - Step 36627: {'lr': 0.0004354567966220013, 'samples': 7032384, 'steps': 36626, 'loss/train': 0.542655348777771}}}} 11/07/2021 02:23:31 - INFO - __main__ - Step 36635: {'lr': 0.00043542832470379415, 'samples': 7033920, 'steps': 36634, 'loss/train': 0.9428682923316956}} 11/07/2021 02:23:34 - INFO - __main__ - Step 36640: {'lr': 0.00043541052703945034, 'samples': 7034880, 'steps': 36639, 'loss/train': 1.4707204103469849}} 11/07/2021 02:23:36 - INFO - __main__ - Step 36645: {'lr': 0.0004353927272865285, 'samples': 7035840, 'steps': 36644, 'loss/train': 1.5697284936904907}}} 11/07/2021 02:23:36 - INFO - __main__ - Step 36645: {'lr': 0.0004353927272865285, 'samples': 7035840, 'steps': 36644, 'loss/train': 1.5697284936904907}}} 11/07/2021 02:23:40 - INFO - __main__ - Step 36652: {'lr': 0.00043536780412400857, 'samples': 7037184, 'steps': 36651, 'loss/train': 1.5300203561782837}} 11/07/2021 02:23:41 - INFO - __main__ - Step 36656: {'lr': 0.00043535356047929387, 'samples': 7037952, 'steps': 36655, 'loss/train': 1.262406826019287}}} 11/07/2021 02:23:44 - INFO - __main__ - Step 36661: {'lr': 0.00043533575404426986, 'samples': 7038912, 'steps': 36660, 'loss/train': 1.4423415660858154}} 11/07/2021 02:23:46 - INFO - __main__ - Step 36665: {'lr': 0.0004353215073930712, 'samples': 7039680, 'steps': 36664, 'loss/train': 1.543150544166565}4}} 11/07/2021 02:23:48 - INFO - __main__ - Step 36669: {'lr': 0.0004353072594058243, 'samples': 7040448, 'steps': 36668, 'loss/train': 1.6925253868103027}}} 11/07/2021 02:23:50 - INFO - __main__ - Step 36673: {'lr': 0.000435293010082632, 'samples': 7041216, 'steps': 36672, 'loss/train': 1.4857127666473389}}}} 11/07/2021 02:23:51 - INFO - __main__ - Step 36677: {'lr': 0.00043527875942359697, 'samples': 7041984, 'steps': 36676, 'loss/train': 1.895818829536438}}} 11/07/2021 02:23:54 - INFO - __main__ - Step 36682: {'lr': 0.0004352609442214309, 'samples': 7042944, 'steps': 36681, 'loss/train': 1.5160088539123535}}} 11/07/2021 02:23:56 - INFO - __main__ - Step 36687: {'lr': 0.00043524312693237166, 'samples': 7043904, 'steps': 36686, 'loss/train': 1.4041990041732788}} 11/07/2021 02:23:56 - INFO - __main__ - Step 36687: {'lr': 0.00043524312693237166, 'samples': 7043904, 'steps': 36686, 'loss/train': 1.4041990041732788}} 11/07/2021 02:24:00 - INFO - __main__ - Step 36694: {'lr': 0.00043521817922209064, 'samples': 7045248, 'steps': 36693, 'loss/train': 1.047343134880066}}} 11/07/2021 02:24:02 - INFO - __main__ - Step 36698: {'lr': 0.00043520392155156694, 'samples': 7046016, 'steps': 36697, 'loss/train': 1.2285186052322388}} 11/07/2021 02:24:02 - INFO - __main__ - Step 36698: {'lr': 0.00043520392155156694, 'samples': 7046016, 'steps': 36697, 'loss/train': 1.2285186052322388}} 11/07/2021 02:24:06 - INFO - __main__ - Step 36705: {'lr': 0.00043517896741538634, 'samples': 7047360, 'steps': 36704, 'loss/train': 1.3965660333633423}} 11/07/2021 02:24:08 - INFO - __main__ - Step 36709: {'lr': 0.0004351647060733088, 'samples': 7048128, 'steps': 36708, 'loss/train': 1.5138906240463257}}} 11/07/2021 02:24:10 - INFO - __main__ - Step 36714: {'lr': 0.0004351468775184959, 'samples': 7049088, 'steps': 36713, 'loss/train': 1.2794389724731445}}} 11/07/2021 02:24:10 - INFO - __main__ - Step 36714: {'lr': 0.0004351468775184959, 'samples': 7049088, 'steps': 36713, 'loss/train': 1.2794389724731445}}} 11/07/2021 02:24:13 - INFO - __main__ - Step 36721: {'lr': 0.00043512191403798095, 'samples': 7050432, 'steps': 36720, 'loss/train': 1.3332635164260864}} 11/07/2021 02:24:15 - INFO - __main__ - Step 36725: {'lr': 0.00043510764735684945, 'samples': 7051200, 'steps': 36724, 'loss/train': 1.5401238203048706}} 11/07/2021 02:24:18 - INFO - __main__ - Step 36730: {'lr': 0.00043508981212879737, 'samples': 7052160, 'steps': 36729, 'loss/train': 1.5463894605636597}} 11/07/2021 02:24:20 - INFO - __main__ - Step 36735: {'lr': 0.0004350719748157801, 'samples': 7053120, 'steps': 36734, 'loss/train': 1.727805733680725}7}} 11/07/2021 02:24:20 - INFO - __main__ - Step 36735: {'lr': 0.0004350719748157801, 'samples': 7053120, 'steps': 36734, 'loss/train': 1.727805733680725}7}} 11/07/2021 02:24:23 - INFO - __main__ - Step 36742: {'lr': 0.0004350469990751966, 'samples': 7054464, 'steps': 36741, 'loss/train': 1.233171820640564}7}} 11/07/2021 02:24:25 - INFO - __main__ - Step 36746: {'lr': 0.00043503272538905423, 'samples': 7055232, 'steps': 36745, 'loss/train': 1.5790698528289795}} 11/07/2021 02:24:28 - INFO - __main__ - Step 36751: {'lr': 0.00043501488140549824, 'samples': 7056192, 'steps': 36750, 'loss/train': 1.3899540901184082}} 11/07/2021 02:24:28 - INFO - __main__ - Step 36751: {'lr': 0.00043501488140549824, 'samples': 7056192, 'steps': 36750, 'loss/train': 1.3899540901184082}} 11/07/2021 02:24:32 - INFO - __main__ - Step 36759: {'lr': 0.00043498632669692, 'samples': 7057728, 'steps': 36758, 'loss/train': 1.6395902633666992}82}} 11/07/2021 02:24:33 - INFO - __main__ - Step 36763: {'lr': 0.0004349720473421318, 'samples': 7058496, 'steps': 36762, 'loss/train': 1.6168584823608398}}} 11/07/2021 02:24:36 - INFO - __main__ - Step 36767: {'lr': 0.0004349577666538148, 'samples': 7059264, 'steps': 36766, 'loss/train': 1.339677333831787}}}} 11/07/2021 02:24:38 - INFO - __main__ - Step 36772: {'lr': 0.0004349399139183005, 'samples': 7060224, 'steps': 36771, 'loss/train': 1.70767080783844}}}}} 11/07/2021 02:24:38 - INFO - __main__ - Step 36772: {'lr': 0.0004349399139183005, 'samples': 7060224, 'steps': 36771, 'loss/train': 1.70767080783844}}}}} 11/07/2021 02:24:42 - INFO - __main__ - Step 36780: {'lr': 0.0004349113452083456, 'samples': 7061760, 'steps': 36779, 'loss/train': 0.10330658406019211}} 11/07/2021 02:24:44 - INFO - __main__ - Step 36784: {'lr': 0.00043489705885367986, 'samples': 7062528, 'steps': 36783, 'loss/train': 1.6908950805664062}} 11/07/2021 02:24:46 - INFO - __main__ - Step 36788: {'lr': 0.000434882771166026, 'samples': 7063296, 'steps': 36787, 'loss/train': 1.6882917881011963}2}} 11/07/2021 02:24:48 - INFO - __main__ - Step 36792: {'lr': 0.00043486848214548693, 'samples': 7064064, 'steps': 36791, 'loss/train': 1.5211457014083862}} 11/07/2021 02:24:48 - INFO - __main__ - Step 36792: {'lr': 0.00043486848214548693, 'samples': 7064064, 'steps': 36791, 'loss/train': 1.5211457014083862}} 11/07/2021 02:24:52 - INFO - __main__ - Step 36799: {'lr': 0.0004348434731525984, 'samples': 7065408, 'steps': 36798, 'loss/train': 1.167148232460022}2}} 11/07/2021 02:24:54 - INFO - __main__ - Step 36804: {'lr': 0.00043482560708758876, 'samples': 7066368, 'steps': 36803, 'loss/train': 1.4844629764556885}} 11/07/2021 02:24:54 - INFO - __main__ - Step 36804: {'lr': 0.00043482560708758876, 'samples': 7066368, 'steps': 36803, 'loss/train': 1.4844629764556885}} 11/07/2021 02:24:58 - INFO - __main__ - Step 36812: {'lr': 0.0004347970170531197, 'samples': 7067904, 'steps': 36811, 'loss/train': 0.178208589553833}5}} 11/07/2021 02:25:00 - INFO - __main__ - Step 36816: {'lr': 0.00043478272003743315, 'samples': 7068672, 'steps': 36815, 'loss/train': 1.3449026346206665}} 11/07/2021 02:25:02 - INFO - __main__ - Step 36820: {'lr': 0.00043476842168958276, 'samples': 7069440, 'steps': 36819, 'loss/train': 1.308180332183838}}} 11/07/2021 02:25:05 - INFO - __main__ - Step 36825: {'lr': 0.00043475054688157136, 'samples': 7070400, 'steps': 36824, 'loss/train': 1.386857509613037}}} 11/07/2021 02:25:07 - INFO - __main__ - Step 36829: {'lr': 0.0004347362455367292, 'samples': 7071168, 'steps': 36828, 'loss/train': 1.2976011037826538}}} 11/07/2021 02:25:07 - INFO - __main__ - Step 36829: {'lr': 0.0004347362455367292, 'samples': 7071168, 'steps': 36828, 'loss/train': 1.2976011037826538}}} 11/07/2021 02:25:10 - INFO - __main__ - Step 36836: {'lr': 0.0004347112149786042, 'samples': 7072512, 'steps': 36835, 'loss/train': 1.5071020126342773}}} 11/07/2021 02:25:12 - INFO - __main__ - Step 36840: {'lr': 0.00043469690997148086, 'samples': 7073280, 'steps': 36839, 'loss/train': 2.168046474456787}}} 11/07/2021 02:25:14 - INFO - __main__ - Step 36845: {'lr': 0.0004346790268401033, 'samples': 7074240, 'steps': 36844, 'loss/train': 2.1799943447113037}}} 11/07/2021 02:25:14 - INFO - __main__ - Step 36845: {'lr': 0.0004346790268401033, 'samples': 7074240, 'steps': 36844, 'loss/train': 2.1799943447113037}}} 11/07/2021 02:25:19 - INFO - __main__ - Step 36853: {'lr': 0.0004346504095028799, 'samples': 7075776, 'steps': 36852, 'loss/train': 0.6996008157730103}}} 11/07/2021 02:25:20 - INFO - __main__ - Step 36857: {'lr': 0.0004346360988374016, 'samples': 7076544, 'steps': 36856, 'loss/train': 1.0569288730621338}}} 11/07/2021 02:25:22 - INFO - __main__ - Step 36861: {'lr': 0.00043462178684081657, 'samples': 7077312, 'steps': 36860, 'loss/train': 1.1887201070785522}} 11/07/2021 02:25:25 - INFO - __main__ - Step 36866: {'lr': 0.0004346038949733734, 'samples': 7078272, 'steps': 36865, 'loss/train': 1.4717979431152344}}} 11/07/2021 02:25:27 - INFO - __main__ - Step 36870: {'lr': 0.00043458957998217517, 'samples': 7079040, 'steps': 36869, 'loss/train': 0.5438855290412903}} 11/07/2021 02:25:27 - INFO - __main__ - Step 36870: {'lr': 0.00043458957998217517, 'samples': 7079040, 'steps': 36869, 'loss/train': 0.5438855290412903}} 11/07/2021 02:25:30 - INFO - __main__ - Step 36877: {'lr': 0.00043456452554547153, 'samples': 7080384, 'steps': 36876, 'loss/train': 1.5580357313156128}} 11/07/2021 02:25:32 - INFO - __main__ - Step 36882: {'lr': 0.0004345466270243646, 'samples': 7081344, 'steps': 36881, 'loss/train': 1.5124123096466064}}} 11/07/2021 02:25:35 - INFO - __main__ - Step 36887: {'lr': 0.00043452872642441124, 'samples': 7082304, 'steps': 36886, 'loss/train': 1.3987716436386108}} 11/07/2021 02:25:37 - INFO - __main__ - Step 36891: {'lr': 0.0004345144044478144, 'samples': 7083072, 'steps': 36890, 'loss/train': 1.3742188215255737}}} 11/07/2021 02:25:39 - INFO - __main__ - Step 36895: {'lr': 0.0004345000811409881, 'samples': 7083840, 'steps': 36894, 'loss/train': 1.6039793491363525}}} 11/07/2021 02:25:40 - INFO - __main__ - Step 36899: {'lr': 0.00043448575650403555, 'samples': 7084608, 'steps': 36898, 'loss/train': 1.2332217693328857}} 11/07/2021 02:25:43 - INFO - __main__ - Step 36903: {'lr': 0.00043447143053706007, 'samples': 7085376, 'steps': 36902, 'loss/train': 0.938480794429779}}} 11/07/2021 02:25:45 - INFO - __main__ - Step 36908: {'lr': 0.0004344535212081533, 'samples': 7086336, 'steps': 36907, 'loss/train': 1.481315016746521}}}} 11/07/2021 02:25:47 - INFO - __main__ - Step 36912: {'lr': 0.0004344391922490037, 'samples': 7087104, 'steps': 36911, 'loss/train': 1.4719501733779907}}} 11/07/2021 02:25:47 - INFO - __main__ - Step 36912: {'lr': 0.0004344391922490037, 'samples': 7087104, 'steps': 36911, 'loss/train': 1.4719501733779907}}} 11/07/2021 02:25:50 - INFO - __main__ - Step 36919: {'lr': 0.0004344141133709943, 'samples': 7088448, 'steps': 36918, 'loss/train': 1.3123325109481812}}} 11/07/2021 02:25:52 - INFO - __main__ - Step 36923: {'lr': 0.00043439978075545337, 'samples': 7089216, 'steps': 36922, 'loss/train': 1.4996906518936157}} 11/07/2021 02:25:55 - INFO - __main__ - Step 36928: {'lr': 0.00043438186311656624, 'samples': 7090176, 'steps': 36927, 'loss/train': 1.6770905256271362}} 11/07/2021 02:25:57 - INFO - __main__ - Step 36933: {'lr': 0.0004343639434006885, 'samples': 7091136, 'steps': 36932, 'loss/train': 1.7856611013412476}}} 11/07/2021 02:25:57 - INFO - __main__ - Step 36933: {'lr': 0.0004343639434006885, 'samples': 7091136, 'steps': 36932, 'loss/train': 1.7856611013412476}}} 11/07/2021 02:26:00 - INFO - __main__ - Step 36940: {'lr': 0.0004343388523095, 'samples': 7092480, 'steps': 36939, 'loss/train': 0.7859086394309998}76}}} 11/07/2021 02:26:02 - INFO - __main__ - Step 36944: {'lr': 0.0004343245127157456, 'samples': 7093248, 'steps': 36943, 'loss/train': 1.1533764600753784}}} 11/07/2021 02:26:05 - INFO - __main__ - Step 36949: {'lr': 0.0004343065863548548, 'samples': 7094208, 'steps': 36948, 'loss/train': 1.6003363132476807}}} 11/07/2021 02:26:07 - INFO - __main__ - Step 36953: {'lr': 0.00043429224377130964, 'samples': 7094976, 'steps': 36952, 'loss/train': 1.239585041999817}}} 11/07/2021 02:26:09 - INFO - __main__ - Step 36957: {'lr': 0.00043427789985913675, 'samples': 7095744, 'steps': 36956, 'loss/train': 1.3179750442504883}} 11/07/2021 02:26:11 - INFO - __main__ - Step 36961: {'lr': 0.00043426355461843934, 'samples': 7096512, 'steps': 36960, 'loss/train': 1.260001540184021}}} 11/07/2021 02:26:13 - INFO - __main__ - Step 36965: {'lr': 0.000434249208049321, 'samples': 7097280, 'steps': 36964, 'loss/train': 1.5644744634628296}}}} 11/07/2021 02:26:15 - INFO - __main__ - Step 36970: {'lr': 0.00043423127296998845, 'samples': 7098240, 'steps': 36969, 'loss/train': 1.783840298652649}}} 11/07/2021 02:26:17 - INFO - __main__ - Step 36974: {'lr': 0.0004342169234123009, 'samples': 7099008, 'steps': 36973, 'loss/train': 1.6480128765106201}}} 11/07/2021 02:26:19 - INFO - __main__ - Step 36978: {'lr': 0.0004342025725265285, 'samples': 7099776, 'steps': 36977, 'loss/train': 1.5415979623794556}}} 11/07/2021 02:26:19 - INFO - __main__ - Step 36978: {'lr': 0.0004342025725265285, 'samples': 7099776, 'steps': 36977, 'loss/train': 1.5415979623794556}}} 11/07/2021 02:26:22 - INFO - __main__ - Step 36985: {'lr': 0.0004341774552810339, 'samples': 7101120, 'steps': 36984, 'loss/train': 0.09527873247861862}} 11/07/2021 02:26:25 - INFO - __main__ - Step 36991: {'lr': 0.00043415592297693276, 'samples': 7102272, 'steps': 36990, 'loss/train': 1.5589598417282104}} 11/07/2021 02:26:27 - INFO - __main__ - Step 36995: {'lr': 0.0004341415664479541, 'samples': 7103040, 'steps': 36994, 'loss/train': 1.4736157655715942}}} 11/07/2021 02:26:27 - INFO - __main__ - Step 36995: {'lr': 0.0004341415664479541, 'samples': 7103040, 'steps': 36994, 'loss/train': 1.4736157655715942}}} 11/07/2021 02:26:31 - INFO - __main__ - Step 37002: {'lr': 0.00043411643932790686, 'samples': 7104384, 'steps': 37001, 'loss/train': 0.5130859613418579}} 11/07/2021 02:26:33 - INFO - __main__ - Step 37006: {'lr': 0.000434102079148438, 'samples': 7105152, 'steps': 37005, 'loss/train': 1.5979087352752686}9}} 11/07/2021 02:26:35 - INFO - __main__ - Step 37012: {'lr': 0.0004340805363906603, 'samples': 7106304, 'steps': 37011, 'loss/train': 1.3931164741516113}}} 11/07/2021 02:26:35 - INFO - __main__ - Step 37012: {'lr': 0.0004340805363906603, 'samples': 7106304, 'steps': 37011, 'loss/train': 1.3931164741516113}}} 11/07/2021 02:26:39 - INFO - __main__ - Step 37019: {'lr': 0.0004340553993993325, 'samples': 7107648, 'steps': 37018, 'loss/train': 1.8460217714309692}}} 11/07/2021 02:26:41 - INFO - __main__ - Step 37023: {'lr': 0.00043404103357973684, 'samples': 7108416, 'steps': 37022, 'loss/train': 1.1846529245376587}} 11/07/2021 02:26:43 - INFO - __main__ - Step 37027: {'lr': 0.00043402666643332444, 'samples': 7109184, 'steps': 37026, 'loss/train': 1.3075718879699707}} 11/07/2021 02:26:45 - INFO - __main__ - Step 37031: {'lr': 0.0004340122979601989, 'samples': 7109952, 'steps': 37030, 'loss/train': 1.4986871480941772}}} 11/07/2021 02:26:45 - INFO - __main__ - Step 37031: {'lr': 0.0004340122979601989, 'samples': 7109952, 'steps': 37030, 'loss/train': 1.4986871480941772}}} 11/07/2021 02:26:48 - INFO - __main__ - Step 37038: {'lr': 0.00043398714994013696, 'samples': 7111296, 'steps': 37037, 'loss/train': 1.315832257270813}}} 11/07/2021 02:26:50 - INFO - __main__ - Step 37042: {'lr': 0.0004339727778190842, 'samples': 7112064, 'steps': 37041, 'loss/train': 1.5488569736480713}}} 11/07/2021 02:26:53 - INFO - __main__ - Step 37047: {'lr': 0.00043395481080263614, 'samples': 7113024, 'steps': 37046, 'loss/train': 1.3028538227081299}} 11/07/2021 02:26:55 - INFO - __main__ - Step 37051: {'lr': 0.00043394043569749843, 'samples': 7113792, 'steps': 37050, 'loss/train': 1.555881381034851}}} 11/07/2021 02:26:55 - INFO - __main__ - Step 37051: {'lr': 0.00043394043569749843, 'samples': 7113792, 'steps': 37050, 'loss/train': 1.555881381034851}}} 11/07/2021 02:26:59 - INFO - __main__ - Step 37058: {'lr': 0.000433915276072662, 'samples': 7115136, 'steps': 37057, 'loss/train': 1.7411218881607056}}}} 11/07/2021 02:26:59 - INFO - __main__ - Step 37058: {'lr': 0.000433915276072662, 'samples': 7115136, 'steps': 37057, 'loss/train': 1.7411218881607056}}}} 11/07/2021 02:27:03 - INFO - __main__ - Step 37066: {'lr': 0.0004338865172435754, 'samples': 7116672, 'steps': 37065, 'loss/train': 1.5191227197647095}}} 11/07/2021 02:27:04 - INFO - __main__ - Step 37070: {'lr': 0.000433872135840426, 'samples': 7117440, 'steps': 37069, 'loss/train': 0.9483161568641663}}}} 11/07/2021 02:27:06 - INFO - __main__ - Step 37074: {'lr': 0.00043385775311167746, 'samples': 7118208, 'steps': 37073, 'loss/train': 1.103857398033142}}} 11/07/2021 02:27:09 - INFO - __main__ - Step 37079: {'lr': 0.0004338397728367759, 'samples': 7119168, 'steps': 37078, 'loss/train': 1.6730190515518188}}} 11/07/2021 02:27:09 - INFO - __main__ - Step 37079: {'lr': 0.0004338397728367759, 'samples': 7119168, 'steps': 37078, 'loss/train': 1.6730190515518188}}} 11/07/2021 02:27:13 - INFO - __main__ - Step 37087: {'lr': 0.0004338110000895787, 'samples': 7120704, 'steps': 37086, 'loss/train': 1.9725539684295654}}} 11/07/2021 02:27:14 - INFO - __main__ - Step 37091: {'lr': 0.00043379661172819075, 'samples': 7121472, 'steps': 37090, 'loss/train': 1.3050607442855835}} 11/07/2021 02:27:17 - INFO - __main__ - Step 37095: {'lr': 0.00043378222204174807, 'samples': 7122240, 'steps': 37094, 'loss/train': 1.6674373149871826}} 11/07/2021 02:27:19 - INFO - __main__ - Step 37100: {'lr': 0.00043376423307049455, 'samples': 7123200, 'steps': 37099, 'loss/train': 0.8144013285636902}} 11/07/2021 02:27:21 - INFO - __main__ - Step 37105: {'lr': 0.00043374624202920786, 'samples': 7124160, 'steps': 37104, 'loss/train': 1.4684909582138062}} 11/07/2021 02:27:21 - INFO - __main__ - Step 37105: {'lr': 0.00043374624202920786, 'samples': 7124160, 'steps': 37104, 'loss/train': 1.4684909582138062}} 11/07/2021 02:27:24 - INFO - __main__ - Step 37112: {'lr': 0.0004337210510941366, 'samples': 7125504, 'steps': 37111, 'loss/train': 1.1224045753479004}}} 11/07/2021 02:27:27 - INFO - __main__ - Step 37116: {'lr': 0.0004337066544528591, 'samples': 7126272, 'steps': 37115, 'loss/train': 0.6512836217880249}}} 11/07/2021 02:27:29 - INFO - __main__ - Step 37121: {'lr': 0.00043368865678882824, 'samples': 7127232, 'steps': 37120, 'loss/train': 1.4385783672332764}} 11/07/2021 02:27:29 - INFO - __main__ - Step 37121: {'lr': 0.00043368865678882824, 'samples': 7127232, 'steps': 37120, 'loss/train': 1.4385783672332764}} 11/07/2021 02:27:29 - INFO - __main__ - Step 37121: {'lr': 0.00043368865678882824, 'samples': 7127232, 'steps': 37120, 'loss/train': 1.4385783672332764}} 11/07/2021 02:27:34 - INFO - __main__ - Step 37132: {'lr': 0.00043364905464472563, 'samples': 7129344, 'steps': 37131, 'loss/train': 1.628318428993225}}} 11/07/2021 02:27:37 - INFO - __main__ - Step 37137: {'lr': 0.0004336310503600266, 'samples': 7130304, 'steps': 37136, 'loss/train': 1.0943663120269775}}} 11/07/2021 02:27:37 - INFO - __main__ - Step 37137: {'lr': 0.0004336310503600266, 'samples': 7130304, 'steps': 37136, 'loss/train': 1.0943663120269775}}} 11/07/2021 02:27:41 - INFO - __main__ - Step 37145: {'lr': 0.0004336022392020439, 'samples': 7131840, 'steps': 37144, 'loss/train': 1.4522278308868408}}} 11/07/2021 02:27:43 - INFO - __main__ - Step 37149: {'lr': 0.0004335878316375206, 'samples': 7132608, 'steps': 37148, 'loss/train': 1.4561712741851807}}} 11/07/2021 02:27:45 - INFO - __main__ - Step 37153: {'lr': 0.0004335734227494478, 'samples': 7133376, 'steps': 37152, 'loss/train': 1.6425676345825195}}} 11/07/2021 02:27:47 - INFO - __main__ - Step 37157: {'lr': 0.0004335590125379293, 'samples': 7134144, 'steps': 37156, 'loss/train': 1.66996169090271}5}}} 11/07/2021 02:27:49 - INFO - __main__ - Step 37162: {'lr': 0.00043354099791259414, 'samples': 7135104, 'steps': 37161, 'loss/train': 0.9119004011154175}} 11/07/2021 02:27:51 - INFO - __main__ - Step 37166: {'lr': 0.00043352658472370294, 'samples': 7135872, 'steps': 37165, 'loss/train': 1.2007473707199097}} 11/07/2021 02:27:53 - INFO - __main__ - Step 37170: {'lr': 0.0004335121702117038, 'samples': 7136640, 'steps': 37169, 'loss/train': 1.431115746498108}7}} 11/07/2021 02:27:55 - INFO - __main__ - Step 37174: {'lr': 0.00043349775437670046, 'samples': 7137408, 'steps': 37173, 'loss/train': 0.5817450881004333}} 11/07/2021 02:27:57 - INFO - __main__ - Step 37178: {'lr': 0.0004334833372187972, 'samples': 7138176, 'steps': 37177, 'loss/train': 1.3863235712051392}}} 11/07/2021 02:27:59 - INFO - __main__ - Step 37183: {'lr': 0.0004334653139112481, 'samples': 7139136, 'steps': 37182, 'loss/train': 1.3545410633087158}}} 11/07/2021 02:27:59 - INFO - __main__ - Step 37183: {'lr': 0.0004334653139112481, 'samples': 7139136, 'steps': 37182, 'loss/train': 1.3545410633087158}}} 11/07/2021 02:28:03 - INFO - __main__ - Step 37190: {'lr': 0.000433440077808726, 'samples': 7140480, 'steps': 37189, 'loss/train': 1.7172496318817139}}}} 11/07/2021 02:28:05 - INFO - __main__ - Step 37195: {'lr': 0.00043342204954151963, 'samples': 7141440, 'steps': 37194, 'loss/train': 1.5363059043884277}} 11/07/2021 02:28:08 - INFO - __main__ - Step 37200: {'lr': 0.0004334040192081347, 'samples': 7142400, 'steps': 37199, 'loss/train': 1.625350832939148}7}} 11/07/2021 02:28:10 - INFO - __main__ - Step 37204: {'lr': 0.0004333895934539146, 'samples': 7143168, 'steps': 37203, 'loss/train': 1.299027442932129}7}} 11/07/2021 02:28:12 - INFO - __main__ - Step 37208: {'lr': 0.00043337516637757416, 'samples': 7143936, 'steps': 37207, 'loss/train': 1.1694921255111694}} 11/07/2021 02:28:12 - INFO - __main__ - Step 37208: {'lr': 0.00043337516637757416, 'samples': 7143936, 'steps': 37207, 'loss/train': 1.1694921255111694}} 11/07/2021 02:28:15 - INFO - __main__ - Step 37215: {'lr': 0.00043334991581293924, 'samples': 7145280, 'steps': 37214, 'loss/train': 1.2101293802261353}} 11/07/2021 02:28:17 - INFO - __main__ - Step 37220: {'lr': 0.00043333187721687104, 'samples': 7146240, 'steps': 37219, 'loss/train': 1.8787128925323486}} 11/07/2021 02:28:20 - INFO - __main__ - Step 37225: {'lr': 0.00043331383655564003, 'samples': 7147200, 'steps': 37224, 'loss/train': 1.7596057653427124}} 11/07/2021 02:28:20 - INFO - __main__ - Step 37225: {'lr': 0.00043331383655564003, 'samples': 7147200, 'steps': 37224, 'loss/train': 1.7596057653427124}} 11/07/2021 02:28:23 - INFO - __main__ - Step 37232: {'lr': 0.00043328857616082986, 'samples': 7148544, 'steps': 37231, 'loss/train': 1.354921579360962}}} 11/07/2021 02:28:25 - INFO - __main__ - Step 37236: {'lr': 0.0004332741398325599, 'samples': 7149312, 'steps': 37235, 'loss/train': 1.7749427556991577}}} 11/07/2021 02:28:27 - INFO - __main__ - Step 37240: {'lr': 0.000433259702183002, 'samples': 7150080, 'steps': 37239, 'loss/train': 1.4372882843017578}}}} 11/07/2021 02:28:29 - INFO - __main__ - Step 37244: {'lr': 0.0004332452632122601, 'samples': 7150848, 'steps': 37243, 'loss/train': 1.967123031616211}}}} 11/07/2021 02:28:31 - INFO - __main__ - Step 37248: {'lr': 0.0004332308229204385, 'samples': 7151616, 'steps': 37247, 'loss/train': 1.775602102279663}}}} 11/07/2021 02:28:33 - INFO - __main__ - Step 37252: {'lr': 0.00043321638130764116, 'samples': 7152384, 'steps': 37251, 'loss/train': 1.5859137773513794}} 11/07/2021 02:28:35 - INFO - __main__ - Step 37257: {'lr': 0.000433198327434181, 'samples': 7153344, 'steps': 37256, 'loss/train': 1.7939229011535645}4}} 11/07/2021 02:28:38 - INFO - __main__ - Step 37262: {'lr': 0.0004331802714970624, 'samples': 7154304, 'steps': 37261, 'loss/train': 1.487119197845459}4}} 11/07/2021 02:28:40 - INFO - __main__ - Step 37266: {'lr': 0.00043316582526167004, 'samples': 7155072, 'steps': 37265, 'loss/train': 1.634678602218628}}} 11/07/2021 02:28:42 - INFO - __main__ - Step 37270: {'lr': 0.0004331513777057706, 'samples': 7155840, 'steps': 37269, 'loss/train': 1.3365075588226318}}} 11/07/2021 02:28:44 - INFO - __main__ - Step 37274: {'lr': 0.0004331369288294681, 'samples': 7156608, 'steps': 37273, 'loss/train': 1.0709071159362793}}} 11/07/2021 02:28:45 - INFO - __main__ - Step 37278: {'lr': 0.000433122478632867, 'samples': 7157376, 'steps': 37277, 'loss/train': 0.8798840641975403}}}} 11/07/2021 02:28:45 - INFO - __main__ - Step 37278: {'lr': 0.000433122478632867, 'samples': 7157376, 'steps': 37277, 'loss/train': 0.8798840641975403}}}} 11/07/2021 02:28:50 - INFO - __main__ - Step 37286: {'lr': 0.0004330935742791849, 'samples': 7158912, 'steps': 37285, 'loss/train': 1.4151886701583862}}} 11/07/2021 02:28:50 - INFO - __main__ - Step 37286: {'lr': 0.0004330935742791849, 'samples': 7158912, 'steps': 37285, 'loss/train': 1.4151886701583862}}} 11/07/2021 02:28:53 - INFO - __main__ - Step 37293: {'lr': 0.00043306827863847985, 'samples': 7160256, 'steps': 37292, 'loss/train': 1.3688923120498657}} 11/07/2021 02:28:55 - INFO - __main__ - Step 37298: {'lr': 0.0004330502078490258, 'samples': 7161216, 'steps': 37297, 'loss/train': 1.2880827188491821}}} 11/07/2021 02:28:55 - INFO - __main__ - Step 37298: {'lr': 0.0004330502078490258, 'samples': 7161216, 'steps': 37297, 'loss/train': 1.2880827188491821}}} 11/07/2021 02:28:59 - INFO - __main__ - Step 37306: {'lr': 0.0004330212902970447, 'samples': 7162752, 'steps': 37305, 'loss/train': 1.4084619283676147}}} 11/07/2021 02:29:01 - INFO - __main__ - Step 37310: {'lr': 0.0004330068295418044, 'samples': 7163520, 'steps': 37309, 'loss/train': 1.5030561685562134}}} 11/07/2021 02:29:03 - INFO - __main__ - Step 37314: {'lr': 0.0004329923674672032, 'samples': 7164288, 'steps': 37313, 'loss/train': 1.4974441528320312}}} 11/07/2021 02:29:05 - INFO - __main__ - Step 37319: {'lr': 0.0004329742880187594, 'samples': 7165248, 'steps': 37318, 'loss/train': 1.1523969173431396}}} 11/07/2021 02:29:05 - INFO - __main__ - Step 37319: {'lr': 0.0004329742880187594, 'samples': 7165248, 'steps': 37318, 'loss/train': 1.1523969173431396}}} 11/07/2021 02:29:09 - INFO - __main__ - Step 37327: {'lr': 0.0004329453566141737, 'samples': 7166784, 'steps': 37326, 'loss/train': 1.637865424156189}}}} 11/07/2021 02:29:11 - INFO - __main__ - Step 37331: {'lr': 0.0004329308889334522, 'samples': 7167552, 'steps': 37330, 'loss/train': 1.1413021087646484}}} 11/07/2021 02:29:13 - INFO - __main__ - Step 37335: {'lr': 0.00043291641993391727, 'samples': 7168320, 'steps': 37334, 'loss/train': 1.5530239343643188}} 11/07/2021 02:29:15 - INFO - __main__ - Step 37340: {'lr': 0.0004328983318300763, 'samples': 7169280, 'steps': 37339, 'loss/train': 1.6685435771942139}}} 11/07/2021 02:29:17 - INFO - __main__ - Step 37344: {'lr': 0.00043288385986359266, 'samples': 7170048, 'steps': 37343, 'loss/train': 1.6947556734085083}} 11/07/2021 02:29:20 - INFO - __main__ - Step 37348: {'lr': 0.00043286938657863483, 'samples': 7170816, 'steps': 37347, 'loss/train': 1.6188071966171265}} 11/07/2021 02:29:21 - INFO - __main__ - Step 37352: {'lr': 0.00043285491197530694, 'samples': 7171584, 'steps': 37351, 'loss/train': 0.8640336394309998}} 11/07/2021 02:29:23 - INFO - __main__ - Step 37356: {'lr': 0.00043284043605371346, 'samples': 7172352, 'steps': 37355, 'loss/train': 1.0829873085021973}} 11/07/2021 02:29:25 - INFO - __main__ - Step 37361: {'lr': 0.0004328223392980696, 'samples': 7173312, 'steps': 37360, 'loss/train': 1.577709436416626}3}} 11/07/2021 02:29:28 - INFO - __main__ - Step 37365: {'lr': 0.00043280786041076006, 'samples': 7174080, 'steps': 37364, 'loss/train': 0.9891050457954407}} 11/07/2021 02:29:30 - INFO - __main__ - Step 37369: {'lr': 0.0004327933802055241, 'samples': 7174848, 'steps': 37368, 'loss/train': 1.428175449371338}7}} 11/07/2021 02:29:32 - INFO - __main__ - Step 37373: {'lr': 0.00043277889868246605, 'samples': 7175616, 'steps': 37372, 'loss/train': 1.78399658203125}7}} 11/07/2021 02:29:33 - INFO - __main__ - Step 37377: {'lr': 0.0004327644158416905, 'samples': 7176384, 'steps': 37376, 'loss/train': 1.8409719467163086}}} 11/07/2021 02:29:35 - INFO - __main__ - Step 37381: {'lr': 0.0004327499316833016, 'samples': 7177152, 'steps': 37380, 'loss/train': 1.8813824653625488}}} 11/07/2021 02:29:38 - INFO - __main__ - Step 37386: {'lr': 0.0004327318246325811, 'samples': 7178112, 'steps': 37385, 'loss/train': 1.6646647453308105}}} 11/07/2021 02:29:38 - INFO - __main__ - Step 37386: {'lr': 0.0004327318246325811, 'samples': 7178112, 'steps': 37385, 'loss/train': 1.6646647453308105}}} 11/07/2021 02:29:41 - INFO - __main__ - Step 37393: {'lr': 0.0004327064713035002, 'samples': 7179456, 'steps': 37392, 'loss/train': 1.3795374631881714}}} 11/07/2021 02:29:43 - INFO - __main__ - Step 37397: {'lr': 0.0004326919818757028, 'samples': 7180224, 'steps': 37396, 'loss/train': 0.8401727676391602}}} 11/07/2021 02:29:45 - INFO - __main__ - Step 37402: {'lr': 0.00043267386823880904, 'samples': 7181184, 'steps': 37401, 'loss/train': 1.6912891864776611}} 11/07/2021 02:29:45 - INFO - __main__ - Step 37402: {'lr': 0.00043267386823880904, 'samples': 7181184, 'steps': 37401, 'loss/train': 1.6912891864776611}} 11/07/2021 02:29:45 - INFO - __main__ - Step 37402: {'lr': 0.00043267386823880904, 'samples': 7181184, 'steps': 37401, 'loss/train': 1.6912891864776611}} 11/07/2021 02:29:51 - INFO - __main__ - Step 37413: {'lr': 0.00043263401099464805, 'samples': 7183296, 'steps': 37412, 'loss/train': 3.1627933979034424}} 11/07/2021 02:29:54 - INFO - __main__ - Step 37418: {'lr': 0.0004326158907736706, 'samples': 7184256, 'steps': 37417, 'loss/train': 1.6083804368972778}}} 11/07/2021 02:29:56 - INFO - __main__ - Step 37422: {'lr': 0.00043260139311576863, 'samples': 7185024, 'steps': 37421, 'loss/train': 2.185137987136841}}} 11/07/2021 02:29:56 - INFO - __main__ - Step 37422: {'lr': 0.00043260139311576863, 'samples': 7185024, 'steps': 37421, 'loss/train': 2.185137987136841}}} 11/07/2021 02:29:59 - INFO - __main__ - Step 37429: {'lr': 0.0004325760190468243, 'samples': 7186368, 'steps': 37428, 'loss/train': 1.5231657028198242}}} 11/07/2021 02:30:02 - INFO - __main__ - Step 37435: {'lr': 0.0004325542666364793, 'samples': 7187520, 'steps': 37434, 'loss/train': 1.3707133531570435}}} 11/07/2021 02:30:04 - INFO - __main__ - Step 37439: {'lr': 0.00043253976338443814, 'samples': 7188288, 'steps': 37438, 'loss/train': 0.7496334314346313}} 11/07/2021 02:30:06 - INFO - __main__ - Step 37443: {'lr': 0.0004325252588164033, 'samples': 7189056, 'steps': 37442, 'loss/train': 1.2332797050476074}}} 11/07/2021 02:30:06 - INFO - __main__ - Step 37443: {'lr': 0.0004325252588164033, 'samples': 7189056, 'steps': 37442, 'loss/train': 1.2332797050476074}}} 11/07/2021 02:30:09 - INFO - __main__ - Step 37450: {'lr': 0.0004324998726560473, 'samples': 7190400, 'steps': 37449, 'loss/train': 1.1127818822860718}}} 11/07/2021 02:30:12 - INFO - __main__ - Step 37456: {'lr': 0.0004324781098829732, 'samples': 7191552, 'steps': 37455, 'loss/train': 1.2282586097717285}}} 11/07/2021 02:30:12 - INFO - __main__ - Step 37456: {'lr': 0.0004324781098829732, 'samples': 7191552, 'steps': 37455, 'loss/train': 1.2282586097717285}}} 11/07/2021 02:30:16 - INFO - __main__ - Step 37463: {'lr': 0.0004324527162399854, 'samples': 7192896, 'steps': 37462, 'loss/train': 1.5770810842514038}}} 11/07/2021 02:30:17 - INFO - __main__ - Step 37467: {'lr': 0.00043243820377818524, 'samples': 7193664, 'steps': 37466, 'loss/train': 1.4279179573059082}} 11/07/2021 02:30:20 - INFO - __main__ - Step 37471: {'lr': 0.00043242369000112365, 'samples': 7194432, 'steps': 37470, 'loss/train': 1.961127519607544}}} 11/07/2021 02:30:22 - INFO - __main__ - Step 37475: {'lr': 0.0004324091749089052, 'samples': 7195200, 'steps': 37474, 'loss/train': 1.4887938499450684}}} 11/07/2021 02:30:23 - INFO - __main__ - Step 37479: {'lr': 0.0004323946585016347, 'samples': 7195968, 'steps': 37478, 'loss/train': 1.6027249097824097}}} 11/07/2021 02:30:25 - INFO - __main__ - Step 37483: {'lr': 0.00043238014077941656, 'samples': 7196736, 'steps': 37482, 'loss/train': 1.6039621829986572}} 11/07/2021 02:30:28 - INFO - __main__ - Step 37488: {'lr': 0.00043236199177765856, 'samples': 7197696, 'steps': 37487, 'loss/train': 1.7081737518310547}} 11/07/2021 02:30:30 - INFO - __main__ - Step 37493: {'lr': 0.0004323438407216631, 'samples': 7198656, 'steps': 37492, 'loss/train': 1.6030843257904053}}} 11/07/2021 02:30:30 - INFO - __main__ - Step 37493: {'lr': 0.0004323438407216631, 'samples': 7198656, 'steps': 37492, 'loss/train': 1.6030843257904053}}} 11/07/2021 02:30:34 - INFO - __main__ - Step 37500: {'lr': 0.0004323184257925397, 'samples': 7200000, 'steps': 37499, 'loss/train': 1.5604526996612549}}} 11/07/2021 02:30:35 - INFO - __main__ - Step 37504: {'lr': 0.00043230390116856467, 'samples': 7200768, 'steps': 37503, 'loss/train': 0.5808529257774353}} 11/07/2021 02:30:38 - INFO - __main__ - Step 37509: {'lr': 0.00043228574354038326, 'samples': 7201728, 'steps': 37508, 'loss/train': 1.5501351356506348}} 11/07/2021 02:30:40 - INFO - __main__ - Step 37514: {'lr': 0.0004322675838588234, 'samples': 7202688, 'steps': 37513, 'loss/train': 1.200168490409851}8}} 11/07/2021 02:30:42 - INFO - __main__ - Step 37518: {'lr': 0.0004322530546352803, 'samples': 7203456, 'steps': 37517, 'loss/train': 1.74091374874115}}8}} 11/07/2021 02:30:42 - INFO - __main__ - Step 37518: {'lr': 0.0004322530546352803, 'samples': 7203456, 'steps': 37517, 'loss/train': 1.74091374874115}}8}} 11/07/2021 02:30:46 - INFO - __main__ - Step 37525: {'lr': 0.000432227625332507, 'samples': 7204800, 'steps': 37524, 'loss/train': 0.7475195527076721}8}} 11/07/2021 02:30:48 - INFO - __main__ - Step 37529: {'lr': 0.0004322130924959178, 'samples': 7205568, 'steps': 37528, 'loss/train': 1.5095049142837524}}} 11/07/2021 02:30:48 - INFO - __main__ - Step 37529: {'lr': 0.0004322130924959178, 'samples': 7205568, 'steps': 37528, 'loss/train': 1.5095049142837524}}} 11/07/2021 02:30:52 - INFO - __main__ - Step 37537: {'lr': 0.0004321840228819286, 'samples': 7207104, 'steps': 37536, 'loss/train': 1.0783336162567139}}} 11/07/2021 02:30:54 - INFO - __main__ - Step 37541: {'lr': 0.00043216948610473816, 'samples': 7207872, 'steps': 37540, 'loss/train': 1.5212979316711426}} 11/07/2021 02:30:56 - INFO - __main__ - Step 37546: {'lr': 0.0004321513132864003, 'samples': 7208832, 'steps': 37545, 'loss/train': 1.4994314908981323}}} 11/07/2021 02:30:58 - INFO - __main__ - Step 37550: {'lr': 0.00043213677355437795, 'samples': 7209600, 'steps': 37549, 'loss/train': 1.087836742401123}}} 11/07/2021 02:30:58 - INFO - __main__ - Step 37550: {'lr': 0.00043213677355437795, 'samples': 7209600, 'steps': 37549, 'loss/train': 1.087836742401123}}} 11/07/2021 02:31:02 - INFO - __main__ - Step 37557: {'lr': 0.0004321113258637832, 'samples': 7210944, 'steps': 37556, 'loss/train': 1.3894625902175903}}} 11/07/2021 02:31:04 - INFO - __main__ - Step 37562: {'lr': 0.00043209314648020035, 'samples': 7211904, 'steps': 37561, 'loss/train': 1.5488879680633545}} 11/07/2021 02:31:04 - INFO - __main__ - Step 37562: {'lr': 0.00043209314648020035, 'samples': 7211904, 'steps': 37561, 'loss/train': 1.5488879680633545}} 11/07/2021 02:31:08 - INFO - __main__ - Step 37570: {'lr': 0.00043206405520003824, 'samples': 7213440, 'steps': 37569, 'loss/train': 1.5961387157440186}} 11/07/2021 02:31:10 - INFO - __main__ - Step 37574: {'lr': 0.00043204950759105865, 'samples': 7214208, 'steps': 37573, 'loss/train': 0.9354721903800964}} 11/07/2021 02:31:12 - INFO - __main__ - Step 37578: {'lr': 0.00043203495866961996, 'samples': 7214976, 'steps': 37577, 'loss/train': 1.2092102766036987}} 11/07/2021 02:31:14 - INFO - __main__ - Step 37583: {'lr': 0.00043201677067233554, 'samples': 7215936, 'steps': 37582, 'loss/train': 2.1765105724334717}} 11/07/2021 02:31:16 - INFO - __main__ - Step 37587: {'lr': 0.00043200221879824706, 'samples': 7216704, 'steps': 37586, 'loss/train': 1.7899430990219116}} 11/07/2021 02:31:18 - INFO - __main__ - Step 37591: {'lr': 0.00043198766561204047, 'samples': 7217472, 'steps': 37590, 'loss/train': 1.7535258531570435}} 11/07/2021 02:31:20 - INFO - __main__ - Step 37595: {'lr': 0.00043197311111382045, 'samples': 7218240, 'steps': 37594, 'loss/train': 1.6020832061767578}} 11/07/2021 02:31:22 - INFO - __main__ - Step 37599: {'lr': 0.000431958555303692, 'samples': 7219008, 'steps': 37598, 'loss/train': 1.4375860691070557}8}} 11/07/2021 02:31:22 - INFO - __main__ - Step 37599: {'lr': 0.000431958555303692, 'samples': 7219008, 'steps': 37598, 'loss/train': 1.4375860691070557}8}} 11/07/2021 02:31:22 - INFO - __main__ - Step 37599: {'lr': 0.000431958555303692, 'samples': 7219008, 'steps': 37598, 'loss/train': 1.4375860691070557}8}} 11/07/2021 02:31:28 - INFO - __main__ - Step 37610: {'lr': 0.0004319185200621678, 'samples': 7221120, 'steps': 37609, 'loss/train': 0.6669997572898865}}} 11/07/2021 02:31:30 - INFO - __main__ - Step 37615: {'lr': 0.00043190031894619306, 'samples': 7222080, 'steps': 37614, 'loss/train': 1.7094345092773438}} 11/07/2021 02:31:33 - INFO - __main__ - Step 37619: {'lr': 0.00043188575657809685, 'samples': 7222848, 'steps': 37618, 'loss/train': 1.844686508178711}}} 11/07/2021 02:31:33 - INFO - __main__ - Step 37619: {'lr': 0.00043188575657809685, 'samples': 7222848, 'steps': 37618, 'loss/train': 1.844686508178711}}} 11/07/2021 02:31:36 - INFO - __main__ - Step 37626: {'lr': 0.00043186026927872736, 'samples': 7224192, 'steps': 37625, 'loss/train': 1.1585700511932373}} 11/07/2021 02:31:39 - INFO - __main__ - Step 37632: {'lr': 0.0004318384198263099, 'samples': 7225344, 'steps': 37631, 'loss/train': 1.4363213777542114}}} 11/07/2021 02:31:39 - INFO - __main__ - Step 37632: {'lr': 0.0004318384198263099, 'samples': 7225344, 'steps': 37631, 'loss/train': 1.4363213777542114}}} 11/07/2021 02:31:42 - INFO - __main__ - Step 37639: {'lr': 0.0004318129250705361, 'samples': 7226688, 'steps': 37638, 'loss/train': 1.8374136686325073}}} 11/07/2021 02:31:44 - INFO - __main__ - Step 37643: {'lr': 0.0004317983548363431, 'samples': 7227456, 'steps': 37642, 'loss/train': 1.6383424997329712}}} 11/07/2021 02:31:46 - INFO - __main__ - Step 37647: {'lr': 0.0004317837832915016, 'samples': 7228224, 'steps': 37646, 'loss/train': 1.8266150951385498}}} 11/07/2021 02:31:48 - INFO - __main__ - Step 37652: {'lr': 0.0004317655670175102, 'samples': 7229184, 'steps': 37651, 'loss/train': 1.1823043823242188}}} 11/07/2021 02:31:48 - INFO - __main__ - Step 37652: {'lr': 0.0004317655670175102, 'samples': 7229184, 'steps': 37651, 'loss/train': 1.1823043823242188}}} 11/07/2021 02:31:51 - INFO - __main__ - Step 37659: {'lr': 0.0004317400607941364, 'samples': 7230528, 'steps': 37658, 'loss/train': 1.5288509130477905}}} 11/07/2021 02:31:54 - INFO - __main__ - Step 37663: {'lr': 0.0004317254840077514, 'samples': 7231296, 'steps': 37662, 'loss/train': 1.6886277198791504}}} 11/07/2021 02:31:56 - INFO - __main__ - Step 37668: {'lr': 0.00043170726118242164, 'samples': 7232256, 'steps': 37667, 'loss/train': 2.937180280685425}}} 11/07/2021 02:31:58 - INFO - __main__ - Step 37673: {'lr': 0.0004316890363102298, 'samples': 7233216, 'steps': 37672, 'loss/train': 1.3016736507415771}}} 11/07/2021 02:31:58 - INFO - __main__ - Step 37673: {'lr': 0.0004316890363102298, 'samples': 7233216, 'steps': 37672, 'loss/train': 1.3016736507415771}}} 11/07/2021 02:32:02 - INFO - __main__ - Step 37680: {'lr': 0.0004316635180508235, 'samples': 7234560, 'steps': 37679, 'loss/train': 1.6097488403320312}}} 11/07/2021 02:32:04 - INFO - __main__ - Step 37684: {'lr': 0.0004316489343874644, 'samples': 7235328, 'steps': 37683, 'loss/train': 1.3470348119735718}}} 11/07/2021 02:32:06 - INFO - __main__ - Step 37689: {'lr': 0.00043163070296669317, 'samples': 7236288, 'steps': 37688, 'loss/train': 1.1718648672103882}} 11/07/2021 02:32:08 - INFO - __main__ - Step 37694: {'lr': 0.0004316124694999222, 'samples': 7237248, 'steps': 37693, 'loss/train': 1.347544550895691}2}} 11/07/2021 02:32:10 - INFO - __main__ - Step 37698: {'lr': 0.00043159788125352353, 'samples': 7238016, 'steps': 37697, 'loss/train': 1.798108458518982}}} 11/07/2021 02:32:10 - INFO - __main__ - Step 37698: {'lr': 0.00043159788125352353, 'samples': 7238016, 'steps': 37697, 'loss/train': 1.798108458518982}}} 11/07/2021 02:32:14 - INFO - __main__ - Step 37705: {'lr': 0.0004315723486721188, 'samples': 7239360, 'steps': 37704, 'loss/train': 0.9939437508583069}}} 11/07/2021 02:32:16 - INFO - __main__ - Step 37710: {'lr': 0.0004315541086595288, 'samples': 7240320, 'steps': 37709, 'loss/train': 1.5813043117523193}}} 11/07/2021 02:32:18 - INFO - __main__ - Step 37714: {'lr': 0.00043153951517694824, 'samples': 7241088, 'steps': 37713, 'loss/train': 1.300570011138916}}} 11/07/2021 02:32:20 - INFO - __main__ - Step 37718: {'lr': 0.00043152492038558526, 'samples': 7241856, 'steps': 37717, 'loss/train': 1.6024607419967651}} 11/07/2021 02:32:20 - INFO - __main__ - Step 37718: {'lr': 0.00043152492038558526, 'samples': 7241856, 'steps': 37717, 'loss/train': 1.6024607419967651}} 11/07/2021 02:32:24 - INFO - __main__ - Step 37725: {'lr': 0.00043149937635175874, 'samples': 7243200, 'steps': 37724, 'loss/train': 1.3453290462493896}} 11/07/2021 02:32:27 - INFO - __main__ - Step 37731: {'lr': 0.0004314774782761484, 'samples': 7244352, 'steps': 37730, 'loss/train': 1.2930500507354736}}} 11/07/2021 02:32:29 - INFO - __main__ - Step 37735: {'lr': 0.0004314628779236339, 'samples': 7245120, 'steps': 37734, 'loss/train': 1.8617697954177856}}} 11/07/2021 02:32:31 - INFO - __main__ - Step 37739: {'lr': 0.00043144827626288943, 'samples': 7245888, 'steps': 37738, 'loss/train': 1.5663368701934814}} 11/07/2021 02:32:32 - INFO - __main__ - Step 37743: {'lr': 0.0004314336732940202, 'samples': 7246656, 'steps': 37742, 'loss/train': 0.8476759195327759}}} 11/07/2021 02:32:34 - INFO - __main__ - Step 37747: {'lr': 0.0004314190690171317, 'samples': 7247424, 'steps': 37746, 'loss/train': 1.3477909564971924}}} 11/07/2021 02:32:37 - INFO - __main__ - Step 37752: {'lr': 0.000431400811831779, 'samples': 7248384, 'steps': 37751, 'loss/train': 1.698687195777893}4}}} 11/07/2021 02:32:37 - INFO - __main__ - Step 37752: {'lr': 0.000431400811831779, 'samples': 7248384, 'steps': 37751, 'loss/train': 1.698687195777893}4}}} 11/07/2021 02:32:40 - INFO - __main__ - Step 37759: {'lr': 0.00043137524833940233, 'samples': 7249728, 'steps': 37758, 'loss/train': 1.6888084411621094}} 11/07/2021 02:32:42 - INFO - __main__ - Step 37763: {'lr': 0.00043136063883148905, 'samples': 7250496, 'steps': 37762, 'loss/train': 1.5382124185562134}} 11/07/2021 02:32:44 - INFO - __main__ - Step 37767: {'lr': 0.00043134602801608293, 'samples': 7251264, 'steps': 37766, 'loss/train': 1.1817660331726074}} 11/07/2021 02:32:47 - INFO - __main__ - Step 37773: {'lr': 0.000431324109341655, 'samples': 7252416, 'steps': 37772, 'loss/train': 1.9840986728668213}4}} 11/07/2021 02:32:49 - INFO - __main__ - Step 37777: {'lr': 0.0004313094952579775, 'samples': 7253184, 'steps': 37776, 'loss/train': 1.7181931734085083}}} 11/07/2021 02:32:49 - INFO - __main__ - Step 37777: {'lr': 0.0004313094952579775, 'samples': 7253184, 'steps': 37776, 'loss/train': 1.7181931734085083}}} 11/07/2021 02:32:52 - INFO - __main__ - Step 37784: {'lr': 0.0004312839174663377, 'samples': 7254528, 'steps': 37783, 'loss/train': 1.327996850013733}}}} 11/07/2021 02:32:54 - INFO - __main__ - Step 37788: {'lr': 0.00043126929978832217, 'samples': 7255296, 'steps': 37787, 'loss/train': 1.6394915580749512}} 11/07/2021 02:32:56 - INFO - __main__ - Step 37793: {'lr': 0.0004312510258530794, 'samples': 7256256, 'steps': 37792, 'loss/train': 1.8957017660140991}}} 11/07/2021 02:32:56 - INFO - __main__ - Step 37793: {'lr': 0.0004312510258530794, 'samples': 7256256, 'steps': 37792, 'loss/train': 1.8957017660140991}}} 11/07/2021 02:33:00 - INFO - __main__ - Step 37800: {'lr': 0.0004312254389136911, 'samples': 7257600, 'steps': 37799, 'loss/train': 1.2227293252944946}}} 11/07/2021 02:33:02 - INFO - __main__ - Step 37804: {'lr': 0.0004312108160089706, 'samples': 7258368, 'steps': 37803, 'loss/train': 1.2928982973098755}}} 11/07/2021 02:33:05 - INFO - __main__ - Step 37809: {'lr': 0.0004311925355409393, 'samples': 7259328, 'steps': 37808, 'loss/train': 1.3604049682617188}}} 11/07/2021 02:33:05 - INFO - __main__ - Step 37809: {'lr': 0.0004311925355409393, 'samples': 7259328, 'steps': 37808, 'loss/train': 1.3604049682617188}}} 11/07/2021 02:33:05 - INFO - __main__ - Step 37809: {'lr': 0.0004311925355409393, 'samples': 7259328, 'steps': 37808, 'loss/train': 1.3604049682617188}}} 11/07/2021 02:33:11 - INFO - __main__ - Step 37819: {'lr': 0.0004311559684818905, 'samples': 7261248, 'steps': 37818, 'loss/train': 1.7777564525604248}}} 11/07/2021 02:33:12 - INFO - __main__ - Step 37823: {'lr': 0.00043114133937264843, 'samples': 7262016, 'steps': 37822, 'loss/train': 1.4681305885314941}} 11/07/2021 02:33:14 - INFO - __main__ - Step 37827: {'lr': 0.0004311267089574944, 'samples': 7262784, 'steps': 37826, 'loss/train': 1.7053385972976685}}} 11/07/2021 02:33:17 - INFO - __main__ - Step 37832: {'lr': 0.0004311084191022741, 'samples': 7263744, 'steps': 37831, 'loss/train': 1.5728517770767212}}} 11/07/2021 02:33:19 - INFO - __main__ - Step 37836: {'lr': 0.000431093785749204, 'samples': 7264512, 'steps': 37835, 'loss/train': 1.6409319639205933}}}} 11/07/2021 02:33:19 - INFO - __main__ - Step 37836: {'lr': 0.000431093785749204, 'samples': 7264512, 'steps': 37835, 'loss/train': 1.6409319639205933}}}} 11/07/2021 02:33:22 - INFO - __main__ - Step 37843: {'lr': 0.00043106817423986933, 'samples': 7265856, 'steps': 37842, 'loss/train': 1.6606569290161133}} 11/07/2021 02:33:25 - INFO - __main__ - Step 37848: {'lr': 0.0004310498778570016, 'samples': 7266816, 'steps': 37847, 'loss/train': 1.5267208814620972}}} 11/07/2021 02:33:25 - INFO - __main__ - Step 37848: {'lr': 0.0004310498778570016, 'samples': 7266816, 'steps': 37847, 'loss/train': 1.5267208814620972}}} 11/07/2021 02:33:29 - INFO - __main__ - Step 37856: {'lr': 0.00043102059940242825, 'samples': 7268352, 'steps': 37855, 'loss/train': 1.6300129890441895}} 11/07/2021 02:33:31 - INFO - __main__ - Step 37860: {'lr': 0.00043100595821752674, 'samples': 7269120, 'steps': 37859, 'loss/train': 1.3307468891143799}} 11/07/2021 02:33:32 - INFO - __main__ - Step 37864: {'lr': 0.00043099131572768936, 'samples': 7269888, 'steps': 37863, 'loss/train': 1.4234338998794556}} 11/07/2021 02:33:35 - INFO - __main__ - Step 37869: {'lr': 0.00043097301078048736, 'samples': 7270848, 'steps': 37868, 'loss/train': 2.2756893634796143}} 11/07/2021 02:33:37 - INFO - __main__ - Step 37874: {'lr': 0.0004309547037946941, 'samples': 7271808, 'steps': 37873, 'loss/train': 1.276893138885498}3}} 11/07/2021 02:33:37 - INFO - __main__ - Step 37874: {'lr': 0.0004309547037946941, 'samples': 7271808, 'steps': 37873, 'loss/train': 1.276893138885498}3}} 11/07/2021 02:33:41 - INFO - __main__ - Step 37881: {'lr': 0.00043092907059014325, 'samples': 7273152, 'steps': 37880, 'loss/train': 1.5659189224243164}} 11/07/2021 02:33:42 - INFO - __main__ - Step 37885: {'lr': 0.0004309144212511246, 'samples': 7273920, 'steps': 37884, 'loss/train': 2.258639097213745}4}} 11/07/2021 02:33:45 - INFO - __main__ - Step 37890: {'lr': 0.00043089610774322575, 'samples': 7274880, 'steps': 37889, 'loss/train': 1.6858359575271606}} 11/07/2021 02:33:47 - INFO - __main__ - Step 37894: {'lr': 0.0004308814554697348, 'samples': 7275648, 'steps': 37893, 'loss/train': 1.5547536611557007}}} 11/07/2021 02:33:47 - INFO - __main__ - Step 37894: {'lr': 0.0004308814554697348, 'samples': 7275648, 'steps': 37893, 'loss/train': 1.5547536611557007}}} 11/07/2021 02:33:50 - INFO - __main__ - Step 37900: {'lr': 0.0004308594746144596, 'samples': 7276800, 'steps': 37899, 'loss/train': 1.532272219657898}}}} 11/07/2021 02:33:53 - INFO - __main__ - Step 37905: {'lr': 0.00043084115499400505, 'samples': 7277760, 'steps': 37904, 'loss/train': 1.2803417444229126}} 11/07/2021 02:33:53 - INFO - __main__ - Step 37905: {'lr': 0.00043084115499400505, 'samples': 7277760, 'steps': 37904, 'loss/train': 1.2803417444229126}} 11/07/2021 02:33:53 - INFO - __main__ - Step 37905: {'lr': 0.00043084115499400505, 'samples': 7277760, 'steps': 37904, 'loss/train': 1.2803417444229126}} 11/07/2021 02:33:58 - INFO - __main__ - Step 37916: {'lr': 0.00043080084465868307, 'samples': 7279872, 'steps': 37915, 'loss/train': 1.9730298519134521}} 11/07/2021 02:34:01 - INFO - __main__ - Step 37921: {'lr': 0.00043078251852021634, 'samples': 7280832, 'steps': 37920, 'loss/train': 1.6662101745605469}} 11/07/2021 02:34:03 - INFO - __main__ - Step 37925: {'lr': 0.00043076785614319234, 'samples': 7281600, 'steps': 37924, 'loss/train': 1.733856201171875}}} 11/07/2021 02:34:03 - INFO - __main__ - Step 37925: {'lr': 0.00043076785614319234, 'samples': 7281600, 'steps': 37924, 'loss/train': 1.733856201171875}}} 11/07/2021 02:34:06 - INFO - __main__ - Step 37933: {'lr': 0.0004307385274795923, 'samples': 7283136, 'steps': 37932, 'loss/train': 1.5373013019561768}}} 11/07/2021 02:34:08 - INFO - __main__ - Step 37937: {'lr': 0.0004307238611932276, 'samples': 7283904, 'steps': 37936, 'loss/train': 1.5360777378082275}}} 11/07/2021 02:34:08 - INFO - __main__ - Step 37937: {'lr': 0.0004307238611932276, 'samples': 7283904, 'steps': 37936, 'loss/train': 1.5360777378082275}}} 11/07/2021 02:34:12 - INFO - __main__ - Step 37944: {'lr': 0.0004306981920570447, 'samples': 7285248, 'steps': 37943, 'loss/train': 1.4249544143676758}}} 11/07/2021 02:34:15 - INFO - __main__ - Step 37949: {'lr': 0.00043067985451714373, 'samples': 7286208, 'steps': 37948, 'loss/train': 1.5056431293487549}} 11/07/2021 02:34:15 - INFO - __main__ - Step 37949: {'lr': 0.00043067985451714373, 'samples': 7286208, 'steps': 37948, 'loss/train': 1.5056431293487549}} 11/07/2021 02:34:18 - INFO - __main__ - Step 37956: {'lr': 0.00043065417854204333, 'samples': 7287552, 'steps': 37955, 'loss/train': 1.7715234756469727}} 11/07/2021 02:34:20 - INFO - __main__ - Step 37960: {'lr': 0.00043063950476543563, 'samples': 7288320, 'steps': 37959, 'loss/train': 2.0494678020477295}} 11/07/2021 02:34:23 - INFO - __main__ - Step 37965: {'lr': 0.00043062116071333745, 'samples': 7289280, 'steps': 37964, 'loss/train': 1.1325451135635376}} 11/07/2021 02:34:23 - INFO - __main__ - Step 37965: {'lr': 0.00043062116071333745, 'samples': 7289280, 'steps': 37964, 'loss/train': 1.1325451135635376}} 11/07/2021 02:34:27 - INFO - __main__ - Step 37972: {'lr': 0.00043059547562227185, 'samples': 7290624, 'steps': 37971, 'loss/train': 1.5636862516403198}} 11/07/2021 02:34:28 - INFO - __main__ - Step 37976: {'lr': 0.00043058079663712304, 'samples': 7291392, 'steps': 37975, 'loss/train': 1.4831064939498901}} 11/07/2021 02:34:28 - INFO - __main__ - Step 37976: {'lr': 0.00043058079663712304, 'samples': 7291392, 'steps': 37975, 'loss/train': 1.4831064939498901}} 11/07/2021 02:34:32 - INFO - __main__ - Step 37983: {'lr': 0.0004305551052805499, 'samples': 7292736, 'steps': 37982, 'loss/train': 1.0468441247940063}}} 11/07/2021 02:34:35 - INFO - __main__ - Step 37987: {'lr': 0.0004305404227155113, 'samples': 7293504, 'steps': 37986, 'loss/train': 1.0172924995422363}}} 11/07/2021 02:34:35 - INFO - __main__ - Step 37987: {'lr': 0.0004305404227155113, 'samples': 7293504, 'steps': 37986, 'loss/train': 1.0172924995422363}}} 11/07/2021 02:34:39 - INFO - __main__ - Step 37995: {'lr': 0.00043051105368080103, 'samples': 7295040, 'steps': 37994, 'loss/train': 1.5717216730117798}} 11/07/2021 02:34:40 - INFO - __main__ - Step 37999: {'lr': 0.000430496367211341, 'samples': 7295808, 'steps': 37998, 'loss/train': 1.7385200262069702}8}} 11/07/2021 02:34:42 - INFO - __main__ - Step 38003: {'lr': 0.000430481679440619, 'samples': 7296576, 'steps': 38002, 'loss/train': 0.9746781587600708}8}} 11/07/2021 02:34:42 - INFO - __main__ - Step 38003: {'lr': 0.000430481679440619, 'samples': 7296576, 'steps': 38002, 'loss/train': 0.9746781587600708}8}} 11/07/2021 02:34:47 - INFO - __main__ - Step 38011: {'lr': 0.0004304522999958124, 'samples': 7298112, 'steps': 38010, 'loss/train': 2.6189193725585938}}} 11/07/2021 02:34:48 - INFO - __main__ - Step 38015: {'lr': 0.0004304376083219396, 'samples': 7298880, 'steps': 38014, 'loss/train': 1.4761950969696045}}} 11/07/2021 02:34:50 - INFO - __main__ - Step 38019: {'lr': 0.0004304229153472283, 'samples': 7299648, 'steps': 38018, 'loss/train': 1.200123906135559}}}} 11/07/2021 02:34:53 - INFO - __main__ - Step 38024: {'lr': 0.0004304045472996966, 'samples': 7300608, 'steps': 38023, 'loss/train': 1.5964939594268799}}} 11/07/2021 02:34:55 - INFO - __main__ - Step 38029: {'lr': 0.0004303861772199773, 'samples': 7301568, 'steps': 38028, 'loss/train': 1.200292706489563}}}} 11/07/2021 02:34:57 - INFO - __main__ - Step 38033: {'lr': 0.0004303714796931658, 'samples': 7302336, 'steps': 38032, 'loss/train': 1.8147969245910645}}} 11/07/2021 02:34:59 - INFO - __main__ - Step 38037: {'lr': 0.00043035678086599265, 'samples': 7303104, 'steps': 38036, 'loss/train': 1.323421597480774}}} 11/07/2021 02:35:01 - INFO - __main__ - Step 38041: {'lr': 0.00043034208073856374, 'samples': 7303872, 'steps': 38040, 'loss/train': 1.8999801874160767}} 11/07/2021 02:35:03 - INFO - __main__ - Step 38045: {'lr': 0.00043032737931098517, 'samples': 7304640, 'steps': 38044, 'loss/train': 1.5681344270706177}} 11/07/2021 02:35:05 - INFO - __main__ - Step 38050: {'lr': 0.00043030900069833774, 'samples': 7305600, 'steps': 38049, 'loss/train': 1.630570888519287}}} 11/07/2021 02:35:05 - INFO - __main__ - Step 38050: {'lr': 0.00043030900069833774, 'samples': 7305600, 'steps': 38049, 'loss/train': 1.630570888519287}}} 11/07/2021 02:35:08 - INFO - __main__ - Step 38057: {'lr': 0.00043028326722841073, 'samples': 7306944, 'steps': 38056, 'loss/train': 1.8193788528442383}} 11/07/2021 02:35:10 - INFO - __main__ - Step 38061: {'lr': 0.00043026856060129307, 'samples': 7307712, 'steps': 38060, 'loss/train': 1.5397465229034424}} 11/07/2021 02:35:13 - INFO - __main__ - Step 38066: {'lr': 0.0004302501754898183, 'samples': 7308672, 'steps': 38065, 'loss/train': 1.7628917694091797}}} 11/07/2021 02:35:15 - INFO - __main__ - Step 38071: {'lr': 0.00043023178834789477, 'samples': 7309632, 'steps': 38070, 'loss/train': 1.4251023530960083}} 11/07/2021 02:35:17 - INFO - __main__ - Step 38075: {'lr': 0.0004302170771725721, 'samples': 7310400, 'steps': 38074, 'loss/train': 1.5847716331481934}}} 11/07/2021 02:35:19 - INFO - __main__ - Step 38079: {'lr': 0.0004302023646980009, 'samples': 7311168, 'steps': 38078, 'loss/train': 1.669752597808838}}}} 11/07/2021 02:35:19 - INFO - __main__ - Step 38079: {'lr': 0.0004302023646980009, 'samples': 7311168, 'steps': 38078, 'loss/train': 1.669752597808838}}}} 11/07/2021 02:35:22 - INFO - __main__ - Step 38086: {'lr': 0.00043017661474150347, 'samples': 7312512, 'steps': 38085, 'loss/train': 1.5643198490142822}} 11/07/2021 02:35:25 - INFO - __main__ - Step 38091: {'lr': 0.0004301582194798567, 'samples': 7313472, 'steps': 38090, 'loss/train': 1.1627254486083984}}} 11/07/2021 02:35:27 - INFO - __main__ - Step 38096: {'lr': 0.0004301398221887971, 'samples': 7314432, 'steps': 38095, 'loss/train': 2.0996835231781006}}} 11/07/2021 02:35:29 - INFO - __main__ - Step 38100: {'lr': 0.0004301251028949114, 'samples': 7315200, 'steps': 38099, 'loss/train': 1.506402850151062}}}} 11/07/2021 02:35:29 - INFO - __main__ - Step 38100: {'lr': 0.0004301251028949114, 'samples': 7315200, 'steps': 38099, 'loss/train': 1.506402850151062}}}} 11/07/2021 02:35:33 - INFO - __main__ - Step 38107: {'lr': 0.00043009934100595403, 'samples': 7316544, 'steps': 38106, 'loss/train': 1.5879276990890503}} 11/07/2021 02:35:35 - INFO - __main__ - Step 38112: {'lr': 0.00043008093722216603, 'samples': 7317504, 'steps': 38111, 'loss/train': 1.3977137804031372}} 11/07/2021 02:35:37 - INFO - __main__ - Step 38116: {'lr': 0.00043006621273457523, 'samples': 7318272, 'steps': 38115, 'loss/train': 1.7254838943481445}} 11/07/2021 02:35:39 - INFO - __main__ - Step 38120: {'lr': 0.0004300514869488236, 'samples': 7319040, 'steps': 38119, 'loss/train': 1.5329667329788208}}} 11/07/2021 02:35:41 - INFO - __main__ - Step 38124: {'lr': 0.00043003675986501717, 'samples': 7319808, 'steps': 38123, 'loss/train': 1.5655076503753662}} 11/07/2021 02:35:43 - INFO - __main__ - Step 38128: {'lr': 0.00043002203148326213, 'samples': 7320576, 'steps': 38127, 'loss/train': 1.8233729600906372}} 11/07/2021 02:35:45 - INFO - __main__ - Step 38132: {'lr': 0.0004300073018036648, 'samples': 7321344, 'steps': 38131, 'loss/train': 1.5343341827392578}}} 11/07/2021 02:35:47 - INFO - __main__ - Step 38136: {'lr': 0.0004299925708263312, 'samples': 7322112, 'steps': 38135, 'loss/train': 1.7947341203689575}}} 11/07/2021 02:35:49 - INFO - __main__ - Step 38140: {'lr': 0.0004299778385513676, 'samples': 7322880, 'steps': 38139, 'loss/train': 1.8368099927902222}}} 11/07/2021 02:35:51 - INFO - __main__ - Step 38144: {'lr': 0.00042996310497888025, 'samples': 7323648, 'steps': 38143, 'loss/train': 1.9385180473327637}} 11/07/2021 02:35:53 - INFO - __main__ - Step 38149: {'lr': 0.00042994468618879, 'samples': 7324608, 'steps': 38148, 'loss/train': 1.6510372161865234}37}} 11/07/2021 02:35:53 - INFO - __main__ - Step 38149: {'lr': 0.00042994468618879, 'samples': 7324608, 'steps': 38148, 'loss/train': 1.6510372161865234}37}} 11/07/2021 02:35:57 - INFO - __main__ - Step 38157: {'lr': 0.0004299152119085564, 'samples': 7326144, 'steps': 38156, 'loss/train': 1.5574463605880737}}} 11/07/2021 02:35:59 - INFO - __main__ - Step 38161: {'lr': 0.0004299004728227781, 'samples': 7326912, 'steps': 38160, 'loss/train': 1.3330954313278198}}} 11/07/2021 02:36:01 - INFO - __main__ - Step 38165: {'lr': 0.0004298857324400337, 'samples': 7327680, 'steps': 38164, 'loss/train': 1.8268216848373413}}} 11/07/2021 02:36:03 - INFO - __main__ - Step 38169: {'lr': 0.0004298709907604296, 'samples': 7328448, 'steps': 38168, 'loss/train': 1.229674220085144}}}} 11/07/2021 02:36:05 - INFO - __main__ - Step 38174: {'lr': 0.00042985256183737723, 'samples': 7329408, 'steps': 38173, 'loss/train': 0.975612998008728}}} 11/07/2021 02:36:05 - INFO - __main__ - Step 38174: {'lr': 0.00042985256183737723, 'samples': 7329408, 'steps': 38173, 'loss/train': 0.975612998008728}}} 11/07/2021 02:36:09 - INFO - __main__ - Step 38181: {'lr': 0.00042982675794152135, 'samples': 7330752, 'steps': 38180, 'loss/train': 2.0016207695007324}} 11/07/2021 02:36:11 - INFO - __main__ - Step 38185: {'lr': 0.000429812011075541, 'samples': 7331520, 'steps': 38184, 'loss/train': 1.148552417755127}24}} 11/07/2021 02:36:13 - INFO - __main__ - Step 38190: {'lr': 0.00042979357567011643, 'samples': 7332480, 'steps': 38189, 'loss/train': 1.3136870861053467}} 11/07/2021 02:36:13 - INFO - __main__ - Step 38190: {'lr': 0.00042979357567011643, 'samples': 7332480, 'steps': 38189, 'loss/train': 1.3136870861053467}} 11/07/2021 02:36:17 - INFO - __main__ - Step 38198: {'lr': 0.0004297640748088886, 'samples': 7334016, 'steps': 38197, 'loss/train': 1.6281474828720093}}} 11/07/2021 02:36:19 - INFO - __main__ - Step 38202: {'lr': 0.00042974932243424743, 'samples': 7334784, 'steps': 38201, 'loss/train': 2.0515167713165283}} 11/07/2021 02:36:21 - INFO - __main__ - Step 38206: {'lr': 0.0004297345687637299, 'samples': 7335552, 'steps': 38205, 'loss/train': 2.14150333404541}83}} 11/07/2021 02:36:23 - INFO - __main__ - Step 38211: {'lr': 0.00042971612485341896, 'samples': 7336512, 'steps': 38210, 'loss/train': 0.9586760997772217}} 11/07/2021 02:36:26 - INFO - __main__ - Step 38216: {'lr': 0.0004296976789186753, 'samples': 7337472, 'steps': 38215, 'loss/train': 1.534716010093689}7}} 11/07/2021 02:36:28 - INFO - __main__ - Step 38220: {'lr': 0.0004296829207134283, 'samples': 7338240, 'steps': 38219, 'loss/train': 1.2776870727539062}}} 11/07/2021 02:36:28 - INFO - __main__ - Step 38220: {'lr': 0.0004296829207134283, 'samples': 7338240, 'steps': 38219, 'loss/train': 1.2776870727539062}}} 11/07/2021 02:36:31 - INFO - __main__ - Step 38227: {'lr': 0.00042965709073725957, 'samples': 7339584, 'steps': 38226, 'loss/train': 1.9048470258712769}} 11/07/2021 02:36:31 - INFO - __main__ - Step 38227: {'lr': 0.00042965709073725957, 'samples': 7339584, 'steps': 38226, 'loss/train': 1.9048470258712769}} 11/07/2021 02:36:35 - INFO - __main__ - Step 38235: {'lr': 0.0004296275659074858, 'samples': 7341120, 'steps': 38234, 'loss/train': 1.2817461490631104}}} 11/07/2021 02:36:37 - INFO - __main__ - Step 38239: {'lr': 0.00042961280155004786, 'samples': 7341888, 'steps': 38238, 'loss/train': 0.6085673570632935}} 11/07/2021 02:36:39 - INFO - __main__ - Step 38243: {'lr': 0.0004295980358977178, 'samples': 7342656, 'steps': 38242, 'loss/train': 1.4010951519012451}}} 11/07/2021 02:36:41 - INFO - __main__ - Step 38247: {'lr': 0.00042958326895060206, 'samples': 7343424, 'steps': 38246, 'loss/train': 1.1405773162841797}} 11/07/2021 02:36:43 - INFO - __main__ - Step 38252: {'lr': 0.00042956480844607734, 'samples': 7344384, 'steps': 38251, 'loss/train': 0.6693893074989319}} 11/07/2021 02:36:45 - INFO - __main__ - Step 38256: {'lr': 0.0004295500385860832, 'samples': 7345152, 'steps': 38255, 'loss/train': 2.0637753009796143}}} 11/07/2021 02:36:45 - INFO - __main__ - Step 38256: {'lr': 0.0004295500385860832, 'samples': 7345152, 'steps': 38255, 'loss/train': 2.0637753009796143}}} 11/07/2021 02:36:49 - INFO - __main__ - Step 38263: {'lr': 0.0004295241882164121, 'samples': 7346496, 'steps': 38262, 'loss/train': 1.4554109573364258}}} 11/07/2021 02:36:51 - INFO - __main__ - Step 38268: {'lr': 0.0004295057212398889, 'samples': 7347456, 'steps': 38267, 'loss/train': 1.5185699462890625}}} 11/07/2021 02:36:51 - INFO - __main__ - Step 38268: {'lr': 0.0004295057212398889, 'samples': 7347456, 'steps': 38267, 'loss/train': 1.5185699462890625}}} 11/07/2021 02:36:55 - INFO - __main__ - Step 38276: {'lr': 0.00042947616987164787, 'samples': 7348992, 'steps': 38275, 'loss/train': 1.8463332653045654}} 11/07/2021 02:36:57 - INFO - __main__ - Step 38280: {'lr': 0.0004294613922466135, 'samples': 7349760, 'steps': 38279, 'loss/train': 1.591142177581787}4}} 11/07/2021 02:36:59 - INFO - __main__ - Step 38284: {'lr': 0.0004294466133277786, 'samples': 7350528, 'steps': 38283, 'loss/train': 1.229430079460144}4}} 11/07/2021 02:36:59 - INFO - __main__ - Step 38284: {'lr': 0.0004294466133277786, 'samples': 7350528, 'steps': 38283, 'loss/train': 1.229430079460144}4}} 11/07/2021 02:37:03 - INFO - __main__ - Step 38292: {'lr': 0.0004294170516091332, 'samples': 7352064, 'steps': 38291, 'loss/train': 1.6080865859985352}}} 11/07/2021 02:37:05 - INFO - __main__ - Step 38296: {'lr': 0.00042940226880953605, 'samples': 7352832, 'steps': 38295, 'loss/train': 1.732701301574707}}} 11/07/2021 02:37:07 - INFO - __main__ - Step 38301: {'lr': 0.0004293837884912444, 'samples': 7353792, 'steps': 38300, 'loss/train': 2.025219678878784}}}} 11/07/2021 02:37:07 - INFO - __main__ - Step 38301: {'lr': 0.0004293837884912444, 'samples': 7353792, 'steps': 38300, 'loss/train': 2.025219678878784}}}} 11/07/2021 02:37:11 - INFO - __main__ - Step 38309: {'lr': 0.0004293542157790308, 'samples': 7355328, 'steps': 38308, 'loss/train': 1.512961983680725}}}} 11/07/2021 02:37:13 - INFO - __main__ - Step 38313: {'lr': 0.0004293394274833289, 'samples': 7356096, 'steps': 38312, 'loss/train': 1.530188798904419}}}} 11/07/2021 02:37:15 - INFO - __main__ - Step 38317: {'lr': 0.0004293246378947058, 'samples': 7356864, 'steps': 38316, 'loss/train': 1.9374382495880127}}} 11/07/2021 02:37:17 - INFO - __main__ - Step 38322: {'lr': 0.0004293061490909187, 'samples': 7357824, 'steps': 38321, 'loss/train': 1.3759467601776123}}} 11/07/2021 02:37:20 - INFO - __main__ - Step 38327: {'lr': 0.0004292876582673171, 'samples': 7358784, 'steps': 38326, 'loss/train': 0.9394339323043823}}} 11/07/2021 02:37:20 - INFO - __main__ - Step 38327: {'lr': 0.0004292876582673171, 'samples': 7358784, 'steps': 38326, 'loss/train': 0.9394339323043823}}} 11/07/2021 02:37:23 - INFO - __main__ - Step 38334: {'lr': 0.00042926176772138295, 'samples': 7360128, 'steps': 38333, 'loss/train': 1.29563307762146}}}} 11/07/2021 02:37:25 - INFO - __main__ - Step 38338: {'lr': 0.0004292469713466727, 'samples': 7360896, 'steps': 38337, 'loss/train': 1.8090503215789795}}} 11/07/2021 02:37:28 - INFO - __main__ - Step 38343: {'lr': 0.0004292284740610642, 'samples': 7361856, 'steps': 38342, 'loss/train': 1.0337316989898682}}} 11/07/2021 02:37:30 - INFO - __main__ - Step 38347: {'lr': 0.0004292136747789309, 'samples': 7362624, 'steps': 38346, 'loss/train': 1.6750733852386475}}} 11/07/2021 02:37:32 - INFO - __main__ - Step 38351: {'lr': 0.0004291988742047829, 'samples': 7363392, 'steps': 38350, 'loss/train': 1.636959433555603}}}} 11/07/2021 02:37:33 - INFO - __main__ - Step 38355: {'lr': 0.0004291840723387269, 'samples': 7364160, 'steps': 38354, 'loss/train': 1.572365164756775}}}} 11/07/2021 02:37:36 - INFO - __main__ - Step 38359: {'lr': 0.00042916926918086973, 'samples': 7364928, 'steps': 38358, 'loss/train': 0.26775655150413513} 11/07/2021 02:37:38 - INFO - __main__ - Step 38363: {'lr': 0.00042915446473131805, 'samples': 7365696, 'steps': 38362, 'loss/train': 1.0548052787780762}} 11/07/2021 02:37:40 - INFO - __main__ - Step 38367: {'lr': 0.00042913965899017855, 'samples': 7366464, 'steps': 38366, 'loss/train': 1.445980191230774}}} 11/07/2021 02:37:41 - INFO - __main__ - Step 38371: {'lr': 0.000429124851957558, 'samples': 7367232, 'steps': 38370, 'loss/train': 1.5229068994522095}}}} 11/07/2021 02:37:43 - INFO - __main__ - Step 38375: {'lr': 0.0004291100436335631, 'samples': 7368000, 'steps': 38374, 'loss/train': 1.2032063007354736}}} 11/07/2021 02:37:46 - INFO - __main__ - Step 38380: {'lr': 0.00042909153141273705, 'samples': 7368960, 'steps': 38379, 'loss/train': 1.791010856628418}}} 11/07/2021 02:37:48 - INFO - __main__ - Step 38384: {'lr': 0.00042907672018354027, 'samples': 7369728, 'steps': 38383, 'loss/train': 1.2648224830627441}} 11/07/2021 02:37:48 - INFO - __main__ - Step 38384: {'lr': 0.00042907672018354027, 'samples': 7369728, 'steps': 38383, 'loss/train': 1.2648224830627441}} 11/07/2021 02:37:51 - INFO - __main__ - Step 38391: {'lr': 0.0004290507974259759, 'samples': 7371072, 'steps': 38390, 'loss/train': 1.5398764610290527}}} 11/07/2021 02:37:54 - INFO - __main__ - Step 38396: {'lr': 0.0004290322787502135, 'samples': 7372032, 'steps': 38395, 'loss/train': 1.1739457845687866}}} 11/07/2021 02:37:56 - INFO - __main__ - Step 38401: {'lr': 0.0004290137580577216, 'samples': 7372992, 'steps': 38400, 'loss/train': 1.6494174003601074}}} 11/07/2021 02:37:56 - INFO - __main__ - Step 38401: {'lr': 0.0004290137580577216, 'samples': 7372992, 'steps': 38400, 'loss/train': 1.6494174003601074}}} 11/07/2021 02:37:59 - INFO - __main__ - Step 38408: {'lr': 0.00042898782570052453, 'samples': 7374336, 'steps': 38407, 'loss/train': 1.9118536710739136}} 11/07/2021 02:38:01 - INFO - __main__ - Step 38412: {'lr': 0.0004289730054363795, 'samples': 7375104, 'steps': 38411, 'loss/train': 1.6180638074874878}}} 11/07/2021 02:38:04 - INFO - __main__ - Step 38417: {'lr': 0.00042895447829175516, 'samples': 7376064, 'steps': 38416, 'loss/train': 1.3981465101242065}} 11/07/2021 02:38:06 - INFO - __main__ - Step 38421: {'lr': 0.0004289396551246313, 'samples': 7376832, 'steps': 38420, 'loss/train': 1.5289490222930908}}} 11/07/2021 02:38:08 - INFO - __main__ - Step 38425: {'lr': 0.00042892483066746836, 'samples': 7377600, 'steps': 38424, 'loss/train': 1.5883793830871582}} 11/07/2021 02:38:08 - INFO - __main__ - Step 38425: {'lr': 0.00042892483066746836, 'samples': 7377600, 'steps': 38424, 'loss/train': 1.5883793830871582}} 11/07/2021 02:38:08 - INFO - __main__ - Step 38425: {'lr': 0.00042892483066746836, 'samples': 7377600, 'steps': 38424, 'loss/train': 1.5883793830871582}} 11/07/2021 02:38:13 - INFO - __main__ - Step 38434: {'lr': 0.00042889147092269964, 'samples': 7379328, 'steps': 38433, 'loss/train': 2.4124293327331543}} 11/07/2021 02:38:13 - INFO - __main__ - Step 38434: {'lr': 0.00042889147092269964, 'samples': 7379328, 'steps': 38433, 'loss/train': 2.4124293327331543}} 11/07/2021 02:38:13 - INFO - __main__ - Step 38434: {'lr': 0.00042889147092269964, 'samples': 7379328, 'steps': 38433, 'loss/train': 2.4124293327331543}} 11/07/2021 02:38:19 - INFO - __main__ - Step 38445: {'lr': 0.00042885068903480717, 'samples': 7381440, 'steps': 38444, 'loss/train': 0.9094494581222534}} 11/07/2021 02:38:22 - INFO - __main__ - Step 38451: {'lr': 0.0004288284402585866, 'samples': 7382592, 'steps': 38450, 'loss/train': 1.5371888875961304}}} 11/07/2021 02:38:22 - INFO - __main__ - Step 38451: {'lr': 0.0004288284402585866, 'samples': 7382592, 'steps': 38450, 'loss/train': 1.5371888875961304}}} 11/07/2021 02:38:26 - INFO - __main__ - Step 38458: {'lr': 0.00042880247968675255, 'samples': 7383936, 'steps': 38457, 'loss/train': 1.191347360610962}}} 11/07/2021 02:38:27 - INFO - __main__ - Step 38462: {'lr': 0.0004287876433017951, 'samples': 7384704, 'steps': 38461, 'loss/train': 1.1448413133621216}}} 11/07/2021 02:38:29 - INFO - __main__ - Step 38466: {'lr': 0.0004287728056278944, 'samples': 7385472, 'steps': 38465, 'loss/train': 1.7644872665405273}}} 11/07/2021 02:38:32 - INFO - __main__ - Step 38471: {'lr': 0.00042875425672310506, 'samples': 7386432, 'steps': 38470, 'loss/train': 1.6783102750778198}} 11/07/2021 02:38:34 - INFO - __main__ - Step 38475: {'lr': 0.0004287394161494733, 'samples': 7387200, 'steps': 38474, 'loss/train': 1.8952447175979614}}} 11/07/2021 02:38:36 - INFO - __main__ - Step 38479: {'lr': 0.00042872457428724586, 'samples': 7387968, 'steps': 38478, 'loss/train': 1.7590001821517944}} 11/07/2021 02:38:37 - INFO - __main__ - Step 38483: {'lr': 0.0004287097311365299, 'samples': 7388736, 'steps': 38482, 'loss/train': 1.6136687994003296}}} 11/07/2021 02:38:39 - INFO - __main__ - Step 38487: {'lr': 0.0004286948866974323, 'samples': 7389504, 'steps': 38486, 'loss/train': 1.2108365297317505}}} 11/07/2021 02:38:42 - INFO - __main__ - Step 38492: {'lr': 0.0004286763293369369, 'samples': 7390464, 'steps': 38491, 'loss/train': 1.2377177476882935}}} 11/07/2021 02:38:44 - INFO - __main__ - Step 38496: {'lr': 0.00042866148199937216, 'samples': 7391232, 'steps': 38495, 'loss/train': 1.4726425409317017}} 11/07/2021 02:38:46 - INFO - __main__ - Step 38501: {'lr': 0.00042864292101613133, 'samples': 7392192, 'steps': 38500, 'loss/train': 1.4850726127624512}} 11/07/2021 02:38:46 - INFO - __main__ - Step 38501: {'lr': 0.00042864292101613133, 'samples': 7392192, 'steps': 38500, 'loss/train': 1.4850726127624512}} 11/07/2021 02:38:49 - INFO - __main__ - Step 38508: {'lr': 0.00042861693225890385, 'samples': 7393536, 'steps': 38507, 'loss/train': 1.6027082204818726}} 11/07/2021 02:38:52 - INFO - __main__ - Step 38513: {'lr': 0.00042859836644638976, 'samples': 7394496, 'steps': 38512, 'loss/train': 3.527266502380371}}} 11/07/2021 02:38:52 - INFO - __main__ - Step 38513: {'lr': 0.00042859836644638976, 'samples': 7394496, 'steps': 38512, 'loss/train': 3.527266502380371}}} 11/07/2021 02:38:56 - INFO - __main__ - Step 38521: {'lr': 0.0004285686569618235, 'samples': 7396032, 'steps': 38520, 'loss/train': 1.7476186752319336}}} 11/07/2021 02:38:57 - INFO - __main__ - Step 38525: {'lr': 0.00042855380028844004, 'samples': 7396800, 'steps': 38524, 'loss/train': 1.3176720142364502}} 11/07/2021 02:38:59 - INFO - __main__ - Step 38529: {'lr': 0.00042853894232779924, 'samples': 7397568, 'steps': 38528, 'loss/train': 1.7133264541625977}} 11/07/2021 02:39:02 - INFO - __main__ - Step 38534: {'lr': 0.00042852036806695565, 'samples': 7398528, 'steps': 38533, 'loss/train': 1.6784566640853882}} 11/07/2021 02:39:02 - INFO - __main__ - Step 38534: {'lr': 0.00042852036806695565, 'samples': 7398528, 'steps': 38533, 'loss/train': 1.6784566640853882}} 11/07/2021 02:39:02 - INFO - __main__ - Step 38534: {'lr': 0.00042852036806695565, 'samples': 7398528, 'steps': 38533, 'loss/train': 1.6784566640853882}} 11/07/2021 02:39:08 - INFO - __main__ - Step 38545: {'lr': 0.0004284794976148044, 'samples': 7400640, 'steps': 38544, 'loss/train': 0.7191889882087708}}} 11/07/2021 02:39:08 - INFO - __main__ - Step 38545: {'lr': 0.0004284794976148044, 'samples': 7400640, 'steps': 38544, 'loss/train': 0.7191889882087708}}} 11/07/2021 02:39:12 - INFO - __main__ - Step 38553: {'lr': 0.0004284497675375482, 'samples': 7402176, 'steps': 38552, 'loss/train': 1.730039358139038}}}} 11/07/2021 02:39:13 - INFO - __main__ - Step 38557: {'lr': 0.00042843490056910534, 'samples': 7402944, 'steps': 38556, 'loss/train': 1.7708011865615845}} 11/07/2021 02:39:15 - INFO - __main__ - Step 38561: {'lr': 0.0004284200323142623, 'samples': 7403712, 'steps': 38560, 'loss/train': 2.0764431953430176}}} 11/07/2021 02:39:18 - INFO - __main__ - Step 38566: {'lr': 0.0004284014451868716, 'samples': 7404672, 'steps': 38565, 'loss/train': 1.763478398323059}}}} 11/07/2021 02:39:20 - INFO - __main__ - Step 38571: {'lr': 0.0004283828560498574, 'samples': 7405632, 'steps': 38570, 'loss/train': 0.9588892459869385}}} 11/07/2021 02:39:20 - INFO - __main__ - Step 38571: {'lr': 0.0004283828560498574, 'samples': 7405632, 'steps': 38570, 'loss/train': 0.9588892459869385}}} 11/07/2021 02:39:24 - INFO - __main__ - Step 38578: {'lr': 0.0004283568278822688, 'samples': 7406976, 'steps': 38577, 'loss/train': 0.8108773827552795}}} 11/07/2021 02:39:26 - INFO - __main__ - Step 38582: {'lr': 0.00042834195287558356, 'samples': 7407744, 'steps': 38581, 'loss/train': 1.4471834897994995}} 11/07/2021 02:39:28 - INFO - __main__ - Step 38587: {'lr': 0.00042832335730918147, 'samples': 7408704, 'steps': 38586, 'loss/train': 1.4644055366516113}} 11/07/2021 02:39:30 - INFO - __main__ - Step 38591: {'lr': 0.0004283084794097543, 'samples': 7409472, 'steps': 38590, 'loss/train': 1.6175768375396729}}} 11/07/2021 02:39:32 - INFO - __main__ - Step 38595: {'lr': 0.0004282936002248383, 'samples': 7410240, 'steps': 38594, 'loss/train': 1.3758429288864136}}} 11/07/2021 02:39:34 - INFO - __main__ - Step 38599: {'lr': 0.0004282787197545408, 'samples': 7411008, 'steps': 38598, 'loss/train': 1.0509960651397705}}} 11/07/2021 02:39:36 - INFO - __main__ - Step 38603: {'lr': 0.00042826383799896906, 'samples': 7411776, 'steps': 38602, 'loss/train': 1.4507431983947754}} 11/07/2021 02:39:38 - INFO - __main__ - Step 38608: {'lr': 0.0004282452339972509, 'samples': 7412736, 'steps': 38607, 'loss/train': 0.5122175216674805}}} 11/07/2021 02:39:41 - INFO - __main__ - Step 38613: {'lr': 0.000428226627987669, 'samples': 7413696, 'steps': 38612, 'loss/train': 1.3209978342056274}}}} 11/07/2021 02:39:41 - INFO - __main__ - Step 38613: {'lr': 0.000428226627987669, 'samples': 7413696, 'steps': 38612, 'loss/train': 1.3209978342056274}}}} 11/07/2021 02:39:44 - INFO - __main__ - Step 38620: {'lr': 0.00042820057620144214, 'samples': 7415040, 'steps': 38619, 'loss/train': 1.3826243877410889}} 11/07/2021 02:39:46 - INFO - __main__ - Step 38624: {'lr': 0.00042818568769994103, 'samples': 7415808, 'steps': 38623, 'loss/train': 0.15500399470329285} 11/07/2021 02:39:48 - INFO - __main__ - Step 38628: {'lr': 0.00042817079791383636, 'samples': 7416576, 'steps': 38627, 'loss/train': 1.77236807346344}85} 11/07/2021 02:39:50 - INFO - __main__ - Step 38633: {'lr': 0.00042815218387489535, 'samples': 7417536, 'steps': 38632, 'loss/train': 1.3826674222946167}} 11/07/2021 02:39:50 - INFO - __main__ - Step 38633: {'lr': 0.00042815218387489535, 'samples': 7417536, 'steps': 38632, 'loss/train': 1.3826674222946167}} 11/07/2021 02:39:53 - INFO - __main__ - Step 38640: {'lr': 0.0004281261208489747, 'samples': 7418880, 'steps': 38639, 'loss/train': 1.198464274406433}7}} 11/07/2021 02:39:56 - INFO - __main__ - Step 38644: {'lr': 0.00042811122592552943, 'samples': 7419648, 'steps': 38643, 'loss/train': 1.4061126708984375}} 11/07/2021 02:39:58 - INFO - __main__ - Step 38649: {'lr': 0.0004280926054655165, 'samples': 7420608, 'steps': 38648, 'loss/train': 1.7997180223464966}}} 11/07/2021 02:40:00 - INFO - __main__ - Step 38653: {'lr': 0.00042807770765307217, 'samples': 7421376, 'steps': 38652, 'loss/train': 0.7365889549255371}} 11/07/2021 02:40:02 - INFO - __main__ - Step 38657: {'lr': 0.0004280628085568028, 'samples': 7422144, 'steps': 38656, 'loss/train': 1.452724814414978}1}} 11/07/2021 02:40:04 - INFO - __main__ - Step 38661: {'lr': 0.00042804790817681574, 'samples': 7422912, 'steps': 38660, 'loss/train': 1.398537039756775}}} 11/07/2021 02:40:04 - INFO - __main__ - Step 38661: {'lr': 0.00042804790817681574, 'samples': 7422912, 'steps': 38660, 'loss/train': 1.398537039756775}}} 11/07/2021 02:40:08 - INFO - __main__ - Step 38668: {'lr': 0.00042802182942321576, 'samples': 7424256, 'steps': 38667, 'loss/train': 1.372496485710144}}} 11/07/2021 02:40:10 - INFO - __main__ - Step 38672: {'lr': 0.000428006925513559, 'samples': 7425024, 'steps': 38671, 'loss/train': 1.8654686212539673}}}} 11/07/2021 02:40:12 - INFO - __main__ - Step 38676: {'lr': 0.0004279920203205875, 'samples': 7425792, 'steps': 38675, 'loss/train': 1.5036089420318604}}} 11/07/2021 02:40:14 - INFO - __main__ - Step 38681: {'lr': 0.0004279733870248754, 'samples': 7426752, 'steps': 38680, 'loss/train': 1.6720571517944336}}} 11/07/2021 02:40:16 - INFO - __main__ - Step 38685: {'lr': 0.0004279584789448385, 'samples': 7427520, 'steps': 38684, 'loss/train': 1.6397520303726196}}} 11/07/2021 02:40:19 - INFO - __main__ - Step 38689: {'lr': 0.0004279435695818361, 'samples': 7428288, 'steps': 38688, 'loss/train': 1.785889983177185}}}} 11/07/2021 02:40:20 - INFO - __main__ - Step 38693: {'lr': 0.0004279286589359757, 'samples': 7429056, 'steps': 38692, 'loss/train': 1.4303791522979736}}} 11/07/2021 02:40:22 - INFO - __main__ - Step 38697: {'lr': 0.0004279137470073648, 'samples': 7429824, 'steps': 38696, 'loss/train': 1.5088738203048706}}} 11/07/2021 02:40:24 - INFO - __main__ - Step 38702: {'lr': 0.000427895105292897, 'samples': 7430784, 'steps': 38701, 'loss/train': 1.3261916637420654}}}} 11/07/2021 02:40:27 - INFO - __main__ - Step 38706: {'lr': 0.0004278801904784904, 'samples': 7431552, 'steps': 38705, 'loss/train': 1.5037397146224976}}} 11/07/2021 02:40:29 - INFO - __main__ - Step 38710: {'lr': 0.0004278652743816828, 'samples': 7432320, 'steps': 38709, 'loss/train': 1.3880391120910645}}} 11/07/2021 02:40:30 - INFO - __main__ - Step 38714: {'lr': 0.0004278503570025816, 'samples': 7433088, 'steps': 38713, 'loss/train': 1.1154862642288208}}} 11/07/2021 02:40:32 - INFO - __main__ - Step 38718: {'lr': 0.0004278354383412943, 'samples': 7433856, 'steps': 38717, 'loss/train': 1.4579813480377197}}} 11/07/2021 02:40:35 - INFO - __main__ - Step 38723: {'lr': 0.000427816788211775, 'samples': 7434816, 'steps': 38722, 'loss/train': 1.6787524223327637}}}} 11/07/2021 02:40:35 - INFO - __main__ - Step 38723: {'lr': 0.000427816788211775, 'samples': 7434816, 'steps': 38722, 'loss/train': 1.6787524223327637}}}} 11/07/2021 02:40:38 - INFO - __main__ - Step 38730: {'lr': 0.000427790674665392, 'samples': 7436160, 'steps': 38729, 'loss/train': 1.3973135948181152}}}} 11/07/2021 02:40:40 - INFO - __main__ - Step 38734: {'lr': 0.0004277757508764363, 'samples': 7436928, 'steps': 38733, 'loss/train': 1.2723112106323242}}} 11/07/2021 02:40:42 - INFO - __main__ - Step 38739: {'lr': 0.00042775709433793657, 'samples': 7437888, 'steps': 38738, 'loss/train': 1.4090418815612793}} 11/07/2021 02:40:45 - INFO - __main__ - Step 38744: {'lr': 0.0004277384357970717, 'samples': 7438848, 'steps': 38743, 'loss/train': 1.289526104927063}3}} 11/07/2021 02:40:47 - INFO - __main__ - Step 38748: {'lr': 0.00042772350752281823, 'samples': 7439616, 'steps': 38747, 'loss/train': 1.8541260957717896}} 11/07/2021 02:40:47 - INFO - __main__ - Step 38748: {'lr': 0.00042772350752281823, 'samples': 7439616, 'steps': 38747, 'loss/train': 1.8541260957717896}} 11/07/2021 02:40:50 - INFO - __main__ - Step 38755: {'lr': 0.0004276973799598798, 'samples': 7440960, 'steps': 38754, 'loss/train': 1.5014312267303467}}} 11/07/2021 02:40:52 - INFO - __main__ - Step 38760: {'lr': 0.00042767871501285916, 'samples': 7441920, 'steps': 38759, 'loss/train': 1.5829507112503052}} 11/07/2021 02:40:55 - INFO - __main__ - Step 38765: {'lr': 0.00042766004806435643, 'samples': 7442880, 'steps': 38764, 'loss/train': 1.5389777421951294}} 11/07/2021 02:40:57 - INFO - __main__ - Step 38769: {'lr': 0.0004276451130646283, 'samples': 7443648, 'steps': 38768, 'loss/train': 1.5693845748901367}}} 11/07/2021 02:40:59 - INFO - __main__ - Step 38773: {'lr': 0.0004276301767841939, 'samples': 7444416, 'steps': 38772, 'loss/train': 1.301182746887207}}}} 11/07/2021 02:40:59 - INFO - __main__ - Step 38773: {'lr': 0.0004276301767841939, 'samples': 7444416, 'steps': 38772, 'loss/train': 1.301182746887207}}}} 11/07/2021 02:41:02 - INFO - __main__ - Step 38780: {'lr': 0.0004276040352120578, 'samples': 7445760, 'steps': 38779, 'loss/train': 1.2119277715682983}}} 11/07/2021 02:41:05 - INFO - __main__ - Step 38785: {'lr': 0.0004275853602597294, 'samples': 7446720, 'steps': 38784, 'loss/train': 0.8429235219955444}}} 11/07/2021 02:41:07 - INFO - __main__ - Step 38790: {'lr': 0.00042756668330697024, 'samples': 7447680, 'steps': 38789, 'loss/train': 1.5498528480529785}} 11/07/2021 02:41:07 - INFO - __main__ - Step 38790: {'lr': 0.00042756668330697024, 'samples': 7447680, 'steps': 38789, 'loss/train': 1.5498528480529785}} 11/07/2021 02:41:10 - INFO - __main__ - Step 38797: {'lr': 0.00042754053221278476, 'samples': 7449024, 'steps': 38796, 'loss/train': 1.9527474641799927}} 11/07/2021 02:41:12 - INFO - __main__ - Step 38801: {'lr': 0.00042752558697042143, 'samples': 7449792, 'steps': 38800, 'loss/train': 1.6765644550323486}} 11/07/2021 02:41:15 - INFO - __main__ - Step 38806: {'lr': 0.0004275069036176985, 'samples': 7450752, 'steps': 38805, 'loss/train': 1.706905722618103}6}} 11/07/2021 02:41:15 - INFO - __main__ - Step 38806: {'lr': 0.0004275069036176985, 'samples': 7450752, 'steps': 38805, 'loss/train': 1.706905722618103}6}} 11/07/2021 02:41:18 - INFO - __main__ - Step 38813: {'lr': 0.0004274807435646948, 'samples': 7452096, 'steps': 38812, 'loss/train': 1.7396658658981323}}} 11/07/2021 02:41:20 - INFO - __main__ - Step 38817: {'lr': 0.00042746579320359956, 'samples': 7452864, 'steps': 38816, 'loss/train': 1.5527185201644897}} 11/07/2021 02:41:23 - INFO - __main__ - Step 38822: {'lr': 0.00042744710345306774, 'samples': 7453824, 'steps': 38821, 'loss/train': 1.780123233795166}}} 11/07/2021 02:41:25 - INFO - __main__ - Step 38826: {'lr': 0.0004274321502134435, 'samples': 7454592, 'steps': 38825, 'loss/train': 1.3130568265914917}}} 11/07/2021 02:41:27 - INFO - __main__ - Step 38830: {'lr': 0.00042741719569464834, 'samples': 7455360, 'steps': 38829, 'loss/train': 1.382586121559143}}} 11/07/2021 02:41:29 - INFO - __main__ - Step 38834: {'lr': 0.00042740223989678984, 'samples': 7456128, 'steps': 38833, 'loss/train': 1.2758821249008179}} 11/07/2021 02:41:31 - INFO - __main__ - Step 38839: {'lr': 0.0004273835433509484, 'samples': 7457088, 'steps': 38838, 'loss/train': 1.5911930799484253}}} 11/07/2021 02:41:33 - INFO - __main__ - Step 38843: {'lr': 0.000427368584675592, 'samples': 7457856, 'steps': 38842, 'loss/train': 1.8798352479934692}}}} 11/07/2021 02:41:33 - INFO - __main__ - Step 38843: {'lr': 0.000427368584675592, 'samples': 7457856, 'steps': 38842, 'loss/train': 1.8798352479934692}}}} 11/07/2021 02:41:36 - INFO - __main__ - Step 38850: {'lr': 0.0004273424039168805, 'samples': 7459200, 'steps': 38849, 'loss/train': 1.733851671218872}}}} 11/07/2021 02:41:38 - INFO - __main__ - Step 38854: {'lr': 0.0004273274417253235, 'samples': 7459968, 'steps': 38853, 'loss/train': 1.4414904117584229}}} 11/07/2021 02:41:41 - INFO - __main__ - Step 38860: {'lr': 0.0004273049960409915, 'samples': 7461120, 'steps': 38859, 'loss/train': 1.8021929264068604}}} 11/07/2021 02:41:41 - INFO - __main__ - Step 38860: {'lr': 0.0004273049960409915, 'samples': 7461120, 'steps': 38859, 'loss/train': 1.8021929264068604}}} 11/07/2021 02:41:44 - INFO - __main__ - Step 38865: {'lr': 0.00042728628910703305, 'samples': 7462080, 'steps': 38864, 'loss/train': 2.429159164428711}}} 11/07/2021 02:41:46 - INFO - __main__ - Step 38870: {'lr': 0.00042726758017601297, 'samples': 7463040, 'steps': 38869, 'loss/train': 1.5700926780700684}} 11/07/2021 02:41:46 - INFO - __main__ - Step 38870: {'lr': 0.00042726758017601297, 'samples': 7463040, 'steps': 38869, 'loss/train': 1.5700926780700684}} 11/07/2021 02:41:50 - INFO - __main__ - Step 38877: {'lr': 0.00042724138431792245, 'samples': 7464384, 'steps': 38876, 'loss/train': 0.17262661457061768} 11/07/2021 02:41:52 - INFO - __main__ - Step 38882: {'lr': 0.0004272226705948143, 'samples': 7465344, 'steps': 38881, 'loss/train': 0.8323983550071716}8} 11/07/2021 02:41:55 - INFO - __main__ - Step 38887: {'lr': 0.00042720395487536115, 'samples': 7466304, 'steps': 38886, 'loss/train': 2.7242424488067627}} 11/07/2021 02:41:55 - INFO - __main__ - Step 38887: {'lr': 0.00042720395487536115, 'samples': 7466304, 'steps': 38886, 'loss/train': 2.7242424488067627}} 11/07/2021 02:41:59 - INFO - __main__ - Step 38894: {'lr': 0.0004271777495146685, 'samples': 7467648, 'steps': 38893, 'loss/train': 1.3811542987823486}}} 11/07/2021 02:42:00 - INFO - __main__ - Step 38898: {'lr': 0.0004271627732664687, 'samples': 7468416, 'steps': 38897, 'loss/train': 0.915793776512146}}}} 11/07/2021 02:42:02 - INFO - __main__ - Step 38902: {'lr': 0.0004271477957410399, 'samples': 7469184, 'steps': 38901, 'loss/train': 1.4027563333511353}}} 11/07/2021 02:42:02 - INFO - __main__ - Step 38902: {'lr': 0.0004271477957410399, 'samples': 7469184, 'steps': 38901, 'loss/train': 1.4027563333511353}}} 11/07/2021 02:42:02 - INFO - __main__ - Step 38902: {'lr': 0.0004271477957410399, 'samples': 7469184, 'steps': 38901, 'loss/train': 1.4027563333511353}}} 11/07/2021 02:42:08 - INFO - __main__ - Step 38913: {'lr': 0.0004271066009612804, 'samples': 7471296, 'steps': 38912, 'loss/train': 1.6177656650543213}}} 11/07/2021 02:42:11 - INFO - __main__ - Step 38918: {'lr': 0.0004270878728691946, 'samples': 7472256, 'steps': 38917, 'loss/train': 1.196390151977539}}}} 11/07/2021 02:42:11 - INFO - __main__ - Step 38918: {'lr': 0.0004270878728691946, 'samples': 7472256, 'steps': 38917, 'loss/train': 1.196390151977539}}}} 11/07/2021 02:42:14 - INFO - __main__ - Step 38924: {'lr': 0.0004270653965255391, 'samples': 7473408, 'steps': 38923, 'loss/train': 1.0960659980773926}}} 11/07/2021 02:42:17 - INFO - __main__ - Step 38930: {'lr': 0.000427042917309698, 'samples': 7474560, 'steps': 38929, 'loss/train': 1.7741930484771729}}}} 11/07/2021 02:42:19 - INFO - __main__ - Step 38934: {'lr': 0.0004270279295703253, 'samples': 7475328, 'steps': 38933, 'loss/train': 1.5284372568130493}}} 11/07/2021 02:42:19 - INFO - __main__ - Step 38934: {'lr': 0.0004270279295703253, 'samples': 7475328, 'steps': 38933, 'loss/train': 1.5284372568130493}}} 11/07/2021 02:42:22 - INFO - __main__ - Step 38941: {'lr': 0.00042700169795549504, 'samples': 7476672, 'steps': 38940, 'loss/train': 1.759921669960022}}} 11/07/2021 02:42:25 - INFO - __main__ - Step 38946: {'lr': 0.00042698295869509836, 'samples': 7477632, 'steps': 38945, 'loss/train': 1.5602725744247437}} 11/07/2021 02:42:27 - INFO - __main__ - Step 38950: {'lr': 0.0004269679658513466, 'samples': 7478400, 'steps': 38949, 'loss/train': 1.495678186416626}7}} 11/07/2021 02:42:27 - INFO - __main__ - Step 38950: {'lr': 0.0004269679658513466, 'samples': 7478400, 'steps': 38949, 'loss/train': 1.495678186416626}7}} 11/07/2021 02:42:30 - INFO - __main__ - Step 38957: {'lr': 0.00042694172530489326, 'samples': 7479744, 'steps': 38956, 'loss/train': 1.171259880065918}}} 11/07/2021 02:42:33 - INFO - __main__ - Step 38962: {'lr': 0.00042692297966557657, 'samples': 7480704, 'steps': 38961, 'loss/train': 1.3303264379501343}} 11/07/2021 02:42:35 - INFO - __main__ - Step 38967: {'lr': 0.00042690423203329067, 'samples': 7481664, 'steps': 38966, 'loss/train': 1.5373287200927734}} 11/07/2021 02:42:35 - INFO - __main__ - Step 38967: {'lr': 0.00042690423203329067, 'samples': 7481664, 'steps': 38966, 'loss/train': 1.5373287200927734}} 11/07/2021 02:42:39 - INFO - __main__ - Step 38974: {'lr': 0.00042687798200030446, 'samples': 7483008, 'steps': 38973, 'loss/train': 1.1193890571594238}} 11/07/2021 02:42:40 - INFO - __main__ - Step 38978: {'lr': 0.00042686298022805126, 'samples': 7483776, 'steps': 38977, 'loss/train': 1.4833357334136963}} 11/07/2021 02:42:42 - INFO - __main__ - Step 38982: {'lr': 0.0004268479771807303, 'samples': 7484544, 'steps': 38981, 'loss/train': 1.8099292516708374}}} 11/07/2021 02:42:45 - INFO - __main__ - Step 38988: {'lr': 0.0004268254702192337, 'samples': 7485696, 'steps': 38987, 'loss/train': 1.7074424028396606}}} 11/07/2021 02:42:47 - INFO - __main__ - Step 38992: {'lr': 0.00042681046398471693, 'samples': 7486464, 'steps': 38991, 'loss/train': 1.6774481534957886}} 11/07/2021 02:42:49 - INFO - __main__ - Step 38996: {'lr': 0.000426795456475511, 'samples': 7487232, 'steps': 38995, 'loss/train': 1.4442554712295532}6}} 11/07/2021 02:42:49 - INFO - __main__ - Step 38996: {'lr': 0.000426795456475511, 'samples': 7487232, 'steps': 38995, 'loss/train': 1.4442554712295532}6}} 11/07/2021 02:42:53 - INFO - __main__ - Step 39003: {'lr': 0.0004267691902675055, 'samples': 7488576, 'steps': 39002, 'loss/train': 1.7540159225463867}}} 11/07/2021 02:42:53 - INFO - __main__ - Step 39003: {'lr': 0.0004267691902675055, 'samples': 7488576, 'steps': 39002, 'loss/train': 1.7540159225463867}}} 11/07/2021 02:42:53 - INFO - __main__ - Step 39003: {'lr': 0.0004267691902675055, 'samples': 7488576, 'steps': 39002, 'loss/train': 1.7540159225463867}}} 11/07/2021 02:42:59 - INFO - __main__ - Step 39015: {'lr': 0.00042672415340263507, 'samples': 7490880, 'steps': 39014, 'loss/train': 1.77121102809906}}}} 11/07/2021 02:43:01 - INFO - __main__ - Step 39020: {'lr': 0.0004267053846578646, 'samples': 7491840, 'steps': 39019, 'loss/train': 1.799487590789795}}}} 11/07/2021 02:43:01 - INFO - __main__ - Step 39020: {'lr': 0.0004267053846578646, 'samples': 7491840, 'steps': 39019, 'loss/train': 1.799487590789795}}}} 11/07/2021 02:43:05 - INFO - __main__ - Step 39028: {'lr': 0.0004266753505260425, 'samples': 7493376, 'steps': 39027, 'loss/train': 0.9015293717384338}}} 11/07/2021 02:43:07 - INFO - __main__ - Step 39032: {'lr': 0.00042666033154950485, 'samples': 7494144, 'steps': 39031, 'loss/train': 1.570765733718872}}} 11/07/2021 02:43:09 - INFO - __main__ - Step 39036: {'lr': 0.00042664531129936044, 'samples': 7494912, 'steps': 39035, 'loss/train': 1.7049858570098877}} 11/07/2021 02:43:11 - INFO - __main__ - Step 39040: {'lr': 0.00042663028977571774, 'samples': 7495680, 'steps': 39039, 'loss/train': 1.2940502166748047}} 11/07/2021 02:43:13 - INFO - __main__ - Step 39045: {'lr': 0.000426611511080472, 'samples': 7496640, 'steps': 39044, 'loss/train': 1.5129510164260864}7}} 11/07/2021 02:43:15 - INFO - __main__ - Step 39049: {'lr': 0.00042659648669185376, 'samples': 7497408, 'steps': 39048, 'loss/train': 1.8148497343063354}} 11/07/2021 02:43:17 - INFO - __main__ - Step 39053: {'lr': 0.00042658146103008904, 'samples': 7498176, 'steps': 39052, 'loss/train': 1.286407470703125}}} 11/07/2021 02:43:19 - INFO - __main__ - Step 39057: {'lr': 0.0004265664340952862, 'samples': 7498944, 'steps': 39056, 'loss/train': 1.225213646888733}}}} 11/07/2021 02:43:21 - INFO - __main__ - Step 39061: {'lr': 0.00042655140588755366, 'samples': 7499712, 'steps': 39060, 'loss/train': 1.3078956604003906}} 11/07/2021 02:43:23 - INFO - __main__ - Step 39065: {'lr': 0.0004265363764069997, 'samples': 7500480, 'steps': 39064, 'loss/train': 1.4725383520126343}}} 11/07/2021 02:43:25 - INFO - __main__ - Step 39069: {'lr': 0.0004265213456537326, 'samples': 7501248, 'steps': 39068, 'loss/train': 1.6109968423843384}}} 11/07/2021 02:43:27 - INFO - __main__ - Step 39073: {'lr': 0.0004265063136278608, 'samples': 7502016, 'steps': 39072, 'loss/train': 1.0011496543884277}}} 11/07/2021 02:43:29 - INFO - __main__ - Step 39077: {'lr': 0.0004264912803294926, 'samples': 7502784, 'steps': 39076, 'loss/train': 1.2915695905685425}}} 11/07/2021 02:43:31 - INFO - __main__ - Step 39082: {'lr': 0.0004264724869172496, 'samples': 7503744, 'steps': 39081, 'loss/train': 1.7730166912078857}}} 11/07/2021 02:43:33 - INFO - __main__ - Step 39086: {'lr': 0.00042645745075616106, 'samples': 7504512, 'steps': 39085, 'loss/train': 0.18499447405338287} 11/07/2021 02:43:33 - INFO - __main__ - Step 39086: {'lr': 0.00042645745075616106, 'samples': 7504512, 'steps': 39085, 'loss/train': 0.18499447405338287} 11/07/2021 02:43:37 - INFO - __main__ - Step 39093: {'lr': 0.0004264311344132245, 'samples': 7505856, 'steps': 39092, 'loss/train': 1.8912596702575684}7} 11/07/2021 02:43:39 - INFO - __main__ - Step 39097: {'lr': 0.00042641609475400054, 'samples': 7506624, 'steps': 39096, 'loss/train': 2.056940793991089}7} 11/07/2021 02:43:41 - INFO - __main__ - Step 39102: {'lr': 0.00042639729339145004, 'samples': 7507584, 'steps': 39101, 'loss/train': 0.1855313628911972}} 11/07/2021 02:43:43 - INFO - __main__ - Step 39106: {'lr': 0.00042638225087072523, 'samples': 7508352, 'steps': 39105, 'loss/train': 1.9047843217849731}} 11/07/2021 02:43:45 - INFO - __main__ - Step 39110: {'lr': 0.0004263672070783986, 'samples': 7509120, 'steps': 39109, 'loss/train': 2.0347707271575928}}} 11/07/2021 02:43:47 - INFO - __main__ - Step 39114: {'lr': 0.00042635216201457836, 'samples': 7509888, 'steps': 39113, 'loss/train': 1.5635451078414917}} 11/07/2021 02:43:49 - INFO - __main__ - Step 39118: {'lr': 0.00042633711567937325, 'samples': 7510656, 'steps': 39117, 'loss/train': 1.6775025129318237}} 11/07/2021 02:43:49 - INFO - __main__ - Step 39118: {'lr': 0.00042633711567937325, 'samples': 7510656, 'steps': 39117, 'loss/train': 1.6775025129318237}} 11/07/2021 02:43:49 - INFO - __main__ - Step 39118: {'lr': 0.00042633711567937325, 'samples': 7510656, 'steps': 39117, 'loss/train': 1.6775025129318237}} 11/07/2021 02:43:55 - INFO - __main__ - Step 39129: {'lr': 0.0004262957317028657, 'samples': 7512768, 'steps': 39128, 'loss/train': 1.9064688682556152}}} 11/07/2021 02:43:57 - INFO - __main__ - Step 39133: {'lr': 0.00042628068060093294, 'samples': 7513536, 'steps': 39132, 'loss/train': 1.417256474494934}}} 11/07/2021 02:43:59 - INFO - __main__ - Step 39137: {'lr': 0.0004262656282281305, 'samples': 7514304, 'steps': 39136, 'loss/train': 1.5228852033615112}}} 11/07/2021 02:44:01 - INFO - __main__ - Step 39142: {'lr': 0.0004262468109751323, 'samples': 7515264, 'steps': 39141, 'loss/train': 0.9578734636306763}}} 11/07/2021 02:44:03 - INFO - __main__ - Step 39146: {'lr': 0.0004262317557432699, 'samples': 7516032, 'steps': 39145, 'loss/train': 1.3861796855926514}}} 11/07/2021 02:44:06 - INFO - __main__ - Step 39150: {'lr': 0.00042621669924089044, 'samples': 7516800, 'steps': 39149, 'loss/train': 1.1236180067062378}} 11/07/2021 02:44:06 - INFO - __main__ - Step 39150: {'lr': 0.00042621669924089044, 'samples': 7516800, 'steps': 39149, 'loss/train': 1.1236180067062378}} 11/07/2021 02:44:09 - INFO - __main__ - Step 39157: {'lr': 0.00042619034730487167, 'samples': 7518144, 'steps': 39156, 'loss/train': 1.2965528964996338}} 11/07/2021 02:44:09 - INFO - __main__ - Step 39157: {'lr': 0.00042619034730487167, 'samples': 7518144, 'steps': 39156, 'loss/train': 1.2965528964996338}} 11/07/2021 02:44:13 - INFO - __main__ - Step 39166: {'lr': 0.0004261564605283745, 'samples': 7519872, 'steps': 39165, 'loss/train': 1.8770747184753418}}} 11/07/2021 02:44:15 - INFO - __main__ - Step 39170: {'lr': 0.0004261413976750388, 'samples': 7520640, 'steps': 39169, 'loss/train': 1.223093032836914}}}} 11/07/2021 02:44:17 - INFO - __main__ - Step 39174: {'lr': 0.0004261263335518375, 'samples': 7521408, 'steps': 39173, 'loss/train': 1.765062689781189}}}} 11/07/2021 02:44:19 - INFO - __main__ - Step 39178: {'lr': 0.0004261112681588793, 'samples': 7522176, 'steps': 39177, 'loss/train': 1.85269033908844}}}}} 11/07/2021 02:44:19 - INFO - __main__ - Step 39178: {'lr': 0.0004261112681588793, 'samples': 7522176, 'steps': 39177, 'loss/train': 1.85269033908844}}}}} 11/07/2021 02:44:24 - INFO - __main__ - Step 39187: {'lr': 0.0004260773663827372, 'samples': 7523904, 'steps': 39186, 'loss/train': 1.276141881942749}}}} 11/07/2021 02:44:24 - INFO - __main__ - Step 39187: {'lr': 0.0004260773663827372, 'samples': 7523904, 'steps': 39186, 'loss/train': 1.276141881942749}}}} 11/07/2021 02:44:27 - INFO - __main__ - Step 39194: {'lr': 0.00042605099389164957, 'samples': 7525248, 'steps': 39193, 'loss/train': 1.0232011079788208}} 11/07/2021 02:44:29 - INFO - __main__ - Step 39199: {'lr': 0.0004260321540182057, 'samples': 7526208, 'steps': 39198, 'loss/train': 1.3887532949447632}}} 11/07/2021 02:44:32 - INFO - __main__ - Step 39204: {'lr': 0.0004260133121618276, 'samples': 7527168, 'steps': 39203, 'loss/train': 1.5005837678909302}}} 11/07/2021 02:44:34 - INFO - __main__ - Step 39208: {'lr': 0.0004259982372491551, 'samples': 7527936, 'steps': 39207, 'loss/train': 1.186719536781311}}}} 11/07/2021 02:44:34 - INFO - __main__ - Step 39208: {'lr': 0.0004259982372491551, 'samples': 7527936, 'steps': 39207, 'loss/train': 1.186719536781311}}}} 11/07/2021 02:44:37 - INFO - __main__ - Step 39215: {'lr': 0.00042597185309891305, 'samples': 7529280, 'steps': 39214, 'loss/train': 1.9070416688919067}} 11/07/2021 02:44:37 - INFO - __main__ - Step 39215: {'lr': 0.00042597185309891305, 'samples': 7529280, 'steps': 39214, 'loss/train': 1.9070416688919067}} 11/07/2021 02:44:41 - INFO - __main__ - Step 39222: {'lr': 0.00042594546506345124, 'samples': 7530624, 'steps': 39221, 'loss/train': 0.7826408743858337}} 11/07/2021 02:44:43 - INFO - __main__ - Step 39227: {'lr': 0.00042592661408830937, 'samples': 7531584, 'steps': 39226, 'loss/train': 1.7915257215499878}} 11/07/2021 02:44:46 - INFO - __main__ - Step 39232: {'lr': 0.00042590776113142216, 'samples': 7532544, 'steps': 39231, 'loss/train': 1.5731817483901978}} 11/07/2021 02:44:46 - INFO - __main__ - Step 39232: {'lr': 0.00042590776113142216, 'samples': 7532544, 'steps': 39231, 'loss/train': 1.5731817483901978}} 11/07/2021 02:44:49 - INFO - __main__ - Step 39237: {'lr': 0.0004258889061930018, 'samples': 7533504, 'steps': 39236, 'loss/train': 1.6044750213623047}}} 11/07/2021 02:44:52 - INFO - __main__ - Step 39243: {'lr': 0.0004258662776515728, 'samples': 7534656, 'steps': 39242, 'loss/train': 1.8259460926055908}}} 11/07/2021 02:44:52 - INFO - __main__ - Step 39243: {'lr': 0.0004258662776515728, 'samples': 7534656, 'steps': 39242, 'loss/train': 1.8259460926055908}}} 11/07/2021 02:44:55 - INFO - __main__ - Step 39250: {'lr': 0.0004258398740810584, 'samples': 7536000, 'steps': 39249, 'loss/train': 1.7944103479385376}}} 11/07/2021 02:44:57 - INFO - __main__ - Step 39255: {'lr': 0.00042582101201087786, 'samples': 7536960, 'steps': 39254, 'loss/train': 1.1267701387405396}} 11/07/2021 02:45:00 - INFO - __main__ - Step 39260: {'lr': 0.0004258021479601414, 'samples': 7537920, 'steps': 39259, 'loss/train': 1.3875758647918701}}} 11/07/2021 02:45:00 - INFO - __main__ - Step 39260: {'lr': 0.0004258021479601414, 'samples': 7537920, 'steps': 39259, 'loss/train': 1.3875758647918701}}} 11/07/2021 02:45:03 - INFO - __main__ - Step 39267: {'lr': 0.0004257757349621811, 'samples': 7539264, 'steps': 39266, 'loss/train': 1.6467303037643433}}} 11/07/2021 02:45:05 - INFO - __main__ - Step 39271: {'lr': 0.0004257606400780117, 'samples': 7540032, 'steps': 39270, 'loss/train': 1.5877642631530762}}} 11/07/2021 02:45:07 - INFO - __main__ - Step 39275: {'lr': 0.0004257455439267218, 'samples': 7540800, 'steps': 39274, 'loss/train': 1.438101053237915}}}} 11/07/2021 02:45:09 - INFO - __main__ - Step 39279: {'lr': 0.0004257304465084203, 'samples': 7541568, 'steps': 39278, 'loss/train': 1.7904508113861084}}} 11/07/2021 02:45:11 - INFO - __main__ - Step 39283: {'lr': 0.00042571534782321593, 'samples': 7542336, 'steps': 39282, 'loss/train': 1.4070467948913574}} 11/07/2021 02:45:13 - INFO - __main__ - Step 39287: {'lr': 0.0004257002478712175, 'samples': 7543104, 'steps': 39286, 'loss/train': 1.1086269617080688}}} 11/07/2021 02:45:16 - INFO - __main__ - Step 39292: {'lr': 0.00042568137114995633, 'samples': 7544064, 'steps': 39291, 'loss/train': 1.7251018285751343}} 11/07/2021 02:45:18 - INFO - __main__ - Step 39296: {'lr': 0.0004256662683480695, 'samples': 7544832, 'steps': 39295, 'loss/train': 1.9215407371520996}}} 11/07/2021 02:45:20 - INFO - __main__ - Step 39300: {'lr': 0.0004256511642797426, 'samples': 7545600, 'steps': 39299, 'loss/train': 1.5300726890563965}}} 11/07/2021 02:45:22 - INFO - __main__ - Step 39304: {'lr': 0.00042563605894508434, 'samples': 7546368, 'steps': 39303, 'loss/train': 1.3211116790771484}} 11/07/2021 02:45:23 - INFO - __main__ - Step 39308: {'lr': 0.00042562095234420375, 'samples': 7547136, 'steps': 39307, 'loss/train': 1.361107349395752}}} 11/07/2021 02:45:25 - INFO - __main__ - Step 39312: {'lr': 0.0004256058444772097, 'samples': 7547904, 'steps': 39311, 'loss/train': 1.536171317100525}}}} 11/07/2021 02:45:28 - INFO - __main__ - Step 39317: {'lr': 0.00042558695786316106, 'samples': 7548864, 'steps': 39316, 'loss/train': 1.6797736883163452}} 11/07/2021 02:45:28 - INFO - __main__ - Step 39317: {'lr': 0.00042558695786316106, 'samples': 7548864, 'steps': 39316, 'loss/train': 1.6797736883163452}} 11/07/2021 02:45:31 - INFO - __main__ - Step 39324: {'lr': 0.000425560513280636, 'samples': 7550208, 'steps': 39323, 'loss/train': 1.1001230478286743}2}} 11/07/2021 02:45:33 - INFO - __main__ - Step 39328: {'lr': 0.0004255454003502774, 'samples': 7550976, 'steps': 39327, 'loss/train': 1.4416260719299316}}} 11/07/2021 02:45:35 - INFO - __main__ - Step 39333: {'lr': 0.0004255265074076358, 'samples': 7551936, 'steps': 39332, 'loss/train': 0.8778195381164551}}} 11/07/2021 02:45:35 - INFO - __main__ - Step 39333: {'lr': 0.0004255265074076358, 'samples': 7551936, 'steps': 39332, 'loss/train': 0.8778195381164551}}} 11/07/2021 02:45:39 - INFO - __main__ - Step 39340: {'lr': 0.0004255000539662247, 'samples': 7553280, 'steps': 39339, 'loss/train': 1.684509038925171}}}} 11/07/2021 02:45:41 - INFO - __main__ - Step 39344: {'lr': 0.0004254849359742449, 'samples': 7554048, 'steps': 39343, 'loss/train': 1.0768413543701172}}} 11/07/2021 02:45:41 - INFO - __main__ - Step 39344: {'lr': 0.0004254849359742449, 'samples': 7554048, 'steps': 39343, 'loss/train': 1.0768413543701172}}} 11/07/2021 02:45:46 - INFO - __main__ - Step 39353: {'lr': 0.00042545091586681404, 'samples': 7555776, 'steps': 39352, 'loss/train': 1.6890608072280884}} 11/07/2021 02:45:48 - INFO - __main__ - Step 39357: {'lr': 0.0004254357937635509, 'samples': 7556544, 'steps': 39356, 'loss/train': 1.8868955373764038}}} 11/07/2021 02:45:49 - INFO - __main__ - Step 39361: {'lr': 0.00042542067039550916, 'samples': 7557312, 'steps': 39360, 'loss/train': 1.9013553857803345}} 11/07/2021 02:45:51 - INFO - __main__ - Step 39365: {'lr': 0.00042540554576279776, 'samples': 7558080, 'steps': 39364, 'loss/train': 1.6894145011901855}} 11/07/2021 02:45:54 - INFO - __main__ - Step 39370: {'lr': 0.00042538663819363323, 'samples': 7559040, 'steps': 39369, 'loss/train': 1.4790570735931396}} 11/07/2021 02:45:56 - INFO - __main__ - Step 39375: {'lr': 0.0004253677286488058, 'samples': 7560000, 'steps': 39374, 'loss/train': 1.004982352256775}6}} 11/07/2021 02:45:58 - INFO - __main__ - Step 39379: {'lr': 0.0004253525995906098, 'samples': 7560768, 'steps': 39378, 'loss/train': 1.5641216039657593}}} 11/07/2021 02:45:58 - INFO - __main__ - Step 39379: {'lr': 0.0004253525995906098, 'samples': 7560768, 'steps': 39378, 'loss/train': 1.5641216039657593}}} 11/07/2021 02:46:01 - INFO - __main__ - Step 39386: {'lr': 0.00042532612069690214, 'samples': 7562112, 'steps': 39385, 'loss/train': 1.331007719039917}}} 11/07/2021 02:46:04 - INFO - __main__ - Step 39391: {'lr': 0.00042530720483138524, 'samples': 7563072, 'steps': 39390, 'loss/train': 0.7283228635787964}} 11/07/2021 02:46:04 - INFO - __main__ - Step 39391: {'lr': 0.00042530720483138524, 'samples': 7563072, 'steps': 39390, 'loss/train': 0.7283228635787964}} 11/07/2021 02:46:08 - INFO - __main__ - Step 39399: {'lr': 0.0004252769353391294, 'samples': 7564608, 'steps': 39398, 'loss/train': 0.9206255078315735}}} 11/07/2021 02:46:09 - INFO - __main__ - Step 39403: {'lr': 0.0004252617986974969, 'samples': 7565376, 'steps': 39402, 'loss/train': 0.9733449816703796}}} 11/07/2021 02:46:11 - INFO - __main__ - Step 39407: {'lr': 0.0004252466607923402, 'samples': 7566144, 'steps': 39406, 'loss/train': 1.5809663534164429}}} 11/07/2021 02:46:14 - INFO - __main__ - Step 39412: {'lr': 0.00042522773663422977, 'samples': 7567104, 'steps': 39411, 'loss/train': 1.7798511981964111}} 11/07/2021 02:46:16 - INFO - __main__ - Step 39416: {'lr': 0.00042521259588654264, 'samples': 7567872, 'steps': 39415, 'loss/train': 1.2348101139068604}} 11/07/2021 02:46:16 - INFO - __main__ - Step 39416: {'lr': 0.00042521259588654264, 'samples': 7567872, 'steps': 39415, 'loss/train': 1.2348101139068604}} 11/07/2021 02:46:19 - INFO - __main__ - Step 39423: {'lr': 0.00042518609653865444, 'samples': 7569216, 'steps': 39422, 'loss/train': 1.822353720664978}}} 11/07/2021 02:46:22 - INFO - __main__ - Step 39428: {'lr': 0.0004251671660649013, 'samples': 7570176, 'steps': 39427, 'loss/train': 1.613086223602295}}}} 11/07/2021 02:46:24 - INFO - __main__ - Step 39433: {'lr': 0.00042514823361795764, 'samples': 7571136, 'steps': 39432, 'loss/train': 0.7617422938346863}} 11/07/2021 02:46:24 - INFO - __main__ - Step 39433: {'lr': 0.00042514823361795764, 'samples': 7571136, 'steps': 39432, 'loss/train': 0.7617422938346863}} 11/07/2021 02:46:28 - INFO - __main__ - Step 39440: {'lr': 0.00042512172487768244, 'samples': 7572480, 'steps': 39439, 'loss/train': 1.3813639879226685}} 11/07/2021 02:46:30 - INFO - __main__ - Step 39444: {'lr': 0.0004251065752901018, 'samples': 7573248, 'steps': 39443, 'loss/train': 1.537869930267334}5}} 11/07/2021 02:46:32 - INFO - __main__ - Step 39449: {'lr': 0.00042508763653038167, 'samples': 7574208, 'steps': 39448, 'loss/train': 1.785377025604248}}} 11/07/2021 02:46:34 - INFO - __main__ - Step 39453: {'lr': 0.00042507248410254307, 'samples': 7574976, 'steps': 39452, 'loss/train': 1.2439583539962769}} 11/07/2021 02:46:36 - INFO - __main__ - Step 39457: {'lr': 0.00042505733041254526, 'samples': 7575744, 'steps': 39456, 'loss/train': 1.4447280168533325}} 11/07/2021 02:46:36 - INFO - __main__ - Step 39457: {'lr': 0.00042505733041254526, 'samples': 7575744, 'steps': 39456, 'loss/train': 1.4447280168533325}} 11/07/2021 02:46:39 - INFO - __main__ - Step 39464: {'lr': 0.00042503080841830654, 'samples': 7577088, 'steps': 39463, 'loss/train': 1.5276356935501099}} 11/07/2021 02:46:42 - INFO - __main__ - Step 39469: {'lr': 0.0004250118617706879, 'samples': 7578048, 'steps': 39468, 'loss/train': 1.4779454469680786}}} 11/07/2021 02:46:42 - INFO - __main__ - Step 39469: {'lr': 0.0004250118617706879, 'samples': 7578048, 'steps': 39468, 'loss/train': 1.4779454469680786}}} 11/07/2021 02:46:46 - INFO - __main__ - Step 39477: {'lr': 0.0004249815430339894, 'samples': 7579584, 'steps': 39476, 'loss/train': 0.690558135509491}}}} 11/07/2021 02:46:46 - INFO - __main__ - Step 39477: {'lr': 0.0004249815430339894, 'samples': 7579584, 'steps': 39476, 'loss/train': 0.690558135509491}}}} 11/07/2021 02:46:50 - INFO - __main__ - Step 39485: {'lr': 0.0004249512192512759, 'samples': 7581120, 'steps': 39484, 'loss/train': 1.6692142486572266}}} 11/07/2021 02:46:52 - INFO - __main__ - Step 39490: {'lr': 0.00042493226432503917, 'samples': 7582080, 'steps': 39489, 'loss/train': 1.0378392934799194}} 11/07/2021 02:46:52 - INFO - __main__ - Step 39490: {'lr': 0.00042493226432503917, 'samples': 7582080, 'steps': 39489, 'loss/train': 1.0378392934799194}} 11/07/2021 02:46:52 - INFO - __main__ - Step 39490: {'lr': 0.00042493226432503917, 'samples': 7582080, 'steps': 39489, 'loss/train': 1.0378392934799194}} 11/07/2021 02:46:58 - INFO - __main__ - Step 39500: {'lr': 0.00042489434856114565, 'samples': 7584000, 'steps': 39499, 'loss/train': 1.4942436218261719}} 11/07/2021 02:47:00 - INFO - __main__ - Step 39504: {'lr': 0.00042487918004896117, 'samples': 7584768, 'steps': 39503, 'loss/train': 1.7786004543304443}} 11/07/2021 02:47:02 - INFO - __main__ - Step 39508: {'lr': 0.00042486401027601084, 'samples': 7585536, 'steps': 39507, 'loss/train': 1.7957890033721924}} 11/07/2021 02:47:04 - INFO - __main__ - Step 39513: {'lr': 0.0004248450462870378, 'samples': 7586496, 'steps': 39512, 'loss/train': 1.3140740394592285}}} 11/07/2021 02:47:06 - INFO - __main__ - Step 39518: {'lr': 0.00042482608032850275, 'samples': 7587456, 'steps': 39517, 'loss/train': 1.4334967136383057}} 11/07/2021 02:47:09 - INFO - __main__ - Step 39522: {'lr': 0.00042481090614373364, 'samples': 7588224, 'steps': 39521, 'loss/train': 1.578730583190918}}} 11/07/2021 02:47:11 - INFO - __main__ - Step 39526: {'lr': 0.00042479573069869095, 'samples': 7588992, 'steps': 39525, 'loss/train': 1.9888464212417603}} 11/07/2021 02:47:12 - INFO - __main__ - Step 39530: {'lr': 0.00042478055399348415, 'samples': 7589760, 'steps': 39529, 'loss/train': 1.7055436372756958}} 11/07/2021 02:47:14 - INFO - __main__ - Step 39534: {'lr': 0.0004247653760282225, 'samples': 7590528, 'steps': 39533, 'loss/train': 1.2379640340805054}}} 11/07/2021 02:47:16 - INFO - __main__ - Step 39538: {'lr': 0.0004247501968030157, 'samples': 7591296, 'steps': 39537, 'loss/train': 1.1878610849380493}}} 11/07/2021 02:47:16 - INFO - __main__ - Step 39538: {'lr': 0.0004247501968030157, 'samples': 7591296, 'steps': 39537, 'loss/train': 1.1878610849380493}}} 11/07/2021 02:47:21 - INFO - __main__ - Step 39546: {'lr': 0.00042471983457320384, 'samples': 7592832, 'steps': 39545, 'loss/train': 0.9633825421333313}} 11/07/2021 02:47:22 - INFO - __main__ - Step 39550: {'lr': 0.00042470465156881765, 'samples': 7593600, 'steps': 39549, 'loss/train': 1.7505109310150146}} 11/07/2021 02:47:24 - INFO - __main__ - Step 39554: {'lr': 0.00042468946730492404, 'samples': 7594368, 'steps': 39553, 'loss/train': 1.6657792329788208}} 11/07/2021 02:47:27 - INFO - __main__ - Step 39559: {'lr': 0.00042467048520404126, 'samples': 7595328, 'steps': 39558, 'loss/train': 1.1775028705596924}} 11/07/2021 02:47:27 - INFO - __main__ - Step 39559: {'lr': 0.00042467048520404126, 'samples': 7595328, 'steps': 39558, 'loss/train': 1.1775028705596924}} 11/07/2021 02:47:30 - INFO - __main__ - Step 39566: {'lr': 0.0004246439069572926, 'samples': 7596672, 'steps': 39565, 'loss/train': 0.4988980293273926}}} 11/07/2021 02:47:32 - INFO - __main__ - Step 39570: {'lr': 0.0004246287176564637, 'samples': 7597440, 'steps': 39569, 'loss/train': 1.8463135957717896}}} 11/07/2021 02:47:35 - INFO - __main__ - Step 39576: {'lr': 0.00042460593134470426, 'samples': 7598592, 'steps': 39575, 'loss/train': 1.1359695196151733}} 11/07/2021 02:47:37 - INFO - __main__ - Step 39580: {'lr': 0.0004245907388966804, 'samples': 7599360, 'steps': 39579, 'loss/train': 1.2286467552185059}}} 11/07/2021 02:47:39 - INFO - __main__ - Step 39584: {'lr': 0.0004245755451899703, 'samples': 7600128, 'steps': 39583, 'loss/train': 2.1289525032043457}}} 11/07/2021 02:47:41 - INFO - __main__ - Step 39588: {'lr': 0.00042456035022468344, 'samples': 7600896, 'steps': 39587, 'loss/train': 1.5494352579116821}} 11/07/2021 02:47:42 - INFO - __main__ - Step 39592: {'lr': 0.00042454515400092944, 'samples': 7601664, 'steps': 39591, 'loss/train': 1.5542364120483398}} 11/07/2021 02:47:45 - INFO - __main__ - Step 39596: {'lr': 0.00042452995651881764, 'samples': 7602432, 'steps': 39595, 'loss/train': 1.5693578720092773}} 11/07/2021 02:47:47 - INFO - __main__ - Step 39601: {'lr': 0.00042451095789677943, 'samples': 7603392, 'steps': 39600, 'loss/train': 0.8249521255493164}} 11/07/2021 02:47:47 - INFO - __main__ - Step 39601: {'lr': 0.00042451095789677943, 'samples': 7603392, 'steps': 39600, 'loss/train': 0.8249521255493164}} 11/07/2021 02:47:50 - INFO - __main__ - Step 39608: {'lr': 0.00042448435652343223, 'samples': 7604736, 'steps': 39607, 'loss/train': 1.5574475526809692}} 11/07/2021 02:47:52 - INFO - __main__ - Step 39612: {'lr': 0.00042446915400898565, 'samples': 7605504, 'steps': 39611, 'loss/train': 1.1934031248092651}} 11/07/2021 02:47:55 - INFO - __main__ - Step 39617: {'lr': 0.0004244501490971454, 'samples': 7606464, 'steps': 39616, 'loss/train': 0.39124223589897156}} 11/07/2021 02:47:57 - INFO - __main__ - Step 39621: {'lr': 0.000424434943752781, 'samples': 7607232, 'steps': 39620, 'loss/train': 0.9253093004226685}6}} 11/07/2021 02:47:59 - INFO - __main__ - Step 39625: {'lr': 0.0004244197371508536, 'samples': 7608000, 'steps': 39624, 'loss/train': 1.5457366704940796}}} 11/07/2021 02:48:01 - INFO - __main__ - Step 39629: {'lr': 0.0004244045292914726, 'samples': 7608768, 'steps': 39628, 'loss/train': 1.6657663583755493}}} 11/07/2021 02:48:02 - INFO - __main__ - Step 39633: {'lr': 0.00042438932017474783, 'samples': 7609536, 'steps': 39632, 'loss/train': 1.414711594581604}}} 11/07/2021 02:48:04 - INFO - __main__ - Step 39637: {'lr': 0.00042437410980078894, 'samples': 7610304, 'steps': 39636, 'loss/train': 1.1887015104293823}} 11/07/2021 02:48:07 - INFO - __main__ - Step 39642: {'lr': 0.0004243550950655217, 'samples': 7611264, 'steps': 39641, 'loss/train': 0.8495831489562988}}} 11/07/2021 02:48:09 - INFO - __main__ - Step 39646: {'lr': 0.0004243398818631868, 'samples': 7612032, 'steps': 39645, 'loss/train': 1.6688364744186401}}} 11/07/2021 02:48:09 - INFO - __main__ - Step 39646: {'lr': 0.0004243398818631868, 'samples': 7612032, 'steps': 39645, 'loss/train': 1.6688364744186401}}} 11/07/2021 02:48:12 - INFO - __main__ - Step 39653: {'lr': 0.0004243132557348045, 'samples': 7613376, 'steps': 39652, 'loss/train': 0.45958590507507324}} 11/07/2021 02:48:15 - INFO - __main__ - Step 39658: {'lr': 0.0004242942347153542, 'samples': 7614336, 'steps': 39657, 'loss/train': 1.1589655876159668}}} 11/07/2021 02:48:15 - INFO - __main__ - Step 39658: {'lr': 0.0004242942347153542, 'samples': 7614336, 'steps': 39657, 'loss/train': 1.1589655876159668}}} 11/07/2021 02:48:18 - INFO - __main__ - Step 39665: {'lr': 0.0004242676019897314, 'samples': 7615680, 'steps': 39664, 'loss/train': 1.6339011192321777}}} 11/07/2021 02:48:20 - INFO - __main__ - Step 39669: {'lr': 0.0004242523815618473, 'samples': 7616448, 'steps': 39668, 'loss/train': 1.4685289859771729}}} 11/07/2021 02:48:23 - INFO - __main__ - Step 39674: {'lr': 0.0004242333542604079, 'samples': 7617408, 'steps': 39673, 'loss/train': 1.5013810396194458}}} 11/07/2021 02:48:23 - INFO - __main__ - Step 39674: {'lr': 0.0004242333542604079, 'samples': 7617408, 'steps': 39673, 'loss/train': 1.5013810396194458}}} 11/07/2021 02:48:27 - INFO - __main__ - Step 39682: {'lr': 0.0004242029064958372, 'samples': 7618944, 'steps': 39681, 'loss/train': 1.0614537000656128}}} 11/07/2021 02:48:29 - INFO - __main__ - Step 39686: {'lr': 0.00042418768072966163, 'samples': 7619712, 'steps': 39685, 'loss/train': 1.621649980545044}}} 11/07/2021 02:48:30 - INFO - __main__ - Step 39690: {'lr': 0.00042417245370770547, 'samples': 7620480, 'steps': 39689, 'loss/train': 1.6081323623657227}} 11/07/2021 02:48:32 - INFO - __main__ - Step 39694: {'lr': 0.0004241572254300786, 'samples': 7621248, 'steps': 39693, 'loss/train': 0.6049160957336426}}} 11/07/2021 02:48:34 - INFO - __main__ - Step 39698: {'lr': 0.00042414199589689084, 'samples': 7622016, 'steps': 39697, 'loss/train': 1.3225457668304443}} 11/07/2021 02:48:34 - INFO - __main__ - Step 39698: {'lr': 0.00042414199589689084, 'samples': 7622016, 'steps': 39697, 'loss/train': 1.3225457668304443}} 11/07/2021 02:48:38 - INFO - __main__ - Step 39706: {'lr': 0.0004241115330642717, 'samples': 7623552, 'steps': 39705, 'loss/train': 1.345481514930725}3}} 11/07/2021 02:48:40 - INFO - __main__ - Step 39710: {'lr': 0.00042409629976505994, 'samples': 7624320, 'steps': 39709, 'loss/train': 1.191588044166565}}} 11/07/2021 02:48:43 - INFO - __main__ - Step 39715: {'lr': 0.0004240772563760432, 'samples': 7625280, 'steps': 39714, 'loss/train': 1.3761696815490723}}} 11/07/2021 02:48:43 - INFO - __main__ - Step 39715: {'lr': 0.0004240772563760432, 'samples': 7625280, 'steps': 39714, 'loss/train': 1.3761696815490723}}} 11/07/2021 02:48:47 - INFO - __main__ - Step 39723: {'lr': 0.0004240467828750064, 'samples': 7626816, 'steps': 39722, 'loss/train': 1.5825248956680298}}} 11/07/2021 02:48:48 - INFO - __main__ - Step 39727: {'lr': 0.00042403154424228596, 'samples': 7627584, 'steps': 39726, 'loss/train': 0.6849703192710876}} 11/07/2021 02:48:50 - INFO - __main__ - Step 39731: {'lr': 0.00042401630435491073, 'samples': 7628352, 'steps': 39730, 'loss/train': 1.7260664701461792}} 11/07/2021 02:48:53 - INFO - __main__ - Step 39736: {'lr': 0.00042399725273150056, 'samples': 7629312, 'steps': 39735, 'loss/train': 1.8496798276901245}} 11/07/2021 02:48:55 - INFO - __main__ - Step 39740: {'lr': 0.0004239820100215537, 'samples': 7630080, 'steps': 39739, 'loss/train': 1.3782191276550293}}} 11/07/2021 02:48:57 - INFO - __main__ - Step 39744: {'lr': 0.000423966766057309, 'samples': 7630848, 'steps': 39743, 'loss/train': 1.6397227048873901}}}} 11/07/2021 02:48:58 - INFO - __main__ - Step 39748: {'lr': 0.0004239515208388764, 'samples': 7631616, 'steps': 39747, 'loss/train': 1.706516981124878}}}} 11/07/2021 02:49:01 - INFO - __main__ - Step 39752: {'lr': 0.00042393627436636597, 'samples': 7632384, 'steps': 39751, 'loss/train': 1.8266533613204956}} 11/07/2021 02:49:03 - INFO - __main__ - Step 39757: {'lr': 0.0004239172145123481, 'samples': 7633344, 'steps': 39756, 'loss/train': 1.6792773008346558}}} 11/07/2021 02:49:03 - INFO - __main__ - Step 39757: {'lr': 0.0004239172145123481, 'samples': 7633344, 'steps': 39756, 'loss/train': 1.6792773008346558}}} 11/07/2021 02:49:06 - INFO - __main__ - Step 39764: {'lr': 0.0004238905274254661, 'samples': 7634688, 'steps': 39763, 'loss/train': 1.3256386518478394}}} 11/07/2021 02:49:08 - INFO - __main__ - Step 39768: {'lr': 0.0004238752759377431, 'samples': 7635456, 'steps': 39767, 'loss/train': 1.544019103050232}}}} 11/07/2021 02:49:11 - INFO - __main__ - Step 39773: {'lr': 0.0004238562098153281, 'samples': 7636416, 'steps': 39772, 'loss/train': 1.5938646793365479}}} 11/07/2021 02:49:13 - INFO - __main__ - Step 39778: {'lr': 0.00042383714173449007, 'samples': 7637376, 'steps': 39777, 'loss/train': 1.3706766366958618}} 11/07/2021 02:49:13 - INFO - __main__ - Step 39778: {'lr': 0.00042383714173449007, 'samples': 7637376, 'steps': 39777, 'loss/train': 1.3706766366958618}} 11/07/2021 02:49:16 - INFO - __main__ - Step 39785: {'lr': 0.0004238104431315749, 'samples': 7638720, 'steps': 39784, 'loss/train': 1.6826781034469604}}} 11/07/2021 02:49:19 - INFO - __main__ - Step 39789: {'lr': 0.0004237951850640555, 'samples': 7639488, 'steps': 39788, 'loss/train': 1.75411057472229}4}}} 11/07/2021 02:49:21 - INFO - __main__ - Step 39794: {'lr': 0.0004237761107177068, 'samples': 7640448, 'steps': 39793, 'loss/train': 1.2133203744888306}}} 11/07/2021 02:49:21 - INFO - __main__ - Step 39794: {'lr': 0.0004237761107177068, 'samples': 7640448, 'steps': 39793, 'loss/train': 1.2133203744888306}}} 11/07/2021 02:49:24 - INFO - __main__ - Step 39801: {'lr': 0.00042374940334423194, 'samples': 7641792, 'steps': 39800, 'loss/train': 1.038809895515442}}} 11/07/2021 02:49:26 - INFO - __main__ - Step 39805: {'lr': 0.0004237341402655692, 'samples': 7642560, 'steps': 39804, 'loss/train': 1.4463545083999634}}} 11/07/2021 02:49:29 - INFO - __main__ - Step 39810: {'lr': 0.0004237150596559103, 'samples': 7643520, 'steps': 39809, 'loss/train': 1.6761316061019897}}} 11/07/2021 02:49:29 - INFO - __main__ - Step 39810: {'lr': 0.0004237150596559103, 'samples': 7643520, 'steps': 39809, 'loss/train': 1.6761316061019897}}} 11/07/2021 02:49:33 - INFO - __main__ - Step 39818: {'lr': 0.0004236845266103327, 'samples': 7645056, 'steps': 39817, 'loss/train': 1.579480528831482}}}} 11/07/2021 02:49:35 - INFO - __main__ - Step 39822: {'lr': 0.00042366925820925915, 'samples': 7645824, 'steps': 39821, 'loss/train': 1.3526073694229126}} 11/07/2021 02:49:37 - INFO - __main__ - Step 39826: {'lr': 0.0004236539885561427, 'samples': 7646592, 'steps': 39825, 'loss/train': 1.4713175296783447}}} 11/07/2021 02:49:39 - INFO - __main__ - Step 39830: {'lr': 0.0004236387176510933, 'samples': 7647360, 'steps': 39829, 'loss/train': 1.5187652111053467}}} 11/07/2021 02:49:41 - INFO - __main__ - Step 39835: {'lr': 0.0004236196272594186, 'samples': 7648320, 'steps': 39834, 'loss/train': 1.5928826332092285}}} 11/07/2021 02:49:43 - INFO - __main__ - Step 39840: {'lr': 0.0004236005349119858, 'samples': 7649280, 'steps': 39839, 'loss/train': 1.6861557960510254}}} 11/07/2021 02:49:45 - INFO - __main__ - Step 39844: {'lr': 0.0004235852596260382, 'samples': 7650048, 'steps': 39843, 'loss/train': 1.425424575805664}}}} 11/07/2021 02:49:47 - INFO - __main__ - Step 39848: {'lr': 0.00042356998308865323, 'samples': 7650816, 'steps': 39847, 'loss/train': 2.0486652851104736}} 11/07/2021 02:49:49 - INFO - __main__ - Step 39852: {'lr': 0.0004235547052999409, 'samples': 7651584, 'steps': 39851, 'loss/train': 1.3674862384796143}}} 11/07/2021 02:49:51 - INFO - __main__ - Step 39856: {'lr': 0.0004235394262600114, 'samples': 7652352, 'steps': 39855, 'loss/train': 1.801565170288086}}}} 11/07/2021 02:49:53 - INFO - __main__ - Step 39861: {'lr': 0.00042352032570074327, 'samples': 7653312, 'steps': 39860, 'loss/train': 1.4000568389892578}} 11/07/2021 02:49:53 - INFO - __main__ - Step 39861: {'lr': 0.00042352032570074327, 'samples': 7653312, 'steps': 39860, 'loss/train': 1.4000568389892578}} 11/07/2021 02:49:57 - INFO - __main__ - Step 39868: {'lr': 0.00042349358163402175, 'samples': 7654656, 'steps': 39867, 'loss/train': 2.038564920425415}}} 11/07/2021 02:49:59 - INFO - __main__ - Step 39872: {'lr': 0.0004234782975903253, 'samples': 7655424, 'steps': 39871, 'loss/train': 1.6400891542434692}}} 11/07/2021 02:50:01 - INFO - __main__ - Step 39877: {'lr': 0.0004234591907769681, 'samples': 7656384, 'steps': 39876, 'loss/train': 1.3981540203094482}}} 11/07/2021 02:50:04 - INFO - __main__ - Step 39882: {'lr': 0.0004234400820096601, 'samples': 7657344, 'steps': 39881, 'loss/train': 1.6279277801513672}}} 11/07/2021 02:50:04 - INFO - __main__ - Step 39882: {'lr': 0.0004234400820096601, 'samples': 7657344, 'steps': 39881, 'loss/train': 1.6279277801513672}}} 11/07/2021 02:50:07 - INFO - __main__ - Step 39889: {'lr': 0.00042341332645320126, 'samples': 7658688, 'steps': 39888, 'loss/train': 1.503919243812561}}} 11/07/2021 02:50:09 - INFO - __main__ - Step 39893: {'lr': 0.00042339803584473626, 'samples': 7659456, 'steps': 39892, 'loss/train': 1.4663480520248413}} 11/07/2021 02:50:11 - INFO - __main__ - Step 39898: {'lr': 0.000423378920826232, 'samples': 7660416, 'steps': 39897, 'loss/train': 3.15498423576355}413}} 11/07/2021 02:50:14 - INFO - __main__ - Step 39903: {'lr': 0.0004233598038546812, 'samples': 7661376, 'steps': 39902, 'loss/train': 1.5171977281570435}}} 11/07/2021 02:50:14 - INFO - __main__ - Step 39903: {'lr': 0.0004233598038546812, 'samples': 7661376, 'steps': 39902, 'loss/train': 1.5171977281570435}}} 11/07/2021 02:50:18 - INFO - __main__ - Step 39910: {'lr': 0.00042333303681380165, 'samples': 7662720, 'steps': 39909, 'loss/train': 1.294662594795227}}} 11/07/2021 02:50:19 - INFO - __main__ - Step 39914: {'lr': 0.0004233177396436064, 'samples': 7663488, 'steps': 39913, 'loss/train': 1.651315689086914}}}} 11/07/2021 02:50:21 - INFO - __main__ - Step 39919: {'lr': 0.00042329861642375347, 'samples': 7664448, 'steps': 39918, 'loss/train': 1.5415047407150269}} 11/07/2021 02:50:24 - INFO - __main__ - Step 39924: {'lr': 0.00042327949125175844, 'samples': 7665408, 'steps': 39923, 'loss/train': 1.445731282234192}}} 11/07/2021 02:50:26 - INFO - __main__ - Step 39928: {'lr': 0.000423264189708765, 'samples': 7666176, 'steps': 39927, 'loss/train': 1.2989650964736938}}}} 11/07/2021 02:50:28 - INFO - __main__ - Step 39932: {'lr': 0.0004232488869166488, 'samples': 7666944, 'steps': 39931, 'loss/train': 1.7143346071243286}}} 11/07/2021 02:50:28 - INFO - __main__ - Step 39932: {'lr': 0.0004232488869166488, 'samples': 7666944, 'steps': 39931, 'loss/train': 1.7143346071243286}}} 11/07/2021 02:50:31 - INFO - __main__ - Step 39939: {'lr': 0.0004232221040250758, 'samples': 7668288, 'steps': 39938, 'loss/train': 1.211232304573059}}}} 11/07/2021 02:50:33 - INFO - __main__ - Step 39944: {'lr': 0.0004232029710466671, 'samples': 7669248, 'steps': 39943, 'loss/train': 1.740206003189087}}}} 11/07/2021 02:50:33 - INFO - __main__ - Step 39944: {'lr': 0.0004232029710466671, 'samples': 7669248, 'steps': 39943, 'loss/train': 1.740206003189087}}}} 11/07/2021 02:50:38 - INFO - __main__ - Step 39952: {'lr': 0.0004231723542230885, 'samples': 7670784, 'steps': 39951, 'loss/train': 1.4010276794433594}}} 11/07/2021 02:50:40 - INFO - __main__ - Step 39956: {'lr': 0.0004231570439385531, 'samples': 7671552, 'steps': 39955, 'loss/train': 2.2703516483306885}}} 11/07/2021 02:50:41 - INFO - __main__ - Step 39960: {'lr': 0.0004231417324056674, 'samples': 7672320, 'steps': 39959, 'loss/train': 1.5038126707077026}}} 11/07/2021 02:50:44 - INFO - __main__ - Step 39965: {'lr': 0.00042312259123423584, 'samples': 7673280, 'steps': 39964, 'loss/train': 1.3081835508346558}} 11/07/2021 02:50:46 - INFO - __main__ - Step 39969: {'lr': 0.00042310727689296563, 'samples': 7674048, 'steps': 39968, 'loss/train': 0.4142704904079437}} 11/07/2021 02:50:48 - INFO - __main__ - Step 39973: {'lr': 0.00042309196130370396, 'samples': 7674816, 'steps': 39972, 'loss/train': 1.4492918252944946}} 11/07/2021 02:50:50 - INFO - __main__ - Step 39977: {'lr': 0.00042307664446656116, 'samples': 7675584, 'steps': 39976, 'loss/train': 1.8707165718078613}} 11/07/2021 02:50:51 - INFO - __main__ - Step 39981: {'lr': 0.0004230613263816478, 'samples': 7676352, 'steps': 39980, 'loss/train': 1.6573978662490845}}} 11/07/2021 02:50:53 - INFO - __main__ - Step 39985: {'lr': 0.00042304600704907416, 'samples': 7677120, 'steps': 39984, 'loss/train': 1.2177273035049438}} 11/07/2021 02:50:53 - INFO - __main__ - Step 39985: {'lr': 0.00042304600704907416, 'samples': 7677120, 'steps': 39984, 'loss/train': 1.2177273035049438}} 11/07/2021 02:50:57 - INFO - __main__ - Step 39993: {'lr': 0.0004230153646413881, 'samples': 7678656, 'steps': 39992, 'loss/train': 1.6313539743423462}}} 11/07/2021 02:50:59 - INFO - __main__ - Step 39997: {'lr': 0.00042300004156649654, 'samples': 7679424, 'steps': 39996, 'loss/train': 1.631992220878601}}} 11/07/2021 02:51:01 - INFO - __main__ - Step 40001: {'lr': 0.0004229847172443866, 'samples': 7680192, 'steps': 40000, 'loss/train': 1.474384069442749}}}} 11/07/2021 02:51:04 - INFO - __main__ - Step 40006: {'lr': 0.00042296556008801663, 'samples': 7681152, 'steps': 40005, 'loss/train': 1.5770862102508545}} 11/07/2021 02:51:06 - INFO - __main__ - Step 40010: {'lr': 0.0004229502329600692, 'samples': 7681920, 'steps': 40009, 'loss/train': 1.3491777181625366}}} 11/07/2021 02:51:06 - INFO - __main__ - Step 40010: {'lr': 0.0004229502329600692, 'samples': 7681920, 'steps': 40009, 'loss/train': 1.3491777181625366}}} 11/07/2021 02:51:09 - INFO - __main__ - Step 40017: {'lr': 0.0004229234074859726, 'samples': 7683264, 'steps': 40016, 'loss/train': 1.6694526672363281}}} 11/07/2021 02:51:12 - INFO - __main__ - Step 40023: {'lr': 0.0004229004111836907, 'samples': 7684416, 'steps': 40022, 'loss/train': 1.340185523033142}}}} 11/07/2021 02:51:14 - INFO - __main__ - Step 40027: {'lr': 0.00042288507875735455, 'samples': 7685184, 'steps': 40026, 'loss/train': 1.5051108598709106}} 11/07/2021 02:51:14 - INFO - __main__ - Step 40027: {'lr': 0.00042288507875735455, 'samples': 7685184, 'steps': 40026, 'loss/train': 1.5051108598709106}} 11/07/2021 02:51:17 - INFO - __main__ - Step 40033: {'lr': 0.00042286207778090447, 'samples': 7686336, 'steps': 40032, 'loss/train': 1.6635338068008423}} 11/07/2021 02:51:20 - INFO - __main__ - Step 40038: {'lr': 0.0004228429081585664, 'samples': 7687296, 'steps': 40037, 'loss/train': 1.2760931253433228}}} 11/07/2021 02:51:22 - INFO - __main__ - Step 40042: {'lr': 0.0004228275710588394, 'samples': 7688064, 'steps': 40041, 'loss/train': 1.1090223789215088}}} 11/07/2021 02:51:23 - INFO - __main__ - Step 40046: {'lr': 0.00042281223271313734, 'samples': 7688832, 'steps': 40045, 'loss/train': 1.2158195972442627}} 11/07/2021 02:51:26 - INFO - __main__ - Step 40051: {'lr': 0.000422793058029026, 'samples': 7689792, 'steps': 40050, 'loss/train': 1.6261519193649292}7}} 11/07/2021 02:51:26 - INFO - __main__ - Step 40051: {'lr': 0.000422793058029026, 'samples': 7689792, 'steps': 40050, 'loss/train': 1.6261519193649292}7}} 11/07/2021 02:51:30 - INFO - __main__ - Step 40059: {'lr': 0.0004227623744859276, 'samples': 7691328, 'steps': 40058, 'loss/train': 1.6112170219421387}}} 11/07/2021 02:51:32 - INFO - __main__ - Step 40063: {'lr': 0.0004227470308460657, 'samples': 7692096, 'steps': 40062, 'loss/train': 1.5447250604629517}}} 11/07/2021 02:51:33 - INFO - __main__ - Step 40067: {'lr': 0.00042273168596080934, 'samples': 7692864, 'steps': 40066, 'loss/train': 1.5752909183502197}} 11/07/2021 02:51:35 - INFO - __main__ - Step 40071: {'lr': 0.00042271633983026935, 'samples': 7693632, 'steps': 40070, 'loss/train': 1.406765341758728}}} 11/07/2021 02:51:38 - INFO - __main__ - Step 40076: {'lr': 0.00042269715541608265, 'samples': 7694592, 'steps': 40075, 'loss/train': 0.6684977412223816}} 11/07/2021 02:51:40 - INFO - __main__ - Step 40080: {'lr': 0.00042268180648405884, 'samples': 7695360, 'steps': 40079, 'loss/train': 1.6849303245544434}} 11/07/2021 02:51:42 - INFO - __main__ - Step 40084: {'lr': 0.0004226664563071109, 'samples': 7696128, 'steps': 40083, 'loss/train': 0.9506791234016418}}} 11/07/2021 02:51:44 - INFO - __main__ - Step 40088: {'lr': 0.0004226511048853495, 'samples': 7696896, 'steps': 40087, 'loss/train': 1.7920674085617065}}} 11/07/2021 02:51:45 - INFO - __main__ - Step 40092: {'lr': 0.0004226357522188853, 'samples': 7697664, 'steps': 40091, 'loss/train': 1.5058112144470215}}} 11/07/2021 02:51:48 - INFO - __main__ - Step 40097: {'lr': 0.00042261655963561043, 'samples': 7698624, 'steps': 40096, 'loss/train': 1.6845612525939941}} 11/07/2021 02:51:50 - INFO - __main__ - Step 40101: {'lr': 0.00042260120416896975, 'samples': 7699392, 'steps': 40100, 'loss/train': 1.2876131534576416}} 11/07/2021 02:51:52 - INFO - __main__ - Step 40105: {'lr': 0.00042258584745798595, 'samples': 7700160, 'steps': 40104, 'loss/train': 1.4063727855682373}} 11/07/2021 02:51:54 - INFO - __main__ - Step 40109: {'lr': 0.0004225704895027699, 'samples': 7700928, 'steps': 40108, 'loss/train': 1.637282133102417}3}} 11/07/2021 02:51:55 - INFO - __main__ - Step 40113: {'lr': 0.0004225551303034322, 'samples': 7701696, 'steps': 40112, 'loss/train': 1.6477985382080078}}} 11/07/2021 02:51:57 - INFO - __main__ - Step 40117: {'lr': 0.0004225397698600837, 'samples': 7702464, 'steps': 40116, 'loss/train': 1.5630342960357666}}} 11/07/2021 02:52:00 - INFO - __main__ - Step 40123: {'lr': 0.00042251672686278275, 'samples': 7703616, 'steps': 40122, 'loss/train': 1.510049819946289}}} 11/07/2021 02:52:00 - INFO - __main__ - Step 40123: {'lr': 0.00042251672686278275, 'samples': 7703616, 'steps': 40122, 'loss/train': 1.510049819946289}}} 11/07/2021 02:52:04 - INFO - __main__ - Step 40130: {'lr': 0.0004224898398290893, 'samples': 7704960, 'steps': 40129, 'loss/train': 1.2647972106933594}}} 11/07/2021 02:52:06 - INFO - __main__ - Step 40134: {'lr': 0.0004224744740999302, 'samples': 7705728, 'steps': 40133, 'loss/train': 1.0452232360839844}}} 11/07/2021 02:52:08 - INFO - __main__ - Step 40138: {'lr': 0.0004224591071273416, 'samples': 7706496, 'steps': 40137, 'loss/train': 1.529614806175232}}}} 11/07/2021 02:52:10 - INFO - __main__ - Step 40142: {'lr': 0.00042244373891143453, 'samples': 7707264, 'steps': 40141, 'loss/train': 3.466099739074707}}} 11/07/2021 02:52:10 - INFO - __main__ - Step 40142: {'lr': 0.00042244373891143453, 'samples': 7707264, 'steps': 40141, 'loss/train': 3.466099739074707}}} 11/07/2021 02:52:13 - INFO - __main__ - Step 40149: {'lr': 0.0004224168415421948, 'samples': 7708608, 'steps': 40148, 'loss/train': 1.2203428745269775}}} 11/07/2021 02:52:16 - INFO - __main__ - Step 40154: {'lr': 0.00042239762680490944, 'samples': 7709568, 'steps': 40153, 'loss/train': 1.6479051113128662}} 11/07/2021 02:52:18 - INFO - __main__ - Step 40158: {'lr': 0.00042238225361683593, 'samples': 7710336, 'steps': 40157, 'loss/train': 1.1780622005462646}} 11/07/2021 02:52:20 - INFO - __main__ - Step 40162: {'lr': 0.0004223668791859979, 'samples': 7711104, 'steps': 40161, 'loss/train': 1.98379385471344}46}} 11/07/2021 02:52:21 - INFO - __main__ - Step 40166: {'lr': 0.00042235150351250617, 'samples': 7711872, 'steps': 40165, 'loss/train': 1.8517236709594727}} 11/07/2021 02:52:23 - INFO - __main__ - Step 40170: {'lr': 0.0004223361265964716, 'samples': 7712640, 'steps': 40169, 'loss/train': 1.5532838106155396}}} 11/07/2021 02:52:26 - INFO - __main__ - Step 40175: {'lr': 0.00042231690370427135, 'samples': 7713600, 'steps': 40174, 'loss/train': 1.6584599018096924}} 11/07/2021 02:52:28 - INFO - __main__ - Step 40179: {'lr': 0.00042230152399292065, 'samples': 7714368, 'steps': 40178, 'loss/train': 1.0819612741470337}} 11/07/2021 02:52:30 - INFO - __main__ - Step 40183: {'lr': 0.0004222861430393875, 'samples': 7715136, 'steps': 40182, 'loss/train': 1.5716710090637207}}} 11/07/2021 02:52:31 - INFO - __main__ - Step 40187: {'lr': 0.0004222707608437827, 'samples': 7715904, 'steps': 40186, 'loss/train': 1.881104826927185}}}} 11/07/2021 02:52:33 - INFO - __main__ - Step 40191: {'lr': 0.00042225537740621713, 'samples': 7716672, 'steps': 40190, 'loss/train': 1.8040450811386108}} 11/07/2021 02:52:36 - INFO - __main__ - Step 40196: {'lr': 0.0004222361463629218, 'samples': 7717632, 'steps': 40195, 'loss/train': 1.5748728513717651}}} 11/07/2021 02:52:36 - INFO - __main__ - Step 40196: {'lr': 0.0004222361463629218, 'samples': 7717632, 'steps': 40195, 'loss/train': 1.5748728513717651}}} 11/07/2021 02:52:40 - INFO - __main__ - Step 40204: {'lr': 0.0004222053726581782, 'samples': 7719168, 'steps': 40203, 'loss/train': 1.7543646097183228}}} 11/07/2021 02:52:42 - INFO - __main__ - Step 40208: {'lr': 0.00042218998394351684, 'samples': 7719936, 'steps': 40207, 'loss/train': 1.4205678701400757}} 11/07/2021 02:52:44 - INFO - __main__ - Step 40212: {'lr': 0.00042217459398747703, 'samples': 7720704, 'steps': 40211, 'loss/train': 1.9883413314819336}} 11/07/2021 02:52:46 - INFO - __main__ - Step 40216: {'lr': 0.00042215920279016993, 'samples': 7721472, 'steps': 40215, 'loss/train': 1.5851691961288452}} 11/07/2021 02:52:46 - INFO - __main__ - Step 40216: {'lr': 0.00042215920279016993, 'samples': 7721472, 'steps': 40215, 'loss/train': 1.5851691961288452}} 11/07/2021 02:52:50 - INFO - __main__ - Step 40224: {'lr': 0.0004221284166721971, 'samples': 7723008, 'steps': 40223, 'loss/train': 1.2773767709732056}}} 11/07/2021 02:52:52 - INFO - __main__ - Step 40228: {'lr': 0.00042211302175175334, 'samples': 7723776, 'steps': 40227, 'loss/train': 1.2917256355285645}} 11/07/2021 02:52:54 - INFO - __main__ - Step 40232: {'lr': 0.0004220976255904861, 'samples': 7724544, 'steps': 40231, 'loss/train': 1.606628656387329}5}} 11/07/2021 02:52:56 - INFO - __main__ - Step 40236: {'lr': 0.00042208222818850634, 'samples': 7725312, 'steps': 40235, 'loss/train': 1.2168179750442505}} 11/07/2021 02:52:56 - INFO - __main__ - Step 40236: {'lr': 0.00042208222818850634, 'samples': 7725312, 'steps': 40235, 'loss/train': 1.2168179750442505}} 11/07/2021 02:52:56 - INFO - __main__ - Step 40236: {'lr': 0.00042208222818850634, 'samples': 7725312, 'steps': 40235, 'loss/train': 1.2168179750442505}} 11/07/2021 02:53:02 - INFO - __main__ - Step 40248: {'lr': 0.0004220360285394017, 'samples': 7727616, 'steps': 40247, 'loss/train': 1.0947076082229614}}} 11/07/2021 02:53:04 - INFO - __main__ - Step 40253: {'lr': 0.00042201677539097294, 'samples': 7728576, 'steps': 40252, 'loss/train': 1.3090541362762451}} 11/07/2021 02:53:04 - INFO - __main__ - Step 40253: {'lr': 0.00042201677539097294, 'samples': 7728576, 'steps': 40252, 'loss/train': 1.3090541362762451}} 11/07/2021 02:53:08 - INFO - __main__ - Step 40261: {'lr': 0.00042198596632315576, 'samples': 7730112, 'steps': 40260, 'loss/train': 5.88883113861084}1}} 11/07/2021 02:53:10 - INFO - __main__ - Step 40265: {'lr': 0.0004219705599293303, 'samples': 7730880, 'steps': 40264, 'loss/train': 1.0776889324188232}}} 11/07/2021 02:53:12 - INFO - __main__ - Step 40269: {'lr': 0.00042195515229570833, 'samples': 7731648, 'steps': 40268, 'loss/train': 1.5870234966278076}} 11/07/2021 02:53:14 - INFO - __main__ - Step 40274: {'lr': 0.0004219358910103862, 'samples': 7732608, 'steps': 40273, 'loss/train': 1.3778915405273438}}} 11/07/2021 02:53:14 - INFO - __main__ - Step 40274: {'lr': 0.0004219358910103862, 'samples': 7732608, 'steps': 40273, 'loss/train': 1.3778915405273438}}} 11/07/2021 02:53:19 - INFO - __main__ - Step 40282: {'lr': 0.000421905068925435, 'samples': 7734144, 'steps': 40281, 'loss/train': 1.269433856010437}8}}} 11/07/2021 02:53:20 - INFO - __main__ - Step 40286: {'lr': 0.00042188965602391726, 'samples': 7734912, 'steps': 40285, 'loss/train': 1.637909173965454}}} 11/07/2021 02:53:22 - INFO - __main__ - Step 40290: {'lr': 0.0004218742418831863, 'samples': 7735680, 'steps': 40289, 'loss/train': 1.6381466388702393}}} 11/07/2021 02:53:22 - INFO - __main__ - Step 40290: {'lr': 0.0004218742418831863, 'samples': 7735680, 'steps': 40289, 'loss/train': 1.6381466388702393}}} 11/07/2021 02:53:26 - INFO - __main__ - Step 40298: {'lr': 0.00042184340988452924, 'samples': 7737216, 'steps': 40297, 'loss/train': 1.3267289400100708}} 11/07/2021 02:53:28 - INFO - __main__ - Step 40302: {'lr': 0.00042182799202682543, 'samples': 7737984, 'steps': 40301, 'loss/train': 1.7396727800369263}} 11/07/2021 02:53:30 - INFO - __main__ - Step 40306: {'lr': 0.00042181257293035293, 'samples': 7738752, 'steps': 40305, 'loss/train': 0.8169730305671692}} 11/07/2021 02:53:33 - INFO - __main__ - Step 40311: {'lr': 0.00042179329731791324, 'samples': 7739712, 'steps': 40310, 'loss/train': 1.418582558631897}}} 11/07/2021 02:53:33 - INFO - __main__ - Step 40311: {'lr': 0.00042179329731791324, 'samples': 7739712, 'steps': 40310, 'loss/train': 1.418582558631897}}} 11/07/2021 02:53:36 - INFO - __main__ - Step 40318: {'lr': 0.00042176630820943515, 'samples': 7741056, 'steps': 40317, 'loss/train': 1.3726780414581299}} 11/07/2021 02:53:38 - INFO - __main__ - Step 40323: {'lr': 0.00042174702795291574, 'samples': 7742016, 'steps': 40322, 'loss/train': 1.3882884979248047}} 11/07/2021 02:53:41 - INFO - __main__ - Step 40328: {'lr': 0.00042172774576173226, 'samples': 7742976, 'steps': 40327, 'loss/train': 2.0860137939453125}} 11/07/2021 02:53:43 - INFO - __main__ - Step 40332: {'lr': 0.0004217123186159735, 'samples': 7743744, 'steps': 40331, 'loss/train': 1.5076559782028198}}} 11/07/2021 02:53:45 - INFO - __main__ - Step 40336: {'lr': 0.00042169689023227987, 'samples': 7744512, 'steps': 40335, 'loss/train': 1.7182577848434448}} 11/07/2021 02:53:46 - INFO - __main__ - Step 40340: {'lr': 0.0004216814606107627, 'samples': 7745280, 'steps': 40339, 'loss/train': 1.6288928985595703}}} 11/07/2021 02:53:48 - INFO - __main__ - Step 40344: {'lr': 0.00042166602975153333, 'samples': 7746048, 'steps': 40343, 'loss/train': 1.4574334621429443}} 11/07/2021 02:53:48 - INFO - __main__ - Step 40344: {'lr': 0.00042166602975153333, 'samples': 7746048, 'steps': 40343, 'loss/train': 1.4574334621429443}} 11/07/2021 02:53:52 - INFO - __main__ - Step 40352: {'lr': 0.0004216351643203828, 'samples': 7747584, 'steps': 40351, 'loss/train': 1.381427526473999}}}} 11/07/2021 02:53:55 - INFO - __main__ - Step 40356: {'lr': 0.00042161972974868415, 'samples': 7748352, 'steps': 40355, 'loss/train': 1.5978648662567139}} 11/07/2021 02:53:56 - INFO - __main__ - Step 40360: {'lr': 0.0004216042939397182, 'samples': 7749120, 'steps': 40359, 'loss/train': 1.4361519813537598}}} 11/07/2021 02:53:59 - INFO - __main__ - Step 40365: {'lr': 0.0004215849974387733, 'samples': 7750080, 'steps': 40364, 'loss/train': 1.1762034893035889}}} 11/07/2021 02:54:01 - INFO - __main__ - Step 40369: {'lr': 0.00042156955884636307, 'samples': 7750848, 'steps': 40368, 'loss/train': 1.3229519128799438}} 11/07/2021 02:54:03 - INFO - __main__ - Step 40373: {'lr': 0.00042155411901704723, 'samples': 7751616, 'steps': 40372, 'loss/train': 1.645806074142456}}} 11/07/2021 02:54:05 - INFO - __main__ - Step 40377: {'lr': 0.00042153867795093714, 'samples': 7752384, 'steps': 40376, 'loss/train': 1.8111679553985596}} 11/07/2021 02:54:06 - INFO - __main__ - Step 40381: {'lr': 0.0004215232356481442, 'samples': 7753152, 'steps': 40380, 'loss/train': 1.472200632095337}6}} 11/07/2021 02:54:09 - INFO - __main__ - Step 40386: {'lr': 0.00042150393103073736, 'samples': 7754112, 'steps': 40385, 'loss/train': 1.0497719049453735}} 11/07/2021 02:54:09 - INFO - __main__ - Step 40386: {'lr': 0.00042150393103073736, 'samples': 7754112, 'steps': 40385, 'loss/train': 1.0497719049453735}} 11/07/2021 02:54:12 - INFO - __main__ - Step 40393: {'lr': 0.00042147690132078136, 'samples': 7755456, 'steps': 40392, 'loss/train': 1.670060396194458}}} 11/07/2021 02:54:12 - INFO - __main__ - Step 40393: {'lr': 0.00042147690132078136, 'samples': 7755456, 'steps': 40392, 'loss/train': 1.670060396194458}}} 11/07/2021 02:54:16 - INFO - __main__ - Step 40401: {'lr': 0.00042144600558783284, 'samples': 7756992, 'steps': 40400, 'loss/train': 1.6899981498718262}} 11/07/2021 02:54:16 - INFO - __main__ - Step 40401: {'lr': 0.00042144600558783284, 'samples': 7756992, 'steps': 40400, 'loss/train': 1.6899981498718262}} 11/07/2021 02:54:20 - INFO - __main__ - Step 40409: {'lr': 0.0004214151049108252, 'samples': 7758528, 'steps': 40408, 'loss/train': 1.743821144104004}2}} 11/07/2021 02:54:22 - INFO - __main__ - Step 40413: {'lr': 0.00042139965271857774, 'samples': 7759296, 'steps': 40412, 'loss/train': 1.7358777523040771}} 11/07/2021 02:54:24 - INFO - __main__ - Step 40418: {'lr': 0.0004213803357406055, 'samples': 7760256, 'steps': 40417, 'loss/train': 1.7742410898208618}}} 11/07/2021 02:54:27 - INFO - __main__ - Step 40422: {'lr': 0.0004213648807682332, 'samples': 7761024, 'steps': 40421, 'loss/train': 1.5297152996063232}}} 11/07/2021 02:54:29 - INFO - __main__ - Step 40426: {'lr': 0.00042134942456043104, 'samples': 7761792, 'steps': 40425, 'loss/train': 1.590003490447998}}} 11/07/2021 02:54:30 - INFO - __main__ - Step 40430: {'lr': 0.0004213339671173103, 'samples': 7762560, 'steps': 40429, 'loss/train': 1.6742734909057617}}} 11/07/2021 02:54:32 - INFO - __main__ - Step 40434: {'lr': 0.00042131850843898255, 'samples': 7763328, 'steps': 40433, 'loss/train': 1.5195480585098267}} 11/07/2021 02:54:32 - INFO - __main__ - Step 40434: {'lr': 0.00042131850843898255, 'samples': 7763328, 'steps': 40433, 'loss/train': 1.5195480585098267}} 11/07/2021 02:54:32 - INFO - __main__ - Step 40434: {'lr': 0.00042131850843898255, 'samples': 7763328, 'steps': 40433, 'loss/train': 1.5195480585098267}} 11/07/2021 02:54:38 - INFO - __main__ - Step 40445: {'lr': 0.0004212759907054546, 'samples': 7765440, 'steps': 40444, 'loss/train': 1.7988765239715576}}} 11/07/2021 02:54:41 - INFO - __main__ - Step 40450: {'lr': 0.0004212566613758299, 'samples': 7766400, 'steps': 40449, 'loss/train': 1.6221541166305542}}} 11/07/2021 02:54:43 - INFO - __main__ - Step 40455: {'lr': 0.0004212373301170649, 'samples': 7767360, 'steps': 40454, 'loss/train': 1.7538222074508667}}} 11/07/2021 02:54:43 - INFO - __main__ - Step 40455: {'lr': 0.0004212373301170649, 'samples': 7767360, 'steps': 40454, 'loss/train': 1.7538222074508667}}} 11/07/2021 02:54:47 - INFO - __main__ - Step 40462: {'lr': 0.000421210263114253, 'samples': 7768704, 'steps': 40461, 'loss/train': 1.7173820734024048}}}} 11/07/2021 02:54:48 - INFO - __main__ - Step 40466: {'lr': 0.00042119479455828153, 'samples': 7769472, 'steps': 40465, 'loss/train': 1.2135385274887085}} 11/07/2021 02:54:51 - INFO - __main__ - Step 40471: {'lr': 0.0004211754571277313, 'samples': 7770432, 'steps': 40470, 'loss/train': 1.7979527711868286}}} 11/07/2021 02:54:53 - INFO - __main__ - Step 40475: {'lr': 0.0004211599857949583, 'samples': 7771200, 'steps': 40474, 'loss/train': 1.609128713607788}}}} 11/07/2021 02:54:53 - INFO - __main__ - Step 40475: {'lr': 0.0004211599857949583, 'samples': 7771200, 'steps': 40474, 'loss/train': 1.609128713607788}}}} 11/07/2021 02:54:56 - INFO - __main__ - Step 40482: {'lr': 0.00042113290799347376, 'samples': 7772544, 'steps': 40481, 'loss/train': 1.779977560043335}}} 11/07/2021 02:54:59 - INFO - __main__ - Step 40487: {'lr': 0.00042111356439336877, 'samples': 7773504, 'steps': 40486, 'loss/train': 1.8205519914627075}} 11/07/2021 02:55:01 - INFO - __main__ - Step 40492: {'lr': 0.0004210942188657356, 'samples': 7774464, 'steps': 40491, 'loss/train': 1.0204192399978638}}} 11/07/2021 02:55:01 - INFO - __main__ - Step 40492: {'lr': 0.0004210942188657356, 'samples': 7774464, 'steps': 40491, 'loss/train': 1.0204192399978638}}} 11/07/2021 02:55:04 - INFO - __main__ - Step 40499: {'lr': 0.00042106713188921647, 'samples': 7775808, 'steps': 40498, 'loss/train': 1.7913703918457031}} 11/07/2021 02:55:07 - INFO - __main__ - Step 40503: {'lr': 0.00042105165192111684, 'samples': 7776576, 'steps': 40502, 'loss/train': 1.5947679281234741}} 11/07/2021 02:55:09 - INFO - __main__ - Step 40507: {'lr': 0.00042103617071984544, 'samples': 7777344, 'steps': 40506, 'loss/train': 1.195376992225647}}} 11/07/2021 02:55:11 - INFO - __main__ - Step 40511: {'lr': 0.000421020688285514, 'samples': 7778112, 'steps': 40510, 'loss/train': 1.4846168756484985}}}} 11/07/2021 02:55:13 - INFO - __main__ - Step 40515: {'lr': 0.0004210052046182339, 'samples': 7778880, 'steps': 40514, 'loss/train': 1.8797236680984497}}} 11/07/2021 02:55:15 - INFO - __main__ - Step 40519: {'lr': 0.00042098971971811695, 'samples': 7779648, 'steps': 40518, 'loss/train': 1.330376148223877}}} 11/07/2021 02:55:17 - INFO - __main__ - Step 40523: {'lr': 0.0004209742335852747, 'samples': 7780416, 'steps': 40522, 'loss/train': 1.586439609527588}}}} 11/07/2021 02:55:19 - INFO - __main__ - Step 40528: {'lr': 0.0004209548741858721, 'samples': 7781376, 'steps': 40527, 'loss/train': 1.5555566549301147}}} 11/07/2021 02:55:21 - INFO - __main__ - Step 40532: {'lr': 0.0004209393852798062, 'samples': 7782144, 'steps': 40531, 'loss/train': 1.7007616758346558}}} 11/07/2021 02:55:21 - INFO - __main__ - Step 40532: {'lr': 0.0004209393852798062, 'samples': 7782144, 'steps': 40531, 'loss/train': 1.7007616758346558}}} 11/07/2021 02:55:25 - INFO - __main__ - Step 40539: {'lr': 0.00042091227672888624, 'samples': 7783488, 'steps': 40538, 'loss/train': 1.7245628833770752}} 11/07/2021 02:55:27 - INFO - __main__ - Step 40544: {'lr': 0.0004208929111678811, 'samples': 7784448, 'steps': 40543, 'loss/train': 1.6190506219863892}}} 11/07/2021 02:55:29 - INFO - __main__ - Step 40548: {'lr': 0.00042087741733303575, 'samples': 7785216, 'steps': 40547, 'loss/train': 1.1815096139907837}} 11/07/2021 02:55:29 - INFO - __main__ - Step 40548: {'lr': 0.00042087741733303575, 'samples': 7785216, 'steps': 40547, 'loss/train': 1.1815096139907837}} 11/07/2021 02:55:32 - INFO - __main__ - Step 40555: {'lr': 0.0004208503001578266, 'samples': 7786560, 'steps': 40554, 'loss/train': 1.30928373336792}37}} 11/07/2021 02:55:35 - INFO - __main__ - Step 40560: {'lr': 0.00042083092843745275, 'samples': 7787520, 'steps': 40559, 'loss/train': 1.7562403678894043}} 11/07/2021 02:55:37 - INFO - __main__ - Step 40565: {'lr': 0.0004208115547927345, 'samples': 7788480, 'steps': 40564, 'loss/train': 1.2707369327545166}}} 11/07/2021 02:55:37 - INFO - __main__ - Step 40565: {'lr': 0.0004208115547927345, 'samples': 7788480, 'steps': 40564, 'loss/train': 1.2707369327545166}}} 11/07/2021 02:55:41 - INFO - __main__ - Step 40572: {'lr': 0.0004207844284576455, 'samples': 7789824, 'steps': 40571, 'loss/train': 1.7188644409179688}}} 11/07/2021 02:55:43 - INFO - __main__ - Step 40576: {'lr': 0.0004207689260017369, 'samples': 7790592, 'steps': 40575, 'loss/train': 1.788718581199646}}}} 11/07/2021 02:55:45 - INFO - __main__ - Step 40581: {'lr': 0.0004207495462005828, 'samples': 7791552, 'steps': 40580, 'loss/train': 1.9620747566223145}}} 11/07/2021 02:55:45 - INFO - __main__ - Step 40581: {'lr': 0.0004207495462005828, 'samples': 7791552, 'steps': 40580, 'loss/train': 1.9620747566223145}}} 11/07/2021 02:55:49 - INFO - __main__ - Step 40588: {'lr': 0.0004207224112476573, 'samples': 7792896, 'steps': 40587, 'loss/train': 1.7950011491775513}}} 11/07/2021 02:55:51 - INFO - __main__ - Step 40592: {'lr': 0.00042070690386788545, 'samples': 7793664, 'steps': 40591, 'loss/train': 1.5835875272750854}} 11/07/2021 02:55:53 - INFO - __main__ - Step 40596: {'lr': 0.00042069139525742727, 'samples': 7794432, 'steps': 40595, 'loss/train': 1.6492644548416138}} 11/07/2021 02:55:55 - INFO - __main__ - Step 40601: {'lr': 0.00042067200776387215, 'samples': 7795392, 'steps': 40600, 'loss/train': 1.5179119110107422}} 11/07/2021 02:55:55 - INFO - __main__ - Step 40601: {'lr': 0.00042067200776387215, 'samples': 7795392, 'steps': 40600, 'loss/train': 1.5179119110107422}} 11/07/2021 02:56:00 - INFO - __main__ - Step 40609: {'lr': 0.0004206409837753618, 'samples': 7796928, 'steps': 40608, 'loss/train': 1.832965612411499}2}} 11/07/2021 02:56:01 - INFO - __main__ - Step 40613: {'lr': 0.0004206254699357341, 'samples': 7797696, 'steps': 40612, 'loss/train': 1.1263415813446045}}} 11/07/2021 02:56:03 - INFO - __main__ - Step 40617: {'lr': 0.0004206099548660071, 'samples': 7798464, 'steps': 40616, 'loss/train': 1.47562575340271}5}}} 11/07/2021 02:56:06 - INFO - __main__ - Step 40622: {'lr': 0.00042059055929919163, 'samples': 7799424, 'steps': 40621, 'loss/train': 1.4647282361984253}} 11/07/2021 02:56:08 - INFO - __main__ - Step 40626: {'lr': 0.0004205750414621503, 'samples': 7800192, 'steps': 40625, 'loss/train': 1.3805701732635498}}} 11/07/2021 02:56:08 - INFO - __main__ - Step 40626: {'lr': 0.0004205750414621503, 'samples': 7800192, 'steps': 40625, 'loss/train': 1.3805701732635498}}} 11/07/2021 02:56:11 - INFO - __main__ - Step 40633: {'lr': 0.00042054788228834374, 'samples': 7801536, 'steps': 40632, 'loss/train': 1.3163074254989624}} 11/07/2021 02:56:14 - INFO - __main__ - Step 40639: {'lr': 0.00042052459999948323, 'samples': 7802688, 'steps': 40638, 'loss/train': 1.222219467163086}}} 11/07/2021 02:56:14 - INFO - __main__ - Step 40639: {'lr': 0.00042052459999948323, 'samples': 7802688, 'steps': 40638, 'loss/train': 1.222219467163086}}} 11/07/2021 02:56:18 - INFO - __main__ - Step 40646: {'lr': 0.00042049743383314577, 'samples': 7804032, 'steps': 40645, 'loss/train': 1.396903395652771}}} 11/07/2021 02:56:19 - INFO - __main__ - Step 40650: {'lr': 0.00042048190861936866, 'samples': 7804800, 'steps': 40649, 'loss/train': 1.5080392360687256}} 11/07/2021 02:56:21 - INFO - __main__ - Step 40654: {'lr': 0.00042046638217652717, 'samples': 7805568, 'steps': 40653, 'loss/train': 1.3786760568618774}} 11/07/2021 02:56:24 - INFO - __main__ - Step 40660: {'lr': 0.00042044309020801434, 'samples': 7806720, 'steps': 40659, 'loss/train': 1.272469162940979}}} 11/07/2021 02:56:24 - INFO - __main__ - Step 40660: {'lr': 0.00042044309020801434, 'samples': 7806720, 'steps': 40659, 'loss/train': 1.272469162940979}}} 11/07/2021 02:56:28 - INFO - __main__ - Step 40667: {'lr': 0.0004204159127504202, 'samples': 7808064, 'steps': 40666, 'loss/train': 1.802840232849121}}}} 11/07/2021 02:56:29 - INFO - __main__ - Step 40671: {'lr': 0.0004204003810853045, 'samples': 7808832, 'steps': 40670, 'loss/train': 1.3344838619232178}}} 11/07/2021 02:56:31 - INFO - __main__ - Step 40675: {'lr': 0.0004203848481917122, 'samples': 7809600, 'steps': 40674, 'loss/train': 1.5235605239868164}}} 11/07/2021 02:56:34 - INFO - __main__ - Step 40680: {'lr': 0.0004203654303473474, 'samples': 7810560, 'steps': 40679, 'loss/train': 1.413865327835083}}}} 11/07/2021 02:56:36 - INFO - __main__ - Step 40684: {'lr': 0.00042034989469009245, 'samples': 7811328, 'steps': 40683, 'loss/train': 1.418968915939331}}} 11/07/2021 02:56:38 - INFO - __main__ - Step 40688: {'lr': 0.00042033435780472494, 'samples': 7812096, 'steps': 40687, 'loss/train': 1.5045456886291504}} 11/07/2021 02:56:39 - INFO - __main__ - Step 40692: {'lr': 0.000420318819691357, 'samples': 7812864, 'steps': 40691, 'loss/train': 1.6695582866668701}4}} 11/07/2021 02:56:41 - INFO - __main__ - Step 40696: {'lr': 0.00042030328035010047, 'samples': 7813632, 'steps': 40695, 'loss/train': 1.2691519260406494}} 11/07/2021 02:56:44 - INFO - __main__ - Step 40701: {'lr': 0.0004202838544469822, 'samples': 7814592, 'steps': 40700, 'loss/train': 1.5229068994522095}}} 11/07/2021 02:56:46 - INFO - __main__ - Step 40705: {'lr': 0.00042026831234338614, 'samples': 7815360, 'steps': 40704, 'loss/train': 1.6805357933044434}} 11/07/2021 02:56:46 - INFO - __main__ - Step 40705: {'lr': 0.00042026831234338614, 'samples': 7815360, 'steps': 40704, 'loss/train': 1.6805357933044434}} 11/07/2021 02:56:49 - INFO - __main__ - Step 40711: {'lr': 0.0004202449968864188, 'samples': 7816512, 'steps': 40710, 'loss/train': 2.121345043182373}4}} 11/07/2021 02:56:52 - INFO - __main__ - Step 40716: {'lr': 0.0004202255652294114, 'samples': 7817472, 'steps': 40715, 'loss/train': 1.2422605752944946}}} 11/07/2021 02:56:54 - INFO - __main__ - Step 40720: {'lr': 0.0004202100185231767, 'samples': 7818240, 'steps': 40719, 'loss/train': 1.5049687623977661}}} 11/07/2021 02:56:54 - INFO - __main__ - Step 40720: {'lr': 0.0004202100185231767, 'samples': 7818240, 'steps': 40719, 'loss/train': 1.5049687623977661}}} 11/07/2021 02:56:57 - INFO - __main__ - Step 40727: {'lr': 0.00042018280883461415, 'samples': 7819584, 'steps': 40726, 'loss/train': 1.242191195487976}}} 11/07/2021 02:56:59 - INFO - __main__ - Step 40732: {'lr': 0.0004201633710422962, 'samples': 7820544, 'steps': 40731, 'loss/train': 1.4807544946670532}}} 11/07/2021 02:57:02 - INFO - __main__ - Step 40737: {'lr': 0.00042014393133315366, 'samples': 7821504, 'steps': 40736, 'loss/train': 1.5821160078048706}} 11/07/2021 02:57:02 - INFO - __main__ - Step 40737: {'lr': 0.00042014393133315366, 'samples': 7821504, 'steps': 40736, 'loss/train': 1.5821160078048706}} 11/07/2021 02:57:06 - INFO - __main__ - Step 40745: {'lr': 0.0004201128238120766, 'samples': 7823040, 'steps': 40744, 'loss/train': 1.352439045906067}6}} 11/07/2021 02:57:08 - INFO - __main__ - Step 40749: {'lr': 0.0004200972682118769, 'samples': 7823808, 'steps': 40748, 'loss/train': 1.8409411907196045}}} 11/07/2021 02:57:09 - INFO - __main__ - Step 40753: {'lr': 0.000420081711385386, 'samples': 7824576, 'steps': 40752, 'loss/train': 1.7906991243362427}}}} 11/07/2021 02:57:11 - INFO - __main__ - Step 40757: {'lr': 0.00042006615333271585, 'samples': 7825344, 'steps': 40756, 'loss/train': 1.609025478363037}}} 11/07/2021 02:57:11 - INFO - __main__ - Step 40757: {'lr': 0.00042006615333271585, 'samples': 7825344, 'steps': 40756, 'loss/train': 1.609025478363037}}} 11/07/2021 02:57:11 - INFO - __main__ - Step 40757: {'lr': 0.00042006615333271585, 'samples': 7825344, 'steps': 40756, 'loss/train': 1.609025478363037}}} 11/07/2021 02:57:17 - INFO - __main__ - Step 40769: {'lr': 0.0004200194718187527, 'samples': 7827648, 'steps': 40768, 'loss/train': 1.1773542165756226}}} 11/07/2021 02:57:19 - INFO - __main__ - Step 40773: {'lr': 0.00042000390886248783, 'samples': 7828416, 'steps': 40772, 'loss/train': 5.857418537139893}}} 11/07/2021 02:57:21 - INFO - __main__ - Step 40778: {'lr': 0.0004199844534436443, 'samples': 7829376, 'steps': 40777, 'loss/train': 2.341153621673584}}}} 11/07/2021 02:57:21 - INFO - __main__ - Step 40778: {'lr': 0.0004199844534436443, 'samples': 7829376, 'steps': 40777, 'loss/train': 2.341153621673584}}}} 11/07/2021 02:57:25 - INFO - __main__ - Step 40786: {'lr': 0.0004199533207907827, 'samples': 7830912, 'steps': 40785, 'loss/train': 1.3264210224151611}}} 11/07/2021 02:57:25 - INFO - __main__ - Step 40786: {'lr': 0.0004199533207907827, 'samples': 7830912, 'steps': 40785, 'loss/train': 1.3264210224151611}}} 11/07/2021 02:57:29 - INFO - __main__ - Step 40793: {'lr': 0.000419926075699135, 'samples': 7832256, 'steps': 40792, 'loss/train': 1.2800946235656738}}}} 11/07/2021 02:57:32 - INFO - __main__ - Step 40799: {'lr': 0.0004199027197773375, 'samples': 7833408, 'steps': 40798, 'loss/train': 2.1304287910461426}}} 11/07/2021 02:57:32 - INFO - __main__ - Step 40799: {'lr': 0.0004199027197773375, 'samples': 7833408, 'steps': 40798, 'loss/train': 2.1304287910461426}}} 11/07/2021 02:57:36 - INFO - __main__ - Step 40806: {'lr': 0.0004198754677186565, 'samples': 7834752, 'steps': 40805, 'loss/train': 1.5161305665969849}}} 11/07/2021 02:57:37 - INFO - __main__ - Step 40810: {'lr': 0.0004198598934297055, 'samples': 7835520, 'steps': 40809, 'loss/train': 1.3566529750823975}}} 11/07/2021 02:57:37 - INFO - __main__ - Step 40810: {'lr': 0.0004198598934297055, 'samples': 7835520, 'steps': 40809, 'loss/train': 1.3566529750823975}}} 11/07/2021 02:57:42 - INFO - __main__ - Step 40818: {'lr': 0.00041982874117817593, 'samples': 7837056, 'steps': 40817, 'loss/train': 1.5131193399429321}} 11/07/2021 02:57:43 - INFO - __main__ - Step 40822: {'lr': 0.000419813163215822, 'samples': 7837824, 'steps': 40821, 'loss/train': 1.3809610605239868}1}} 11/07/2021 02:57:45 - INFO - __main__ - Step 40826: {'lr': 0.00041979758402922496, 'samples': 7838592, 'steps': 40825, 'loss/train': 1.2652628421783447}} 11/07/2021 02:57:48 - INFO - __main__ - Step 40831: {'lr': 0.000419778108324558, 'samples': 7839552, 'steps': 40830, 'loss/train': 1.4193148612976074}7}} 11/07/2021 02:57:48 - INFO - __main__ - Step 40831: {'lr': 0.000419778108324558, 'samples': 7839552, 'steps': 40830, 'loss/train': 1.4193148612976074}7}} 11/07/2021 02:57:52 - INFO - __main__ - Step 40838: {'lr': 0.0004197508391250988, 'samples': 7840896, 'steps': 40837, 'loss/train': 1.9184430837631226}}} 11/07/2021 02:57:53 - INFO - __main__ - Step 40842: {'lr': 0.0004197352550426528, 'samples': 7841664, 'steps': 40841, 'loss/train': 1.584807276725769}}}} 11/07/2021 02:57:55 - INFO - __main__ - Step 40846: {'lr': 0.00041971966973652545, 'samples': 7842432, 'steps': 40845, 'loss/train': 1.2680137157440186}} 11/07/2021 02:57:58 - INFO - __main__ - Step 40851: {'lr': 0.00041970018638323546, 'samples': 7843392, 'steps': 40850, 'loss/train': 1.5288792848587036}} 11/07/2021 02:58:00 - INFO - __main__ - Step 40855: {'lr': 0.0004196845983242358, 'samples': 7844160, 'steps': 40854, 'loss/train': 1.6631810665130615}}} 11/07/2021 02:58:00 - INFO - __main__ - Step 40855: {'lr': 0.0004196845983242358, 'samples': 7844160, 'steps': 40854, 'loss/train': 1.6631810665130615}}} 11/07/2021 02:58:03 - INFO - __main__ - Step 40862: {'lr': 0.0004196573162774494, 'samples': 7845504, 'steps': 40861, 'loss/train': 0.9696474671363831}}} 11/07/2021 02:58:06 - INFO - __main__ - Step 40868: {'lr': 0.00041963392868454163, 'samples': 7846656, 'steps': 40867, 'loss/train': 1.4458591938018799}} 11/07/2021 02:58:08 - INFO - __main__ - Step 40872: {'lr': 0.0004196183354272244, 'samples': 7847424, 'steps': 40871, 'loss/train': 1.2333451509475708}}} 11/07/2021 02:58:08 - INFO - __main__ - Step 40872: {'lr': 0.0004196183354272244, 'samples': 7847424, 'steps': 40871, 'loss/train': 1.2333451509475708}}} 11/07/2021 02:58:11 - INFO - __main__ - Step 40879: {'lr': 0.00041959104428453175, 'samples': 7848768, 'steps': 40878, 'loss/train': 1.2749711275100708}} 11/07/2021 02:58:13 - INFO - __main__ - Step 40883: {'lr': 0.0004195754476646793, 'samples': 7849536, 'steps': 40882, 'loss/train': 1.359736442565918}8}} 11/07/2021 02:58:16 - INFO - __main__ - Step 40888: {'lr': 0.0004195559501706951, 'samples': 7850496, 'steps': 40887, 'loss/train': 1.023625373840332}8}} 11/07/2021 02:58:18 - INFO - __main__ - Step 40892: {'lr': 0.00041954035080030985, 'samples': 7851264, 'steps': 40891, 'loss/train': 1.0924385786056519}} 11/07/2021 02:58:20 - INFO - __main__ - Step 40896: {'lr': 0.00041952475020764834, 'samples': 7852032, 'steps': 40895, 'loss/train': 1.1508278846740723}} 11/07/2021 02:58:21 - INFO - __main__ - Step 40900: {'lr': 0.0004195091483928231, 'samples': 7852800, 'steps': 40899, 'loss/train': 1.6984916925430298}}} 11/07/2021 02:58:23 - INFO - __main__ - Step 40904: {'lr': 0.00041949354535594655, 'samples': 7853568, 'steps': 40903, 'loss/train': 1.4739363193511963}} 11/07/2021 02:58:26 - INFO - __main__ - Step 40909: {'lr': 0.0004194740398415125, 'samples': 7854528, 'steps': 40908, 'loss/train': 1.4221503734588623}}} 11/07/2021 02:58:26 - INFO - __main__ - Step 40909: {'lr': 0.0004194740398415125, 'samples': 7854528, 'steps': 40908, 'loss/train': 1.4221503734588623}}} 11/07/2021 02:58:29 - INFO - __main__ - Step 40916: {'lr': 0.0004194467289141339, 'samples': 7855872, 'steps': 40915, 'loss/train': 0.865330696105957}}}} 11/07/2021 02:58:31 - INFO - __main__ - Step 40920: {'lr': 0.000419431120990177, 'samples': 7856640, 'steps': 40919, 'loss/train': 1.536737084388733}}}}} 11/07/2021 02:58:34 - INFO - __main__ - Step 40925: {'lr': 0.0004194116093675256, 'samples': 7857600, 'steps': 40924, 'loss/train': 1.5521842241287231}}} 11/07/2021 02:58:36 - INFO - __main__ - Step 40930: {'lr': 0.00041939209583651774, 'samples': 7858560, 'steps': 40929, 'loss/train': 1.7363842725753784}} 11/07/2021 02:58:38 - INFO - __main__ - Step 40934: {'lr': 0.0004193764836378425, 'samples': 7859328, 'steps': 40933, 'loss/train': 1.413246989250183}4}} 11/07/2021 02:58:38 - INFO - __main__ - Step 40934: {'lr': 0.0004193764836378425, 'samples': 7859328, 'steps': 40933, 'loss/train': 1.413246989250183}4}} 11/07/2021 02:58:41 - INFO - __main__ - Step 40941: {'lr': 0.000419349159351969, 'samples': 7860672, 'steps': 40940, 'loss/train': 1.786657691001892}}4}} 11/07/2021 02:58:43 - INFO - __main__ - Step 40945: {'lr': 0.00041933354379555376, 'samples': 7861440, 'steps': 40944, 'loss/train': 1.540553331375122}}} 11/07/2021 02:58:46 - INFO - __main__ - Step 40950: {'lr': 0.00041931402263331856, 'samples': 7862400, 'steps': 40949, 'loss/train': 1.102845549583435}}} 11/07/2021 02:58:48 - INFO - __main__ - Step 40955: {'lr': 0.00041929449956382625, 'samples': 7863360, 'steps': 40954, 'loss/train': 1.2445733547210693}} 11/07/2021 02:58:50 - INFO - __main__ - Step 40959: {'lr': 0.00041927887973515493, 'samples': 7864128, 'steps': 40958, 'loss/train': 1.483497977256775}}} 11/07/2021 02:58:52 - INFO - __main__ - Step 40963: {'lr': 0.00041926325868609247, 'samples': 7864896, 'steps': 40962, 'loss/train': 1.5598963499069214}} 11/07/2021 02:58:54 - INFO - __main__ - Step 40967: {'lr': 0.0004192476364167514, 'samples': 7865664, 'steps': 40966, 'loss/train': 1.754093885421753}4}} 11/07/2021 02:58:56 - INFO - __main__ - Step 40971: {'lr': 0.00041923201292724436, 'samples': 7866432, 'steps': 40970, 'loss/train': 1.8128414154052734}} 11/07/2021 02:58:56 - INFO - __main__ - Step 40971: {'lr': 0.00041923201292724436, 'samples': 7866432, 'steps': 40970, 'loss/train': 1.8128414154052734}} 11/07/2021 02:59:00 - INFO - __main__ - Step 40979: {'lr': 0.00041920076228818293, 'samples': 7867968, 'steps': 40978, 'loss/train': 1.7132076025009155}} 11/07/2021 02:59:02 - INFO - __main__ - Step 40983: {'lr': 0.0004191851351388538, 'samples': 7868736, 'steps': 40982, 'loss/train': 1.521274209022522}5}} 11/07/2021 02:59:04 - INFO - __main__ - Step 40987: {'lr': 0.00041916950676980933, 'samples': 7869504, 'steps': 40986, 'loss/train': 1.4859052896499634}} 11/07/2021 02:59:06 - INFO - __main__ - Step 40992: {'lr': 0.00041914996959345057, 'samples': 7870464, 'steps': 40991, 'loss/train': 1.3491690158843994}} 11/07/2021 02:59:06 - INFO - __main__ - Step 40992: {'lr': 0.00041914996959345057, 'samples': 7870464, 'steps': 40991, 'loss/train': 1.3491690158843994}} 11/07/2021 02:59:06 - INFO - __main__ - Step 40992: {'lr': 0.00041914996959345057, 'samples': 7870464, 'steps': 40991, 'loss/train': 1.3491690158843994}} 11/07/2021 02:59:11 - INFO - __main__ - Step 41002: {'lr': 0.0004191108895247258, 'samples': 7872384, 'steps': 41001, 'loss/train': 1.3515865802764893}}} 11/07/2021 02:59:14 - INFO - __main__ - Step 41007: {'lr': 0.0004190913466327999, 'samples': 7873344, 'steps': 41006, 'loss/train': 1.8696014881134033}}} 11/07/2021 02:59:16 - INFO - __main__ - Step 41012: {'lr': 0.00041907180183612525, 'samples': 7874304, 'steps': 41011, 'loss/train': 1.446285605430603}}} 11/07/2021 02:59:18 - INFO - __main__ - Step 41016: {'lr': 0.0004190561646275144, 'samples': 7875072, 'steps': 41015, 'loss/train': 1.9799628257751465}}} 11/07/2021 02:59:18 - INFO - __main__ - Step 41016: {'lr': 0.0004190561646275144, 'samples': 7875072, 'steps': 41015, 'loss/train': 1.9799628257751465}}} 11/07/2021 02:59:21 - INFO - __main__ - Step 41023: {'lr': 0.00041902879657981036, 'samples': 7876416, 'steps': 41022, 'loss/train': 1.0762028694152832}} 11/07/2021 02:59:24 - INFO - __main__ - Step 41028: {'lr': 0.00041900924568941925, 'samples': 7877376, 'steps': 41027, 'loss/train': 0.515064537525177}}} 11/07/2021 02:59:26 - INFO - __main__ - Step 41033: {'lr': 0.0004189896928952041, 'samples': 7878336, 'steps': 41032, 'loss/train': 1.352460265159607}}}} 11/07/2021 02:59:26 - INFO - __main__ - Step 41033: {'lr': 0.0004189896928952041, 'samples': 7878336, 'steps': 41032, 'loss/train': 1.352460265159607}}}} 11/07/2021 02:59:29 - INFO - __main__ - Step 41040: {'lr': 0.0004189623157852981, 'samples': 7879680, 'steps': 41039, 'loss/train': 1.8899928331375122}}} 11/07/2021 02:59:31 - INFO - __main__ - Step 41044: {'lr': 0.000418946670047556, 'samples': 7880448, 'steps': 41043, 'loss/train': 1.392247200012207}2}}} 11/07/2021 02:59:34 - INFO - __main__ - Step 41049: {'lr': 0.00041892711116258454, 'samples': 7881408, 'steps': 41048, 'loss/train': 1.2349627017974854}} 11/07/2021 02:59:34 - INFO - __main__ - Step 41049: {'lr': 0.00041892711116258454, 'samples': 7881408, 'steps': 41048, 'loss/train': 1.2349627017974854}} 11/07/2021 02:59:37 - INFO - __main__ - Step 41056: {'lr': 0.00041889972552680387, 'samples': 7882752, 'steps': 41055, 'loss/train': 1.6331814527511597}} 11/07/2021 02:59:39 - INFO - __main__ - Step 41060: {'lr': 0.0004188840749177538, 'samples': 7883520, 'steps': 41059, 'loss/train': 1.6948528289794922}}} 11/07/2021 02:59:42 - INFO - __main__ - Step 41065: {'lr': 0.00041886450994428197, 'samples': 7884480, 'steps': 41064, 'loss/train': 1.3526166677474976}} 11/07/2021 02:59:44 - INFO - __main__ - Step 41070: {'lr': 0.0004188449430686166, 'samples': 7885440, 'steps': 41069, 'loss/train': 1.4126471281051636}}} 11/07/2021 02:59:46 - INFO - __main__ - Step 41074: {'lr': 0.000418829288198653, 'samples': 7886208, 'steps': 41073, 'loss/train': 1.5374236106872559}}}} 11/07/2021 02:59:46 - INFO - __main__ - Step 41074: {'lr': 0.000418829288198653, 'samples': 7886208, 'steps': 41073, 'loss/train': 1.5374236106872559}}}} 11/07/2021 02:59:49 - INFO - __main__ - Step 41081: {'lr': 0.0004188018892475176, 'samples': 7887552, 'steps': 41080, 'loss/train': 1.715844988822937}}}} 11/07/2021 02:59:51 - INFO - __main__ - Step 41085: {'lr': 0.0004187862310306633, 'samples': 7888320, 'steps': 41084, 'loss/train': 1.9033116102218628}}} 11/07/2021 02:59:54 - INFO - __main__ - Step 41090: {'lr': 0.0004187666565484279, 'samples': 7889280, 'steps': 41089, 'loss/train': 1.3745850324630737}}} 11/07/2021 02:59:54 - INFO - __main__ - Step 41090: {'lr': 0.0004187666565484279, 'samples': 7889280, 'steps': 41089, 'loss/train': 1.3745850324630737}}} 11/07/2021 02:59:58 - INFO - __main__ - Step 41098: {'lr': 0.00041873533342267336, 'samples': 7890816, 'steps': 41097, 'loss/train': 1.4721550941467285}} 11/07/2021 02:59:59 - INFO - __main__ - Step 41102: {'lr': 0.00041871967003503073, 'samples': 7891584, 'steps': 41101, 'loss/train': 1.177298903465271}}} 11/07/2021 03:00:02 - INFO - __main__ - Step 41106: {'lr': 0.0004187040054310284, 'samples': 7892352, 'steps': 41105, 'loss/train': 1.490696907043457}}}} 11/07/2021 03:00:04 - INFO - __main__ - Step 41111: {'lr': 0.0004186844229656917, 'samples': 7893312, 'steps': 41110, 'loss/train': 1.5238968133926392}}} 11/07/2021 03:00:06 - INFO - __main__ - Step 41115: {'lr': 0.00041866875562529305, 'samples': 7894080, 'steps': 41114, 'loss/train': 1.6523754596710205}} 11/07/2021 03:00:06 - INFO - __main__ - Step 41115: {'lr': 0.00041866875562529305, 'samples': 7894080, 'steps': 41114, 'loss/train': 1.6523754596710205}} 11/07/2021 03:00:09 - INFO - __main__ - Step 41122: {'lr': 0.00041864133485368106, 'samples': 7895424, 'steps': 41121, 'loss/train': 1.0865099430084229}} 11/07/2021 03:00:12 - INFO - __main__ - Step 41127: {'lr': 0.00041862174630859315, 'samples': 7896384, 'steps': 41126, 'loss/train': 1.4096827507019043}} 11/07/2021 03:00:12 - INFO - __main__ - Step 41127: {'lr': 0.00041862174630859315, 'samples': 7896384, 'steps': 41126, 'loss/train': 1.4096827507019043}} 11/07/2021 03:00:16 - INFO - __main__ - Step 41135: {'lr': 0.0004185904006856697, 'samples': 7897920, 'steps': 41134, 'loss/train': 1.6345583200454712}}} 11/07/2021 03:00:18 - INFO - __main__ - Step 41139: {'lr': 0.0004185747260510099, 'samples': 7898688, 'steps': 41138, 'loss/train': 1.5812559127807617}}} 11/07/2021 03:00:20 - INFO - __main__ - Step 41143: {'lr': 0.00041855905020103543, 'samples': 7899456, 'steps': 41142, 'loss/train': 1.4493659734725952}} 11/07/2021 03:00:22 - INFO - __main__ - Step 41147: {'lr': 0.00041854337313585913, 'samples': 7900224, 'steps': 41146, 'loss/train': 1.5544965267181396}} 11/07/2021 03:00:24 - INFO - __main__ - Step 41152: {'lr': 0.0004185237750956836, 'samples': 7901184, 'steps': 41151, 'loss/train': 1.4736276865005493}}} 11/07/2021 03:00:24 - INFO - __main__ - Step 41152: {'lr': 0.0004185237750956836, 'samples': 7901184, 'steps': 41151, 'loss/train': 1.4736276865005493}}} 11/07/2021 03:00:28 - INFO - __main__ - Step 41159: {'lr': 0.0004184963346502504, 'samples': 7902528, 'steps': 41158, 'loss/train': 1.931903600692749}}}} 11/07/2021 03:00:30 - INFO - __main__ - Step 41163: {'lr': 0.00041848065272539765, 'samples': 7903296, 'steps': 41162, 'loss/train': 1.4716529846191406}} 11/07/2021 03:00:32 - INFO - __main__ - Step 41167: {'lr': 0.0004184649695859083, 'samples': 7904064, 'steps': 41166, 'loss/train': 1.6647261381149292}}} 11/07/2021 03:00:34 - INFO - __main__ - Step 41172: {'lr': 0.00041844536395363636, 'samples': 7905024, 'steps': 41171, 'loss/train': 1.4426734447479248}} 11/07/2021 03:00:36 - INFO - __main__ - Step 41176: {'lr': 0.00041842967808162834, 'samples': 7905792, 'steps': 41175, 'loss/train': 1.224771499633789}}} 11/07/2021 03:00:39 - INFO - __main__ - Step 41180: {'lr': 0.0004184139909953513, 'samples': 7906560, 'steps': 41179, 'loss/train': 1.5802867412567139}}} 11/07/2021 03:00:40 - INFO - __main__ - Step 41184: {'lr': 0.00041839830269491823, 'samples': 7907328, 'steps': 41183, 'loss/train': 1.391185998916626}}} 11/07/2021 03:00:42 - INFO - __main__ - Step 41188: {'lr': 0.0004183826131804424, 'samples': 7908096, 'steps': 41187, 'loss/train': 1.4859697818756104}}} 11/07/2021 03:00:44 - INFO - __main__ - Step 41193: {'lr': 0.00041836299958027226, 'samples': 7909056, 'steps': 41192, 'loss/train': 1.2868040800094604}} 11/07/2021 03:00:44 - INFO - __main__ - Step 41193: {'lr': 0.00041836299958027226, 'samples': 7909056, 'steps': 41192, 'loss/train': 1.2868040800094604}} 11/07/2021 03:00:48 - INFO - __main__ - Step 41200: {'lr': 0.0004183355373538892, 'samples': 7910400, 'steps': 41199, 'loss/train': 0.5584649443626404}}} 11/07/2021 03:00:50 - INFO - __main__ - Step 41204: {'lr': 0.0004183198429843732, 'samples': 7911168, 'steps': 41203, 'loss/train': 1.5909727811813354}}} 11/07/2021 03:00:52 - INFO - __main__ - Step 41209: {'lr': 0.00041830022331603925, 'samples': 7912128, 'steps': 41208, 'loss/train': 1.482258915901184}}} 11/07/2021 03:00:52 - INFO - __main__ - Step 41209: {'lr': 0.00041830022331603925, 'samples': 7912128, 'steps': 41208, 'loss/train': 1.482258915901184}}} 11/07/2021 03:00:52 - INFO - __main__ - Step 41209: {'lr': 0.00041830022331603925, 'samples': 7912128, 'steps': 41208, 'loss/train': 1.482258915901184}}} 11/07/2021 03:00:58 - INFO - __main__ - Step 41219: {'lr': 0.0004182609782920812, 'samples': 7914048, 'steps': 41218, 'loss/train': 1.9552432298660278}}} 11/07/2021 03:01:01 - INFO - __main__ - Step 41225: {'lr': 0.0004182374276384347, 'samples': 7915200, 'steps': 41224, 'loss/train': 1.4558976888656616}}} 11/07/2021 03:01:01 - INFO - __main__ - Step 41225: {'lr': 0.0004182374276384347, 'samples': 7915200, 'steps': 41224, 'loss/train': 1.4558976888656616}}} 11/07/2021 03:01:04 - INFO - __main__ - Step 41232: {'lr': 0.00041820994842673787, 'samples': 7916544, 'steps': 41231, 'loss/train': 1.6607749462127686}} 11/07/2021 03:01:06 - INFO - __main__ - Step 41236: {'lr': 0.0004181942443525734, 'samples': 7917312, 'steps': 41235, 'loss/train': 1.290693759918213}6}} 11/07/2021 03:01:06 - INFO - __main__ - Step 41236: {'lr': 0.0004181942443525734, 'samples': 7917312, 'steps': 41235, 'loss/train': 1.290693759918213}6}} 11/07/2021 03:01:10 - INFO - __main__ - Step 41244: {'lr': 0.0004181628325666424, 'samples': 7918848, 'steps': 41243, 'loss/train': 1.7734460830688477}}} 11/07/2021 03:01:12 - INFO - __main__ - Step 41248: {'lr': 0.00041814712485510245, 'samples': 7919616, 'steps': 41247, 'loss/train': 0.9829598665237427}} 11/07/2021 03:01:15 - INFO - __main__ - Step 41253: {'lr': 0.0004181274885109895, 'samples': 7920576, 'steps': 41252, 'loss/train': 0.991035521030426}7}} 11/07/2021 03:01:17 - INFO - __main__ - Step 41257: {'lr': 0.0004181117780720868, 'samples': 7921344, 'steps': 41256, 'loss/train': 1.1258245706558228}}} 11/07/2021 03:01:19 - INFO - __main__ - Step 41261: {'lr': 0.0004180960664212069, 'samples': 7922112, 'steps': 41260, 'loss/train': 1.564186453819275}}}} 11/07/2021 03:01:20 - INFO - __main__ - Step 41265: {'lr': 0.0004180803535584632, 'samples': 7922880, 'steps': 41264, 'loss/train': 1.7812724113464355}}} 11/07/2021 03:01:22 - INFO - __main__ - Step 41269: {'lr': 0.00041806463948396876, 'samples': 7923648, 'steps': 41268, 'loss/train': 1.508406639099121}}} 11/07/2021 03:01:22 - INFO - __main__ - Step 41269: {'lr': 0.00041806463948396876, 'samples': 7923648, 'steps': 41268, 'loss/train': 1.508406639099121}}} 11/07/2021 03:01:26 - INFO - __main__ - Step 41277: {'lr': 0.0004180332077001814, 'samples': 7925184, 'steps': 41276, 'loss/train': 1.587908148765564}}}} 11/07/2021 03:01:28 - INFO - __main__ - Step 41281: {'lr': 0.00041801748999111487, 'samples': 7925952, 'steps': 41280, 'loss/train': 2.370021343231201}}} 11/07/2021 03:01:30 - INFO - __main__ - Step 41285: {'lr': 0.000418001771070751, 'samples': 7926720, 'steps': 41284, 'loss/train': 1.1700141429901123}}}} 11/07/2021 03:01:32 - INFO - __main__ - Step 41289: {'lr': 0.00041798605093920307, 'samples': 7927488, 'steps': 41288, 'loss/train': 1.71012544631958}}}} 11/07/2021 03:01:32 - INFO - __main__ - Step 41289: {'lr': 0.00041798605093920307, 'samples': 7927488, 'steps': 41288, 'loss/train': 1.71012544631958}}}} 11/07/2021 03:01:37 - INFO - __main__ - Step 41298: {'lr': 0.0004179506762154153, 'samples': 7929216, 'steps': 41297, 'loss/train': 1.2087020874023438}}} 11/07/2021 03:01:39 - INFO - __main__ - Step 41302: {'lr': 0.0004179349521483018, 'samples': 7929984, 'steps': 41301, 'loss/train': 1.1674959659576416}}} 11/07/2021 03:01:40 - INFO - __main__ - Step 41306: {'lr': 0.0004179192268704859, 'samples': 7930752, 'steps': 41305, 'loss/train': 1.4723738431930542}}} 11/07/2021 03:01:42 - INFO - __main__ - Step 41310: {'lr': 0.000417903500382081, 'samples': 7931520, 'steps': 41309, 'loss/train': 1.3963290452957153}}}} 11/07/2021 03:01:45 - INFO - __main__ - Step 41315: {'lr': 0.00041788384056935693, 'samples': 7932480, 'steps': 41314, 'loss/train': 1.4484103918075562}} 11/07/2021 03:01:45 - INFO - __main__ - Step 41315: {'lr': 0.00041788384056935693, 'samples': 7932480, 'steps': 41314, 'loss/train': 1.4484103918075562}} 11/07/2021 03:01:48 - INFO - __main__ - Step 41322: {'lr': 0.0004178563136544662, 'samples': 7933824, 'steps': 41321, 'loss/train': 1.5325267314910889}}} 11/07/2021 03:01:50 - INFO - __main__ - Step 41326: {'lr': 0.0004178405823248392, 'samples': 7934592, 'steps': 41325, 'loss/train': 1.7020225524902344}}} 11/07/2021 03:01:53 - INFO - __main__ - Step 41331: {'lr': 0.00041782091646122533, 'samples': 7935552, 'steps': 41330, 'loss/train': 1.3062204122543335}} 11/07/2021 03:01:53 - INFO - __main__ - Step 41331: {'lr': 0.00041782091646122533, 'samples': 7935552, 'steps': 41330, 'loss/train': 1.3062204122543335}} 11/07/2021 03:01:56 - INFO - __main__ - Step 41338: {'lr': 0.0004177933810762797, 'samples': 7936896, 'steps': 41337, 'loss/train': 1.3531194925308228}}} 11/07/2021 03:01:56 - INFO - __main__ - Step 41338: {'lr': 0.0004177933810762797, 'samples': 7936896, 'steps': 41337, 'loss/train': 1.3531194925308228}}} 11/07/2021 03:02:00 - INFO - __main__ - Step 41344: {'lr': 0.00041776977636913274, 'samples': 7938048, 'steps': 41343, 'loss/train': 1.6654342412948608}} 11/07/2021 03:02:00 - INFO - __main__ - Step 41344: {'lr': 0.00041776977636913274, 'samples': 7938048, 'steps': 41343, 'loss/train': 1.6654342412948608}} 11/07/2021 03:02:05 - INFO - __main__ - Step 41353: {'lr': 0.000417734364205905, 'samples': 7939776, 'steps': 41352, 'loss/train': 1.5426164865493774}8}} 11/07/2021 03:02:06 - INFO - __main__ - Step 41357: {'lr': 0.0004177186235015744, 'samples': 7940544, 'steps': 41356, 'loss/train': 1.4589163064956665}}} 11/07/2021 03:02:08 - INFO - __main__ - Step 41361: {'lr': 0.0004177028815881011, 'samples': 7941312, 'steps': 41360, 'loss/train': 1.4144057035446167}}} 11/07/2021 03:02:11 - INFO - __main__ - Step 41366: {'lr': 0.00041768320249607527, 'samples': 7942272, 'steps': 41365, 'loss/train': 0.5804887413978577}} 11/07/2021 03:02:13 - INFO - __main__ - Step 41370: {'lr': 0.00041766745786244564, 'samples': 7943040, 'steps': 41369, 'loss/train': 1.856892704963684}}} 11/07/2021 03:02:13 - INFO - __main__ - Step 41370: {'lr': 0.00041766745786244564, 'samples': 7943040, 'steps': 41369, 'loss/train': 1.856892704963684}}} 11/07/2021 03:02:16 - INFO - __main__ - Step 41378: {'lr': 0.00041763596496897817, 'samples': 7944576, 'steps': 41377, 'loss/train': 1.489722728729248}}} 11/07/2021 03:02:18 - INFO - __main__ - Step 41382: {'lr': 0.00041762021670936736, 'samples': 7945344, 'steps': 41381, 'loss/train': 1.8188769817352295}} 11/07/2021 03:02:21 - INFO - __main__ - Step 41387: {'lr': 0.00041760052968550776, 'samples': 7946304, 'steps': 41386, 'loss/train': 1.1613948345184326}} 11/07/2021 03:02:23 - INFO - __main__ - Step 41392: {'lr': 0.0004175808407736929, 'samples': 7947264, 'steps': 41391, 'loss/train': 0.9999271035194397}}} 11/07/2021 03:02:23 - INFO - __main__ - Step 41392: {'lr': 0.0004175808407736929, 'samples': 7947264, 'steps': 41391, 'loss/train': 0.9999271035194397}}} 11/07/2021 03:02:27 - INFO - __main__ - Step 41399: {'lr': 0.00041755327312580944, 'samples': 7948608, 'steps': 41398, 'loss/train': 1.602047324180603}}} 11/07/2021 03:02:28 - INFO - __main__ - Step 41403: {'lr': 0.0004175375185231904, 'samples': 7949376, 'steps': 41402, 'loss/train': 1.2870851755142212}}} 11/07/2021 03:02:30 - INFO - __main__ - Step 41407: {'lr': 0.0004175217627127344, 'samples': 7950144, 'steps': 41406, 'loss/train': 1.4150969982147217}}} 11/07/2021 03:02:32 - INFO - __main__ - Step 41411: {'lr': 0.00041750600569455474, 'samples': 7950912, 'steps': 41410, 'loss/train': 1.4357165098190308}} 11/07/2021 03:02:35 - INFO - __main__ - Step 41415: {'lr': 0.00041749024746876517, 'samples': 7951680, 'steps': 41414, 'loss/train': 1.9053404331207275}} 11/07/2021 03:02:36 - INFO - __main__ - Step 41419: {'lr': 0.00041747448803547925, 'samples': 7952448, 'steps': 41418, 'loss/train': 1.7027661800384521}} 11/07/2021 03:02:38 - INFO - __main__ - Step 41423: {'lr': 0.0004174587273948106, 'samples': 7953216, 'steps': 41422, 'loss/train': 1.418365716934204}1}} 11/07/2021 03:02:41 - INFO - __main__ - Step 41428: {'lr': 0.00041743902489626606, 'samples': 7954176, 'steps': 41427, 'loss/train': 1.5032762289047241}} 11/07/2021 03:02:43 - INFO - __main__ - Step 41432: {'lr': 0.0004174232615394018, 'samples': 7954944, 'steps': 41431, 'loss/train': 1.4779268503189087}}} 11/07/2021 03:02:45 - INFO - __main__ - Step 41436: {'lr': 0.00041740749697552406, 'samples': 7955712, 'steps': 41435, 'loss/train': 1.1720250844955444}} 11/07/2021 03:02:46 - INFO - __main__ - Step 41440: {'lr': 0.00041739173120474663, 'samples': 7956480, 'steps': 41439, 'loss/train': 1.266028642654419}}} 11/07/2021 03:02:48 - INFO - __main__ - Step 41444: {'lr': 0.00041737596422718306, 'samples': 7957248, 'steps': 41443, 'loss/train': 1.0524725914001465}} 11/07/2021 03:02:51 - INFO - __main__ - Step 41449: {'lr': 0.00041735625380835884, 'samples': 7958208, 'steps': 41448, 'loss/train': 1.2277780771255493}} 11/07/2021 03:02:53 - INFO - __main__ - Step 41453: {'lr': 0.00041734048411594214, 'samples': 7958976, 'steps': 41452, 'loss/train': 1.5704346895217896}} 11/07/2021 03:02:55 - INFO - __main__ - Step 41457: {'lr': 0.00041732471321710886, 'samples': 7959744, 'steps': 41456, 'loss/train': 1.1334835290908813}} 11/07/2021 03:02:56 - INFO - __main__ - Step 41461: {'lr': 0.00041730894111197266, 'samples': 7960512, 'steps': 41460, 'loss/train': 1.2933282852172852}} 11/07/2021 03:02:58 - INFO - __main__ - Step 41465: {'lr': 0.0004172931678006472, 'samples': 7961280, 'steps': 41464, 'loss/train': 1.5170831680297852}}} 11/07/2021 03:02:58 - INFO - __main__ - Step 41465: {'lr': 0.0004172931678006472, 'samples': 7961280, 'steps': 41464, 'loss/train': 1.5170831680297852}}} 11/07/2021 03:03:02 - INFO - __main__ - Step 41473: {'lr': 0.0004172616175598835, 'samples': 7962816, 'steps': 41472, 'loss/train': 1.6682500839233398}}} 11/07/2021 03:03:04 - INFO - __main__ - Step 41477: {'lr': 0.0004172458406306726, 'samples': 7963584, 'steps': 41476, 'loss/train': 1.0533301830291748}}} 11/07/2021 03:03:04 - INFO - __main__ - Step 41477: {'lr': 0.0004172458406306726, 'samples': 7963584, 'steps': 41476, 'loss/train': 1.0533301830291748}}} 11/07/2021 03:03:09 - INFO - __main__ - Step 41485: {'lr': 0.00041721428315516176, 'samples': 7965120, 'steps': 41484, 'loss/train': 1.4794907569885254}} 11/07/2021 03:03:09 - INFO - __main__ - Step 41485: {'lr': 0.00041721428315516176, 'samples': 7965120, 'steps': 41484, 'loss/train': 1.4794907569885254}} 11/07/2021 03:03:13 - INFO - __main__ - Step 41492: {'lr': 0.00041718666640848937, 'samples': 7966464, 'steps': 41491, 'loss/train': 1.8231583833694458}} 11/07/2021 03:03:15 - INFO - __main__ - Step 41496: {'lr': 0.00041717088375305367, 'samples': 7967232, 'steps': 41495, 'loss/train': 0.9370419979095459}} 11/07/2021 03:03:16 - INFO - __main__ - Step 41500: {'lr': 0.0004171550998924241, 'samples': 7968000, 'steps': 41499, 'loss/train': 1.9808309078216553}}} 11/07/2021 03:03:18 - INFO - __main__ - Step 41504: {'lr': 0.00041713931482671425, 'samples': 7968768, 'steps': 41503, 'loss/train': 1.7438743114471436}} 11/07/2021 03:03:21 - INFO - __main__ - Step 41509: {'lr': 0.00041711958180010644, 'samples': 7969728, 'steps': 41508, 'loss/train': 1.4612095355987549}} 11/07/2021 03:03:21 - INFO - __main__ - Step 41509: {'lr': 0.00041711958180010644, 'samples': 7969728, 'steps': 41508, 'loss/train': 1.4612095355987549}} 11/07/2021 03:03:21 - INFO - __main__ - Step 41509: {'lr': 0.00041711958180010644, 'samples': 7969728, 'steps': 41508, 'loss/train': 1.4612095355987549}} 11/07/2021 03:03:26 - INFO - __main__ - Step 41520: {'lr': 0.00041707616251535, 'samples': 7971840, 'steps': 41519, 'loss/train': 1.5904089212417603}49}} 11/07/2021 03:03:29 - INFO - __main__ - Step 41525: {'lr': 0.00041705642346540436, 'samples': 7972800, 'steps': 41524, 'loss/train': 1.8854976892471313}} 11/07/2021 03:03:31 - INFO - __main__ - Step 41530: {'lr': 0.0004170366825336326, 'samples': 7973760, 'steps': 41529, 'loss/train': 1.6385127305984497}}} 11/07/2021 03:03:31 - INFO - __main__ - Step 41530: {'lr': 0.0004170366825336326, 'samples': 7973760, 'steps': 41529, 'loss/train': 1.6385127305984497}}} 11/07/2021 03:03:35 - INFO - __main__ - Step 41537: {'lr': 0.00041700904206810755, 'samples': 7975104, 'steps': 41536, 'loss/train': 1.848340392112732}}} 11/07/2021 03:03:37 - INFO - __main__ - Step 41541: {'lr': 0.0004169932458608025, 'samples': 7975872, 'steps': 41540, 'loss/train': 1.6828755140304565}}} 11/07/2021 03:03:37 - INFO - __main__ - Step 41541: {'lr': 0.0004169932458608025, 'samples': 7975872, 'steps': 41540, 'loss/train': 1.6828755140304565}}} 11/07/2021 03:03:37 - INFO - __main__ - Step 41541: {'lr': 0.0004169932458608025, 'samples': 7975872, 'steps': 41540, 'loss/train': 1.6828755140304565}}} 11/07/2021 03:03:42 - INFO - __main__ - Step 41552: {'lr': 0.00041694980008337825, 'samples': 7977984, 'steps': 41551, 'loss/train': 1.5691694021224976}} 11/07/2021 03:03:45 - INFO - __main__ - Step 41557: {'lr': 0.0004169300489935884, 'samples': 7978944, 'steps': 41556, 'loss/train': 1.5837068557739258}}} 11/07/2021 03:03:47 - INFO - __main__ - Step 41561: {'lr': 0.00041691424676785593, 'samples': 7979712, 'steps': 41560, 'loss/train': 1.891399621963501}}} 11/07/2021 03:03:47 - INFO - __main__ - Step 41561: {'lr': 0.00041691424676785593, 'samples': 7979712, 'steps': 41560, 'loss/train': 1.891399621963501}}} 11/07/2021 03:03:50 - INFO - __main__ - Step 41568: {'lr': 0.00041688658997734675, 'samples': 7981056, 'steps': 41567, 'loss/train': 1.6336743831634521}} 11/07/2021 03:03:52 - INFO - __main__ - Step 41572: {'lr': 0.00041687078444269316, 'samples': 7981824, 'steps': 41571, 'loss/train': 1.8877520561218262}} 11/07/2021 03:03:55 - INFO - __main__ - Step 41577: {'lr': 0.000416851025832628, 'samples': 7982784, 'steps': 41576, 'loss/train': 0.6884801387786865}2}} 11/07/2021 03:03:55 - INFO - __main__ - Step 41577: {'lr': 0.000416851025832628, 'samples': 7982784, 'steps': 41576, 'loss/train': 0.6884801387786865}2}} 11/07/2021 03:03:59 - INFO - __main__ - Step 41585: {'lr': 0.0004168194081472305, 'samples': 7984320, 'steps': 41584, 'loss/train': 1.1452103853225708}}} 11/07/2021 03:04:00 - INFO - __main__ - Step 41589: {'lr': 0.0004168035975004847, 'samples': 7985088, 'steps': 41588, 'loss/train': 0.9681318402290344}}} 11/07/2021 03:04:02 - INFO - __main__ - Step 41593: {'lr': 0.0004167877856511929, 'samples': 7985856, 'steps': 41592, 'loss/train': 1.6857103109359741}}} 11/07/2021 03:04:05 - INFO - __main__ - Step 41598: {'lr': 0.00041676801914867145, 'samples': 7986816, 'steps': 41597, 'loss/train': 1.030824065208435}}} 11/07/2021 03:04:07 - INFO - __main__ - Step 41603: {'lr': 0.0004167482507675726, 'samples': 7987776, 'steps': 41602, 'loss/train': 1.5415982007980347}}} 11/07/2021 03:04:07 - INFO - __main__ - Step 41603: {'lr': 0.0004167482507675726, 'samples': 7987776, 'steps': 41602, 'loss/train': 1.5415982007980347}}} 11/07/2021 03:04:10 - INFO - __main__ - Step 41610: {'lr': 0.0004167205718784481, 'samples': 7989120, 'steps': 41609, 'loss/train': 1.6693180799484253}}} 11/07/2021 03:04:12 - INFO - __main__ - Step 41614: {'lr': 0.00041670475371766, 'samples': 7989888, 'steps': 41613, 'loss/train': 1.4963102340698242}3}}} 11/07/2021 03:04:15 - INFO - __main__ - Step 41619: {'lr': 0.00041668497932661005, 'samples': 7990848, 'steps': 41618, 'loss/train': 1.4335687160491943}} 11/07/2021 03:04:17 - INFO - __main__ - Step 41623: {'lr': 0.0004166691584618572, 'samples': 7991616, 'steps': 41622, 'loss/train': 1.4614523649215698}}} 11/07/2021 03:04:17 - INFO - __main__ - Step 41623: {'lr': 0.0004166691584618572, 'samples': 7991616, 'steps': 41622, 'loss/train': 1.4614523649215698}}} 11/07/2021 03:04:20 - INFO - __main__ - Step 41631: {'lr': 0.0004166375131277349, 'samples': 7993152, 'steps': 41630, 'loss/train': 1.4986904859542847}}} 11/07/2021 03:04:22 - INFO - __main__ - Step 41635: {'lr': 0.00041662168865859374, 'samples': 7993920, 'steps': 41634, 'loss/train': 1.5390034914016724}} 11/07/2021 03:04:25 - INFO - __main__ - Step 41640: {'lr': 0.00041660190638294456, 'samples': 7994880, 'steps': 41639, 'loss/train': 1.7146142721176147}} 11/07/2021 03:04:25 - INFO - __main__ - Step 41640: {'lr': 0.00041660190638294456, 'samples': 7994880, 'steps': 41639, 'loss/train': 1.7146142721176147}} 11/07/2021 03:04:29 - INFO - __main__ - Step 41648: {'lr': 0.00041657025083844957, 'samples': 7996416, 'steps': 41647, 'loss/train': 1.5819388628005981}} 11/07/2021 03:04:31 - INFO - __main__ - Step 41652: {'lr': 0.0004165544212648494, 'samples': 7997184, 'steps': 41651, 'loss/train': 1.6275389194488525}}} 11/07/2021 03:04:33 - INFO - __main__ - Step 41656: {'lr': 0.00041653859049049964, 'samples': 7997952, 'steps': 41655, 'loss/train': 1.5604771375656128}} 11/07/2021 03:04:35 - INFO - __main__ - Step 41660: {'lr': 0.00041652275851551435, 'samples': 7998720, 'steps': 41659, 'loss/train': 1.145478367805481}}} 11/07/2021 03:04:37 - INFO - __main__ - Step 41665: {'lr': 0.0004165029668585629, 'samples': 7999680, 'steps': 41664, 'loss/train': 1.5130399465560913}}} 11/07/2021 03:04:39 - INFO - __main__ - Step 41670: {'lr': 0.0004164831733260198, 'samples': 8000640, 'steps': 41669, 'loss/train': 1.4005441665649414}}} 11/07/2021 03:04:39 - INFO - __main__ - Step 41670: {'lr': 0.0004164831733260198, 'samples': 8000640, 'steps': 41669, 'loss/train': 1.4005441665649414}}} 11/07/2021 03:04:43 - INFO - __main__ - Step 41677: {'lr': 0.00041645545922989, 'samples': 8001984, 'steps': 41676, 'loss/train': 1.1119292974472046}4}}} 11/07/2021 03:04:45 - INFO - __main__ - Step 41681: {'lr': 0.00041643962095344107, 'samples': 8002752, 'steps': 41680, 'loss/train': 1.4981598854064941}} 11/07/2021 03:04:47 - INFO - __main__ - Step 41686: {'lr': 0.00041641982142050297, 'samples': 8003712, 'steps': 41685, 'loss/train': 1.5093369483947754}} 11/07/2021 03:04:49 - INFO - __main__ - Step 41690: {'lr': 0.0004164039804443902, 'samples': 8004480, 'steps': 41689, 'loss/train': 1.4913170337677002}}} 11/07/2021 03:04:49 - INFO - __main__ - Step 41690: {'lr': 0.0004164039804443902, 'samples': 8004480, 'steps': 41689, 'loss/train': 1.4913170337677002}}} 11/07/2021 03:04:53 - INFO - __main__ - Step 41697: {'lr': 0.0004163762558495674, 'samples': 8005824, 'steps': 41696, 'loss/train': 1.8762617111206055}}} 11/07/2021 03:04:53 - INFO - __main__ - Step 41697: {'lr': 0.0004163762558495674, 'samples': 8005824, 'steps': 41696, 'loss/train': 1.8762617111206055}}} 11/07/2021 03:04:57 - INFO - __main__ - Step 41705: {'lr': 0.0004163445661003827, 'samples': 8007360, 'steps': 41704, 'loss/train': 2.0427615642547607}}} 11/07/2021 03:04:59 - INFO - __main__ - Step 41709: {'lr': 0.00041632871942687814, 'samples': 8008128, 'steps': 41708, 'loss/train': 1.4548799991607666}} 11/07/2021 03:05:01 - INFO - __main__ - Step 41714: {'lr': 0.0004163089093987449, 'samples': 8009088, 'steps': 41713, 'loss/train': 1.3990758657455444}}} 11/07/2021 03:05:03 - INFO - __main__ - Step 41718: {'lr': 0.0004162930600273754, 'samples': 8009856, 'steps': 41717, 'loss/train': 1.3864885568618774}}} 11/07/2021 03:05:05 - INFO - __main__ - Step 41722: {'lr': 0.00041627720945714065, 'samples': 8010624, 'steps': 41721, 'loss/train': 1.3326258659362793}} 11/07/2021 03:05:07 - INFO - __main__ - Step 41726: {'lr': 0.00041626135768815467, 'samples': 8011392, 'steps': 41725, 'loss/train': 1.537654161453247}}} 11/07/2021 03:05:09 - INFO - __main__ - Step 41730: {'lr': 0.0004162455047205319, 'samples': 8012160, 'steps': 41729, 'loss/train': 0.8903040885925293}}} 11/07/2021 03:05:11 - INFO - __main__ - Step 41734: {'lr': 0.0004162296505543867, 'samples': 8012928, 'steps': 41733, 'loss/train': 1.5739892721176147}}} 11/07/2021 03:05:11 - INFO - __main__ - Step 41734: {'lr': 0.0004162296505543867, 'samples': 8012928, 'steps': 41733, 'loss/train': 1.5739892721176147}}} 11/07/2021 03:05:15 - INFO - __main__ - Step 41742: {'lr': 0.00041619793862698553, 'samples': 8014464, 'steps': 41741, 'loss/train': 1.555660367012024}}} 11/07/2021 03:05:17 - INFO - __main__ - Step 41746: {'lr': 0.00041618208086595843, 'samples': 8015232, 'steps': 41745, 'loss/train': 1.339052677154541}}} 11/07/2021 03:05:20 - INFO - __main__ - Step 41751: {'lr': 0.0004161622569799086, 'samples': 8016192, 'steps': 41750, 'loss/train': 1.49894118309021}1}}} 11/07/2021 03:05:22 - INFO - __main__ - Step 41755: {'lr': 0.00041614639652339533, 'samples': 8016960, 'steps': 41754, 'loss/train': 0.987359344959259}}} 11/07/2021 03:05:24 - INFO - __main__ - Step 41759: {'lr': 0.00041613053486907396, 'samples': 8017728, 'steps': 41758, 'loss/train': 1.616061806678772}}} 11/07/2021 03:05:25 - INFO - __main__ - Step 41763: {'lr': 0.000416114672017059, 'samples': 8018496, 'steps': 41762, 'loss/train': 1.7405064105987549}}}} 11/07/2021 03:05:27 - INFO - __main__ - Step 41767: {'lr': 0.00041609880796746463, 'samples': 8019264, 'steps': 41766, 'loss/train': 1.4200246334075928}} 11/07/2021 03:05:30 - INFO - __main__ - Step 41772: {'lr': 0.00041607897622155006, 'samples': 8020224, 'steps': 41771, 'loss/train': 1.3142369985580444}} 11/07/2021 03:05:32 - INFO - __main__ - Step 41776: {'lr': 0.00041606310947782046, 'samples': 8020992, 'steps': 41775, 'loss/train': 1.5007060766220093}} 11/07/2021 03:05:32 - INFO - __main__ - Step 41776: {'lr': 0.00041606310947782046, 'samples': 8020992, 'steps': 41775, 'loss/train': 1.5007060766220093}} 11/07/2021 03:05:35 - INFO - __main__ - Step 41782: {'lr': 0.0004160393071174975, 'samples': 8022144, 'steps': 41781, 'loss/train': 1.4717530012130737}}} 11/07/2021 03:05:38 - INFO - __main__ - Step 41788: {'lr': 0.0004160155020638436, 'samples': 8023296, 'steps': 41787, 'loss/train': 1.6691575050354004}}} 11/07/2021 03:05:38 - INFO - __main__ - Step 41788: {'lr': 0.0004160155020638436, 'samples': 8023296, 'steps': 41787, 'loss/train': 1.6691575050354004}}} 11/07/2021 03:05:41 - INFO - __main__ - Step 41795: {'lr': 0.0004159877260976914, 'samples': 8024640, 'steps': 41794, 'loss/train': 1.1664894819259644}}} 11/07/2021 03:05:43 - INFO - __main__ - Step 41800: {'lr': 0.0004159678838780874, 'samples': 8025600, 'steps': 41799, 'loss/train': 1.3497828245162964}}} 11/07/2021 03:05:46 - INFO - __main__ - Step 41805: {'lr': 0.00041594803978891925, 'samples': 8026560, 'steps': 41804, 'loss/train': 1.3734766244888306}} 11/07/2021 03:05:46 - INFO - __main__ - Step 41805: {'lr': 0.00041594803978891925, 'samples': 8026560, 'steps': 41804, 'loss/train': 1.3734766244888306}} 11/07/2021 03:05:50 - INFO - __main__ - Step 41812: {'lr': 0.0004159202549236416, 'samples': 8027904, 'steps': 41811, 'loss/train': 1.2156720161437988}}} 11/07/2021 03:05:51 - INFO - __main__ - Step 41816: {'lr': 0.000415904376212985, 'samples': 8028672, 'steps': 41815, 'loss/train': 1.4413472414016724}}}} 11/07/2021 03:05:53 - INFO - __main__ - Step 41820: {'lr': 0.00041588849630626513, 'samples': 8029440, 'steps': 41819, 'loss/train': 1.7191028594970703}} 11/07/2021 03:05:56 - INFO - __main__ - Step 41825: {'lr': 0.00041586864474107575, 'samples': 8030400, 'steps': 41824, 'loss/train': 1.1881881952285767}} 11/07/2021 03:05:58 - INFO - __main__ - Step 41829: {'lr': 0.0004158527621436322, 'samples': 8031168, 'steps': 41828, 'loss/train': 1.5067369937896729}}} 11/07/2021 03:05:58 - INFO - __main__ - Step 41829: {'lr': 0.0004158527621436322, 'samples': 8031168, 'steps': 41828, 'loss/train': 1.5067369937896729}}} 11/07/2021 03:06:01 - INFO - __main__ - Step 41836: {'lr': 0.00041582496472104314, 'samples': 8032512, 'steps': 41835, 'loss/train': 2.1492388248443604}} 11/07/2021 03:06:03 - INFO - __main__ - Step 41841: {'lr': 0.0004158051071776129, 'samples': 8033472, 'steps': 41840, 'loss/train': 1.3763138055801392}}} 11/07/2021 03:06:03 - INFO - __main__ - Step 41841: {'lr': 0.0004158051071776129, 'samples': 8033472, 'steps': 41840, 'loss/train': 1.3763138055801392}}} 11/07/2021 03:06:03 - INFO - __main__ - Step 41841: {'lr': 0.0004158051071776129, 'samples': 8033472, 'steps': 41840, 'loss/train': 1.3763138055801392}}} 11/07/2021 03:06:09 - INFO - __main__ - Step 41852: {'lr': 0.00041576141400796984, 'samples': 8035584, 'steps': 41851, 'loss/train': 1.2623769044876099}} 11/07/2021 03:06:11 - INFO - __main__ - Step 41857: {'lr': 0.0004157415504885893, 'samples': 8036544, 'steps': 41856, 'loss/train': 1.363741159439087}9}} 11/07/2021 03:06:14 - INFO - __main__ - Step 41862: {'lr': 0.0004157216851021941, 'samples': 8037504, 'steps': 41861, 'loss/train': 2.5971829891204834}}} 11/07/2021 03:06:16 - INFO - __main__ - Step 41866: {'lr': 0.0004157057914489778, 'samples': 8038272, 'steps': 41865, 'loss/train': 1.4821267127990723}}} 11/07/2021 03:06:18 - INFO - __main__ - Step 41870: {'lr': 0.0004156898966011299, 'samples': 8039040, 'steps': 41869, 'loss/train': 1.2954154014587402}}} 11/07/2021 03:06:18 - INFO - __main__ - Step 41870: {'lr': 0.0004156898966011299, 'samples': 8039040, 'steps': 41869, 'loss/train': 1.2954154014587402}}} 11/07/2021 03:06:21 - INFO - __main__ - Step 41877: {'lr': 0.00041566207774315866, 'samples': 8040384, 'steps': 41876, 'loss/train': 1.8460909128189087}} 11/07/2021 03:06:21 - INFO - __main__ - Step 41877: {'lr': 0.00041566207774315866, 'samples': 8040384, 'steps': 41876, 'loss/train': 1.8460909128189087}} 11/07/2021 03:06:25 - INFO - __main__ - Step 41886: {'lr': 0.0004156263052657148, 'samples': 8042112, 'steps': 41885, 'loss/train': 1.515254259109497}7}} 11/07/2021 03:06:28 - INFO - __main__ - Step 41890: {'lr': 0.0004156104044464282, 'samples': 8042880, 'steps': 41889, 'loss/train': 1.5599640607833862}}} 11/07/2021 03:06:29 - INFO - __main__ - Step 41894: {'lr': 0.0004155945024331976, 'samples': 8043648, 'steps': 41893, 'loss/train': 1.2952686548233032}}} 11/07/2021 03:06:31 - INFO - __main__ - Step 41898: {'lr': 0.00041557859922613795, 'samples': 8044416, 'steps': 41897, 'loss/train': 1.8280117511749268}} 11/07/2021 03:06:34 - INFO - __main__ - Step 41903: {'lr': 0.00041555871853866553, 'samples': 8045376, 'steps': 41902, 'loss/train': 1.4974288940429688}} 11/07/2021 03:06:36 - INFO - __main__ - Step 41907: {'lr': 0.0004155428126459092, 'samples': 8046144, 'steps': 41906, 'loss/train': 1.0993465185165405}}} 11/07/2021 03:06:36 - INFO - __main__ - Step 41907: {'lr': 0.0004155428126459092, 'samples': 8046144, 'steps': 41906, 'loss/train': 1.0993465185165405}}} 11/07/2021 03:06:39 - INFO - __main__ - Step 41914: {'lr': 0.0004155149744618997, 'samples': 8047488, 'steps': 41913, 'loss/train': 1.6639630794525146}}} 11/07/2021 03:06:42 - INFO - __main__ - Step 41920: {'lr': 0.0004154911102527356, 'samples': 8048640, 'steps': 41919, 'loss/train': 1.6904728412628174}}} 11/07/2021 03:06:44 - INFO - __main__ - Step 41924: {'lr': 0.0004154751992885808, 'samples': 8049408, 'steps': 41923, 'loss/train': 0.46176427602767944}} 11/07/2021 03:06:44 - INFO - __main__ - Step 41924: {'lr': 0.0004154751992885808, 'samples': 8049408, 'steps': 41923, 'loss/train': 0.46176427602767944}} 11/07/2021 03:06:47 - INFO - __main__ - Step 41931: {'lr': 0.00041544735223079693, 'samples': 8050752, 'steps': 41930, 'loss/train': 1.6106220483779907}} 11/07/2021 03:06:49 - INFO - __main__ - Step 41935: {'lr': 0.000415431437986253, 'samples': 8051520, 'steps': 41934, 'loss/train': 1.0677088499069214}7}} 11/07/2021 03:06:52 - INFO - __main__ - Step 41940: {'lr': 0.0004154115435034175, 'samples': 8052480, 'steps': 41939, 'loss/train': 1.5125808715820312}}} 11/07/2021 03:06:54 - INFO - __main__ - Step 41944: {'lr': 0.0004153956265755642, 'samples': 8053248, 'steps': 41943, 'loss/train': 1.6083602905273438}}} 11/07/2021 03:06:54 - INFO - __main__ - Step 41944: {'lr': 0.0004153956265755642, 'samples': 8053248, 'steps': 41943, 'loss/train': 1.6083602905273438}}} 11/07/2021 03:06:57 - INFO - __main__ - Step 41951: {'lr': 0.00041536776908268847, 'samples': 8054592, 'steps': 41950, 'loss/train': 1.2215300798416138}} 11/07/2021 03:06:57 - INFO - __main__ - Step 41951: {'lr': 0.00041536776908268847, 'samples': 8054592, 'steps': 41950, 'loss/train': 1.2215300798416138}} 11/07/2021 03:07:01 - INFO - __main__ - Step 41958: {'lr': 0.0004153399079387167, 'samples': 8055936, 'steps': 41957, 'loss/train': 1.5228304862976074}}} 11/07/2021 03:07:04 - INFO - __main__ - Step 41964: {'lr': 0.0004153160240526612, 'samples': 8057088, 'steps': 41963, 'loss/train': 1.6114858388900757}}} 11/07/2021 03:07:06 - INFO - __main__ - Step 41968: {'lr': 0.00041530009997215665, 'samples': 8057856, 'steps': 41967, 'loss/train': 1.4030784368515015}} 11/07/2021 03:07:08 - INFO - __main__ - Step 41972: {'lr': 0.0004152841746999454, 'samples': 8058624, 'steps': 41971, 'loss/train': 1.5982258319854736}}} 11/07/2021 03:07:09 - INFO - __main__ - Step 41976: {'lr': 0.0004152682482361422, 'samples': 8059392, 'steps': 41975, 'loss/train': 1.662643313407898}}}} 11/07/2021 03:07:09 - INFO - __main__ - Step 41976: {'lr': 0.0004152682482361422, 'samples': 8059392, 'steps': 41975, 'loss/train': 1.662643313407898}}}} 11/07/2021 03:07:09 - INFO - __main__ - Step 41976: {'lr': 0.0004152682482361422, 'samples': 8059392, 'steps': 41975, 'loss/train': 1.662643313407898}}}} 11/07/2021 03:07:15 - INFO - __main__ - Step 41986: {'lr': 0.00041522842686417255, 'samples': 8061312, 'steps': 41985, 'loss/train': 1.1234190464019775}} 11/07/2021 03:07:18 - INFO - __main__ - Step 41992: {'lr': 0.0004152045304673058, 'samples': 8062464, 'steps': 41991, 'loss/train': 1.167265772819519}5}} 11/07/2021 03:07:20 - INFO - __main__ - Step 41996: {'lr': 0.00041518859804726507, 'samples': 8063232, 'steps': 41995, 'loss/train': 1.2416760921478271}} 11/07/2021 03:07:20 - INFO - __main__ - Step 41996: {'lr': 0.00041518859804726507, 'samples': 8063232, 'steps': 41995, 'loss/train': 1.2416760921478271}} 11/07/2021 03:07:24 - INFO - __main__ - Step 42003: {'lr': 0.00041516071344665275, 'samples': 8064576, 'steps': 42002, 'loss/train': 1.8126933574676514}} 11/07/2021 03:07:25 - INFO - __main__ - Step 42007: {'lr': 0.0004151447777519054, 'samples': 8065344, 'steps': 42006, 'loss/train': 1.7512965202331543}}} 11/07/2021 03:07:27 - INFO - __main__ - Step 42011: {'lr': 0.000415128840866571, 'samples': 8066112, 'steps': 42010, 'loss/train': 5.797067642211914}3}}} 11/07/2021 03:07:30 - INFO - __main__ - Step 42016: {'lr': 0.0004151089180858151, 'samples': 8067072, 'steps': 42015, 'loss/train': 1.1027518510818481}}} 11/07/2021 03:07:32 - INFO - __main__ - Step 42020: {'lr': 0.00041509297852208003, 'samples': 8067840, 'steps': 42019, 'loss/train': 1.8538402318954468}} 11/07/2021 03:07:32 - INFO - __main__ - Step 42020: {'lr': 0.00041509297852208003, 'samples': 8067840, 'steps': 42019, 'loss/train': 1.8538402318954468}} 11/07/2021 03:07:35 - INFO - __main__ - Step 42027: {'lr': 0.0004150650814216614, 'samples': 8069184, 'steps': 42026, 'loss/train': 2.0050127506256104}}} 11/07/2021 03:07:38 - INFO - __main__ - Step 42033: {'lr': 0.00041504116672062385, 'samples': 8070336, 'steps': 42032, 'loss/train': 1.820695400238037}}} 11/07/2021 03:07:38 - INFO - __main__ - Step 42033: {'lr': 0.00041504116672062385, 'samples': 8070336, 'steps': 42032, 'loss/train': 1.820695400238037}}} 11/07/2021 03:07:41 - INFO - __main__ - Step 42040: {'lr': 0.00041501326285249963, 'samples': 8071680, 'steps': 42039, 'loss/train': 1.6893484592437744}} 11/07/2021 03:07:43 - INFO - __main__ - Step 42044: {'lr': 0.0004149973161492072, 'samples': 8072448, 'steps': 42043, 'loss/train': 1.4257150888442993}}} 11/07/2021 03:07:46 - INFO - __main__ - Step 42050: {'lr': 0.000414973393863947, 'samples': 8073600, 'steps': 42049, 'loss/train': 1.5420819520950317}}}} 11/07/2021 03:07:46 - INFO - __main__ - Step 42050: {'lr': 0.000414973393863947, 'samples': 8073600, 'steps': 42049, 'loss/train': 1.5420819520950317}}}} 11/07/2021 03:07:46 - INFO - __main__ - Step 42050: {'lr': 0.000414973393863947, 'samples': 8073600, 'steps': 42049, 'loss/train': 1.5420819520950317}}}} 11/07/2021 03:07:51 - INFO - __main__ - Step 42060: {'lr': 0.0004149335174419478, 'samples': 8075520, 'steps': 42059, 'loss/train': 1.5596524477005005}}} 11/07/2021 03:07:54 - INFO - __main__ - Step 42065: {'lr': 0.0004149135764439672, 'samples': 8076480, 'steps': 42064, 'loss/train': 1.273585557937622}}}} 11/07/2021 03:07:56 - INFO - __main__ - Step 42070: {'lr': 0.00041489363358829885, 'samples': 8077440, 'steps': 42069, 'loss/train': 1.3521915674209595}} 11/07/2021 03:07:58 - INFO - __main__ - Step 42074: {'lr': 0.0004148776779663799, 'samples': 8078208, 'steps': 42073, 'loss/train': 1.590319275856018}5}} 11/07/2021 03:07:58 - INFO - __main__ - Step 42074: {'lr': 0.0004148776779663799, 'samples': 8078208, 'steps': 42073, 'loss/train': 1.590319275856018}5}} 11/07/2021 03:08:01 - INFO - __main__ - Step 42081: {'lr': 0.00041484975276787436, 'samples': 8079552, 'steps': 42080, 'loss/train': 1.5335677862167358}} 11/07/2021 03:08:04 - INFO - __main__ - Step 42086: {'lr': 0.00041482980396911467, 'samples': 8080512, 'steps': 42085, 'loss/train': 0.7727944850921631}} 11/07/2021 03:08:06 - INFO - __main__ - Step 42090: {'lr': 0.0004148138435932404, 'samples': 8081280, 'steps': 42089, 'loss/train': 1.4468871355056763}}} 11/07/2021 03:08:08 - INFO - __main__ - Step 42094: {'lr': 0.00041479788202916483, 'samples': 8082048, 'steps': 42093, 'loss/train': 1.8482271432876587}} 11/07/2021 03:08:10 - INFO - __main__ - Step 42098: {'lr': 0.0004147819192770033, 'samples': 8082816, 'steps': 42097, 'loss/train': 1.5658690929412842}}} 11/07/2021 03:08:12 - INFO - __main__ - Step 42102: {'lr': 0.0004147659553368706, 'samples': 8083584, 'steps': 42101, 'loss/train': 2.541309118270874}}}} 11/07/2021 03:08:12 - INFO - __main__ - Step 42102: {'lr': 0.0004147659553368706, 'samples': 8083584, 'steps': 42101, 'loss/train': 2.541309118270874}}}} 11/07/2021 03:08:16 - INFO - __main__ - Step 42110: {'lr': 0.0004147340238931525, 'samples': 8085120, 'steps': 42109, 'loss/train': 1.515739917755127}}}} 11/07/2021 03:08:18 - INFO - __main__ - Step 42114: {'lr': 0.0004147180563897972, 'samples': 8085888, 'steps': 42113, 'loss/train': 2.0268678665161133}}} 11/07/2021 03:08:19 - INFO - __main__ - Step 42118: {'lr': 0.00041470208769893137, 'samples': 8086656, 'steps': 42117, 'loss/train': 1.7440546751022339}} 11/07/2021 03:08:21 - INFO - __main__ - Step 42122: {'lr': 0.00041468611782067, 'samples': 8087424, 'steps': 42121, 'loss/train': 1.2551136016845703}39}} 11/07/2021 03:08:24 - INFO - __main__ - Step 42127: {'lr': 0.0004146661538032438, 'samples': 8088384, 'steps': 42126, 'loss/train': 1.5557634830474854}}} 11/07/2021 03:08:26 - INFO - __main__ - Step 42131: {'lr': 0.00041465018125376354, 'samples': 8089152, 'steps': 42130, 'loss/train': 1.5250442028045654}} 11/07/2021 03:08:26 - INFO - __main__ - Step 42131: {'lr': 0.00041465018125376354, 'samples': 8089152, 'steps': 42130, 'loss/train': 1.5250442028045654}} 11/07/2021 03:08:29 - INFO - __main__ - Step 42138: {'lr': 0.00041462222643597236, 'samples': 8090496, 'steps': 42137, 'loss/train': 1.4060335159301758}} 11/07/2021 03:08:31 - INFO - __main__ - Step 42143: {'lr': 0.0004146022564836556, 'samples': 8091456, 'steps': 42142, 'loss/train': 1.6729769706726074}}} 11/07/2021 03:08:34 - INFO - __main__ - Step 42148: {'lr': 0.00041458228467715786, 'samples': 8092416, 'steps': 42147, 'loss/train': 1.452275037765503}}} 11/07/2021 03:08:34 - INFO - __main__ - Step 42148: {'lr': 0.00041458228467715786, 'samples': 8092416, 'steps': 42147, 'loss/train': 1.452275037765503}}} 11/07/2021 03:08:37 - INFO - __main__ - Step 42155: {'lr': 0.0004145543210334656, 'samples': 8093760, 'steps': 42154, 'loss/train': 1.334218144416809}}}} 11/07/2021 03:08:39 - INFO - __main__ - Step 42159: {'lr': 0.0004145383401772549, 'samples': 8094528, 'steps': 42158, 'loss/train': 1.6665518283843994}}} 11/07/2021 03:08:42 - INFO - __main__ - Step 42164: {'lr': 0.00041451836243889027, 'samples': 8095488, 'steps': 42163, 'loss/train': 1.2692837715148926}} 11/07/2021 03:08:44 - INFO - __main__ - Step 42169: {'lr': 0.00041449838284728964, 'samples': 8096448, 'steps': 42168, 'loss/train': 1.386372685432434}}} 11/07/2021 03:08:46 - INFO - __main__ - Step 42173: {'lr': 0.0004144823978398306, 'samples': 8097216, 'steps': 42172, 'loss/train': 1.8618935346603394}}} 11/07/2021 03:08:46 - INFO - __main__ - Step 42173: {'lr': 0.0004144823978398306, 'samples': 8097216, 'steps': 42172, 'loss/train': 1.8618935346603394}}} 11/07/2021 03:08:49 - INFO - __main__ - Step 42180: {'lr': 0.00041445442122348727, 'samples': 8098560, 'steps': 42179, 'loss/train': 0.6647708415985107}} 11/07/2021 03:08:52 - INFO - __main__ - Step 42186: {'lr': 0.0004144304383766737, 'samples': 8099712, 'steps': 42185, 'loss/train': 1.4202641248703003}}} 11/07/2021 03:08:52 - INFO - __main__ - Step 42186: {'lr': 0.0004144304383766737, 'samples': 8099712, 'steps': 42185, 'loss/train': 1.4202641248703003}}} 11/07/2021 03:08:56 - INFO - __main__ - Step 42193: {'lr': 0.0004144024550176653, 'samples': 8101056, 'steps': 42192, 'loss/train': 1.3551652431488037}}} 11/07/2021 03:08:58 - INFO - __main__ - Step 42197: {'lr': 0.000414386462897065, 'samples': 8101824, 'steps': 42196, 'loss/train': 1.359559416770935}7}}} 11/07/2021 03:09:00 - INFO - __main__ - Step 42201: {'lr': 0.0004143704695913447, 'samples': 8102592, 'steps': 42200, 'loss/train': 1.3718774318695068}}} 11/07/2021 03:09:02 - INFO - __main__ - Step 42205: {'lr': 0.0004143544751006197, 'samples': 8103360, 'steps': 42204, 'loss/train': 1.5881863832473755}}} 11/07/2021 03:09:04 - INFO - __main__ - Step 42209: {'lr': 0.00041433847942500516, 'samples': 8104128, 'steps': 42208, 'loss/train': 1.2702810764312744}} 11/07/2021 03:09:06 - INFO - __main__ - Step 42213: {'lr': 0.0004143224825646166, 'samples': 8104896, 'steps': 42212, 'loss/train': 1.3781150579452515}}} 11/07/2021 03:09:07 - INFO - __main__ - Step 42217: {'lr': 0.00041430648451956913, 'samples': 8105664, 'steps': 42216, 'loss/train': 1.4321280717849731}} 11/07/2021 03:09:10 - INFO - __main__ - Step 42222: {'lr': 0.0004142864852975092, 'samples': 8106624, 'steps': 42221, 'loss/train': 1.4351541996002197}}} 11/07/2021 03:09:10 - INFO - __main__ - Step 42222: {'lr': 0.0004142864852975092, 'samples': 8106624, 'steps': 42221, 'loss/train': 1.4351541996002197}}} 11/07/2021 03:09:14 - INFO - __main__ - Step 42230: {'lr': 0.00041425448269300923, 'samples': 8108160, 'steps': 42229, 'loss/train': 1.63296377658844}7}} 11/07/2021 03:09:16 - INFO - __main__ - Step 42234: {'lr': 0.00041423847961444873, 'samples': 8108928, 'steps': 42233, 'loss/train': 1.8132482767105103}} 11/07/2021 03:09:17 - INFO - __main__ - Step 42238: {'lr': 0.0004142224753518351, 'samples': 8109696, 'steps': 42237, 'loss/train': 1.3605972528457642}}} 11/07/2021 03:09:20 - INFO - __main__ - Step 42242: {'lr': 0.00041420646990528355, 'samples': 8110464, 'steps': 42241, 'loss/train': 1.4256129264831543}} 11/07/2021 03:09:22 - INFO - __main__ - Step 42247: {'lr': 0.00041418646143235737, 'samples': 8111424, 'steps': 42246, 'loss/train': 2.096343517303467}}} 11/07/2021 03:09:24 - INFO - __main__ - Step 42251: {'lr': 0.00041417045332236776, 'samples': 8112192, 'steps': 42250, 'loss/train': 1.1745171546936035}} 11/07/2021 03:09:26 - INFO - __main__ - Step 42255: {'lr': 0.0004141544440288153, 'samples': 8112960, 'steps': 42254, 'loss/train': 2.242783546447754}5}} 11/07/2021 03:09:28 - INFO - __main__ - Step 42259: {'lr': 0.0004141384335518155, 'samples': 8113728, 'steps': 42258, 'loss/train': 1.6466920375823975}}} 11/07/2021 03:09:30 - INFO - __main__ - Step 42263: {'lr': 0.00041412242189148383, 'samples': 8114496, 'steps': 42262, 'loss/train': 1.5122243165969849}} 11/07/2021 03:09:32 - INFO - __main__ - Step 42268: {'lr': 0.0004141024056521845, 'samples': 8115456, 'steps': 42267, 'loss/train': 0.7243680357933044}}} 11/07/2021 03:09:32 - INFO - __main__ - Step 42268: {'lr': 0.0004141024056521845, 'samples': 8115456, 'steps': 42267, 'loss/train': 0.7243680357933044}}} 11/07/2021 03:09:36 - INFO - __main__ - Step 42274: {'lr': 0.00041407838372495883, 'samples': 8116608, 'steps': 42273, 'loss/train': 1.0360863208770752}} 11/07/2021 03:09:38 - INFO - __main__ - Step 42279: {'lr': 0.0004140583634191465, 'samples': 8117568, 'steps': 42278, 'loss/train': 1.5542657375335693}}} 11/07/2021 03:09:40 - INFO - __main__ - Step 42284: {'lr': 0.00041403834126528007, 'samples': 8118528, 'steps': 42283, 'loss/train': 1.6267235279083252}} 11/07/2021 03:09:40 - INFO - __main__ - Step 42284: {'lr': 0.00041403834126528007, 'samples': 8118528, 'steps': 42283, 'loss/train': 1.6267235279083252}} 11/07/2021 03:09:44 - INFO - __main__ - Step 42291: {'lr': 0.0004140103071455654, 'samples': 8119872, 'steps': 42290, 'loss/train': 1.7726240158081055}}} 11/07/2021 03:09:46 - INFO - __main__ - Step 42296: {'lr': 0.00041399028055728914, 'samples': 8120832, 'steps': 42295, 'loss/train': 1.5786479711532593}} 11/07/2021 03:09:46 - INFO - __main__ - Step 42296: {'lr': 0.00041399028055728914, 'samples': 8120832, 'steps': 42295, 'loss/train': 1.5786479711532593}} 11/07/2021 03:09:49 - INFO - __main__ - Step 42303: {'lr': 0.00041396224023031045, 'samples': 8122176, 'steps': 42302, 'loss/train': 1.121030569076538}}} 11/07/2021 03:09:51 - INFO - __main__ - Step 42307: {'lr': 0.00041394621556094805, 'samples': 8122944, 'steps': 42306, 'loss/train': 1.5338916778564453}} 11/07/2021 03:09:54 - INFO - __main__ - Step 42312: {'lr': 0.00041392618306214683, 'samples': 8123904, 'steps': 42311, 'loss/train': 1.2932323217391968}} 11/07/2021 03:09:56 - INFO - __main__ - Step 42316: {'lr': 0.00041391015573356805, 'samples': 8124672, 'steps': 42315, 'loss/train': 1.5465834140777588}} 11/07/2021 03:09:58 - INFO - __main__ - Step 42320: {'lr': 0.0004138941272233031, 'samples': 8125440, 'steps': 42319, 'loss/train': 1.346096396446228}8}} 11/07/2021 03:10:00 - INFO - __main__ - Step 42324: {'lr': 0.00041387809753146756, 'samples': 8126208, 'steps': 42323, 'loss/train': 1.7210004329681396}} 11/07/2021 03:10:02 - INFO - __main__ - Step 42329: {'lr': 0.0004138580587552654, 'samples': 8127168, 'steps': 42328, 'loss/train': 1.4846197366714478}}} 11/07/2021 03:10:04 - INFO - __main__ - Step 42333: {'lr': 0.0004138420264053184, 'samples': 8127936, 'steps': 42332, 'loss/train': 1.7020927667617798}}} 11/07/2021 03:10:06 - INFO - __main__ - Step 42337: {'lr': 0.0004138259928741764, 'samples': 8128704, 'steps': 42336, 'loss/train': 1.46265709400177}8}}} 11/07/2021 03:10:08 - INFO - __main__ - Step 42341: {'lr': 0.000413809958161955, 'samples': 8129472, 'steps': 42340, 'loss/train': 1.5473113059997559}}}} 11/07/2021 03:10:10 - INFO - __main__ - Step 42345: {'lr': 0.00041379392226876974, 'samples': 8130240, 'steps': 42344, 'loss/train': 1.669350028038025}}} 11/07/2021 03:10:12 - INFO - __main__ - Step 42350: {'lr': 0.0004137738757417339, 'samples': 8131200, 'steps': 42349, 'loss/train': 1.6222318410873413}}} 11/07/2021 03:10:12 - INFO - __main__ - Step 42350: {'lr': 0.0004137738757417339, 'samples': 8131200, 'steps': 42349, 'loss/train': 1.6222318410873413}}} 11/07/2021 03:10:16 - INFO - __main__ - Step 42357: {'lr': 0.0004137458075045871, 'samples': 8132544, 'steps': 42356, 'loss/train': 0.9286091923713684}}} 11/07/2021 03:10:16 - INFO - __main__ - Step 42357: {'lr': 0.0004137458075045871, 'samples': 8132544, 'steps': 42356, 'loss/train': 0.9286091923713684}}} 11/07/2021 03:10:20 - INFO - __main__ - Step 42364: {'lr': 0.00041371773565215494, 'samples': 8133888, 'steps': 42363, 'loss/train': 1.8453580141067505}} 11/07/2021 03:10:22 - INFO - __main__ - Step 42368: {'lr': 0.00041370169297067145, 'samples': 8134656, 'steps': 42367, 'loss/train': 2.0079658031463623}} 11/07/2021 03:10:24 - INFO - __main__ - Step 42374: {'lr': 0.0004136776267356387, 'samples': 8135808, 'steps': 42373, 'loss/train': 1.7191283702850342}}} 11/07/2021 03:10:27 - INFO - __main__ - Step 42378: {'lr': 0.00041366158110391375, 'samples': 8136576, 'steps': 42377, 'loss/train': 1.5728462934494019}} 11/07/2021 03:10:27 - INFO - __main__ - Step 42378: {'lr': 0.00041366158110391375, 'samples': 8136576, 'steps': 42377, 'loss/train': 1.5728462934494019}} 11/07/2021 03:10:30 - INFO - __main__ - Step 42385: {'lr': 0.0004136334984093446, 'samples': 8137920, 'steps': 42384, 'loss/train': 0.9145182967185974}}} 11/07/2021 03:10:32 - INFO - __main__ - Step 42389: {'lr': 0.00041361744953318923, 'samples': 8138688, 'steps': 42388, 'loss/train': 1.2505319118499756}} 11/07/2021 03:10:35 - INFO - __main__ - Step 42395: {'lr': 0.00041359337400728746, 'samples': 8139840, 'steps': 42394, 'loss/train': 0.7346595525741577}} 11/07/2021 03:10:35 - INFO - __main__ - Step 42395: {'lr': 0.00041359337400728746, 'samples': 8139840, 'steps': 42394, 'loss/train': 0.7346595525741577}} 11/07/2021 03:10:38 - INFO - __main__ - Step 42402: {'lr': 0.00041356528253983714, 'samples': 8141184, 'steps': 42401, 'loss/train': 1.5337250232696533}} 11/07/2021 03:10:40 - INFO - __main__ - Step 42406: {'lr': 0.00041354922865128316, 'samples': 8141952, 'steps': 42405, 'loss/train': 1.2541090250015259}} 11/07/2021 03:10:42 - INFO - __main__ - Step 42410: {'lr': 0.00041353317358364496, 'samples': 8142720, 'steps': 42409, 'loss/train': 1.0971400737762451}} 11/07/2021 03:10:44 - INFO - __main__ - Step 42415: {'lr': 0.00041351310309118653, 'samples': 8143680, 'steps': 42414, 'loss/train': 0.7985998392105103}} 11/07/2021 03:10:47 - INFO - __main__ - Step 42420: {'lr': 0.000413493030756816, 'samples': 8144640, 'steps': 42419, 'loss/train': 1.7966359853744507}3}} 11/07/2021 03:10:47 - INFO - __main__ - Step 42420: {'lr': 0.000413493030756816, 'samples': 8144640, 'steps': 42419, 'loss/train': 1.7966359853744507}3}} 11/07/2021 03:10:50 - INFO - __main__ - Step 42427: {'lr': 0.00041346492639471555, 'samples': 8145984, 'steps': 42426, 'loss/train': 1.1028519868850708}} 11/07/2021 03:10:52 - INFO - __main__ - Step 42431: {'lr': 0.00041344886513878485, 'samples': 8146752, 'steps': 42430, 'loss/train': 1.7651448249816895}} 11/07/2021 03:10:55 - INFO - __main__ - Step 42436: {'lr': 0.0004134287869118154, 'samples': 8147712, 'steps': 42435, 'loss/train': 1.740281343460083}5}} 11/07/2021 03:10:57 - INFO - __main__ - Step 42440: {'lr': 0.0004134127230047362, 'samples': 8148480, 'steps': 42439, 'loss/train': 1.5026410818099976}}} 11/07/2021 03:10:59 - INFO - __main__ - Step 42444: {'lr': 0.00041339665791955695, 'samples': 8149248, 'steps': 42443, 'loss/train': 1.6956210136413574}} 11/07/2021 03:11:00 - INFO - __main__ - Step 42448: {'lr': 0.0004133805916563935, 'samples': 8150016, 'steps': 42447, 'loss/train': 1.2641940116882324}}} 11/07/2021 03:11:02 - INFO - __main__ - Step 42452: {'lr': 0.0004133645242153617, 'samples': 8150784, 'steps': 42451, 'loss/train': 1.6520686149597168}}} 11/07/2021 03:11:05 - INFO - __main__ - Step 42457: {'lr': 0.00041334443825787097, 'samples': 8151744, 'steps': 42456, 'loss/train': 1.0859808921813965}} 11/07/2021 03:11:07 - INFO - __main__ - Step 42461: {'lr': 0.0004133283681670589, 'samples': 8152512, 'steps': 42460, 'loss/train': 1.2702975273132324}}} 11/07/2021 03:11:09 - INFO - __main__ - Step 42465: {'lr': 0.00041331229689875487, 'samples': 8153280, 'steps': 42464, 'loss/train': 1.96661376953125}}}} 11/07/2021 03:11:10 - INFO - __main__ - Step 42469: {'lr': 0.0004132962244530749, 'samples': 8154048, 'steps': 42468, 'loss/train': 1.902879238128662}}}} 11/07/2021 03:11:12 - INFO - __main__ - Step 42473: {'lr': 0.0004132801508301347, 'samples': 8154816, 'steps': 42472, 'loss/train': 1.3885867595672607}}} 11/07/2021 03:11:15 - INFO - __main__ - Step 42478: {'lr': 0.000413260057146114, 'samples': 8155776, 'steps': 42477, 'loss/train': 1.9076615571975708}}}} 11/07/2021 03:11:17 - INFO - __main__ - Step 42482: {'lr': 0.0004132439808747622, 'samples': 8156544, 'steps': 42481, 'loss/train': 1.372361421585083}}}} 11/07/2021 03:11:19 - INFO - __main__ - Step 42486: {'lr': 0.00041322790342652695, 'samples': 8157312, 'steps': 42485, 'loss/train': 0.6607436537742615}} 11/07/2021 03:11:19 - INFO - __main__ - Step 42486: {'lr': 0.00041322790342652695, 'samples': 8157312, 'steps': 42485, 'loss/train': 0.6607436537742615}} 11/07/2021 03:11:22 - INFO - __main__ - Step 42493: {'lr': 0.00041319976506058785, 'samples': 8158656, 'steps': 42492, 'loss/train': 0.9843049049377441}} 11/07/2021 03:11:24 - INFO - __main__ - Step 42498: {'lr': 0.00041317966402167923, 'samples': 8159616, 'steps': 42497, 'loss/train': 1.2073416709899902}} 11/07/2021 03:11:27 - INFO - __main__ - Step 42503: {'lr': 0.0004131595611446146, 'samples': 8160576, 'steps': 42502, 'loss/train': 1.4976180791854858}}} 11/07/2021 03:11:29 - INFO - __main__ - Step 42507: {'lr': 0.0004131434775196428, 'samples': 8161344, 'steps': 42506, 'loss/train': 1.2052451372146606}}} 11/07/2021 03:11:31 - INFO - __main__ - Step 42511: {'lr': 0.00041312739271851196, 'samples': 8162112, 'steps': 42510, 'loss/train': 1.5313423871994019}} 11/07/2021 03:11:32 - INFO - __main__ - Step 42515: {'lr': 0.00041311130674133824, 'samples': 8162880, 'steps': 42514, 'loss/train': 1.7059967517852783}} 11/07/2021 03:11:35 - INFO - __main__ - Step 42519: {'lr': 0.0004130952195882375, 'samples': 8163648, 'steps': 42518, 'loss/train': 1.2123099565505981}}} 11/07/2021 03:11:35 - INFO - __main__ - Step 42519: {'lr': 0.0004130952195882375, 'samples': 8163648, 'steps': 42518, 'loss/train': 1.2123099565505981}}} 11/07/2021 03:11:39 - INFO - __main__ - Step 42527: {'lr': 0.0004130630417547189, 'samples': 8165184, 'steps': 42526, 'loss/train': 1.6828103065490723}}} 11/07/2021 03:11:40 - INFO - __main__ - Step 42531: {'lr': 0.00041304695107453307, 'samples': 8165952, 'steps': 42530, 'loss/train': 1.6047009229660034}} 11/07/2021 03:11:42 - INFO - __main__ - Step 42535: {'lr': 0.0004130308592188842, 'samples': 8166720, 'steps': 42534, 'loss/train': 1.5886842012405396}}} 11/07/2021 03:11:45 - INFO - __main__ - Step 42540: {'lr': 0.0004130107427465049, 'samples': 8167680, 'steps': 42539, 'loss/train': 1.7081719636917114}}} 11/07/2021 03:11:47 - INFO - __main__ - Step 42544: {'lr': 0.0004129946482464883, 'samples': 8168448, 'steps': 42543, 'loss/train': 0.9306034445762634}}} 11/07/2021 03:11:49 - INFO - __main__ - Step 42548: {'lr': 0.00041297855257138577, 'samples': 8169216, 'steps': 42547, 'loss/train': 1.384551763534546}}} 11/07/2021 03:11:51 - INFO - __main__ - Step 42552: {'lr': 0.0004129624557213133, 'samples': 8169984, 'steps': 42551, 'loss/train': 1.28696870803833}6}}} 11/07/2021 03:11:53 - INFO - __main__ - Step 42556: {'lr': 0.0004129463576963869, 'samples': 8170752, 'steps': 42555, 'loss/train': 1.503050446510315}}}} 11/07/2021 03:11:55 - INFO - __main__ - Step 42560: {'lr': 0.0004129302584967227, 'samples': 8171520, 'steps': 42559, 'loss/train': 1.6396253108978271}}} 11/07/2021 03:11:57 - INFO - __main__ - Step 42565: {'lr': 0.0004129101328453442, 'samples': 8172480, 'steps': 42564, 'loss/train': 1.6780205965042114}}} 11/07/2021 03:11:59 - INFO - __main__ - Step 42569: {'lr': 0.0004128940310029443, 'samples': 8173248, 'steps': 42568, 'loss/train': 1.1596369743347168}}} 11/07/2021 03:11:59 - INFO - __main__ - Step 42569: {'lr': 0.0004128940310029443, 'samples': 8173248, 'steps': 42568, 'loss/train': 1.1596369743347168}}} 11/07/2021 03:12:02 - INFO - __main__ - Step 42576: {'lr': 0.0004128658499530091, 'samples': 8174592, 'steps': 42575, 'loss/train': 1.3641626834869385}}} 11/07/2021 03:12:05 - INFO - __main__ - Step 42581: {'lr': 0.0004128457184300454, 'samples': 8175552, 'steps': 42580, 'loss/train': 0.8959876894950867}}} 11/07/2021 03:12:05 - INFO - __main__ - Step 42581: {'lr': 0.0004128457184300454, 'samples': 8175552, 'steps': 42580, 'loss/train': 0.8959876894950867}}} 11/07/2021 03:12:09 - INFO - __main__ - Step 42589: {'lr': 0.00041281350417785777, 'samples': 8177088, 'steps': 42588, 'loss/train': 1.5267343521118164}} 11/07/2021 03:12:09 - INFO - __main__ - Step 42589: {'lr': 0.00041281350417785777, 'samples': 8177088, 'steps': 42588, 'loss/train': 1.5267343521118164}} 11/07/2021 03:12:12 - INFO - __main__ - Step 42596: {'lr': 0.0004127853128556962, 'samples': 8178432, 'steps': 42595, 'loss/train': 1.4697948694229126}}} 11/07/2021 03:12:12 - INFO - __main__ - Step 42596: {'lr': 0.0004127853128556962, 'samples': 8178432, 'steps': 42595, 'loss/train': 1.4697948694229126}}} 11/07/2021 03:12:16 - INFO - __main__ - Step 42603: {'lr': 0.0004127571179394557, 'samples': 8179776, 'steps': 42602, 'loss/train': 1.3474419116973877}}} 11/07/2021 03:12:18 - INFO - __main__ - Step 42608: {'lr': 0.00041273697651345785, 'samples': 8180736, 'steps': 42607, 'loss/train': 1.7024065256118774}} 11/07/2021 03:12:21 - INFO - __main__ - Step 42613: {'lr': 0.00041271683325429075, 'samples': 8181696, 'steps': 42612, 'loss/train': 0.977882981300354}}} 11/07/2021 03:12:23 - INFO - __main__ - Step 42617: {'lr': 0.0004127007173272278, 'samples': 8182464, 'steps': 42616, 'loss/train': 1.5213189125061035}}} 11/07/2021 03:12:23 - INFO - __main__ - Step 42617: {'lr': 0.0004127007173272278, 'samples': 8182464, 'steps': 42616, 'loss/train': 1.5213189125061035}}} 11/07/2021 03:12:26 - INFO - __main__ - Step 42624: {'lr': 0.0004126725116324858, 'samples': 8183808, 'steps': 42623, 'loss/train': 1.348909854888916}}}} 11/07/2021 03:12:29 - INFO - __main__ - Step 42629: {'lr': 0.000412652362508702, 'samples': 8184768, 'steps': 42628, 'loss/train': 0.7027224898338318}}}} 11/07/2021 03:12:31 - INFO - __main__ - Step 42634: {'lr': 0.0004126322115527021, 'samples': 8185728, 'steps': 42633, 'loss/train': 1.517307996749878}}}} 11/07/2021 03:12:33 - INFO - __main__ - Step 42638: {'lr': 0.0004126160894688591, 'samples': 8186496, 'steps': 42637, 'loss/train': 1.1772278547286987}}} 11/07/2021 03:12:33 - INFO - __main__ - Step 42638: {'lr': 0.0004126160894688591, 'samples': 8186496, 'steps': 42637, 'loss/train': 1.1772278547286987}}} 11/07/2021 03:12:36 - INFO - __main__ - Step 42645: {'lr': 0.00041258787300122026, 'samples': 8187840, 'steps': 42644, 'loss/train': 1.4660903215408325}} 11/07/2021 03:12:39 - INFO - __main__ - Step 42650: {'lr': 0.0004125677161836543, 'samples': 8188800, 'steps': 42649, 'loss/train': 1.4704724550247192}}} 11/07/2021 03:12:39 - INFO - __main__ - Step 42650: {'lr': 0.0004125677161836543, 'samples': 8188800, 'steps': 42649, 'loss/train': 1.4704724550247192}}} 11/07/2021 03:12:43 - INFO - __main__ - Step 42658: {'lr': 0.00041253546146661704, 'samples': 8190336, 'steps': 42657, 'loss/train': 1.883015513420105}}} 11/07/2021 03:12:45 - INFO - __main__ - Step 42662: {'lr': 0.00041251933235037695, 'samples': 8191104, 'steps': 42661, 'loss/train': 1.5997662544250488}} 11/07/2021 03:12:46 - INFO - __main__ - Step 42666: {'lr': 0.0004125032020624776, 'samples': 8191872, 'steps': 42665, 'loss/train': 1.5029171705245972}}} 11/07/2021 03:12:49 - INFO - __main__ - Step 42671: {'lr': 0.00041248303755513484, 'samples': 8192832, 'steps': 42670, 'loss/train': 1.66847562789917}}}} 11/07/2021 03:12:51 - INFO - __main__ - Step 42675: {'lr': 0.00041246690463142733, 'samples': 8193600, 'steps': 42674, 'loss/train': 0.7003434896469116}} 11/07/2021 03:12:53 - INFO - __main__ - Step 42679: {'lr': 0.00041245077053643866, 'samples': 8194368, 'steps': 42678, 'loss/train': 1.107468605041504}}} 11/07/2021 03:12:53 - INFO - __main__ - Step 42679: {'lr': 0.00041245077053643866, 'samples': 8194368, 'steps': 42678, 'loss/train': 1.107468605041504}}} 11/07/2021 03:12:56 - INFO - __main__ - Step 42686: {'lr': 0.0004124225330521626, 'samples': 8195712, 'steps': 42685, 'loss/train': 1.5804864168167114}}} 11/07/2021 03:12:58 - INFO - __main__ - Step 42691: {'lr': 0.0004124023612249479, 'samples': 8196672, 'steps': 42690, 'loss/train': 1.4882365465164185}}} 11/07/2021 03:13:01 - INFO - __main__ - Step 42696: {'lr': 0.0004123821875683333, 'samples': 8197632, 'steps': 42695, 'loss/train': 1.4438482522964478}}} 11/07/2021 03:13:03 - INFO - __main__ - Step 42700: {'lr': 0.0004123660473260263, 'samples': 8198400, 'steps': 42699, 'loss/train': 0.6675590872764587}}} 11/07/2021 03:13:03 - INFO - __main__ - Step 42700: {'lr': 0.0004123660473260263, 'samples': 8198400, 'steps': 42699, 'loss/train': 0.6675590872764587}}} 11/07/2021 03:13:06 - INFO - __main__ - Step 42707: {'lr': 0.00041233779908541316, 'samples': 8199744, 'steps': 42706, 'loss/train': 1.5503088235855103}} 11/07/2021 03:13:09 - INFO - __main__ - Step 42712: {'lr': 0.00041231761957624593, 'samples': 8200704, 'steps': 42711, 'loss/train': 1.6528187990188599}} 11/07/2021 03:13:09 - INFO - __main__ - Step 42712: {'lr': 0.00041231761957624593, 'samples': 8200704, 'steps': 42711, 'loss/train': 1.6528187990188599}} 11/07/2021 03:13:09 - INFO - __main__ - Step 42712: {'lr': 0.00041231761957624593, 'samples': 8200704, 'steps': 42711, 'loss/train': 1.6528187990188599}} 11/07/2021 03:13:14 - INFO - __main__ - Step 42723: {'lr': 0.0004122732182202703, 'samples': 8202816, 'steps': 42722, 'loss/train': 1.799514651298523}9}} 11/07/2021 03:13:17 - INFO - __main__ - Step 42728: {'lr': 0.0004122530328608781, 'samples': 8203776, 'steps': 42727, 'loss/train': 1.6887344121932983}}} 11/07/2021 03:13:19 - INFO - __main__ - Step 42732: {'lr': 0.0004122368832573967, 'samples': 8204544, 'steps': 42731, 'loss/train': 1.4255421161651611}}} 11/07/2021 03:13:19 - INFO - __main__ - Step 42732: {'lr': 0.0004122368832573967, 'samples': 8204544, 'steps': 42731, 'loss/train': 1.4255421161651611}}} 11/07/2021 03:13:22 - INFO - __main__ - Step 42739: {'lr': 0.00041220861863696886, 'samples': 8205888, 'steps': 42738, 'loss/train': 1.396811604499817}}} 11/07/2021 03:13:24 - INFO - __main__ - Step 42743: {'lr': 0.00041219246581730435, 'samples': 8206656, 'steps': 42742, 'loss/train': 1.721859097480774}}} 11/07/2021 03:13:27 - INFO - __main__ - Step 42748: {'lr': 0.00041217227314840535, 'samples': 8207616, 'steps': 42747, 'loss/train': 2.701878547668457}}} 11/07/2021 03:13:29 - INFO - __main__ - Step 42752: {'lr': 0.00041215611769797344, 'samples': 8208384, 'steps': 42751, 'loss/train': 1.8799508810043335}} 11/07/2021 03:13:29 - INFO - __main__ - Step 42752: {'lr': 0.00041215611769797344, 'samples': 8208384, 'steps': 42751, 'loss/train': 1.8799508810043335}} 11/07/2021 03:13:32 - INFO - __main__ - Step 42759: {'lr': 0.00041212784284678345, 'samples': 8209728, 'steps': 42758, 'loss/train': 1.4945995807647705}} 11/07/2021 03:13:35 - INFO - __main__ - Step 42764: {'lr': 0.00041210764433289936, 'samples': 8210688, 'steps': 42763, 'loss/train': 1.4414068460464478}} 11/07/2021 03:13:35 - INFO - __main__ - Step 42764: {'lr': 0.00041210764433289936, 'samples': 8210688, 'steps': 42763, 'loss/train': 1.4414068460464478}} 11/07/2021 03:13:39 - INFO - __main__ - Step 42772: {'lr': 0.0004120753229125329, 'samples': 8212224, 'steps': 42771, 'loss/train': 1.3817518949508667}}} 11/07/2021 03:13:41 - INFO - __main__ - Step 42776: {'lr': 0.00041205916044960406, 'samples': 8212992, 'steps': 42775, 'loss/train': 3.485616683959961}}} 11/07/2021 03:13:43 - INFO - __main__ - Step 42780: {'lr': 0.00041204299681833344, 'samples': 8213760, 'steps': 42779, 'loss/train': 1.4981358051300049}} 11/07/2021 03:13:45 - INFO - __main__ - Step 42785: {'lr': 0.00041202279063644234, 'samples': 8214720, 'steps': 42784, 'loss/train': 1.2365634441375732}} 11/07/2021 03:13:45 - INFO - __main__ - Step 42785: {'lr': 0.00041202279063644234, 'samples': 8214720, 'steps': 42784, 'loss/train': 1.2365634441375732}} 11/07/2021 03:13:49 - INFO - __main__ - Step 42792: {'lr': 0.00041199449891563694, 'samples': 8216064, 'steps': 42791, 'loss/train': 1.2733299732208252}} 11/07/2021 03:13:51 - INFO - __main__ - Step 42796: {'lr': 0.00041197833061216494, 'samples': 8216832, 'steps': 42795, 'loss/train': 1.3337584733963013}} 11/07/2021 03:13:53 - INFO - __main__ - Step 42801: {'lr': 0.00041195811859067756, 'samples': 8217792, 'steps': 42800, 'loss/train': 1.5047647953033447}} 11/07/2021 03:13:55 - INFO - __main__ - Step 42805: {'lr': 0.0004119419476599118, 'samples': 8218560, 'steps': 42804, 'loss/train': 1.4537402391433716}}} 11/07/2021 03:13:57 - INFO - __main__ - Step 42809: {'lr': 0.00041192577556164924, 'samples': 8219328, 'steps': 42808, 'loss/train': 3.0425539016723633}} 11/07/2021 03:13:59 - INFO - __main__ - Step 42813: {'lr': 0.0004119096022960067, 'samples': 8220096, 'steps': 42812, 'loss/train': 1.6381014585494995}}} 11/07/2021 03:14:01 - INFO - __main__ - Step 42817: {'lr': 0.00041189342786310067, 'samples': 8220864, 'steps': 42816, 'loss/train': 1.1310538053512573}} 11/07/2021 03:14:04 - INFO - __main__ - Step 42822: {'lr': 0.0004118732081806814, 'samples': 8221824, 'steps': 42821, 'loss/train': 1.7592384815216064}}} 11/07/2021 03:14:06 - INFO - __main__ - Step 42826: {'lr': 0.0004118570311218589, 'samples': 8222592, 'steps': 42825, 'loss/train': 1.5050877332687378}}} 11/07/2021 03:14:07 - INFO - __main__ - Step 42830: {'lr': 0.0004118408528961519, 'samples': 8223360, 'steps': 42829, 'loss/train': 1.5232681035995483}}} 11/07/2021 03:14:09 - INFO - __main__ - Step 42834: {'lr': 0.0004118246735036769, 'samples': 8224128, 'steps': 42833, 'loss/train': 1.2316757440567017}}} 11/07/2021 03:14:09 - INFO - __main__ - Step 42834: {'lr': 0.0004118246735036769, 'samples': 8224128, 'steps': 42833, 'loss/train': 1.2316757440567017}}} 11/07/2021 03:14:11 - INFO - __main__ - Step 42839: {'lr': 0.0004118044476224937, 'samples': 8225088, 'steps': 42838, 'loss/train': 1.2652711868286133}}} 11/07/2021 03:14:14 - INFO - __main__ - Step 42845: {'lr': 0.00041178017415917655, 'samples': 8226240, 'steps': 42844, 'loss/train': 1.3483153581619263}} 11/07/2021 03:14:17 - INFO - __main__ - Step 42849: {'lr': 0.0004117639903923611, 'samples': 8227008, 'steps': 42848, 'loss/train': 0.9353717565536499}}} 11/07/2021 03:14:19 - INFO - __main__ - Step 42854: {'lr': 0.0004117437590438674, 'samples': 8227968, 'steps': 42853, 'loss/train': 1.7855308055877686}}} 11/07/2021 03:14:19 - INFO - __main__ - Step 42854: {'lr': 0.0004117437590438674, 'samples': 8227968, 'steps': 42853, 'loss/train': 1.7855308055877686}}} 11/07/2021 03:14:23 - INFO - __main__ - Step 42860: {'lr': 0.00041171947902068006, 'samples': 8229120, 'steps': 42859, 'loss/train': 1.9934972524642944}} 11/07/2021 03:14:25 - INFO - __main__ - Step 42865: {'lr': 0.000411699243664129, 'samples': 8230080, 'steps': 42864, 'loss/train': 0.8765882253646851}4}} 11/07/2021 03:14:27 - INFO - __main__ - Step 42869: {'lr': 0.0004116830540674118, 'samples': 8230848, 'steps': 42868, 'loss/train': 1.5690141916275024}}} 11/07/2021 03:14:29 - INFO - __main__ - Step 42873: {'lr': 0.0004116668633050644, 'samples': 8231616, 'steps': 42872, 'loss/train': 1.3518669605255127}}} 11/07/2021 03:14:31 - INFO - __main__ - Step 42878: {'lr': 0.00041164662321314054, 'samples': 8232576, 'steps': 42877, 'loss/train': 1.2000056505203247}} 11/07/2021 03:14:33 - INFO - __main__ - Step 42882: {'lr': 0.00041163042982855194, 'samples': 8233344, 'steps': 42881, 'loss/train': 1.416580080986023}}} 11/07/2021 03:14:33 - INFO - __main__ - Step 42882: {'lr': 0.00041163042982855194, 'samples': 8233344, 'steps': 42881, 'loss/train': 1.416580080986023}}} 11/07/2021 03:14:37 - INFO - __main__ - Step 42889: {'lr': 0.00041160208860170725, 'samples': 8234688, 'steps': 42888, 'loss/train': 1.395034670829773}}} 11/07/2021 03:14:39 - INFO - __main__ - Step 42893: {'lr': 0.0004115858920129598, 'samples': 8235456, 'steps': 42892, 'loss/train': 1.9112871885299683}}} 11/07/2021 03:14:42 - INFO - __main__ - Step 42899: {'lr': 0.00041156159494563183, 'samples': 8236608, 'steps': 42898, 'loss/train': 1.627911925315857}}} 11/07/2021 03:14:42 - INFO - __main__ - Step 42899: {'lr': 0.00041156159494563183, 'samples': 8236608, 'steps': 42898, 'loss/train': 1.627911925315857}}} 11/07/2021 03:14:45 - INFO - __main__ - Step 42906: {'lr': 0.00041153324505483933, 'samples': 8237952, 'steps': 42905, 'loss/train': 1.5426833629608154}} 11/07/2021 03:14:47 - INFO - __main__ - Step 42910: {'lr': 0.0004115170435159469, 'samples': 8238720, 'steps': 42909, 'loss/train': 1.8392415046691895}}} 11/07/2021 03:14:49 - INFO - __main__ - Step 42915: {'lr': 0.0004114967899548606, 'samples': 8239680, 'steps': 42914, 'loss/train': 1.2665598392486572}}} 11/07/2021 03:14:51 - INFO - __main__ - Step 42919: {'lr': 0.00041148058579615733, 'samples': 8240448, 'steps': 42918, 'loss/train': 1.258515477180481}}} 11/07/2021 03:14:53 - INFO - __main__ - Step 42923: {'lr': 0.00041146438047328347, 'samples': 8241216, 'steps': 42922, 'loss/train': 1.325085163116455}}} 11/07/2021 03:14:55 - INFO - __main__ - Step 42927: {'lr': 0.000411448173986356, 'samples': 8241984, 'steps': 42926, 'loss/train': 1.5102177858352661}}}} 11/07/2021 03:14:57 - INFO - __main__ - Step 42931: {'lr': 0.0004114319663354915, 'samples': 8242752, 'steps': 42930, 'loss/train': 1.8404631614685059}}} 11/07/2021 03:14:59 - INFO - __main__ - Step 42936: {'lr': 0.00041141170513530267, 'samples': 8243712, 'steps': 42935, 'loss/train': 1.4220731258392334}} 11/07/2021 03:14:59 - INFO - __main__ - Step 42936: {'lr': 0.00041141170513530267, 'samples': 8243712, 'steps': 42935, 'loss/train': 1.4220731258392334}} 11/07/2021 03:15:02 - INFO - __main__ - Step 42942: {'lr': 0.0004113873892950186, 'samples': 8244864, 'steps': 42941, 'loss/train': 1.0686310529708862}}} 11/07/2021 03:15:05 - INFO - __main__ - Step 42946: {'lr': 0.0004113711772804315, 'samples': 8245632, 'steps': 42945, 'loss/train': 1.1424510478973389}}} 11/07/2021 03:15:07 - INFO - __main__ - Step 42951: {'lr': 0.0004113509106262058, 'samples': 8246592, 'steps': 42950, 'loss/train': 1.343586802482605}}}} 11/07/2021 03:15:09 - INFO - __main__ - Step 42956: {'lr': 0.00041133064215442415, 'samples': 8247552, 'steps': 42955, 'loss/train': 2.494164228439331}}} 11/07/2021 03:15:11 - INFO - __main__ - Step 42960: {'lr': 0.0004113144260685122, 'samples': 8248320, 'steps': 42959, 'loss/train': 1.2599166631698608}}} 11/07/2021 03:15:11 - INFO - __main__ - Step 42960: {'lr': 0.0004113144260685122, 'samples': 8248320, 'steps': 42959, 'loss/train': 1.2599166631698608}}} 11/07/2021 03:15:15 - INFO - __main__ - Step 42967: {'lr': 0.00041128604511983356, 'samples': 8249664, 'steps': 42966, 'loss/train': 2.0043632984161377}} 11/07/2021 03:15:17 - INFO - __main__ - Step 42972: {'lr': 0.00041126577083340797, 'samples': 8250624, 'steps': 42971, 'loss/train': 1.1354724168777466}} 11/07/2021 03:15:19 - INFO - __main__ - Step 42977: {'lr': 0.00041124549473038564, 'samples': 8251584, 'steps': 42976, 'loss/train': 1.7653063535690308}} 11/07/2021 03:15:19 - INFO - __main__ - Step 42977: {'lr': 0.00041124549473038564, 'samples': 8251584, 'steps': 42976, 'loss/train': 1.7653063535690308}} 11/07/2021 03:15:23 - INFO - __main__ - Step 42984: {'lr': 0.0004112171051347069, 'samples': 8252928, 'steps': 42983, 'loss/train': 1.8783169984817505}}} 11/07/2021 03:15:25 - INFO - __main__ - Step 42988: {'lr': 0.00041120088091044183, 'samples': 8253696, 'steps': 42987, 'loss/train': 1.4893476963043213}} 11/07/2021 03:15:27 - INFO - __main__ - Step 42993: {'lr': 0.00041118059899584503, 'samples': 8254656, 'steps': 42992, 'loss/train': 1.7796658277511597}} 11/07/2021 03:15:29 - INFO - __main__ - Step 42997: {'lr': 0.00041116437215689785, 'samples': 8255424, 'steps': 42996, 'loss/train': 1.6728167533874512}} 11/07/2021 03:15:31 - INFO - __main__ - Step 43001: {'lr': 0.00041114814415605977, 'samples': 8256192, 'steps': 43000, 'loss/train': 0.8801016807556152}} 11/07/2021 03:15:33 - INFO - __main__ - Step 43005: {'lr': 0.00041113191499344784, 'samples': 8256960, 'steps': 43004, 'loss/train': 1.4178489446640015}} 11/07/2021 03:15:35 - INFO - __main__ - Step 43009: {'lr': 0.000411115684669179, 'samples': 8257728, 'steps': 43008, 'loss/train': 1.5675806999206543}5}} 11/07/2021 03:15:37 - INFO - __main__ - Step 43014: {'lr': 0.00041109539513044127, 'samples': 8258688, 'steps': 43013, 'loss/train': 1.3206629753112793}} 11/07/2021 03:15:37 - INFO - __main__ - Step 43014: {'lr': 0.00041109539513044127, 'samples': 8258688, 'steps': 43013, 'loss/train': 1.3206629753112793}} 11/07/2021 03:15:41 - INFO - __main__ - Step 43021: {'lr': 0.00041106698672760145, 'samples': 8260032, 'steps': 43020, 'loss/train': 1.1121315956115723}} 11/07/2021 03:15:43 - INFO - __main__ - Step 43025: {'lr': 0.00041105075175787534, 'samples': 8260800, 'steps': 43024, 'loss/train': 1.5974022150039673}} 11/07/2021 03:15:45 - INFO - __main__ - Step 43030: {'lr': 0.0004110304564129742, 'samples': 8261760, 'steps': 43029, 'loss/train': 1.4041417837142944}}} 11/07/2021 03:15:45 - INFO - __main__ - Step 43030: {'lr': 0.0004110304564129742, 'samples': 8261760, 'steps': 43029, 'loss/train': 1.4041417837142944}}} 11/07/2021 03:15:45 - INFO - __main__ - Step 43030: {'lr': 0.0004110304564129742, 'samples': 8261760, 'steps': 43029, 'loss/train': 1.4041417837142944}}} 11/07/2021 03:15:51 - INFO - __main__ - Step 43041: {'lr': 0.000410985800269424, 'samples': 8263872, 'steps': 43040, 'loss/train': 1.612999439239502}4}}} 11/07/2021 03:15:54 - INFO - __main__ - Step 43046: {'lr': 0.00041096549912070067, 'samples': 8264832, 'steps': 43045, 'loss/train': 1.7899335622787476}} 11/07/2021 03:15:54 - INFO - __main__ - Step 43046: {'lr': 0.00041096549912070067, 'samples': 8264832, 'steps': 43045, 'loss/train': 1.7899335622787476}} 11/07/2021 03:15:54 - INFO - __main__ - Step 43046: {'lr': 0.00041096549912070067, 'samples': 8264832, 'steps': 43045, 'loss/train': 1.7899335622787476}} 11/07/2021 03:15:59 - INFO - __main__ - Step 43056: {'lr': 0.00041092489138384, 'samples': 8266752, 'steps': 43055, 'loss/train': 5.796188831329346}476}} 11/07/2021 03:16:02 - INFO - __main__ - Step 43062: {'lr': 0.0004109005232611134, 'samples': 8267904, 'steps': 43061, 'loss/train': 1.4820033311843872}}} 11/07/2021 03:16:04 - INFO - __main__ - Step 43066: {'lr': 0.00041088427639595206, 'samples': 8268672, 'steps': 43065, 'loss/train': 1.3141721487045288}} 11/07/2021 03:16:06 - INFO - __main__ - Step 43070: {'lr': 0.00041086802837091916, 'samples': 8269440, 'steps': 43069, 'loss/train': 2.006185293197632}}} 11/07/2021 03:16:06 - INFO - __main__ - Step 43070: {'lr': 0.00041086802837091916, 'samples': 8269440, 'steps': 43069, 'loss/train': 2.006185293197632}}} 11/07/2021 03:16:09 - INFO - __main__ - Step 43077: {'lr': 0.000410839591536523, 'samples': 8270784, 'steps': 43076, 'loss/train': 1.5742981433868408}}}} 11/07/2021 03:16:12 - INFO - __main__ - Step 43083: {'lr': 0.0004108152142806151, 'samples': 8271936, 'steps': 43082, 'loss/train': 1.5733226537704468}}} 11/07/2021 03:16:14 - INFO - __main__ - Step 43087: {'lr': 0.00041079896132743506, 'samples': 8272704, 'steps': 43086, 'loss/train': 1.5374175310134888}} 11/07/2021 03:16:16 - INFO - __main__ - Step 43091: {'lr': 0.0004107827072149984, 'samples': 8273472, 'steps': 43090, 'loss/train': 1.2737891674041748}}} 11/07/2021 03:16:16 - INFO - __main__ - Step 43091: {'lr': 0.0004107827072149984, 'samples': 8273472, 'steps': 43090, 'loss/train': 1.2737891674041748}}} 11/07/2021 03:16:19 - INFO - __main__ - Step 43098: {'lr': 0.00041075425972912595, 'samples': 8274816, 'steps': 43097, 'loss/train': 1.7125060558319092}} 11/07/2021 03:16:21 - INFO - __main__ - Step 43103: {'lr': 0.00041073393792332157, 'samples': 8275776, 'steps': 43102, 'loss/train': 1.6294130086898804}} 11/07/2021 03:16:24 - INFO - __main__ - Step 43108: {'lr': 0.00041071361430691143, 'samples': 8276736, 'steps': 43107, 'loss/train': 1.5375863313674927}} 11/07/2021 03:16:26 - INFO - __main__ - Step 43112: {'lr': 0.00041069735411030105, 'samples': 8277504, 'steps': 43111, 'loss/train': 1.5038973093032837}} 11/07/2021 03:16:26 - INFO - __main__ - Step 43112: {'lr': 0.00041069735411030105, 'samples': 8277504, 'steps': 43111, 'loss/train': 1.5038973093032837}} 11/07/2021 03:16:29 - INFO - __main__ - Step 43119: {'lr': 0.000410668895978605, 'samples': 8278848, 'steps': 43118, 'loss/train': 1.167095422744751}37}} 11/07/2021 03:16:31 - INFO - __main__ - Step 43124: {'lr': 0.0004106485665697948, 'samples': 8279808, 'steps': 43123, 'loss/train': 1.0472019910812378}}} 11/07/2021 03:16:34 - INFO - __main__ - Step 43129: {'lr': 0.00041062823535134053, 'samples': 8280768, 'steps': 43128, 'loss/train': 1.6788736581802368}} 11/07/2021 03:16:36 - INFO - __main__ - Step 43133: {'lr': 0.00041061196907378727, 'samples': 8281536, 'steps': 43132, 'loss/train': 0.20600904524326324} 11/07/2021 03:16:36 - INFO - __main__ - Step 43133: {'lr': 0.00041061196907378727, 'samples': 8281536, 'steps': 43132, 'loss/train': 0.20600904524326324} 11/07/2021 03:16:39 - INFO - __main__ - Step 43140: {'lr': 0.0004105835003019225, 'samples': 8282880, 'steps': 43139, 'loss/train': 1.4372590780258179}4} 11/07/2021 03:16:42 - INFO - __main__ - Step 43145: {'lr': 0.00041056316329414613, 'samples': 8283840, 'steps': 43144, 'loss/train': 1.4259611368179321}} 11/07/2021 03:16:44 - INFO - __main__ - Step 43150: {'lr': 0.00041054282447768763, 'samples': 8284800, 'steps': 43149, 'loss/train': 1.6200916767120361}} 11/07/2021 03:16:46 - INFO - __main__ - Step 43154: {'lr': 0.00041052655212242377, 'samples': 8285568, 'steps': 43153, 'loss/train': 1.352491021156311}}} 11/07/2021 03:16:46 - INFO - __main__ - Step 43154: {'lr': 0.00041052655212242377, 'samples': 8285568, 'steps': 43153, 'loss/train': 1.352491021156311}}} 11/07/2021 03:16:49 - INFO - __main__ - Step 43160: {'lr': 0.000410502141419641, 'samples': 8286720, 'steps': 43159, 'loss/train': 1.2417993545532227}}}} 11/07/2021 03:16:49 - INFO - __main__ - Step 43160: {'lr': 0.000410502141419641, 'samples': 8286720, 'steps': 43159, 'loss/train': 1.2417993545532227}}}} 11/07/2021 03:16:54 - INFO - __main__ - Step 43169: {'lr': 0.0004104655204840048, 'samples': 8288448, 'steps': 43168, 'loss/train': 1.7086986303329468}}} 11/07/2021 03:16:55 - INFO - __main__ - Step 43173: {'lr': 0.00041044924263264603, 'samples': 8289216, 'steps': 43172, 'loss/train': 1.5493334531784058}} 11/07/2021 03:16:57 - INFO - __main__ - Step 43177: {'lr': 0.0004104329636245521, 'samples': 8289984, 'steps': 43176, 'loss/train': 0.77286696434021}58}} 11/07/2021 03:17:00 - INFO - __main__ - Step 43182: {'lr': 0.00041041261323795437, 'samples': 8290944, 'steps': 43181, 'loss/train': 1.5282878875732422}} 11/07/2021 03:17:02 - INFO - __main__ - Step 43186: {'lr': 0.00041039633162763523, 'samples': 8291712, 'steps': 43185, 'loss/train': 1.0623544454574585}} 11/07/2021 03:17:04 - INFO - __main__ - Step 43190: {'lr': 0.0004103800488609622, 'samples': 8292480, 'steps': 43189, 'loss/train': 1.1504318714141846}}} 11/07/2021 03:17:04 - INFO - __main__ - Step 43190: {'lr': 0.0004103800488609622, 'samples': 8292480, 'steps': 43189, 'loss/train': 1.1504318714141846}}} 11/07/2021 03:17:07 - INFO - __main__ - Step 43197: {'lr': 0.00041035155123716127, 'samples': 8293824, 'steps': 43196, 'loss/train': 0.3642762005329132}} 11/07/2021 03:17:10 - INFO - __main__ - Step 43203: {'lr': 0.0004103271218846254, 'samples': 8294976, 'steps': 43202, 'loss/train': 1.2290693521499634}}} 11/07/2021 03:17:12 - INFO - __main__ - Step 43207: {'lr': 0.00041031083420475854, 'samples': 8295744, 'steps': 43206, 'loss/train': 1.379603624343872}}} 11/07/2021 03:17:14 - INFO - __main__ - Step 43211: {'lr': 0.0004102945453691542, 'samples': 8296512, 'steps': 43210, 'loss/train': 1.2275004386901855}}} 11/07/2021 03:17:16 - INFO - __main__ - Step 43215: {'lr': 0.00041027825537792993, 'samples': 8297280, 'steps': 43214, 'loss/train': 1.04563307762146}}}} 11/07/2021 03:17:18 - INFO - __main__ - Step 43219: {'lr': 0.0004102619642312031, 'samples': 8298048, 'steps': 43218, 'loss/train': 1.255764365196228}}}} 11/07/2021 03:17:18 - INFO - __main__ - Step 43219: {'lr': 0.0004102619642312031, 'samples': 8298048, 'steps': 43218, 'loss/train': 1.255764365196228}}}} 11/07/2021 03:17:21 - INFO - __main__ - Step 43226: {'lr': 0.0004102334519443565, 'samples': 8299392, 'steps': 43225, 'loss/train': 0.4940943717956543}}} 11/07/2021 03:17:24 - INFO - __main__ - Step 43230: {'lr': 0.00041021715762060336, 'samples': 8300160, 'steps': 43229, 'loss/train': 1.6046879291534424}} 11/07/2021 03:17:26 - INFO - __main__ - Step 43234: {'lr': 0.0004102008621417881, 'samples': 8300928, 'steps': 43233, 'loss/train': 1.7774174213409424}}} 11/07/2021 03:17:28 - INFO - __main__ - Step 43238: {'lr': 0.0004101845655080283, 'samples': 8301696, 'steps': 43237, 'loss/train': 1.5787715911865234}}} 11/07/2021 03:17:29 - INFO - __main__ - Step 43242: {'lr': 0.0004101682677194414, 'samples': 8302464, 'steps': 43241, 'loss/train': 1.368617057800293}}}} 11/07/2021 03:17:31 - INFO - __main__ - Step 43246: {'lr': 0.0004101519687761449, 'samples': 8303232, 'steps': 43245, 'loss/train': 1.2122910022735596}}} 11/07/2021 03:17:34 - INFO - __main__ - Step 43250: {'lr': 0.00041013566867825627, 'samples': 8304000, 'steps': 43249, 'loss/train': 1.616087794303894}}} 11/07/2021 03:17:36 - INFO - __main__ - Step 43254: {'lr': 0.0004101193674258931, 'samples': 8304768, 'steps': 43253, 'loss/train': 1.2787197828292847}}} 11/07/2021 03:17:37 - INFO - __main__ - Step 43258: {'lr': 0.00041010306501917287, 'samples': 8305536, 'steps': 43257, 'loss/train': 1.4174599647521973}} 11/07/2021 03:17:39 - INFO - __main__ - Step 43262: {'lr': 0.0004100867614582131, 'samples': 8306304, 'steps': 43261, 'loss/train': 1.5937113761901855}}} 11/07/2021 03:17:42 - INFO - __main__ - Step 43267: {'lr': 0.0004100663803840431, 'samples': 8307264, 'steps': 43266, 'loss/train': 0.8777159452438354}}} 11/07/2021 03:17:42 - INFO - __main__ - Step 43267: {'lr': 0.0004100663803840431, 'samples': 8307264, 'steps': 43266, 'loss/train': 0.8777159452438354}}} 11/07/2021 03:17:46 - INFO - __main__ - Step 43274: {'lr': 0.0004100378438510721, 'samples': 8308608, 'steps': 43273, 'loss/train': 1.5262449979782104}}} 11/07/2021 03:17:47 - INFO - __main__ - Step 43278: {'lr': 0.00041002153567432965, 'samples': 8309376, 'steps': 43277, 'loss/train': 1.4881389141082764}} 11/07/2021 03:17:50 - INFO - __main__ - Step 43282: {'lr': 0.0004100052263439355, 'samples': 8310144, 'steps': 43281, 'loss/train': 1.6344153881072998}}} 11/07/2021 03:17:52 - INFO - __main__ - Step 43286: {'lr': 0.00040998891586000716, 'samples': 8310912, 'steps': 43285, 'loss/train': 1.4235750436782837}} 11/07/2021 03:17:54 - INFO - __main__ - Step 43290: {'lr': 0.00040997260422266223, 'samples': 8311680, 'steps': 43289, 'loss/train': 1.5309162139892578}} 11/07/2021 03:17:55 - INFO - __main__ - Step 43294: {'lr': 0.0004099562914320183, 'samples': 8312448, 'steps': 43293, 'loss/train': 1.6182531118392944}}} 11/07/2021 03:17:58 - INFO - __main__ - Step 43298: {'lr': 0.000409939977488193, 'samples': 8313216, 'steps': 43297, 'loss/train': 2.0168118476867676}}}} 11/07/2021 03:18:01 - INFO - __main__ - Step 43303: {'lr': 0.0004099195834369292, 'samples': 8314176, 'steps': 43302, 'loss/train': 1.5457040071487427}}} 11/07/2021 03:18:04 - INFO - __main__ - Step 43309: {'lr': 0.0004098951081975421, 'samples': 8315328, 'steps': 43308, 'loss/train': 1.566868782043457}}}} 11/07/2021 03:18:06 - INFO - __main__ - Step 43313: {'lr': 0.00040987878993033417, 'samples': 8316096, 'steps': 43312, 'loss/train': 1.7013061046600342}} 11/07/2021 03:18:07 - INFO - __main__ - Step 43317: {'lr': 0.0004098624705105036, 'samples': 8316864, 'steps': 43316, 'loss/train': 1.5735280513763428}}} 11/07/2021 03:18:09 - INFO - __main__ - Step 43321: {'lr': 0.000409846149938168, 'samples': 8317632, 'steps': 43320, 'loss/train': 1.836496114730835}8}}} 11/07/2021 03:18:12 - INFO - __main__ - Step 43326: {'lr': 0.0004098257476022176, 'samples': 8318592, 'steps': 43325, 'loss/train': 1.7156808376312256}}} 11/07/2021 03:18:14 - INFO - __main__ - Step 43330: {'lr': 0.00040980942443717596, 'samples': 8319360, 'steps': 43329, 'loss/train': 1.0857611894607544}} 11/07/2021 03:18:14 - INFO - __main__ - Step 43330: {'lr': 0.00040980942443717596, 'samples': 8319360, 'steps': 43329, 'loss/train': 1.0857611894607544}} 11/07/2021 03:18:17 - INFO - __main__ - Step 43338: {'lr': 0.00040977677465084275, 'samples': 8320896, 'steps': 43337, 'loss/train': 1.318015217781067}}} 11/07/2021 03:18:20 - INFO - __main__ - Step 43342: {'lr': 0.00040976044802978645, 'samples': 8321664, 'steps': 43341, 'loss/train': 1.496842622756958}}} 11/07/2021 03:18:22 - INFO - __main__ - Step 43347: {'lr': 0.0004097400381338041, 'samples': 8322624, 'steps': 43346, 'loss/train': 1.7230134010314941}}} 11/07/2021 03:18:24 - INFO - __main__ - Step 43352: {'lr': 0.0004097196264384118, 'samples': 8323584, 'steps': 43351, 'loss/train': 1.0327445268630981}}} 11/07/2021 03:18:26 - INFO - __main__ - Step 43356: {'lr': 0.00040970329578667735, 'samples': 8324352, 'steps': 43355, 'loss/train': 1.532841444015503}}} 11/07/2021 03:18:26 - INFO - __main__ - Step 43356: {'lr': 0.00040970329578667735, 'samples': 8324352, 'steps': 43355, 'loss/train': 1.532841444015503}}} 11/07/2021 03:18:30 - INFO - __main__ - Step 43363: {'lr': 0.0004096747143757591, 'samples': 8325696, 'steps': 43362, 'loss/train': 0.833960771560669}}}} 11/07/2021 03:18:32 - INFO - __main__ - Step 43368: {'lr': 0.00040965429692380034, 'samples': 8326656, 'steps': 43367, 'loss/train': 1.3145965337753296}} 11/07/2021 03:18:34 - INFO - __main__ - Step 43372: {'lr': 0.00040963796166734257, 'samples': 8327424, 'steps': 43371, 'loss/train': 1.6246415376663208}} 11/07/2021 03:18:36 - INFO - __main__ - Step 43376: {'lr': 0.00040962162525999833, 'samples': 8328192, 'steps': 43375, 'loss/train': 0.593380868434906}}} 11/07/2021 03:18:38 - INFO - __main__ - Step 43380: {'lr': 0.00040960528770188554, 'samples': 8328960, 'steps': 43379, 'loss/train': 1.4032756090164185}} 11/07/2021 03:18:40 - INFO - __main__ - Step 43384: {'lr': 0.00040958894899312183, 'samples': 8329728, 'steps': 43383, 'loss/train': 1.640594244003296}}} 11/07/2021 03:18:42 - INFO - __main__ - Step 43389: {'lr': 0.00040956852398924383, 'samples': 8330688, 'steps': 43388, 'loss/train': 1.3369059562683105}} 11/07/2021 03:18:44 - INFO - __main__ - Step 43393: {'lr': 0.0004095521826919463, 'samples': 8331456, 'steps': 43392, 'loss/train': 1.5825469493865967}}} 11/07/2021 03:18:44 - INFO - __main__ - Step 43393: {'lr': 0.0004095521826919463, 'samples': 8331456, 'steps': 43392, 'loss/train': 1.5825469493865967}}} 11/07/2021 03:18:47 - INFO - __main__ - Step 43400: {'lr': 0.0004095235826539141, 'samples': 8332800, 'steps': 43399, 'loss/train': 1.3647236824035645}}} 11/07/2021 03:18:50 - INFO - __main__ - Step 43404: {'lr': 0.00040950723819366307, 'samples': 8333568, 'steps': 43403, 'loss/train': 1.461193323135376}}} 11/07/2021 03:18:50 - INFO - __main__ - Step 43404: {'lr': 0.00040950723819366307, 'samples': 8333568, 'steps': 43403, 'loss/train': 1.461193323135376}}} 11/07/2021 03:18:55 - INFO - __main__ - Step 43413: {'lr': 0.00040947045895379494, 'samples': 8335296, 'steps': 43412, 'loss/train': 1.4618539810180664}} 11/07/2021 03:18:57 - INFO - __main__ - Step 43417: {'lr': 0.00040945411075665674, 'samples': 8336064, 'steps': 43416, 'loss/train': 1.0641642808914185}} 11/07/2021 03:18:58 - INFO - __main__ - Step 43421: {'lr': 0.00040943776140995756, 'samples': 8336832, 'steps': 43420, 'loss/train': 1.336556077003479}}} 11/07/2021 03:18:58 - INFO - __main__ - Step 43421: {'lr': 0.00040943776140995756, 'samples': 8336832, 'steps': 43420, 'loss/train': 1.336556077003479}}} 11/07/2021 03:19:02 - INFO - __main__ - Step 43429: {'lr': 0.0004094050592683477, 'samples': 8338368, 'steps': 43428, 'loss/train': 1.762089729309082}}}} 11/07/2021 03:19:04 - INFO - __main__ - Step 43433: {'lr': 0.00040938870647367275, 'samples': 8339136, 'steps': 43432, 'loss/train': 1.3768030405044556}} 11/07/2021 03:19:06 - INFO - __main__ - Step 43437: {'lr': 0.00040937235252990834, 'samples': 8339904, 'steps': 43436, 'loss/train': 1.6887180805206299}} 11/07/2021 03:19:08 - INFO - __main__ - Step 43441: {'lr': 0.00040935599743717243, 'samples': 8340672, 'steps': 43440, 'loss/train': 1.6567108631134033}} 11/07/2021 03:19:10 - INFO - __main__ - Step 43446: {'lr': 0.0004093355519556908, 'samples': 8341632, 'steps': 43445, 'loss/train': 1.2826430797576904}}} 11/07/2021 03:19:12 - INFO - __main__ - Step 43450: {'lr': 0.0004093191942782001, 'samples': 8342400, 'steps': 43449, 'loss/train': 1.4493123292922974}}} 11/07/2021 03:19:14 - INFO - __main__ - Step 43454: {'lr': 0.0004093028354521209, 'samples': 8343168, 'steps': 43453, 'loss/train': 1.4804021120071411}}} 11/07/2021 03:19:16 - INFO - __main__ - Step 43458: {'lr': 0.0004092864754775713, 'samples': 8343936, 'steps': 43457, 'loss/train': 1.1358518600463867}}} 11/07/2021 03:19:18 - INFO - __main__ - Step 43462: {'lr': 0.00040927011435466933, 'samples': 8344704, 'steps': 43461, 'loss/train': 1.3408634662628174}} 11/07/2021 03:19:18 - INFO - __main__ - Step 43462: {'lr': 0.00040927011435466933, 'samples': 8344704, 'steps': 43461, 'loss/train': 1.3408634662628174}} 11/07/2021 03:19:22 - INFO - __main__ - Step 43470: {'lr': 0.00040923738866427986, 'samples': 8346240, 'steps': 43469, 'loss/train': 1.2424981594085693}} 11/07/2021 03:19:24 - INFO - __main__ - Step 43474: {'lr': 0.0004092210240970282, 'samples': 8347008, 'steps': 43473, 'loss/train': 1.4208704233169556}}} 11/07/2021 03:19:26 - INFO - __main__ - Step 43478: {'lr': 0.000409204658381896, 'samples': 8347776, 'steps': 43477, 'loss/train': 1.6098095178604126}}}} 11/07/2021 03:19:28 - INFO - __main__ - Step 43483: {'lr': 0.0004091841996239535, 'samples': 8348736, 'steps': 43482, 'loss/train': 1.563791275024414}}}} 11/07/2021 03:19:28 - INFO - __main__ - Step 43483: {'lr': 0.0004091841996239535, 'samples': 8348736, 'steps': 43482, 'loss/train': 1.563791275024414}}}} 11/07/2021 03:19:32 - INFO - __main__ - Step 43490: {'lr': 0.0004091555543503959, 'samples': 8350080, 'steps': 43489, 'loss/train': 1.5925389528274536}}} 11/07/2021 03:19:34 - INFO - __main__ - Step 43494: {'lr': 0.0004091391840449213, 'samples': 8350848, 'steps': 43493, 'loss/train': 1.5638720989227295}}} 11/07/2021 03:19:36 - INFO - __main__ - Step 43499: {'lr': 0.0004091187195497146, 'samples': 8351808, 'steps': 43498, 'loss/train': 2.1405017375946045}}} 11/07/2021 03:19:38 - INFO - __main__ - Step 43503: {'lr': 0.0004091023466630023, 'samples': 8352576, 'steps': 43502, 'loss/train': 1.8581750392913818}}} 11/07/2021 03:19:40 - INFO - __main__ - Step 43507: {'lr': 0.00040908597262926484, 'samples': 8353344, 'steps': 43506, 'loss/train': 1.5585148334503174}} 11/07/2021 03:19:42 - INFO - __main__ - Step 43511: {'lr': 0.0004090695974486206, 'samples': 8354112, 'steps': 43510, 'loss/train': 1.5004621744155884}}} 11/07/2021 03:19:44 - INFO - __main__ - Step 43515: {'lr': 0.0004090532211211874, 'samples': 8354880, 'steps': 43514, 'loss/train': 1.1567301750183105}}} 11/07/2021 03:19:46 - INFO - __main__ - Step 43520: {'lr': 0.0004090327490994038, 'samples': 8355840, 'steps': 43519, 'loss/train': 1.3841397762298584}}} 11/07/2021 03:19:46 - INFO - __main__ - Step 43520: {'lr': 0.0004090327490994038, 'samples': 8355840, 'steps': 43519, 'loss/train': 1.3841397762298584}}} 11/07/2021 03:19:51 - INFO - __main__ - Step 43528: {'lr': 0.0004089999901384456, 'samples': 8357376, 'steps': 43527, 'loss/train': 1.1758595705032349}}} 11/07/2021 03:19:51 - INFO - __main__ - Step 43528: {'lr': 0.0004089999901384456, 'samples': 8357376, 'steps': 43527, 'loss/train': 1.1758595705032349}}} 11/07/2021 03:19:54 - INFO - __main__ - Step 43535: {'lr': 0.00040897132228632035, 'samples': 8358720, 'steps': 43534, 'loss/train': 1.61453115940094}}}} 11/07/2021 03:19:57 - INFO - __main__ - Step 43540: {'lr': 0.0004089508431001504, 'samples': 8359680, 'steps': 43539, 'loss/train': 1.860291600227356}}}} 11/07/2021 03:19:57 - INFO - __main__ - Step 43540: {'lr': 0.0004089508431001504, 'samples': 8359680, 'steps': 43539, 'loss/train': 1.860291600227356}}}} 11/07/2021 03:20:00 - INFO - __main__ - Step 43547: {'lr': 0.00040892216923149073, 'samples': 8361024, 'steps': 43546, 'loss/train': 1.7720543146133423}} 11/07/2021 03:20:02 - INFO - __main__ - Step 43551: {'lr': 0.00040890578258827125, 'samples': 8361792, 'steps': 43550, 'loss/train': 1.712302327156067}}} 11/07/2021 03:20:04 - INFO - __main__ - Step 43556: {'lr': 0.00040888529767324966, 'samples': 8362752, 'steps': 43555, 'loss/train': 1.569291114807129}}} 11/07/2021 03:20:07 - INFO - __main__ - Step 43561: {'lr': 0.0004088648109684465, 'samples': 8363712, 'steps': 43560, 'loss/train': 1.0811123847961426}}} 11/07/2021 03:20:07 - INFO - __main__ - Step 43561: {'lr': 0.0004088648109684465, 'samples': 8363712, 'steps': 43560, 'loss/train': 1.0811123847961426}}} 11/07/2021 03:20:10 - INFO - __main__ - Step 43568: {'lr': 0.00040883612657532844, 'samples': 8365056, 'steps': 43567, 'loss/train': 0.7460005879402161}} 11/07/2021 03:20:12 - INFO - __main__ - Step 43573: {'lr': 0.00040881563557599107, 'samples': 8366016, 'steps': 43572, 'loss/train': 1.221165418624878}}} 11/07/2021 03:20:14 - INFO - __main__ - Step 43577: {'lr': 0.00040879924148843233, 'samples': 8366784, 'steps': 43576, 'loss/train': 1.5095930099487305}} 11/07/2021 03:20:17 - INFO - __main__ - Step 43582: {'lr': 0.0004087787472690668, 'samples': 8367744, 'steps': 43581, 'loss/train': 1.1368916034698486}}} 11/07/2021 03:20:19 - INFO - __main__ - Step 43586: {'lr': 0.00040876235060578476, 'samples': 8368512, 'steps': 43585, 'loss/train': 1.665792465209961}}} 11/07/2021 03:20:19 - INFO - __main__ - Step 43586: {'lr': 0.00040876235060578476, 'samples': 8368512, 'steps': 43585, 'loss/train': 1.665792465209961}}} 11/07/2021 03:20:22 - INFO - __main__ - Step 43593: {'lr': 0.0004087336536909815, 'samples': 8369856, 'steps': 43592, 'loss/train': 1.7234948873519897}}} 11/07/2021 03:20:25 - INFO - __main__ - Step 43597: {'lr': 0.0004087172538804058, 'samples': 8370624, 'steps': 43596, 'loss/train': 1.3239374160766602}}} 11/07/2021 03:20:27 - INFO - __main__ - Step 43601: {'lr': 0.00040870085292558147, 'samples': 8371392, 'steps': 43600, 'loss/train': 1.1627293825149536}} 11/07/2021 03:20:29 - INFO - __main__ - Step 43605: {'lr': 0.00040868445082662655, 'samples': 8372160, 'steps': 43604, 'loss/train': 1.4655396938323975}} 11/07/2021 03:20:30 - INFO - __main__ - Step 43609: {'lr': 0.0004086680475836594, 'samples': 8372928, 'steps': 43608, 'loss/train': 0.5646260976791382}}} 11/07/2021 03:20:32 - INFO - __main__ - Step 43613: {'lr': 0.0004086516431967984, 'samples': 8373696, 'steps': 43612, 'loss/train': 1.6255570650100708}}} 11/07/2021 03:20:35 - INFO - __main__ - Step 43619: {'lr': 0.0004086270344719642, 'samples': 8374848, 'steps': 43618, 'loss/train': 1.7593225240707397}}} 11/07/2021 03:20:37 - INFO - __main__ - Step 43623: {'lr': 0.0004086106272258856, 'samples': 8375616, 'steps': 43622, 'loss/train': 0.8363951444625854}}} 11/07/2021 03:20:37 - INFO - __main__ - Step 43623: {'lr': 0.0004086106272258856, 'samples': 8375616, 'steps': 43622, 'loss/train': 0.8363951444625854}}} 11/07/2021 03:20:40 - INFO - __main__ - Step 43629: {'lr': 0.00040858601421277956, 'samples': 8376768, 'steps': 43628, 'loss/train': 1.0271927118301392}} 11/07/2021 03:20:43 - INFO - __main__ - Step 43635: {'lr': 0.0004085613986272428, 'samples': 8377920, 'steps': 43634, 'loss/train': 5.793621063232422}2}} 11/07/2021 03:20:43 - INFO - __main__ - Step 43635: {'lr': 0.0004085613986272428, 'samples': 8377920, 'steps': 43634, 'loss/train': 5.793621063232422}2}} 11/07/2021 03:20:46 - INFO - __main__ - Step 43642: {'lr': 0.00040853267719338256, 'samples': 8379264, 'steps': 43641, 'loss/train': 1.3150479793548584}} 11/07/2021 03:20:48 - INFO - __main__ - Step 43646: {'lr': 0.0004085162633739095, 'samples': 8380032, 'steps': 43645, 'loss/train': 1.760745882987976}4}} 11/07/2021 03:20:51 - INFO - __main__ - Step 43651: {'lr': 0.0004084957444925198, 'samples': 8380992, 'steps': 43650, 'loss/train': 1.450783371925354}4}} 11/07/2021 03:20:51 - INFO - __main__ - Step 43651: {'lr': 0.0004084957444925198, 'samples': 8380992, 'steps': 43650, 'loss/train': 1.450783371925354}4}} 11/07/2021 03:20:54 - INFO - __main__ - Step 43657: {'lr': 0.0004084711194781533, 'samples': 8382144, 'steps': 43656, 'loss/train': 1.0405417680740356}}} 11/07/2021 03:20:56 - INFO - __main__ - Step 43662: {'lr': 0.00040845059666920323, 'samples': 8383104, 'steps': 43661, 'loss/train': 1.7791005373001099}} 11/07/2021 03:20:59 - INFO - __main__ - Step 43667: {'lr': 0.0004084300720753684, 'samples': 8384064, 'steps': 43666, 'loss/train': 1.6888883113861084}}} 11/07/2021 03:21:01 - INFO - __main__ - Step 43671: {'lr': 0.0004084136511153388, 'samples': 8384832, 'steps': 43670, 'loss/train': 1.9210768938064575}}} 11/07/2021 03:21:03 - INFO - __main__ - Step 43675: {'lr': 0.00040839722901324924, 'samples': 8385600, 'steps': 43674, 'loss/train': 0.7722538709640503}} 11/07/2021 03:21:05 - INFO - __main__ - Step 43679: {'lr': 0.0004083808057692181, 'samples': 8386368, 'steps': 43678, 'loss/train': 1.5521893501281738}}} 11/07/2021 03:21:06 - INFO - __main__ - Step 43683: {'lr': 0.00040836438138336384, 'samples': 8387136, 'steps': 43682, 'loss/train': 1.5819343328475952}} 11/07/2021 03:21:08 - INFO - __main__ - Step 43687: {'lr': 0.0004083479558558048, 'samples': 8387904, 'steps': 43686, 'loss/train': 1.5194342136383057}}} 11/07/2021 03:21:11 - INFO - __main__ - Step 43692: {'lr': 0.00040832742234101415, 'samples': 8388864, 'steps': 43691, 'loss/train': 1.4217921495437622}} 11/07/2021 03:21:11 - INFO - __main__ - Step 43692: {'lr': 0.00040832742234101415, 'samples': 8388864, 'steps': 43691, 'loss/train': 1.4217921495437622}} 11/07/2021 03:21:14 - INFO - __main__ - Step 43699: {'lr': 0.0004082986724240835, 'samples': 8390208, 'steps': 43698, 'loss/train': 1.9071747064590454}}} 11/07/2021 03:21:16 - INFO - __main__ - Step 43703: {'lr': 0.0004082822423308897, 'samples': 8390976, 'steps': 43702, 'loss/train': 0.9039545059204102}}} 11/07/2021 03:21:19 - INFO - __main__ - Step 43708: {'lr': 0.00040826170310972196, 'samples': 8391936, 'steps': 43707, 'loss/train': 1.3259520530700684}} 11/07/2021 03:21:21 - INFO - __main__ - Step 43713: {'lr': 0.0004082411621057971, 'samples': 8392896, 'steps': 43712, 'loss/train': 1.6946877241134644}}} 11/07/2021 03:21:23 - INFO - __main__ - Step 43717: {'lr': 0.0004082247280192276, 'samples': 8393664, 'steps': 43716, 'loss/train': 1.2968155145645142}}} 11/07/2021 03:21:23 - INFO - __main__ - Step 43717: {'lr': 0.0004082247280192276, 'samples': 8393664, 'steps': 43716, 'loss/train': 1.2968155145645142}}} 11/07/2021 03:21:26 - INFO - __main__ - Step 43724: {'lr': 0.00040819596562299793, 'samples': 8395008, 'steps': 43723, 'loss/train': 0.6590692400932312}} 11/07/2021 03:21:26 - INFO - __main__ - Step 43724: {'lr': 0.00040819596562299793, 'samples': 8395008, 'steps': 43723, 'loss/train': 0.6590692400932312}} 11/07/2021 03:21:30 - INFO - __main__ - Step 43731: {'lr': 0.00040816719973401586, 'samples': 8396352, 'steps': 43730, 'loss/train': 1.64004647731781}2}} 11/07/2021 03:21:33 - INFO - __main__ - Step 43736: {'lr': 0.0004081466505323892, 'samples': 8397312, 'steps': 43735, 'loss/train': 0.5817473530769348}}} 11/07/2021 03:21:35 - INFO - __main__ - Step 43740: {'lr': 0.0004081302098884249, 'samples': 8398080, 'steps': 43739, 'loss/train': 0.8534517288208008}}} 11/07/2021 03:21:35 - INFO - __main__ - Step 43740: {'lr': 0.0004081302098884249, 'samples': 8398080, 'steps': 43739, 'loss/train': 0.8534517288208008}}} 11/07/2021 03:21:38 - INFO - __main__ - Step 43747: {'lr': 0.00040810143601839377, 'samples': 8399424, 'steps': 43746, 'loss/train': 1.6005651950836182}} 11/07/2021 03:21:41 - INFO - __main__ - Step 43752: {'lr': 0.00040808088111690677, 'samples': 8400384, 'steps': 43751, 'loss/train': 1.6774927377700806}} 11/07/2021 03:21:43 - INFO - __main__ - Step 43757: {'lr': 0.00040806032443469967, 'samples': 8401344, 'steps': 43756, 'loss/train': 1.472224473953247}}} 11/07/2021 03:21:45 - INFO - __main__ - Step 43761: {'lr': 0.0004080438778069711, 'samples': 8402112, 'steps': 43760, 'loss/train': 1.4132717847824097}}} 11/07/2021 03:21:45 - INFO - __main__ - Step 43761: {'lr': 0.0004080438778069711, 'samples': 8402112, 'steps': 43760, 'loss/train': 1.4132717847824097}}} 11/07/2021 03:21:49 - INFO - __main__ - Step 43768: {'lr': 0.0004080150934668503, 'samples': 8403456, 'steps': 43767, 'loss/train': 1.1555874347686768}}} 11/07/2021 03:21:51 - INFO - __main__ - Step 43773: {'lr': 0.00040799453108789497, 'samples': 8404416, 'steps': 43772, 'loss/train': 1.7890753746032715}} 11/07/2021 03:21:51 - INFO - __main__ - Step 43773: {'lr': 0.00040799453108789497, 'samples': 8404416, 'steps': 43772, 'loss/train': 1.7890753746032715}} 11/07/2021 03:21:55 - INFO - __main__ - Step 43781: {'lr': 0.00040796162757978803, 'samples': 8405952, 'steps': 43780, 'loss/train': 2.036315679550171}}} 11/07/2021 03:21:56 - INFO - __main__ - Step 43785: {'lr': 0.0004079451741174737, 'samples': 8406720, 'steps': 43784, 'loss/train': 1.5919462442398071}}} 11/07/2021 03:21:59 - INFO - __main__ - Step 43789: {'lr': 0.00040792871951647657, 'samples': 8407488, 'steps': 43788, 'loss/train': 1.572245478630066}}} 11/07/2021 03:21:59 - INFO - __main__ - Step 43789: {'lr': 0.00040792871951647657, 'samples': 8407488, 'steps': 43788, 'loss/train': 1.572245478630066}}} 11/07/2021 03:22:02 - INFO - __main__ - Step 43795: {'lr': 0.0004079040354802109, 'samples': 8408640, 'steps': 43794, 'loss/train': 0.3329477906227112}}} 11/07/2021 03:22:05 - INFO - __main__ - Step 43800: {'lr': 0.00040788346349337156, 'samples': 8409600, 'steps': 43799, 'loss/train': 1.3185638189315796}} 11/07/2021 03:22:07 - INFO - __main__ - Step 43805: {'lr': 0.000407862889728036, 'samples': 8410560, 'steps': 43804, 'loss/train': 1.6155372858047485}6}} 11/07/2021 03:22:07 - INFO - __main__ - Step 43805: {'lr': 0.000407862889728036, 'samples': 8410560, 'steps': 43804, 'loss/train': 1.6155372858047485}6}} 11/07/2021 03:22:11 - INFO - __main__ - Step 43812: {'lr': 0.00040783408346913366, 'samples': 8411904, 'steps': 43811, 'loss/train': 1.1286613941192627}} 11/07/2021 03:22:12 - INFO - __main__ - Step 43816: {'lr': 0.0004078176211851328, 'samples': 8412672, 'steps': 43815, 'loss/train': 1.089661717414856}7}} 11/07/2021 03:22:15 - INFO - __main__ - Step 43821: {'lr': 0.0004077970417301665, 'samples': 8413632, 'steps': 43820, 'loss/train': 1.578310251235962}7}} 11/07/2021 03:22:15 - INFO - __main__ - Step 43821: {'lr': 0.0004077970417301665, 'samples': 8413632, 'steps': 43820, 'loss/train': 1.578310251235962}7}} 11/07/2021 03:22:19 - INFO - __main__ - Step 43829: {'lr': 0.00040776411090506944, 'samples': 8415168, 'steps': 43828, 'loss/train': 4.008372783660889}}} 11/07/2021 03:22:19 - INFO - __main__ - Step 43829: {'lr': 0.00040776411090506944, 'samples': 8415168, 'steps': 43828, 'loss/train': 4.008372783660889}}} 11/07/2021 03:22:22 - INFO - __main__ - Step 43836: {'lr': 0.00040773529270105816, 'samples': 8416512, 'steps': 43835, 'loss/train': 0.5030577182769775}} 11/07/2021 03:22:25 - INFO - __main__ - Step 43841: {'lr': 0.0004077147061373918, 'samples': 8417472, 'steps': 43840, 'loss/train': 1.5139338970184326}}} 11/07/2021 03:22:27 - INFO - __main__ - Step 43845: {'lr': 0.000407698235607299, 'samples': 8418240, 'steps': 43844, 'loss/train': 1.3933018445968628}}}} 11/07/2021 03:22:29 - INFO - __main__ - Step 43849: {'lr': 0.0004076817639403038, 'samples': 8419008, 'steps': 43848, 'loss/train': 1.659220576286316}}}} 11/07/2021 03:22:30 - INFO - __main__ - Step 43853: {'lr': 0.0004076652911365252, 'samples': 8419776, 'steps': 43852, 'loss/train': 1.7538639307022095}}} 11/07/2021 03:22:33 - INFO - __main__ - Step 43857: {'lr': 0.00040764881719608184, 'samples': 8420544, 'steps': 43856, 'loss/train': 1.5082013607025146}} 11/07/2021 03:22:35 - INFO - __main__ - Step 43862: {'lr': 0.0004076282231722737, 'samples': 8421504, 'steps': 43861, 'loss/train': 1.1427700519561768}}} 11/07/2021 03:22:37 - INFO - __main__ - Step 43866: {'lr': 0.00040761174667476883, 'samples': 8422272, 'steps': 43865, 'loss/train': 1.4639588594436646}} 11/07/2021 03:22:39 - INFO - __main__ - Step 43870: {'lr': 0.0004075952690409852, 'samples': 8423040, 'steps': 43869, 'loss/train': 1.6615227460861206}}} 11/07/2021 03:22:39 - INFO - __main__ - Step 43870: {'lr': 0.0004075952690409852, 'samples': 8423040, 'steps': 43869, 'loss/train': 1.6615227460861206}}} 11/07/2021 03:22:42 - INFO - __main__ - Step 43877: {'lr': 0.00040756643044805057, 'samples': 8424384, 'steps': 43876, 'loss/train': 2.9302780628204346}} 11/07/2021 03:22:45 - INFO - __main__ - Step 43882: {'lr': 0.00040754582932315007, 'samples': 8425344, 'steps': 43881, 'loss/train': 2.1872682571411133}} 11/07/2021 03:22:47 - INFO - __main__ - Step 43886: {'lr': 0.0004075293471454396, 'samples': 8426112, 'steps': 43885, 'loss/train': 1.765032172203064}3}} 11/07/2021 03:22:49 - INFO - __main__ - Step 43890: {'lr': 0.00040751286383204437, 'samples': 8426880, 'steps': 43889, 'loss/train': 1.5433449745178223}} 11/07/2021 03:22:50 - INFO - __main__ - Step 43894: {'lr': 0.00040749637938308336, 'samples': 8427648, 'steps': 43893, 'loss/train': 1.17872953414917}3}} 11/07/2021 03:22:52 - INFO - __main__ - Step 43898: {'lr': 0.0004074798937986753, 'samples': 8428416, 'steps': 43897, 'loss/train': 1.738026738166809}3}} 11/07/2021 03:22:52 - INFO - __main__ - Step 43898: {'lr': 0.0004074798937986753, 'samples': 8428416, 'steps': 43897, 'loss/train': 1.738026738166809}3}} 11/07/2021 03:22:57 - INFO - __main__ - Step 43906: {'lr': 0.0004074469192239936, 'samples': 8429952, 'steps': 43905, 'loss/train': 1.4113155603408813}}} 11/07/2021 03:22:58 - INFO - __main__ - Step 43910: {'lr': 0.0004074304302339576, 'samples': 8430720, 'steps': 43909, 'loss/train': 1.672196388244629}}}} 11/07/2021 03:23:00 - INFO - __main__ - Step 43914: {'lr': 0.00040741394010895013, 'samples': 8431488, 'steps': 43913, 'loss/train': 1.4635186195373535}} 11/07/2021 03:23:03 - INFO - __main__ - Step 43919: {'lr': 0.00040739332585681807, 'samples': 8432448, 'steps': 43918, 'loss/train': 1.896952509880066}}} 11/07/2021 03:23:05 - INFO - __main__ - Step 43923: {'lr': 0.0004073768331785592, 'samples': 8433216, 'steps': 43922, 'loss/train': 1.7586002349853516}}} 11/07/2021 03:23:05 - INFO - __main__ - Step 43923: {'lr': 0.0004073768331785592, 'samples': 8433216, 'steps': 43922, 'loss/train': 1.7586002349853516}}} 11/07/2021 03:23:08 - INFO - __main__ - Step 43930: {'lr': 0.00040734796826158226, 'samples': 8434560, 'steps': 43929, 'loss/train': 1.6103554964065552}} 11/07/2021 03:23:11 - INFO - __main__ - Step 43935: {'lr': 0.0004073273483367474, 'samples': 8435520, 'steps': 43934, 'loss/train': 0.11346945911645889}} 11/07/2021 03:23:11 - INFO - __main__ - Step 43935: {'lr': 0.0004073273483367474, 'samples': 8435520, 'steps': 43934, 'loss/train': 0.11346945911645889}} 11/07/2021 03:23:15 - INFO - __main__ - Step 43943: {'lr': 0.0004072943527708659, 'samples': 8437056, 'steps': 43942, 'loss/train': 1.10098397731781}89}} 11/07/2021 03:23:17 - INFO - __main__ - Step 43947: {'lr': 0.00040727785328687995, 'samples': 8437824, 'steps': 43946, 'loss/train': 1.4861782789230347}} 11/07/2021 03:23:19 - INFO - __main__ - Step 43951: {'lr': 0.0004072613526690223, 'samples': 8438592, 'steps': 43950, 'loss/train': 1.2344754934310913}}} 11/07/2021 03:23:19 - INFO - __main__ - Step 43951: {'lr': 0.0004072613526690223, 'samples': 8438592, 'steps': 43950, 'loss/train': 1.2344754934310913}}} 11/07/2021 03:23:23 - INFO - __main__ - Step 43959: {'lr': 0.00040722834803216834, 'samples': 8440128, 'steps': 43958, 'loss/train': 1.1566802263259888}} 11/07/2021 03:23:24 - INFO - __main__ - Step 43963: {'lr': 0.00040721184401340977, 'samples': 8440896, 'steps': 43962, 'loss/train': 1.6888006925582886}} 11/07/2021 03:23:27 - INFO - __main__ - Step 43967: {'lr': 0.0004071953388612555, 'samples': 8441664, 'steps': 43966, 'loss/train': 1.4906792640686035}}} 11/07/2021 03:23:29 - INFO - __main__ - Step 43972: {'lr': 0.00040717470582740634, 'samples': 8442624, 'steps': 43971, 'loss/train': 1.7003074884414673}} 11/07/2021 03:23:31 - INFO - __main__ - Step 43976: {'lr': 0.00040715819812554686, 'samples': 8443392, 'steps': 43975, 'loss/train': 1.956968069076538}}} 11/07/2021 03:23:31 - INFO - __main__ - Step 43976: {'lr': 0.00040715819812554686, 'samples': 8443392, 'steps': 43975, 'loss/train': 1.956968069076538}}} 11/07/2021 03:23:35 - INFO - __main__ - Step 43983: {'lr': 0.00040712930692106164, 'samples': 8444736, 'steps': 43982, 'loss/train': 1.5842952728271484}} 11/07/2021 03:23:37 - INFO - __main__ - Step 43988: {'lr': 0.0004071086682223909, 'samples': 8445696, 'steps': 43987, 'loss/train': 1.1857975721359253}}} 11/07/2021 03:23:39 - INFO - __main__ - Step 43993: {'lr': 0.00040708802775395165, 'samples': 8446656, 'steps': 43992, 'loss/train': 1.4375793933868408}} 11/07/2021 03:23:39 - INFO - __main__ - Step 43993: {'lr': 0.00040708802775395165, 'samples': 8446656, 'steps': 43992, 'loss/train': 1.4375793933868408}} 11/07/2021 03:23:43 - INFO - __main__ - Step 44000: {'lr': 0.0004070591281253682, 'samples': 8448000, 'steps': 43999, 'loss/train': 1.512953519821167}8}} 11/07/2021 03:23:45 - INFO - __main__ - Step 44004: {'lr': 0.0004070426124949458, 'samples': 8448768, 'steps': 44003, 'loss/train': 1.6318305730819702}}} 11/07/2021 03:23:47 - INFO - __main__ - Step 44009: {'lr': 0.0004070219663648098, 'samples': 8449728, 'steps': 44008, 'loss/train': 1.146945595741272}}}} 11/07/2021 03:23:49 - INFO - __main__ - Step 44013: {'lr': 0.0004070054481871597, 'samples': 8450496, 'steps': 44012, 'loss/train': 1.3653974533081055}}} 11/07/2021 03:23:49 - INFO - __main__ - Step 44013: {'lr': 0.0004070054481871597, 'samples': 8450496, 'steps': 44012, 'loss/train': 1.3653974533081055}}} 11/07/2021 03:23:53 - INFO - __main__ - Step 44020: {'lr': 0.00040697653865269057, 'samples': 8451840, 'steps': 44019, 'loss/train': 0.4142761826515198}} 11/07/2021 03:23:55 - INFO - __main__ - Step 44024: {'lr': 0.00040696001736258077, 'samples': 8452608, 'steps': 44023, 'loss/train': 1.4945148229599}98}} 11/07/2021 03:23:57 - INFO - __main__ - Step 44029: {'lr': 0.0004069393641586728, 'samples': 8453568, 'steps': 44028, 'loss/train': 1.9058088064193726}}} 11/07/2021 03:24:00 - INFO - __main__ - Step 44034: {'lr': 0.0004069187091869035, 'samples': 8454528, 'steps': 44033, 'loss/train': 1.5818285942077637}}} 11/07/2021 03:24:00 - INFO - __main__ - Step 44034: {'lr': 0.0004069187091869035, 'samples': 8454528, 'steps': 44033, 'loss/train': 1.5818285942077637}}} 11/07/2021 03:24:03 - INFO - __main__ - Step 44041: {'lr': 0.00040688978925686235, 'samples': 8455872, 'steps': 44040, 'loss/train': 1.8101271390914917}} 11/07/2021 03:24:05 - INFO - __main__ - Step 44045: {'lr': 0.0004068732620272856, 'samples': 8456640, 'steps': 44044, 'loss/train': 1.4922720193862915}}} 11/07/2021 03:24:07 - INFO - __main__ - Step 44050: {'lr': 0.00040685260139992343, 'samples': 8457600, 'steps': 44049, 'loss/train': 1.989881157875061}}} 11/07/2021 03:24:10 - INFO - __main__ - Step 44055: {'lr': 0.00040683193900567727, 'samples': 8458560, 'steps': 44054, 'loss/train': 1.1132365465164185}} 11/07/2021 03:24:12 - INFO - __main__ - Step 44059: {'lr': 0.0004068154078182802, 'samples': 8459328, 'steps': 44058, 'loss/train': 0.8346248269081116}}} 11/07/2021 03:24:12 - INFO - __main__ - Step 44059: {'lr': 0.0004068154078182802, 'samples': 8459328, 'steps': 44058, 'loss/train': 0.8346248269081116}}} 11/07/2021 03:24:15 - INFO - __main__ - Step 44066: {'lr': 0.00040678647552005087, 'samples': 8460672, 'steps': 44065, 'loss/train': 1.907792329788208}}} 11/07/2021 03:24:17 - INFO - __main__ - Step 44071: {'lr': 0.00040676580747334, 'samples': 8461632, 'steps': 44070, 'loss/train': 1.042952537536621}08}}} 11/07/2021 03:24:20 - INFO - __main__ - Step 44076: {'lr': 0.00040674513766072274, 'samples': 8462592, 'steps': 44075, 'loss/train': 0.9936408996582031}} 11/07/2021 03:24:22 - INFO - __main__ - Step 44080: {'lr': 0.00040672860053933286, 'samples': 8463360, 'steps': 44079, 'loss/train': 1.5099854469299316}} 11/07/2021 03:24:22 - INFO - __main__ - Step 44080: {'lr': 0.00040672860053933286, 'samples': 8463360, 'steps': 44079, 'loss/train': 1.5099854469299316}} 11/07/2021 03:24:25 - INFO - __main__ - Step 44087: {'lr': 0.00040669965785812193, 'samples': 8464704, 'steps': 44086, 'loss/train': 1.5070987939834595}} 11/07/2021 03:24:28 - INFO - __main__ - Step 44092: {'lr': 0.0004066789823961691, 'samples': 8465664, 'steps': 44091, 'loss/train': 1.5024408102035522}}} 11/07/2021 03:24:30 - INFO - __main__ - Step 44096: {'lr': 0.00040666244075584736, 'samples': 8466432, 'steps': 44095, 'loss/train': 1.9180774688720703}} 11/07/2021 03:24:30 - INFO - __main__ - Step 44096: {'lr': 0.00040666244075584736, 'samples': 8466432, 'steps': 44095, 'loss/train': 1.9180774688720703}} 11/07/2021 03:24:33 - INFO - __main__ - Step 44102: {'lr': 0.00040663762617771163, 'samples': 8467584, 'steps': 44101, 'loss/train': 1.5183041095733643}} 11/07/2021 03:24:36 - INFO - __main__ - Step 44108: {'lr': 0.00040661280905875, 'samples': 8468736, 'steps': 44107, 'loss/train': 1.2548373937606812}43}} 11/07/2021 03:24:38 - INFO - __main__ - Step 44112: {'lr': 0.0004065962629014044, 'samples': 8469504, 'steps': 44111, 'loss/train': 1.4359891414642334}}} 11/07/2021 03:24:38 - INFO - __main__ - Step 44112: {'lr': 0.0004065962629014044, 'samples': 8469504, 'steps': 44111, 'loss/train': 1.4359891414642334}}} 11/07/2021 03:24:41 - INFO - __main__ - Step 44119: {'lr': 0.00040656730440956677, 'samples': 8470848, 'steps': 44118, 'loss/train': 1.3201446533203125}} 11/07/2021 03:24:43 - INFO - __main__ - Step 44123: {'lr': 0.00040655075514787445, 'samples': 8471616, 'steps': 44122, 'loss/train': 1.3534348011016846}} 11/07/2021 03:24:45 - INFO - __main__ - Step 44127: {'lr': 0.00040653420475755245, 'samples': 8472384, 'steps': 44126, 'loss/train': 1.3555549383163452}} 11/07/2021 03:24:47 - INFO - __main__ - Step 44131: {'lr': 0.00040651765323872, 'samples': 8473152, 'steps': 44130, 'loss/train': 1.753224492073059}452}} 11/07/2021 03:24:50 - INFO - __main__ - Step 44135: {'lr': 0.00040650110059149664, 'samples': 8473920, 'steps': 44134, 'loss/train': 1.9925587177276611}} 11/07/2021 03:24:51 - INFO - __main__ - Step 44139: {'lr': 0.00040648454681600153, 'samples': 8474688, 'steps': 44138, 'loss/train': 1.4614596366882324}} 11/07/2021 03:24:53 - INFO - __main__ - Step 44144: {'lr': 0.00040646385301018243, 'samples': 8475648, 'steps': 44143, 'loss/train': 1.7908116579055786}} 11/07/2021 03:24:55 - INFO - __main__ - Step 44148: {'lr': 0.00040644729669651235, 'samples': 8476416, 'steps': 44147, 'loss/train': 1.154239535331726}}} 11/07/2021 03:24:58 - INFO - __main__ - Step 44152: {'lr': 0.0004064307392549585, 'samples': 8477184, 'steps': 44151, 'loss/train': 1.6127792596817017}}} 11/07/2021 03:25:00 - INFO - __main__ - Step 44156: {'lr': 0.00040641418068564024, 'samples': 8477952, 'steps': 44155, 'loss/train': 1.2231348752975464}} 11/07/2021 03:25:01 - INFO - __main__ - Step 44160: {'lr': 0.00040639762098867684, 'samples': 8478720, 'steps': 44159, 'loss/train': 1.1427425146102905}} 11/07/2021 03:25:03 - INFO - __main__ - Step 44164: {'lr': 0.00040638106016418785, 'samples': 8479488, 'steps': 44163, 'loss/train': 1.4833199977874756}} 11/07/2021 03:25:06 - INFO - __main__ - Step 44169: {'lr': 0.00040636035754817545, 'samples': 8480448, 'steps': 44168, 'loss/train': 1.5258365869522095}} 11/07/2021 03:25:08 - INFO - __main__ - Step 44173: {'lr': 0.0004063437941871903, 'samples': 8481216, 'steps': 44172, 'loss/train': 0.9237746000289917}}} 11/07/2021 03:25:08 - INFO - __main__ - Step 44173: {'lr': 0.0004063437941871903, 'samples': 8481216, 'steps': 44172, 'loss/train': 0.9237746000289917}}} 11/07/2021 03:25:11 - INFO - __main__ - Step 44180: {'lr': 0.000406314805593363, 'samples': 8482560, 'steps': 44179, 'loss/train': 1.4306188821792603}}}} 11/07/2021 03:25:13 - INFO - __main__ - Step 44185: {'lr': 0.0004062940973418865, 'samples': 8483520, 'steps': 44184, 'loss/train': 1.467079758644104}}}} 11/07/2021 03:25:16 - INFO - __main__ - Step 44190: {'lr': 0.0004062733873298172, 'samples': 8484480, 'steps': 44189, 'loss/train': 1.873066782951355}}}} 11/07/2021 03:25:16 - INFO - __main__ - Step 44190: {'lr': 0.0004062733873298172, 'samples': 8484480, 'steps': 44189, 'loss/train': 1.873066782951355}}}} 11/07/2021 03:25:20 - INFO - __main__ - Step 44197: {'lr': 0.0004062443903555687, 'samples': 8485824, 'steps': 44196, 'loss/train': 1.3376597166061401}}} 11/07/2021 03:25:21 - INFO - __main__ - Step 44201: {'lr': 0.00040622781910712826, 'samples': 8486592, 'steps': 44200, 'loss/train': 1.1684272289276123}} 11/07/2021 03:25:23 - INFO - __main__ - Step 44205: {'lr': 0.0004062112467323863, 'samples': 8487360, 'steps': 44204, 'loss/train': 1.7162210941314697}}} 11/07/2021 03:25:23 - INFO - __main__ - Step 44205: {'lr': 0.0004062112467323863, 'samples': 8487360, 'steps': 44204, 'loss/train': 1.7162210941314697}}} 11/07/2021 03:25:27 - INFO - __main__ - Step 44213: {'lr': 0.00040617809860447564, 'samples': 8488896, 'steps': 44212, 'loss/train': 0.8359420299530029}} 11/07/2021 03:25:29 - INFO - __main__ - Step 44217: {'lr': 0.00040616152285154607, 'samples': 8489664, 'steps': 44216, 'loss/train': 1.7768678665161133}} 11/07/2021 03:25:29 - INFO - __main__ - Step 44217: {'lr': 0.00040616152285154607, 'samples': 8489664, 'steps': 44216, 'loss/train': 1.7768678665161133}} 11/07/2021 03:25:34 - INFO - __main__ - Step 44225: {'lr': 0.00040612836796833556, 'samples': 8491200, 'steps': 44224, 'loss/train': 1.9082430601119995}} 11/07/2021 03:25:35 - INFO - __main__ - Step 44229: {'lr': 0.0004061117888382938, 'samples': 8491968, 'steps': 44228, 'loss/train': 1.5498803853988647}}} 11/07/2021 03:25:37 - INFO - __main__ - Step 44233: {'lr': 0.00040609520858278704, 'samples': 8492736, 'steps': 44232, 'loss/train': 1.6199346780776978}} 11/07/2021 03:25:40 - INFO - __main__ - Step 44238: {'lr': 0.00040607448168090044, 'samples': 8493696, 'steps': 44237, 'loss/train': 1.4443018436431885}} 11/07/2021 03:25:42 - INFO - __main__ - Step 44242: {'lr': 0.00040605789889353445, 'samples': 8494464, 'steps': 44241, 'loss/train': 1.4835010766983032}} 11/07/2021 03:25:43 - INFO - __main__ - Step 44246: {'lr': 0.00040604131498109193, 'samples': 8495232, 'steps': 44245, 'loss/train': 1.67646062374115}2}} 11/07/2021 03:25:45 - INFO - __main__ - Step 44250: {'lr': 0.0004060247299436925, 'samples': 8496000, 'steps': 44249, 'loss/train': 2.00783371925354}}2}} 11/07/2021 03:25:48 - INFO - __main__ - Step 44255: {'lr': 0.00040600399706515466, 'samples': 8496960, 'steps': 44254, 'loss/train': 1.435835599899292}}} 11/07/2021 03:25:48 - INFO - __main__ - Step 44255: {'lr': 0.00040600399706515466, 'samples': 8496960, 'steps': 44254, 'loss/train': 1.435835599899292}}} 11/07/2021 03:25:52 - INFO - __main__ - Step 44263: {'lr': 0.0004059708208043556, 'samples': 8498496, 'steps': 44262, 'loss/train': 2.3060762882232666}}} 11/07/2021 03:25:54 - INFO - __main__ - Step 44267: {'lr': 0.00040595423098722315, 'samples': 8499264, 'steps': 44266, 'loss/train': 1.3291518688201904}} 11/07/2021 03:25:55 - INFO - __main__ - Step 44271: {'lr': 0.00040593764004576166, 'samples': 8500032, 'steps': 44270, 'loss/train': 0.7102341651916504}} 11/07/2021 03:25:58 - INFO - __main__ - Step 44275: {'lr': 0.00040592104798009066, 'samples': 8500800, 'steps': 44274, 'loss/train': 1.1963216066360474}} 11/07/2021 03:25:58 - INFO - __main__ - Step 44275: {'lr': 0.00040592104798009066, 'samples': 8500800, 'steps': 44274, 'loss/train': 1.1963216066360474}} 11/07/2021 03:26:01 - INFO - __main__ - Step 44282: {'lr': 0.00040589200916039703, 'samples': 8502144, 'steps': 44281, 'loss/train': 1.7258718013763428}} 11/07/2021 03:26:04 - INFO - __main__ - Step 44287: {'lr': 0.00040587126503901664, 'samples': 8503104, 'steps': 44286, 'loss/train': 1.5036070346832275}} 11/07/2021 03:26:04 - INFO - __main__ - Step 44287: {'lr': 0.00040587126503901664, 'samples': 8503104, 'steps': 44286, 'loss/train': 1.5036070346832275}} 11/07/2021 03:26:08 - INFO - __main__ - Step 44295: {'lr': 0.0004058380707927798, 'samples': 8504640, 'steps': 44294, 'loss/train': 1.8551056385040283}}} 11/07/2021 03:26:08 - INFO - __main__ - Step 44295: {'lr': 0.0004058380707927798, 'samples': 8504640, 'steps': 44294, 'loss/train': 1.8551056385040283}}} 11/07/2021 03:26:11 - INFO - __main__ - Step 44302: {'lr': 0.0004058090221408326, 'samples': 8505984, 'steps': 44301, 'loss/train': 1.6845630407333374}}} 11/07/2021 03:26:14 - INFO - __main__ - Step 44307: {'lr': 0.0004057882709975359, 'samples': 8506944, 'steps': 44306, 'loss/train': 1.6465903520584106}}} 11/07/2021 03:26:14 - INFO - __main__ - Step 44307: {'lr': 0.0004057882709975359, 'samples': 8506944, 'steps': 44306, 'loss/train': 1.6465903520584106}}} 11/07/2021 03:26:14 - INFO - __main__ - Step 44307: {'lr': 0.0004057882709975359, 'samples': 8506944, 'steps': 44306, 'loss/train': 1.6465903520584106}}} 11/07/2021 03:26:20 - INFO - __main__ - Step 44318: {'lr': 0.00040574261230538267, 'samples': 8509056, 'steps': 44317, 'loss/train': 1.2443079948425293}} 11/07/2021 03:26:22 - INFO - __main__ - Step 44323: {'lr': 0.0004057218555472456, 'samples': 8510016, 'steps': 44322, 'loss/train': 1.5574032068252563}}} 11/07/2021 03:26:24 - INFO - __main__ - Step 44327: {'lr': 0.0004057052488777392, 'samples': 8510784, 'steps': 44326, 'loss/train': 1.6044999361038208}}} 11/07/2021 03:26:24 - INFO - __main__ - Step 44327: {'lr': 0.0004057052488777392, 'samples': 8510784, 'steps': 44326, 'loss/train': 1.6044999361038208}}} 11/07/2021 03:26:27 - INFO - __main__ - Step 44334: {'lr': 0.0004056761845050772, 'samples': 8512128, 'steps': 44333, 'loss/train': 0.6642376184463501}}} 11/07/2021 03:26:30 - INFO - __main__ - Step 44339: {'lr': 0.000405655422134494, 'samples': 8513088, 'steps': 44338, 'loss/train': 2.3924691677093506}}}} 11/07/2021 03:26:30 - INFO - __main__ - Step 44339: {'lr': 0.000405655422134494, 'samples': 8513088, 'steps': 44338, 'loss/train': 2.3924691677093506}}}} 11/07/2021 03:26:30 - INFO - __main__ - Step 44339: {'lr': 0.000405655422134494, 'samples': 8513088, 'steps': 44338, 'loss/train': 2.3924691677093506}}}} 11/07/2021 03:26:35 - INFO - __main__ - Step 44350: {'lr': 0.00040560973874757844, 'samples': 8515200, 'steps': 44349, 'loss/train': 1.7278624773025513}} 11/07/2021 03:26:38 - INFO - __main__ - Step 44355: {'lr': 0.0004055889707669441, 'samples': 8516160, 'steps': 44354, 'loss/train': 1.867988109588623}3}} 11/07/2021 03:26:40 - INFO - __main__ - Step 44360: {'lr': 0.0004055682010336601, 'samples': 8517120, 'steps': 44359, 'loss/train': 1.4113619327545166}}} 11/07/2021 03:26:42 - INFO - __main__ - Step 44364: {'lr': 0.00040555158398528237, 'samples': 8517888, 'steps': 44363, 'loss/train': 1.5277279615402222}} 11/07/2021 03:26:42 - INFO - __main__ - Step 44364: {'lr': 0.00040555158398528237, 'samples': 8517888, 'steps': 44363, 'loss/train': 1.5277279615402222}} 11/07/2021 03:26:46 - INFO - __main__ - Step 44370: {'lr': 0.0004055266563100788, 'samples': 8519040, 'steps': 44369, 'loss/train': 1.9142252206802368}}} 11/07/2021 03:26:46 - INFO - __main__ - Step 44370: {'lr': 0.0004055266563100788, 'samples': 8519040, 'steps': 44369, 'loss/train': 1.9142252206802368}}} 11/07/2021 03:26:50 - INFO - __main__ - Step 44378: {'lr': 0.00040549341548551415, 'samples': 8520576, 'steps': 44377, 'loss/train': 1.8282917737960815}} 11/07/2021 03:26:52 - INFO - __main__ - Step 44382: {'lr': 0.00040547679339166155, 'samples': 8521344, 'steps': 44381, 'loss/train': 1.7189534902572632}} 11/07/2021 03:26:54 - INFO - __main__ - Step 44387: {'lr': 0.00040545601419811236, 'samples': 8522304, 'steps': 44386, 'loss/train': 1.71592378616333}2}} 11/07/2021 03:26:54 - INFO - __main__ - Step 44387: {'lr': 0.00040545601419811236, 'samples': 8522304, 'steps': 44386, 'loss/train': 1.71592378616333}2}} 11/07/2021 03:26:58 - INFO - __main__ - Step 44395: {'lr': 0.0004054227638461348, 'samples': 8523840, 'steps': 44394, 'loss/train': 1.1267532110214233}}} 11/07/2021 03:26:58 - INFO - __main__ - Step 44395: {'lr': 0.0004054227638461348, 'samples': 8523840, 'steps': 44394, 'loss/train': 1.1267532110214233}}} 11/07/2021 03:27:02 - INFO - __main__ - Step 44402: {'lr': 0.000405393666111489, 'samples': 8525184, 'steps': 44401, 'loss/train': 1.2938517332077026}}}} 11/07/2021 03:27:04 - INFO - __main__ - Step 44407: {'lr': 0.00040537287991473627, 'samples': 8526144, 'steps': 44406, 'loss/train': 1.5492923259735107}} 11/07/2021 03:27:04 - INFO - __main__ - Step 44407: {'lr': 0.00040537287991473627, 'samples': 8526144, 'steps': 44406, 'loss/train': 1.5492923259735107}} 11/07/2021 03:27:08 - INFO - __main__ - Step 44415: {'lr': 0.000405339618359581, 'samples': 8527680, 'steps': 44414, 'loss/train': 0.21375149488449097}}} 11/07/2021 03:27:10 - INFO - __main__ - Step 44419: {'lr': 0.0004053229859020962, 'samples': 8528448, 'steps': 44418, 'loss/train': 1.4786136150360107}}} 11/07/2021 03:27:12 - INFO - __main__ - Step 44423: {'lr': 0.0004053063523248331, 'samples': 8529216, 'steps': 44422, 'loss/train': 1.6097091436386108}}} 11/07/2021 03:27:14 - INFO - __main__ - Step 44427: {'lr': 0.00040528971762791177, 'samples': 8529984, 'steps': 44426, 'loss/train': 1.7034776210784912}} 11/07/2021 03:27:17 - INFO - __main__ - Step 44431: {'lr': 0.000405273081811452, 'samples': 8530752, 'steps': 44430, 'loss/train': 1.1703969240188599}2}} 11/07/2021 03:27:18 - INFO - __main__ - Step 44435: {'lr': 0.00040525644487557366, 'samples': 8531520, 'steps': 44434, 'loss/train': 1.7728455066680908}} 11/07/2021 03:27:20 - INFO - __main__ - Step 44439: {'lr': 0.00040523980682039684, 'samples': 8532288, 'steps': 44438, 'loss/train': 1.6078511476516724}} 11/07/2021 03:27:22 - INFO - __main__ - Step 44444: {'lr': 0.000405219007677595, 'samples': 8533248, 'steps': 44443, 'loss/train': 1.638048529624939}24}} 11/07/2021 03:27:22 - INFO - __main__ - Step 44444: {'lr': 0.000405219007677595, 'samples': 8533248, 'steps': 44443, 'loss/train': 1.638048529624939}24}} 11/07/2021 03:27:27 - INFO - __main__ - Step 44450: {'lr': 0.0004051940463982569, 'samples': 8534400, 'steps': 44449, 'loss/train': 1.2241114377975464}}} 11/07/2021 03:27:29 - INFO - __main__ - Step 44454: {'lr': 0.0004051774041467789, 'samples': 8535168, 'steps': 44453, 'loss/train': 1.2485108375549316}}} 11/07/2021 03:27:30 - INFO - __main__ - Step 44458: {'lr': 0.00040516076077657233, 'samples': 8535936, 'steps': 44457, 'loss/train': 1.619061827659607}}} 11/07/2021 03:27:32 - INFO - __main__ - Step 44462: {'lr': 0.00040514411628775695, 'samples': 8536704, 'steps': 44461, 'loss/train': 1.083493709564209}}} 11/07/2021 03:27:35 - INFO - __main__ - Step 44467: {'lr': 0.00040512330910387706, 'samples': 8537664, 'steps': 44466, 'loss/train': 2.71468186378479}}}} 11/07/2021 03:27:35 - INFO - __main__ - Step 44467: {'lr': 0.00040512330910387706, 'samples': 8537664, 'steps': 44466, 'loss/train': 2.71468186378479}}}} 11/07/2021 03:27:38 - INFO - __main__ - Step 44474: {'lr': 0.00040509417611085864, 'samples': 8539008, 'steps': 44473, 'loss/train': 1.0161057710647583}} 11/07/2021 03:27:41 - INFO - __main__ - Step 44478: {'lr': 0.00040507752714880854, 'samples': 8539776, 'steps': 44477, 'loss/train': 1.3781660795211792}} 11/07/2021 03:27:43 - INFO - __main__ - Step 44482: {'lr': 0.00040506087706874966, 'samples': 8540544, 'steps': 44481, 'loss/train': 1.220317006111145}}} 11/07/2021 03:27:44 - INFO - __main__ - Step 44486: {'lr': 0.0004050442258708022, 'samples': 8541312, 'steps': 44485, 'loss/train': 0.9665480256080627}}} 11/07/2021 03:27:46 - INFO - __main__ - Step 44490: {'lr': 0.00040502757355508626, 'samples': 8542080, 'steps': 44489, 'loss/train': 0.7407692670822144}} 11/07/2021 03:27:49 - INFO - __main__ - Step 44495: {'lr': 0.0004050067565887621, 'samples': 8543040, 'steps': 44494, 'loss/train': 1.4558534622192383}}} 11/07/2021 03:27:51 - INFO - __main__ - Step 44499: {'lr': 0.0004049901017585058, 'samples': 8543808, 'steps': 44498, 'loss/train': 1.58024263381958}3}}} 11/07/2021 03:27:53 - INFO - __main__ - Step 44503: {'lr': 0.000404973445810871, 'samples': 8544576, 'steps': 44502, 'loss/train': 1.3407477140426636}}}} 11/07/2021 03:27:53 - INFO - __main__ - Step 44503: {'lr': 0.000404973445810871, 'samples': 8544576, 'steps': 44502, 'loss/train': 1.3407477140426636}}}} 11/07/2021 03:27:56 - INFO - __main__ - Step 44510: {'lr': 0.00040494429521417983, 'samples': 8545920, 'steps': 44509, 'loss/train': 1.0804678201675415}} 11/07/2021 03:27:59 - INFO - __main__ - Step 44516: {'lr': 0.00040491930626561525, 'samples': 8547072, 'steps': 44515, 'loss/train': 1.423165202140808}}} 11/07/2021 03:27:59 - INFO - __main__ - Step 44516: {'lr': 0.00040491930626561525, 'samples': 8547072, 'steps': 44515, 'loss/train': 1.423165202140808}}} 11/07/2021 03:28:03 - INFO - __main__ - Step 44523: {'lr': 0.0004048901493162251, 'samples': 8548416, 'steps': 44522, 'loss/train': 1.0734343528747559}}} 11/07/2021 03:28:04 - INFO - __main__ - Step 44527: {'lr': 0.0004048734866668421, 'samples': 8549184, 'steps': 44526, 'loss/train': 1.581064224243164}}}} 11/07/2021 03:28:06 - INFO - __main__ - Step 44531: {'lr': 0.00040485682290092144, 'samples': 8549952, 'steps': 44530, 'loss/train': 1.5065734386444092}} 11/07/2021 03:28:06 - INFO - __main__ - Step 44531: {'lr': 0.00040485682290092144, 'samples': 8549952, 'steps': 44530, 'loss/train': 1.5065734386444092}} 11/07/2021 03:28:11 - INFO - __main__ - Step 44539: {'lr': 0.00040482349201994785, 'samples': 8551488, 'steps': 44538, 'loss/train': 1.1694883108139038}} 11/07/2021 03:28:12 - INFO - __main__ - Step 44543: {'lr': 0.000404806824905135, 'samples': 8552256, 'steps': 44542, 'loss/train': 1.2956562042236328}8}} 11/07/2021 03:28:14 - INFO - __main__ - Step 44547: {'lr': 0.00040479015667426523, 'samples': 8553024, 'steps': 44546, 'loss/train': 1.5965445041656494}} 11/07/2021 03:28:17 - INFO - __main__ - Step 44552: {'lr': 0.0004047693198164058, 'samples': 8553984, 'steps': 44551, 'loss/train': 1.5908708572387695}}} 11/07/2021 03:28:17 - INFO - __main__ - Step 44552: {'lr': 0.0004047693198164058, 'samples': 8553984, 'steps': 44551, 'loss/train': 1.5908708572387695}}} 11/07/2021 03:28:20 - INFO - __main__ - Step 44559: {'lr': 0.00040474014528651514, 'samples': 8555328, 'steps': 44558, 'loss/train': 1.8229775428771973}} 11/07/2021 03:28:23 - INFO - __main__ - Step 44564: {'lr': 0.00040471930424485, 'samples': 8556288, 'steps': 44563, 'loss/train': 0.8524765968322754}73}} 11/07/2021 03:28:25 - INFO - __main__ - Step 44568: {'lr': 0.00040470263015665234, 'samples': 8557056, 'steps': 44567, 'loss/train': 1.0699820518493652}} 11/07/2021 03:28:27 - INFO - __main__ - Step 44572: {'lr': 0.0004046859549531487, 'samples': 8557824, 'steps': 44571, 'loss/train': 1.3198877573013306}}} 11/07/2021 03:28:27 - INFO - __main__ - Step 44572: {'lr': 0.0004046859549531487, 'samples': 8557824, 'steps': 44571, 'loss/train': 1.3198877573013306}}} 11/07/2021 03:28:30 - INFO - __main__ - Step 44579: {'lr': 0.00040465677066367424, 'samples': 8559168, 'steps': 44578, 'loss/train': 1.6030867099761963}} 11/07/2021 03:28:32 - INFO - __main__ - Step 44584: {'lr': 0.0004046359226520048, 'samples': 8560128, 'steps': 44583, 'loss/train': 1.42673921585083}63}} 11/07/2021 03:28:35 - INFO - __main__ - Step 44589: {'lr': 0.0004046150728984214, 'samples': 8561088, 'steps': 44588, 'loss/train': 1.2639451026916504}}} 11/07/2021 03:28:37 - INFO - __main__ - Step 44593: {'lr': 0.00040459839184153436, 'samples': 8561856, 'steps': 44592, 'loss/train': 1.6785650253295898}} 11/07/2021 03:28:39 - INFO - __main__ - Step 44597: {'lr': 0.0004045817096700929, 'samples': 8562624, 'steps': 44596, 'loss/train': 1.7540621757507324}}} 11/07/2021 03:28:40 - INFO - __main__ - Step 44601: {'lr': 0.0004045650263842174, 'samples': 8563392, 'steps': 44600, 'loss/train': 1.0853443145751953}}} 11/07/2021 03:28:42 - INFO - __main__ - Step 44605: {'lr': 0.0004045483419840281, 'samples': 8564160, 'steps': 44604, 'loss/train': 1.49483060836792}3}}} 11/07/2021 03:28:42 - INFO - __main__ - Step 44605: {'lr': 0.0004045483419840281, 'samples': 8564160, 'steps': 44604, 'loss/train': 1.49483060836792}3}}} 11/07/2021 03:28:42 - INFO - __main__ - Step 44605: {'lr': 0.0004045483419840281, 'samples': 8564160, 'steps': 44604, 'loss/train': 1.49483060836792}3}}} 11/07/2021 03:28:48 - INFO - __main__ - Step 44616: {'lr': 0.0004045024541388085, 'samples': 8566272, 'steps': 44615, 'loss/train': 1.4237236976623535}}} 11/07/2021 03:28:51 - INFO - __main__ - Step 44621: {'lr': 0.0004044815932425379, 'samples': 8567232, 'steps': 44620, 'loss/train': 2.050100088119507}}}} 11/07/2021 03:28:53 - INFO - __main__ - Step 44626: {'lr': 0.00040446073060609156, 'samples': 8568192, 'steps': 44625, 'loss/train': 1.1727397441864014}} 11/07/2021 03:28:55 - INFO - __main__ - Step 44630: {'lr': 0.00040444403924416614, 'samples': 8568960, 'steps': 44629, 'loss/train': 1.1327166557312012}} 11/07/2021 03:28:55 - INFO - __main__ - Step 44630: {'lr': 0.00040444403924416614, 'samples': 8568960, 'steps': 44629, 'loss/train': 1.1327166557312012}} 11/07/2021 03:28:58 - INFO - __main__ - Step 44637: {'lr': 0.0004044148266816501, 'samples': 8570304, 'steps': 44636, 'loss/train': 1.6011314392089844}}} 11/07/2021 03:29:00 - INFO - __main__ - Step 44641: {'lr': 0.00040439813225804977, 'samples': 8571072, 'steps': 44640, 'loss/train': 1.777889609336853}}} 11/07/2021 03:29:03 - INFO - __main__ - Step 44646: {'lr': 0.00040437726266325164, 'samples': 8572032, 'steps': 44645, 'loss/train': 1.547196388244629}}} 11/07/2021 03:29:05 - INFO - __main__ - Step 44651: {'lr': 0.00040435639132945314, 'samples': 8572992, 'steps': 44650, 'loss/train': 1.2456927299499512}} 11/07/2021 03:29:07 - INFO - __main__ - Step 44655: {'lr': 0.0004043396930104922, 'samples': 8573760, 'steps': 44654, 'loss/train': 1.440232515335083}2}} 11/07/2021 03:29:07 - INFO - __main__ - Step 44655: {'lr': 0.0004043396930104922, 'samples': 8573760, 'steps': 44654, 'loss/train': 1.440232515335083}2}} 11/07/2021 03:29:10 - INFO - __main__ - Step 44662: {'lr': 0.00040431046827497415, 'samples': 8575104, 'steps': 44661, 'loss/train': 1.582525610923767}}} 11/07/2021 03:29:13 - INFO - __main__ - Step 44667: {'lr': 0.00040428959137795475, 'samples': 8576064, 'steps': 44666, 'loss/train': 1.512468695640564}}} 11/07/2021 03:29:15 - INFO - __main__ - Step 44672: {'lr': 0.00040426871274292257, 'samples': 8577024, 'steps': 44671, 'loss/train': 1.5917364358901978}} 11/07/2021 03:29:15 - INFO - __main__ - Step 44672: {'lr': 0.00040426871274292257, 'samples': 8577024, 'steps': 44671, 'loss/train': 1.5917364358901978}} 11/07/2021 03:29:18 - INFO - __main__ - Step 44679: {'lr': 0.00040423947973446404, 'samples': 8578368, 'steps': 44678, 'loss/train': 1.7626153230667114}} 11/07/2021 03:29:21 - INFO - __main__ - Step 44683: {'lr': 0.00040422277362920614, 'samples': 8579136, 'steps': 44682, 'loss/train': 0.9555754661560059}} 11/07/2021 03:29:23 - INFO - __main__ - Step 44688: {'lr': 0.00040420188943411385, 'samples': 8580096, 'steps': 44687, 'loss/train': 1.834280014038086}}} 11/07/2021 03:29:25 - INFO - __main__ - Step 44692: {'lr': 0.00040418518082737087, 'samples': 8580864, 'steps': 44691, 'loss/train': 1.1153234243392944}} 11/07/2021 03:29:27 - INFO - __main__ - Step 44696: {'lr': 0.00040416847110905243, 'samples': 8581632, 'steps': 44695, 'loss/train': 1.1403671503067017}} 11/07/2021 03:29:29 - INFO - __main__ - Step 44700: {'lr': 0.00040415176027927915, 'samples': 8582400, 'steps': 44699, 'loss/train': 1.3662071228027344}} 11/07/2021 03:29:29 - INFO - __main__ - Step 44700: {'lr': 0.00040415176027927915, 'samples': 8582400, 'steps': 44699, 'loss/train': 1.3662071228027344}} 11/07/2021 03:29:33 - INFO - __main__ - Step 44706: {'lr': 0.00040412669195090466, 'samples': 8583552, 'steps': 44705, 'loss/train': 1.787896752357483}}} 11/07/2021 03:29:35 - INFO - __main__ - Step 44711: {'lr': 0.0004041057997674464, 'samples': 8584512, 'steps': 44710, 'loss/train': 1.3378225564956665}}} 11/07/2021 03:29:37 - INFO - __main__ - Step 44715: {'lr': 0.0004040890847707901, 'samples': 8585280, 'steps': 44714, 'loss/train': 1.6032570600509644}}} 11/07/2021 03:29:39 - INFO - __main__ - Step 44719: {'lr': 0.0004040723686632512, 'samples': 8586048, 'steps': 44718, 'loss/train': 1.422973394393921}}}} 11/07/2021 03:29:41 - INFO - __main__ - Step 44723: {'lr': 0.0004040556514449501, 'samples': 8586816, 'steps': 44722, 'loss/train': 1.0461225509643555}}} 11/07/2021 03:29:43 - INFO - __main__ - Step 44727: {'lr': 0.0004040389331160075, 'samples': 8587584, 'steps': 44726, 'loss/train': 1.6357241868972778}}} 11/07/2021 03:29:43 - INFO - __main__ - Step 44727: {'lr': 0.0004040389331160075, 'samples': 8587584, 'steps': 44726, 'loss/train': 1.6357241868972778}}} 11/07/2021 03:29:47 - INFO - __main__ - Step 44735: {'lr': 0.0004040054931266795, 'samples': 8589120, 'steps': 44734, 'loss/train': 1.4642417430877686}}} 11/07/2021 03:29:49 - INFO - __main__ - Step 44739: {'lr': 0.0004039887714665352, 'samples': 8589888, 'steps': 44738, 'loss/train': 1.4107799530029297}}} 11/07/2021 03:29:51 - INFO - __main__ - Step 44743: {'lr': 0.0004039720486962316, 'samples': 8590656, 'steps': 44742, 'loss/train': 1.3014174699783325}}} 11/07/2021 03:29:53 - INFO - __main__ - Step 44748: {'lr': 0.00040395114367237407, 'samples': 8591616, 'steps': 44747, 'loss/train': 1.1026297807693481}} 11/07/2021 03:29:55 - INFO - __main__ - Step 44752: {'lr': 0.0004039344184046525, 'samples': 8592384, 'steps': 44751, 'loss/train': 1.4764256477355957}}} 11/07/2021 03:29:57 - INFO - __main__ - Step 44756: {'lr': 0.00040391769202716333, 'samples': 8593152, 'steps': 44755, 'loss/train': 1.6105319261550903}} 11/07/2021 03:29:59 - INFO - __main__ - Step 44760: {'lr': 0.0004039009645400272, 'samples': 8593920, 'steps': 44759, 'loss/train': 1.4869424104690552}}} 11/07/2021 03:30:01 - INFO - __main__ - Step 44764: {'lr': 0.0004038842359433647, 'samples': 8594688, 'steps': 44763, 'loss/train': 1.3257486820220947}}} 11/07/2021 03:30:03 - INFO - __main__ - Step 44769: {'lr': 0.00040386332363744884, 'samples': 8595648, 'steps': 44768, 'loss/train': 1.7680883407592773}} 11/07/2021 03:30:05 - INFO - __main__ - Step 44773: {'lr': 0.0004038465925447929, 'samples': 8596416, 'steps': 44772, 'loss/train': 1.5884572267532349}}} 11/07/2021 03:30:05 - INFO - __main__ - Step 44773: {'lr': 0.0004038465925447929, 'samples': 8596416, 'steps': 44772, 'loss/train': 1.5884572267532349}}} 11/07/2021 03:30:09 - INFO - __main__ - Step 44780: {'lr': 0.00040381731046386295, 'samples': 8597760, 'steps': 44779, 'loss/train': 1.361086130142212}}} 11/07/2021 03:30:11 - INFO - __main__ - Step 44785: {'lr': 0.0004037963926125011, 'samples': 8598720, 'steps': 44784, 'loss/train': 1.8758233785629272}}} 11/07/2021 03:30:13 - INFO - __main__ - Step 44789: {'lr': 0.00040377965708403133, 'samples': 8599488, 'steps': 44788, 'loss/train': 1.6939905881881714}} 11/07/2021 03:30:16 - INFO - __main__ - Step 44794: {'lr': 0.0004037587361144166, 'samples': 8600448, 'steps': 44793, 'loss/train': 1.1653132438659668}}} 11/07/2021 03:30:18 - INFO - __main__ - Step 44798: {'lr': 0.0004037419980916499, 'samples': 8601216, 'steps': 44797, 'loss/train': 1.3725799322128296}}} 11/07/2021 03:30:18 - INFO - __main__ - Step 44798: {'lr': 0.0004037419980916499, 'samples': 8601216, 'steps': 44797, 'loss/train': 1.3725799322128296}}} 11/07/2021 03:30:21 - INFO - __main__ - Step 44805: {'lr': 0.0004037127038848404, 'samples': 8602560, 'steps': 44804, 'loss/train': 1.5042856931686401}}} 11/07/2021 03:30:23 - INFO - __main__ - Step 44809: {'lr': 0.00040369596281431816, 'samples': 8603328, 'steps': 44808, 'loss/train': 1.1962133646011353}} 11/07/2021 03:30:25 - INFO - __main__ - Step 44813: {'lr': 0.00040367922063574735, 'samples': 8604096, 'steps': 44812, 'loss/train': 1.7019158601760864}} 11/07/2021 03:30:25 - INFO - __main__ - Step 44813: {'lr': 0.00040367922063574735, 'samples': 8604096, 'steps': 44812, 'loss/train': 1.7019158601760864}} 11/07/2021 03:30:28 - INFO - __main__ - Step 44820: {'lr': 0.0004036499191573699, 'samples': 8605440, 'steps': 44819, 'loss/train': 2.1798417568206787}}} 11/07/2021 03:30:31 - INFO - __main__ - Step 44825: {'lr': 0.00040362898745295117, 'samples': 8606400, 'steps': 44824, 'loss/train': 1.492126703262329}}} 11/07/2021 03:30:33 - INFO - __main__ - Step 44830: {'lr': 0.00040360805401796124, 'samples': 8607360, 'steps': 44829, 'loss/train': 1.6936755180358887}} 11/07/2021 03:30:35 - INFO - __main__ - Step 44834: {'lr': 0.00040359130602411644, 'samples': 8608128, 'steps': 44833, 'loss/train': 1.508933186531067}}} 11/07/2021 03:30:37 - INFO - __main__ - Step 44838: {'lr': 0.00040357455692297765, 'samples': 8608896, 'steps': 44837, 'loss/train': 1.5283297300338745}} 11/07/2021 03:30:39 - INFO - __main__ - Step 44842: {'lr': 0.0004035578067146657, 'samples': 8609664, 'steps': 44841, 'loss/train': 0.952649712562561}5}} 11/07/2021 03:30:41 - INFO - __main__ - Step 44846: {'lr': 0.0004035410553993012, 'samples': 8610432, 'steps': 44845, 'loss/train': 1.8207674026489258}}} 11/07/2021 03:30:43 - INFO - __main__ - Step 44851: {'lr': 0.00040352011469848713, 'samples': 8611392, 'steps': 44850, 'loss/train': 0.7737088203430176}} 11/07/2021 03:30:45 - INFO - __main__ - Step 44855: {'lr': 0.0004035033608926963, 'samples': 8612160, 'steps': 44854, 'loss/train': 1.4348763227462769}}} 11/07/2021 03:30:45 - INFO - __main__ - Step 44855: {'lr': 0.0004035033608926963, 'samples': 8612160, 'steps': 44854, 'loss/train': 1.4348763227462769}}} 11/07/2021 03:30:49 - INFO - __main__ - Step 44862: {'lr': 0.00040347403906973445, 'samples': 8613504, 'steps': 44861, 'loss/train': 1.2716156244277954}} 11/07/2021 03:30:51 - INFO - __main__ - Step 44867: {'lr': 0.00040345309283584726, 'samples': 8614464, 'steps': 44866, 'loss/train': 1.4803099632263184}} 11/07/2021 03:30:53 - INFO - __main__ - Step 44872: {'lr': 0.0004034321448733701, 'samples': 8615424, 'steps': 44871, 'loss/train': 1.5915842056274414}}} 11/07/2021 03:30:55 - INFO - __main__ - Step 44876: {'lr': 0.00040341538525896233, 'samples': 8616192, 'steps': 44875, 'loss/train': 1.4717856645584106}} 11/07/2021 03:30:58 - INFO - __main__ - Step 44880: {'lr': 0.0004033986245385288, 'samples': 8616960, 'steps': 44879, 'loss/train': 1.655372977256775}6}} 11/07/2021 03:30:58 - INFO - __main__ - Step 44880: {'lr': 0.0004033986245385288, 'samples': 8616960, 'steps': 44879, 'loss/train': 1.655372977256775}6}} 11/07/2021 03:31:01 - INFO - __main__ - Step 44887: {'lr': 0.00040336929061675933, 'samples': 8618304, 'steps': 44886, 'loss/train': 1.292196273803711}}} 11/07/2021 03:31:03 - INFO - __main__ - Step 44892: {'lr': 0.0004033483357422825, 'samples': 8619264, 'steps': 44891, 'loss/train': 1.8648028373718262}}} 11/07/2021 03:31:06 - INFO - __main__ - Step 44897: {'lr': 0.0004033273791403959, 'samples': 8620224, 'steps': 44896, 'loss/train': 1.4491678476333618}}} 11/07/2021 03:31:06 - INFO - __main__ - Step 44897: {'lr': 0.0004033273791403959, 'samples': 8620224, 'steps': 44896, 'loss/train': 1.4491678476333618}}} 11/07/2021 03:31:09 - INFO - __main__ - Step 44904: {'lr': 0.0004032980369961555, 'samples': 8621568, 'steps': 44903, 'loss/train': 1.1294523477554321}}} 11/07/2021 03:31:11 - INFO - __main__ - Step 44908: {'lr': 0.00040328126853692606, 'samples': 8622336, 'steps': 44907, 'loss/train': 1.154900312423706}}} 11/07/2021 03:31:13 - INFO - __main__ - Step 44913: {'lr': 0.0004032603064089144, 'samples': 8623296, 'steps': 44912, 'loss/train': 0.12013912200927734}} 11/07/2021 03:31:15 - INFO - __main__ - Step 44917: {'lr': 0.0004032435354634726, 'samples': 8624064, 'steps': 44916, 'loss/train': 1.2690373659133911}}} 11/07/2021 03:31:17 - INFO - __main__ - Step 44921: {'lr': 0.00040322676341324415, 'samples': 8624832, 'steps': 44920, 'loss/train': 1.3669313192367554}} 11/07/2021 03:31:19 - INFO - __main__ - Step 44925: {'lr': 0.00040320999025834973, 'samples': 8625600, 'steps': 44924, 'loss/train': 1.509555459022522}}} 11/07/2021 03:31:21 - INFO - __main__ - Step 44929: {'lr': 0.0004031932159989105, 'samples': 8626368, 'steps': 44928, 'loss/train': 1.417038917541504}}}} 11/07/2021 03:31:23 - INFO - __main__ - Step 44934: {'lr': 0.0004031722466215293, 'samples': 8627328, 'steps': 44933, 'loss/train': 1.464234709739685}}}} 11/07/2021 03:31:26 - INFO - __main__ - Step 44938: {'lr': 0.0004031554698773061, 'samples': 8628096, 'steps': 44937, 'loss/train': 1.67475426197052}}}}} 11/07/2021 03:31:26 - INFO - __main__ - Step 44938: {'lr': 0.0004031554698773061, 'samples': 8628096, 'steps': 44937, 'loss/train': 1.67475426197052}}}}} 11/07/2021 03:31:29 - INFO - __main__ - Step 44945: {'lr': 0.00040312610791812286, 'samples': 8629440, 'steps': 44944, 'loss/train': 1.2826176881790161}} 11/07/2021 03:31:31 - INFO - __main__ - Step 44949: {'lr': 0.00040310932813777316, 'samples': 8630208, 'steps': 44948, 'loss/train': 1.1987284421920776}} 11/07/2021 03:31:34 - INFO - __main__ - Step 44954: {'lr': 0.0004030883518601044, 'samples': 8631168, 'steps': 44953, 'loss/train': 1.1118228435516357}}} 11/07/2021 03:31:36 - INFO - __main__ - Step 44959: {'lr': 0.00040306737385795437, 'samples': 8632128, 'steps': 44958, 'loss/train': 1.431859016418457}}} 11/07/2021 03:31:38 - INFO - __main__ - Step 44963: {'lr': 0.0004030505902147668, 'samples': 8632896, 'steps': 44962, 'loss/train': 1.8923532962799072}}} 11/07/2021 03:31:38 - INFO - __main__ - Step 44963: {'lr': 0.0004030505902147668, 'samples': 8632896, 'steps': 44962, 'loss/train': 1.8923532962799072}}} 11/07/2021 03:31:41 - INFO - __main__ - Step 44970: {'lr': 0.00040302121618421505, 'samples': 8634240, 'steps': 44969, 'loss/train': 1.7225620746612549}} 11/07/2021 03:31:43 - INFO - __main__ - Step 44974: {'lr': 0.0004030044295069803, 'samples': 8635008, 'steps': 44973, 'loss/train': 1.1578707695007324}}} 11/07/2021 03:31:46 - INFO - __main__ - Step 44979: {'lr': 0.00040298344460926866, 'samples': 8635968, 'steps': 44978, 'loss/train': 1.4797956943511963}} 11/07/2021 03:31:48 - INFO - __main__ - Step 44984: {'lr': 0.0004029624579882576, 'samples': 8636928, 'steps': 44983, 'loss/train': 2.202698230743408}3}} 11/07/2021 03:31:48 - INFO - __main__ - Step 44984: {'lr': 0.0004029624579882576, 'samples': 8636928, 'steps': 44983, 'loss/train': 2.202698230743408}3}} 11/07/2021 03:31:51 - INFO - __main__ - Step 44991: {'lr': 0.000402933073824149, 'samples': 8638272, 'steps': 44990, 'loss/train': 0.9720051288604736}3}} 11/07/2021 03:31:53 - INFO - __main__ - Step 44995: {'lr': 0.00040291628135718404, 'samples': 8639040, 'steps': 44994, 'loss/train': 1.681424617767334}}} 11/07/2021 03:31:56 - INFO - __main__ - Step 45000: {'lr': 0.00040289528922320334, 'samples': 8640000, 'steps': 44999, 'loss/train': 0.798037052154541}}} 11/07/2021 03:31:56 - INFO - __main__ - Step 45000: {'lr': 0.00040289528922320334, 'samples': 8640000, 'steps': 44999, 'loss/train': 0.798037052154541}}} 11/07/2021 03:31:56 - INFO - __main__ - Step 45000: {'lr': 0.00040289528922320334, 'samples': 8640000, 'steps': 44999, 'loss/train': 0.798037052154541}}} 11/07/2021 03:31:56 - INFO - __main__ - Step 45000: {'lr': 0.00040289528922320334, 'samples': 8640000, 'steps': 44999, 'loss/train': 0.798037052154541}}} 11/07/2021 03:31:56 - INFO - __main__ - Step 45000: {'lr': 0.00040289528922320334, 'samples': 8640000, 'steps': 44999, 'loss/train': 0.798037052154541}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) 11/07/2021 03:31:56 - INFO - __main__ - Step 45000: {'lr': 0.00040289528922320334, 'samples': 8640000, 'steps': 44999, 'loss/train': 0.798037052154541}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible 11/07/2021 03:35:24 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.0000, 'steps': 44999, 'loss/train': 0.798037052154541}}} 11/07/2021 03:35:24 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.0000, 'steps': 44999, 'loss/train': 0.798037052154541}}} huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible - Explicitly set the environment variable TOKENIZERS_PARALLELISM=(true | false) huggingface/tokenizers: The current process just got forked, after parallelism has already been used. Disabling parallelism to avoid deadlocks... To disable this warning, you can either: - Avoid using `tokenizers` before the fork if possible Upload file wandb/run-20211106_211610-dtkf2u0m/logs/debug-internal.log: 0%|▏ | 32.0k/19.3M [00:00