Text Generation
Transformers
Safetensors
qwen3
llama-factory
full
Generated from Trainer
conversational
text-generation-inference
Instructions to use DCAgent2/bugs-nl2bashseq with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use DCAgent2/bugs-nl2bashseq with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="DCAgent2/bugs-nl2bashseq") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("DCAgent2/bugs-nl2bashseq") model = AutoModelForCausalLM.from_pretrained("DCAgent2/bugs-nl2bashseq") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use DCAgent2/bugs-nl2bashseq with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "DCAgent2/bugs-nl2bashseq" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "DCAgent2/bugs-nl2bashseq", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/DCAgent2/bugs-nl2bashseq
- SGLang
How to use DCAgent2/bugs-nl2bashseq with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "DCAgent2/bugs-nl2bashseq" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "DCAgent2/bugs-nl2bashseq", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "DCAgent2/bugs-nl2bashseq" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "DCAgent2/bugs-nl2bashseq", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use DCAgent2/bugs-nl2bashseq with Docker Model Runner:
docker model run hf.co/DCAgent2/bugs-nl2bashseq
Training in progress, step 6600
Browse files
model-00001-of-00004.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4902257696
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f42fc8d29b38db32e6fcd75ee4bc5f70b45fad7d4435601bee629963aa01b216
|
| 3 |
size 4902257696
|
model-00002-of-00004.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4915960368
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4fef1a64a4cb327a7663da3edfd9b2636af632559c1c186d3d35a44da8adfff5
|
| 3 |
size 4915960368
|
model-00003-of-00004.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4983068496
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9b9edac273e4086b6bd939a9fd2b1ea7163bbb06a4182fdf4039879c00cb4c42
|
| 3 |
size 4983068496
|
model-00004-of-00004.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1580230264
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a96f4d62829439602873f6022b05aed05bc37b9297fc426d1b32141718e35c60
|
| 3 |
size 1580230264
|
trainer_log.jsonl
CHANGED
|
@@ -963,3 +963,369 @@
|
|
| 963 |
{"current_steps": 4815, "total_steps": 6657, "loss": 0.0808, "lr": 8.635385598892881e-06, "epoch": 5.063091482649842, "percentage": 72.33, "elapsed_time": "4:54:53", "remaining_time": "1:52:48"}
|
| 964 |
{"current_steps": 4820, "total_steps": 6657, "loss": 0.0826, "lr": 8.592274651593482e-06, "epoch": 5.068349106203995, "percentage": 72.4, "elapsed_time": "4:55:16", "remaining_time": "1:52:32"}
|
| 965 |
{"current_steps": 4825, "total_steps": 6657, "loss": 0.1055, "lr": 8.549242126656814e-06, "epoch": 5.07360672975815, "percentage": 72.48, "elapsed_time": "4:55:36", "remaining_time": "1:52:14"}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 963 |
{"current_steps": 4815, "total_steps": 6657, "loss": 0.0808, "lr": 8.635385598892881e-06, "epoch": 5.063091482649842, "percentage": 72.33, "elapsed_time": "4:54:53", "remaining_time": "1:52:48"}
|
| 964 |
{"current_steps": 4820, "total_steps": 6657, "loss": 0.0826, "lr": 8.592274651593482e-06, "epoch": 5.068349106203995, "percentage": 72.4, "elapsed_time": "4:55:16", "remaining_time": "1:52:32"}
|
| 965 |
{"current_steps": 4825, "total_steps": 6657, "loss": 0.1055, "lr": 8.549242126656814e-06, "epoch": 5.07360672975815, "percentage": 72.48, "elapsed_time": "4:55:36", "remaining_time": "1:52:14"}
|
| 966 |
+
{"current_steps": 4830, "total_steps": 6657, "loss": 0.0761, "lr": 8.506288319909793e-06, "epoch": 5.078864353312303, "percentage": 72.56, "elapsed_time": "4:55:57", "remaining_time": "1:51:57"}
|
| 967 |
+
{"current_steps": 4835, "total_steps": 6657, "loss": 0.0743, "lr": 8.463413526638186e-06, "epoch": 5.084121976866457, "percentage": 72.63, "elapsed_time": "4:56:14", "remaining_time": "1:51:37"}
|
| 968 |
+
{"current_steps": 4840, "total_steps": 6657, "loss": 0.0695, "lr": 8.420618041584604e-06, "epoch": 5.08937960042061, "percentage": 72.71, "elapsed_time": "4:56:31", "remaining_time": "1:51:19"}
|
| 969 |
+
{"current_steps": 4845, "total_steps": 6657, "loss": 0.088, "lr": 8.377902158946427e-06, "epoch": 5.094637223974764, "percentage": 72.78, "elapsed_time": "4:56:47", "remaining_time": "1:50:59"}
|
| 970 |
+
{"current_steps": 4850, "total_steps": 6657, "loss": 0.0868, "lr": 8.335266172373832e-06, "epoch": 5.099894847528917, "percentage": 72.86, "elapsed_time": "4:57:10", "remaining_time": "1:50:43"}
|
| 971 |
+
{"current_steps": 4855, "total_steps": 6657, "loss": 0.0883, "lr": 8.292710374967737e-06, "epoch": 5.105152471083071, "percentage": 72.93, "elapsed_time": "4:57:30", "remaining_time": "1:50:25"}
|
| 972 |
+
{"current_steps": 4860, "total_steps": 6657, "loss": 0.087, "lr": 8.250235059277792e-06, "epoch": 5.110410094637224, "percentage": 73.01, "elapsed_time": "4:57:51", "remaining_time": "1:50:08"}
|
| 973 |
+
{"current_steps": 4865, "total_steps": 6657, "loss": 0.1173, "lr": 8.207840517300398e-06, "epoch": 5.115667718191378, "percentage": 73.08, "elapsed_time": "4:58:14", "remaining_time": "1:49:51"}
|
| 974 |
+
{"current_steps": 4870, "total_steps": 6657, "loss": 0.0727, "lr": 8.165527040476666e-06, "epoch": 5.120925341745531, "percentage": 73.16, "elapsed_time": "4:58:29", "remaining_time": "1:49:31"}
|
| 975 |
+
{"current_steps": 4875, "total_steps": 6657, "loss": 0.0997, "lr": 8.123294919690413e-06, "epoch": 5.126182965299685, "percentage": 73.23, "elapsed_time": "4:58:46", "remaining_time": "1:49:12"}
|
| 976 |
+
{"current_steps": 4880, "total_steps": 6657, "loss": 0.1016, "lr": 8.081144445266201e-06, "epoch": 5.131440588853838, "percentage": 73.31, "elapsed_time": "4:59:12", "remaining_time": "1:48:57"}
|
| 977 |
+
{"current_steps": 4885, "total_steps": 6657, "loss": 0.0796, "lr": 8.039075906967293e-06, "epoch": 5.136698212407992, "percentage": 73.38, "elapsed_time": "4:59:33", "remaining_time": "1:48:39"}
|
| 978 |
+
{"current_steps": 4890, "total_steps": 6657, "loss": 0.1907, "lr": 7.99708959399368e-06, "epoch": 5.141955835962145, "percentage": 73.46, "elapsed_time": "5:00:00", "remaining_time": "1:48:24"}
|
| 979 |
+
{"current_steps": 4895, "total_steps": 6657, "loss": 0.1752, "lr": 7.955185794980117e-06, "epoch": 5.147213459516299, "percentage": 73.53, "elapsed_time": "5:00:24", "remaining_time": "1:48:07"}
|
| 980 |
+
{"current_steps": 4900, "total_steps": 6657, "loss": 0.0909, "lr": 7.913364797994111e-06, "epoch": 5.152471083070452, "percentage": 73.61, "elapsed_time": "5:00:45", "remaining_time": "1:47:50"}
|
| 981 |
+
{"current_steps": 4905, "total_steps": 6657, "loss": 0.1563, "lr": 7.871626890533917e-06, "epoch": 5.157728706624606, "percentage": 73.68, "elapsed_time": "5:01:14", "remaining_time": "1:47:35"}
|
| 982 |
+
{"current_steps": 4910, "total_steps": 6657, "loss": 0.099, "lr": 7.829972359526626e-06, "epoch": 5.162986330178759, "percentage": 73.76, "elapsed_time": "5:01:38", "remaining_time": "1:47:19"}
|
| 983 |
+
{"current_steps": 4915, "total_steps": 6657, "loss": 0.0814, "lr": 7.788401491326155e-06, "epoch": 5.168243953732913, "percentage": 73.83, "elapsed_time": "5:01:59", "remaining_time": "1:47:01"}
|
| 984 |
+
{"current_steps": 4920, "total_steps": 6657, "loss": 0.0876, "lr": 7.746914571711264e-06, "epoch": 5.173501577287066, "percentage": 73.91, "elapsed_time": "5:02:17", "remaining_time": "1:46:43"}
|
| 985 |
+
{"current_steps": 4925, "total_steps": 6657, "loss": 0.0766, "lr": 7.705511885883612e-06, "epoch": 5.1787592008412195, "percentage": 73.98, "elapsed_time": "5:02:36", "remaining_time": "1:46:25"}
|
| 986 |
+
{"current_steps": 4930, "total_steps": 6657, "loss": 0.0775, "lr": 7.664193718465814e-06, "epoch": 5.184016824395373, "percentage": 74.06, "elapsed_time": "5:02:59", "remaining_time": "1:46:08"}
|
| 987 |
+
{"current_steps": 4935, "total_steps": 6657, "loss": 0.0719, "lr": 7.622960353499438e-06, "epoch": 5.1892744479495265, "percentage": 74.13, "elapsed_time": "5:03:15", "remaining_time": "1:45:48"}
|
| 988 |
+
{"current_steps": 4940, "total_steps": 6657, "loss": 0.0756, "lr": 7.581812074443084e-06, "epoch": 5.19453207150368, "percentage": 74.21, "elapsed_time": "5:03:30", "remaining_time": "1:45:29"}
|
| 989 |
+
{"current_steps": 4945, "total_steps": 6657, "loss": 0.0663, "lr": 7.5407491641704464e-06, "epoch": 5.1997896950578335, "percentage": 74.28, "elapsed_time": "5:03:46", "remaining_time": "1:45:10"}
|
| 990 |
+
{"current_steps": 4950, "total_steps": 6657, "loss": 0.1158, "lr": 7.499771904968332e-06, "epoch": 5.205047318611987, "percentage": 74.36, "elapsed_time": "5:04:12", "remaining_time": "1:44:54"}
|
| 991 |
+
{"current_steps": 4955, "total_steps": 6657, "loss": 0.0591, "lr": 7.45888057853474e-06, "epoch": 5.2103049421661405, "percentage": 74.43, "elapsed_time": "5:04:27", "remaining_time": "1:44:34"}
|
| 992 |
+
{"current_steps": 4960, "total_steps": 6657, "loss": 0.0659, "lr": 7.418075465976944e-06, "epoch": 5.215562565720294, "percentage": 74.51, "elapsed_time": "5:04:45", "remaining_time": "1:44:16"}
|
| 993 |
+
{"current_steps": 4965, "total_steps": 6657, "loss": 0.0698, "lr": 7.3773568478095184e-06, "epoch": 5.220820189274448, "percentage": 74.58, "elapsed_time": "5:05:01", "remaining_time": "1:43:56"}
|
| 994 |
+
{"current_steps": 4970, "total_steps": 6657, "loss": 0.0855, "lr": 7.336725003952456e-06, "epoch": 5.226077812828602, "percentage": 74.66, "elapsed_time": "5:05:29", "remaining_time": "1:43:41"}
|
| 995 |
+
{"current_steps": 4975, "total_steps": 6657, "loss": 0.0973, "lr": 7.296180213729196e-06, "epoch": 5.231335436382755, "percentage": 74.73, "elapsed_time": "5:05:51", "remaining_time": "1:43:24"}
|
| 996 |
+
{"current_steps": 4980, "total_steps": 6657, "loss": 0.1023, "lr": 7.255722755864734e-06, "epoch": 5.236593059936909, "percentage": 74.81, "elapsed_time": "5:06:14", "remaining_time": "1:43:07"}
|
| 997 |
+
{"current_steps": 4985, "total_steps": 6657, "loss": 0.0792, "lr": 7.21535290848372e-06, "epoch": 5.241850683491062, "percentage": 74.88, "elapsed_time": "5:06:33", "remaining_time": "1:42:49"}
|
| 998 |
+
{"current_steps": 4990, "total_steps": 6657, "loss": 0.0926, "lr": 7.175070949108496e-06, "epoch": 5.247108307045216, "percentage": 74.96, "elapsed_time": "5:06:57", "remaining_time": "1:42:32"}
|
| 999 |
+
{"current_steps": 4995, "total_steps": 6657, "loss": 0.1088, "lr": 7.1348771546572315e-06, "epoch": 5.252365930599369, "percentage": 75.03, "elapsed_time": "5:07:20", "remaining_time": "1:42:15"}
|
| 1000 |
+
{"current_steps": 5000, "total_steps": 6657, "loss": 0.0805, "lr": 7.09477180144202e-06, "epoch": 5.257623554153523, "percentage": 75.11, "elapsed_time": "5:07:39", "remaining_time": "1:41:57"}
|
| 1001 |
+
{"current_steps": 5005, "total_steps": 6657, "loss": 0.0991, "lr": 7.054755165166945e-06, "epoch": 5.262881177707676, "percentage": 75.18, "elapsed_time": "5:08:50", "remaining_time": "1:41:56"}
|
| 1002 |
+
{"current_steps": 5010, "total_steps": 6657, "loss": 0.0822, "lr": 7.014827520926206e-06, "epoch": 5.26813880126183, "percentage": 75.26, "elapsed_time": "5:09:10", "remaining_time": "1:41:38"}
|
| 1003 |
+
{"current_steps": 5015, "total_steps": 6657, "loss": 0.0777, "lr": 6.9749891432022505e-06, "epoch": 5.273396424815983, "percentage": 75.33, "elapsed_time": "5:09:37", "remaining_time": "1:41:22"}
|
| 1004 |
+
{"current_steps": 5020, "total_steps": 6657, "loss": 0.0683, "lr": 6.935240305863844e-06, "epoch": 5.278654048370137, "percentage": 75.41, "elapsed_time": "5:09:56", "remaining_time": "1:41:04"}
|
| 1005 |
+
{"current_steps": 5025, "total_steps": 6657, "loss": 0.0686, "lr": 6.895581282164201e-06, "epoch": 5.28391167192429, "percentage": 75.48, "elapsed_time": "5:10:22", "remaining_time": "1:40:48"}
|
| 1006 |
+
{"current_steps": 5030, "total_steps": 6657, "loss": 0.0743, "lr": 6.856012344739138e-06, "epoch": 5.289169295478444, "percentage": 75.56, "elapsed_time": "5:10:39", "remaining_time": "1:40:29"}
|
| 1007 |
+
{"current_steps": 5035, "total_steps": 6657, "loss": 0.0936, "lr": 6.816533765605144e-06, "epoch": 5.294426919032597, "percentage": 75.63, "elapsed_time": "5:11:12", "remaining_time": "1:40:15"}
|
| 1008 |
+
{"current_steps": 5040, "total_steps": 6657, "loss": 0.0779, "lr": 6.7771458161575685e-06, "epoch": 5.299684542586751, "percentage": 75.71, "elapsed_time": "5:11:28", "remaining_time": "1:39:55"}
|
| 1009 |
+
{"current_steps": 5045, "total_steps": 6657, "loss": 0.0888, "lr": 6.737848767168709e-06, "epoch": 5.304942166140904, "percentage": 75.78, "elapsed_time": "5:11:45", "remaining_time": "1:39:36"}
|
| 1010 |
+
{"current_steps": 5050, "total_steps": 6657, "loss": 0.0619, "lr": 6.698642888785965e-06, "epoch": 5.310199789695058, "percentage": 75.86, "elapsed_time": "5:12:04", "remaining_time": "1:39:18"}
|
| 1011 |
+
{"current_steps": 5055, "total_steps": 6657, "loss": 0.0816, "lr": 6.659528450530006e-06, "epoch": 5.315457413249211, "percentage": 75.94, "elapsed_time": "5:12:29", "remaining_time": "1:39:02"}
|
| 1012 |
+
{"current_steps": 5060, "total_steps": 6657, "loss": 0.0762, "lr": 6.6205057212928755e-06, "epoch": 5.320715036803365, "percentage": 76.01, "elapsed_time": "5:12:45", "remaining_time": "1:38:42"}
|
| 1013 |
+
{"current_steps": 5065, "total_steps": 6657, "loss": 0.0672, "lr": 6.5815749693361645e-06, "epoch": 5.325972660357518, "percentage": 76.09, "elapsed_time": "5:13:04", "remaining_time": "1:38:24"}
|
| 1014 |
+
{"current_steps": 5070, "total_steps": 6657, "loss": 0.074, "lr": 6.542736462289188e-06, "epoch": 5.331230283911672, "percentage": 76.16, "elapsed_time": "5:13:23", "remaining_time": "1:38:05"}
|
| 1015 |
+
{"current_steps": 5075, "total_steps": 6657, "loss": 0.0767, "lr": 6.503990467147101e-06, "epoch": 5.336487907465825, "percentage": 76.24, "elapsed_time": "5:13:41", "remaining_time": "1:37:47"}
|
| 1016 |
+
{"current_steps": 5080, "total_steps": 6657, "loss": 0.0902, "lr": 6.465337250269086e-06, "epoch": 5.341745531019979, "percentage": 76.31, "elapsed_time": "5:14:05", "remaining_time": "1:37:30"}
|
| 1017 |
+
{"current_steps": 5085, "total_steps": 6657, "loss": 0.073, "lr": 6.426777077376538e-06, "epoch": 5.347003154574132, "percentage": 76.39, "elapsed_time": "5:14:24", "remaining_time": "1:37:11"}
|
| 1018 |
+
{"current_steps": 5090, "total_steps": 6657, "loss": 0.0888, "lr": 6.388310213551223e-06, "epoch": 5.352260778128286, "percentage": 76.46, "elapsed_time": "5:14:39", "remaining_time": "1:36:52"}
|
| 1019 |
+
{"current_steps": 5095, "total_steps": 6657, "loss": 0.0938, "lr": 6.349936923233422e-06, "epoch": 5.357518401682439, "percentage": 76.54, "elapsed_time": "5:15:05", "remaining_time": "1:36:35"}
|
| 1020 |
+
{"current_steps": 5100, "total_steps": 6657, "loss": 0.0765, "lr": 6.311657470220178e-06, "epoch": 5.3627760252365935, "percentage": 76.61, "elapsed_time": "5:15:22", "remaining_time": "1:36:16"}
|
| 1021 |
+
{"current_steps": 5105, "total_steps": 6657, "loss": 0.0636, "lr": 6.273472117663446e-06, "epoch": 5.368033648790747, "percentage": 76.69, "elapsed_time": "5:15:39", "remaining_time": "1:35:57"}
|
| 1022 |
+
{"current_steps": 5110, "total_steps": 6657, "loss": 0.0734, "lr": 6.2353811280682715e-06, "epoch": 5.3732912723449004, "percentage": 76.76, "elapsed_time": "5:15:55", "remaining_time": "1:35:38"}
|
| 1023 |
+
{"current_steps": 5115, "total_steps": 6657, "loss": 0.1007, "lr": 6.19738476329101e-06, "epoch": 5.378548895899054, "percentage": 76.84, "elapsed_time": "5:16:24", "remaining_time": "1:35:23"}
|
| 1024 |
+
{"current_steps": 5120, "total_steps": 6657, "loss": 0.0608, "lr": 6.159483284537533e-06, "epoch": 5.383806519453207, "percentage": 76.91, "elapsed_time": "5:16:39", "remaining_time": "1:35:03"}
|
| 1025 |
+
{"current_steps": 5125, "total_steps": 6657, "loss": 0.0771, "lr": 6.121676952361395e-06, "epoch": 5.389064143007361, "percentage": 76.99, "elapsed_time": "5:17:06", "remaining_time": "1:34:47"}
|
| 1026 |
+
{"current_steps": 5130, "total_steps": 6657, "loss": 0.0714, "lr": 6.083966026662076e-06, "epoch": 5.394321766561514, "percentage": 77.06, "elapsed_time": "5:17:25", "remaining_time": "1:34:28"}
|
| 1027 |
+
{"current_steps": 5135, "total_steps": 6657, "loss": 0.0605, "lr": 6.046350766683194e-06, "epoch": 5.399579390115668, "percentage": 77.14, "elapsed_time": "5:17:42", "remaining_time": "1:34:09"}
|
| 1028 |
+
{"current_steps": 5140, "total_steps": 6657, "loss": 0.0822, "lr": 6.0088314310107e-06, "epoch": 5.404837013669821, "percentage": 77.21, "elapsed_time": "5:18:00", "remaining_time": "1:33:51"}
|
| 1029 |
+
{"current_steps": 5145, "total_steps": 6657, "loss": 0.0664, "lr": 5.9714082775711115e-06, "epoch": 5.410094637223975, "percentage": 77.29, "elapsed_time": "5:18:21", "remaining_time": "1:33:33"}
|
| 1030 |
+
{"current_steps": 5150, "total_steps": 6657, "loss": 0.0968, "lr": 5.934081563629764e-06, "epoch": 5.415352260778128, "percentage": 77.36, "elapsed_time": "5:18:45", "remaining_time": "1:33:16"}
|
| 1031 |
+
{"current_steps": 5155, "total_steps": 6657, "loss": 0.0934, "lr": 5.896851545788987e-06, "epoch": 5.420609884332282, "percentage": 77.44, "elapsed_time": "5:19:06", "remaining_time": "1:32:58"}
|
| 1032 |
+
{"current_steps": 5160, "total_steps": 6657, "loss": 0.0622, "lr": 5.859718479986407e-06, "epoch": 5.425867507886435, "percentage": 77.51, "elapsed_time": "5:19:22", "remaining_time": "1:32:39"}
|
| 1033 |
+
{"current_steps": 5165, "total_steps": 6657, "loss": 0.0882, "lr": 5.822682621493132e-06, "epoch": 5.431125131440589, "percentage": 77.59, "elapsed_time": "5:19:39", "remaining_time": "1:32:20"}
|
| 1034 |
+
{"current_steps": 5170, "total_steps": 6657, "loss": 0.0711, "lr": 5.7857442249120155e-06, "epoch": 5.436382754994742, "percentage": 77.66, "elapsed_time": "5:19:55", "remaining_time": "1:32:00"}
|
| 1035 |
+
{"current_steps": 5175, "total_steps": 6657, "loss": 0.0777, "lr": 5.748903544175934e-06, "epoch": 5.441640378548896, "percentage": 77.74, "elapsed_time": "5:20:11", "remaining_time": "1:31:41"}
|
| 1036 |
+
{"current_steps": 5180, "total_steps": 6657, "loss": 0.0927, "lr": 5.712160832545992e-06, "epoch": 5.446898002103049, "percentage": 77.81, "elapsed_time": "5:20:34", "remaining_time": "1:31:24"}
|
| 1037 |
+
{"current_steps": 5185, "total_steps": 6657, "loss": 0.1011, "lr": 5.675516342609811e-06, "epoch": 5.452155625657203, "percentage": 77.89, "elapsed_time": "5:20:55", "remaining_time": "1:31:06"}
|
| 1038 |
+
{"current_steps": 5190, "total_steps": 6657, "loss": 0.076, "lr": 5.638970326279802e-06, "epoch": 5.457413249211356, "percentage": 77.96, "elapsed_time": "5:21:20", "remaining_time": "1:30:49"}
|
| 1039 |
+
{"current_steps": 5195, "total_steps": 6657, "loss": 0.0786, "lr": 5.602523034791407e-06, "epoch": 5.46267087276551, "percentage": 78.04, "elapsed_time": "5:21:40", "remaining_time": "1:30:31"}
|
| 1040 |
+
{"current_steps": 5200, "total_steps": 6657, "loss": 0.0702, "lr": 5.566174718701378e-06, "epoch": 5.467928496319663, "percentage": 78.11, "elapsed_time": "5:21:56", "remaining_time": "1:30:12"}
|
| 1041 |
+
{"current_steps": 5205, "total_steps": 6657, "loss": 0.0694, "lr": 5.529925627886079e-06, "epoch": 5.473186119873817, "percentage": 78.19, "elapsed_time": "5:23:04", "remaining_time": "1:30:07"}
|
| 1042 |
+
{"current_steps": 5210, "total_steps": 6657, "loss": 0.067, "lr": 5.493776011539749e-06, "epoch": 5.47844374342797, "percentage": 78.26, "elapsed_time": "5:23:20", "remaining_time": "1:29:48"}
|
| 1043 |
+
{"current_steps": 5215, "total_steps": 6657, "loss": 0.0812, "lr": 5.457726118172761e-06, "epoch": 5.483701366982124, "percentage": 78.34, "elapsed_time": "5:23:36", "remaining_time": "1:29:28"}
|
| 1044 |
+
{"current_steps": 5220, "total_steps": 6657, "loss": 0.066, "lr": 5.421776195609982e-06, "epoch": 5.488958990536277, "percentage": 78.41, "elapsed_time": "5:23:52", "remaining_time": "1:29:09"}
|
| 1045 |
+
{"current_steps": 5225, "total_steps": 6657, "loss": 0.0664, "lr": 5.385926490989e-06, "epoch": 5.494216614090431, "percentage": 78.49, "elapsed_time": "5:24:10", "remaining_time": "1:28:50"}
|
| 1046 |
+
{"current_steps": 5230, "total_steps": 6657, "loss": 0.0816, "lr": 5.350177250758479e-06, "epoch": 5.499474237644584, "percentage": 78.56, "elapsed_time": "5:24:27", "remaining_time": "1:28:31"}
|
| 1047 |
+
{"current_steps": 5235, "total_steps": 6657, "loss": 0.0631, "lr": 5.314528720676424e-06, "epoch": 5.504731861198739, "percentage": 78.64, "elapsed_time": "5:24:53", "remaining_time": "1:28:14"}
|
| 1048 |
+
{"current_steps": 5240, "total_steps": 6657, "loss": 0.0684, "lr": 5.2789811458085085e-06, "epoch": 5.509989484752892, "percentage": 78.71, "elapsed_time": "5:25:12", "remaining_time": "1:27:56"}
|
| 1049 |
+
{"current_steps": 5245, "total_steps": 6657, "loss": 0.0656, "lr": 5.243534770526404e-06, "epoch": 5.515247108307046, "percentage": 78.79, "elapsed_time": "5:25:37", "remaining_time": "1:27:39"}
|
| 1050 |
+
{"current_steps": 5250, "total_steps": 6657, "loss": 0.1677, "lr": 5.208189838506074e-06, "epoch": 5.520504731861199, "percentage": 78.86, "elapsed_time": "5:26:18", "remaining_time": "1:27:27"}
|
| 1051 |
+
{"current_steps": 5255, "total_steps": 6657, "loss": 0.0647, "lr": 5.172946592726109e-06, "epoch": 5.5257623554153525, "percentage": 78.94, "elapsed_time": "5:26:34", "remaining_time": "1:27:07"}
|
| 1052 |
+
{"current_steps": 5260, "total_steps": 6657, "loss": 0.0758, "lr": 5.137805275466072e-06, "epoch": 5.531019978969506, "percentage": 79.01, "elapsed_time": "5:27:00", "remaining_time": "1:26:51"}
|
| 1053 |
+
{"current_steps": 5265, "total_steps": 6657, "loss": 0.0954, "lr": 5.1027661283048036e-06, "epoch": 5.5362776025236595, "percentage": 79.09, "elapsed_time": "5:27:24", "remaining_time": "1:26:33"}
|
| 1054 |
+
{"current_steps": 5270, "total_steps": 6657, "loss": 0.1397, "lr": 5.067829392118775e-06, "epoch": 5.541535226077813, "percentage": 79.16, "elapsed_time": "5:27:40", "remaining_time": "1:26:14"}
|
| 1055 |
+
{"current_steps": 5275, "total_steps": 6657, "loss": 0.1331, "lr": 5.03299530708045e-06, "epoch": 5.5467928496319665, "percentage": 79.24, "elapsed_time": "5:27:52", "remaining_time": "1:25:53"}
|
| 1056 |
+
{"current_steps": 5280, "total_steps": 6657, "loss": 0.1254, "lr": 4.998264112656617e-06, "epoch": 5.55205047318612, "percentage": 79.32, "elapsed_time": "5:28:04", "remaining_time": "1:25:33"}
|
| 1057 |
+
{"current_steps": 5285, "total_steps": 6657, "loss": 0.1239, "lr": 4.963636047606712e-06, "epoch": 5.5573080967402735, "percentage": 79.39, "elapsed_time": "5:28:15", "remaining_time": "1:25:13"}
|
| 1058 |
+
{"current_steps": 5290, "total_steps": 6657, "loss": 0.1289, "lr": 4.929111349981244e-06, "epoch": 5.562565720294427, "percentage": 79.47, "elapsed_time": "5:28:28", "remaining_time": "1:24:52"}
|
| 1059 |
+
{"current_steps": 5295, "total_steps": 6657, "loss": 0.127, "lr": 4.894690257120114e-06, "epoch": 5.5678233438485805, "percentage": 79.54, "elapsed_time": "5:28:39", "remaining_time": "1:24:32"}
|
| 1060 |
+
{"current_steps": 5300, "total_steps": 6657, "loss": 0.1198, "lr": 4.860373005650985e-06, "epoch": 5.573080967402734, "percentage": 79.62, "elapsed_time": "5:28:51", "remaining_time": "1:24:12"}
|
| 1061 |
+
{"current_steps": 5305, "total_steps": 6657, "loss": 0.1305, "lr": 4.826159831487656e-06, "epoch": 5.578338590956887, "percentage": 79.69, "elapsed_time": "5:29:03", "remaining_time": "1:23:51"}
|
| 1062 |
+
{"current_steps": 5310, "total_steps": 6657, "loss": 0.1163, "lr": 4.792050969828474e-06, "epoch": 5.583596214511041, "percentage": 79.77, "elapsed_time": "5:29:14", "remaining_time": "1:23:31"}
|
| 1063 |
+
{"current_steps": 5315, "total_steps": 6657, "loss": 0.1261, "lr": 4.758046655154664e-06, "epoch": 5.588853838065194, "percentage": 79.84, "elapsed_time": "5:29:26", "remaining_time": "1:23:11"}
|
| 1064 |
+
{"current_steps": 5320, "total_steps": 6657, "loss": 0.1169, "lr": 4.72414712122875e-06, "epoch": 5.594111461619348, "percentage": 79.92, "elapsed_time": "5:29:39", "remaining_time": "1:22:50"}
|
| 1065 |
+
{"current_steps": 5325, "total_steps": 6657, "loss": 0.1205, "lr": 4.690352601092954e-06, "epoch": 5.599369085173501, "percentage": 79.99, "elapsed_time": "5:29:51", "remaining_time": "1:22:30"}
|
| 1066 |
+
{"current_steps": 5330, "total_steps": 6657, "loss": 0.1147, "lr": 4.656663327067563e-06, "epoch": 5.604626708727655, "percentage": 80.07, "elapsed_time": "5:30:03", "remaining_time": "1:22:10"}
|
| 1067 |
+
{"current_steps": 5335, "total_steps": 6657, "loss": 0.1068, "lr": 4.623079530749355e-06, "epoch": 5.609884332281808, "percentage": 80.14, "elapsed_time": "5:30:15", "remaining_time": "1:21:50"}
|
| 1068 |
+
{"current_steps": 5340, "total_steps": 6657, "loss": 0.1081, "lr": 4.589601443010012e-06, "epoch": 5.615141955835962, "percentage": 80.22, "elapsed_time": "5:30:26", "remaining_time": "1:21:29"}
|
| 1069 |
+
{"current_steps": 5345, "total_steps": 6657, "loss": 0.1208, "lr": 4.55622929399451e-06, "epoch": 5.620399579390115, "percentage": 80.29, "elapsed_time": "5:30:39", "remaining_time": "1:21:09"}
|
| 1070 |
+
{"current_steps": 5350, "total_steps": 6657, "loss": 0.118, "lr": 4.522963313119564e-06, "epoch": 5.625657202944269, "percentage": 80.37, "elapsed_time": "5:30:51", "remaining_time": "1:20:49"}
|
| 1071 |
+
{"current_steps": 5355, "total_steps": 6657, "loss": 0.1203, "lr": 4.48980372907202e-06, "epoch": 5.630914826498422, "percentage": 80.44, "elapsed_time": "5:31:04", "remaining_time": "1:20:29"}
|
| 1072 |
+
{"current_steps": 5360, "total_steps": 6657, "loss": 0.1198, "lr": 4.456750769807303e-06, "epoch": 5.636172450052577, "percentage": 80.52, "elapsed_time": "5:31:16", "remaining_time": "1:20:09"}
|
| 1073 |
+
{"current_steps": 5365, "total_steps": 6657, "loss": 0.1156, "lr": 4.4238046625478635e-06, "epoch": 5.641430073606729, "percentage": 80.59, "elapsed_time": "5:31:28", "remaining_time": "1:19:49"}
|
| 1074 |
+
{"current_steps": 5370, "total_steps": 6657, "loss": 0.1204, "lr": 4.390965633781579e-06, "epoch": 5.646687697160884, "percentage": 80.67, "elapsed_time": "5:31:42", "remaining_time": "1:19:29"}
|
| 1075 |
+
{"current_steps": 5375, "total_steps": 6657, "loss": 0.1185, "lr": 4.358233909260215e-06, "epoch": 5.651945320715037, "percentage": 80.74, "elapsed_time": "5:31:56", "remaining_time": "1:19:10"}
|
| 1076 |
+
{"current_steps": 5380, "total_steps": 6657, "loss": 0.1114, "lr": 4.3256097139978934e-06, "epoch": 5.657202944269191, "percentage": 80.82, "elapsed_time": "5:32:08", "remaining_time": "1:18:50"}
|
| 1077 |
+
{"current_steps": 5385, "total_steps": 6657, "loss": 0.107, "lr": 4.293093272269513e-06, "epoch": 5.662460567823344, "percentage": 80.89, "elapsed_time": "5:32:19", "remaining_time": "1:18:29"}
|
| 1078 |
+
{"current_steps": 5390, "total_steps": 6657, "loss": 0.1141, "lr": 4.260684807609217e-06, "epoch": 5.667718191377498, "percentage": 80.97, "elapsed_time": "5:32:31", "remaining_time": "1:18:09"}
|
| 1079 |
+
{"current_steps": 5395, "total_steps": 6657, "loss": 0.1163, "lr": 4.22838454280887e-06, "epoch": 5.672975814931651, "percentage": 81.04, "elapsed_time": "5:32:43", "remaining_time": "1:17:49"}
|
| 1080 |
+
{"current_steps": 5400, "total_steps": 6657, "loss": 0.1173, "lr": 4.196192699916528e-06, "epoch": 5.678233438485805, "percentage": 81.12, "elapsed_time": "5:32:55", "remaining_time": "1:17:29"}
|
| 1081 |
+
{"current_steps": 5405, "total_steps": 6657, "loss": 0.1177, "lr": 4.164109500234865e-06, "epoch": 5.683491062039958, "percentage": 81.19, "elapsed_time": "5:34:08", "remaining_time": "1:17:24"}
|
| 1082 |
+
{"current_steps": 5410, "total_steps": 6657, "loss": 0.1135, "lr": 4.1321351643197235e-06, "epoch": 5.688748685594112, "percentage": 81.27, "elapsed_time": "5:34:20", "remaining_time": "1:17:03"}
|
| 1083 |
+
{"current_steps": 5415, "total_steps": 6657, "loss": 0.1166, "lr": 4.100269911978549e-06, "epoch": 5.694006309148265, "percentage": 81.34, "elapsed_time": "5:34:32", "remaining_time": "1:16:43"}
|
| 1084 |
+
{"current_steps": 5420, "total_steps": 6657, "loss": 0.1148, "lr": 4.068513962268892e-06, "epoch": 5.699263932702419, "percentage": 81.42, "elapsed_time": "5:34:44", "remaining_time": "1:16:23"}
|
| 1085 |
+
{"current_steps": 5425, "total_steps": 6657, "loss": 0.1061, "lr": 4.036867533496895e-06, "epoch": 5.704521556256572, "percentage": 81.49, "elapsed_time": "5:34:55", "remaining_time": "1:16:03"}
|
| 1086 |
+
{"current_steps": 5430, "total_steps": 6657, "loss": 0.1111, "lr": 4.00533084321582e-06, "epoch": 5.709779179810726, "percentage": 81.57, "elapsed_time": "5:35:07", "remaining_time": "1:15:43"}
|
| 1087 |
+
{"current_steps": 5435, "total_steps": 6657, "loss": 0.1039, "lr": 3.9739041082245114e-06, "epoch": 5.715036803364879, "percentage": 81.64, "elapsed_time": "5:35:19", "remaining_time": "1:15:23"}
|
| 1088 |
+
{"current_steps": 5440, "total_steps": 6657, "loss": 0.1123, "lr": 3.942587544565932e-06, "epoch": 5.720294426919033, "percentage": 81.72, "elapsed_time": "5:35:30", "remaining_time": "1:15:03"}
|
| 1089 |
+
{"current_steps": 5445, "total_steps": 6657, "loss": 0.1093, "lr": 3.9113813675256816e-06, "epoch": 5.725552050473186, "percentage": 81.79, "elapsed_time": "5:35:42", "remaining_time": "1:14:43"}
|
| 1090 |
+
{"current_steps": 5450, "total_steps": 6657, "loss": 0.1081, "lr": 3.8802857916305006e-06, "epoch": 5.7308096740273395, "percentage": 81.87, "elapsed_time": "5:35:54", "remaining_time": "1:14:23"}
|
| 1091 |
+
{"current_steps": 5455, "total_steps": 6657, "loss": 0.1129, "lr": 3.849301030646797e-06, "epoch": 5.736067297581493, "percentage": 81.94, "elapsed_time": "5:36:06", "remaining_time": "1:14:03"}
|
| 1092 |
+
{"current_steps": 5460, "total_steps": 6657, "loss": 0.1092, "lr": 3.818427297579186e-06, "epoch": 5.7413249211356465, "percentage": 82.02, "elapsed_time": "5:36:18", "remaining_time": "1:13:43"}
|
| 1093 |
+
{"current_steps": 5465, "total_steps": 6657, "loss": 0.1116, "lr": 3.787664804669027e-06, "epoch": 5.7465825446898, "percentage": 82.09, "elapsed_time": "5:36:29", "remaining_time": "1:13:23"}
|
| 1094 |
+
{"current_steps": 5470, "total_steps": 6657, "loss": 0.11, "lr": 3.7570137633929647e-06, "epoch": 5.7518401682439535, "percentage": 82.17, "elapsed_time": "5:36:42", "remaining_time": "1:13:03"}
|
| 1095 |
+
{"current_steps": 5475, "total_steps": 6657, "loss": 0.118, "lr": 3.7264743844614424e-06, "epoch": 5.757097791798107, "percentage": 82.24, "elapsed_time": "5:36:54", "remaining_time": "1:12:44"}
|
| 1096 |
+
{"current_steps": 5480, "total_steps": 6657, "loss": 0.1153, "lr": 3.6960468778173097e-06, "epoch": 5.7623554153522605, "percentage": 82.32, "elapsed_time": "5:37:07", "remaining_time": "1:12:24"}
|
| 1097 |
+
{"current_steps": 5485, "total_steps": 6657, "loss": 0.1151, "lr": 3.665731452634347e-06, "epoch": 5.767613038906414, "percentage": 82.39, "elapsed_time": "5:37:19", "remaining_time": "1:12:04"}
|
| 1098 |
+
{"current_steps": 5490, "total_steps": 6657, "loss": 0.1084, "lr": 3.6355283173158153e-06, "epoch": 5.7728706624605675, "percentage": 82.47, "elapsed_time": "5:37:32", "remaining_time": "1:11:44"}
|
| 1099 |
+
{"current_steps": 5495, "total_steps": 6657, "loss": 0.1141, "lr": 3.6054376794930467e-06, "epoch": 5.778128286014722, "percentage": 82.54, "elapsed_time": "5:37:44", "remaining_time": "1:11:25"}
|
| 1100 |
+
{"current_steps": 5500, "total_steps": 6657, "loss": 0.1093, "lr": 3.5754597460240216e-06, "epoch": 5.783385909568874, "percentage": 82.62, "elapsed_time": "5:37:55", "remaining_time": "1:11:05"}
|
| 1101 |
+
{"current_steps": 5505, "total_steps": 6657, "loss": 0.1026, "lr": 3.5455947229919185e-06, "epoch": 5.788643533123029, "percentage": 82.69, "elapsed_time": "5:38:07", "remaining_time": "1:10:45"}
|
| 1102 |
+
{"current_steps": 5510, "total_steps": 6657, "loss": 0.1044, "lr": 3.515842815703716e-06, "epoch": 5.793901156677181, "percentage": 82.77, "elapsed_time": "5:38:18", "remaining_time": "1:10:25"}
|
| 1103 |
+
{"current_steps": 5515, "total_steps": 6657, "loss": 0.1127, "lr": 3.4862042286887943e-06, "epoch": 5.799158780231336, "percentage": 82.85, "elapsed_time": "5:38:31", "remaining_time": "1:10:05"}
|
| 1104 |
+
{"current_steps": 5520, "total_steps": 6657, "loss": 0.1184, "lr": 3.456679165697494e-06, "epoch": 5.804416403785489, "percentage": 82.92, "elapsed_time": "5:38:43", "remaining_time": "1:09:46"}
|
| 1105 |
+
{"current_steps": 5525, "total_steps": 6657, "loss": 0.1109, "lr": 3.427267829699741e-06, "epoch": 5.809674027339643, "percentage": 83.0, "elapsed_time": "5:38:54", "remaining_time": "1:09:26"}
|
| 1106 |
+
{"current_steps": 5530, "total_steps": 6657, "loss": 0.1081, "lr": 3.3979704228836586e-06, "epoch": 5.814931650893796, "percentage": 83.07, "elapsed_time": "5:39:07", "remaining_time": "1:09:06"}
|
| 1107 |
+
{"current_steps": 5535, "total_steps": 6657, "loss": 0.1176, "lr": 3.3687871466541424e-06, "epoch": 5.82018927444795, "percentage": 83.15, "elapsed_time": "5:39:20", "remaining_time": "1:08:47"}
|
| 1108 |
+
{"current_steps": 5540, "total_steps": 6657, "loss": 0.1069, "lr": 3.339718201631521e-06, "epoch": 5.825446898002103, "percentage": 83.22, "elapsed_time": "5:39:31", "remaining_time": "1:08:27"}
|
| 1109 |
+
{"current_steps": 5545, "total_steps": 6657, "loss": 0.1098, "lr": 3.3107637876501352e-06, "epoch": 5.830704521556257, "percentage": 83.3, "elapsed_time": "5:39:43", "remaining_time": "1:08:07"}
|
| 1110 |
+
{"current_steps": 5550, "total_steps": 6657, "loss": 0.1102, "lr": 3.2819241037569838e-06, "epoch": 5.83596214511041, "percentage": 83.37, "elapsed_time": "5:39:55", "remaining_time": "1:07:48"}
|
| 1111 |
+
{"current_steps": 5555, "total_steps": 6657, "loss": 0.104, "lr": 3.253199348210372e-06, "epoch": 5.841219768664564, "percentage": 83.45, "elapsed_time": "5:40:06", "remaining_time": "1:07:28"}
|
| 1112 |
+
{"current_steps": 5560, "total_steps": 6657, "loss": 0.11, "lr": 3.2245897184785103e-06, "epoch": 5.846477392218717, "percentage": 83.52, "elapsed_time": "5:40:19", "remaining_time": "1:07:08"}
|
| 1113 |
+
{"current_steps": 5565, "total_steps": 6657, "loss": 0.1164, "lr": 3.1960954112381825e-06, "epoch": 5.851735015772871, "percentage": 83.6, "elapsed_time": "5:40:32", "remaining_time": "1:06:49"}
|
| 1114 |
+
{"current_steps": 5570, "total_steps": 6657, "loss": 0.1064, "lr": 3.1677166223733934e-06, "epoch": 5.856992639327024, "percentage": 83.67, "elapsed_time": "5:40:43", "remaining_time": "1:06:29"}
|
| 1115 |
+
{"current_steps": 5575, "total_steps": 6657, "loss": 0.1119, "lr": 3.1394535469740273e-06, "epoch": 5.862250262881178, "percentage": 83.75, "elapsed_time": "5:40:54", "remaining_time": "1:06:09"}
|
| 1116 |
+
{"current_steps": 5580, "total_steps": 6657, "loss": 0.1116, "lr": 3.111306379334462e-06, "epoch": 5.867507886435331, "percentage": 83.82, "elapsed_time": "5:41:06", "remaining_time": "1:05:50"}
|
| 1117 |
+
{"current_steps": 5585, "total_steps": 6657, "loss": 0.1081, "lr": 3.083275312952301e-06, "epoch": 5.872765509989485, "percentage": 83.9, "elapsed_time": "5:41:18", "remaining_time": "1:05:30"}
|
| 1118 |
+
{"current_steps": 5590, "total_steps": 6657, "loss": 0.111, "lr": 3.055360540527006e-06, "epoch": 5.878023133543638, "percentage": 83.97, "elapsed_time": "5:41:30", "remaining_time": "1:05:11"}
|
| 1119 |
+
{"current_steps": 5595, "total_steps": 6657, "loss": 0.1097, "lr": 3.0275622539585556e-06, "epoch": 5.883280757097792, "percentage": 84.05, "elapsed_time": "5:41:42", "remaining_time": "1:04:51"}
|
| 1120 |
+
{"current_steps": 5600, "total_steps": 6657, "loss": 0.1142, "lr": 2.999880644346165e-06, "epoch": 5.888538380651945, "percentage": 84.12, "elapsed_time": "5:41:53", "remaining_time": "1:04:31"}
|
| 1121 |
+
{"current_steps": 5605, "total_steps": 6657, "loss": 0.1055, "lr": 2.9723159019869597e-06, "epoch": 5.893796004206099, "percentage": 84.2, "elapsed_time": "5:42:58", "remaining_time": "1:04:22"}
|
| 1122 |
+
{"current_steps": 5610, "total_steps": 6657, "loss": 0.1224, "lr": 2.9448682163746413e-06, "epoch": 5.899053627760252, "percentage": 84.27, "elapsed_time": "5:43:10", "remaining_time": "1:04:02"}
|
| 1123 |
+
{"current_steps": 5615, "total_steps": 6657, "loss": 0.111, "lr": 2.917537776198216e-06, "epoch": 5.904311251314406, "percentage": 84.35, "elapsed_time": "5:43:24", "remaining_time": "1:03:43"}
|
| 1124 |
+
{"current_steps": 5620, "total_steps": 6657, "loss": 0.117, "lr": 2.8903247693406932e-06, "epoch": 5.909568874868559, "percentage": 84.42, "elapsed_time": "5:43:36", "remaining_time": "1:03:24"}
|
| 1125 |
+
{"current_steps": 5625, "total_steps": 6657, "loss": 0.1089, "lr": 2.863229382877777e-06, "epoch": 5.914826498422713, "percentage": 84.5, "elapsed_time": "5:43:47", "remaining_time": "1:03:04"}
|
| 1126 |
+
{"current_steps": 5630, "total_steps": 6657, "loss": 0.1038, "lr": 2.8362518030765904e-06, "epoch": 5.920084121976867, "percentage": 84.57, "elapsed_time": "5:43:59", "remaining_time": "1:02:45"}
|
| 1127 |
+
{"current_steps": 5635, "total_steps": 6657, "loss": 0.113, "lr": 2.8093922153944065e-06, "epoch": 5.9253417455310196, "percentage": 84.65, "elapsed_time": "5:44:11", "remaining_time": "1:02:25"}
|
| 1128 |
+
{"current_steps": 5640, "total_steps": 6657, "loss": 0.1096, "lr": 2.782650804477347e-06, "epoch": 5.930599369085174, "percentage": 84.72, "elapsed_time": "5:44:23", "remaining_time": "1:02:06"}
|
| 1129 |
+
{"current_steps": 5645, "total_steps": 6657, "loss": 0.107, "lr": 2.7560277541591427e-06, "epoch": 5.9358569926393265, "percentage": 84.8, "elapsed_time": "5:44:35", "remaining_time": "1:01:46"}
|
| 1130 |
+
{"current_steps": 5650, "total_steps": 6657, "loss": 0.1116, "lr": 2.7295232474598445e-06, "epoch": 5.941114616193481, "percentage": 84.87, "elapsed_time": "5:44:46", "remaining_time": "1:01:27"}
|
| 1131 |
+
{"current_steps": 5655, "total_steps": 6657, "loss": 0.1229, "lr": 2.703137466584571e-06, "epoch": 5.946372239747634, "percentage": 84.95, "elapsed_time": "5:44:59", "remaining_time": "1:01:07"}
|
| 1132 |
+
{"current_steps": 5660, "total_steps": 6657, "loss": 0.1092, "lr": 2.6768705929222827e-06, "epoch": 5.951629863301788, "percentage": 85.02, "elapsed_time": "5:45:10", "remaining_time": "1:00:48"}
|
| 1133 |
+
{"current_steps": 5665, "total_steps": 6657, "loss": 0.1123, "lr": 2.6507228070444922e-06, "epoch": 5.956887486855941, "percentage": 85.1, "elapsed_time": "5:45:25", "remaining_time": "1:00:29"}
|
| 1134 |
+
{"current_steps": 5670, "total_steps": 6657, "loss": 0.1055, "lr": 2.6246942887040416e-06, "epoch": 5.962145110410095, "percentage": 85.17, "elapsed_time": "5:45:40", "remaining_time": "1:00:10"}
|
| 1135 |
+
{"current_steps": 5675, "total_steps": 6657, "loss": 0.1036, "lr": 2.5987852168338922e-06, "epoch": 5.967402733964248, "percentage": 85.25, "elapsed_time": "5:45:51", "remaining_time": "0:59:50"}
|
| 1136 |
+
{"current_steps": 5680, "total_steps": 6657, "loss": 0.1107, "lr": 2.5729957695458454e-06, "epoch": 5.972660357518402, "percentage": 85.32, "elapsed_time": "5:46:02", "remaining_time": "0:59:31"}
|
| 1137 |
+
{"current_steps": 5685, "total_steps": 6657, "loss": 0.1099, "lr": 2.5473261241293547e-06, "epoch": 5.977917981072555, "percentage": 85.4, "elapsed_time": "5:46:15", "remaining_time": "0:59:12"}
|
| 1138 |
+
{"current_steps": 5690, "total_steps": 6657, "loss": 0.1136, "lr": 2.521776457050302e-06, "epoch": 5.983175604626709, "percentage": 85.47, "elapsed_time": "5:46:27", "remaining_time": "0:58:52"}
|
| 1139 |
+
{"current_steps": 5695, "total_steps": 6657, "loss": 0.1134, "lr": 2.4963469439497703e-06, "epoch": 5.988433228180862, "percentage": 85.55, "elapsed_time": "5:46:39", "remaining_time": "0:58:33"}
|
| 1140 |
+
{"current_steps": 5700, "total_steps": 6657, "loss": 0.11, "lr": 2.4710377596428404e-06, "epoch": 5.993690851735016, "percentage": 85.62, "elapsed_time": "5:46:51", "remaining_time": "0:58:14"}
|
| 1141 |
+
{"current_steps": 5705, "total_steps": 6657, "loss": 0.1113, "lr": 2.4458490781174084e-06, "epoch": 5.998948475289169, "percentage": 85.7, "elapsed_time": "5:47:02", "remaining_time": "0:57:54"}
|
| 1142 |
+
{"current_steps": 5710, "total_steps": 6657, "loss": 0.0977, "lr": 2.4207810725329583e-06, "epoch": 6.004206098843323, "percentage": 85.77, "elapsed_time": "5:47:26", "remaining_time": "0:57:37"}
|
| 1143 |
+
{"current_steps": 5715, "total_steps": 6657, "loss": 0.0806, "lr": 2.395833915219401e-06, "epoch": 6.009463722397476, "percentage": 85.85, "elapsed_time": "5:47:44", "remaining_time": "0:57:19"}
|
| 1144 |
+
{"current_steps": 5720, "total_steps": 6657, "loss": 0.0731, "lr": 2.3710077776758713e-06, "epoch": 6.01472134595163, "percentage": 85.92, "elapsed_time": "5:48:05", "remaining_time": "0:57:01"}
|
| 1145 |
+
{"current_steps": 5725, "total_steps": 6657, "loss": 0.0876, "lr": 2.3463028305695447e-06, "epoch": 6.019978969505783, "percentage": 86.0, "elapsed_time": "5:48:22", "remaining_time": "0:56:42"}
|
| 1146 |
+
{"current_steps": 5730, "total_steps": 6657, "loss": 0.0713, "lr": 2.3217192437344925e-06, "epoch": 6.025236593059937, "percentage": 86.07, "elapsed_time": "5:48:45", "remaining_time": "0:56:25"}
|
| 1147 |
+
{"current_steps": 5735, "total_steps": 6657, "loss": 0.0748, "lr": 2.2972571861704784e-06, "epoch": 6.03049421661409, "percentage": 86.15, "elapsed_time": "5:49:02", "remaining_time": "0:56:06"}
|
| 1148 |
+
{"current_steps": 5740, "total_steps": 6657, "loss": 0.1273, "lr": 2.2729168260418224e-06, "epoch": 6.035751840168244, "percentage": 86.23, "elapsed_time": "5:49:22", "remaining_time": "0:55:48"}
|
| 1149 |
+
{"current_steps": 5745, "total_steps": 6657, "loss": 0.0737, "lr": 2.2486983306762332e-06, "epoch": 6.041009463722397, "percentage": 86.3, "elapsed_time": "5:49:41", "remaining_time": "0:55:30"}
|
| 1150 |
+
{"current_steps": 5750, "total_steps": 6657, "loss": 0.0943, "lr": 2.224601866563665e-06, "epoch": 6.046267087276551, "percentage": 86.38, "elapsed_time": "5:50:04", "remaining_time": "0:55:13"}
|
| 1151 |
+
{"current_steps": 5755, "total_steps": 6657, "loss": 0.0942, "lr": 2.2006275993551563e-06, "epoch": 6.051524710830704, "percentage": 86.45, "elapsed_time": "5:50:35", "remaining_time": "0:54:56"}
|
| 1152 |
+
{"current_steps": 5760, "total_steps": 6657, "loss": 0.0678, "lr": 2.176775693861719e-06, "epoch": 6.056782334384858, "percentage": 86.53, "elapsed_time": "5:50:57", "remaining_time": "0:54:39"}
|
| 1153 |
+
{"current_steps": 5765, "total_steps": 6657, "loss": 0.0742, "lr": 2.1530463140531886e-06, "epoch": 6.062039957939011, "percentage": 86.6, "elapsed_time": "5:51:20", "remaining_time": "0:54:21"}
|
| 1154 |
+
{"current_steps": 5770, "total_steps": 6657, "loss": 0.0729, "lr": 2.129439623057077e-06, "epoch": 6.067297581493165, "percentage": 86.68, "elapsed_time": "5:51:43", "remaining_time": "0:54:04"}
|
| 1155 |
+
{"current_steps": 5775, "total_steps": 6657, "loss": 0.105, "lr": 2.105955783157498e-06, "epoch": 6.072555205047319, "percentage": 86.75, "elapsed_time": "5:52:04", "remaining_time": "0:53:46"}
|
| 1156 |
+
{"current_steps": 5780, "total_steps": 6657, "loss": 0.0677, "lr": 2.0825949557940174e-06, "epoch": 6.0778128286014725, "percentage": 86.83, "elapsed_time": "5:52:20", "remaining_time": "0:53:27"}
|
| 1157 |
+
{"current_steps": 5785, "total_steps": 6657, "loss": 0.0757, "lr": 2.059357301560547e-06, "epoch": 6.083070452155626, "percentage": 86.9, "elapsed_time": "5:52:42", "remaining_time": "0:53:09"}
|
| 1158 |
+
{"current_steps": 5790, "total_steps": 6657, "loss": 0.062, "lr": 2.036242980204244e-06, "epoch": 6.0883280757097795, "percentage": 86.98, "elapsed_time": "5:52:59", "remaining_time": "0:52:51"}
|
| 1159 |
+
{"current_steps": 5795, "total_steps": 6657, "loss": 0.0809, "lr": 2.0132521506244294e-06, "epoch": 6.093585699263933, "percentage": 87.05, "elapsed_time": "5:53:15", "remaining_time": "0:52:32"}
|
| 1160 |
+
{"current_steps": 5800, "total_steps": 6657, "loss": 0.0783, "lr": 1.9903849708714664e-06, "epoch": 6.0988433228180865, "percentage": 87.13, "elapsed_time": "5:53:36", "remaining_time": "0:52:14"}
|
| 1161 |
+
{"current_steps": 5805, "total_steps": 6657, "loss": 0.0854, "lr": 1.967641598145684e-06, "epoch": 6.10410094637224, "percentage": 87.2, "elapsed_time": "5:54:54", "remaining_time": "0:52:05"}
|
| 1162 |
+
{"current_steps": 5810, "total_steps": 6657, "loss": 0.0833, "lr": 1.9450221887963194e-06, "epoch": 6.1093585699263935, "percentage": 87.28, "elapsed_time": "5:55:15", "remaining_time": "0:51:47"}
|
| 1163 |
+
{"current_steps": 5815, "total_steps": 6657, "loss": 0.1118, "lr": 1.922526898320407e-06, "epoch": 6.114616193480547, "percentage": 87.35, "elapsed_time": "5:55:38", "remaining_time": "0:51:29"}
|
| 1164 |
+
{"current_steps": 5820, "total_steps": 6657, "loss": 0.0671, "lr": 1.900155881361727e-06, "epoch": 6.1198738170347005, "percentage": 87.43, "elapsed_time": "5:55:54", "remaining_time": "0:51:11"}
|
| 1165 |
+
{"current_steps": 5825, "total_steps": 6657, "loss": 0.084, "lr": 1.8779092917097564e-06, "epoch": 6.125131440588854, "percentage": 87.5, "elapsed_time": "5:56:09", "remaining_time": "0:50:52"}
|
| 1166 |
+
{"current_steps": 5830, "total_steps": 6657, "loss": 0.0981, "lr": 1.85578728229858e-06, "epoch": 6.130389064143007, "percentage": 87.58, "elapsed_time": "5:56:32", "remaining_time": "0:50:34"}
|
| 1167 |
+
{"current_steps": 5835, "total_steps": 6657, "loss": 0.0838, "lr": 1.8337900052058732e-06, "epoch": 6.135646687697161, "percentage": 87.65, "elapsed_time": "5:56:58", "remaining_time": "0:50:17"}
|
| 1168 |
+
{"current_steps": 5840, "total_steps": 6657, "loss": 0.1296, "lr": 1.811917611651821e-06, "epoch": 6.140904311251314, "percentage": 87.73, "elapsed_time": "5:57:21", "remaining_time": "0:49:59"}
|
| 1169 |
+
{"current_steps": 5845, "total_steps": 6657, "loss": 0.2161, "lr": 1.7901702519981068e-06, "epoch": 6.146161934805468, "percentage": 87.8, "elapsed_time": "5:57:48", "remaining_time": "0:49:42"}
|
| 1170 |
+
{"current_steps": 5850, "total_steps": 6657, "loss": 0.09, "lr": 1.7685480757468765e-06, "epoch": 6.151419558359621, "percentage": 87.88, "elapsed_time": "5:58:09", "remaining_time": "0:49:24"}
|
| 1171 |
+
{"current_steps": 5855, "total_steps": 6657, "loss": 0.0871, "lr": 1.7470512315396894e-06, "epoch": 6.156677181913775, "percentage": 87.95, "elapsed_time": "5:58:33", "remaining_time": "0:49:06"}
|
| 1172 |
+
{"current_steps": 5860, "total_steps": 6657, "loss": 0.1593, "lr": 1.7256798671565111e-06, "epoch": 6.161934805467928, "percentage": 88.03, "elapsed_time": "5:59:01", "remaining_time": "0:48:49"}
|
| 1173 |
+
{"current_steps": 5865, "total_steps": 6657, "loss": 0.0735, "lr": 1.7044341295147116e-06, "epoch": 6.167192429022082, "percentage": 88.1, "elapsed_time": "5:59:22", "remaining_time": "0:48:31"}
|
| 1174 |
+
{"current_steps": 5870, "total_steps": 6657, "loss": 0.0779, "lr": 1.683314164668024e-06, "epoch": 6.172450052576235, "percentage": 88.18, "elapsed_time": "5:59:41", "remaining_time": "0:48:13"}
|
| 1175 |
+
{"current_steps": 5875, "total_steps": 6657, "loss": 0.0776, "lr": 1.6623201178055603e-06, "epoch": 6.177707676130389, "percentage": 88.25, "elapsed_time": "6:00:00", "remaining_time": "0:47:55"}
|
| 1176 |
+
{"current_steps": 5880, "total_steps": 6657, "loss": 0.0692, "lr": 1.6414521332508183e-06, "epoch": 6.182965299684542, "percentage": 88.33, "elapsed_time": "6:00:23", "remaining_time": "0:47:37"}
|
| 1177 |
+
{"current_steps": 5885, "total_steps": 6657, "loss": 0.0667, "lr": 1.6207103544606795e-06, "epoch": 6.188222923238696, "percentage": 88.4, "elapsed_time": "6:00:38", "remaining_time": "0:47:18"}
|
| 1178 |
+
{"current_steps": 5890, "total_steps": 6657, "loss": 0.0727, "lr": 1.6000949240244047e-06, "epoch": 6.193480546792849, "percentage": 88.48, "elapsed_time": "6:00:54", "remaining_time": "0:46:59"}
|
| 1179 |
+
{"current_steps": 5895, "total_steps": 6657, "loss": 0.0603, "lr": 1.5796059836626998e-06, "epoch": 6.198738170347003, "percentage": 88.55, "elapsed_time": "6:01:10", "remaining_time": "0:46:41"}
|
| 1180 |
+
{"current_steps": 5900, "total_steps": 6657, "loss": 0.1136, "lr": 1.5592436742267048e-06, "epoch": 6.203995793901156, "percentage": 88.63, "elapsed_time": "6:01:36", "remaining_time": "0:46:23"}
|
| 1181 |
+
{"current_steps": 5905, "total_steps": 6657, "loss": 0.0535, "lr": 1.5390081356970331e-06, "epoch": 6.20925341745531, "percentage": 88.7, "elapsed_time": "6:01:51", "remaining_time": "0:46:04"}
|
| 1182 |
+
{"current_steps": 5910, "total_steps": 6657, "loss": 0.0632, "lr": 1.5188995071828117e-06, "epoch": 6.214511041009464, "percentage": 88.78, "elapsed_time": "6:02:09", "remaining_time": "0:45:46"}
|
| 1183 |
+
{"current_steps": 5915, "total_steps": 6657, "loss": 0.0618, "lr": 1.498917926920731e-06, "epoch": 6.219768664563618, "percentage": 88.85, "elapsed_time": "6:02:24", "remaining_time": "0:45:27"}
|
| 1184 |
+
{"current_steps": 5920, "total_steps": 6657, "loss": 0.0752, "lr": 1.4790635322740855e-06, "epoch": 6.225026288117771, "percentage": 88.93, "elapsed_time": "6:02:50", "remaining_time": "0:45:10"}
|
| 1185 |
+
{"current_steps": 5925, "total_steps": 6657, "loss": 0.0966, "lr": 1.4593364597318305e-06, "epoch": 6.230283911671925, "percentage": 89.0, "elapsed_time": "6:03:14", "remaining_time": "0:44:52"}
|
| 1186 |
+
{"current_steps": 5930, "total_steps": 6657, "loss": 0.093, "lr": 1.4397368449076443e-06, "epoch": 6.235541535226078, "percentage": 89.08, "elapsed_time": "6:03:38", "remaining_time": "0:44:34"}
|
| 1187 |
+
{"current_steps": 5935, "total_steps": 6657, "loss": 0.0771, "lr": 1.4202648225390103e-06, "epoch": 6.240799158780232, "percentage": 89.15, "elapsed_time": "6:03:56", "remaining_time": "0:44:16"}
|
| 1188 |
+
{"current_steps": 5940, "total_steps": 6657, "loss": 0.0853, "lr": 1.4009205264862646e-06, "epoch": 6.246056782334385, "percentage": 89.23, "elapsed_time": "6:04:19", "remaining_time": "0:43:58"}
|
| 1189 |
+
{"current_steps": 5945, "total_steps": 6657, "loss": 0.104, "lr": 1.3817040897316903e-06, "epoch": 6.251314405888539, "percentage": 89.3, "elapsed_time": "6:04:44", "remaining_time": "0:43:41"}
|
| 1190 |
+
{"current_steps": 5950, "total_steps": 6657, "loss": 0.0769, "lr": 1.362615644378611e-06, "epoch": 6.256572029442692, "percentage": 89.38, "elapsed_time": "6:05:02", "remaining_time": "0:43:22"}
|
| 1191 |
+
{"current_steps": 5955, "total_steps": 6657, "loss": 0.0911, "lr": 1.3436553216504721e-06, "epoch": 6.261829652996846, "percentage": 89.45, "elapsed_time": "6:05:19", "remaining_time": "0:43:03"}
|
| 1192 |
+
{"current_steps": 5960, "total_steps": 6657, "loss": 0.0765, "lr": 1.324823251889924e-06, "epoch": 6.267087276550999, "percentage": 89.53, "elapsed_time": "6:05:39", "remaining_time": "0:42:45"}
|
| 1193 |
+
{"current_steps": 5965, "total_steps": 6657, "loss": 0.07, "lr": 1.3061195645579661e-06, "epoch": 6.2723449001051526, "percentage": 89.6, "elapsed_time": "6:06:02", "remaining_time": "0:42:27"}
|
| 1194 |
+
{"current_steps": 5970, "total_steps": 6657, "loss": 0.0688, "lr": 1.2875443882330218e-06, "epoch": 6.277602523659306, "percentage": 89.68, "elapsed_time": "6:06:25", "remaining_time": "0:42:09"}
|
| 1195 |
+
{"current_steps": 5975, "total_steps": 6657, "loss": 0.0667, "lr": 1.269097850610066e-06, "epoch": 6.2828601472134595, "percentage": 89.76, "elapsed_time": "6:06:51", "remaining_time": "0:41:52"}
|
| 1196 |
+
{"current_steps": 5980, "total_steps": 6657, "loss": 0.0691, "lr": 1.250780078499747e-06, "epoch": 6.288117770767613, "percentage": 89.83, "elapsed_time": "6:07:08", "remaining_time": "0:41:33"}
|
| 1197 |
+
{"current_steps": 5985, "total_steps": 6657, "loss": 0.0807, "lr": 1.2325911978275196e-06, "epoch": 6.2933753943217665, "percentage": 89.91, "elapsed_time": "6:07:36", "remaining_time": "0:41:16"}
|
| 1198 |
+
{"current_steps": 5990, "total_steps": 6657, "loss": 0.078, "lr": 1.214531333632769e-06, "epoch": 6.29863301787592, "percentage": 89.98, "elapsed_time": "6:07:58", "remaining_time": "0:40:58"}
|
| 1199 |
+
{"current_steps": 5995, "total_steps": 6657, "loss": 0.0848, "lr": 1.1966006100679596e-06, "epoch": 6.3038906414300735, "percentage": 90.06, "elapsed_time": "6:08:15", "remaining_time": "0:40:39"}
|
| 1200 |
+
{"current_steps": 6000, "total_steps": 6657, "loss": 0.0603, "lr": 1.1787991503977846e-06, "epoch": 6.309148264984227, "percentage": 90.13, "elapsed_time": "6:08:34", "remaining_time": "0:40:21"}
|
| 1201 |
+
{"current_steps": 6005, "total_steps": 6657, "loss": 0.073, "lr": 1.1611270769983051e-06, "epoch": 6.3144058885383805, "percentage": 90.21, "elapsed_time": "6:09:51", "remaining_time": "0:40:09"}
|
| 1202 |
+
{"current_steps": 6010, "total_steps": 6657, "loss": 0.0755, "lr": 1.143584511356115e-06, "epoch": 6.319663512092534, "percentage": 90.28, "elapsed_time": "6:10:10", "remaining_time": "0:39:51"}
|
| 1203 |
+
{"current_steps": 6015, "total_steps": 6657, "loss": 0.0581, "lr": 1.1261715740675205e-06, "epoch": 6.3249211356466875, "percentage": 90.36, "elapsed_time": "6:10:27", "remaining_time": "0:39:32"}
|
| 1204 |
+
{"current_steps": 6020, "total_steps": 6657, "loss": 0.0709, "lr": 1.108888384837683e-06, "epoch": 6.330178759200841, "percentage": 90.43, "elapsed_time": "6:10:48", "remaining_time": "0:39:14"}
|
| 1205 |
+
{"current_steps": 6025, "total_steps": 6657, "loss": 0.0711, "lr": 1.0917350624798262e-06, "epoch": 6.335436382754994, "percentage": 90.51, "elapsed_time": "6:11:05", "remaining_time": "0:38:55"}
|
| 1206 |
+
{"current_steps": 6030, "total_steps": 6657, "loss": 0.0859, "lr": 1.07471172491439e-06, "epoch": 6.340694006309148, "percentage": 90.58, "elapsed_time": "6:11:26", "remaining_time": "0:38:37"}
|
| 1207 |
+
{"current_steps": 6035, "total_steps": 6657, "loss": 0.07, "lr": 1.0578184891682408e-06, "epoch": 6.345951629863301, "percentage": 90.66, "elapsed_time": "6:11:49", "remaining_time": "0:38:19"}
|
| 1208 |
+
{"current_steps": 6040, "total_steps": 6657, "loss": 0.0824, "lr": 1.041055471373864e-06, "epoch": 6.351209253417455, "percentage": 90.73, "elapsed_time": "6:12:04", "remaining_time": "0:38:00"}
|
| 1209 |
+
{"current_steps": 6045, "total_steps": 6657, "loss": 0.0768, "lr": 1.0244227867685597e-06, "epoch": 6.356466876971609, "percentage": 90.81, "elapsed_time": "6:12:28", "remaining_time": "0:37:42"}
|
| 1210 |
+
{"current_steps": 6050, "total_steps": 6657, "loss": 0.0818, "lr": 1.0079205496936484e-06, "epoch": 6.361724500525763, "percentage": 90.88, "elapsed_time": "6:12:46", "remaining_time": "0:37:24"}
|
| 1211 |
+
{"current_steps": 6055, "total_steps": 6657, "loss": 0.0665, "lr": 9.915488735936995e-07, "epoch": 6.366982124079916, "percentage": 90.96, "elapsed_time": "6:13:04", "remaining_time": "0:37:05"}
|
| 1212 |
+
{"current_steps": 6060, "total_steps": 6657, "loss": 0.0664, "lr": 9.753078710157316e-07, "epoch": 6.37223974763407, "percentage": 91.03, "elapsed_time": "6:13:18", "remaining_time": "0:36:46"}
|
| 1213 |
+
{"current_steps": 6065, "total_steps": 6657, "loss": 0.0923, "lr": 9.59197653608448e-07, "epoch": 6.377497371188223, "percentage": 91.11, "elapsed_time": "6:13:48", "remaining_time": "0:36:29"}
|
| 1214 |
+
{"current_steps": 6070, "total_steps": 6657, "loss": 0.0619, "lr": 9.432183321214805e-07, "epoch": 6.382754994742377, "percentage": 91.18, "elapsed_time": "6:14:04", "remaining_time": "0:36:10"}
|
| 1215 |
+
{"current_steps": 6075, "total_steps": 6657, "loss": 0.0692, "lr": 9.273700164046162e-07, "epoch": 6.38801261829653, "percentage": 91.26, "elapsed_time": "6:14:30", "remaining_time": "0:35:52"}
|
| 1216 |
+
{"current_steps": 6080, "total_steps": 6657, "loss": 0.068, "lr": 9.11652815407027e-07, "epoch": 6.393270241850684, "percentage": 91.33, "elapsed_time": "6:14:49", "remaining_time": "0:35:34"}
|
| 1217 |
+
{"current_steps": 6085, "total_steps": 6657, "loss": 0.0585, "lr": 8.960668371765569e-07, "epoch": 6.398527865404837, "percentage": 91.41, "elapsed_time": "6:15:06", "remaining_time": "0:35:15"}
|
| 1218 |
+
{"current_steps": 6090, "total_steps": 6657, "loss": 0.0685, "lr": 8.806121888589492e-07, "epoch": 6.403785488958991, "percentage": 91.48, "elapsed_time": "6:15:24", "remaining_time": "0:34:57"}
|
| 1219 |
+
{"current_steps": 6095, "total_steps": 6657, "loss": 0.0724, "lr": 8.652889766971229e-07, "epoch": 6.409043112513144, "percentage": 91.56, "elapsed_time": "6:15:41", "remaining_time": "0:34:38"}
|
| 1220 |
+
{"current_steps": 6100, "total_steps": 6657, "loss": 0.0898, "lr": 8.500973060304374e-07, "epoch": 6.414300736067298, "percentage": 91.63, "elapsed_time": "6:16:09", "remaining_time": "0:34:20"}
|
| 1221 |
+
{"current_steps": 6105, "total_steps": 6657, "loss": 0.0892, "lr": 8.350372812939778e-07, "epoch": 6.419558359621451, "percentage": 91.71, "elapsed_time": "6:16:29", "remaining_time": "0:34:02"}
|
| 1222 |
+
{"current_steps": 6110, "total_steps": 6657, "loss": 0.0545, "lr": 8.201090060178174e-07, "epoch": 6.424815983175605, "percentage": 91.78, "elapsed_time": "6:16:46", "remaining_time": "0:33:43"}
|
| 1223 |
+
{"current_steps": 6115, "total_steps": 6657, "loss": 0.0863, "lr": 8.053125828263297e-07, "epoch": 6.430073606729758, "percentage": 91.86, "elapsed_time": "6:17:03", "remaining_time": "0:33:25"}
|
| 1224 |
+
{"current_steps": 6120, "total_steps": 6657, "loss": 0.067, "lr": 7.906481134374688e-07, "epoch": 6.435331230283912, "percentage": 91.93, "elapsed_time": "6:17:20", "remaining_time": "0:33:06"}
|
| 1225 |
+
{"current_steps": 6125, "total_steps": 6657, "loss": 0.0692, "lr": 7.761156986620677e-07, "epoch": 6.440588853838065, "percentage": 92.01, "elapsed_time": "6:17:35", "remaining_time": "0:32:47"}
|
| 1226 |
+
{"current_steps": 6130, "total_steps": 6657, "loss": 0.0817, "lr": 7.617154384031545e-07, "epoch": 6.445846477392219, "percentage": 92.08, "elapsed_time": "6:17:56", "remaining_time": "0:32:29"}
|
| 1227 |
+
{"current_steps": 6135, "total_steps": 6657, "loss": 0.1023, "lr": 7.474474316552638e-07, "epoch": 6.451104100946372, "percentage": 92.16, "elapsed_time": "6:18:19", "remaining_time": "0:32:11"}
|
| 1228 |
+
{"current_steps": 6140, "total_steps": 6657, "loss": 0.076, "lr": 7.33311776503749e-07, "epoch": 6.456361724500526, "percentage": 92.23, "elapsed_time": "6:18:45", "remaining_time": "0:31:53"}
|
| 1229 |
+
{"current_steps": 6145, "total_steps": 6657, "loss": 0.0697, "lr": 7.193085701241175e-07, "epoch": 6.461619348054679, "percentage": 92.31, "elapsed_time": "6:19:04", "remaining_time": "0:31:35"}
|
| 1230 |
+
{"current_steps": 6150, "total_steps": 6657, "loss": 0.0656, "lr": 7.054379087813568e-07, "epoch": 6.466876971608833, "percentage": 92.38, "elapsed_time": "6:19:20", "remaining_time": "0:31:16"}
|
| 1231 |
+
{"current_steps": 6155, "total_steps": 6657, "loss": 0.0662, "lr": 6.916998878292691e-07, "epoch": 6.472134595162986, "percentage": 92.46, "elapsed_time": "6:19:36", "remaining_time": "0:30:57"}
|
| 1232 |
+
{"current_steps": 6160, "total_steps": 6657, "loss": 0.0564, "lr": 6.780946017098289e-07, "epoch": 6.4773922187171395, "percentage": 92.53, "elapsed_time": "6:19:51", "remaining_time": "0:30:38"}
|
| 1233 |
+
{"current_steps": 6165, "total_steps": 6657, "loss": 0.0816, "lr": 6.646221439525225e-07, "epoch": 6.482649842271293, "percentage": 92.61, "elapsed_time": "6:20:07", "remaining_time": "0:30:20"}
|
| 1234 |
+
{"current_steps": 6170, "total_steps": 6657, "loss": 0.0601, "lr": 6.512826071737021e-07, "epoch": 6.4879074658254465, "percentage": 92.68, "elapsed_time": "6:20:24", "remaining_time": "0:30:01"}
|
| 1235 |
+
{"current_steps": 6175, "total_steps": 6657, "loss": 0.0631, "lr": 6.380760830759669e-07, "epoch": 6.4931650893796, "percentage": 92.76, "elapsed_time": "6:20:40", "remaining_time": "0:29:42"}
|
| 1236 |
+
{"current_steps": 6180, "total_steps": 6657, "loss": 0.0723, "lr": 6.250026624475092e-07, "epoch": 6.498422712933754, "percentage": 92.83, "elapsed_time": "6:20:58", "remaining_time": "0:29:24"}
|
| 1237 |
+
{"current_steps": 6185, "total_steps": 6657, "loss": 0.0669, "lr": 6.12062435161509e-07, "epoch": 6.503680336487907, "percentage": 92.91, "elapsed_time": "6:21:23", "remaining_time": "0:29:06"}
|
| 1238 |
+
{"current_steps": 6190, "total_steps": 6657, "loss": 0.0607, "lr": 5.992554901755121e-07, "epoch": 6.508937960042061, "percentage": 92.98, "elapsed_time": "6:21:43", "remaining_time": "0:28:47"}
|
| 1239 |
+
{"current_steps": 6195, "total_steps": 6657, "loss": 0.061, "lr": 5.865819155308039e-07, "epoch": 6.514195583596215, "percentage": 93.06, "elapsed_time": "6:22:08", "remaining_time": "0:28:29"}
|
| 1240 |
+
{"current_steps": 6200, "total_steps": 6657, "loss": 0.1393, "lr": 5.740417983518253e-07, "epoch": 6.519453207150368, "percentage": 93.14, "elapsed_time": "6:22:44", "remaining_time": "0:28:12"}
|
| 1241 |
+
{"current_steps": 6205, "total_steps": 6657, "loss": 0.0846, "lr": 5.61635224845567e-07, "epoch": 6.524710830704522, "percentage": 93.21, "elapsed_time": "6:24:01", "remaining_time": "0:27:58"}
|
| 1242 |
+
{"current_steps": 6210, "total_steps": 6657, "loss": 0.0701, "lr": 5.493622803009602e-07, "epoch": 6.529968454258675, "percentage": 93.29, "elapsed_time": "6:24:28", "remaining_time": "0:27:40"}
|
| 1243 |
+
{"current_steps": 6215, "total_steps": 6657, "loss": 0.0631, "lr": 5.372230490883246e-07, "epoch": 6.535226077812829, "percentage": 93.36, "elapsed_time": "6:24:46", "remaining_time": "0:27:21"}
|
| 1244 |
+
{"current_steps": 6220, "total_steps": 6657, "loss": 0.1451, "lr": 5.252176146587484e-07, "epoch": 6.540483701366982, "percentage": 93.44, "elapsed_time": "6:25:08", "remaining_time": "0:27:03"}
|
| 1245 |
+
{"current_steps": 6225, "total_steps": 6657, "loss": 0.1257, "lr": 5.133460595435447e-07, "epoch": 6.545741324921136, "percentage": 93.51, "elapsed_time": "6:25:19", "remaining_time": "0:26:44"}
|
| 1246 |
+
{"current_steps": 6230, "total_steps": 6657, "loss": 0.1173, "lr": 5.016084653536756e-07, "epoch": 6.550998948475289, "percentage": 93.59, "elapsed_time": "6:25:31", "remaining_time": "0:26:25"}
|
| 1247 |
+
{"current_steps": 6235, "total_steps": 6657, "loss": 0.119, "lr": 4.900049127791851e-07, "epoch": 6.556256572029443, "percentage": 93.66, "elapsed_time": "6:25:43", "remaining_time": "0:26:06"}
|
| 1248 |
+
{"current_steps": 6240, "total_steps": 6657, "loss": 0.1222, "lr": 4.785354815886445e-07, "epoch": 6.561514195583596, "percentage": 93.74, "elapsed_time": "6:25:56", "remaining_time": "0:25:47"}
|
| 1249 |
+
{"current_steps": 6245, "total_steps": 6657, "loss": 0.117, "lr": 4.6720025062862106e-07, "epoch": 6.56677181913775, "percentage": 93.81, "elapsed_time": "6:26:07", "remaining_time": "0:25:28"}
|
| 1250 |
+
{"current_steps": 6250, "total_steps": 6657, "loss": 0.1207, "lr": 4.559992978231087e-07, "epoch": 6.572029442691903, "percentage": 93.89, "elapsed_time": "6:26:19", "remaining_time": "0:25:09"}
|
| 1251 |
+
{"current_steps": 6255, "total_steps": 6657, "loss": 0.106, "lr": 4.4493270017301305e-07, "epoch": 6.577287066246057, "percentage": 93.96, "elapsed_time": "6:26:31", "remaining_time": "0:24:50"}
|
| 1252 |
+
{"current_steps": 6260, "total_steps": 6657, "loss": 0.125, "lr": 4.340005337556186e-07, "epoch": 6.58254468980021, "percentage": 94.04, "elapsed_time": "6:26:42", "remaining_time": "0:24:31"}
|
| 1253 |
+
{"current_steps": 6265, "total_steps": 6657, "loss": 0.1218, "lr": 4.232028737240623e-07, "epoch": 6.587802313354364, "percentage": 94.11, "elapsed_time": "6:26:54", "remaining_time": "0:24:12"}
|
| 1254 |
+
{"current_steps": 6270, "total_steps": 6657, "loss": 0.1125, "lr": 4.125397943068099e-07, "epoch": 6.593059936908517, "percentage": 94.19, "elapsed_time": "6:27:06", "remaining_time": "0:23:53"}
|
| 1255 |
+
{"current_steps": 6275, "total_steps": 6657, "loss": 0.1148, "lr": 4.0201136880716027e-07, "epoch": 6.598317560462671, "percentage": 94.26, "elapsed_time": "6:27:19", "remaining_time": "0:23:34"}
|
| 1256 |
+
{"current_steps": 6280, "total_steps": 6657, "loss": 0.1107, "lr": 3.9161766960273517e-07, "epoch": 6.603575184016824, "percentage": 94.34, "elapsed_time": "6:27:31", "remaining_time": "0:23:15"}
|
| 1257 |
+
{"current_steps": 6285, "total_steps": 6657, "loss": 0.1019, "lr": 3.8135876814497927e-07, "epoch": 6.608832807570978, "percentage": 94.41, "elapsed_time": "6:27:42", "remaining_time": "0:22:56"}
|
| 1258 |
+
{"current_steps": 6290, "total_steps": 6657, "loss": 0.1031, "lr": 3.7123473495866314e-07, "epoch": 6.614090431125131, "percentage": 94.49, "elapsed_time": "6:27:54", "remaining_time": "0:22:37"}
|
| 1259 |
+
{"current_steps": 6295, "total_steps": 6657, "loss": 0.1085, "lr": 3.61245639641421e-07, "epoch": 6.619348054679285, "percentage": 94.56, "elapsed_time": "6:28:06", "remaining_time": "0:22:19"}
|
| 1260 |
+
{"current_steps": 6300, "total_steps": 6657, "loss": 0.1172, "lr": 3.513915508632448e-07, "epoch": 6.624605678233438, "percentage": 94.64, "elapsed_time": "6:28:18", "remaining_time": "0:22:00"}
|
| 1261 |
+
{"current_steps": 6305, "total_steps": 6657, "loss": 0.1172, "lr": 3.4167253636602893e-07, "epoch": 6.629863301787592, "percentage": 94.71, "elapsed_time": "6:28:31", "remaining_time": "0:21:41"}
|
| 1262 |
+
{"current_steps": 6310, "total_steps": 6657, "loss": 0.1164, "lr": 3.3208866296310147e-07, "epoch": 6.635120925341745, "percentage": 94.79, "elapsed_time": "6:28:43", "remaining_time": "0:21:22"}
|
| 1263 |
+
{"current_steps": 6315, "total_steps": 6657, "loss": 0.1112, "lr": 3.2263999653876057e-07, "epoch": 6.6403785488958995, "percentage": 94.86, "elapsed_time": "6:28:55", "remaining_time": "0:21:03"}
|
| 1264 |
+
{"current_steps": 6320, "total_steps": 6657, "loss": 0.1145, "lr": 3.133266020478254e-07, "epoch": 6.645636172450052, "percentage": 94.94, "elapsed_time": "6:29:09", "remaining_time": "0:20:45"}
|
| 1265 |
+
{"current_steps": 6325, "total_steps": 6657, "loss": 0.1119, "lr": 3.0414854351519476e-07, "epoch": 6.6508937960042065, "percentage": 95.01, "elapsed_time": "6:29:23", "remaining_time": "0:20:26"}
|
| 1266 |
+
{"current_steps": 6330, "total_steps": 6657, "loss": 0.1066, "lr": 2.951058840353893e-07, "epoch": 6.65615141955836, "percentage": 95.09, "elapsed_time": "6:29:35", "remaining_time": "0:20:07"}
|
| 1267 |
+
{"current_steps": 6335, "total_steps": 6657, "loss": 0.1041, "lr": 2.861986857721388e-07, "epoch": 6.6614090431125135, "percentage": 95.16, "elapsed_time": "6:29:46", "remaining_time": "0:19:48"}
|
| 1268 |
+
{"current_steps": 6340, "total_steps": 6657, "loss": 0.105, "lr": 2.7742700995794457e-07, "epoch": 6.666666666666667, "percentage": 95.24, "elapsed_time": "6:29:58", "remaining_time": "0:19:29"}
|
| 1269 |
+
{"current_steps": 6345, "total_steps": 6657, "loss": 0.1122, "lr": 2.687909168936509e-07, "epoch": 6.6719242902208205, "percentage": 95.31, "elapsed_time": "6:30:10", "remaining_time": "0:19:11"}
|
| 1270 |
+
{"current_steps": 6350, "total_steps": 6657, "loss": 0.1108, "lr": 2.6029046594805206e-07, "epoch": 6.677181913774974, "percentage": 95.39, "elapsed_time": "6:30:22", "remaining_time": "0:18:52"}
|
| 1271 |
+
{"current_steps": 6355, "total_steps": 6657, "loss": 0.1168, "lr": 2.519257155574617e-07, "epoch": 6.682439537329127, "percentage": 95.46, "elapsed_time": "6:30:36", "remaining_time": "0:18:33"}
|
| 1272 |
+
{"current_steps": 6360, "total_steps": 6657, "loss": 0.1055, "lr": 2.436967232253218e-07, "epoch": 6.687697160883281, "percentage": 95.54, "elapsed_time": "6:30:48", "remaining_time": "0:18:15"}
|
| 1273 |
+
{"current_steps": 6365, "total_steps": 6657, "loss": 0.1104, "lr": 2.3560354552180976e-07, "epoch": 6.692954784437434, "percentage": 95.61, "elapsed_time": "6:31:00", "remaining_time": "0:17:56"}
|
| 1274 |
+
{"current_steps": 6370, "total_steps": 6657, "loss": 0.1056, "lr": 2.27646238083441e-07, "epoch": 6.698212407991588, "percentage": 95.69, "elapsed_time": "6:31:11", "remaining_time": "0:17:37"}
|
| 1275 |
+
{"current_steps": 6375, "total_steps": 6657, "loss": 0.1101, "lr": 2.1982485561269805e-07, "epoch": 6.703470031545741, "percentage": 95.76, "elapsed_time": "6:31:23", "remaining_time": "0:17:18"}
|
| 1276 |
+
{"current_steps": 6380, "total_steps": 6657, "loss": 0.1048, "lr": 2.1213945187763764e-07, "epoch": 6.708727655099895, "percentage": 95.84, "elapsed_time": "6:31:35", "remaining_time": "0:17:00"}
|
| 1277 |
+
{"current_steps": 6385, "total_steps": 6657, "loss": 0.1027, "lr": 2.0459007971154632e-07, "epoch": 6.713985278654048, "percentage": 95.91, "elapsed_time": "6:31:46", "remaining_time": "0:16:41"}
|
| 1278 |
+
{"current_steps": 6390, "total_steps": 6657, "loss": 0.0966, "lr": 1.9717679101254549e-07, "epoch": 6.719242902208202, "percentage": 95.99, "elapsed_time": "6:31:58", "remaining_time": "0:16:22"}
|
| 1279 |
+
{"current_steps": 6395, "total_steps": 6657, "loss": 0.1106, "lr": 1.898996367432604e-07, "epoch": 6.724500525762355, "percentage": 96.06, "elapsed_time": "6:32:10", "remaining_time": "0:16:04"}
|
| 1280 |
+
{"current_steps": 6400, "total_steps": 6657, "loss": 0.1019, "lr": 1.8275866693046263e-07, "epoch": 6.729758149316509, "percentage": 96.14, "elapsed_time": "6:32:22", "remaining_time": "0:15:45"}
|
| 1281 |
+
{"current_steps": 6405, "total_steps": 6657, "loss": 0.1077, "lr": 1.7575393066471714e-07, "epoch": 6.735015772870662, "percentage": 96.21, "elapsed_time": "6:33:28", "remaining_time": "0:15:28"}
|
| 1282 |
+
{"current_steps": 6410, "total_steps": 6657, "loss": 0.1047, "lr": 1.6888547610005802e-07, "epoch": 6.740273396424816, "percentage": 96.29, "elapsed_time": "6:33:40", "remaining_time": "0:15:10"}
|
| 1283 |
+
{"current_steps": 6415, "total_steps": 6657, "loss": 0.1105, "lr": 1.6215335045364656e-07, "epoch": 6.745531019978969, "percentage": 96.36, "elapsed_time": "6:33:52", "remaining_time": "0:14:51"}
|
| 1284 |
+
{"current_steps": 6420, "total_steps": 6657, "loss": 0.1003, "lr": 1.5555760000545595e-07, "epoch": 6.750788643533123, "percentage": 96.44, "elapsed_time": "6:34:03", "remaining_time": "0:14:32"}
|
| 1285 |
+
{"current_steps": 6425, "total_steps": 6657, "loss": 0.1121, "lr": 1.4909827009794486e-07, "epoch": 6.756046267087276, "percentage": 96.51, "elapsed_time": "6:34:16", "remaining_time": "0:14:14"}
|
| 1286 |
+
{"current_steps": 6430, "total_steps": 6657, "loss": 0.1122, "lr": 1.4277540513575328e-07, "epoch": 6.76130389064143, "percentage": 96.59, "elapsed_time": "6:34:30", "remaining_time": "0:13:55"}
|
| 1287 |
+
{"current_steps": 6435, "total_steps": 6657, "loss": 0.115, "lr": 1.3658904858538936e-07, "epoch": 6.766561514195583, "percentage": 96.67, "elapsed_time": "6:34:42", "remaining_time": "0:13:37"}
|
| 1288 |
+
{"current_steps": 6440, "total_steps": 6657, "loss": 0.1027, "lr": 1.3053924297493858e-07, "epoch": 6.771819137749737, "percentage": 96.74, "elapsed_time": "6:34:54", "remaining_time": "0:13:18"}
|
| 1289 |
+
{"current_steps": 6445, "total_steps": 6657, "loss": 0.1086, "lr": 1.2462602989376404e-07, "epoch": 6.77707676130389, "percentage": 96.82, "elapsed_time": "6:35:06", "remaining_time": "0:12:59"}
|
| 1290 |
+
{"current_steps": 6450, "total_steps": 6657, "loss": 0.1055, "lr": 1.1884944999222658e-07, "epoch": 6.782334384858045, "percentage": 96.89, "elapsed_time": "6:35:18", "remaining_time": "0:12:41"}
|
| 1291 |
+
{"current_steps": 6455, "total_steps": 6657, "loss": 0.0982, "lr": 1.1320954298140063e-07, "epoch": 6.787592008412197, "percentage": 96.97, "elapsed_time": "6:35:29", "remaining_time": "0:12:22"}
|
| 1292 |
+
{"current_steps": 6460, "total_steps": 6657, "loss": 0.1, "lr": 1.0770634763280552e-07, "epoch": 6.792849631966352, "percentage": 97.04, "elapsed_time": "6:35:41", "remaining_time": "0:12:03"}
|
| 1293 |
+
{"current_steps": 6465, "total_steps": 6657, "loss": 0.1051, "lr": 1.023399017781368e-07, "epoch": 6.798107255520505, "percentage": 97.12, "elapsed_time": "6:35:53", "remaining_time": "0:11:45"}
|
| 1294 |
+
{"current_steps": 6470, "total_steps": 6657, "loss": 0.1131, "lr": 9.711024230900423e-08, "epoch": 6.803364879074659, "percentage": 97.19, "elapsed_time": "6:36:05", "remaining_time": "0:11:26"}
|
| 1295 |
+
{"current_steps": 6475, "total_steps": 6657, "loss": 0.11, "lr": 9.201740517668089e-08, "epoch": 6.808622502628812, "percentage": 97.27, "elapsed_time": "6:36:17", "remaining_time": "0:11:08"}
|
| 1296 |
+
{"current_steps": 6480, "total_steps": 6657, "loss": 0.1004, "lr": 8.706142539185447e-08, "epoch": 6.813880126182966, "percentage": 97.34, "elapsed_time": "6:36:29", "remaining_time": "0:10:49"}
|
| 1297 |
+
{"current_steps": 6485, "total_steps": 6657, "loss": 0.1107, "lr": 8.224233702438966e-08, "epoch": 6.819137749737119, "percentage": 97.42, "elapsed_time": "6:36:41", "remaining_time": "0:10:31"}
|
| 1298 |
+
{"current_steps": 6490, "total_steps": 6657, "loss": 0.1042, "lr": 7.756017320309283e-08, "epoch": 6.8243953732912725, "percentage": 97.49, "elapsed_time": "6:36:54", "remaining_time": "0:10:12"}
|
| 1299 |
+
{"current_steps": 6495, "total_steps": 6657, "loss": 0.1061, "lr": 7.301496611547665e-08, "epoch": 6.829652996845426, "percentage": 97.57, "elapsed_time": "6:37:05", "remaining_time": "0:09:54"}
|
| 1300 |
+
{"current_steps": 6500, "total_steps": 6657, "loss": 0.108, "lr": 6.86067470075491e-08, "epoch": 6.8349106203995795, "percentage": 97.64, "elapsed_time": "6:37:17", "remaining_time": "0:09:35"}
|
| 1301 |
+
{"current_steps": 6505, "total_steps": 6657, "loss": 0.0954, "lr": 6.433554618359816e-08, "epoch": 6.840168243953733, "percentage": 97.72, "elapsed_time": "6:37:29", "remaining_time": "0:09:17"}
|
| 1302 |
+
{"current_steps": 6510, "total_steps": 6657, "loss": 0.1032, "lr": 6.020139300597638e-08, "epoch": 6.8454258675078865, "percentage": 97.79, "elapsed_time": "6:37:41", "remaining_time": "0:08:58"}
|
| 1303 |
+
{"current_steps": 6515, "total_steps": 6657, "loss": 0.1145, "lr": 5.620431589490105e-08, "epoch": 6.85068349106204, "percentage": 97.87, "elapsed_time": "6:37:54", "remaining_time": "0:08:40"}
|
| 1304 |
+
{"current_steps": 6520, "total_steps": 6657, "loss": 0.0984, "lr": 5.234434232826324e-08, "epoch": 6.8559411146161935, "percentage": 97.94, "elapsed_time": "6:38:06", "remaining_time": "0:08:21"}
|
| 1305 |
+
{"current_steps": 6525, "total_steps": 6657, "loss": 0.1082, "lr": 4.862149884143907e-08, "epoch": 6.861198738170347, "percentage": 98.02, "elapsed_time": "6:38:17", "remaining_time": "0:08:03"}
|
| 1306 |
+
{"current_steps": 6530, "total_steps": 6657, "loss": 0.1081, "lr": 4.503581102709875e-08, "epoch": 6.8664563617245005, "percentage": 98.09, "elapsed_time": "6:38:29", "remaining_time": "0:07:45"}
|
| 1307 |
+
{"current_steps": 6535, "total_steps": 6657, "loss": 0.1047, "lr": 4.1587303535040035e-08, "epoch": 6.871713985278654, "percentage": 98.17, "elapsed_time": "6:38:41", "remaining_time": "0:07:26"}
|
| 1308 |
+
{"current_steps": 6540, "total_steps": 6657, "loss": 0.1091, "lr": 3.827600007201282e-08, "epoch": 6.8769716088328074, "percentage": 98.24, "elapsed_time": "6:38:53", "remaining_time": "0:07:08"}
|
| 1309 |
+
{"current_steps": 6545, "total_steps": 6657, "loss": 0.1053, "lr": 3.510192340156149e-08, "epoch": 6.882229232386961, "percentage": 98.32, "elapsed_time": "6:39:04", "remaining_time": "0:06:49"}
|
| 1310 |
+
{"current_steps": 6550, "total_steps": 6657, "loss": 0.1064, "lr": 3.20650953438606e-08, "epoch": 6.887486855941114, "percentage": 98.39, "elapsed_time": "6:39:16", "remaining_time": "0:06:31"}
|
| 1311 |
+
{"current_steps": 6555, "total_steps": 6657, "loss": 0.102, "lr": 2.9165536775574987e-08, "epoch": 6.892744479495268, "percentage": 98.47, "elapsed_time": "6:39:27", "remaining_time": "0:06:12"}
|
| 1312 |
+
{"current_steps": 6560, "total_steps": 6657, "loss": 0.1126, "lr": 2.6403267629706575e-08, "epoch": 6.898002103049421, "percentage": 98.54, "elapsed_time": "6:39:39", "remaining_time": "0:05:54"}
|
| 1313 |
+
{"current_steps": 6565, "total_steps": 6657, "loss": 0.1092, "lr": 2.3778306895467785e-08, "epoch": 6.903259726603575, "percentage": 98.62, "elapsed_time": "6:39:53", "remaining_time": "0:05:36"}
|
| 1314 |
+
{"current_steps": 6570, "total_steps": 6657, "loss": 0.1128, "lr": 2.1290672618135e-08, "epoch": 6.908517350157728, "percentage": 98.69, "elapsed_time": "6:40:05", "remaining_time": "0:05:17"}
|
| 1315 |
+
{"current_steps": 6575, "total_steps": 6657, "loss": 0.1009, "lr": 1.8940381898946424e-08, "epoch": 6.913774973711882, "percentage": 98.77, "elapsed_time": "6:40:17", "remaining_time": "0:04:59"}
|
| 1316 |
+
{"current_steps": 6580, "total_steps": 6657, "loss": 0.1014, "lr": 1.6727450894959973e-08, "epoch": 6.919032597266035, "percentage": 98.84, "elapsed_time": "6:40:29", "remaining_time": "0:04:41"}
|
| 1317 |
+
{"current_steps": 6585, "total_steps": 6657, "loss": 0.1078, "lr": 1.4651894818966671e-08, "epoch": 6.92429022082019, "percentage": 98.92, "elapsed_time": "6:40:41", "remaining_time": "0:04:22"}
|
| 1318 |
+
{"current_steps": 6590, "total_steps": 6657, "loss": 0.1033, "lr": 1.2713727939364096e-08, "epoch": 6.929547844374342, "percentage": 98.99, "elapsed_time": "6:40:52", "remaining_time": "0:04:04"}
|
| 1319 |
+
{"current_steps": 6595, "total_steps": 6657, "loss": 0.1033, "lr": 1.091296358007643e-08, "epoch": 6.934805467928497, "percentage": 99.07, "elapsed_time": "6:41:04", "remaining_time": "0:03:46"}
|
| 1320 |
+
{"current_steps": 6600, "total_steps": 6657, "loss": 0.1032, "lr": 9.249614120450113e-09, "epoch": 6.94006309148265, "percentage": 99.14, "elapsed_time": "6:41:16", "remaining_time": "0:03:27"}
|
| 1321 |
+
{"current_steps": 6605, "total_steps": 6657, "loss": 0.1197, "lr": 7.723690995171673e-09, "epoch": 6.945320715036804, "percentage": 99.22, "elapsed_time": "6:42:23", "remaining_time": "0:03:10"}
|
| 1322 |
+
{"current_steps": 6610, "total_steps": 6657, "loss": 0.1035, "lr": 6.335204694196684e-09, "epoch": 6.950578338590957, "percentage": 99.29, "elapsed_time": "6:42:34", "remaining_time": "0:02:51"}
|
| 1323 |
+
{"current_steps": 6615, "total_steps": 6657, "loss": 0.1023, "lr": 5.084164762667598e-09, "epoch": 6.955835962145111, "percentage": 99.37, "elapsed_time": "6:42:48", "remaining_time": "0:02:33"}
|
| 1324 |
+
{"current_steps": 6620, "total_steps": 6657, "loss": 0.107, "lr": 3.970579800853802e-09, "epoch": 6.961093585699264, "percentage": 99.44, "elapsed_time": "6:43:04", "remaining_time": "0:02:15"}
|
| 1325 |
+
{"current_steps": 6625, "total_steps": 6657, "loss": 0.0962, "lr": 2.9944574640894398e-09, "epoch": 6.966351209253418, "percentage": 99.52, "elapsed_time": "6:43:15", "remaining_time": "0:01:56"}
|
| 1326 |
+
{"current_steps": 6630, "total_steps": 6657, "loss": 0.1073, "lr": 2.1558044627267847e-09, "epoch": 6.971608832807571, "percentage": 99.59, "elapsed_time": "6:43:27", "remaining_time": "0:01:38"}
|
| 1327 |
+
{"current_steps": 6635, "total_steps": 6657, "loss": 0.1016, "lr": 1.4546265620785094e-09, "epoch": 6.976866456361725, "percentage": 99.67, "elapsed_time": "6:43:39", "remaining_time": "0:01:20"}
|
| 1328 |
+
{"current_steps": 6640, "total_steps": 6657, "loss": 0.1103, "lr": 8.909285823910374e-10, "epoch": 6.982124079915878, "percentage": 99.74, "elapsed_time": "6:43:51", "remaining_time": "0:01:02"}
|
| 1329 |
+
{"current_steps": 6645, "total_steps": 6657, "loss": 0.1066, "lr": 4.647143988067981e-10, "epoch": 6.987381703470032, "percentage": 99.82, "elapsed_time": "6:44:03", "remaining_time": "0:00:43"}
|
| 1330 |
+
{"current_steps": 6650, "total_steps": 6657, "loss": 0.1026, "lr": 1.7598694132869853e-10, "epoch": 6.992639327024185, "percentage": 99.89, "elapsed_time": "6:44:15", "remaining_time": "0:00:25"}
|
| 1331 |
+
{"current_steps": 6655, "total_steps": 6657, "loss": 0.1053, "lr": 2.474819481568247e-11, "epoch": 6.997896950578339, "percentage": 99.97, "elapsed_time": "6:44:27", "remaining_time": "0:00:07"}
|