File size: 15,480 Bytes
9fbd070
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
df772d3
 
 
 
 
 
 
 
 
 
9fbd070
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
2024-04-11 12:24:41,039 INFO    StreamThr :1468 [internal.py:wandb_internal():86] W&B internal server running at pid: 1468, started at: 2024-04-11 12:24:41.038680
2024-04-11 12:24:41,041 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status
2024-04-11 12:24:41,424 INFO    WriterThread:1468 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/run-3wiow4fo.wandb
2024-04-11 12:24:41,425 DEBUG   SenderThread:1468 [sender.py:send():379] send: header
2024-04-11 12:24:41,433 DEBUG   SenderThread:1468 [sender.py:send():379] send: run
2024-04-11 12:24:41,575 INFO    SenderThread:1468 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files
2024-04-11 12:24:41,575 INFO    SenderThread:1468 [sender.py:_start_run_threads():1124] run started: 3wiow4fo with start time 1712838281.038479
2024-04-11 12:24:41,584 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: check_version
2024-04-11 12:24:41,584 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: check_version
2024-04-11 12:24:41,681 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: run_start
2024-04-11 12:24:41,695 DEBUG   HandlerThread:1468 [system_info.py:__init__():26] System info init
2024-04-11 12:24:41,695 DEBUG   HandlerThread:1468 [system_info.py:__init__():41] System info init done
2024-04-11 12:24:41,695 INFO    HandlerThread:1468 [system_monitor.py:start():194] Starting system monitor
2024-04-11 12:24:41,695 INFO    SystemMonitor:1468 [system_monitor.py:_start():158] Starting system asset monitoring threads
2024-04-11 12:24:41,696 INFO    HandlerThread:1468 [system_monitor.py:probe():214] Collecting system info
2024-04-11 12:24:41,696 INFO    SystemMonitor:1468 [interfaces.py:start():190] Started cpu monitoring
2024-04-11 12:24:41,697 INFO    SystemMonitor:1468 [interfaces.py:start():190] Started disk monitoring
2024-04-11 12:24:41,699 INFO    SystemMonitor:1468 [interfaces.py:start():190] Started gpu monitoring
2024-04-11 12:24:41,700 INFO    SystemMonitor:1468 [interfaces.py:start():190] Started memory monitoring
2024-04-11 12:24:41,701 INFO    SystemMonitor:1468 [interfaces.py:start():190] Started network monitoring
2024-04-11 12:24:41,715 DEBUG   HandlerThread:1468 [system_info.py:probe():150] Probing system
2024-04-11 12:24:41,717 DEBUG   HandlerThread:1468 [gitlib.py:_init_repo():56] git repository is invalid
2024-04-11 12:24:41,717 DEBUG   HandlerThread:1468 [system_info.py:probe():198] Probing system done
2024-04-11 12:24:41,717 DEBUG   HandlerThread:1468 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-11T12:24:41.715654', 'startedAt': '2024-04-11T12:24:41.032425', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': '55cd788bb41a', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.18, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.18, 'min': 0.0, 'max': 0.0}, {'current': 2000.18, 'min': 0.0, 'max': 0.0}, {'current': 2000.18, 'min': 0.0, 'max': 0.0}, {'current': 2000.18, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5571.194152832031}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}}
2024-04-11 12:24:41,717 INFO    HandlerThread:1468 [system_monitor.py:probe():224] Finished collecting system info
2024-04-11 12:24:41,718 INFO    HandlerThread:1468 [system_monitor.py:probe():227] Publishing system info
2024-04-11 12:24:41,718 DEBUG   HandlerThread:1468 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
2024-04-11 12:24:42,577 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/conda-environment.yaml
2024-04-11 12:24:56,733 ERROR   HandlerThread:1468 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
Traceback (most recent call last):
  File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
    subprocess.call(
  File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
    return p.wait(timeout=timeout)
  File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
    return self._wait(timeout=timeout)
  File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
    raise TimeoutExpired(self.args, timeout)
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
2024-04-11 12:24:56,734 DEBUG   HandlerThread:1468 [system_info.py:_save_conda():222] Saving conda packages done
2024-04-11 12:24:56,734 INFO    HandlerThread:1468 [system_monitor.py:probe():229] Finished publishing system info
2024-04-11 12:24:56,739 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:24:56,739 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: keepalive
2024-04-11 12:24:56,739 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:24:56,739 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: keepalive
2024-04-11 12:24:56,740 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:24:56,740 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: keepalive
2024-04-11 12:24:56,740 DEBUG   SenderThread:1468 [sender.py:send():379] send: files
2024-04-11 12:24:56,740 INFO    SenderThread:1468 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
2024-04-11 12:24:56,944 INFO    wandb-upload_0:1468 [upload_job.py:push():131] Uploaded file /tmp/tmp67awq2ybwandb/dyzxjezo-wandb-metadata.json
2024-04-11 12:24:57,580 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/wandb-metadata.json
2024-04-11 12:24:57,688 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: python_packages
2024-04-11 12:24:57,689 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: python_packages
2024-04-11 12:24:57,691 DEBUG   SenderThread:1468 [sender.py:send():379] send: telemetry
2024-04-11 12:24:57,701 DEBUG   SenderThread:1468 [sender.py:send():379] send: config
2024-04-11 12:24:57,703 DEBUG   SenderThread:1468 [sender.py:send():379] send: metric
2024-04-11 12:24:57,704 DEBUG   SenderThread:1468 [sender.py:send():379] send: telemetry
2024-04-11 12:24:57,704 DEBUG   SenderThread:1468 [sender.py:send():379] send: metric
2024-04-11 12:24:57,704 WARNING SenderThread:1468 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
2024-04-11 12:24:57,704 DEBUG   SenderThread:1468 [sender.py:send():379] send: telemetry
2024-04-11 12:24:57,704 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-11 12:24:57,705 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: stop_status
2024-04-11 12:24:57,706 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: stop_status
2024-04-11 12:24:58,581 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/output.log
2024-04-11 12:24:58,581 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/requirements.txt
2024-04-11 12:25:00,581 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/output.log
2024-04-11 12:25:01,973 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:02,582 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/output.log
2024-04-11 12:25:06,974 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:11,989 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:12,586 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/config.yaml
2024-04-11 12:25:12,691 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-11 12:25:12,691 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: stop_status
2024-04-11 12:25:12,692 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: stop_status
2024-04-11 12:25:14,587 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/output.log
2024-04-11 12:25:17,774 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:22,589 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/output.log
2024-04-11 12:25:22,814 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:27,691 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: stop_status
2024-04-11 12:25:27,691 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-11 12:25:27,691 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: stop_status
2024-04-11 12:25:28,752 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:33,068 DEBUG   SenderThread:1468 [sender.py:send():379] send: telemetry
2024-04-11 12:25:33,108 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: partial_history
2024-04-11 12:25:33,110 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: summary_record
2024-04-11 12:25:33,112 INFO    SenderThread:1468 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-11 12:25:33,112 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: summary_record
2024-04-11 12:25:33,112 INFO    SenderThread:1468 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-11 12:25:33,113 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: summary_record
2024-04-11 12:25:33,113 INFO    SenderThread:1468 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-11 12:25:33,113 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: summary_record
2024-04-11 12:25:33,113 INFO    SenderThread:1468 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-11 12:25:33,114 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: summary_record
2024-04-11 12:25:33,114 INFO    SenderThread:1468 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-11 12:25:33,114 DEBUG   SenderThread:1468 [sender.py:send():379] send: metric
2024-04-11 12:25:33,114 DEBUG   SenderThread:1468 [sender.py:send():379] send: history
2024-04-11 12:25:33,114 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: summary_record
2024-04-11 12:25:33,114 INFO    SenderThread:1468 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-11 12:25:33,594 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/wandb-summary.json
2024-04-11 12:25:34,249 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:34,594 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/output.log
2024-04-11 12:25:39,250 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:41,701 DEBUG   SystemMonitor:1468 [system_monitor.py:_start():172] Starting system metrics aggregation loop
2024-04-11 12:25:41,703 DEBUG   SenderThread:1468 [sender.py:send():379] send: stats
2024-04-11 12:25:42,689 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: stop_status
2024-04-11 12:25:42,690 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-11 12:25:42,690 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: stop_status
2024-04-11 12:25:44,768 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:45,599 INFO    Thread-12 :1468 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240411_122441-3wiow4fo/files/config.yaml
2024-04-11 12:25:49,891 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:54,892 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:25:57,689 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: stop_status
2024-04-11 12:25:57,690 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: stop_status
2024-04-11 12:25:57,726 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-11 12:26:00,727 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:26:05,729 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:26:10,730 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:26:11,704 DEBUG   SenderThread:1468 [sender.py:send():379] send: stats
2024-04-11 12:26:12,690 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: stop_status
2024-04-11 12:26:12,690 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: stop_status
2024-04-11 12:26:12,731 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-11 12:26:15,819 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:26:20,820 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:26:25,820 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: status_report
2024-04-11 12:26:27,690 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: stop_status
2024-04-11 12:26:27,691 DEBUG   SenderThread:1468 [sender.py:send_request():406] send_request: stop_status
2024-04-11 12:26:27,726 DEBUG   HandlerThread:1468 [handler.py:handle_request():146] handle_request: internal_messages