File size: 17,143 Bytes
df772d3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
2024-04-09 22:07:00,443 INFO    StreamThr :462 [internal.py:wandb_internal():86] W&B internal server running at pid: 462, started at: 2024-04-09 22:07:00.443187
2024-04-09 22:07:00,445 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status
2024-04-09 22:07:00,767 INFO    WriterThread:462 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240409_220700-9aom042n/run-9aom042n.wandb
2024-04-09 22:07:00,767 DEBUG   SenderThread:462 [sender.py:send():379] send: header
2024-04-09 22:07:00,770 DEBUG   SenderThread:462 [sender.py:send():379] send: run
2024-04-09 22:07:03,990 INFO    SenderThread:462 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240409_220700-9aom042n/files
2024-04-09 22:07:03,990 INFO    SenderThread:462 [sender.py:_start_run_threads():1124] run started: 9aom042n with start time 1712700420.443095
2024-04-09 22:07:04,000 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: check_version
2024-04-09 22:07:04,000 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: check_version
2024-04-09 22:07:04,096 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: run_start
2024-04-09 22:07:04,107 DEBUG   HandlerThread:462 [system_info.py:__init__():26] System info init
2024-04-09 22:07:04,107 DEBUG   HandlerThread:462 [system_info.py:__init__():41] System info init done
2024-04-09 22:07:04,107 INFO    HandlerThread:462 [system_monitor.py:start():194] Starting system monitor
2024-04-09 22:07:04,107 INFO    SystemMonitor:462 [system_monitor.py:_start():158] Starting system asset monitoring threads
2024-04-09 22:07:04,107 INFO    HandlerThread:462 [system_monitor.py:probe():214] Collecting system info
2024-04-09 22:07:04,108 INFO    SystemMonitor:462 [interfaces.py:start():190] Started cpu monitoring
2024-04-09 22:07:04,109 INFO    SystemMonitor:462 [interfaces.py:start():190] Started disk monitoring
2024-04-09 22:07:04,111 INFO    SystemMonitor:462 [interfaces.py:start():190] Started gpu monitoring
2024-04-09 22:07:04,111 INFO    SystemMonitor:462 [interfaces.py:start():190] Started memory monitoring
2024-04-09 22:07:04,112 INFO    SystemMonitor:462 [interfaces.py:start():190] Started network monitoring
2024-04-09 22:07:04,126 DEBUG   HandlerThread:462 [system_info.py:probe():150] Probing system
2024-04-09 22:07:04,129 DEBUG   HandlerThread:462 [gitlib.py:_init_repo():56] git repository is invalid
2024-04-09 22:07:04,129 DEBUG   HandlerThread:462 [system_info.py:probe():198] Probing system done
2024-04-09 22:07:04,129 DEBUG   HandlerThread:462 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-09T22:07:04.127016', 'startedAt': '2024-04-09T22:07:00.437103', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': '6e44b39f6877', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.152, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.152, 'min': 0.0, 'max': 0.0}, {'current': 2000.152, 'min': 0.0, 'max': 0.0}, {'current': 2000.152, 'min': 0.0, 'max': 0.0}, {'current': 2000.152, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5569.50146484375}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}}
2024-04-09 22:07:04,129 INFO    HandlerThread:462 [system_monitor.py:probe():224] Finished collecting system info
2024-04-09 22:07:04,129 INFO    HandlerThread:462 [system_monitor.py:probe():227] Publishing system info
2024-04-09 22:07:04,129 DEBUG   HandlerThread:462 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
2024-04-09 22:07:04,992 INFO    Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/conda-environment.yaml
2024-04-09 22:07:19,144 ERROR   HandlerThread:462 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
Traceback (most recent call last):
  File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
    subprocess.call(
  File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
    return p.wait(timeout=timeout)
  File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
    return self._wait(timeout=timeout)
  File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
    raise TimeoutExpired(self.args, timeout)
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
2024-04-09 22:07:19,144 DEBUG   HandlerThread:462 [system_info.py:_save_conda():222] Saving conda packages done
2024-04-09 22:07:19,145 INFO    HandlerThread:462 [system_monitor.py:probe():229] Finished publishing system info
2024-04-09 22:07:19,150 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:19,150 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: keepalive
2024-04-09 22:07:19,150 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:19,151 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: keepalive
2024-04-09 22:07:19,151 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:19,151 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: keepalive
2024-04-09 22:07:19,151 DEBUG   SenderThread:462 [sender.py:send():379] send: files
2024-04-09 22:07:19,151 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
2024-04-09 22:07:19,461 INFO    wandb-upload_0:462 [upload_job.py:push():131] Uploaded file /tmp/tmpjk_7pw69wandb/fv30folz-wandb-metadata.json
2024-04-09 22:07:19,995 INFO    Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/wandb-metadata.json
2024-04-09 22:07:20,150 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: python_packages
2024-04-09 22:07:20,151 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: python_packages
2024-04-09 22:07:20,154 DEBUG   SenderThread:462 [sender.py:send():379] send: telemetry
2024-04-09 22:07:20,164 DEBUG   SenderThread:462 [sender.py:send():379] send: config
2024-04-09 22:07:20,167 DEBUG   SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:07:20,167 DEBUG   SenderThread:462 [sender.py:send():379] send: telemetry
2024-04-09 22:07:20,167 DEBUG   SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:07:20,167 WARNING SenderThread:462 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
2024-04-09 22:07:20,168 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:07:20,168 DEBUG   SenderThread:462 [sender.py:send():379] send: telemetry
2024-04-09 22:07:20,169 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:07:20,171 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:07:20,996 INFO    Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/output.log
2024-04-09 22:07:20,996 INFO    Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/requirements.txt
2024-04-09 22:07:21,386 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:23,000 INFO    Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/output.log
2024-04-09 22:07:26,387 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:31,393 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:32,004 INFO    Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/config.yaml
2024-04-09 22:07:35,154 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:07:35,154 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:07:35,154 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:07:37,289 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:42,290 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:47,291 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:50,153 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:07:50,153 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:07:50,154 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:07:53,243 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:58,243 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:03,244 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:04,112 DEBUG   SystemMonitor:462 [system_monitor.py:_start():172] Starting system metrics aggregation loop
2024-04-09 22:08:04,114 DEBUG   SenderThread:462 [sender.py:send():379] send: stats
2024-04-09 22:08:05,151 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:08:05,152 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:08:05,155 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:08:08,312 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:13,313 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:14,561 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: partial_history
2024-04-09 22:08:14,564 DEBUG   SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:08:14,564 DEBUG   SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:08:14,564 DEBUG   SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:08:14,565 DEBUG   SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:08:14,565 DEBUG   SenderThread:462 [sender.py:send():379] send: history
2024-04-09 22:08:14,565 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:08:14,567 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:08:15,020 INFO    Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/wandb-summary.json
2024-04-09 22:08:17,021 INFO    Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/output.log
2024-04-09 22:08:19,130 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:20,152 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:08:20,152 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:08:20,156 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:08:24,247 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:29,248 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:34,115 DEBUG   SenderThread:462 [sender.py:send():379] send: stats
2024-04-09 22:08:35,121 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:35,202 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:08:35,203 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:08:35,256 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:08:36,029 INFO    Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/config.yaml
2024-04-09 22:08:40,388 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:45,389 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:50,159 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:08:50,160 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:08:50,160 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:08:51,289 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:56,289 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:09:01,290 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:09:04,116 DEBUG   SenderThread:462 [sender.py:send():379] send: stats
2024-04-09 22:09:05,159 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:09:05,160 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:09:05,160 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:09:06,333 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:09:11,334 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:09:12,622 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: partial_history
2024-04-09 22:09:12,623 DEBUG   SenderThread:462 [sender.py:send():379] send: history
2024-04-09 22:09:12,623 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:12,624 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,028 DEBUG   SenderThread:462 [sender.py:send():379] send: telemetry
2024-04-09 22:09:13,028 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,030 DEBUG   HandlerThread:462 [handler.py:handle_request():146] handle_request: partial_history
2024-04-09 22:09:13,032 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,032 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,032 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,033 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,033 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,033 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,034 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,034 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,034 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,034 DEBUG   SenderThread:462 [sender.py:send():379] send: history
2024-04-09 22:09:13,034 DEBUG   SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,035 INFO    SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,042 INFO    Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/wandb-summary.json