File size: 17,143 Bytes
df772d3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 |
2024-04-09 22:07:00,443 INFO StreamThr :462 [internal.py:wandb_internal():86] W&B internal server running at pid: 462, started at: 2024-04-09 22:07:00.443187
2024-04-09 22:07:00,445 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status
2024-04-09 22:07:00,767 INFO WriterThread:462 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240409_220700-9aom042n/run-9aom042n.wandb
2024-04-09 22:07:00,767 DEBUG SenderThread:462 [sender.py:send():379] send: header
2024-04-09 22:07:00,770 DEBUG SenderThread:462 [sender.py:send():379] send: run
2024-04-09 22:07:03,990 INFO SenderThread:462 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240409_220700-9aom042n/files
2024-04-09 22:07:03,990 INFO SenderThread:462 [sender.py:_start_run_threads():1124] run started: 9aom042n with start time 1712700420.443095
2024-04-09 22:07:04,000 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: check_version
2024-04-09 22:07:04,000 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: check_version
2024-04-09 22:07:04,096 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: run_start
2024-04-09 22:07:04,107 DEBUG HandlerThread:462 [system_info.py:__init__():26] System info init
2024-04-09 22:07:04,107 DEBUG HandlerThread:462 [system_info.py:__init__():41] System info init done
2024-04-09 22:07:04,107 INFO HandlerThread:462 [system_monitor.py:start():194] Starting system monitor
2024-04-09 22:07:04,107 INFO SystemMonitor:462 [system_monitor.py:_start():158] Starting system asset monitoring threads
2024-04-09 22:07:04,107 INFO HandlerThread:462 [system_monitor.py:probe():214] Collecting system info
2024-04-09 22:07:04,108 INFO SystemMonitor:462 [interfaces.py:start():190] Started cpu monitoring
2024-04-09 22:07:04,109 INFO SystemMonitor:462 [interfaces.py:start():190] Started disk monitoring
2024-04-09 22:07:04,111 INFO SystemMonitor:462 [interfaces.py:start():190] Started gpu monitoring
2024-04-09 22:07:04,111 INFO SystemMonitor:462 [interfaces.py:start():190] Started memory monitoring
2024-04-09 22:07:04,112 INFO SystemMonitor:462 [interfaces.py:start():190] Started network monitoring
2024-04-09 22:07:04,126 DEBUG HandlerThread:462 [system_info.py:probe():150] Probing system
2024-04-09 22:07:04,129 DEBUG HandlerThread:462 [gitlib.py:_init_repo():56] git repository is invalid
2024-04-09 22:07:04,129 DEBUG HandlerThread:462 [system_info.py:probe():198] Probing system done
2024-04-09 22:07:04,129 DEBUG HandlerThread:462 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-09T22:07:04.127016', 'startedAt': '2024-04-09T22:07:00.437103', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': '6e44b39f6877', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.152, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.152, 'min': 0.0, 'max': 0.0}, {'current': 2000.152, 'min': 0.0, 'max': 0.0}, {'current': 2000.152, 'min': 0.0, 'max': 0.0}, {'current': 2000.152, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5569.50146484375}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}}
2024-04-09 22:07:04,129 INFO HandlerThread:462 [system_monitor.py:probe():224] Finished collecting system info
2024-04-09 22:07:04,129 INFO HandlerThread:462 [system_monitor.py:probe():227] Publishing system info
2024-04-09 22:07:04,129 DEBUG HandlerThread:462 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
2024-04-09 22:07:04,992 INFO Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/conda-environment.yaml
2024-04-09 22:07:19,144 ERROR HandlerThread:462 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
subprocess.call(
File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
return p.wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
return self._wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
raise TimeoutExpired(self.args, timeout)
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
2024-04-09 22:07:19,144 DEBUG HandlerThread:462 [system_info.py:_save_conda():222] Saving conda packages done
2024-04-09 22:07:19,145 INFO HandlerThread:462 [system_monitor.py:probe():229] Finished publishing system info
2024-04-09 22:07:19,150 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:19,150 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: keepalive
2024-04-09 22:07:19,150 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:19,151 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: keepalive
2024-04-09 22:07:19,151 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:19,151 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: keepalive
2024-04-09 22:07:19,151 DEBUG SenderThread:462 [sender.py:send():379] send: files
2024-04-09 22:07:19,151 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
2024-04-09 22:07:19,461 INFO wandb-upload_0:462 [upload_job.py:push():131] Uploaded file /tmp/tmpjk_7pw69wandb/fv30folz-wandb-metadata.json
2024-04-09 22:07:19,995 INFO Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/wandb-metadata.json
2024-04-09 22:07:20,150 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: python_packages
2024-04-09 22:07:20,151 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: python_packages
2024-04-09 22:07:20,154 DEBUG SenderThread:462 [sender.py:send():379] send: telemetry
2024-04-09 22:07:20,164 DEBUG SenderThread:462 [sender.py:send():379] send: config
2024-04-09 22:07:20,167 DEBUG SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:07:20,167 DEBUG SenderThread:462 [sender.py:send():379] send: telemetry
2024-04-09 22:07:20,167 DEBUG SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:07:20,167 WARNING SenderThread:462 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
2024-04-09 22:07:20,168 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:07:20,168 DEBUG SenderThread:462 [sender.py:send():379] send: telemetry
2024-04-09 22:07:20,169 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:07:20,171 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:07:20,996 INFO Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/output.log
2024-04-09 22:07:20,996 INFO Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/requirements.txt
2024-04-09 22:07:21,386 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:23,000 INFO Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/output.log
2024-04-09 22:07:26,387 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:31,393 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:32,004 INFO Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/config.yaml
2024-04-09 22:07:35,154 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:07:35,154 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:07:35,154 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:07:37,289 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:42,290 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:47,291 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:50,153 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:07:50,153 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:07:50,154 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:07:53,243 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:07:58,243 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:03,244 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:04,112 DEBUG SystemMonitor:462 [system_monitor.py:_start():172] Starting system metrics aggregation loop
2024-04-09 22:08:04,114 DEBUG SenderThread:462 [sender.py:send():379] send: stats
2024-04-09 22:08:05,151 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:08:05,152 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:08:05,155 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:08:08,312 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:13,313 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:14,561 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: partial_history
2024-04-09 22:08:14,564 DEBUG SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:08:14,564 DEBUG SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:08:14,564 DEBUG SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:08:14,565 DEBUG SenderThread:462 [sender.py:send():379] send: metric
2024-04-09 22:08:14,565 DEBUG SenderThread:462 [sender.py:send():379] send: history
2024-04-09 22:08:14,565 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:08:14,567 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:08:15,020 INFO Thread-12 :462 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/wandb-summary.json
2024-04-09 22:08:17,021 INFO Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/output.log
2024-04-09 22:08:19,130 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:20,152 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:08:20,152 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:08:20,156 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:08:24,247 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:29,248 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:34,115 DEBUG SenderThread:462 [sender.py:send():379] send: stats
2024-04-09 22:08:35,121 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:35,202 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:08:35,203 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:08:35,256 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:08:36,029 INFO Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/config.yaml
2024-04-09 22:08:40,388 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:45,389 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:50,159 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:08:50,160 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:08:50,160 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:08:51,289 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:08:56,289 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:09:01,290 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:09:04,116 DEBUG SenderThread:462 [sender.py:send():379] send: stats
2024-04-09 22:09:05,159 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-09 22:09:05,160 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: stop_status
2024-04-09 22:09:05,160 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: stop_status
2024-04-09 22:09:06,333 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:09:11,334 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: status_report
2024-04-09 22:09:12,622 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: partial_history
2024-04-09 22:09:12,623 DEBUG SenderThread:462 [sender.py:send():379] send: history
2024-04-09 22:09:12,623 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:12,624 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,028 DEBUG SenderThread:462 [sender.py:send():379] send: telemetry
2024-04-09 22:09:13,028 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,030 DEBUG HandlerThread:462 [handler.py:handle_request():146] handle_request: partial_history
2024-04-09 22:09:13,032 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,032 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,032 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,033 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,033 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,033 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,034 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,034 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,034 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,034 DEBUG SenderThread:462 [sender.py:send():379] send: history
2024-04-09 22:09:13,034 DEBUG SenderThread:462 [sender.py:send_request():406] send_request: summary_record
2024-04-09 22:09:13,035 INFO SenderThread:462 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-09 22:09:13,042 INFO Thread-12 :462 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240409_220700-9aom042n/files/wandb-summary.json
|