File size: 19,188 Bytes
60aab60 13742f4 95270cd f21684b 04de011 3434cfe |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 |
2024-04-14 07:31:17,436 INFO StreamThr :183 [internal.py:wandb_internal():86] W&B internal server running at pid: 183, started at: 2024-04-14 07:31:17.436199
2024-04-14 07:31:17,438 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status
2024-04-14 07:31:17,994 INFO WriterThread:183 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/run-xuqlocdo.wandb
2024-04-14 07:31:17,994 DEBUG SenderThread:183 [sender.py:send():379] send: header
2024-04-14 07:31:17,998 DEBUG SenderThread:183 [sender.py:send():379] send: run
2024-04-14 07:31:18,114 INFO SenderThread:183 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files
2024-04-14 07:31:18,115 INFO SenderThread:183 [sender.py:_start_run_threads():1124] run started: xuqlocdo with start time 1713079877.439223
2024-04-14 07:31:18,123 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: check_version
2024-04-14 07:31:18,123 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: check_version
2024-04-14 07:31:18,219 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: run_start
2024-04-14 07:31:18,230 DEBUG HandlerThread:183 [system_info.py:__init__():26] System info init
2024-04-14 07:31:18,230 DEBUG HandlerThread:183 [system_info.py:__init__():41] System info init done
2024-04-14 07:31:18,230 INFO HandlerThread:183 [system_monitor.py:start():194] Starting system monitor
2024-04-14 07:31:18,230 INFO SystemMonitor:183 [system_monitor.py:_start():158] Starting system asset monitoring threads
2024-04-14 07:31:18,231 INFO HandlerThread:183 [system_monitor.py:probe():214] Collecting system info
2024-04-14 07:31:18,231 INFO SystemMonitor:183 [interfaces.py:start():190] Started cpu monitoring
2024-04-14 07:31:18,232 INFO SystemMonitor:183 [interfaces.py:start():190] Started disk monitoring
2024-04-14 07:31:18,233 INFO SystemMonitor:183 [interfaces.py:start():190] Started gpu monitoring
2024-04-14 07:31:18,234 INFO SystemMonitor:183 [interfaces.py:start():190] Started memory monitoring
2024-04-14 07:31:18,235 INFO SystemMonitor:183 [interfaces.py:start():190] Started network monitoring
2024-04-14 07:31:18,245 DEBUG HandlerThread:183 [system_info.py:probe():150] Probing system
2024-04-14 07:31:18,247 DEBUG HandlerThread:183 [gitlib.py:_init_repo():56] git repository is invalid
2024-04-14 07:31:18,247 DEBUG HandlerThread:183 [system_info.py:probe():198] Probing system done
2024-04-14 07:31:18,247 DEBUG HandlerThread:183 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-14T07:31:18.245345', 'startedAt': '2024-04-14T07:31:17.430147', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': 'f694866fb244', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.172, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.172, 'min': 0.0, 'max': 0.0}, {'current': 2000.172, 'min': 0.0, 'max': 0.0}, {'current': 2000.172, 'min': 0.0, 'max': 0.0}, {'current': 2000.172, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5574.948589324951}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}}
2024-04-14 07:31:18,247 INFO HandlerThread:183 [system_monitor.py:probe():224] Finished collecting system info
2024-04-14 07:31:18,247 INFO HandlerThread:183 [system_monitor.py:probe():227] Publishing system info
2024-04-14 07:31:18,247 DEBUG HandlerThread:183 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
2024-04-14 07:31:19,116 INFO Thread-12 :183 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/conda-environment.yaml
2024-04-14 07:31:33,262 ERROR HandlerThread:183 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
subprocess.call(
File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
return p.wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
return self._wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
raise TimeoutExpired(self.args, timeout)
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
2024-04-14 07:31:33,268 DEBUG HandlerThread:183 [system_info.py:_save_conda():222] Saving conda packages done
2024-04-14 07:31:33,268 INFO HandlerThread:183 [system_monitor.py:probe():229] Finished publishing system info
2024-04-14 07:31:33,278 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:31:33,278 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: keepalive
2024-04-14 07:31:33,278 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:31:33,278 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: keepalive
2024-04-14 07:31:33,278 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:31:33,279 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: keepalive
2024-04-14 07:31:33,279 DEBUG SenderThread:183 [sender.py:send():379] send: files
2024-04-14 07:31:33,279 INFO SenderThread:183 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
2024-04-14 07:31:33,552 INFO wandb-upload_0:183 [upload_job.py:push():131] Uploaded file /tmp/tmpbxtmxuylwandb/9u0zqbnq-wandb-metadata.json
2024-04-14 07:31:34,120 INFO Thread-12 :183 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/wandb-metadata.json
2024-04-14 07:31:34,297 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: python_packages
2024-04-14 07:31:34,297 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: python_packages
2024-04-14 07:31:34,301 DEBUG SenderThread:183 [sender.py:send():379] send: telemetry
2024-04-14 07:31:34,311 DEBUG SenderThread:183 [sender.py:send():379] send: config
2024-04-14 07:31:34,312 DEBUG SenderThread:183 [sender.py:send():379] send: metric
2024-04-14 07:31:34,313 DEBUG SenderThread:183 [sender.py:send():379] send: telemetry
2024-04-14 07:31:34,313 DEBUG SenderThread:183 [sender.py:send():379] send: metric
2024-04-14 07:31:34,313 WARNING SenderThread:183 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
2024-04-14 07:31:34,313 DEBUG SenderThread:183 [sender.py:send():379] send: telemetry
2024-04-14 07:31:34,314 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:31:34,314 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:31:34,317 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:31:35,120 INFO Thread-12 :183 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/requirements.txt
2024-04-14 07:31:36,120 INFO Thread-12 :183 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/output.log
2024-04-14 07:31:38,121 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/output.log
2024-04-14 07:31:38,719 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:31:43,720 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:31:48,726 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:31:49,125 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/config.yaml
2024-04-14 07:31:49,299 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:31:49,300 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:31:49,300 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:31:54,400 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:31:56,464 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: partial_history
2024-04-14 07:31:56,465 DEBUG SenderThread:183 [sender.py:send():379] send: metric
2024-04-14 07:31:56,466 DEBUG SenderThread:183 [sender.py:send():379] send: metric
2024-04-14 07:31:56,466 DEBUG SenderThread:183 [sender.py:send():379] send: metric
2024-04-14 07:31:56,466 DEBUG SenderThread:183 [sender.py:send():379] send: metric
2024-04-14 07:31:56,466 DEBUG SenderThread:183 [sender.py:send():379] send: history
2024-04-14 07:31:56,466 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: summary_record
2024-04-14 07:31:56,468 INFO SenderThread:183 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-14 07:31:57,129 INFO Thread-12 :183 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/wandb-summary.json
2024-04-14 07:32:00,130 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/output.log
2024-04-14 07:32:00,285 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:04,298 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:32:04,298 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:32:04,301 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:32:05,328 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:10,328 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:15,329 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:17,474 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: partial_history
2024-04-14 07:32:17,475 DEBUG SenderThread:183 [sender.py:send():379] send: history
2024-04-14 07:32:17,475 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: summary_record
2024-04-14 07:32:17,477 INFO SenderThread:183 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-14 07:32:18,137 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/wandb-summary.json
2024-04-14 07:32:18,235 DEBUG SystemMonitor:183 [system_monitor.py:_start():172] Starting system metrics aggregation loop
2024-04-14 07:32:18,237 DEBUG SenderThread:183 [sender.py:send():379] send: stats
2024-04-14 07:32:19,298 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:32:19,298 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:32:19,301 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:32:20,138 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/output.log
2024-04-14 07:32:20,418 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:21,138 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/config.yaml
2024-04-14 07:32:25,504 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:30,505 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:34,298 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:32:34,299 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:32:34,338 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:32:36,378 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:38,751 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: partial_history
2024-04-14 07:32:38,753 DEBUG SenderThread:183 [sender.py:send():379] send: history
2024-04-14 07:32:38,753 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: summary_record
2024-04-14 07:32:38,753 INFO SenderThread:183 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-14 07:32:39,145 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/wandb-summary.json
2024-04-14 07:32:41,573 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:42,147 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/output.log
2024-04-14 07:32:46,574 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:48,238 DEBUG SenderThread:183 [sender.py:send():379] send: stats
2024-04-14 07:32:49,298 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:32:49,299 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:32:49,301 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:32:52,417 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:57,418 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:32:59,581 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: partial_history
2024-04-14 07:32:59,582 DEBUG SenderThread:183 [sender.py:send():379] send: history
2024-04-14 07:32:59,582 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: summary_record
2024-04-14 07:32:59,582 INFO SenderThread:183 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-14 07:33:00,154 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/wandb-summary.json
2024-04-14 07:33:02,155 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/output.log
2024-04-14 07:33:03,319 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:33:04,298 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:33:04,299 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:33:04,301 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:33:08,369 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:33:13,370 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:33:18,240 DEBUG SenderThread:183 [sender.py:send():379] send: stats
2024-04-14 07:33:19,241 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:33:19,298 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:33:19,299 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:33:19,339 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:33:22,849 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: partial_history
2024-04-14 07:33:22,850 DEBUG SenderThread:183 [sender.py:send():379] send: history
2024-04-14 07:33:22,851 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: summary_record
2024-04-14 07:33:22,852 INFO SenderThread:183 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-14 07:33:23,163 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/wandb-summary.json
2024-04-14 07:33:24,597 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:33:26,164 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/output.log
2024-04-14 07:33:29,598 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:33:34,298 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: stop_status
2024-04-14 07:33:34,299 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: stop_status
2024-04-14 07:33:34,303 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-14 07:33:35,421 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:33:40,422 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: status_report
2024-04-14 07:33:44,117 DEBUG HandlerThread:183 [handler.py:handle_request():146] handle_request: partial_history
2024-04-14 07:33:44,118 DEBUG SenderThread:183 [sender.py:send():379] send: history
2024-04-14 07:33:44,118 DEBUG SenderThread:183 [sender.py:send_request():406] send_request: summary_record
2024-04-14 07:33:44,119 INFO SenderThread:183 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-14 07:33:44,171 INFO Thread-12 :183 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240414_073117-xuqlocdo/files/wandb-summary.json
|