File size: 16,239 Bytes
ad67729 c8aef10 5a70ff6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 |
2024-04-13 03:07:20,811 INFO StreamThr :162 [internal.py:wandb_internal():86] W&B internal server running at pid: 162, started at: 2024-04-13 03:07:20.811043
2024-04-13 03:07:20,813 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status
2024-04-13 03:07:21,260 INFO WriterThread:162 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240413_030720-hqqism3w/run-hqqism3w.wandb
2024-04-13 03:07:21,261 DEBUG SenderThread:162 [sender.py:send():379] send: header
2024-04-13 03:07:21,264 DEBUG SenderThread:162 [sender.py:send():379] send: run
2024-04-13 03:07:21,377 INFO SenderThread:162 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files
2024-04-13 03:07:21,378 INFO SenderThread:162 [sender.py:_start_run_threads():1124] run started: hqqism3w with start time 1712977640.812572
2024-04-13 03:07:21,385 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: check_version
2024-04-13 03:07:21,385 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: check_version
2024-04-13 03:07:21,476 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: run_start
2024-04-13 03:07:21,487 DEBUG HandlerThread:162 [system_info.py:__init__():26] System info init
2024-04-13 03:07:21,487 DEBUG HandlerThread:162 [system_info.py:__init__():41] System info init done
2024-04-13 03:07:21,487 INFO HandlerThread:162 [system_monitor.py:start():194] Starting system monitor
2024-04-13 03:07:21,487 INFO SystemMonitor:162 [system_monitor.py:_start():158] Starting system asset monitoring threads
2024-04-13 03:07:21,488 INFO HandlerThread:162 [system_monitor.py:probe():214] Collecting system info
2024-04-13 03:07:21,488 INFO SystemMonitor:162 [interfaces.py:start():190] Started cpu monitoring
2024-04-13 03:07:21,488 INFO SystemMonitor:162 [interfaces.py:start():190] Started disk monitoring
2024-04-13 03:07:21,489 INFO SystemMonitor:162 [interfaces.py:start():190] Started gpu monitoring
2024-04-13 03:07:21,490 INFO SystemMonitor:162 [interfaces.py:start():190] Started memory monitoring
2024-04-13 03:07:21,490 INFO SystemMonitor:162 [interfaces.py:start():190] Started network monitoring
2024-04-13 03:07:21,509 DEBUG HandlerThread:162 [system_info.py:probe():150] Probing system
2024-04-13 03:07:21,511 DEBUG HandlerThread:162 [gitlib.py:_init_repo():56] git repository is invalid
2024-04-13 03:07:21,511 DEBUG HandlerThread:162 [system_info.py:probe():198] Probing system done
2024-04-13 03:07:21,512 DEBUG HandlerThread:162 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-13T03:07:21.509640', 'startedAt': '2024-04-13T03:07:20.804476', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': '4be9d1bc899e', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.156, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5569.521141052246}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}}
2024-04-13 03:07:21,512 INFO HandlerThread:162 [system_monitor.py:probe():224] Finished collecting system info
2024-04-13 03:07:21,512 INFO HandlerThread:162 [system_monitor.py:probe():227] Publishing system info
2024-04-13 03:07:21,512 DEBUG HandlerThread:162 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
2024-04-13 03:07:22,380 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/conda-environment.yaml
2024-04-13 03:07:36,526 ERROR HandlerThread:162 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
subprocess.call(
File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
return p.wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
return self._wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
raise TimeoutExpired(self.args, timeout)
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
2024-04-13 03:07:36,530 DEBUG HandlerThread:162 [system_info.py:_save_conda():222] Saving conda packages done
2024-04-13 03:07:36,530 INFO HandlerThread:162 [system_monitor.py:probe():229] Finished publishing system info
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,539 DEBUG SenderThread:162 [sender.py:send():379] send: files
2024-04-13 03:07:36,539 INFO SenderThread:162 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
2024-04-13 03:07:36,786 INFO wandb-upload_0:162 [upload_job.py:push():131] Uploaded file /tmp/tmp3ubn2n35wandb/jjsi9om0-wandb-metadata.json
2024-04-13 03:07:37,383 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-metadata.json
2024-04-13 03:07:37,560 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: python_packages
2024-04-13 03:07:37,560 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: python_packages
2024-04-13 03:07:37,564 DEBUG SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:37,574 DEBUG SenderThread:162 [sender.py:send():379] send: config
2024-04-13 03:07:37,576 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:07:37,576 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:07:37,577 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:07:37,578 DEBUG SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:37,578 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:07:37,693 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:07:37,693 WARNING SenderThread:162 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
2024-04-13 03:07:37,693 DEBUG SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:38,383 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/requirements.txt
2024-04-13 03:07:39,384 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:07:41,385 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:07:41,723 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:46,724 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:51,730 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:52,389 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/config.yaml
2024-04-13 03:07:52,563 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:07:52,563 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:07:52,564 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:07:57,681 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:02,682 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:07,563 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:07,563 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:07,604 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:07,699 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:10,146 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:08:10,150 INFO SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:08:10,396 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:08:12,938 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:13,397 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:08:17,938 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:21,491 DEBUG SystemMonitor:162 [system_monitor.py:_start():172] Starting system metrics aggregation loop
2024-04-13 03:08:21,492 DEBUG SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:08:22,561 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:22,562 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:22,566 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:23,629 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:24,401 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/config.yaml
2024-04-13 03:08:28,735 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:33,736 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:37,561 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:37,562 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:37,602 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:39,617 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:44,618 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:45,115 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:08:45,116 DEBUG SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:08:45,116 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:08:45,118 INFO SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:08:45,409 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:08:47,410 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:08:49,876 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:51,493 DEBUG SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:08:52,562 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:52,562 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:52,565 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:55,643 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:00,644 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:05,644 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:07,562 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:09:07,562 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:09:07,603 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:09:10,690 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:15,691 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:20,692 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:21,494 DEBUG SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:09:22,562 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:09:22,563 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:09:22,603 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:09:26,621 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:31,010 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:09:31,012 DEBUG SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:09:31,012 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:09:31,012 INFO SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:09:31,427 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:09:31,740 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
|