Femboyuwu2000's picture
Training in progress, step 60
5a70ff6 verified
raw
history blame
No virus
16.2 kB
2024-04-13 03:07:20,811 INFO StreamThr :162 [internal.py:wandb_internal():86] W&B internal server running at pid: 162, started at: 2024-04-13 03:07:20.811043
2024-04-13 03:07:20,813 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status
2024-04-13 03:07:21,260 INFO WriterThread:162 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240413_030720-hqqism3w/run-hqqism3w.wandb
2024-04-13 03:07:21,261 DEBUG SenderThread:162 [sender.py:send():379] send: header
2024-04-13 03:07:21,264 DEBUG SenderThread:162 [sender.py:send():379] send: run
2024-04-13 03:07:21,377 INFO SenderThread:162 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files
2024-04-13 03:07:21,378 INFO SenderThread:162 [sender.py:_start_run_threads():1124] run started: hqqism3w with start time 1712977640.812572
2024-04-13 03:07:21,385 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: check_version
2024-04-13 03:07:21,385 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: check_version
2024-04-13 03:07:21,476 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: run_start
2024-04-13 03:07:21,487 DEBUG HandlerThread:162 [system_info.py:__init__():26] System info init
2024-04-13 03:07:21,487 DEBUG HandlerThread:162 [system_info.py:__init__():41] System info init done
2024-04-13 03:07:21,487 INFO HandlerThread:162 [system_monitor.py:start():194] Starting system monitor
2024-04-13 03:07:21,487 INFO SystemMonitor:162 [system_monitor.py:_start():158] Starting system asset monitoring threads
2024-04-13 03:07:21,488 INFO HandlerThread:162 [system_monitor.py:probe():214] Collecting system info
2024-04-13 03:07:21,488 INFO SystemMonitor:162 [interfaces.py:start():190] Started cpu monitoring
2024-04-13 03:07:21,488 INFO SystemMonitor:162 [interfaces.py:start():190] Started disk monitoring
2024-04-13 03:07:21,489 INFO SystemMonitor:162 [interfaces.py:start():190] Started gpu monitoring
2024-04-13 03:07:21,490 INFO SystemMonitor:162 [interfaces.py:start():190] Started memory monitoring
2024-04-13 03:07:21,490 INFO SystemMonitor:162 [interfaces.py:start():190] Started network monitoring
2024-04-13 03:07:21,509 DEBUG HandlerThread:162 [system_info.py:probe():150] Probing system
2024-04-13 03:07:21,511 DEBUG HandlerThread:162 [gitlib.py:_init_repo():56] git repository is invalid
2024-04-13 03:07:21,511 DEBUG HandlerThread:162 [system_info.py:probe():198] Probing system done
2024-04-13 03:07:21,512 DEBUG HandlerThread:162 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-13T03:07:21.509640', 'startedAt': '2024-04-13T03:07:20.804476', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': '4be9d1bc899e', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.156, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5569.521141052246}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}}
2024-04-13 03:07:21,512 INFO HandlerThread:162 [system_monitor.py:probe():224] Finished collecting system info
2024-04-13 03:07:21,512 INFO HandlerThread:162 [system_monitor.py:probe():227] Publishing system info
2024-04-13 03:07:21,512 DEBUG HandlerThread:162 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
2024-04-13 03:07:22,380 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/conda-environment.yaml
2024-04-13 03:07:36,526 ERROR HandlerThread:162 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
subprocess.call(
File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
return p.wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
return self._wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
raise TimeoutExpired(self.args, timeout)
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
2024-04-13 03:07:36,530 DEBUG HandlerThread:162 [system_info.py:_save_conda():222] Saving conda packages done
2024-04-13 03:07:36,530 INFO HandlerThread:162 [system_monitor.py:probe():229] Finished publishing system info
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,539 DEBUG SenderThread:162 [sender.py:send():379] send: files
2024-04-13 03:07:36,539 INFO SenderThread:162 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
2024-04-13 03:07:36,786 INFO wandb-upload_0:162 [upload_job.py:push():131] Uploaded file /tmp/tmp3ubn2n35wandb/jjsi9om0-wandb-metadata.json
2024-04-13 03:07:37,383 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-metadata.json
2024-04-13 03:07:37,560 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: python_packages
2024-04-13 03:07:37,560 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: python_packages
2024-04-13 03:07:37,564 DEBUG SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:37,574 DEBUG SenderThread:162 [sender.py:send():379] send: config
2024-04-13 03:07:37,576 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:07:37,576 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:07:37,577 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:07:37,578 DEBUG SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:37,578 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:07:37,693 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:07:37,693 WARNING SenderThread:162 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
2024-04-13 03:07:37,693 DEBUG SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:38,383 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/requirements.txt
2024-04-13 03:07:39,384 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:07:41,385 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:07:41,723 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:46,724 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:51,730 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:52,389 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/config.yaml
2024-04-13 03:07:52,563 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:07:52,563 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:07:52,564 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:07:57,681 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:02,682 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:07,563 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:07,563 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:07,604 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:07,699 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:10,146 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:08:10,148 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:08:10,150 INFO SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:08:10,396 INFO Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:08:12,938 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:13,397 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:08:17,938 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:21,491 DEBUG SystemMonitor:162 [system_monitor.py:_start():172] Starting system metrics aggregation loop
2024-04-13 03:08:21,492 DEBUG SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:08:22,561 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:22,562 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:22,566 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:23,629 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:24,401 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/config.yaml
2024-04-13 03:08:28,735 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:33,736 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:37,561 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:37,562 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:37,602 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:39,617 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:44,618 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:45,115 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:08:45,116 DEBUG SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:08:45,116 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:08:45,118 INFO SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:08:45,409 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:08:47,410 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:08:49,876 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:51,493 DEBUG SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:08:52,562 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:52,562 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:52,565 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:55,643 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:00,644 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:05,644 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:07,562 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:09:07,562 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:09:07,603 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:09:10,690 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:15,691 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:20,692 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:21,494 DEBUG SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:09:22,562 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:09:22,563 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:09:22,603 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:09:26,621 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:31,010 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:09:31,012 DEBUG SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:09:31,012 DEBUG SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:09:31,012 INFO SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:09:31,427 INFO Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:09:31,740 DEBUG HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report