File size: 20,449 Bytes
6db4657 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 |
2024-04-10 00:59:59,347 INFO StreamThr :446 [internal.py:wandb_internal():86] W&B internal server running at pid: 446, started at: 2024-04-10 00:59:59.347095
2024-04-10 00:59:59,349 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: status
2024-04-10 00:59:59,733 INFO WriterThread:446 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240410_005959-52om3vq0/run-52om3vq0.wandb
2024-04-10 00:59:59,733 DEBUG SenderThread:446 [sender.py:send():379] send: header
2024-04-10 00:59:59,736 DEBUG SenderThread:446 [sender.py:send():379] send: run
2024-04-10 00:59:59,972 INFO SenderThread:446 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files
2024-04-10 00:59:59,972 INFO SenderThread:446 [sender.py:_start_run_threads():1124] run started: 52om3vq0 with start time 1712710799.346898
2024-04-10 00:59:59,980 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: check_version
2024-04-10 00:59:59,980 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: check_version
2024-04-10 01:00:00,050 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: run_start
2024-04-10 01:00:00,061 DEBUG HandlerThread:446 [system_info.py:__init__():26] System info init
2024-04-10 01:00:00,061 DEBUG HandlerThread:446 [system_info.py:__init__():41] System info init done
2024-04-10 01:00:00,061 INFO HandlerThread:446 [system_monitor.py:start():194] Starting system monitor
2024-04-10 01:00:00,061 INFO SystemMonitor:446 [system_monitor.py:_start():158] Starting system asset monitoring threads
2024-04-10 01:00:00,061 INFO SystemMonitor:446 [interfaces.py:start():190] Started cpu monitoring
2024-04-10 01:00:00,062 INFO SystemMonitor:446 [interfaces.py:start():190] Started disk monitoring
2024-04-10 01:00:00,062 INFO HandlerThread:446 [system_monitor.py:probe():214] Collecting system info
2024-04-10 01:00:00,063 INFO SystemMonitor:446 [interfaces.py:start():190] Started gpu monitoring
2024-04-10 01:00:00,064 INFO SystemMonitor:446 [interfaces.py:start():190] Started memory monitoring
2024-04-10 01:00:00,065 INFO SystemMonitor:446 [interfaces.py:start():190] Started network monitoring
2024-04-10 01:00:00,075 DEBUG HandlerThread:446 [system_info.py:probe():150] Probing system
2024-04-10 01:00:00,077 DEBUG HandlerThread:446 [gitlib.py:_init_repo():56] git repository is invalid
2024-04-10 01:00:00,077 DEBUG HandlerThread:446 [system_info.py:probe():198] Probing system done
2024-04-10 01:00:00,077 DEBUG HandlerThread:446 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-10T01:00:00.075285', 'startedAt': '2024-04-10T00:59:59.340959', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': 'd91c9dc8354a', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.156, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5569.50146484375}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}}
2024-04-10 01:00:00,077 INFO HandlerThread:446 [system_monitor.py:probe():224] Finished collecting system info
2024-04-10 01:00:00,077 INFO HandlerThread:446 [system_monitor.py:probe():227] Publishing system info
2024-04-10 01:00:00,077 DEBUG HandlerThread:446 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
2024-04-10 01:00:00,974 INFO Thread-12 :446 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/conda-environment.yaml
2024-04-10 01:00:15,091 ERROR HandlerThread:446 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
Traceback (most recent call last):
File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
subprocess.call(
File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
return p.wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
return self._wait(timeout=timeout)
File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
raise TimeoutExpired(self.args, timeout)
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
2024-04-10 01:00:15,092 DEBUG HandlerThread:446 [system_info.py:_save_conda():222] Saving conda packages done
2024-04-10 01:00:15,092 INFO HandlerThread:446 [system_monitor.py:probe():229] Finished publishing system info
2024-04-10 01:00:15,097 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: status_report
2024-04-10 01:00:15,097 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: keepalive
2024-04-10 01:00:15,098 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: status_report
2024-04-10 01:00:15,098 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: keepalive
2024-04-10 01:00:15,098 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: status_report
2024-04-10 01:00:15,098 DEBUG SenderThread:446 [sender.py:send():379] send: files
2024-04-10 01:00:15,098 INFO SenderThread:446 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
2024-04-10 01:00:15,388 INFO wandb-upload_0:446 [upload_job.py:push():131] Uploaded file /tmp/tmp0ozzi1ivwandb/9tvoe406-wandb-metadata.json
2024-04-10 01:00:15,977 INFO Thread-12 :446 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/wandb-metadata.json
2024-04-10 01:00:17,470 DEBUG SenderThread:446 [sender.py:send():379] send: exit
2024-04-10 01:00:17,470 INFO SenderThread:446 [sender.py:send_exit():586] handling exit code: 0
2024-04-10 01:00:17,470 INFO SenderThread:446 [sender.py:send_exit():588] handling runtime: 17
2024-04-10 01:00:17,472 INFO SenderThread:446 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-10 01:00:17,472 INFO SenderThread:446 [sender.py:send_exit():594] send defer
2024-04-10 01:00:17,472 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,472 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 0
2024-04-10 01:00:17,472 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,472 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 0
2024-04-10 01:00:17,473 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 1
2024-04-10 01:00:17,473 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,473 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 1
2024-04-10 01:00:17,473 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,473 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 1
2024-04-10 01:00:17,473 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 2
2024-04-10 01:00:17,473 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,473 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 2
2024-04-10 01:00:17,473 INFO HandlerThread:446 [system_monitor.py:finish():203] Stopping system monitor
2024-04-10 01:00:17,474 DEBUG SystemMonitor:446 [system_monitor.py:_start():172] Starting system metrics aggregation loop
2024-04-10 01:00:17,474 DEBUG SystemMonitor:446 [system_monitor.py:_start():179] Finished system metrics aggregation loop
2024-04-10 01:00:17,474 DEBUG SystemMonitor:446 [system_monitor.py:_start():183] Publishing last batch of metrics
2024-04-10 01:00:17,474 INFO HandlerThread:446 [interfaces.py:finish():202] Joined cpu monitor
2024-04-10 01:00:17,475 INFO HandlerThread:446 [interfaces.py:finish():202] Joined disk monitor
2024-04-10 01:00:17,489 INFO HandlerThread:446 [interfaces.py:finish():202] Joined gpu monitor
2024-04-10 01:00:17,489 INFO HandlerThread:446 [interfaces.py:finish():202] Joined memory monitor
2024-04-10 01:00:17,489 INFO HandlerThread:446 [interfaces.py:finish():202] Joined network monitor
2024-04-10 01:00:17,490 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,490 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 2
2024-04-10 01:00:17,490 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 3
2024-04-10 01:00:17,490 DEBUG SenderThread:446 [sender.py:send():379] send: stats
2024-04-10 01:00:17,490 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,490 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 3
2024-04-10 01:00:17,490 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,491 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 3
2024-04-10 01:00:17,491 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 4
2024-04-10 01:00:17,491 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,491 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 4
2024-04-10 01:00:17,491 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,491 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 4
2024-04-10 01:00:17,491 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 5
2024-04-10 01:00:17,491 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,491 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 5
2024-04-10 01:00:17,492 DEBUG SenderThread:446 [sender.py:send():379] send: summary
2024-04-10 01:00:17,492 INFO SenderThread:446 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-10 01:00:17,492 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,492 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 5
2024-04-10 01:00:17,492 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 6
2024-04-10 01:00:17,492 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,492 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 6
2024-04-10 01:00:17,493 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,493 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 6
2024-04-10 01:00:17,493 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 7
2024-04-10 01:00:17,493 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: status_report
2024-04-10 01:00:17,493 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,493 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 7
2024-04-10 01:00:17,493 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,493 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 7
2024-04-10 01:00:17,494 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 8
2024-04-10 01:00:17,494 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,494 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 8
2024-04-10 01:00:17,494 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,494 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 8
2024-04-10 01:00:17,494 INFO SenderThread:446 [job_builder.py:build():318] Attempting to build job artifact
2024-04-10 01:00:17,494 WARNING SenderThread:446 [job_builder.py:_log_if_verbose():210] No requirements.txt found, not creating job artifact. See https://docs.wandb.ai/guides/launch/create-job
2024-04-10 01:00:17,494 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 9
2024-04-10 01:00:17,494 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,495 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 9
2024-04-10 01:00:17,495 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,495 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 9
2024-04-10 01:00:17,495 INFO SenderThread:446 [dir_watcher.py:finish():358] shutting down directory watcher
2024-04-10 01:00:17,978 INFO SenderThread:446 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/wandb-summary.json
2024-04-10 01:00:17,979 INFO SenderThread:446 [dir_watcher.py:finish():388] scan: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files
2024-04-10 01:00:17,979 INFO SenderThread:446 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/wandb-summary.json wandb-summary.json
2024-04-10 01:00:17,979 INFO SenderThread:446 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/conda-environment.yaml conda-environment.yaml
2024-04-10 01:00:17,979 INFO SenderThread:446 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/wandb-metadata.json wandb-metadata.json
2024-04-10 01:00:17,982 INFO SenderThread:446 [dir_watcher.py:finish():402] scan save: /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/config.yaml config.yaml
2024-04-10 01:00:17,983 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 10
2024-04-10 01:00:17,986 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:17,987 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 10
2024-04-10 01:00:17,988 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:17,988 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 10
2024-04-10 01:00:17,988 INFO SenderThread:446 [file_pusher.py:finish():172] shutting down file pusher
2024-04-10 01:00:18,197 INFO wandb-upload_0:446 [upload_job.py:push():131] Uploaded file /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/wandb-summary.json
2024-04-10 01:00:18,241 INFO wandb-upload_1:446 [upload_job.py:push():131] Uploaded file /kaggle/working/wandb/run-20240410_005959-52om3vq0/files/config.yaml
2024-04-10 01:00:18,442 INFO Thread-11 (_thread_body):446 [sender.py:transition_state():614] send defer: 11
2024-04-10 01:00:18,442 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:18,442 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 11
2024-04-10 01:00:18,442 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:18,442 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 11
2024-04-10 01:00:18,442 INFO SenderThread:446 [file_pusher.py:join():178] waiting for file pusher
2024-04-10 01:00:18,443 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 12
2024-04-10 01:00:18,443 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:18,443 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 12
2024-04-10 01:00:18,443 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:18,443 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 12
2024-04-10 01:00:18,443 INFO SenderThread:446 [file_stream.py:finish():614] file stream finish called
2024-04-10 01:00:18,469 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: poll_exit
2024-04-10 01:00:18,628 INFO SenderThread:446 [file_stream.py:finish():618] file stream finish is done
2024-04-10 01:00:18,628 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 13
2024-04-10 01:00:18,628 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: poll_exit
2024-04-10 01:00:18,628 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:18,628 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 13
2024-04-10 01:00:18,629 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:18,629 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 13
2024-04-10 01:00:18,629 INFO SenderThread:446 [sender.py:transition_state():614] send defer: 14
2024-04-10 01:00:18,629 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: defer
2024-04-10 01:00:18,629 INFO HandlerThread:446 [handler.py:handle_request_defer():172] handle defer: 14
2024-04-10 01:00:18,629 DEBUG SenderThread:446 [sender.py:send():379] send: final
2024-04-10 01:00:18,629 DEBUG SenderThread:446 [sender.py:send():379] send: footer
2024-04-10 01:00:18,629 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: defer
2024-04-10 01:00:18,629 INFO SenderThread:446 [sender.py:send_request_defer():610] handle sender defer: 14
2024-04-10 01:00:18,631 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: poll_exit
2024-04-10 01:00:18,631 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: poll_exit
2024-04-10 01:00:18,631 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: poll_exit
2024-04-10 01:00:18,632 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: poll_exit
2024-04-10 01:00:18,632 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: server_info
2024-04-10 01:00:18,632 DEBUG SenderThread:446 [sender.py:send_request():406] send_request: server_info
2024-04-10 01:00:18,635 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: get_summary
2024-04-10 01:00:18,636 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: sampled_history
2024-04-10 01:00:18,636 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-10 01:00:18,691 INFO MainThread:446 [wandb_run.py:_footer_history_summary_info():3920] rendering history
2024-04-10 01:00:18,691 INFO MainThread:446 [wandb_run.py:_footer_history_summary_info():3952] rendering summary
2024-04-10 01:00:18,691 INFO MainThread:446 [wandb_run.py:_footer_sync_info():3879] logging synced files
2024-04-10 01:00:18,692 DEBUG HandlerThread:446 [handler.py:handle_request():146] handle_request: shutdown
2024-04-10 01:00:18,692 INFO HandlerThread:446 [handler.py:finish():866] shutting down handler
2024-04-10 01:00:19,633 INFO WriterThread:446 [datastore.py:close():296] close: /kaggle/working/wandb/run-20240410_005959-52om3vq0/run-52om3vq0.wandb
2024-04-10 01:00:19,691 INFO SenderThread:446 [sender.py:finish():1546] shutting down sender
2024-04-10 01:00:19,691 INFO SenderThread:446 [file_pusher.py:finish():172] shutting down file pusher
2024-04-10 01:00:19,691 INFO SenderThread:446 [file_pusher.py:join():178] waiting for file pusher
|