File size: 16,239 Bytes
ad67729
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
c8aef10
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5a70ff6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
2024-04-13 03:07:20,811 INFO    StreamThr :162 [internal.py:wandb_internal():86] W&B internal server running at pid: 162, started at: 2024-04-13 03:07:20.811043
2024-04-13 03:07:20,813 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status
2024-04-13 03:07:21,260 INFO    WriterThread:162 [datastore.py:open_for_write():87] open: /kaggle/working/wandb/run-20240413_030720-hqqism3w/run-hqqism3w.wandb
2024-04-13 03:07:21,261 DEBUG   SenderThread:162 [sender.py:send():379] send: header
2024-04-13 03:07:21,264 DEBUG   SenderThread:162 [sender.py:send():379] send: run
2024-04-13 03:07:21,377 INFO    SenderThread:162 [dir_watcher.py:__init__():211] watching files in: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files
2024-04-13 03:07:21,378 INFO    SenderThread:162 [sender.py:_start_run_threads():1124] run started: hqqism3w with start time 1712977640.812572
2024-04-13 03:07:21,385 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: check_version
2024-04-13 03:07:21,385 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: check_version
2024-04-13 03:07:21,476 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: run_start
2024-04-13 03:07:21,487 DEBUG   HandlerThread:162 [system_info.py:__init__():26] System info init
2024-04-13 03:07:21,487 DEBUG   HandlerThread:162 [system_info.py:__init__():41] System info init done
2024-04-13 03:07:21,487 INFO    HandlerThread:162 [system_monitor.py:start():194] Starting system monitor
2024-04-13 03:07:21,487 INFO    SystemMonitor:162 [system_monitor.py:_start():158] Starting system asset monitoring threads
2024-04-13 03:07:21,488 INFO    HandlerThread:162 [system_monitor.py:probe():214] Collecting system info
2024-04-13 03:07:21,488 INFO    SystemMonitor:162 [interfaces.py:start():190] Started cpu monitoring
2024-04-13 03:07:21,488 INFO    SystemMonitor:162 [interfaces.py:start():190] Started disk monitoring
2024-04-13 03:07:21,489 INFO    SystemMonitor:162 [interfaces.py:start():190] Started gpu monitoring
2024-04-13 03:07:21,490 INFO    SystemMonitor:162 [interfaces.py:start():190] Started memory monitoring
2024-04-13 03:07:21,490 INFO    SystemMonitor:162 [interfaces.py:start():190] Started network monitoring
2024-04-13 03:07:21,509 DEBUG   HandlerThread:162 [system_info.py:probe():150] Probing system
2024-04-13 03:07:21,511 DEBUG   HandlerThread:162 [gitlib.py:_init_repo():56] git repository is invalid
2024-04-13 03:07:21,511 DEBUG   HandlerThread:162 [system_info.py:probe():198] Probing system done
2024-04-13 03:07:21,512 DEBUG   HandlerThread:162 [system_monitor.py:probe():223] {'os': 'Linux-5.15.133+-x86_64-with-glibc2.31', 'python': '3.10.13', 'heartbeatAt': '2024-04-13T03:07:21.509640', 'startedAt': '2024-04-13T03:07:20.804476', 'docker': None, 'cuda': None, 'args': (), 'state': 'running', 'program': 'kaggle.ipynb', 'codePathLocal': None, 'root': '/kaggle/working', 'host': '4be9d1bc899e', 'username': 'root', 'executable': '/opt/conda/bin/python3.10', 'cpu_count': 2, 'cpu_count_logical': 4, 'cpu_freq': {'current': 2000.156, 'min': 0.0, 'max': 0.0}, 'cpu_freq_per_core': [{'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}, {'current': 2000.156, 'min': 0.0, 'max': 0.0}], 'disk': {'/': {'total': 8062.387607574463, 'used': 5569.521141052246}}, 'gpu': 'Tesla T4', 'gpu_count': 2, 'gpu_devices': [{'name': 'Tesla T4', 'memory_total': 16106127360}, {'name': 'Tesla T4', 'memory_total': 16106127360}], 'memory': {'total': 31.357559204101562}}
2024-04-13 03:07:21,512 INFO    HandlerThread:162 [system_monitor.py:probe():224] Finished collecting system info
2024-04-13 03:07:21,512 INFO    HandlerThread:162 [system_monitor.py:probe():227] Publishing system info
2024-04-13 03:07:21,512 DEBUG   HandlerThread:162 [system_info.py:_save_conda():207] Saving list of conda packages installed into the current environment
2024-04-13 03:07:22,380 INFO    Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/conda-environment.yaml
2024-04-13 03:07:36,526 ERROR   HandlerThread:162 [system_info.py:_save_conda():221] Error saving conda packages: Command '['conda', 'env', 'export']' timed out after 15 seconds
Traceback (most recent call last):
  File "/opt/conda/lib/python3.10/site-packages/wandb/sdk/internal/system/system_info.py", line 214, in _save_conda
    subprocess.call(
  File "/opt/conda/lib/python3.10/subprocess.py", line 347, in call
    return p.wait(timeout=timeout)
  File "/opt/conda/lib/python3.10/subprocess.py", line 1209, in wait
    return self._wait(timeout=timeout)
  File "/opt/conda/lib/python3.10/subprocess.py", line 1951, in _wait
    raise TimeoutExpired(self.args, timeout)
subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after 15 seconds
2024-04-13 03:07:36,530 DEBUG   HandlerThread:162 [system_info.py:_save_conda():222] Saving conda packages done
2024-04-13 03:07:36,530 INFO    HandlerThread:162 [system_monitor.py:probe():229] Finished publishing system info
2024-04-13 03:07:36,538 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,538 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,538 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:36,538 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: keepalive
2024-04-13 03:07:36,539 DEBUG   SenderThread:162 [sender.py:send():379] send: files
2024-04-13 03:07:36,539 INFO    SenderThread:162 [sender.py:_save_file():1390] saving file wandb-metadata.json with policy now
2024-04-13 03:07:36,786 INFO    wandb-upload_0:162 [upload_job.py:push():131] Uploaded file /tmp/tmp3ubn2n35wandb/jjsi9om0-wandb-metadata.json
2024-04-13 03:07:37,383 INFO    Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-metadata.json
2024-04-13 03:07:37,560 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: python_packages
2024-04-13 03:07:37,560 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: python_packages
2024-04-13 03:07:37,564 DEBUG   SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:37,574 DEBUG   SenderThread:162 [sender.py:send():379] send: config
2024-04-13 03:07:37,576 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:07:37,576 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:07:37,577 DEBUG   SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:07:37,578 DEBUG   SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:37,578 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:07:37,693 DEBUG   SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:07:37,693 WARNING SenderThread:162 [sender.py:send_metric():1341] Seen metric with glob (shouldn't happen)
2024-04-13 03:07:37,693 DEBUG   SenderThread:162 [sender.py:send():379] send: telemetry
2024-04-13 03:07:38,383 INFO    Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/requirements.txt
2024-04-13 03:07:39,384 INFO    Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:07:41,385 INFO    Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:07:41,723 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:46,724 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:51,730 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:07:52,389 INFO    Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/config.yaml
2024-04-13 03:07:52,563 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:07:52,563 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:07:52,564 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:07:57,681 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:02,682 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:07,563 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:07,563 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:07,604 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:07,699 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:10,146 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:08:10,148 DEBUG   SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG   SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG   SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG   SenderThread:162 [sender.py:send():379] send: metric
2024-04-13 03:08:10,148 DEBUG   SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:08:10,148 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:08:10,150 INFO    SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:08:10,396 INFO    Thread-12 :162 [dir_watcher.py:_on_file_created():271] file/dir created: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:08:12,938 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:13,397 INFO    Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:08:17,938 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:21,491 DEBUG   SystemMonitor:162 [system_monitor.py:_start():172] Starting system metrics aggregation loop
2024-04-13 03:08:21,492 DEBUG   SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:08:22,561 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:22,562 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:22,566 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:23,629 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:24,401 INFO    Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/config.yaml
2024-04-13 03:08:28,735 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:33,736 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:37,561 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:37,562 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:37,602 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:39,617 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:44,618 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:45,115 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:08:45,116 DEBUG   SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:08:45,116 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:08:45,118 INFO    SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:08:45,409 INFO    Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:08:47,410 INFO    Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/output.log
2024-04-13 03:08:49,876 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:08:51,493 DEBUG   SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:08:52,562 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:08:52,562 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:08:52,565 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:08:55,643 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:00,644 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:05,644 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:07,562 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:09:07,562 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:09:07,603 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:09:10,690 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:15,691 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:20,692 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:21,494 DEBUG   SenderThread:162 [sender.py:send():379] send: stats
2024-04-13 03:09:22,562 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: stop_status
2024-04-13 03:09:22,563 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: stop_status
2024-04-13 03:09:22,603 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: internal_messages
2024-04-13 03:09:26,621 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report
2024-04-13 03:09:31,010 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: partial_history
2024-04-13 03:09:31,012 DEBUG   SenderThread:162 [sender.py:send():379] send: history
2024-04-13 03:09:31,012 DEBUG   SenderThread:162 [sender.py:send_request():406] send_request: summary_record
2024-04-13 03:09:31,012 INFO    SenderThread:162 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
2024-04-13 03:09:31,427 INFO    Thread-12 :162 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240413_030720-hqqism3w/files/wandb-summary.json
2024-04-13 03:09:31,740 DEBUG   HandlerThread:162 [handler.py:handle_request():146] handle_request: status_report