|
[2025-01-15 22:08:28,344 I 4679 4679] (gcs_server) gcs_server_main.cc:52: Ray cluster metadata ray_version=2.40.0 ray_commit=22541c38dbef25286cd6d19f1c151bf4fd62f2ed |
|
[2025-01-15 22:08:28,345 I 4679 4679] (gcs_server) io_service_pool.cc:35: IOServicePool is running with 1 io_service. |
|
[2025-01-15 22:08:28,351 I 4679 4679] (gcs_server) event.cc:493: Ray Event initialized for GCS |
|
[2025-01-15 22:08:28,351 I 4679 4679] (gcs_server) event.cc:493: Ray Event initialized for EXPORT_NODE |
|
[2025-01-15 22:08:28,351 I 4679 4679] (gcs_server) event.cc:493: Ray Event initialized for EXPORT_ACTOR |
|
[2025-01-15 22:08:28,351 I 4679 4679] (gcs_server) event.cc:493: Ray Event initialized for EXPORT_DRIVER_JOB |
|
[2025-01-15 22:08:28,351 I 4679 4679] (gcs_server) event.cc:324: Set ray event level to warning |
|
[2025-01-15 22:08:28,357 I 4679 4679] (gcs_server) gcs_server.cc:73: GCS storage type is StorageType::IN_MEMORY |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:42: Loading job table data. |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:54: Loading node table data. |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:80: Loading actor table data. |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:93: Loading actor task spec table data. |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:66: Loading placement group table data. |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:46: Finished loading job table data, size = 0 |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:58: Finished loading node table data, size = 0 |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:84: Finished loading actor table data, size = 0 |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:97: Finished loading actor task spec table data, size = 0 |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_init_data.cc:71: Finished loading placement group table data, size = 0 |
|
[2025-01-15 22:08:28,360 I 4679 4679] (gcs_server) gcs_server.cc:162: No existing server cluster ID found. Generating new ID: f7f01344024dc7c8bd522189074e80daef59427598c9f82cb7d2eec8 |
|
[2025-01-15 22:08:28,361 I 4679 4679] (gcs_server) gcs_server.cc:644: Autoscaler V2 enabled: 0 |
|
[2025-01-15 22:08:28,364 I 4679 4679] (gcs_server) grpc_server.cc:134: GcsServer server started, listening on port 45980. |
|
[2025-01-15 22:08:28,650 I 4679 4679] (gcs_server) gcs_server.cc:245: Gcs Debug state: |
|
|
|
GcsNodeManager: |
|
- RegisterNode request count: 0 |
|
- DrainNode request count: 0 |
|
- GetAllNodeInfo request count: 0 |
|
|
|
GcsActorManager: |
|
- RegisterActor request count: 0 |
|
- CreateActor request count: 0 |
|
- GetActorInfo request count: 0 |
|
- GetNamedActorInfo request count: 0 |
|
- GetAllActorInfo request count: 0 |
|
- KillActor request count: 0 |
|
- ListNamedActors request count: 0 |
|
- Registered actors count: 0 |
|
- Destroyed actors count: 0 |
|
- Named actors count: 0 |
|
- Unresolved actors count: 0 |
|
- Pending actors count: 0 |
|
- Created actors count: 0 |
|
- owners_: 0 |
|
- actor_to_register_callbacks_: 0 |
|
- actor_to_restart_callbacks_: 0 |
|
- actor_to_create_callbacks_: 0 |
|
- sorted_destroyed_actor_list_: 0 |
|
|
|
GcsResourceManager: |
|
- GetAllAvailableResources request count: 0 |
|
- GetAllTotalResources request count: 0 |
|
- GetAllResourceUsage request count: 0 |
|
|
|
GcsPlacementGroupManager: |
|
- CreatePlacementGroup request count: 0 |
|
- RemovePlacementGroup request count: 0 |
|
- GetPlacementGroup request count: 0 |
|
- GetAllPlacementGroup request count: 0 |
|
- WaitPlacementGroupUntilReady request count: 0 |
|
- GetNamedPlacementGroup request count: 0 |
|
- Scheduling pending placement group count: 0 |
|
- Registered placement groups count: 0 |
|
- Named placement group count: 0 |
|
- Pending placement groups count: 0 |
|
- Infeasible placement groups count: 0 |
|
|
|
Publisher: |
|
|
|
[runtime env manager] ID to URIs table: |
|
[runtime env manager] URIs reference table: |
|
|
|
GcsTaskManager: |
|
-Total num task events reported: 0 |
|
-Total num status task events dropped: 0 |
|
-Total num profile events dropped: 0 |
|
-Current num of task events stored: 0 |
|
-Total num of actor creation tasks: 0 |
|
-Total num of actor tasks: 0 |
|
-Total num of normal tasks: 0 |
|
-Total num of driver tasks: 0 |
|
|
|
GcsAutoscalerStateManager: |
|
- last_seen_autoscaler_state_version_: 0 |
|
- last_cluster_resource_state_version_: 0 |
|
- pending demands: |
|
|
|
|
|
|
|
[2025-01-15 22:08:28,651 I 4679 4679] (gcs_server) gcs_server.cc:843: Main service Event stats: |
|
|
|
|
|
Global stats: 25 total (5 active) |
|
Queueing time: mean = 102.966 ms, max = 285.263 ms, min = 4.253 us, total = 2.574 s |
|
Execution time: mean = 11.607 ms, total = 290.170 ms |
|
Event stats: |
|
GcsInMemoryStore.Put - 9 total (0 active), Execution time: mean = 31.701 ms, total = 285.311 ms, Queueing time: mean = 220.757 ms, max = 284.301 ms, min = 4.253 us, total = 1.987 s |
|
GcsInMemoryStore.GetAll - 5 total (0 active), Execution time: mean = 17.248 us, total = 86.240 us, Queueing time: mean = 110.600 us, max = 126.060 us, min = 96.507 us, total = 552.998 us |
|
PeriodicalRunner.RunFnPeriodically - 4 total (2 active, 1 running), Execution time: mean = 3.583 us, total = 14.334 us, Queueing time: mean = 142.577 ms, max = 285.263 ms, min = 285.047 ms, total = 570.310 ms |
|
event_loop_lag_probe - 2 total (0 active), Execution time: mean = 15.429 us, total = 30.858 us, Queueing time: mean = 7.701 ms, max = 15.021 ms, min = 381.543 us, total = 15.402 ms |
|
NodeInfoGcsService.grpc_server.GetClusterId - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s |
|
RayletLoadPulled - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s |
|
ClusterResourceManager.ResetRemoteNodeView - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s |
|
GcsInMemoryStore.Get - 1 total (0 active), Execution time: mean = 26.830 us, total = 26.830 us, Queueing time: mean = 9.167 us, max = 9.167 us, min = 9.167 us, total = 9.167 us |
|
NodeInfoGcsService.grpc_server.GetClusterId.HandleRequestImpl - 1 total (0 active), Execution time: mean = 4.701 ms, total = 4.701 ms, Queueing time: mean = 1.059 ms, max = 1.059 ms, min = 1.059 ms, total = 1.059 ms |
|
|
|
|
|
[2025-01-15 22:08:28,651 I 4679 4679] (gcs_server) gcs_server.cc:847: task_io_context Event stats: |
|
|
|
|
|
Global stats: 5 total (1 active) |
|
Queueing time: mean = 1.699 ms, max = 8.360 ms, min = 10.087 us, total = 8.497 ms |
|
Execution time: mean = 35.053 us, total = 175.265 us |
|
Event stats: |
|
event_loop_lag_probe - 3 total (0 active), Execution time: mean = 55.485 us, total = 166.455 us, Queueing time: mean = 2.797 ms, max = 8.360 ms, min = 10.087 us, total = 8.390 ms |
|
PeriodicalRunner.RunFnPeriodically - 1 total (0 active), Execution time: mean = 8.810 us, total = 8.810 us, Queueing time: mean = 107.354 us, max = 107.354 us, min = 107.354 us, total = 107.354 us |
|
GcsTaskManager.GcJobSummary - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s |
|
|
|
|
|
[2025-01-15 22:08:28,651 I 4679 4679] (gcs_server) gcs_server.cc:847: pubsub_io_context Event stats: |
|
|
|
|
|
Global stats: 5 total (1 active) |
|
Queueing time: mean = 1.521 ms, max = 7.455 ms, min = 14.102 us, total = 7.607 ms |
|
Execution time: mean = 93.792 us, total = 468.959 us |
|
Event stats: |
|
event_loop_lag_probe - 3 total (0 active), Execution time: mean = 146.502 us, total = 439.506 us, Queueing time: mean = 2.501 ms, max = 7.455 ms, min = 14.102 us, total = 7.502 ms |
|
PeriodicalRunner.RunFnPeriodically - 1 total (0 active), Execution time: mean = 29.453 us, total = 29.453 us, Queueing time: mean = 104.252 us, max = 104.252 us, min = 104.252 us, total = 104.252 us |
|
Publisher.CheckDeadSubscribers - 1 total (1 active), Execution time: mean = 0.000 s, total = 0.000 s, Queueing time: mean = 0.000 s, max = -0.000 s, min = 9223372036.855 s, total = 0.000 s |
|
|
|
|
|
[2025-01-15 22:08:28,651 I 4679 4679] (gcs_server) gcs_server.cc:847: ray_syncer_io_context Event stats: |
|
|
|
|
|
Global stats: 5 total (0 active) |
|
Queueing time: mean = 1.830 ms, max = 8.984 ms, min = 9.503 us, total = 9.150 ms |
|
Execution time: mean = 54.855 us, total = 274.277 us |
|
Event stats: |
|
event_loop_lag_probe - 3 total (0 active), Execution time: mean = 90.824 us, total = 272.472 us, Queueing time: mean = 3.016 ms, max = 8.984 ms, min = 9.503 us, total = 9.049 ms |
|
RaySyncerRegister - 2 total (0 active), Execution time: mean = 902.500 ns, total = 1.805 us, Queueing time: mean = 50.687 us, max = 51.319 us, min = 50.054 us, total = 101.373 us |
|
|
|
|
|
[2025-01-15 22:08:31,083 I 4679 4679] (gcs_server) gcs_node_manager.cc:85: Registering node info, address = 192.168.0.2, node name = 192.168.0.2 node_id=8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:31,083 I 4679 4679] (gcs_server) gcs_node_manager.cc:91: Finished registering node info, address = 192.168.0.2, node name = 192.168.0.2, is_head_node = 1 node_id=8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:31,084 I 4679 4679] (gcs_server) gcs_placement_group_manager.cc:819: A new node: 8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb registered, will try to reschedule all the infeasible placement groups. |
|
[2025-01-15 22:08:31,091 I 4679 4759] (gcs_server) ray_syncer.cc:377: Get connection node_id=8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:32,020 I 4679 4679] (gcs_server) gcs_job_manager.cc:90: Adding job, job id = 01000000, driver pid = 4611 |
|
[2025-01-15 22:08:32,020 I 4679 4679] (gcs_server) gcs_job_manager.cc:111: Finished adding job, job id = 01000000, driver pid = 4611 |
|
[2025-01-15 22:08:38,365 W 4679 4702] (gcs_server) metric_exporter.cc:105: [1] Export metrics to agent failed: RpcError: RPC Error message: failed to connect to all addresses; last error: UNKNOWN: ipv4:127.0.0.1:54666: Failed to connect to remote host: Connection refused; RPC Error details: . This won't affect Ray, but you can lose metrics from the cluster. |
|
[2025-01-15 22:08:38,623 I 4679 4679] (gcs_server) gcs_job_manager.cc:149: Finished marking job state, job id = 01000000 |
|
[2025-01-15 22:08:38,659 I 4679 4679] (gcs_server) gcs_node_manager.cc:366: Removing node, node name = 192.168.0.2, death reason = EXPECTED_TERMINATION, death message = received SIGTERM node_id=8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:38,660 I 4679 4679] (gcs_server) gcs_placement_group_manager.cc:789: Node failed, rescheduling the placement groups on the dead node. node_id=8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:38,660 I 4679 4679] (gcs_server) gcs_actor_manager.cc:1274: Node failed, reconstructing actors. node_id=8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:38,660 I 4679 4679] (gcs_server) gcs_job_manager.cc:454: Node failed, mark all jobs from this node as finished node_id=8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:38,887 I 4679 4728] (gcs_server) ray_syncer-inl.h:318: Failed to read the message from: 8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:38,888 I 4679 4728] (gcs_server) ray_syncer.cc:373: Connection is broken. node_id=8121c06173321a984912c08809f74f3e3b338294980e2cc0600da3fb |
|
[2025-01-15 22:08:38,923 I 4679 4679] (gcs_server) gcs_server_main.cc:130: GCS server received SIGTERM, shutting down... |
|
[2025-01-15 22:08:38,925 I 4679 4679] (gcs_server) gcs_server.cc:267: Stopping GCS server. |
|
[2025-01-15 22:08:39,022 I 4679 4679] (gcs_server) gcs_server.cc:284: GCS server stopped. |
|
[2025-01-15 22:08:39,022 I 4679 4679] (gcs_server) io_service_pool.cc:47: IOServicePool is stopped. |
|
[2025-01-15 22:08:39,066 I 4679 4679] (gcs_server) stats.h:120: Stats module has shutdown. |
|
|