|
if server_printed_ready_message and last_queue_req == 0 and time.time() - last_semaphore_release > 30 and semaphore.locked(): |
When starting up vllm_server, why we need 30s timeout? If 'server_printed_ready_message == True', can we free up semaphore directly?