LLM inference server with continuous batching & SSD caching for Apple Silicon โ managed from the macOS menu bar