inferctl
kubectl for local LLMs
A local inference control plane. Inspect, route across, and coordinate local backends — Ollama, llama.cpp, LM Studio, MLX — from one command. It explains your stack; it doesn't run inference.
Try inferctl doctor.
Launching soon.