inferctl

kubectl for local LLMs

A local inference control plane. Inspect, route across, and coordinate local backends — Ollama, llama.cpp, LM Studio, MLX — from one command. It explains your stack; it doesn't run inference.

Try inferctl doctor.

Launching soon.