Overview

The open-model control room for SMB & mid-market teams.

catalog updated 31m ago

Stop guessing which model to use.

LLM Switchboard catalogs every open model on Groq and NVIDIA build.nvidia.com — plus 303+ models you can run on your own hardware — scores them on the dimensions that matter for your job, and routes each request to the best fit. Cloud or local. One API. No lock-in.

⚡ 45 cloud models Groq · 18 NVIDIA · 27 ⬇ 303 run locally < 25GB
45
Cloud models
18 Groq · 27 NVIDIA
303
Local models < 25GB
run on your own hardware
1.0k t/s
Fastest (cloud)
GPT-OSS 20B
15
Benchmarks graded
by trustworthiness (BQS)
⬇ Run it on your own hardware no API key, $0
Browse 303 models →

303 open models under 25 GB with one-command Ollama/Docker setup, picked by benchmark — and a built-in sandbox to test them in-browser. reasoning 80 · coding 35 · vision 46 · STT 45 · TTS 38 · embeddings 59

What's inside
Talk to the SDR live demo
✸ icompaas SDR · routed to fastest Groq model
This is use case #3, embedded

The homepage SDR chat is just a LLM Switchboard recipe: job type chat, policy fastest that passes, pinned to Groq for sub-second replies. Drop a key in .env and it answers live; without one it shows which model it would call.

See all business recipes →
Sign in to continue

LLM Switchboard is private — sign in with Authly to access the control room.

Sign in with Authly
← Back to home