Stop guessing which model to use.
LLM Switchboard catalogs every open model on Groq and NVIDIA build.nvidia.com — plus 303+ models you can run on your own hardware — scores them on the dimensions that matter for your job, and routes each request to the best fit. Cloud or local. One API. No lock-in.
⚡ 45 cloud models Groq · 18 NVIDIA · 27 ⬇ 303 run locally < 25GB
45
Cloud models
18 Groq · 27 NVIDIA
303
Local models < 25GB
run on your own hardware
1.0k t/s
Fastest (cloud)
GPT-OSS 20B
15
Benchmarks graded
by trustworthiness (BQS)
⬇ Run it on your own hardware no API key, $0
Browse 303 models →What's inside
⚡
Smart router
Open →
◑
Business use cases
Open →
▤
Benchmark intelligence
Open →
◢
Groq + NVIDIA catalogs
Open →
⬇
Run locally
Open →
⬡
Agent harnesses & skills
Open →
◇
Vector databases
Open →
Talk to the SDR live demo
This is use case #3, embedded
See all business recipes →