▤
Security questionnaire autofill
ComplianceAuto-answer a prospect's security questionnaire from your own policy library — grounded, with citations and a 'needs human review' flag.
The routing recipe
Job typeRAG / grounded Q&APolicyBalancedConstraintsctx ≥ 128k
routes to
Llama 4 Maverick 17B-128E Instruct nvidia
nvidia:meta/llama-4-maverick-17b-128e-instruct
87.7
match score
93
task fit
100
cost
53.4
speed
Why this model from benchmarks & capabilities
- "RAG / grounded Q&A" leans hardest on instruction; Llama 4 Maverick 17B-128E Instruct scores 94/100 there — top-2 of 32 chat models.
- Strength on this class of work shows up on MMLU-Pro, DROP, Natural Questions, where Meta models are competitive.
- Holds the 32k+ context this job needs (this model: 1M).
- Cost: runs on NVIDIA free credits / self-hostable NIM — effectively $0 marginal. Speed: ~45 tok/s (estimated).
- Chosen over Qwen3 Coder 480B-A35B Instruct for higher task-fit (93 vs 88.2) & more context.
| Capability the job needs | Weight | Llama 4 Maverick 17B-128E Instruct | Catalog rank |
|---|---|---|---|
| instruction | 0.90 | #2 of 32 | |
| long_context | 0.90 | #5 of 32 | |
| knowledge | 0.80 | #6 of 32 |
Relevant benchmarksMMLU-Pro DROP Natural Questions RULER (long-context)
Why it qualifiesHolds the 32k+ context this job needs (this model: 1M).
EconomicsCost: runs on NVIDIA free credits / self-hostable NIM — effectively $0 marginal.
Speed: ~45 tok/s (estimated). vs runner-upChosen over Qwen3 Coder 480B-A35B Instruct for higher task-fit (93 vs 88.2) & more context. (Qwen3 Coder 480B-A35B Instruct, score 86.8)
Speed: ~45 tok/s (estimated). vs runner-upChosen over Qwen3 Coder 480B-A35B Instruct for higher task-fit (93 vs 88.2) & more context. (Qwen3 Coder 480B-A35B Instruct, score 86.8)
Example result
Sample input ▾
Question: 'Do you encrypt customer data in transit and at rest?' Context: 'Policy ENC-2: All data encrypted at rest with AES-256 (KMS). TLS 1.2+ enforced on all public endpoints.'
Answer: Yes. Customer data is encrypted at rest with AES-256 (managed via KMS) and in transit via TLS 1.2+ on all public endpoints.
Source: Policy ENC-2.
needs_review: false
Test it on your own data