Question 1

Do I need to change my code?

Accepted Answer

No. Nadir exposes an OpenAI compatible API. Change your base URL to api.getnadir.com and set model to auto. That is the entire change.

Question 2

How does routing decide?

Accepted Answer

A trained pre-classifier reads each prompt in under 10 ms. Confident routes ship straight from a Haiku-class model. Borderline routes get the cheap-model answer scored by a calibrated verifier (AUROC 0.961 on RouterBench held-out). If the verifier accepts, ship cheap; if it rejects, escalate to Sonnet or Opus. The router never ships an answer it has not verified.

Question 3

What about quality?

Accepted Answer

On 11,420 RouterBench held-out triples, Nadir's verifier-gated cascade preserves 98% of always-Opus quality at 60% lower cost. Catastrophic-route rate is 1.7%. The verifier reads every borderline cheap-model answer before it ships, so quality drops are caught rather than absorbed silently. You can also set a per-API-key quality floor that pins traffic above your threshold to your configured premium model.

Question 4

Do you store my prompts?

Accepted Answer

Only if you turn on logging. With BYOK and logging off, we never see your plaintext. Just headers and token counts.

Question 5

Can I bring my own keys?

Accepted Answer

Yes. BYOK is supported on every tier, including Free. Your keys stay in your environment.

Question 6

What if a provider is down?

Accepted Answer

Automatic failover. If Anthropic errors, Nadir retries against OpenAI or Google on your configured chain. Your app stays up.

Question 7

What is Nadir?

Accepted Answer

Nadir is a verifier-gated cascade LLM router. It sits between your application and LLM providers (Anthropic, OpenAI, Google). The cheap model answers first, a calibrated verifier scores the answer before we ship it, and we escalate to a stronger model only when the verifier rejects. On 11,420 RouterBench held-out triples, Nadir cuts cost 60% versus always-Opus while preserving 98% of always-Opus quality. OpenAI compatible, two-line change.

Question 8

What is a verifier-gated cascade?

Accepted Answer

A verifier-gated cascade is a routing architecture where the cheap model answers the prompt first, then a calibrated verifier model scores that answer for quality. If the verifier accepts, the cheap answer ships. If the verifier rejects, the request escalates to a stronger model (Sonnet, then Opus). The wedge against one-shot prompt-only routers (Not Diamond, Martian) is that quality drops are caught and surfaced, not absorbed. Nadir's verifier has AUROC 0.961 and ECE 0.016 on RouterBench held-out (n=11,420).

Question 9

How is Nadir different from Not Diamond, Martian, OpenRouter, or Portkey?

Accepted Answer

Not Diamond and Martian route once with a meta-classifier from the prompt alone and ship whatever the chosen model produces. If the prediction is wrong, the user eats the bad answer. Nadir reads the cheap-model answer with a calibrated verifier (AUROC 0.961) before shipping, so routing mistakes are recoverable rather than absorbed. OpenRouter and Portkey hand you a catalogue and a fallback; you still pick the tier. Nadir picks the tier and verifies the outcome. On RouterBench held-out, Nadir preserves 98% of always-Opus quality at 60% lower cost.

Question 10

Is Nadir free?

Accepted Answer

Yes. Nadir is free and open-source under the MIT license. A hosted Pro plan is also available at $9 per month plus a variable savings fee.

Question 11

What LLM providers does Nadir support?

Accepted Answer

Nadir supports Anthropic (Claude), OpenAI (GPT), and Google (Gemini) models out of the box. You can configure custom model tiers and routing rules for any OpenAI-compatible provider.

Question 12

What are the benchmark results?

Accepted Answer

On 11,420 held-out RouterBench triples (eval JSON: verifier/reports/eval_composed_20260526T191001.json), Nadir's verifier-gated cascade preserves 98% of always-Opus quality at 60% lower cost. Catastrophic-route rate is 1.7%. Verifier AUROC is 0.961 and calibration ECE is 0.016 on the same held-out split. Pre-classifier overhead is under 10 ms; verifier latency is 180 ms on CPU INT8 when it runs, and most requests skip the verifier entirely because the pre-classifier is confident. On RouterArena's public scorer, Nadir scored arena_score 0.7118 on the full evaluation set (n=8,400), which projects into the public leaderboard's top 5, ahead of Auto Router (70.05), vLLM-SR (67.23), and Not Diamond (57.29). The published top entries above us are Sqwish Router (75.27), OrcaRouter-Adaptive (72.08), Azure-Model-Router (71.87), and R2-Router (71.60). A contamination audit confirmed zero prompt overlap between Nadir training corpora and the RouterArena eval splits. Savings vary with your prompt mix; the eval methodology and full threshold sweep are reproducible from the open-source eval harness.

Question 13

What is the verifier's AUROC and calibration?

Accepted Answer

Verifier AUROC is 0.961 and Expected Calibration Error (ECE) is 0.016, measured on 11,420 RouterBench held-out triples (verifier/reports/eval_20260526T184516.json). AUROC measures the verifier's ability to discriminate good cheap-model answers from bad ones; ECE measures whether the verifier's confidence scores are well calibrated to actual accept rates. Both are reported on a held-out split disjoint from training (overlap_count=0).