Anthropic filed for an IPO on June 1. The numbers tell the story. On June 1, 2026, Anthropic confidentially filed an S-1 with the SEC. The company raised $65 billion in its Series H at a $965 billion post-money valuation, eclipsing OpenAI for the first time. Analysts expect the listing in October 2026, likely above $1 trillion. Source: Fortune, "Anthropic confidentially files for IPO after raising $65 billion in a funding round at a $965 billion valuation," June 2026 Source: TechCrunch, "Anthropic files to go public," June 2026 The revenue trajectory behind the valuation: $9 billion annualized at the end of 2025. $14 billion in February 2026. $19 billion in March. $30 billion in April. $47 billion in May. Anthropic told investors the run rate will exceed $50 billion by the end of July. That is roughly 80x growth in annualized revenue over two years. Source: CNBC, "Anthropic tops OpenAI as most valuable AI startup, nears $1 trillion valuation in latest round," May 2026 Source: VentureBeat, "Anthropic says it hit a $30 billion revenue run rate after 'crazy' 80x growth," 2026 Eighty percent of that revenue comes from enterprise API customers. Over 1,000 companies now spend more than $1 million per year on Claude, double the 500 reported in April. Eight of the Fortune 10 are customers. Source: Sacra, "Anthropic revenue, valuation & funding" The revenue is real. The growth is extraordinary. And the source is your token bill. The revenue model is per-token billing at frontier prices. Anthropic's pricing as of June 2026: | Model | Input ($/M tokens) | Output ($/M tokens) | |---|---:|---:| | Claude Opus 4.8 | $5.00 | $25.00 | | Claude Opus 4.8 Fast | $10.00 | $50.00 | | Claude Sonnet 4.6 | $3.00 | $15.00 | | Claude Haiku 4.5 | $1.00 | $5.00 | Source: Anthropic, "Pricing" Opus 4.8 launched May 28 with better agentic coding scores (64.3% to 69.2% on SWE-bench) and a 3x cheaper fast mode ($10/$50, down from $30/$150 for Opus 4.7). The standard rate stayed the same: $5/$25 per million tokens. Source: Neowin, "Anthropic launches Claude Opus 4.8 with better coding and lower fast mode pricing," May 2026 Source: VentureBeat, "Anthropic's Claude Opus 4.8 is here with 3X cheaper fast mode and near-Mythos level alignment," May 2026 The output price spread between Haiku and Opus is 5x. Between Haiku and Opus Fast, it is 10x. Every API call that hits Opus when Haiku would suffice contributes 5x more to that $47 billion revenue number than it needs to. This is not a criticism of Anthropic's pricing. Frontier models cost more to build and run. The price reflects the capability. The problem is that most production API calls do not need frontier capability. JPMorgan says the token bill is eating profits. In May 2026, JPMorgan published a research note titled "AI Token Costs are Eating Internet Profits Alive." The analysts identified several public companies, including Shopify, Spotify, ServiceNow, and Roku, where AI inference costs were surging as a share of operating expenditures. Source: Investing.com, "The AI Token Pricing Crisis Behind OpenAI and Anthropic's Revenue Race," 2026 The pattern is consistent across industries. Enterprise AI budgets were set based on 2024 token rates and chat-era usage patterns. Agentic workloads, coding agents, and production inference at 2026 adoption levels consume multiples of what the spreadsheets projected. The data points keep arriving: Uber burned its entire 2026 AI budget in four months after Claude Code adoption jumped from 32% to 84%. Per-engineer costs ran $500 to $2,000 per month. An unnamed enterprise ran up a $500 million Anthropic bill in a single month with no usage limits. Microsoft cancelled Claude Code licenses in its Experiences and Devices division by June 30 after per-developer costs exceeded forecasts. Meta built an internal leaderboard called Claudeonomics to track token spend across 85,000 employees. Source: Fortune, "Microsoft reports are exposing AI's real cost problem: Using the tech is more expensive than paying human employees," May 2026 Derek Thompson called it "The Great AI Cost Panic of 2026." The AI boom has entered its "wait, is this worth it?" phase. The answer is yes, it is worth it, but not at frontier prices for every call. Source: Derek Thompson, "The AI Boom Has Entered Its 'Wait, Is This Worth It?' Phase," 2026 The problem is not Claude. It is uniform model selection. Anthropic built a great model. Opus 4.8 is arguably the best coding model available. The 69.2% agentic coding score, the improved self-review capability, the dynamic workflow features, these represent genuine advances that justify the price for tasks that need them. The problem is that most production API calls do not need them. Datadog measured production telemetry across thousands of companies and found that 69% of all input tokens are system prompts, tool schemas, and policy definitions that repeat verbatim on every call. The actual user query is 31% of the token volume. Most of those queries are classification, formatting, summarization, simple Q&A, file reads, and status checks. Source: Datadog, "State of AI Engineering 2026" The AICC's analysis of 2.4 billion API calls found that organizations with intelligent multi-model routing achieved median blended costs of $2.31 per million tokens. Organizations without routing paid $18.40 per million tokens. That is an 87% difference. Source: AICC, "Enterprise Token Costs Drop 67% Year-Over-Year as Multi-Model AI Adoption Hits Record High," May 2026 The gap is not about using less AI. It is about using the right model for each call. The enterprise paying $18.40 per million tokens is sending everything to Opus. The one paying $2.31 is routing simple tasks to Haiku and reserving Opus for the 10-20% of calls that actually need frontier reasoning. What $47 billion in revenue looks like with routing. Anthropic's $47 billion revenue run rate means enterprise customers are collectively spending roughly $3.9 billion per month on Claude tokens. Eighty percent of that, $3.1 billion, comes from enterprise API usage. We cannot know the exact prompt-complexity distribution across all of Anthropic's customers. But the industry data is consistent: 50% to 70% of production API calls are low or medium complexity. Model the blended spend at three tiers: Without routing (current state for most enterprises): All calls go to Opus at $5/$25 blended. 100% of tokens at frontier price. With three-tier routing: 60% of calls to Haiku at $1/$5: ~80% cost reduction on those calls 30% of calls to Sonnet at $3/$15: ~40% cost reduction 10% of calls stay on Opus at $5/$25: no change Blended cost reduction: roughly 55%. On $3.1 billion per month in enterprise Claude spend, that is $1.7 billion per month in potential savings. $20.4 billion per year. Not from using less AI. Not from switching providers. From routing each call to the cheapest Claude model that can handle it. Anthropic still gets paid. Every routed request still hits a Claude model. The total token volume stays the same or increases (because teams that save 55% tend to deploy more AI, not less). But the per-request cost matches the per-request complexity. The IPO creates a structural tension. Anthropic's business model scales with token volume at frontier prices. The company's revenue growth depends on more customers sending more tokens to Opus and Sonnet. A $965 billion valuation implies continued growth toward $50 billion and beyond in annualized revenue. Your business model requires controlling costs. You need AI to be productive, not to maximize your cloud provider's revenue. The tension is structural: what is good for Anthropic's IPO prospectus is not necessarily good for your P&L. This is not unique to Anthropic. OpenAI faces the same dynamic. Google faces it with Gemini. Every token-priced provider benefits from customers using the most expensive model available. The incentives are aligned on usage but misaligned on efficiency. CNBC reported in May that cheap AI models could actually threaten OpenAI and Anthropic's IPO valuations. If enterprises route to cheaper models, the blended revenue per token falls. This is the honest version of the market: providers want you on Opus, your CFO wants you on Haiku, and routing is the mechanism that resolves the tension by matching the model to the task. Source: CNBC, "Cheap AI could derail OpenAI and Anthropic's IPOs," May 2026 Josh Bersin's analysis puts it bluntly: AI prices are going up across the board for enterprise customers, and the companies with the best cost management practices are pulling ahead of competitors who treat AI spend as uncontrollable. Source: Josh Bersin, "AI Prices Are Going Up, Up, Up - And What This Means For Enterprise AI," May 2026 Five things to do before the Q3 budget review. 1. Audit your model selection. Pull your API logs from the last 30 days. What percentage of calls go to Opus versus Sonnet versus Haiku? If the answer is "mostly Opus" or "we do not know," you are overpaying. 2. Classify your prompt distribution. Tag a sample of 1,000 requests by complexity. Classification, formatting, simple Q&A, and status checks are Haiku-tier tasks. Multi-step reasoning, long-form generation, and novel problem-solving are Opus-tier. Most teams find 50-70% of their calls are in the cheap tier. 3. Enable prompt caching. Datadog found only 28% of teams cache their system prompts, despite 69% of tokens being static scaffolding. Anthropic supports automatic prefix caching. Enabling it costs nothing and saves 60%+ on input tokens for repeated content. 4. Route per request, not per application. Stop hardcoding model selection. A classifier that evaluates each call independently and routes to the cheapest capable model captures the 5x pricing gap between Haiku and Opus on the majority of production calls. 5. Plan for agentic growth, not chat-era budgets. Goldman Sachs projects 24x token consumption growth by 2030. Your AI budget should assume volume growth, not just price changes. Routing is the lever that lets volume grow without the bill growing proportionally. Source: Goldman Sachs, "AI Agents Forecast to Boost Tech Cash Flow as Usage Soars," May 2026 Where Nadir fits. Nadir sits between your application and Claude. The trained classifier evaluates each API call in under 10 ms and routes to the cheapest Claude model that can handle it. On 11,420 RouterBench held-out triples, the verifier-gated cascade preserves 98% of always-Opus quality at 60% lower cost. The integration is two lines: change the base URL, set \model="auto"\. Per-request response headers (\x-nadir-routed-to\, \x-nadir-cost-usd\, \x-nadir-cost-saved\) show exactly where each call went and what it saved. The dashboard aggregates savings by day, week, and month. Anthropic built a trillion-dollar company on token revenue. That revenue comes from your API bill. You do not have to stop using Claude to control the cost. You have to stop sending every call to the most expensive model. Sources: Fortune, "Anthropic confidentially files for IPO" (June 2026). TechCrunch, "Anthropic files to go public" (June 2026). CNBC, "Anthropic tops OpenAI as most valuable AI startup" (May 2026). VentureBeat, "Anthropic says it hit a $30 billion revenue run rate" (2026). Sacra, "Anthropic revenue, valuation & funding". Anthropic, "Pricing". Neowin, "Anthropic launches Claude Opus 4.8" (May 2026). VentureBeat, "Claude Opus 4.8 is here with 3X cheaper fast mode" (May 2026). Investing.com, "The AI Token Pricing Crisis Behind OpenAI and Anthropic's Revenue Race" (2026). Fortune, "Microsoft reports are exposing AI's real cost problem" (May 2026). Derek Thompson, "The AI Boom Has Entered Its 'Wait, Is This Worth It?' Phase" (2026). Datadog, "State of AI Engineering 2026". AICC, "Enterprise Token Costs Drop 67% Year-Over-Year" (May 2026). CNBC, "Cheap AI could derail OpenAI and Anthropic's IPOs" (May 2026). Josh Bersin, "AI Prices Are Going Up, Up, Up" (May 2026). Goldman Sachs, "AI Agents Forecast to Boost Tech Cash Flow" (May 2026). Anthropic model pricing as of June 2026.