Cerebras (CBRS) — wafer-scale inference, S&P 500 fast-track, and the 86% UAE concentration
S&P 500 fast-tracked Cerebras 12 days after its IPO. Passive funds now own a $48.8B name with 86% UAE revenue concentration. The wafer-scale inference thesis and the customer risk you buy through SPY.
The standard $CBRS story is simple: hot AI IPO, +68% on day one, now in the S&P 500. That's true. It also misses the structural read.
Cerebras is the first specialist inference chip on the public tape. And as of May 25, 2026 — twelve days after its IPO priced — S&P 500 fast-tracked the stock, skipping the standard twelve-month observation period. Every SPY, IVV, and VOO holder is now long a $48.8B AI company whose top three customers explain roughly 99% of disclosed and contracted revenue, and two of those three are related-party Abu Dhabi entities. This piece walks through what the WSE-3 chip actually does, where Cerebras fits in QA's AI Compute Accelerators bubble, how the customer book is structured, and what the passive-flow buyers just acquired without checking.
The TL;DR. Cerebras builds wafer-scale inference accelerators — one chip the size of a dinner plate, 4 trillion transistors, ~21× the inference throughput of an NVIDIA B200 on Llama 3 70B reasoning. The architectural pitch is real and structurally defensible. The customer book is not: 86% of 2025 revenue from G42 + MBZUAI (both Abu Dhabi), 80% of $24.6B backlog from a single OpenAI contract. S&P fast-track added passive demand to that concentration before any of it diversified.
For the May 14 IPO-event coverage, see Cerebras (CBRS) IPO: the first pure-play inference bet hits the tape. This piece is the evergreen explainer with the S&P inclusion lens.
What does Cerebras do?
Cerebras designs and sells AI accelerator hardware built around the Wafer-Scale Engine — a single chip etched on a full 300mm silicon wafer instead of being diced into individual dies. The current generation, WSE-3, is one die the size of a dinner plate: 4 trillion transistors, 900,000 cores, 21 PB/s of on-wafer memory bandwidth. That bandwidth number is roughly 2,600× a single NVIDIA B200. The CS-3 system that wraps the wafer delivers 125 PFLOPS of AI compute, draws 23 kW, and occupies 15U of rack space.
The architectural pitch is not flops. It's dataflow. A conventional GPU cluster runs an LLM by sharding the model across many chips and moving activations between them over interconnect (NVLink within a server, InfiniBand across racks). Token-by-token inference is latency-bound by that cross-chip synchronization. Cerebras's "layer-by-layer" dataflow runs the entire wafer on one layer of the model for all in-flight requests, then steps to the next layer. Cross-chip synchronization disappears — there's only one chip. On Llama 3 70B reasoning workloads, Cerebras claims ~21× faster inference than B200 at ~32% lower total cost of ownership. Independent Artificial Analysis benchmarks on Llama 4 Maverick (400B params): CS-3 delivers ~2,500 tokens/sec/user vs NVIDIA DGX B200 at ~1,000, SambaNova at ~794, Groq at ~549.
The company is explicit about what this doesn't do. WSE-3 isn't a training chip — $NVDA's H100/B200/Blackwell stack still owns that surface. WSE-3 isn't general-purpose compute — the wafer is hardwired around dense transformer inference. The pitch is narrower and sharper: latency-critical inference for frontier models, the slice where every additional millisecond of time-to-first-token is a measurable UX cost and a customer-grade pain point.
How they make money
Cerebras sells CS-3 systems (and the wafers inside them) to a small number of large customers who want pre-built, pre-tuned inference capacity rather than building it themselves on GPUs. Full-year 2025 revenue was $510M, +76% YoY, with $87.9M in net profit on the bottom line. That's the headline.
The structure underneath is the read. Of that $510M in 2025 revenue:
- G42 (Abu Dhabi-based AI conglomerate, related to MBZUAI) — ~24%
- MBZUAI (Mohamed bin Zayed University of AI, Abu Dhabi state-affiliated) — ~62%
- Everyone else combined — ~14%
That's 86% from two related-party UAE entities. The customer concentration risk on this name is not a footnote; it's the dominant signal in the financials.
Forward, the backlog tells a different but adjacent story. $24.6B in contracted revenue. Of that:
- OpenAI — roughly $20B, via a $10B+ deal for 750 MW of inference capacity over multiple deliveries through 2028
- G42 + MBZUAI continuation — most of the remainder
The two-customer current revenue concentration replaces with a different three-customer forward concentration (OpenAI dominates; G42 + MBZUAI continue). Concentration doesn't go away; it migrates. The bull-case framing is that the backlog represents commercial validation by the single most-prominent AI lab in the world. The bear-case framing is that if any one of those three relationships breaks — OpenAI shifts to in-house silicon, UAE export controls tighten, G42 reorganizes — the forward revenue trajectory is structurally different.
Where it sits in AI Compute Accelerators
CBRS is the newest member of QA's AI Compute Accelerators bubble, seeded at weight 0.70 (primary). For context, weights of established members:
- $NVDA — 1.00 (the cluster anchor)
- $AMD, $TSM — 0.95
- $AVGO — 0.90
- $MRVL — 0.80
- $CBRS — 0.70
- $QCOM, $INTC — 0.65 / 0.60 (partial-thesis)
The 0.70 weight reflects a real but specialist thesis: pure-play AI exposure, but scoped to inference rather than the full compute stack. No 252-day correlation history yet (the stock has traded for two weeks), so the weight will get revisited after 30 sessions of post-IPO tape. Expect lower residualized correlation to the bloc than NVDA/AMD/AVGO — single-thesis exposure clusters less tightly than diversified compute.
Structurally, CBRS is the inference-specialist counter to NVIDIA's general-purpose GPU dominance. NVIDIA's moat is not the silicon — it's CUDA, the 19-year-old software ecosystem every model is written against. Cerebras's bet is that inference is sufficiently different from training that the software-moat logic that locks hyperscalers into NVIDIA for training doesn't translate to inference, where the workload is narrower (token generation, dense GEMMs), latency is the dominant cost, and a vertically-integrated stack can beat a general-purpose one on the relevant metric. Whether that bet plays out is the central long-term question on this name.
The numbers
| Metric | Value | As of | | --- | --- | --- | | Market cap (fully-diluted) | $48.8B | 2026-05-18 | | TTM revenue | $510M (+76% YoY) | FY 2025 | | Net income (FY 2025) | $87.9M | FY 2025 | | Customer concentration | 86% from 2 UAE entities | FY 2025 | | Contracted backlog | $24.6B (~80% OpenAI) | 2026-05-18 | | IPO price / Day 1 close | $185 / $311.07 (+68.2%) | 2026-05-14 | | Exchange / Sector | NASDAQ / IT Semiconductors | — |
At $48.8B fully-diluted on $510M of revenue, CBRS trades at ~96× sales. That multiple compresses substantially if the OpenAI backlog converts to ratable revenue on schedule — $24.6B over ~3 years implies an ~$8B/yr forward run-rate, which would put the multiple at ~6× forward sales by 2028 if delivery and recognition land. The valuation is not pricing the current business; it's pricing the backlog conversion plus an inference-market share narrative.
The bull case
- Architectural advantage on the right workload. Wafer-scale dataflow wins on token-generation latency in ways general-purpose GPUs structurally cannot match. The Artificial Analysis benchmarks aren't marketing — they're independent third-party measurement.
- OpenAI lock-in is a real moat. The $10B+ / 750 MW deal through 2028 is not a try-before-you-buy. It's a multi-year capacity commitment from the single most credible AI lab in the world, with switching costs (model porting, infrastructure validation) that lock in revenue for the contract duration.
- Inference-cycle bet, not training-cycle bet. Training capex is the cycle that already happened. Inference scaling is the next 5-10 years, and the workload mix favors specialists more than general-purpose silicon.
- Passive demand floor. S&P 500 inclusion creates a mechanical buyer regardless of fundamentals. The free-float pressure during regular SPY/IVV/VOO rebalance windows is structural support for the stock through 2028 unless removed.
- Sufficient capital to execute. IPO raised $5.55B against a backlog of $24.6B. The cash position is large enough to fund the wafer-fab capacity ramp and the customer-acquisition push that diversifies the book.
The bear case
- 86% UAE concentration is not a sentence on a slide; it's the business. Two related-party entities representing the overwhelming majority of revenue means a single decision in Abu Dhabi materially restructures the company. Geopolitical risk (US export controls on advanced AI hardware to certain jurisdictions) compounds the concentration risk.
- OpenAI backlog is a single counterparty. If OpenAI's compute strategy shifts to in-house silicon (the Stargate datacenter buildout, the Broadcom co-design rumors, the rumored TSMC capacity reservation), the $20B forward becomes substantially less reliable. Backlog ≠ recognized revenue.
- NVIDIA software moat may translate to inference too. TensorRT-LLM, vLLM-CUDA, Triton kernel optimization — NVIDIA has shipped substantial inference-specific software that closes the latency gap on the specific workloads where Cerebras claims architectural advantage. The 21× number is on a specific Llama 3 70B benchmark; what's the read on Blackwell vs WSE-3 on a tuned vLLM-CUDA stack a year from now?
- 96× sales is a multiple. Even if backlog converts perfectly, the multiple has to be defended. A single OpenAI delivery slip or a single re-rating in semis sentiment compresses the equity quickly.
- No clean US institutional shareholder base yet. Pre-S&P, the float was tightly held; S&P inclusion creates index demand but doesn't create active fundamental conviction. The buy-side institutional book is still forming.
How to access
Direct stock. $CBRS trades on NASDAQ in USD. Any US-retail brokerage with NASDAQ access can buy it directly — Interactive Brokers, Schwab, Fidelity, Robinhood. For US-resident retail looking at this as a long, the IBKR-via-LLC structure is the cleanest path; see /stack/ibkr for the access mechanics.
Through the S&P 500 index — the new path. As of the May 25, 2026 effective date, every S&P 500 index fund holds CBRS proportional to its market-cap weight. That means:
- $SPY, $IVV, $VOO — S&P 500 trackers, ~0.1-0.2% CBRS weight initially
- $QQQ — Nasdaq-100 (CBRS qualifies on the exchange listing); inclusion timing typically lags S&P by 1-2 rebalance cycles
- $SOXX, $SMH — semiconductor ETFs; inclusion depends on each fund's methodology
QA tracks ETF holdings for CBRS on the /stocks/cbrs page; the holdings table will populate over the next 4-6 weeks as 13F and ETF holdings databases catch up to the S&P inclusion event. The first wave of passive demand has already happened — the more interesting question is what the second wave looks like once the rebalance windows compound.
Through related thematic ETFs. Cerebras's architectural relevance ties it to several editorial themes: AI infrastructure pure-plays, specialist accelerator names, and the AI Compute Accelerators bubble cluster. As the position builds across thematic vehicles, the /etfs page tracks the inclusion timeline.
What to watch
- Q1 2026 earnings (first as a public company) — expected mid-August 2026. Watch the customer mix disclosure, the OpenAI delivery cadence, and any signal on customer #4 / customer #5.
- OpenAI inference capacity decisions. Any commentary from OpenAI on their Stargate / in-house silicon plans is material. A clear directional read on whether OpenAI is doubling down on Cerebras or hedging toward in-house silicon resets the bear-case probability weight.
- UAE export controls. US Commerce Department announcements on advanced AI hardware exports to UAE / Gulf jurisdictions directly affect the G42 + MBZUAI customer relationships. Watch the headline tape.
- The post-S&P passive flow tape. Roughly 4-6 weeks post-effective-date is when the bulk of passive inflows have completed and active-fund positioning takes over as the marginal flow. The tape behavior around that transition is informative on the institutional read.
- Bubble-level read. If the AI Compute Accelerators bubble breaks correlation with NVDA on the next correction, CBRS's beta to the bloc — and the structural read on inference-specialist independence — gets meaningfully clearer.
Live data on this ticker: /stocks/cbrs — price, ETF holdings (populating post-S&P), bubble correlation, bot positions.
Bubble context: /bubbles/semiconductors — the AI Compute Accelerators cluster CBRS belongs to and how it's moving.
QuantAbundancia is educational research. Nothing here is investment advice. See /disclosures.
Related bubbles
Get the daily digest.
One email a day · alerts + bubble shifts + new research. Free during beta.
No spam. One email per day max. Telegram alerts coming with the paid tier.