Decoding Discontinuity

Decoding Discontinuity

The Cerebras IPO Test, Part II: The Architectural Math Behind a $17-20 Billion Fair Value

Cerebras bet the future of AI on ultra-fast wafer-scale inference. But the agentic economy emerging in 2026 is heterogeneous, CPU-centric, and increasingly optimized for cheaper FP8 compute.

Raphaëlle d'Ornano's avatar
Raphaëlle d'Ornano
May 19, 2026
∙ Paid
Image by Maryam Sicard via UnSplash

A follow-up to last week’s piece, now that the deal is live. A deep dive into whether the wafer-scale chip is built for the inference economy that has actually arrived.

Cerebras’s wafer-scale architecture was designed for monolithic, high-precision, single-model inference at extreme speed. The Inference Economy that has materialized in 2026 is heterogeneous, orchestrated, multi-model, context-heavy, and CPU-centric. This creates a structural architectural mismatch. Three misalignments limit Cerebras’ durable moat. At ~$66 billion, the stock trades 3-4× our fair-value range of $17-20 billion. The Inference Economy is real. Whether Cerebras’s architecture is positioned to capture its economic surplus remains an open question.

Keep reading with a 7-day free trial

Subscribe to Decoding Discontinuity to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2026 Raphaëlle d'Ornano · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture