Cerebras Files for $26.6 Billion Nasdaq IPO as Cloud Revenue Surges
Cerebras files for a $26.6 billion Nasdaq IPO after securing a $20 billion OpenAI deal. The firm pivots from hardware sales to inference-cloud revenue, betting wafer-scale compute can survive as a recurring business.
Cerebras filed for a second Nasdaq IPO, targeting a listing on May 14, 2026, and seeking up to $3.5 billion in capital at a valuation near $26.6 billion. The offering prices shares between $115 and $125, with the institutional order book exceeding 20x oversubscription. Behind the paperwork lies a structural shift: Cerebras is moving away from bespoke hardware sales toward recurring inference-cloud revenue, validated by a >$20 billion compute agreement with OpenAI and the launch of CS-3 systems on Amazon Web Services. This pivot transforms a hardware vendor into a utility-like operator, though the transition introduces new dependencies on wafer-scale manufacturing and concentrated customers.
The IPO mechanics and diversification
The prospectus reveals a company recalibrating after regulators blocked a late 2024 filing due to heavy reliance on Abu Dhabi-backed G42. At that stage, G42 accounted for 87% of revenue. The current filing reflects a broader distribution network. MBZUAI now represents 62% of 2025 revenue, while G42 drops to 24%. OpenAI and AWS serve as anchor partners, reducing concentration risk.
Full-year 2025 revenue reached $510 million,
Growth stems largely from cloud service adoption rather than pure chip shipments. Inference cloud revenue, rising from negligible levels previously. The order book's 20x demand underscores appetite for the new model.
De-risking the balance sheet via inference economics
The transition from upfront capex hardware sales to subscription-style inference usage fundamentally alters Cerebras's risk profile. Pre-profit AI chip vendors face brutal cyclicality when relying solely on capital expenditure cycles. Recurring revenue smooths volatility and improves visibility for investors. This mirrors the trajectory of successful semiconductor firms that expanded into services to lock in long-term contracts.
Cerebras isn't attempting to dethrone Nvidia across all domains. Instead, it targets high-throughput inference slices where deterministic, low-latency performance commands a premium. The Wafer-Scale Engine 3 delivers 250 times more on-chip SRAM and 2,625 times greater bandwidth compared to Nvidia's B200. Such metrics appeal to workloads sensitive to memory bottlenecks. However, wafer-scale production remains tied exclusively to TSMC, introducing a single-point-of-failure risk that persists regardless of the revenue model.
Pluralism in the accelerator stack
Integrating CS-3 onto Amazon Web Services in 2026 marked a milestone: the non-GPU accelerator entering a major hyperscaler's primary supply chain. This validates that wafer-scale architectures can slot into established cloud workflows without requiring complete architectural overhaul. Hyperscalers are stress-testing alternative designs to mitigate geopolitical exposure and supply-concentration hazards. Cerebras secures a foothold by complementing rather than replacing incumbent stacks.
The compute agreement with OpenAI draws 750 megawatts of capacity—echoing the grid constraints outlined in Power Is The New Bottleneck For AI Infrastructure—and highlights how power scarcity drives buyers toward efficient, dedicated solutions. Cerebras positions itself as that solution for latency-sensitive workloads, capturing a wedge in the market where raw transistor count matters less than effective utilization.
Our read
We see this IPO as a critical stress test for specialized silicon businesses. The era of selling custom hardware to well-heeled labs is giving way to managed services that guarantee uptime and output. Cerebras has demonstrated it can secure multi-year, multi-billion-dollar commitments before going public—a rare feat for a pre-profit tech firm. The question is whether the inference cloud can generate sufficient gross margins to justify the $26.6 billion mark once the initial contract tail winds down.
The 30% cloud revenue share is promising but incomplete. Until inference services dominate the income statement, Cerebras remains vulnerable to hardware order fluctuations. Furthermore, the reliance on TSMC for wafer-scale yields keeps execution risk elevated. If the company can maintain the OpenAI and AWS anchors while scaling the cloud footprint, it establishes a durable niche. Otherwise, it risks becoming another boutique accelerator swallowed by the general-purpose giants.
Investors watching the May 14 listing will scrutinize how quickly the remaining 70% of revenue migrates to the cloud model—and whether the unit economics of inference can sustain the valuation.
Cerebras’s IPO validates its strategic pivot to recurring inference-cloud revenue, but sustaining a $26.6 billion valuation requires successfully converting the majority of its business to software-defined services.
Stance · CautiousConfidence · Established
The article acknowledges a structurally sound pivot to recurring revenue but emphasizes execution hurdles, valuation pressure, and concentrated manufacturing dependencies that limit immediate optimism.
Key takeaways
Customer concentration has shifted significantly, with MBZUAI accounting for 62 percent of 2025 revenue and G42 falling to 24 percent, supplemented by anchor deals with OpenAI and AWS.
Total 2025 revenue reached $510 million, fueled by a rapid expansion in inference-cloud subscriptions that now represent approximately 30 percent of the portfolio.
The Wafer-Scale Engine 3 targets latency-sensitive inference workloads with superior memory density and bandwidth, deliberately avoiding direct competition in broad training markets.
Manufacturing relies exclusively on TSMC for wafer-scale chips, preserving a critical single-point-of-failure risk regardless of the new recurring-revenue model.
Long-term viability depends on migrating the remaining 70 percent of revenue to the cloud platform and proving that inference services can support the proposed valuation once early contract tails expire.
What to watch next
Speed of migration for the remaining 70 percent of revenue to the cloud model
Unit economics and gross margin trajectories for the inference-as-a-service segment
Wafer-scale yield stability at TSMC and any announced manufacturing diversification
Who should care
AI infrastructure investorsCloud strategistsSemiconductor supply chain analystsEnterprise AI architects
Key players
CerebrasOpenAIAmazon Web ServicesTSMCMBZUAI
Auto-generated from the article by our model — a reading aid, not a replacement for the piece.