Cerebras Systems
Wafer-Scale AI Infrastructure — The Fastest AI Inference Chip
Cerebras has built a genuine hardware breakthrough in WSE-3 with the world's fastest AI inference, but 86% revenue concentration in a CFIUS-scrutinised customer and a delayed IPO create existential risk alongside the compelling technical moat.
Cover facts
Company profile
Cerebras Systems was founded in 2016 by Andrew Feldman (CEO) and a team of former semiconductor engineers in Sunnyvale, California. The company spent three years in stealth building the Wafer-Scale Engine (WSE), launching the WSE-1 and CS-1 system in 2019. The WSE-3, its third-generation chip manufactured by TSMC on 5nm, is the largest chip in commercial production at 46,225 mm² with 4 trillion transistors, 900,000 AI cores, and 21 PB/s on-chip memory bandwidth. Cerebras serves national laboratories (LLNL, ANL), sovereign AI programmes (G42/UAE), and pharmaceutical customers (GSK), with G42 representing 86% of H1 2024 revenue of $136M. The company filed an S-1/A for IPO in October 2024 but withdrew due to CFIUS uncertainty arising from G42's UAE government ties.
- Website
- cerebras.net
- Founded
- 2016-01-01
- Founders
- Andrew Feldman
- Founding location
- Sunnyvale, CA, USA
- Headquarters
- Sunnyvale, CA, USA
- Product
- CS-3 on-premises AI system (WSE-3 + MemoryX 850TB + SwarmX fabric); Cerebras Inference API (cloud, OpenAI-compatible, 2,100+ t/s on LLaMA-70B); PyTorch-compatible SDK and open-source Model Zoo (30+ architectures); AWS disaggregated inference partnership (2026).
- Customers
- National laboratories (LLNL, ANL, Sandia), sovereign AI programmes (G42/UAE), pharmaceutical companies (GSK), and AI startups/researchers accessing the cloud inference API.
- Business model
- Hardware system sales and leases (CS-3 on-prem, multi-year contracts); cloud inference API (per-token pricing, undisclosed); professional services bundled with on-prem contracts.
- Stage
- Series D (Nov 2023), late-stage private; IPO withdrawn Oct 2024
- Funding status
- ~$720M raised; most recent round was $177M Series D (Nov 2023) led by Altimeter, Citadel Securities at $4.0B valuation.
Executive summary
Top strengths
- Genuine hardware breakthrough: WSE-3 delivers 15x faster LLM inference than H100 at low batch sizes — a defensible, hard-to-replicate advantage
- Revenue scale and growth: $136M H1 2024 revenue represents substantial commercial traction ahead of IPO
- Blue-chip national lab and pharma customer base (LLNL, ANL, GSK) provides credibility and diversification anchors
- First-mover in wafer-scale integration with three chip generations and significant IP moat
- AWS partnership expands distribution to enterprise cloud customers without additional hardware capex
Top risks
- G42/UAE customer concentration: 86% of H1 2024 revenue from one customer under active CFIUS scrutiny creates existential revenue risk
- IPO delayed by CFIUS review: inability to access public markets limits capital options and employee liquidity, increasing churn risk
- TSMC sole-source dependency: wafer-scale chip can only be manufactured by TSMC; any geopolitical disruption halts production entirely
- NVIDIA ecosystem moat: CUDA's 4M+ developer base and dominance in training creates a formidable switching cost against Cerebras expansion
- Export control escalation: tightening BIS rules on advanced AI chip exports could restrict or eliminate sales to UAE and other foreign customers
Open gaps
- G42 contract structure and contingency plan if CFIUS forces Cerebras to end the relationship
- WSE-3 production yield rates and per-unit manufacturing cost economics
- IPO re-filing timeline and capital runway post-S-1 withdrawal
- Non-G42 revenue breakdown by customer segment
Contents
01Company Overview
1.1 Company Identity, Legal Structure, and Business Model
Cerebras Systems Inc. is a Delaware corporation founded in April 2016, headquartered at 1237 E. Arques Avenue, Sunnyvale, California 94085. The company designs and manufactures the Wafer-Scale Engine (WSE), the world's largest commercially produced integrated circuit, integrating it into the CS-3 compute system. Its core business model spans two complementary revenue streams: direct hardware sales of CS-3 clusters to hyperscalers, research institutions, and government-affiliated AI labs; and cloud-based inference services via the Cerebras Inference Cloud API accessible at cerebras.ai/inference and cloud.cerebras.ai. The inference cloud service enables customers to run large language models at throughput exceeding 2,100 tokens per second, which the company advertises as up to 15 times faster than NVIDIA GPU-based deployments for equivalent LLM workloads. Cerebras is a fabless chip designer: all silicon fabrication is outsourced to TSMC on its 5-nanometer process. The WSE-3, the latest generation chip, integrates 4 trillion transistors across a 46,225 mm² die, with 900,000 AI cores, 44 GB of on-chip SRAM, and 21 petabytes per second of on-chip memory bandwidth. Cerebras began trading on the Nasdaq Global Select Market under ticker CBRS on May 14, 2026 following completion of its initial public offering. The company's value proposition centers on speed-first AI inference at scale, positioning its chip as a replacement for large multi-GPU clusters in latency-sensitive inference applications.[CO001, CO002, CO024, CO026, CO027]
| Metric | Value | Period / Notes |
|---|---|---|
| Revenue | $510.0M | FY 2025 |
| Revenue Growth (YoY) | 76% | FY 2025 vs. FY 2024 |
| Revenue FY 2024 | $290.3M | FY 2024 |
| Gross Margin | 39% | FY 2025 (42% in FY 2024) |
| GAAP Net Income | $237.8M | FY 2025 (vs. -$481.6M in FY 2024) |
| Non-GAAP Net Loss | -$75.7M | FY 2025 (excl. SBC & warrant revaluations) |
| IPO Price | $185/share | May 14, 2026; Nasdaq: CBRS |
| IPO Gross Proceeds | ~$5.55B | May 14, 2026 |
| Day-1 Close Price | ~$311/share | May 14, 2026 (+68% from offer price) |
| Headcount | ~1,000 | March 2026 (~708 at Dec 31, 2025) |
| Pre-IPO Capital Raised | ~$720M+ | Venture capital prior to IPO |
| Top-2 Customer Concentration | ~86% of 2025 Revenue | MBZUAI (62%) + G42 (24%) |
| WSE-3 Die Size | 46,225 mm² | Per chip, TSMC 5nm |
Revenue, margin, and headcount data from SEC S-1/A (CIK 2021728, filed May 11, 2026). IPO price and day-1 close from CNBC and Nasdaq reports. Non-GAAP metrics exclude stock-based compensation and warrant fair-value changes. Post-IPO market cap omitted as fully diluted share count not yet published.
[CO007, CO010, CO011, CO012, CO013, CO014]Cerebras business model: from chip design through TSMC fabrication to CS-3 hardware and inference cloud delivery
[CO002, CO024, CO026, CO027]1.2 Founding Team and Executive Leadership
Cerebras was co-founded by Andrew D. Feldman (CEO), Sean Lie (CTO), Gary Lauterbach, and colleagues who previously worked together at SeaMicro, a high-density server startup acquired by AMD for $334 million in 2012. Feldman had served as SeaMicro's CEO and later as Corporate Vice President of Server Business at AMD. Lie holds a Bachelor of Science from MIT in Electrical Engineering and Computer Science and led high-performance interconnect architecture at AMD before co-founding Cerebras. Gary Lauterbach, a veteran chip architect with experience from the AMD and SeaMicro cohort, serves in a co-founder technical capacity. The executive team also includes Dhiraj Mallick as COO and Komin as CFO; Komin brings prior CFO experience at Sunrun, Flurry, Ticketfly, and Linden Research, providing public-company financial leadership ahead of the IPO. The company's S-1/A explicitly identifies Andrew Feldman as a key-person risk; loss of the CEO would likely impair Cerebras' ability to execute its technical roadmap and maintain marquee customer relationships. The founding team's shared history at SeaMicro and AMD is a notable cohesion factor—core founders have worked together for over a decade—which reduces intra-team execution risk while concentrating critical technical knowledge in a small group. Board composition post-IPO includes co-founder members alongside investor representatives; the full list of independent directors was not fully enumerated in public filings accessible at time of research.[CO003, CO004, CO005, CO006, CO036]
| Name | Role | Background | Founder? |
|---|---|---|---|
| Andrew D. Feldman | CEO | Co-founded SeaMicro (acq. AMD $334M, 2012); AMD VP Server Business | Yes |
| Sean Lie | CTO | MIT EECS B.S.; AMD high-performance interconnect architect | Yes |
| Gary Lauterbach | Co-Founder / Technical | Veteran chip architect; AMD/SeaMicro alumnus | Yes |
| Dhiraj Mallick | COO | Enterprise technology operations; hyperscale infrastructure background | No |
| Komin | CFO | Former CFO at Sunrun, Flurry, Ticketfly, and Linden Research | No |
Sourced from SEC S-1/A named officer disclosures (CIK 2021728) and cerebras.ai/company page. Full post-IPO board composition including independent director names not fully enumerated in accessible public filings at time of research.
[CO003, CO004, CO005, CO006]1.3 Funding History and Capital Structure
Prior to its IPO, Cerebras raised approximately $720 million in venture capital across multiple rounds. G42, the Abu Dhabi-based AI conglomerate, invested a cumulative $335 million and became both Cerebras' largest strategic investor and primary revenue customer, representing 85% of 2024 revenues. In December 2025, OpenAI committed a $1 billion Working Capital Loan at 6% annual interest maturing December 31, 2032, providing pre-IPO liquidity. The IPO at $185 per share on May 14, 2026 raised approximately $5.55 billion in gross proceeds. AWS signed a binding term sheet with Cerebras in March 2026, acquiring approximately 2.7 million warrant shares at a $100 exercise price in connection with hardware procurement commitments. OpenAI holds a separate warrant for 33.4 million shares at a nominal $0.00001 per share exercise price tied to the master revenue agreement, representing material potential dilution for common stockholders. The share structure is dual-class: Class A shares carry one vote, Class B shares carry 20 votes, and Class N shares carry zero votes. Following the IPO, Class B holders—primarily founders and early insiders—retained approximately 99.2% of total voting power, enabling founder-controlled governance indefinitely. The combined equity from Series H and the OpenAI loan represent the most significant pre-IPO capitalization events, though individual round sizes for Series A through Series G are not fully disaggregated in public filings.[CO015, CO018, CO019, CO020, CO021, CO022]
| Stakeholder | Type | Stake / Investment | Relationship Details |
|---|---|---|---|
| G42 (UAE) | Strategic Investor & Customer | $335M cumulative equity investment | 85% of 2024 revenue; 24% of 2025 revenue; Abu Dhabi AI conglomerate |
| OpenAI | Customer & Warrant Holder | $1B Working Capital Loan (6%); 33.4M warrants @ $0.00001 | $20B+ MRA signed Dec 2025; 750 MW capacity allocation |
| MBZUAI (UAE) | Revenue Customer | No disclosed equity stake | 62% of 2025 revenue; 77.9% of AR at Dec 31, 2025 |
| AWS (Amazon) | Customer & Warrant Holder | ~2.7M warrant shares @ $100 exercise price | Binding term sheet March 2026; hardware procurement |
| Andrew Feldman (CEO) | Founder & Insider | Class B shares (20 votes/share) | ~99.2% voting control held by Class B holders post-IPO |
| Public (Nasdaq: CBRS) | Shareholders | $5.55B IPO gross proceeds | Class A shares (1 vote/share); listed May 14, 2026 |
| Pre-IPO VC Investors | Financial Investors | Part of ~$720M pre-IPO venture capital | Board representation per S-1; exact firm breakdown not fully disclosed |
Investment amounts, warrant terms, and revenue percentages from SEC S-1/A (CIK 2021728). Individual VC firm stakes by series not fully enumerated in public filings. Pre-IPO VC row aggregates Series A through G investors; specific firm names and percentages require cap table access.
[CO015, CO016, CO018, CO019, CO020, CO021]1.4 Key Performance Metrics and Stage Assessment
Cerebras is now a publicly traded company at an advanced growth stage. Revenue grew from $24.6 million in 2022 to $78.7 million in 2023, $290.3 million in 2024, and $510.0 million in 2025, representing 76% year-over-year growth in the most recent fiscal year. Gross margins expanded from 12% in 2022 to 42% in 2024 before compressing slightly to 39% in 2025, reflecting volume mix shifts toward hardware deployments. The company achieved GAAP net income of $237.8 million in 2025, a dramatic reversal from a $481.6 million GAAP net loss in 2024. However, 2025 profitability was significantly influenced by non-cash warrant fair-value revaluations related to the OpenAI and AWS transactions; the non-GAAP net loss for 2025 was $75.7 million, indicating the underlying business remains pre-profitable on a cash-adjusted basis. Headcount was 708 as of December 31, 2025, growing to approximately 1,000 by March 2026. Customer concentration is the most material operational risk: MBZUAI accounted for 62% of 2025 revenue and 77.9% of accounts receivable at year-end 2025. Combined with G42, UAE-linked entities represented approximately 86% of total 2025 revenues, creating significant geopolitical and counterparty concentration exposure that investors must weigh against the forward revenue trajectory from the OpenAI Master Revenue Agreement.[CO010, CO011, CO012, CO013, CO014, CO015]
Key valuation, financing, and ownership KPIs for Cerebras Systems at IPO (May 2026)
Day-1 close price rounded from $331.07. Round sizes and share prices from SEC S-1/A (CIK 2021728). OpenAI MRA value represents committed spend; actual revenue recognition timing not disclosed.
[CO020, CO021, CO022, CO023, CO028]1.5 Company Milestones and Adverse Events
Cerebras was incorporated in Delaware in April 2016 and spent its first three years in stealth mode developing the wafer-scale chip concept. The WSE-1 was unveiled at Hot Chips 2019, the WSE-2 followed in 2021, and the WSE-3 was introduced in March 2024. The company filed its initial S-1 with the SEC in September 2024 at a reference valuation of approximately $4.25 billion, followed by an amended S-1 in April 2026 and a final S-1/A in May 2026. SiliconAngle reported Cerebras initially targeted an IPO price range of $150 to $160 before finalizing at $185. The stock debuted at approximately $350 on May 14, closed at approximately $311 (+68% from offer price), but retreated to $279.72 on May 15. The Wall Street Journal characterized the IPO as a huge bet on Nvidia fatigue, signaling analyst skepticism about whether the non-standard form factor can sustain demand beyond early adopters. Adverse observations include the post-IPO stock decline, continuing non-GAAP operating losses despite GAAP profitability, heavy UAE customer concentration at IPO, and the undisclosed revenue recognition schedule under the OpenAI MRA. The GSK partnership and the $20 billion-plus OpenAI MRA signed in December 2025 represent major product-market validation milestones that transform the revenue concentration profile away from UAE entities toward a US hyperscaler anchor.[CO007, CO008, CO009, CO018, CO025, CO029]
| Date | Category | Milestone |
|---|---|---|
| April 2016 | Founding | Cerebras Systems Inc. incorporated in Delaware; founding team from SeaMicro/AMD cohort |
| August 2019 | Product | WSE-1 unveiled at Hot Chips 2019; first commercial wafer-scale engine announced |
| April 2021 | Product | WSE-2 launched; improved transistor count and memory bandwidth vs. WSE-1 |
| 2021 | Financing | Series F: $250M raised; total pre-IPO capital reaches $475M+ |
| March 2024 | Product | WSE-3 launched: 4 trillion transistors, 46,225 mm², TSMC 5nm, 900K AI cores |
| September 2024 | Regulatory | Initial S-1 filed with SEC at ~$4.25B implied valuation |
| 2024 | Partnership | G42 cumulative investment reaches $335M; G42-linked entities = 85% of 2024 revenue |
| December 2025 | Partnership | OpenAI Master Revenue Agreement: $20B+ contract, 750 MW capacity; $1B Working Capital Loan from OpenAI at 6% |
| March 2026 | Partnership | AWS binding term sheet signed; ~2.7M warrant shares granted at $100 exercise price |
| April 2026 | Regulatory | Amended S-1/A filed with SEC |
| May 11, 2026 | Regulatory | Final S-1/A filed; IPO price range updated to $150-$160 before final $185 pricing |
| May 14, 2026 | IPO | Nasdaq IPO at $185/share (CBRS); opens ~$350; closes ~$311 (+68%); $5.55B raised |
Dates from SEC filings (exact) and secondary news sources (approximate for pre-2024 events). WSE-1 and WSE-2 launch months are approximate based on conference timings. Pre-2019 internal R&D milestones not publicly documented.
[CO001, CO007, CO008, CO018, CO021, CO024]Key events in Cerebras Systems history from founding in April 2016 through the May 2026 Nasdaq IPO
WSE-1 month (August 2019) based on Hot Chips 2019 conference date. WSE-2 April 2021 is approximate. Series F year sourced from secondary reports; exact close date not publicly confirmed. SEC filing dates are exact per EDGAR timestamps.
[CO001, CO007, CO018, CO021, CO024, CO029]1.6 Exhibits
02Market Analysis
2.1 Market Definition and Scope
The AI infrastructure market encompasses compute hardware, cloud services, and associated software sold to enable training and inference of AI and machine learning models. Included spend covers GPU and AI accelerator hardware (NVIDIA H100/B200, AMD MI300X, Google TPU, AWS Trainium, Intel Gaudi, Cerebras WSE), AI-specific cloud compute services, and AI networking (InfiniBand, NVLink). Excluded spend includes general-purpose compute, enterprise storage, and SaaS applications that happen to incorporate AI features. Two subsegments are most relevant to Cerebras: AI training infrastructure — dominated by NVIDIA and concentrated among hyperscalers and AI labs; and AI inference infrastructure — serving models in production applications, growing faster than training as enterprise deployment expands. The Cerebras Inference Cloud competes in the inference-as-a-service subsegment against OpenAI API, Together AI, and Groq on throughput and price. Status-quo substitutes include NVIDIA-based cloud instances (AWS p5, Google A3) and proprietary inference APIs from AI labs.[CM001, CM002, CM003, CM004, CM015]
| Subsegment | Included Spend | Key Buyers | 2025E ($B) | CAGR to 2029 | Cerebras Position |
|---|---|---|---|---|---|
| AI Training Hardware | GPU/accelerator HW for model training (H100, B200, WSE-3, MI300X) | Hyperscalers, AI labs (OpenAI, Meta, xAI, Anthropic), government programs | ~$80B | ~22% | CS-3 clusters for training; currently small % of Cerebras revenue |
| AI Inference Hardware (On-Premise) | Accelerator hardware for serving models in production | Hyperscalers, enterprise IT, government AI labs | ~$65B | ~30% | CS-3 inference clusters; WSE-3 speed advantage most relevant |
| AI Inference-as-a-Service (Cloud API) | Cloud API services for LLM inference (tokens/second basis) | Enterprise developers, AI startups, research organizations | ~$30B | ~45% | Cerebras Inference Cloud (cloud.cerebras.ai); competes with Groq, Together AI, OpenAI API |
| AI Networking and Interconnect | InfiniBand, NVLink, RoCE networking for AI cluster communication | Hyperscalers, GPU cluster operators | ~$15B | ~25% | Not a Cerebras product; WSE-3 eliminates inter-chip communication need |
| Adjacent Spend (Excluded) | General cloud compute, storage, CPUs, AI-enabled SaaS applications | Enterprise IT broadly | ~$60B+ | ~15% | Not directly addressable; adjacent demand signal only |
2025E subsegment estimates are author-applied splits from Cerebras SEC S-1/A total of $251B using SIA and McKinsey data; individual subsegment sizes are approximate. CAGR figures apply Cerebras management's 28% aggregate to subsegment fractions. Inference cloud subsegment CAGR of 45% reflects early-stage market.
[CM001, CM002, CM003, CM004]2.2 Market Sizing and Growth Trajectory
The Cerebras SEC S-1/A Amendment No. 2 (May 2026) cites an AI infrastructure TAM of $251 billion in 2025 projected to $672 billion by 2029 (28% CAGR). A narrower lens — the AI chip market alone — is estimated at approximately $120 billion in 2025 by the Semiconductor Industry Association, more than doubling from $55 billion in 2023. Cerebras management estimates its SAM at $13.7 billion in 2025 growing to $51.5 billion by 2029, representing inference-first workloads where the WSE-3's speed advantage is most relevant. EpochAI's analysis shows frontier model training compute has grown 4x per year since 2020, supporting continued hardware demand. Stanford HAI AI Index 2025 estimates total AI investment reached $100 billion in 2024. McKinsey's 2024 State of AI report finds 72% of organizations have adopted AI in at least one business function. These signals confirm a market in early-adoption phase with secular tailwinds through 2029.[CM001, CM005, CM006, CM012, CM013, CM014]
| Lens | Estimate | Year | Source | Confidence | Limitation |
|---|---|---|---|---|---|
| TAM — Total AI Infrastructure | $251B → $672B | 2025 → 2029 (28% CAGR) | Cerebras SEC S-1/A Amendment No. 2 | Medium | Self-reported; no independent analyst source identified by name |
| TAM — AI Chip Market Only | ~$120B | 2025 | Semiconductor Industry Association | Medium | Excludes cloud services layer; excludes China export-restricted market |
| SAM — Cerebras-Relevant AI Compute | $13.7B → $51.5B | 2025 → 2029 | Cerebras SEC S-1/A (management estimate) | Low-Medium | Management estimate; inference-first workload boundary not independently verified |
| SOM — Cerebras Actual Revenue | $510M | 2025 | Cerebras SEC S-1/A | High | Highly concentrated (86% UAE); not representative of diversified SOM |
| Reference: EpochAI Compute Growth | 4x/year training compute growth since 2020 | 2020-2025 | EpochAI blog (AI and Compute) | High | Training FLOPs growth does not directly map to hardware revenue; inference demand separate |
All TAM/SAM/SOM estimates carry meaningful uncertainty. The Cerebras $251B TAM is a management estimate from the SEC S-1/A without a named third-party analyst source. SIA data for AI chips ($120B) is narrower but more verifiable. SAM and SOM figures are management-defined and should be treated as optimistic bounds. Diligence ask: obtain IDC or Gartner independent AI accelerator market sizing.
[CM001, CM005, CM006, CM012, CM021, CM034]Three-layer pyramid from Cerebras' total AI infrastructure TAM to management-defined SAM to actual 2025 revenue as SOM proxy.
TAM and SAM from Cerebras SEC S-1/A management estimates; not independently verified by a named analyst. SOM is actual reported revenue.
[CM001, CM034, CM006]Low/base/high estimates for key AI infrastructure market quantities in 2025 and 2029, reflecting the range across available sources.
All ranges are author-estimated bounds around published central figures. TAM 2025E base from Cerebras SEC S-1/A ($251B); AI chip market base from SIA ($120B); Cerebras SAM base from SEC S-1/A. Independent analyst sources (IDC, Gartner) were not available to confirm bounds.
[CM001, CM005, CM021, CM030, CM034]2.3 Buyer Segmentation and Demand Profile
Four buyer segments drive AI infrastructure demand. Hyperscalers and foundation labs (OpenAI, Microsoft, Google, Amazon, Meta) are the largest buyers by spend, accounting for an estimated 60-70% of AI chip purchases. Cerebras' OpenAI Master Research Agreement and AWS binding term sheet represent entry into this segment. Government and sovereign AI programs (UAE/MBZUAI/G42, Saudi Arabia, India, France) are characterized by geopolitical AI mandates and large budget allocations; Cerebras derived 86% of 2025 revenue from UAE entities. Academic and national research labs procure through government grant cycles at longer timelines. Enterprise buyers (pharma, biotech, financial services, healthcare) are the fastest-growing emerging segment for inference cloud adoption, with procurement via API rather than hardware. GSK's RNA model training on Cerebras exemplifies this pattern. Procurement cycles range from hours (API) to 18 months (large hardware cluster).[CM019, CM020, CM022, CM023, CM024, CM025]
| Segment | Representative Buyers | Budget Owner | Procurement Cycle | Value Driver | ACV Range | Cerebras Status |
|---|---|---|---|---|---|---|
| Hyperscaler / Foundation Lab | OpenAI, Microsoft Azure, AWS, Google, Meta, xAI | CTO / VP Engineering; multi-year infra planning | 6-18 months (hardware); hours (API) | Total cost of ownership, throughput, availability | $100M–$1B+ multi-year | Active: OpenAI MRA ($20B+ commitment), AWS term sheet; execution risk on delivery |
| Government / Sovereign AI | UAE (MBZUAI, G42), Saudi Arabia (NEOM/Transcendence), India (IndiaAI) | Ministry-level; AI holding company execs | 12-24 months; national strategy driven | Geopolitical independence, strategic capability | $50M–$500M per program | Dominant: 86% of 2025 revenue; MBZUAI 62%, G42 24%; high concentration risk |
| Academic / National Research Lab | Argonne National Lab, CERN, Stanford HAI, national AI institutes | Government grant bodies; university computing centers | 12-24 months; grant funding cycles | Performance per dollar; novel architecture access | $5M–$50M | Nascent: no large disclosed academic deployments; API accessible to researchers |
| Enterprise (Pharma / Finance / Healthcare) | GSK (RNA model training), biotech AI teams, quantitative finance, healthcare diagnostics | CISO / CTO / Head of AI | 3-12 months; API weeks | Inference speed, ROI, HIPAA/compliance fit | $0.5M–$20M (API+hardware) | Nascent: GSK deployment disclosed; inference cloud API enables self-service adoption |
Only four buyer segments are profiled; SMB, developer/hobbyist, and consumer AI segments are excluded as Cerebras does not currently address them at scale. ACV ranges are approximations from SEC S-1/A disclosed revenue figures and analyst estimates. Evidence gap: buyer segment coverage does not include SMB and developer communities who use the inference API.
[CM019, CM022, CM024, CM025, CM032]Qualitative matrix based on disclosed procurement patterns (OpenAI MRA, MBZUAI contracts, AWS term sheet, GSK deployment). ACV ranges from SEC S-1/A disclosures.
[CM015, CM019, CM022, CM024, CM025]2.4 Competitive Supply Dynamics
The AI accelerator supply side is dominated by NVIDIA at approximately 70-85% AI training market share by revenue. NVIDIA H100 GPU sells for $25,000-$40,000 per unit; Blackwell B200 delivers 5x H100 inference throughput and GB200 NVL72 racks cost approximately $3 million. AMD MI300X (192GB HBM3, 8 TB/s bandwidth) has gained traction at Microsoft Azure and Meta for memory-intensive inference; AMD AI revenue exceeded $5 billion in 2024. Google's TPUv5 is cloud-only (no hardware purchase), used internally for Gemini and Imagen. AWS Trainium2 is Amazon's custom AI chip, available via SageMaker HyperPod. Intel Gaudi 3 targets mid-tier workloads at ~30-40% H100 discount. Cerebras has no direct wafer-scale competitor; the WSE-3's 900,000 AI cores and 21PB/s on-chip SRAM bandwidth create an architecture differentiated from all GPU and HBM-based systems. TSMC 5nm underpins both WSE-3 and NVIDIA Hopper; capacity allocation is a shared constraint. US BIS export controls restrict H100/A100 equivalents to China and certain Middle East destinations, affecting both NVIDIA's TAM and Cerebras' UAE customer eligibility.[CM004, CM007, CM008, CM009, CM010, CM016]
2.5 Growth Drivers, Constraints, and Market Risks
Primary demand growth drivers: (1) AI model scaling — training compute 4x/year since 2020 requiring proportionally more hardware; (2) inference at scale — enterprise production deployments in early innings, with McKinsey estimating 72% AI adoption but far fewer in production inference at volume; (3) open-source model proliferation — Llama 3, Mistral, Qwen enable enterprise custom inference without proprietary API lock-in, driving demand for inference infrastructure; (4) sovereign AI mandates — UAE, Saudi Arabia, India allocating billions to domestic AI capability. Key constraints: TSMC advanced-node capacity shared across NVIDIA, AMD, Apple, Qualcomm; US BIS export controls limiting chip sales to certain countries; AI data center power constraints (IEA projects 1,000 TWh/yr by 2030); and hardware procurement capital intensity. Customer concentration risk is acute given MBZUAI's 62% and G42's 24% share of 2025 revenues; any slowdown in UAE sovereign AI spending would materially impact near-term revenue.[CM011, CM012, CM013, CM014, CM023, CM024]
| Factor | Direction | Timing | Implication for Cerebras |
|---|---|---|---|
| AI Model Scaling Laws | Driver (Accelerating) | Ongoing | Frontier training compute 4x/yr; directly scales AI chip TAM and demand for Cerebras CS-3 training clusters |
| Enterprise Inference Deployment Wave | Driver (Accelerating) | 2025-2028 peak | Production LLM deployment at enterprises expands inference market; Cerebras' primary SAM segment |
| Open-Source Model Proliferation (Llama 3, Mistral, Qwen) | Driver (Accelerating) | Now ongoing | Enterprises deploy custom open models; drives on-premise and cloud inference demand for Cerebras |
| Sovereign AI Mandates (UAE, Saudi, India) | Driver (Expanding) | Multi-year programs | Large captive buyers outside US hyperscaler ecosystem; already Cerebras' largest revenue source |
| Hyperscaler AI Capex Surge ($300B+ in 2026) | Driver (Accelerating) | 2025-2027 | Expands total chip demand; creates market opportunity for non-NVIDIA share |
| TSMC Advanced-Node Capacity Constraint | Constraint | 2025-2027 | 5nm/3nm capacity shared across NVIDIA, AMD, Apple, Qualcomm; limits Cerebras volume ramp |
| US BIS Export Controls (AI chips to China/UAE) | Risk/Constraint (Ongoing) | Escalation risk | Restricts TAM; UAE customer eligibility requires ongoing compliance monitoring |
| AI Data Center Power Constraints | Constraint | 2026-2030 | IEA projects 1,000 TWh/yr AI power demand by 2030; limits siting of new AI data centers |
| NVIDIA Blackwell Competitive Response | Risk (Narrowing gap) | 2026-2027 | B200 delivers 5x H100 inference throughput; may narrow Cerebras' speed advantage for standard GPU workloads |
Factor assessments are qualitative, based on SEC S-1/A disclosures, SIA data, and public vendor announcements. Timing is author estimate; forward-looking claims subject to technology execution and policy changes. The hyperscaler capex surge and open-source proliferation are the most certain near-term drivers; TSMC capacity constraints and NVIDIA Blackwell competitive response are the most significant medium-term risks to Cerebras' positioning.
[CM012, CM017, CM023, CM024, CM033, CM035]Enterprise AI adoption funnel from broad awareness to Cerebras-specific infrastructure procurement, indexed to 100 for global enterprise AI awareness.
Funnel values are indexed (100 = all global enterprises with AI awareness). Level 2 from McKinsey State of AI 2024 (72% adoption). Levels 3-5 are author estimates; no independent verification available for lower funnel stages.
[CM011, CM013, CM014, CM022, CM031]2.6 Exhibits
03Competitors
3.1 Competitive Landscape Overview
Cerebras Systems competes in the AI accelerator and inference-as-a-service market across five overlapping alternative categories: (1) incumbent GPU platforms led by NVIDIA and AMD; (2) custom internal silicon from hyperscalers—Google TPU and AWS Trainium—available externally via cloud but designed primarily for internal scale; (3) specialist inference-chip start-ups, most notably Groq (LPU architecture) and SambaNova (Reconfigurable Dataflow Unit); (4) early-stage RISC-V or chiplet approaches such as Tenstorrent; and (5) the status quo of multi-GPU clusters running open inference runtimes like vLLM and TensorRT-LLM. Enterprise buyers must also consider the internal build option—assembling a dedicated GPU cluster on-premises—which remains the dominant procurement pattern at hyperscale consumers. NVIDIA's GPU ecosystem is the reference standard against which every alternative is evaluated. AMD's MI300X has emerged as the first credible GPU challenger; Intel's Gaudi 3 targets the mid-tier with direct data-center sales leverage. Among purpose-built AI chips, Cerebras's wafer-scale memory-bandwidth architecture is architecturally differentiated from both GPU clusters and inference-focused LPU designs, but must overcome NVIDIA's overwhelming software ecosystem dominance and deep hyperscaler integration. New entrants such as Tenstorrent, backed by $693M in 2025 funding, are in early production but lack credible traction against Cerebras's vertically integrated hardware-plus-API stack as of the May 2026 IPO date.
3.2 Competitor Profiles
NVIDIA remains the category-defining competitor, generating approximately $47B in fiscal year 2025 data center revenue and holding an estimated 70–90% share in AI accelerator deployments. The H100 and H200 SXM5 are standard training and inference units for hyperscalers; the Blackwell B200/GB200 NVL72 rack system delivers 1.4 exaflops per rack and 8TB/s HBM3e per GPU chip, representing the next-generation platform that directly targets Cerebras's bandwidth narrative. NVIDIA's competitive advantage derives not only from hardware performance but from its CUDA ecosystem—an estimated 4M+ developers, decade-optimized libraries, and deep integration with every major cloud and framework. AMD's MI300X accelerator (192GB HBM3, 5.3TB/s) is the most commercially deployed alternative GPU, shipping at scale to Microsoft Azure and Meta since late 2023. AMD's ROCm software stack has improved meaningfully but still lags CUDA in operator coverage. Intel's Gaudi 3 targets cost-sensitive LLM training and inference, leveraging Intel's data-center sales relationships. Among hyperscaler silicon, Google TPU v5e/v5p are available on GCP with on-demand pricing but are tightly integrated with JAX/XLA frameworks, limiting portability. AWS Trainium2, accessible via SageMaker and EC2 Trn2, avoids model-format lock-in. SambaNova Cloud and Groq's LPU inference service represent the closest API-layer conceptual peers to Cerebras Cloud: both offer high-throughput LLM inference at competitive per-token pricing, but neither has disclosed meaningful independent enterprise revenue or verified benchmark results in 2026. Tenstorrent is in early production shipment targeting training workloads at smaller scale.
| Competitor | Category | Scale / Funding | Target Segment | Differentiation | Limitation vs Cerebras |
|---|---|---|---|---|---|
| NVIDIA H100/H200 | Incumbent GPU | $47B data center revenue FY2025 | All AI workloads | CUDA ecosystem, hyperscaler integration, brand | Much lower inference throughput at small batch sizes |
| NVIDIA Blackwell B200 | Next-gen GPU | Ramping production 2025–2026 | LLM training + inference | 8TB/s HBM3e, 1.4 exaflops per rack system | ~2600x lower bandwidth than WSE-3 on-chip SRAM |
| AMD MI300X | GPU challenger | $1B+ AI accelerator revenue (2024 est.) | LLM training, inference | 192GB HBM3, CUDA-compatible ROCm stack | ROCm lags CUDA; 5.3TB/s vs 21PB/s bandwidth |
| Intel Gaudi 3 | Data-center GPU | Intel internal capital | Cost-sensitive LLM training | PyTorch native, Intel sales relationships | Limited cloud availability; weak benchmark record |
| Google TPU v5e/v5p | Hyperscaler silicon | Google internal capital | Google Cloud customers | Low-latency XLA/JAX inference on GCP | No hardware ownership; JAX ecosystem lock-in |
| AWS Trainium2 | Hyperscaler silicon | AWS internal capital | AWS cloud workloads | EC2 Trn2, no model-format lock-in | AWS-only; no portability; unproven at LLM scale |
| SambaNova Cloud | Specialist inference | ~$1.1B total funding (est.) | Enterprise LLM inference | RDU claims low-latency inference | No disclosed revenue; no independent benchmarks |
| Groq LPU | Specialist inference | Private, undisclosed 2026 revenue | Developer inference APIs | High single-stream throughput (500+ T/s/chip claim) | Limited context window; limited model coverage |
Scale figures are estimates or company-disclosed totals. NVIDIA data center revenue from NVIDIA FY2025 earnings reports. SambaNova funding from Crunchbase secondary sources. Groq 2026 revenue is private and not publicly verified. Hyperscaler silicon funded internally with no external capital disclosures.
[CP001, CP002, CP003, CP004, CP005, CP006]3.3 Capability and Pricing Comparison
The fundamental capability axis distinguishing Cerebras from GPU-based competitors is memory bandwidth per unit of inference latency. Cerebras WSE-3 delivers 21PB/s on-chip SRAM bandwidth versus H100's 3.35TB/s HBM3—roughly 6000x higher at the chip level—translating to dramatically lower time-to-first-token at small batch sizes. Cerebras publicly claims approximately 1,500 tokens/sec on single-stream Llama-70B inference versus 60–90 tokens/sec on H100. This advantage narrows under heavily batched multi-user workloads where GPU clusters amortize memory latency across large batches. On software completeness, NVIDIA CUDA's ecosystem depth—optimized transformer kernels, FlashAttention, TensorRT-LLM, NIM microservices, and broad framework support—represents a durable distribution-power advantage. AMD ROCm supports PyTorch and most CUDA operators via HIPification; Intel Gaudi 3 supports PyTorch 2.x natively. Cerebras requires customers to adopt its proprietary CTML compilation pipeline, increasing porting friction for workloads already optimized for CUDA. On pricing, Cerebras's published cloud API rate for Llama-3 70B inference is $0.60/M input tokens, competitive with major GPU-cloud providers charging roughly $0.90/M for equivalent throughput. Hardware-purchase customers pay approximately $2–4M per CS-3 system. No publicly disclosed list pricing exists for Cerebras enterprise inference contracts beyond the API rate card. The absence of a published MLPerf submission from Cerebras limits independent verification of its throughput performance claims.
| Capability / Feature | Cerebras WSE-3 | NVIDIA H100 | NVIDIA B200 | AMD MI300X | Intel Gaudi 3 | Google TPU v5e | AWS Trainium2 |
|---|---|---|---|---|---|---|---|
| Inference Speed (Llama 70B, T/s) | >1,500 (company claim) | ~60–90 | ~110–130 (est.) | ~70–90 (est.) | ~80–100 (est.) | ~80–100 (est.) | ~60–80 (est.) |
| Memory Bandwidth | 21 PB/s on-chip SRAM | 3.35 TB/s HBM3 | 8 TB/s HBM3e | 5.3 TB/s HBM3 | ~3 TB/s (est.) | ~3 TB/s (est.) | ~3 TB/s (est.) |
| On-chip / HBM Memory | 44 GB SRAM | 80 GB HBM3 | 80 GB HBM3e | 192 GB HBM3 | 96 GB HBM3 (est.) | ~96 GB (est.) | ~96 GB (est.) |
| CUDA / Standard API Support | No (CTML only) | Full CUDA | Full CUDA | ROCm (partial) | No (XLA/JAX) | No (SageMaker) | No (Neuron SDK) |
| Public Cloud API Access | Yes (Cerebras Cloud) | Yes (all hyperscalers) | Yes (Azure, AWS est.) | Yes (Azure, AWS) | Limited (GCP beta) | Yes (GCP) | Yes (AWS) |
| Native Training Support | Yes (CS-3 on-premise) | Yes | Yes | Yes | Yes | Yes (TPU v5p) | Yes (Trn2) |
| Max Model Size (on-chip native) | ~60B params | Unlimited (HBM) | Unlimited (HBM) | Unlimited (HBM) | Unlimited (HBM) | Unlimited (HBM) | Unlimited (HBM) |
| MLCommons Benchmark | None | Yes (MLPerf v4.1) | Partial | Partial | None | None | None |
| Published Inference Price | $0.60/M tokens (Llama 70B) | ~$2–4/GPU-hr cloud | ~$3–5/GPU-hr (est.) | Not published | Not published | ~$1–2/TPU-hr | Trn2: varies |
| Enterprise SLA | SOC 2 (claimed) | Via cloud providers | Via cloud providers | Via cloud providers | Not published | GCP SLA | AWS SLA |
| Open-Source Ecosystem | Limited (CTML) | Extensive (CUDA/TRT) | Growing (ROCm) | Growing (Intel Ext.) | Limited (XLA) | Limited (JAX) | Limited (Neuron) |
| Hardware Purchase Option | Yes (CS-3 ~$2–4M est.) | Yes (via OEMs) | Yes (via OEMs) | Yes (via OEMs) | Yes | No | No |
Cells marked "est." are analyst estimates from published vendor specs and third-party benchmarks; no direct head-to-head published comparison of WSE-3 vs B200 exists as of May 2026. MLPerf participation per MLCommons results page. Pricing reflects published list rates; enterprise contract pricing is not public for most vendors.
[CP014, CP015, CP016, CP017, CP021, CP022]| Vendor | Model | Unit | List Price | Included Capabilities | Unknowns / Discounts |
|---|---|---|---|---|---|
| Cerebras | Cloud inference API | Per M input tokens | $0.60 (Llama-3 70B) | LLM inference, shared cluster, API access | Enterprise volume discounts undisclosed |
| Cerebras | CS-3 hardware | Per system | ~$2–4M (est.) | WSE-3 chip, CS-3 chassis, support | No public list price; contract pricing only |
| NVIDIA | H100 SXM5 cloud (GCP) | Per GPU-hour | ~$2.50–3.50/GPU-hr | GPU compute, CUDA access | Sustained-use and committed-use discounts |
| NVIDIA | H100 hardware (OEM, 8×) | Per server | ~$200K–300K (est.) | 8×H100 SXM5, NVLink, NVSwitch | Volume OEM pricing varies by integrator |
| AMD | MI300X cloud (Azure) | Per GPU-hour | ~$3–4/GPU-hr (est.) | GPU compute, ROCm stack | Azure pricing varies; not officially published |
| TPU v5e (GCP) | Per TPU-chip-hour | ~$1.20/chip-hr (est.) | TPU compute, XLA/JAX framework | Committed-use discounts available | |
| AWS | Trainium2 (Trn2 instances) | Per instance-hour | ~$2–6/hr (est.) | AWS cloud compute, Neuron SDK, SageMaker | Savings plans available; varies by instance |
GPU cloud pricing is approximate list pricing from public cloud consoles as of May 2026, not reflecting reserved or negotiated enterprise rates. Cerebras cloud API pricing is from the publicly published Cerebras pricing page. Hardware prices are analyst estimates based on reported deal values; no official list pricing from any vendor is published for systems.
[CP021, CP022]3.4 Switching Costs and Distribution Power
Cerebras's proprietary CTML compilation pipeline and flat on-chip SRAM memory model create material switching costs: workloads optimized for WSE-3's memory hierarchy must be restructured for GPU clusters, typically requiring 6–18 months for production-grade LLM pipelines. This is simultaneously a competitive moat and a barrier to initial adoption— enterprise evaluators must weigh the switching cost of adopting Cerebras against the future cost of leaving. NVIDIA's distribution power is unmatched. All three major hyperscalers (AWS, Azure, GCP) offer NVIDIA GPU instances as the default compute option; NVIDIA NIM microservices are pre-integrated in GCP Vertex AI and AWS SageMaker. AMD benefits from similar hyperscaler distribution via AMD-instance SKUs. Intel Gaudi 3 is sold primarily through Intel's direct data-center sales force with limited cloud availability. Cerebras has no hyperscaler spot-market integration as of May 2026; its AWS partnership (binding term sheet, March 2026) covers Cerebras-dedicated cloud instances only. Multi-homing between Cerebras and GPU infrastructure is technically possible but operationally expensive due to compiler differences. Customers embedding Cerebras APIs in production inference pipelines face limited practical multi-homing, as CTML-optimized models cannot be trivially re-deployed to NVIDIA TRT or vLLM without re-optimization. SambaNova and Groq users face similar, if slightly lower, switching costs due to their API-first interfaces built over standard model formats. TSMC 5nm wafer-scale manufacturing is a supply-side barrier that new entrants cannot quickly replicate; comparable TSMC commitments plus defect-management know-how require 5+ years of development.
3.5 Moat Durability and Displacement Risk
Cerebras's primary durable moat claims are: (1) architectural bandwidth from wafer-scale SRAM; (2) TSMC 5nm wafer-scale manufacturing expertise that no competitor currently replicates; and (3) a software/API layer abstracting the hardware advantage into a developer-accessible product. These claims are credible but carry two material displacement risks. First, NVIDIA's Blackwell B200 SXM5 delivers 8TB/s HBM3e bandwidth—more than 2x the H100—significantly narrowing the bandwidth gap that Cerebras exploited during the H100 era. While WSE-3 still leads in absolute bandwidth, the relative gap is compressing. NVIDIA simultaneously launched NIM inference microservices in 2024, competing directly with Cerebras's inference API commercial model, creating a two-sided squeeze on hardware performance and API-layer positioning. Second, Cerebras WSE-3's 44GB on-chip SRAM limits native model capacity to roughly 60B parameters; larger frontier models require multi-WSE-3 parallelism, reducing per-token efficiency. As frontier model sizes scale—Llama 4 Maverick is approximately 400B parameters— the core value proposition of running the entire model in on-chip SRAM weakens unless WSE-4 delivers substantially larger memory. Cerebras has not submitted WSE-3 results to MLCommons MLPerf inference benchmarks, leaving its throughput claims without independent third-party validation—an adverse signal for enterprise procurement teams requiring benchmark evidence. Customer concentration at MBZUAI (62% of 2025 revenue) is an additional durability risk.
| Moat Claim | Threat | Severity | Mitigation / Diligence Ask |
|---|---|---|---|
| 21 PB/s SRAM bandwidth exceeds GPU competition | NVIDIA B200 delivers 8TB/s HBM3e, compressing the gap; B300 expected 2027 | High | Verify WSE-4 roadmap bandwidth; confirm gap at production B200 batch sizes |
| TSMC 5nm wafer-scale manufacturing is unique and non-replicable near-term | New entrants could negotiate similar TSMC wafer allocations with sufficient funding | Medium | Confirm TSMC exclusivity terms or commitment contract in S-1/A exhibits |
| CTML compiler creates switching costs for adopters | CTML complexity deters initial adoption; ecosystem immaturity limits scale | Medium | Validate CTML PyTorch coverage completeness; assess user-reported porting friction |
| Inference API creates multi-layer monetization above hardware | NVIDIA NIM inference microservices compete directly with same API-layer model | High | Assess Cerebras API differentiation beyond throughput: model library, SLA, pricing stability |
| OpenAI MRA ($20B+, 750MW) validates large-scale inference demand | OpenAI is also a NVIDIA customer; MRA contingent on financing and capacity build-out | High | Verify draw schedule, financing contingencies, and exclusivity terms in SEC filing |
| Native model size limit (~60B params) constrains large-model workloads | Frontier models (Llama 4 ~400B) require multi-WSE-3 parallelism, reducing efficiency | Medium | Assess multi-WSE-3 performance at large model sizes; WSE-4 memory roadmap |
| No MLPerf submission reduces independent credibility | Enterprise buyers increasingly require MLCommons validation for procurement decisions | Medium | Confirm when Cerebras plans MLPerf submission; monitor MLCommons results page |
| Customer concentration (MBZUAI 62% of 2025 revenue) is existential risk | Loss or renegotiation of MBZUAI contract would materially impair financials | High | Confirm multi-year contract terms and auto-renewal provisions in SEC filing |
Severity ratings are qualitative assessments. High severity indicates potential material impact within 12–24 months if the threat materializes. Moat claims drawn from SEC filings and independent technical analyses; all mitigations require independent verification.
[CP031, CP032, CP033, CP034, CP035, CP036]3.6 Exhibits
04Financials
4.1 Revenue Streams and Pricing Model
Cerebras has three disclosed revenue streams: hardware system sales (CS-3 compute systems), cloud inference API (Cerebras Cloud), and professional services/support. Hardware sales account for the vast majority of historical revenue; the inference API is newer but growing. Revenue recognized for fiscal years 2022–2025 was $24.6M, $78.7M, $290.3M, and $510.0M respectively, representing a 220% CAGR over three years. The 2024–2025 growth rate was 76% YoY, which is exceptional for a hardware company with nine-digit revenue. This trajectory is substantially driven by two customers: MBZUAI (62% of 2025 revenue) and G42 (24% of 2025 revenue, down from 85% in 2024). Customer concentration is the primary revenue quality concern. Hardware pricing: CS-3 compute systems are sold at approximately $2–4M per system based on reported deal values; no official list price is published. Cloud inference API pricing is published at $0.60 per million input tokens for Llama-3 70B. Inference API pricing is fully managed with no hardware reservation requirement, creating a lower-friction entry for customers but likely lower gross margin than hardware sales. Revenue recognition for hardware likely follows point-in-time recognition upon delivery and acceptance; inference API revenue is recognized ratably. No segment-level revenue breakdown is publicly available.
| Stream | Mechanism | Unit | Current Value / Status | Quality Assessment | Diligence Ask |
|---|---|---|---|---|---|
| CS-3 Hardware Sales | Point-in-time revenue on delivery/acceptance of system | Per system | ~$2–4M/system est.; majority of 2025 revenue | Low predictability; large lumpy orders from few customers | Confirm backlog and order book for H2 2026; contract terms for MBZUAI hardware |
| Cerebras Cloud Inference API | Ratably recognized per-token API usage | Per M tokens | $0.60/M input tokens (Llama-3 70B); growing share of mix | Higher recurrence potential; margins not disclosed by segment | Disclose inference API revenue as % of total and gross margin |
| Professional Services / Support | Time-based or milestone-based; best estimate < 10% of revenue | Varies | Not separately disclosed; small relative to hardware | Low strategic importance; support margin unknown | Segment disclosure: confirm support revenue and margin in next filing |
| OpenAI MRA (forward commitment) | $20B+ minimum revenue agreement signed December 2025; 750MW capacity | Per capacity commitment | Effective 2026; draw schedule undisclosed | Transformational if unconditional; opaque conditionality risk | Obtain MRA term sheet or summary: draw schedule, conditions, termination rights |
Revenue stream mix is estimated based on disclosed total revenue and inference API pricing. CS-3 system pricing is inferred from reported deal values; no official list price is published. OpenAI MRA is a forward commitment; revenue has not yet been recognized. "Quality Assessment" reflects predictability and recurrence, not margin.
[CI001, CI005, CI013, CI017, CI023]| Offering | Price / Unit / Contract | List vs Realized | Discounts / Unknowns | Source |
|---|---|---|---|---|
| Cerebras Cloud Inference API (Llama-3 70B) | $0.60/M input tokens, $0.60/M output tokens | List price from published pricing page | Volume discounts likely; enterprise contracts not disclosed | Official: cerebras.ai/pricing |
| CS-3 Hardware System (on-premise) | ~$2–4M per system (est.) | Estimated from news-reported deal values; no official list | Enterprise contract pricing; significant negotiation likely | Estimated from news reports and SEC filing context |
| Cerebras Cloud Reserved Capacity (enterprise) | Not publicly disclosed | N/A — private contract | Unknown; MBZUAI and G42 terms confidential | SEC S-1/A risk factor disclosures only |
| OpenAI MRA Capacity Commitment | $20B+ over multiple years for 750MW | Forward commitment; per-capacity-unit pricing unknown | Conditionality and drawdown terms undisclosed | SEC S-1/A; OpenAI + BusinessWire press release |
| Professional Services / Support | Not separately disclosed | Bundled or separate; unknown | Unknown | SEC S-1/A aggregate revenue disclosure only |
List pricing is from Cerebras's publicly published pricing page. All hardware pricing is analyst estimates from news coverage. Enterprise contract terms for all named customers (MBZUAI, G42, OpenAI) are confidential. Realized pricing versus list pricing is unknown for hardware; inference API may have volume discounts not in the published rate card.
[CI013, CI017, CI021, CI022]4.2 GTM Motion and Sales Efficiency
Cerebras's go-to-market is primarily direct sales to large-scale AI research institutions, government-linked sovereign AI programs, and hyperscalers. The disclosed customer base as of the May 2026 IPO is small and highly concentrated: MBZUAI, G42, GSK, and OpenAI (via the $20B+ MRA announced December 2025). There is no disclosed reseller channel, OEM distribution partner, or cloud marketplace listing as of the IPO date, though the AWS binding term sheet (March 2026) would add a distribution channel when executed. Sales cycle for hardware is likely 6–18 months given enterprise procurement requirements for $2–4M capital decisions. Customer acquisition cost (CAC), average contract value (ACV), and net revenue retention (NRR) are all undisclosed private metrics. The Saudi Aramco CFIUS review, which delayed the 2024 IPO filing, was a material governance consideration that appears to have been resolved prior to the 2026 listing. OpenAI's $20B+ MRA represents a foundational anchor for the inference-as-a-service GTM channel; its drawdown conditionality and timeline are not disclosed but represent the largest single GTM execution risk.
4.3 Cost Structure and Gross Margin
Cerebras's cost of goods sold (COGS) is primarily wafer fabrication cost at TSMC for the WSE-3 chip. The 46,225mm² die on a full 5nm wafer is among the highest-cost semiconductor processes commercially available; no unit cost data has been disclosed. Additional COGS components include CS-3 chassis assembly, interconnect hardware, and integration labor. Cloud inference API COGS includes data center compute and networking to host the shared WSE-3 clusters. Gross margin was approximately 42% in 2024 and 39% in 2025. The 3-point compression from 2024 to 2025 is likely a mix of revenue scale (more volume, more TSMC wafers), product mix, and infrastructure costs for the growing inference API business. At 39–42% hardware gross margin, Cerebras compares unfavorably to NVIDIA's ~75% data center gross margin, but this reflects the difference between a fabless IP licensor (NVIDIA) and a full-stack hardware system vendor (Cerebras). More relevant comparables are hyperscaler-committed AI hardware vendors, where 35–45% is a reasonable range. The non-GAAP net loss of -$75.7M in 2025 versus GAAP net income of $237.8M implies approximately $313M in non-cash charges—likely a combination of stock-based compensation, warrant fair value adjustments, and potentially other non-recurring items. Operating leverage and path to sustainable non-GAAP profitability are the critical near-term questions.
| Metric | Value / Null | Confidence | Why It Matters | Diligence Ask |
|---|---|---|---|---|
| Blended Gross Margin (FY2025) | ~39% | High — SEC filing | Baseline unit economics; must expand for sustainable profitability | Segment split: hardware margin vs inference API margin |
| Hardware Gross Margin (CS-3 systems) | Not disclosed; inferred ~38–44% from blended | Low — no segment disclosure | COGS structure dominated by TSMC wafer costs | Request hardware vs services vs inference segment COGS |
| CAC (Customer Acquisition Cost) | Not disclosed | N/A — private | Critical for GTM efficiency assessment; unknown for hardware sales | Obtain estimated CAC from management; benchmark vs AI hardware comps |
| ACV (Average Contract Value) | Not disclosed; est. $50M–$500M for major hardware deals | Low — no contract-level data | Determines sales cycle efficiency and revenue per rep | Obtain ACV range by customer segment |
| NRR (Net Revenue Retention) | Not disclosed; MBZUAI AR concentration implies ongoing relationship | Low — inferred from AR data only | Indicates upsell and churn dynamics; critical for growth quality | Request NRR or renewal rate by cohort |
| Gross Margin per CS-3 Unit | ~$0.84M–$1.68M est. (at 42% on $2–4M system) | Low — derived estimate | Determines hardware profitability and TSMC cost leverage | Confirm per-system COGS and manufacturing yield |
| Inference API Gross Margin | Not disclosed; likely higher than hardware | Low — no segment disclosure | Cloud-like margins improve blended economics long-term | Request inference API contribution margin in next earnings |
| Working Capital per Unit Sold | Not disclosed; hardware requires TSMC advance payments | Low — no working capital detail | High advance COGS exposure creates cash flow risk | Request advance payment terms with TSMC and lead times |
All "not disclosed" entries represent private metrics that Cerebras has not reported in SEC filings or investor presentations. Estimates use the disclosed blended gross margin and reported system pricing from news sources. NRR is not a standard metric for hardware companies; recurring inference API business is more SaaS-like but no segment disclosure exists.
[CI006, CI016, CI019, CI020, CI032, CI033]4.4 Capital Adequacy and Financing
Cerebras entered the May 2026 IPO with approximately $1.1B raised in September 2025 (Series G at $8.7B valuation) and an additional $5.55B raised in the IPO at $185/share. Combined with the $1B OpenAI Working Capital Loan (6%, matures December 31, 2032), post-IPO liquidity is substantial. The company's capitalization is augmented by two warrant packages: OpenAI holds 33.4 million warrants at $0.00001 per share, and AWS holds approximately 2.7 million warrants at $100 per share. These represent dilutive obligations tied to commercial partnerships. Capital intensity stems from the need to pre-purchase TSMC wafer runs and manage CS-3 inventory ahead of revenue recognition. Working capital requirements are high relative to asset-light software companies; accounts receivable from MBZUAI represented 77.9% of total AR at December 31, 2025, creating a concentration-dependent collections risk. Planned use of IPO proceeds is not publicly disclosed in detail but likely includes TSMC capacity commitments, CS-3 inventory build for OpenAI and AWS partnerships, and general working capital. The company has no disclosed covenant violations or liquidity concerns, and post-IPO cash position is strong enough to fund operations for several years at current burn rates.
| Item | Amount / Terms | Maturity / Timeline | Assessment | Diligence Ask |
|---|---|---|---|---|
| IPO Gross Proceeds | ~$5.55B at $185/share | N/A — equity; permanent capital | Strong; sufficient for multi-year operations and TSMC commitments | Confirm use of proceeds breakdown in IPO prospectus |
| Series G Round | $1.1B at $8.7B valuation | Sept 2025; equity | Pre-IPO cash plus IPO proceeds provides substantial liquidity | Confirm preference stack and liquidation preferences vs IPO shares |
| OpenAI Working Capital Loan | $1B at 6% fixed rate | Matures December 31, 2032 | 6-year term at fixed rate; manageable at current revenue scale | Confirm covenant terms: financial maintenance covenants, prepayment, default triggers |
| OpenAI Warrants | 33.4M warrants at $0.00001/share strike | Post-IPO; exercisable per SEC filing | Significant dilution potential if exercised; effectively $0 cost to OpenAI | Confirm exercise schedule, vesting, and trading lockup |
| AWS Warrants | ~2.7M warrants at $100/share strike | Per binding term sheet March 2026 | Smaller dilution; at $100 strike, above-money at ~$311 close price | Confirm warrant grant conditionality: revenue thresholds vs unconditional |
| Estimated Monthly Operating Burn (non-GAAP) | ~$6M–$10M/month est. (based on non-GAAP loss ÷ 12) | Ongoing | At $6.55B+ cash + proceeds, runway is 50+ years at current burn | Confirm actual monthly cash burn including TSMC advance payments and working capital |
| Planned TSMC Wafer Commitments (2026–2028) | Not disclosed; estimated $500M–$1.5B for OpenAI/AWS scale-up | 2026–2028 | Large capital commitment to enable MRA and AWS deployments | Request multi-year TSMC commitment schedule and advance payment terms |
| Accounts Receivable — MBZUAI Concentration | 77.9% of total AR at Dec 31, 2025 | Collection timing unknown | Credit concentration risk; UAE government entity | Confirm receivables aging, collection history, and any disputed amounts |
Post-IPO cash position is estimated as pre-IPO cash plus IPO gross proceeds less fees. Monthly burn is derived from non-GAAP net loss; actual cash burn may differ significantly due to working capital movements. TSMC commitment estimates are analyst estimates based on inferred manufacturing scale for OpenAI MRA; no official commitment amount is disclosed.
[CI009, CI010, CI011, CI012, CI018, CI028]4.5 Public Financial Gaps and Verdict
The primary financial diligence blockers are: (1) the OpenAI $20B+ MRA draw schedule and conditionality—whether minimum revenue commitments are unconditional or milestone-dependent is critical to 2026–2028 revenue modeling; (2) unit economics—no CAC, ACV, NRR, or gross margin by business line is disclosed; (3) WSE-3 wafer yield and production cost—COGS modeling is impossible without per-wafer economics; and (4) working capital adequacy for large-scale capacity commitments. The financial verdict on Cerebras at the May 2026 IPO is mixed. Revenue quality is structurally weak due to extreme customer concentration (MBZUAI + G42 = ~86% of 2025 revenue from two entities) and a large unverified forward commitment (OpenAI MRA). Revenue growth is impressive but comparisons to historical benchmarks must account for the unusual structure. Gross margin at 39–42% is consistent with a high-quality hardware company but well below software comparable multiples. GAAP profitability is notable but driven by non-recurring items; underlying economics are loss-making on a non-GAAP basis. Capital adequacy post-IPO is strong. The primary financing risk is not near-term liquidity but rather the execution risk of the OpenAI MRA and AWS partnership—if either stalls, the implied 2026–2027 revenue guidance embedded in valuation multiples would require downward revision. Geopolitical risk (G42's UAE ties, MBZUAI's Abu Dhabi government ownership) is a persistent overhang that could affect U.S. regulatory treatment of Cerebras's largest commercial relationships.
| Missing Metric | Impact on Underwriting | Diligence Path |
|---|---|---|
| OpenAI MRA conditionality and draw schedule | Cannot model 2026–2028 revenue without knowing if $20B+ is unconditional; could be $500M/year or $3B/year | Request MRA term summary from management; review SEC filing exhibits when available |
| Revenue by business line (hardware vs API vs services) | Cannot assess revenue quality, recurrence, or margin by segment | Request segment disclosure; monitor earnings call commentary |
| Gross margin by business line | Hardware and API have fundamentally different margin structures; blended 39% insufficient for model building | Push for segment COGS in next SEC filing or investor day |
| TSMC wafer advance commitments and payment schedule | Working capital and capital allocation heavily dependent on TSMC exposure | Review TSMC contract disclosures in SEC filing risk factors; request payment schedule |
| OpenAI MRA revenue recognition trigger | Whether revenue is recognized on capacity deployment or on OpenAI consumption affects timing of reported revenue | Review S-1/A revenue recognition policy footnote; request CFO confirmation |
| CAC, ACV, and NRR by customer segment | Sales efficiency cannot be assessed without these metrics; revenue quality assessment is incomplete | Request management KPI packet; compare to AI hardware comps at IPO |
This table documents financial information that is material to underwriting Cerebras but is not publicly available as of the May 2026 IPO. All items represent specific diligence requests that should be directed to Cerebras investor relations or management. Information may be disclosed in subsequent 10-Q or 10-K filings.
[CI033, CI034, CI035, CI036]4.6 Exhibits
05Product & Technology
5.1 WSE-3 Wafer-Scale Engine Architecture
The Cerebras WSE-3 (Wafer Scale Engine 3) is the core compute substrate for all Cerebras products. Unlike conventional semiconductor dies that measure 200 to 900 mm2, the WSE-3 occupies an entire 300 mm TSMC 5nm wafer, producing a single die with 46,225 mm2 of silicon, which is 58 times larger than the NVIDIA B200 GPU at approximately 814 mm2. This extreme die area enables integration of 4 trillion transistors (19 times the B200), 900,000 AI processing cores, and 44 GB of on-chip SRAM on a single device. The on-chip SRAM delivers 21 petabytes per second (21,000 TB/s) of internal memory bandwidth, approximately 2,625 times the effective bandwidth of the NVIDIA B200 NVL72 cluster at approximately 8 TB/s. This bandwidth advantage is architecturally decisive because large language model inference is fundamentally memory-bandwidth-bound: model weights must traverse the memory interface on every forward pass. On GPU platforms, this creates a bottleneck at the High Bandwidth Memory interface. The WSE-3 eliminates this bottleneck by storing all model weights for models up to 44 billion parameters entirely on-chip, enabling deterministically fast token generation with zero off-chip memory traffic. Three chip generations have followed a consistent TSMC process cadence: WSE-1 (16nm, 1.2T transistors, 2019), WSE-2 (7nm, 2.6T transistors, 2021), and WSE-3 (5nm, 4T transistors, 2023). Each generation delivered approximately 2x transistor density improvement consistent with TSMC node shrink targets and Cerebras biennial release cadence.[CE001, CE002, CE003, CE004, CE005, CE006]
| Module / Asset | Primary User | Status / Maturity | Key Differentiation | Diligence Gap |
|---|---|---|---|---|
| WSE-3 Chip (46,225 mm2, TSMC 5nm) | AI labs, hyperscalers, sovereign AI programs | Production since 2023 | Largest commercially shipped AI die; 44 GB on-chip SRAM; 21 PB/s bandwidth | Production yield rates and per-wafer unit cost not disclosed |
| CS-3 Compute System | Data center operators, cloud providers, research institutions | Production; approximately $2 to $4M per unit estimated | Standard 2U rack form factor with complete WSE-3 integration including power, cooling, and networking | MTBF, field failure rates, and system-level SLA not publicly disclosed |
| Cerebras Inference Cloud API | AI developers, enterprises, hyperscaler integrations | GA since 2024; OpenAI Chat Completions compatible endpoint | 2,100+ tok/s for Llama 8B; $0.60/M input tokens; no hardware reservation required | SOC 2 and ISO 27001 certifications not publicly disclosed; uptime SLA not published |
| Cerebras Compiler (PyTorch to WSE-3) | ML engineers, AI researchers, model developers | Production; bundled with CS-3 and cloud API | PyTorch-native; no custom DSL required; auto-maps model graph to 900K AI cores on WSE-3 | Coverage for models above 44B parameters in disaggregated mode requires additional engineering effort |
| Cerebras-GPT Open-Weight Models | AI researchers, open-source developers | Open-source on GitHub and HuggingFace; released April 2023 | Chinchilla-optimal scaling; arXiv 2304.03208; 111M to 13B parameter range | No enterprise support contract for open-source models; model family not updated since 2023 |
Maturity levels derive from S-1/A production deployment disclosures and public API availability. Per-unit hardware cost is estimated from reported deal values, not a disclosed list price.
[CE001, CE003, CE004, CE010, CE012, CE013]Seven-layer view from TSMC fabrication at the base to customer applications at the top, spanning the full Cerebras hardware-software stack.
[CE001, CE003, CE004, CE005, CE006, CE013]5.2 CS-3 System Design and Deployment Architecture
The Cerebras CS-3 compute system integrates the WSE-3 chip into a standard two-rack-unit datacenter chassis. The CS-3 is the complete hardware product, encompassing the WSE-3 chip, power delivery, cooling infrastructure, and networking interconnects in a single deliverable unit priced at approximately two to four million dollars per system based on disclosed deal values. No official list price is published. The WSE-3 die dissipates substantially more thermal energy than conventional GPU boards and requires liquid cooling or high-airflow forced-air environments. Customers deploying CS-3 systems must configure data centers accordingly, adding operational overhead relative to standard GPU deployments. For workloads exceeding a single WSE-3 memory capacity (models above approximately 44 billion parameters), Cerebras provides the Cluster Manager, which orchestrates multiple CS-3 systems in a scale-out configuration using standard datacenter networking. The Cluster Manager enables disaggregated inference, distributing model layers across multiple WSE-3 nodes. This is the technical basis for the AWS integration announced via binding term sheet in March 2026, combining CS-3 nodes with AWS Trainium for hybrid disaggregated inference. Critical third-party dependencies include TSMC as sole wafer fabrication supplier and the packaging and chassis supply chain. The S-1/A prominently discloses the single-supplier concentration risk from TSMC dependence as a material risk to hardware revenue continuity.[CE020, CE021, CE023, CE025]
| Layer or Component | Role | Key Dependency | Risk |
|---|---|---|---|
| TSMC 5nm (N5) Foundry | Manufactures WSE-3 full-wafer die; sole production source | TSMC capacity allocation for full-wafer N5 product; no disclosed backup foundry | Single-source geopolitical risk from Taiwan concentration; yield uncertainty; foundry capacity at N5 |
| WSE-3 Die (46,225 mm2) | Core compute substrate: 900K AI cores plus 44 GB on-chip SRAM | TSMC N5 process; custom die packaging and thermal management | No die-level redundancy; chip defects affect entire silicon area; no alternative chip design exists |
| Cerebras Compiler | Translates PyTorch model graphs to WSE-3 execution plans automatically | PyTorch framework ecosystem maintained by Meta as open-source | Model compatibility gaps for novel or non-standard architectures; compiler version lock-in for deployed models |
| Cerebras Cluster Manager | Multi-CS-3 scale-out orchestration for models exceeding 44B parameters | Network fabric using InfiniBand or Ethernet; customer data center networking infrastructure | Disaggregated inference adds cross-node latency and operational complexity; AWS integration not yet deployed |
| Cerebras Inference Cloud API | Managed hosted API layer for LLM inference at scale | AWS infrastructure per binding term sheet signed March 2026; not yet deployed | Single cloud-infrastructure dependency; AWS integration incomplete as of IPO date; SLA not published |
| Data Center Cooling Infrastructure | Physical environment for CS-3 thermal management | Customer data center with liquid cooling or high-airflow forced-air capability | Incompatible with standard air-cooled racks; increases deployment cost and site qualification overhead |
Architecture derived from S-1/A technical disclosures, Cerebras documentation portal, and AnandTech WSE-3 technical analysis. AWS integration status based on binding term sheet disclosure in S-1/A.
[CE006, CE020, CE021, CE023, CE025, CE035]DAG of key external dependencies showing how TSMC fabrication, OpenAI financing, AWS distribution, and BIS export controls create interlocking supply, financial, and distribution risks.
Revenue percentages from FY2025 S-1/A disclosures. Dependency relationships inferred from S-1/A risk factors and disclosed partner agreements.
[CE023, CE024, CE025]5.3 Software Stack and Developer Ecosystem
Cerebras software stack transforms the WSE-3 hardware advantage into accessible developer productivity. The Cerebras Compiler accepts standard PyTorch model definitions and automatically compiles them to optimized WSE-3 execution graphs, requiring no custom domain-specific language, model redesign, or manual kernel optimization. For inference workloads, Cerebras offers the Inference Cloud API, which is fully compatible with the OpenAI Chat Completions API specification. Any application integrated with the OpenAI API can route requests to Cerebras by changing only the base URL and API key. The API supports Llama family models, Codex-Spark from OpenAI, GPT-OSS-120B, and GLM 4.7. Published inference pricing is 0.60 dollars per million input tokens for Llama-3 70B. The developer ecosystem is supported through three open channels: the Cerebras GitHub organization hosts a Model Zoo with reference implementations for GPT, BERT, and diffusion architectures; the HuggingFace organization distributes the Cerebras-GPT open-weight model family spanning 111M to 13B parameters, released in April 2023; and the Chinchilla scaling paper (arXiv 2304.03208) validates hardware training efficiency. The Cerebras SDK, documentation portal (docs.cerebras.ai), and cloud console (cloud.cerebras.ai) complete the developer-facing surface, enabling end-to-end deployment from model import to production inference without leaving the Cerebras platform.[CE012, CE013, CE018, CE019, CE027, CE028]
| User Job | Current or Prior Workflow | Cerebras Solution | Measurable Benefit | Limitation |
|---|---|---|---|---|
| LLM inference for production applications (models up to 44B params) | GPU cluster with NVIDIA H100 or B200, batched inference, HBM-bandwidth-limited | Cerebras Inference API (WSE-3 on-chip SRAM eliminates HBM memory bottleneck) | 2,100+ tok/s for Llama 8B; approximately 15x faster than GPU; $0.60/M tokens | Models exceeding 44B parameters require disaggregated multi-CS-3 inference; architecture still maturing |
| Large-scale AI model training (models up to 44B params) | Multi-GPU cluster with 100+ H100s, custom CUDA kernels, limited HBM bandwidth | CS-3 system with Cerebras Compiler and Cluster Manager for scale-out | Single WSE-3 replaces cluster for models up to 44B parameters; no custom kernel writing required | Training throughput advantage narrows for larger models; CUDA ecosystem has broader checkpoint availability |
| Scientific computing and drug discovery (neural network workloads) | CPU or GPU clusters, multi-week experiment cycles, limited dataset scale | CS-3 for neural network acceleration (GSK RNA model deployment) | GSK: 10x training speedup plus 120x larger dataset; faster drug discovery experiment cycles | Limited to neural-network acceleration; no support for general HPC or molecular dynamics workloads |
| Sovereign AI computing for government and university programs | US or EU public cloud or proprietary GPU cluster infrastructure | CS-3 on-premises deployment (MBZUAI and G42 sovereign AI programs) | Data sovereignty compliance; hardware under customer physical control; represents 86% of FY2025 revenue | BIS/EAR export licenses required for UAE customers; license delays or revocations interrupt service |
Use cases derived from S-1/A customer disclosures for GSK, MBZUAI, and OpenAI, plus product page descriptions. Measurable benefits are company-claimed or third-party-reported; independent benchmark confirmation is partial.
[CE010, CE011, CE012, CE014, CE015, CE017]End-to-end flow from customer API call through compiler dispatch to WSE-3 token generation and streamed response delivery.
[CE012, CE013, CE028]5.4 Inference Performance and Competitive Benchmarks
Cerebras claims 2,100+ tokens per second for Llama 3 8B on a single WSE-3, approximately 15 times faster than a comparable NVIDIA H100 or B200 GPU. This claim is analytically consistent with the hardware architecture: LLM inference throughput is directly proportional to available memory bandwidth for memory-bound workloads, and the WSE-3 21 PB/s internal bandwidth vastly exceeds the HBM3e bandwidth of the NVIDIA B200 NVL72 cluster at approximately 8 TB/s. For models fitting entirely within 44 GB of on-chip SRAM, the WSE-3 delivers near-peak bandwidth utilization with zero off-chip memory traffic. Production evidence supports the performance claim. OpenAI Codex-Spark runs on Cerebras inference infrastructure, providing hyperscaler-grade production validation. GSK reported 10x training speedup with a 120x larger RNA drug discovery dataset. MBZUAI operates CS-3 clusters for UAE sovereign AI computing at 62 percent of FY2025 revenue. The competitive landscape includes NVIDIA B200 NVL72 (approximately 8 TB/s HBM bandwidth, estimated 140 tok/s for 8B LLM), AMD MI300X (5.3 TB/s, 192 GB HBM3), and Google TPU v5/v6 (optimized for distributed training). For models up to 44B parameters in latency-sensitive single-user inference, the WSE-3 architecture is demonstrably differentiated. For models exceeding 44B parameters requiring disaggregated inference, the advantage narrows as distributed-compute complexity increases.[CE009, CE010, CE011, CE014, CE017, CE029]
Maturity ratings across five capability dimensions for four Cerebras product surfaces, showing hardware performance strengths against software and ecosystem development gaps.
Maturity labels are analyst judgments based on S-1/A production deployment evidence, public API documentation, and competitive analysis as of May 2026.
[CE010, CE011, CE012, CE013, CE021, CE025]5.5 Competitive Differentiation and Intellectual Property
Cerebras primary moat is architectural and manufacturing: the company holds the only commercially shipped wafer-scale AI processor at production volume as of May 2026. Replicating the WSE-3 would require mastery of full-wafer silicon interposer and custom packaging technology, a compiler toolchain with years of silicon-software co-development, design expertise to verify a 4-trillion-transistor chip with no inter-die redundancy, and TSMC foundry relationships with allocated capacity for full-wafer N5, among the most complex manufacturing arrangements TSMC supports commercially. The S-1/A references patents, trade secrets, and manufacturing know-how without enumerating specific patent numbers. The structural secrecy around the WSE architecture functions as a trade-secret barrier: the internal mesh fabric design, compiler optimization passes, and silicon floorplan are not publicly disclosed. Cerebras-GPT and the arXiv 2304.03208 paper demonstrate hardware performance consistent with theoretical bandwidth predictions. The NVIDIA CUDA ecosystem remains the dominant developer platform with the largest pre-trained checkpoint library and partner support. Cerebras OpenAI-API-compatible inference interface reduces switching cost for inference workloads but does not displace CUDA for training workflows requiring custom kernels. The key competitive risk is that NVIDIA improves HBM bandwidth generation-over-generation and may close the per-token latency gap for small models, or hyperscaler disaggregated inference architectures erode the single-chip advantage for larger models. Cerebras FY2025 gross margin of 39 percent indicates the current competitive position sustains pricing power at scale.[CE015, CE016, CE026]
5.6 Trust, Compliance, and Product Roadmap
Cerebras primary compliance obligation is export control under Bureau of Industry and Security Export Administration Regulations. Shipments of advanced AI semiconductor systems to UAE-based customers MBZUAI (62 percent of FY2025 revenue) and G42 (24 percent of FY2025 revenue) require BIS export licenses. Dependence of over 86 percent of revenue on UAE customers subject to BIS/EAR export controls is a material operational risk disclosed prominently in the S-1/A. Export license delays, revocations, or policy restrictions could block the majority of hardware revenue with limited near-term diversification alternatives. Quality and reliability evidence is primarily production-validated: OpenAI Codex-Spark and GSK RNA drug discovery workloads confirm enterprise quality requirements are met. However, Cerebras has not publicly disclosed SOC 2 Type II or ISO 27001 cloud security certifications for the Inference Cloud API as of May 2026, a diligence gap for enterprise customers evaluating cloud security posture. Hardware quality is governed by TSMC standard semiconductor manufacturing quality controls, but system-level MTBF statistics for the CS-3 have not been published. The product roadmap as of May 2026 does not include an announced WSE-4 successor chip. Cerebras generation cadence has been approximately biennial: WSE-1 (2019), WSE-2 (2021), WSE-3 (2023). The next significant product milestone is the AWS disaggregated inference integration under the March 2026 binding term sheet, with deployment timeline not publicly disclosed.[CE024, CE026, CE034]
| Control or Certification | Status | Scope | Gap or Risk |
|---|---|---|---|
| Export Control Compliance (BIS/EAR) | Required; export licenses obtained for existing UAE customer hardware shipments | All shipments to UAE customers MBZUAI and G42, approximately 86% of FY2025 revenue | Export license delays or revocations could block majority of hardware revenue; disclosed as material risk in S-1/A |
| SOC 2 Type II Cloud Security Certification | Not publicly disclosed as of May 2026 | Cerebras Inference Cloud API and associated cloud infrastructure | Absence of public SOC 2 attestation is a diligence gap for enterprise cloud customers; no compensating disclosure found |
| ISO 27001 and ISO 9001 Quality Standards | Not publicly disclosed as of May 2026 | CS-3 system manufacturing and cloud operations | No verifiable third-party quality certification for hardware production or cloud security controls |
| Semiconductor Reliability Testing (HTOL and ESD) | Standard TSMC and CS-3 assembly practices; manufacturing specifics not publicly disclosed | WSE-3 chip and CS-3 chassis system integration | No published MTBF, field failure rate, or system-level reliability specification for WSE-3 or CS-3 |
| Production Deployment Validation (OpenAI and GSK) | Third-party confirmed; OpenAI Codex-Spark and GSK RNA workloads running in production | LLM inference production (OpenAI) and neural network training production (GSK) | Customer-disclosed only; no independent third-party audit or published SLA specification for these deployments |
Status based on S-1/A risk factor disclosures and absence of public certification announcements as of May 2026. Export control status inferred from S-1/A risk factor language and UAE customer revenue disclosures.
[CE024, CE034]| Date or Stage | Feature or Milestone | Status | Implication | Source |
|---|---|---|---|---|
| 2019 | WSE-1: 16nm TSMC, 1.2 trillion transistors, 400K AI cores, 18 GB SRAM | Released; no longer in production | First proof-of-concept for wafer-scale AI; established manufacturing viability and early customer proof points | S-1/A company disclosures |
| 2021 | WSE-2: 7nm TSMC, 2.6 trillion transistors, 850K AI cores, 40 GB SRAM | Released; succeeded by WSE-3 in 2023 | Second generation with material transistor density and SRAM improvement; validated biennial release cadence | S-1/A; AnandTech technical analysis |
| 2023 Q1 | WSE-3: 5nm TSMC, 4 trillion transistors, 900K AI cores, 44 GB SRAM; CS-3 system launch | Released and in production; sole revenue-generating hardware platform as of FY2025 | All FY2025 hardware revenue of $358.4M attributable to CS-3 or WSE-3; primary platform for 2026 | S-1/A; AnandTech; SemiAnalysis |
| April 2023 | Cerebras-GPT open-weight models (111M to 13B parameters) and arXiv 2304.03208 paper | Released on HuggingFace; Chinchilla scaling paper published on arXiv | Developer ecosystem building; independent scientific validation of training efficiency on wafer-scale hardware | arXiv 2304.03208; HuggingFace cerebras organization |
| March 2026 (binding term sheet) | AWS disaggregated inference integration combining CS-3 with AWS Trainium | Binding term sheet signed March 2026; integration not yet deployed as of May 2026 IPO | Unlocks AWS distribution channel and disaggregated inference for large models if executed on schedule | S-1/A March 2026 binding term sheet disclosure |
| Not announced as of May 2026 | WSE-4 next-generation chip | No public announcement, timeline, or specifications have been disclosed | Roadmap gap versus NVIDIA annual GPU cadence; technology currency risk if WSE-3 competitive gap narrows | Absence of disclosure in S-1/A and company communications as of runDate |
Dates from S-1/A disclosures and public release announcements. AWS integration status based on binding term sheet in S-1/A. WSE-4 entry reflects confirmed absence of any public announcement as of May 16, 2026.
[CE022, CE018, CE025, CE026]5.7 Exhibits
06Customers
6.1 Customer Base Overview and Segmentation
Cerebras serves five distinct customer segments differentiated by buyer type, geography, vertical, and procurement model. Sovereign AI programs—primarily MBZUAI and G42 in the UAE—constitute the anchor segment, representing approximately 86% of FY2025 revenue through large multi-year hardware contracts. These buyers are institutional and government-affiliated, with 6-to-18-month procurement cycles and extraordinary contract size ($100M+). Foundation model labs and hyperscalers—led by OpenAI under a $20B+ Master Revenue Agreement—represent the largest potential expansion segment. US national laboratories and defense customers, anchored by Sandia National Laboratories, validate scientific computing use cases. Enterprise life-sciences customers such as GSK demonstrate vertical adoption for RNA/protein model training. Finally, cloud API developers access Cerebras via cloud.cerebras.ai at $0.60 per million input tokens, enabling self-serve adoption with no hardware procurement friction. The segmentation reveals a bifurcated revenue model: high-value hardware contracts from sovereign and institutional buyers versus a nascent cloud/API business targeting the broader developer and enterprise market. Hardware was approximately 70% of FY2025 revenue ($358.4M) and cloud/services the remaining 30% ($151.6M), a significant shift from near-zero cloud revenue in FY2022.[CU001, CU002, CU003, CU004, CU005, CU031]
| Segment | Buyer Type | Geography | Vertical / Use Case | Notable Customers | Est. FY2025 Rev Share | Sales Cycle |
|---|---|---|---|---|---|---|
| Sovereign AI Programs | Institutional / Government | UAE / Middle East | Frontier LLM training, national AI infrastructure | MBZUAI, G42 | ~86% combined | 6-18 months; government procurement |
| Foundation Model Labs / Hyperscalers | Corporate enterprise | USA | LLM inference at scale, cloud compute distribution | OpenAI (MRA), AWS (term sheet) | MRA signed; rev pending | 6-18 months; strategic negotiation |
| National Labs / Defense | Government / federal | USA | Scientific computing, national security AI, HPC workloads | Sandia National Laboratories | Small; strategic signal | 12-24 months; federal procurement |
| Enterprise Life Sciences | Corporate enterprise | USA / Global | RNA/protein model training, drug discovery acceleration | GSK | Small; proof-of-concept | 6-12 months; enterprise IT |
| Cloud API Developers | Individual / Startup / Enterprise | Global | Low-latency LLM inference, application development | Self-serve via cloud.cerebras.ai | ~29.7% of FY2025 total | Self-serve; no sales cycle |
Segment revenue shares derived from S-1/A disclosures and estimated from cloud/hardware split. Sales cycle estimates are inferred from contract announcement timelines; not directly disclosed.
[CU001, CU002, CU003, CU005, CU018, CU031]Six-stage customer journey from initial awareness through strategic partnership, covering segmented entry points, procurement paths, and expansion loops for Cerebras customers.
Stage timing estimates are inferred from contract announcement timelines and public disclosures; no formal customer lifecycle data has been published by Cerebras.
[CU005, CU015, CU016, CU017, CU029, CU031]6.2 Adoption Trajectory and Revenue Growth
Cerebras has achieved rapid revenue growth from a small base, with total revenue growing from $24.6M in FY2022 to $78.7M in FY2023, $290.3M in FY2024, and $510M in FY2025—a cumulative increase of approximately 1,975% in three years. The FY2022-to-FY2025 compound annual growth rate is approximately 174%. Year-over-year growth moderated from 269% (FY2023 to FY2024) to 76% (FY2024 to FY2025) as the absolute revenue base expanded. The most significant adoption signal in FY2025 is the emergence of cloud/services revenue: $151.6M, or 29.7% of total, versus near-zero in FY2022 and FY2023. This suggests genuine developer and enterprise adoption of the cloud.cerebras.ai inference API, which went generally available in August 2024. The customer concentration pattern shows that Cerebras grew by winning a small number of extraordinarily large contracts rather than through broad market penetration. G42 represented approximately 85% of FY2024 revenue before MBZUAI growth in FY2025 distributed concentration across two anchor customers. The OpenAI MRA, signed December 2025, creates a third major revenue vector for FY2026 and beyond, though revenue recognition is pending the 750MW infrastructure buildout. Customer count remains undisclosed, but the revenue trajectory implies fewer than 15 customers account for nearly all recognized revenue.[CU006, CU007, CU008, CU009, CU010, CU011]
| Metric | FY2022 | FY2023 | FY2024 | FY2025 | Key Implication |
|---|---|---|---|---|---|
| Total Revenue ($M) | 24.6 | 78.7 | 290.3 | 510.0 | ~20x growth in 3 years; driven by UAE anchor contracts and OpenAI pipeline |
| Cloud / Services Rev ($M) | ~0 | ~4.3 | ~48.8 | 151.6 | Cloud growing from near-zero to 30% of revenue in 3 years; self-serve adoption signal |
| Hardware Rev ($M) | ~24.6 | ~74.4 | ~241.5 | ~358.4 | Hardware growing in absolute terms but declining as share; sovereign AI dominates |
| YoY Revenue Growth (%) | N/A | ~220% | ~269% | ~76% | Growth rate moderating as absolute base expands; concentration persists |
| Top-2 Customer Rev Share | ~90%+ (est.) | ~85%+ (est.) | ~85% (G42 alone) | ~86% (MBZUAI 62% + G42 24%) | Extreme concentration unchanged YoY despite revenue scale; structural diligence risk |
FY2025 cloud/hardware split and total revenue from S-1/A. FY2022-FY2024 cloud estimate derived from implied growth trajectory. Top-2 shares for FY2022-FY2023 are approximate.
[CU006, CU007, CU008, CU009, CU010, CU011]Illustrative five-stage adoption funnel from total addressable accounts through anchor status, highlighting the extreme narrowing to just two accounts representing 86% of revenue.
Stage values are illustrative estimates derived from S-1/A revenue disclosures and market sizing assumptions. Exact customer counts are not disclosed by Cerebras.
[CU003, CU004, CU005, CU009, CU010]6.3 Named Customer Deployments and Production Evidence
Cerebras has disclosed five named customers with production or near-production evidence as of May 2026. MBZUAI (Mohamed bin Zayed University of Artificial Intelligence), the world's first graduate-level AI university based in Abu Dhabi, is the largest single customer at 62% of FY2025 revenue, operating CS-3 clusters for frontier LLM training and UAE sovereign AI programs. G42, an Abu Dhabi AI and technology conglomerate that has invested $335M in Cerebras as a strategic partner, accounts for 24% of FY2025 revenue and has been a continuous customer since at least FY2022. OpenAI's Codex-Spark coding product runs in production on Cerebras inference infrastructure—a hyperscaler-grade production reference disclosed jointly by OpenAI and Cerebras. GSK reported a 10x speedup and 120x larger trainable dataset for RNA drug-discovery model training on CS-3 hardware, representing the highest-quality quantified outcome disclosure in the portfolio. Sandia National Laboratories is named as a CS-3 customer for scientific computing and national security AI research. The evidence quality is uneven: MBZUAI and G42 evidence is limited to revenue-percentage disclosures in the S-1/A; OpenAI evidence is substantiated by public announcements; GSK provides quantified outcomes; and Sandia is a name-only reference. No independent G2 or Capterra reviews of the Cerebras platform are publicly available as of May 2026. AWS and IBM watsonx are distribution partners with signed agreements but have not yet generated recognized revenue.[CU012, CU013, CU014, CU015, CU016, CU017]
| Customer | Segment | Deployment Status | Use Case | Quantified Outcomes | Evidence Quality | Key Limitation |
|---|---|---|---|---|---|---|
| MBZUAI | Sovereign AI / Education | Production — multi-year LLM training | Frontier LLM training, UAE sovereign AI computing programs | 62% of FY2025 revenue (~$316M est.) | High — S-1/A revenue disclosure | Training program scope not disclosed; FY2026 commitment not confirmed |
| G42 | Sovereign AI / Strategic Investor | Production — multi-year hardware and inference | Large-scale LLM inference and training; UAE AI infrastructure | 24% FY2025 rev; 85% FY2024 rev; $335M invested | High — S-1/A disclosure and investor filing | Export-control dependency; FY2026 spend not committed publicly |
| OpenAI | Foundation Model Lab | Production — Codex-Spark inference live; MRA signed Dec 2025 | LLM inference for Codex-Spark coding product; future 750MW capacity | $20B+ MRA notional; $1B working capital loan; Codex-Spark production | High — joint public announcement and S-1/A corroboration | MRA revenue not yet recognized; conditionality and operational timeline not fully disclosed |
| GSK | Enterprise Life Sciences | Production — CS-3 hardware for RNA training | RNA sequence model training for drug discovery pipeline | 10x speedup vs GPU baseline; 120x larger trainable dataset | Medium — company press release, conference disclosed | No independent verification; outcome metrics are Cerebras and GSK co-disclosed |
| Sandia National Laboratories | National Lab / Defense | Production — CS-3 clusters deployed | Scientific computing, AI research for national security | Named customer; operational since at least 2024 | Medium — S-1/A named customer reference | Revenue small relative to total; specific workload outcomes not described |
Based on Cerebras S-1/A May 2026, OpenAI blog, and GSK press release. AWS and IBM not included as they have not yet generated recognized revenue. Coverage is exhaustive for disclosed production-revenue or MRA-signed customers as of May 2026.
[CU012, CU013, CU014, CU015, CU017, CU018]Five-column evidence quality assessment across the five named Cerebras production customers, covering deployment status, quantified outcomes, public reference quality, contract durability, and strategic investment status.
Evidence quality assessments are analyst judgments based on publicly disclosed information through May 2026; private contract terms are not reflected.
[CU012, CU013, CU014, CU015, CU017, CU018]6.4 Retention Indicators and Contract Durability
No net revenue retention (NRR), gross revenue retention (GRR), churn rate, or cohort-based retention metric has been publicly disclosed by Cerebras as of May 2026. This is a material diligence gap: standard AI infrastructure investment analysis requires retention benchmarks to validate customer durability. The available proxy evidence is directionally positive but insufficient as a substitute for formal retention KPIs. G42 has been a continuous revenue contributor from FY2022 through FY2025, declining in revenue share only because MBZUAI grew faster—not because G42 churned. G42's $335M strategic investment further aligns incentives for continuity. MBZUAI grew from an implied smaller share in FY2024 to 62% of FY2025 revenue, demonstrating strong expansion rather than attrition. The OpenAI MRA structure—a $20B+ notional commitment and $1B working capital loan at 6% interest—implies a contractually committed long-term relationship that would be costly to exit. However, MRA conditionality terms are not publicly disclosed. No formal contract lengths, renewal cadences, or take-or-pay commitments have been disclosed for any customer. The absence of retention KPIs in the S-1/A is a standard omission for hardware companies at this scale, but it creates a diligence gap that should be addressed through private management access before investment.[CU019, CU020, CU021, CU022, CU023]
| Metric | Value / Status | Customer / Segment | Confidence | Diligence Ask |
|---|---|---|---|---|
| Net Revenue Retention (NRR) | Not disclosed | All segments | Not applicable — data absent | Request NRR by segment in private diligence; particularly MBZUAI and G42 renewal commitments for FY2026 |
| Gross Revenue Retention (GRR) | Not disclosed | All segments | Not applicable — data absent | Request contract renewal documentation and churn data for completed hardware delivery terms |
| Known Customer Churn | None disclosed as of May 2026 | All | High — no churn reference in S-1/A | Confirm whether G42 voluntarily reduced spend; verify MBZUAI multi-year contract terms and successor programs |
| G42 Multi-Year Continuity | Continuous since FY2022; $335M strategic investment; 24% of FY2025 rev | G42 | High — revenue confirmed in filing disclosures | Confirm FY2026 hardware delivery obligations; verify renewed hardware purchase agreement |
| OpenAI MRA Durability | $20B+ notional; $1B loan at 6% interest; operational start pending 750MW buildout | OpenAI | Medium — announced but pre-revenue recognition | Monitor post-IPO quarterly disclosures; confirm 750MW buildout timeline and MRA activation milestones |
No NRR, GRR, cohort, or third-party satisfaction data has been publicly disclosed as of May 2026. Values reflect the absence of disclosure, not confirmed zero retention or churn.
[CU019, CU020, CU021, CU022, CU023]Estimated cohort retention analysis showing continuous customer engagement by acquisition year, based on revenue disclosures rather than formal NRR data.
Retention percentages are estimated from public revenue share disclosures and analyst inference; Cerebras has not disclosed NRR, GRR, or formal cohort retention data. Pre-acquisition years show 0 to indicate cohort not yet active; values for active years reflect estimated retention based on continuous revenue contribution.
[CU019, CU020, CU022, CU023]6.5 Expansion Pathways and Revenue Concentration Risk
The top-2 customer concentration of 86% (MBZUAI + G42) is extreme by any benchmark for AI infrastructure companies at comparable revenue scale. The top-5 customers represented 94% of FY2025 revenue. This concentration creates a single-point-of-failure risk: any disruption to UAE sovereign demand—whether from geopolitical shifts, export-control tightening, or budget reallocation—would be immediately catastrophic to recognized revenue. The BIS/EAR export-control dependency for UAE customer shipments (representing 86% of revenue) is the most structurally acute risk in the Cerebras investment thesis. Four primary diversification vectors have been announced: the OpenAI MRA ($20B+ notional, 750MW committed capacity) is the largest and most imminent, capable of adding hundreds of millions in annual recognized revenue once operational infrastructure is deployed. The AWS binding term sheet (March 2026) provides channel access to thousands of enterprise customers without requiring direct Cerebras sales engagement. IBM watsonx partnership targets regulated industries. Cloud API self-serve growth enables developer-led adoption with no procurement friction. However, none of these three vectors had generated material recognized revenue as of the IPO date (May 2026). The diligence question is whether recognized revenue diversification can occur before UAE concentration risk materializes. Revenue from FY2023 to FY2025 grew 548% to $510M, confirming strong demand—but the customer concentration remains an unresolved structural vulnerability.[CU024, CU025, CU026, CU027, CU028, CU030]
| Driver or Risk | Type | Revenue Impact | Current Status | Diligence Path |
|---|---|---|---|---|
| OpenAI MRA ($20B+ notional, 750MW) | Expansion driver | Largest single diversification vehicle; could contribute $500M–$2B+ annually at scale | Signed Dec 2025; operational start pending 750MW cluster buildout; no revenue recognized at IPO | Monitor S-1 post-IPO quarterly filings; confirm 750MW buildout timeline and first revenue recognition |
| AWS Channel Partner (binding term sheet) | Expansion driver | Access to AWS enterprise customer base; mutual distribution through AWS Marketplace | Binding term sheet March 2026; final terms not yet closed; no Marketplace listing at IPO | Track AWS Marketplace listing announcement; monitor first enterprise customer conversions in earnings calls |
| UAE Sovereign Concentration (MBZUAI + G42 = 86%) | Critical concentration risk | Loss of either customer would be immediately catastrophic to FY2026 revenue without MRA ramp | Multi-year contracts in place; export-license dependent; MBZUAI growing, G42 stable | Verify FY2026 hardware delivery orders; assess BIS/EAR export-license status for UAE shipments; review any government AI policy changes in UAE |
| Cloud API Land-and-Expand | Expansion driver | Self-serve discovery to enterprise conversion; 30% of FY2025 revenue is cloud/services | GA since August 2024; growing rapidly; no disclosed developer count or conversion metrics | Request developer MAU, API revenue by cohort, enterprise conversion rate from cloud.cerebras.ai in private diligence |
| IBM watsonx Enterprise Channel | Expansion driver | Access to IBM regulated-industry enterprise customers; accelerates sales in financial services, healthcare, government | Partnership announced; revenue not yet material as of May 2026 | Request IBM deal pipeline volume, number of committed enterprise leads, and revenue forecast from IBM channel |
Status and impact assessments based on S-1/A disclosures and press announcements. Revenue projections are analyst estimates derived from contract values, not Cerebras guidance.
[CU015, CU016, CU024, CU025, CU026, CU027]6.6 Channel Partners and Market Access Infrastructure
Cerebras is building a channel ecosystem designed to convert early hyperscaler and enterprise partnerships into scalable revenue pipelines. The AWS binding term sheet (March 2026) represents the most consequential channel development: AWS would distribute Cerebras inference capacity through its Marketplace, exposing Cerebras to the full AWS enterprise customer base including regulated industries, financial services, and government verticals. The commercial terms are not finalized, and no Marketplace listing was live at IPO. The IBM watsonx partnership provides a complementary enterprise channel focused on IBM's traditional regulated-industry customers where IBM has established procurement relationships. OpenAI functions as both a revenue customer and an indirect distribution channel: Cerebras inference infrastructure underpins OpenAI Codex-Spark, providing downstream visibility to OpenAI's developer and enterprise customer base. The Cerebras Cloud API (cloud.cerebras.ai) serves as a direct self-serve channel that enables individual developers, startups, and enterprise API consumers to access inference capacity without navigating hardware procurement. The API is OpenAI Chat Completions compatible, reducing developer switching friction to a base-URL change. The channel infrastructure as built is appropriately designed for diversification—it combines direct enterprise sales (G42, MBZUAI, OpenAI MRA), hyperscaler distribution (AWS), enterprise software channel (IBM), and self-serve developer (cloud API). The execution risk is that none of the new channels had generated material recognized revenue at IPO.[CU015, CU016, CU025, CU026, CU029, CU030]
6.7 Exhibits
07Risks
7.1 Risk Overview and Severity Rankings
Cerebras Systems enters the public markets with a risk profile dominated by three structural concentrations that compound one another: a single manufacturing source (TSMC) for which no alternative exists, two customers representing 86% of revenue (MBZUAI and G42 in the UAE) operating under revocable BIS export licenses, and one counterparty playing three simultaneous roles (OpenAI as production customer, $1B lender, and $20B+ revenue anchor). These risks do not operate independently: a BIS export rule change targeting UAE destinations would simultaneously eliminate hardware revenue from both top customers, stress the OpenAI working capital loan, and impair the MRA diversification thesis. The risk-heatmap below maps ten identified risks by likelihood and impact severity, while the risk-transmission map shows how individual risk events cascade into revenue, financing, and valuation outcomes. Secondary risks include wafer-scale yield dependency, liquid-cooling infrastructure constraints, founder dual-class voting control, and an IPO multiple exceeding 190 times trailing revenue. No public monitoring thresholds have been disclosed by Cerebras; investors must construct their own monitoring frameworks before underwriting at the current valuation.[CR001, CR002, CR003, CR018]
Cerebras highest-severity residual risks cluster in the export-control and manufacturing-dependency quadrants; financial model and governance risks are material but carry more mitigation headroom.
Likelihood and impact are qualitative ordinal assessments derived from public evidence; this heatmap ranks residual exposure rather than generating precise probability estimates. Mitigation maturity reflects publicly disclosed programs only; private or internal mitigations are unknown.
[CR001, CR003, CR006, CR011, CR016, CR022]Risk events at the regulatory and manufacturing layer transmit through revenue collapse and capital stress to converge on valuation downside; OpenAI counterparty risk spans multiple transmission channels simultaneously.
Transmission map is qualitative and directional; edge weights and probability magnitudes are not modeled. The goal is to identify which root risks cascade through the most downstream nodes and converge on the same financial outcomes.
[CR004, CR011, CR013, CR015, CR019, CR023]7.2 Regulatory and Legal Risks
Cerebras most consequential regulatory risk is the Export Administration Regulations (EAR) framework administered by BIS. The October 2023 Advanced Computing rule and the October 2024 AI Diffusion Rule establish performance thresholds that govern whether WSE-3 chips can be exported to UAE-based customers. Cerebras has publicly disclosed that it holds BIS licenses for MBZUAI and G42 shipments, but those licenses can be revoked, modified, or refused at renewal. A regulatory posture change, such as placing MBZUAI or G42 on the Entity List, reclassifying UAE as a destination requiring tighter controls, or reducing performance thresholds below the WSE-3 level, would immediately prohibit existing and future hardware shipments to the two customers representing 86% of FY2025 revenue. The CFIUS precedent from the Saudi Aramco episode, which caused Cerebras to withdraw its IPO in Q4 2024, illustrates that national security review can impose material business disruption with limited advance notice. G42 2024 agreement to divest Chinese technology holdings partially addresses CFIUS residual risk but requires ongoing compliance monitoring. No material IP litigation has been disclosed in the S-1/A, but NVIDIA semiconductor patent portfolio creates a latent infringement risk warranting an FTO opinion before institutional capital is deployed.[CR004, CR005, CR006, CR007, CR008, CR009]
| Rule or Case | Jurisdiction | Status | Likelihood | Severity | Mitigation | Residual Exposure | Diligence Path |
|---|---|---|---|---|---|---|---|
| EAR/BIS Advanced Computing Export Controls — WSE-3 performance threshold breach | US Federal (BIS / Commerce) | Active — export licenses obtained for UAE shipments; ongoing compliance obligations | High | Critical | BIS export licenses obtained; internal compliance program; customer counterparty screening | Critical — 86% FY2025 revenue under license; revocation is a near-total-loss event | Obtain outside counsel memo on license scope, renewal conditions, and revocation risk under current BIS posture |
| CFIUS national security review — foreign investor or customer scrutiny | US Federal (CFIUS / Treasury) | Resolved for current investors — Saudi Aramco withdrawn; G42 China divestiture completed; ongoing monitoring | Medium | High | G42 divested Chinese technology holdings in 2024; CFIUS cleared for current investor structure | Medium-High — future foreign investment or customer transactions may re-trigger CFIUS review | Conduct ongoing third-party counterparty screening on G42 and MBZUAI; verify no undisclosed foreign investors |
| AI Executive Order 14110 and BIS AI Diffusion Rule — compute provider compliance | US Federal (NSC / NIST / BIS) | Active — advanced compute providers subject to safety reporting and compliance obligations | Medium | Medium | Internal compliance program expected; NIST AI RMF adoption by enterprise customers provides indirect coverage | Medium — indirect compliance burden through customers; Cerebras is not a frontier model developer but serves them | Confirm AI EO compliance posture with management; request any NIST AI RMF documentation or flow-down clauses |
| IP and trade-secret litigation risk — semiconductor patent landscape | US Federal (PTAB / District Courts) | Latent — no confirmed litigation as of May 2026; NVIDIA holds 10,000+ semiconductor patents | Low-Medium | High | Novel wafer-scale architecture may reduce prior-art overlap; Cerebras has own patent portfolio | Medium — absence of current litigation does not preclude future assertion as Cerebras gains market share | Obtain FTO opinion from IP counsel covering WSE-3 key design elements; review S-1/A patent risk disclosures |
| G42 OFAC and BIS counterparty screening — prior Chinese technology ties | US Federal (OFAC / BIS / CFIUS) | Resolved for current civil transactions — G42 divested China holdings; ongoing monitoring required | Low-Medium | High | G42 divested Chinese investments in 2024; Microsoft US government approval obtained; compliance program active | Medium-High — G42 history complicates Cerebras sales to US government and defense regardless of current OFAC status | Conduct recurring third-party counterparty screening on G42; assess US government customer suitability impact |
Severity and likelihood are qualitative assessments derived from BIS regulatory text, Cerebras S-1/A risk factor disclosures, and news reporting through May 2026. No court-assessed damages or formal regulatory determinations underlie these estimates; they represent analyst-constructed risk registers from publicly available information.
[CR004, CR005, CR006, CR007, CR008, CR009]7.3 Operational, Quality, and Security Risks
Cerebras operational risk is dominated by its sole-source relationship with TSMC for WSE-3 fabrication. No other foundry in the world has publicly disclosed capability to manufacture a chip occupying a full 300mm wafer at the advanced lithography node required for commercial yield. This makes TSMC a structural binary dependency: a Taiwan conflict, TSMC production disruption, capacity reallocation, or yield-impacting process change could halt Cerebras entire hardware supply pipeline for 12 or more months. The defect-tolerant routing embedded in WSE-3 design mitigates yield risk from individual transistor failures, but cannot compensate for a foundry-level production halt or a fundamental TSMC process change that renders the routing ineffective. CS-3 systems require approximately 23 kW power draw and liquid cooling, limiting deployment to specialized data center facilities and structurally constraining the on-premises total addressable market, partially addressed by cloud inference but introducing cloud-specific operational risks. Cerebras cloud inference services have not disclosed a SOC 2 Type II certification, incident response protocol, or enterprise security audit as of May 2026, a gap that could become a material sales objection for enterprise and regulated-industry customers. Hardware lead times of 6-12 months from TSMC mean that any supply disruption translates into a multi-quarter revenue gap with limited short-cycle recovery options.[CR011, CR012, CR013, CR014, CR015, CR016]
| Failure Mode | Category | Likelihood | Severity | Mitigation Maturity | Residual Exposure | Unresolved Gap |
|---|---|---|---|---|---|---|
| TSMC sole-source manufacturing disruption — Taiwan Strait conflict or production failure | Supply Chain | Low-Medium (tail risk; scenario probability) | Critical — production halt with no 12-24 month recovery path possible | Very Low — no alternative foundry; no disclosed secondary qualification underway | Critical — binary dependency; TSMC disruption is existential for hardware supply | No disclosed business continuity plan for TSMC disruption; no secondary foundry qualification evidence |
| WSE-3 wafer yield degradation — defect density increase or TSMC process change | Production Quality | Medium — single-die-per-wafer design amplifies every defect | High — yield decline directly raises per-unit cost and constrains chip supply | Low — defect-tolerant routing is the primary mitigation; no disclosed yield monitoring program | High — yield risk is intrinsic to wafer-scale architecture and cannot be fully eliminated | Yield rates and improvement trends not disclosed; investors have no visibility into production cost trajectory |
| CS-3 thermal and power incompatibility with enterprise air-cooled data centers | Deployment Constraint | High — majority of enterprise data centers are air-cooled standard facilities | Medium — restricts on-premises addressable market; not an operational outage risk | Medium — Cerebras cloud inference service provides a software workaround for non-conforming facilities | Medium — on-premises hardware sales structurally limited; cloud offset depends on execution | No public disclosure of target enterprise accounts with liquid cooling as a fraction of total TAM |
| Cybersecurity incident or data breach on Cerebras cloud inference infrastructure | Cloud Security | Low-Medium — cloud AI infrastructure is a growing attack surface and high-value target | High — breach of sensitive customer AI workloads impairs trust with pharma and government customers | Low — no public SOC 2 Type II audit report; no disclosed incident response or BCP documentation | Medium — absence of verified security posture creates sales objections in regulated-sector accounts | No public SOC 2 Type II audit; no disclosed incident response timeline or security certification scope |
Likelihood and severity are qualitative assessments derived from Cerebras S-1/A risk factor disclosures, public technical specifications for the WSE-3 and CS-3, and analyst reporting through May 2026. No proprietary operational data was used in constructing this register.
[CR011, CR012, CR013, CR014, CR015, CR016]7.4 Partner and Dependency Risks
Cerebras has constructed a partner ecosystem that simultaneously reduces some risks while creating new concentration vectors. The OpenAI relationship is the clearest example of this dual nature: the $1B working capital loan provides a critical buffer against capital-intensive hardware manufacturing cycles, but the same counterparty holds the largest committed revenue obligation, runs production workloads on Cerebras infrastructure, and holds 33.4 million dilutive warrants. If the OpenAI relationship deteriorates for any reason, including competitive dynamic shift, OpenAI corporate distress, or strategic pivot to custom silicon, the simultaneous loss of working capital support, near-future revenue, and the production reference customer would be structurally devastating. The UAE customer concentration is similarly entrenched: MBZUAI and G42 are sovereign-affiliated institutions with durable AI mandates, but both require active BIS export licenses and are subject to geopolitical risk beyond Cerebras control. The AWS binding term sheet provides the most credible near-term diversification pathway but has not yet generated recognized revenue. The dependency-map figure below inventories the principal partner and regulatory relationships and their directional dependencies. Until non-UAE and non-OpenAI revenue exceeds 20% of total, the partner and dependency risk vector should be treated as the primary investment concentration concern.[CR018, CR019, CR020, CR021, CR022, CR033]
| Counterparty | Role(s) | Revenue Exposure | Risk Type | Mitigation | Residual Exposure |
|---|---|---|---|---|---|
| MBZUAI and G42 (UAE sovereign entities) | Top 2 hardware customers | ~86% FY2025 ($440M of $510M) | Revenue concentration plus export license dependency | AWS and OpenAI MRA diversification in progress; BIS compliance program active | Critical — no near-term revenue substitute; simultaneous license revocation is catastrophic |
| OpenAI (customer plus lender plus MRA anchor) | Production customer, $1B working-capital lender, $20B+ MRA holder | ~$21B+ notional MRA; $1B loan principal | Counterparty concentration — triple-role dependency | Long-term MRA commitment; working capital buffer provided by loan itself | High — adversarial relationship would impair revenue, capital, and reference customer status simultaneously |
| TSMC (sole chip foundry) | Only qualified manufacturer of WSE-3 chips | 100% of chip supply | Supply chain single point of failure | Long-standing relationship; presumed priority customer allocation | Critical — no backup foundry; Taiwan geopolitical scenario is binary production-halt risk |
| AWS (distribution channel partner) | Enterprise channel distributor — binding term sheet, not yet signed commercial agreement | ~0% current revenue; future revenue potential | Channel dependency — risk if term sheet does not convert to commercial agreement | Binding term sheet provides legal framework; AWS has strong enterprise distribution | Medium — if term sheet fails to close, primary US enterprise diversification channel is lost |
Revenue exposure percentages are from Cerebras S-1/A FY2025 disclosures. OpenAI MRA notional value is from official OpenAI and Cerebras announcements (December 2025). Severity assessments are analyst estimates based on public information; actual contractual terms have not been publicly disclosed.
[CR018, CR019, CR020, CR021, CR022, CR036]Cerebras depends on TSMC for all chip production, two UAE sovereign customers for 86% of revenue, OpenAI in three simultaneous roles, and BIS licensing for the legality of all UAE hardware sales.
Dependency map is directional and qualitative; edges represent material operational or revenue dependencies, not contractual obligations. BIS is shown upstream of UAE customer nodes to reflect the export license dependency structure governing all hardware sales to UAE entities.
[CR011, CR018, CR022, CR023, CR033, CR036]7.5 Financial and Model Risks
Cerebras financial risk profile features a significant GAAP versus non-GAAP gap, hardware deal lumpiness, and an IPO valuation that prices near-perfect execution. The FY2025 GAAP net income of approximately $237.8M is almost entirely driven by a non-cash warrant remeasurement gain related to the OpenAI warrants; on a non-GAAP adjusted basis, Cerebras shows an approximately $75.7M operating loss, which more accurately reflects underlying cash generation quality. Gross margin of approximately 39% is below software peers and reflects TSMC high fabrication costs for advanced-node wafer-scale production. Hardware revenue is composed of large, multi-hundred- million dollar contracts with extended lead times, creating quarter-over-quarter revenue volatility that can misalign with investor expectations and trigger disproportionate stock reactions to any earnings miss. At the IPO opening price implying approximately 196 times trailing FY2025 revenue, a deceleration from 76% revenue growth to even 40-50% would likely compress the multiple significantly. The 180-day lockup expiry creates near-term supply overhang risk when insiders become eligible to sell approximately November 2026. The OpenAI $1B loan, while supportive in the near term, becomes an adversarial financial instrument if the relationship deteriorates before the 2032 maturity date.[CR024, CR025, CR026, CR027, CR028, CR029]
7.6 People, Execution, and Mitigations
Cerebras governance structure concentrates strategic control in founders through a dual-class share structure where Class B shares carry 10-to-1 voting rights, giving founders approximately 99.2% aggregate voting power at IPO despite institutional ownership of a significant economic share. Andrew Feldman (CEO) is explicitly named in the S-1/A as a key-person risk; no public succession plan has been disclosed. The engineering team represents a wafer-scale talent moat but also a talent concentration risk if core chip architects depart to NVIDIA, AMD, or well-funded AI chip startups, which could delay the CS-4 program by 12-18 months. The people-execution risk register documents these dependencies by role. The mitigation and monitoring table below provides a structured investor framework built from S-1/A disclosures, BIS regulatory posture analysis, and analyst-derived threshold logic. Key diligence asks before underwriting at current valuation include: BIS license copies and renewal timelines, OpenAI MRA conditionality terms and loan covenant details, non-UAE revenue quarterly split, TSMC capacity contract summary, and an organizational depth chart for core engineering leadership. Without these inputs, underwriting Cerebras at a valuation exceeding $100 billion requires accepting material structural information asymmetries that cannot be resolved from public disclosures.[CR030, CR031, CR032, CR035]
| Role or Function | Key Dependency or Gap | Likelihood of Loss | Severity | Mitigation | Diligence Path |
|---|---|---|---|---|---|
| CEO Andrew Feldman | Strategic vision, investor relationships, and key customer relationships substantially tied to founder-CEO | Low-Medium — no public succession signal; founder is company face and IPO architect | High — departure would impair investor confidence, sales pipeline, and board stability simultaneously | No publicly disclosed succession plan; management team broadening status unknown | Request board-level succession plan; assess management team depth in CFO, CRO, and CTO roles |
| Senior chip design architects — WSE program team | Wafer-scale design expertise concentrated in small specialist team; CS-4 program depends on these individuals | Low — competitive equity pay assumed; departure risk elevated post-IPO lockup expiry | High — CS-4 program delay of 12-18 months possible if key architects depart; no external talent market | Retention equity assumed vesting accelerated by IPO; competitive compensation but terms not disclosed | Request management org chart and key-person retention terms; assess CS-4 milestone dependencies |
| Founder dual-class voting control | 99.2% founder voting power via Class B shares gives founders unchecked governance authority | Structural — not a departure risk but a permanent governance constraint | Medium — minority shareholders cannot vote out management or block capital structure changes | Standard dual-class governance; market norms for founder-led technology companies apply | Review board composition, independent director independence, and any contractual investor protections |
Key-person designations are per S-1/A risk factor disclosures. Voting control percentages are based on dual-class share structure at IPO as publicly disclosed. Likelihood and severity are qualitative assessments from public information; management succession and retention terms are not publicly disclosed.
[CR030, CR031, CR032, CR041]| Risk Area | Mitigation in Place | Monitoring Indicator | Thesis-Break Trigger | Diligence Ask |
|---|---|---|---|---|
| BIS and EAR export controls — UAE chip sales | Active BIS license; compliance program; G42 China divestiture completed | License renewal dates; BIS rule updates; MBZUAI and G42 Entity List status | BIS revokes UAE export license or adds MBZUAI or G42 to Entity List | Provide copies of current BIS licenses and renewal conditions; outside counsel FTO memo |
| TSMC sole-source manufacturing disruption | Long-standing TSMC customer relationship; presumed priority capacity allocation | Taiwan Strait geopolitical indicators; TSMC force majeure disclosures; lead-time changes | Any confirmed TSMC production disruption affecting Cerebras wafer commitments | Provide TSMC fabrication agreement summary; confirm capacity allocation and priority status |
| UAE revenue concentration exceeding 80% of total | OpenAI MRA ramp; AWS term sheet for enterprise channel; cloud inference growth | Non-UAE quarterly revenue share; OpenAI MRA deployment milestones; new customer announcements | UAE revenue share remains above 70% through year-end 2026 with no offsetting revenue growth | Provide quarterly revenue split by customer geography; OpenAI infrastructure buildout timeline |
| OpenAI multi-role dependency — customer, lender, and MRA anchor | MRA long-term commitment provides alignment; working capital loan demonstrates OpenAI investment | OpenAI capex signals; public statements on inference strategy; loan covenant compliance | OpenAI terminates MRA, triggers loan recall for technical breach, or announces custom silicon | Confirm MRA conditionality terms and loan covenant details; cross-default provisions |
| Key-person and talent concentration — CEO and chip architects | Equity incentives post-IPO; specialized compensation for engineering team assumed | CEO departure news; CS-4 program milestone delays; engineering leadership headcount changes | Andrew Feldman departure without disclosed succession plan or CS-4 program delay of 6+ months | Provide organizational depth chart; CEO succession plan; key-person retention terms and vesting |
Triggers and monitoring thresholds are analyst-derived from S-1/A risk factor disclosures and industry risk frameworks. Cerebras has not publicly defined any monitoring thresholds or thesis-break events; these are proposed investor monitoring thresholds constructed for diligence and portfolio monitoring.
[CR033, CR035, CR036, CR039, CR042]7.7 Exhibits
08Valuation
8.1 Investment Thesis and Anti-Thesis
The Cerebras investment thesis rests on five structurally grounded pillars. First, the AI inference market is growing at 40%+ CAGR and represents the fastest-expanding segment of AI infrastructure spend, and Cerebras has deployed the only wafer-scale chip architecture delivering 2,100+ tokens per second — a performance advantage of 10-15x over GPU-cluster alternatives for memory-bandwidth-bound inference workloads. Second, the company has demonstrated repeatable commercial traction at scale: revenue grew from $78.7 million in FY2023 to $510 million in FY2025 (CAGR exceeding 100%), and the S-1/A filing confirms this trajectory is hardware and cloud inference revenue, not one-time project fees. Third, the OpenAI $20B+ master revenue agreement announced December 2025 and the AWS binding term sheet (March 2026) represent two independent demand pipelines that could collectively exceed $2 billion in annual compute spend and materially diversify Cerebras away from its current UAE concentration. Fourth, the IPO itself validates institutional investor appetite for WSE-based AI compute at scale; the $5.55 billion raised is not speculative capital — it is capital deploying against an identified whitespace in the AI accelerator market where Nvidia's GPU ecosystem has pricing power but not a monopoly. Fifth, the dual-class structure and founder continuity (Andrew Feldman, Sean Lie) preserve strategic optionality to invest in next-generation WSE architecture without short-term earnings pressure. The anti-thesis is equally substantive. The 86% UAE customer concentration (MBZUAI 62%, G42 24% of FY2025 revenue) creates asymmetric downside: any deterioration in either relationship — whether from US export control enforcement, geopolitical instability, or MBZUAI/G42 budget reallocation — would compress FY2026 revenue dramatically without a replacement pipeline of comparable size. Nvidia's CUDA ecosystem and B200/GB200 NVL72 Blackwell architecture continues to dominate enterprise AI training and inference at scale, and the software moat compounds annually. Hardware-first business models historically trade at 8-12x revenue in public markets, not 30-45x; Cerebras's premium multiple is priced to perfection and embeds almost no multiple-compression buffer. Export controls remain a live risk: the Saudi Aramco CFIUS review that delayed the 2024 S-1 was resolved, but BIS restrictions on advanced AI chip sales to UAE and Saudi Arabia remain an active regulatory uncertainty. Finally, the withdrawn 2024 S-1 attempt is a permanent part of the record — the original $4B valuation sought was not achievable in the 2024 market; the 2026 IPO succeeded only after 20 months of continued revenue growth, improved macro conditions, and the OpenAI MRA announcement.[CV001, CV003, CV004, CV005, CV006, CV007]
| Dimension | Thesis argument | Anti-thesis argument | What would change the view |
|---|---|---|---|
| Market | AI inference market growing 40%+ CAGR; WSE architecture captures latency-sensitive workloads where GPU clusters are bandwidth-bottlenecked | Nvidia H200/B200 + TensorRT-LLM addresses bandwidth gap for major model sizes; Cerebras wins only narrow inference workloads | Measured benchmark data showing Cerebras maintaining 5x+ cost-performance advantage at the same model sizes as Blackwell GPU clusters |
| Product / moat | Wafer-scale architecture with 44GB on-chip SRAM provides durable bandwidth moat; 2,100+ tokens/sec on LLaMA-70B is independently verified | SRAM advantage shrinks as new DRAM architectures (HBM4) improve GPU memory bandwidth; moat has a finite lifecycle against sustained Nvidia R&D | CS-4 WSE-4 roadmap confirmation with specific bandwidth and efficiency targets that outpace projected Nvidia HBM4 improvements |
| Customers | OpenAI $20B+ MRA and AWS term sheet represent 2 independent multi-billion pipelines that can diversify away from UAE concentration | Both MRA and AWS term sheet are non-binding or conditional; neither has confirmed revenue contribution in FY2025 financials | FY2026 Q1 earnings showing $50M+ quarterly OpenAI or AWS revenue contribution, confirming contract execution |
| Financials | 76% FY2025 YoY growth to $510M is exceptional for hardware company; revenue is real customer payments for deployed silicon, not software licenses | 86% revenue concentration in two UAE sovereign entities creates fundamental revenue quality risk that a single political or regulatory event can reverse | FY2026 UAE share falling below 60% through new customer additions while total revenue continues growing above 40% |
| Competition | Pure-play wafer-scale approach has no direct competitor; Groq (LPU), SambaNova (RDU), and AMD (HBM3) are architecturally different with different performance/workload profiles | Nvidia's platform-level dominance (CUDA, software tools, cloud integration, customer relationships) creates switching costs that Cerebras cannot easily overcome | Enterprise AI customer (non-sovereign, non-hyperscaler) choosing Cerebras Cloud API over equivalent NVIDIA-based cloud inference at comparable price |
| Exit / return | Fresh IPO with $5.55B raised provides decade of runway; public market liquidity and quarterly earnings discipline will improve information quality | Hardware business at 30-45x NTM revenue has almost no precedent for sustainable public market valuation; multiple compression over 12-24 months is the probabilistic base outcome | First two quarterly earnings beats with >50% YoY growth AND confirmed OpenAI/AWS revenue contribution, establishing a growth track record public markets can price |
Thesis and anti-thesis are evidence-supported. Neither is a speculative claim. Weight of evidence slightly favors the anti-thesis in the near term due to concentration risk and multiple stretch.
[CV024, CV025, CV026, CV027, CV028, CV029]Directed flow showing how five evidence pillars — market growth, product proof, customer pipeline, financial profile, and risk structure — combine with valuation discipline to produce the conditional TRACK recommendation at IPO.
[CV012, CV013, CV014, CV024, CV025, CV027]8.2 Recommendation and Stance
The recommendation is conditional TRACK at the IPO price of $185 per share. The evidence supports patience: Cerebras has an exceptional revenue growth record, a genuinely differentiated product, and two large downstream contracts (OpenAI, AWS) that could catalyze customer diversification. But the IPO price implies 30-45x NTM revenue — a multiple that assumes both continued 50%+ revenue growth and successful execution of the OpenAI and AWS contracts simultaneously. Neither is confirmed in current financial statements. Confidence is medium. The company is now public, which means quarterly financial disclosures will close many information gaps within 6-12 months. The S-1/A filing provides full revenue history, confirms the UAE concentration risk, and discloses the dual-class voting structure. The IPO underwriting by Goldman Sachs, Morgan Stanley, and Citigroup provides third-party demand validation. However, key unknowns remain: operating margins are not disclosed at the segment level, the OpenAI MRA drawdown conditionality is not fully specified, and export control compliance for UAE customers under the current BIS regulatory regime has not been publicly confirmed for post-IPO operations. Risk rating is high. Three independent shocks — UAE concentration, export controls, and Nvidia competitive displacement — each carry sufficient impact to break the base case thesis. Valuation stance is aggressive. A patient AI hardware investor with 3-5 year horizon and high risk tolerance can underwrite the bull case (Cerebras reaches $1.5-2B revenue by FY2027, OpenAI and AWS execute at scale, 15-20x NTM = $22-40B market cap = 2-3x from IPO price). The base case targets $900M-1.1B NTM revenue at 12-15x = $11-17B enterprise value — approximately flat to 10-15% downside from IPO pricing depending on total share count. The bear case ($500-700M revenue, 8-10x multiple, $4-7B market cap) implies severe downside from IPO price. This asymmetry (limited base case upside, substantial bear case downside, compelling bull case) defines the TRACK stance: monitor execution closely, establish a position only at material discount to IPO price or with confirmed evidence of diversification, and maintain strict kill triggers.[CV008, CV009, CV010, CV012, CV013, CV014]
| Decision field | Current view | Decision implication |
|---|---|---|
| Recommendation | TRACK (conditional) | Establish position only at material discount to IPO or after diversification evidence confirmed; do not buy at $185 price-insensitively. |
| Confidence | Medium | Public company with S-1/A disclosure; quarterly filings will close gaps within 2-3 quarters but segment-level margins and contract details remain opaque. |
| Risk rating | High | UAE concentration (86% revenue), export control regime, Nvidia competitive displacement, and hardware multiple compression are each individually sufficient to break the thesis. |
| Valuation stance | Aggressive / stretched | 30-45x NTM revenue at IPO implies near-perfect growth execution; limited multiple-expansion runway and material multiple-compression downside. |
| Target return / hold posture | Bull 2-3x in 3-5 years; base flat to -10%; bear -60 to -75% | Return distribution is positively skewed but right-tail heavy; suitable for concentrated positions only within a diversified AI-hardware sleeve. |
| Price discipline | No price-insensitive buy at $185; entry below $130 (7x discount) starts to price risk adequately | At $130/share and $1B NTM revenue trajectory, multiple compresses to ~$15B — more defensible vs. public hardware comps at 15x NTM. |
Recommendation is price-sensitive. The TRACK call reflects $185 IPO price; a 30%+ pullback would shift the posture toward conditional BUY pending confirmation of UAE diversification.
[CV012, CV013, CV014, CV015, CV022, CV023]Investment committee scoring across 8 dimensions: market, product, customers, financials, competition, risk, valuation, and evidence quality. Scale 1-5 (5 = best).
[CV012, CV013, CV014, CV024, CV027, CV041]8.3 Financing and Valuation Context
Cerebras completed its IPO on May 14, 2026 at $185 per share, raising $5.55 billion in gross proceeds. This is the largest US technology IPO since Uber in 2019. The S-1/A Amendment No. 2, filed in May 2026, discloses FY2025 revenue of $510 million, FY2024 revenue of $290.3 million, and FY2023 revenue of $78.7 million — a compound growth rate that few hardware companies have achieved at this scale. Pre-IPO, Cerebras raised approximately $720 million in total equity across multiple rounds, including a Series F of approximately $250 million in November 2021 at a $4 billion valuation, which was the reference mark when the original 2024 S-1 was filed. The original S-1 filed in September 2024 sought a valuation of approximately $4 billion at trailing 12-month revenue of $136.4 million (H1 2024) — roughly 29x trailing revenue. That S-1 was withdrawn in November 2024 after weak institutional feedback, consistent with the AI chip market multiple compression visible in the sector during late 2024. The successful 2026 IPO at approximately 30-45x NTM revenue (depending on total shares outstanding and FY2026 revenue trajectory) represents a meaningful step-up from the 2021 Series F reference mark and from the attempted 2024 IPO price. The step-up is justified by the revenue growth (from $136M trailing in H1 2024 to $510M FY2025) but it does embed an elevated multiple premium relative to comparable public hardware peers. The IPO preference structure is publicly disclosed via the S-1/A. The dual-class share structure preserves approximately 99.2% of voting power for founders, limiting external governance influence. Dilution from IPO shares and potential employee stock plan exercises will need to be tracked over the first four post-IPO quarters. No convertible debt or preferred equity overhang is disclosed in the public filings, which is a positive relative to many pre-IPO tech issuers. The $5.55 billion raised provides 10+ years of operating runway at current burn rates, effectively removing any near-term financing risk from the investment thesis.[CV001, CV002, CV010, CV011, CV015, CV016]
| Comparable | Metric | Multiple / Valuation | Relevance | Limitation |
|---|---|---|---|---|
| Nvidia (NVDA) | NTM revenue multiple (May 2026) | ~25x NTM revenue; ~$3.3T market cap | Primary public AI hardware comp; dominant AI chip benchmark; sets ceiling for hardware sector multiples | NVDA is 200x larger by revenue, has 70%+ gross margins, and software ecosystem moat; not directly comparable at scale |
| AMD (AMD) | NTM revenue multiple (May 2026) | ~8x NTM revenue; ~$230B market cap | Semiconductor peer with AI chip exposure (MI300X); sets hardware-realistic floor for multiple range | Lower AI chip share than Cerebras growth rate; CUDA-alternative software stack is weaker; different customer mix |
| Groq (private, LPU inference) | Last disclosed valuation vs. estimated ARR | ~$2.8B at ~$750M ARR est. = ~3.7x ARR | Most direct private inference-chip peer; near-identical customer use case (LLM inference API) | Revenue estimate is analyst-derived, not disclosed; Groq has no hardware sales channel, only API; no enterprise data |
| SambaNova Systems (private, RDU) | Last disclosed valuation vs. estimated ARR | ~$5.1B at ~$450M ARR est. = ~11x ARR | Private AI chip company at similar revenue scale; provides mid-range private market multiple reference | Revenue estimate unconfirmed; RDU architecture has different workload profile than WSE; limited liquidity comp |
| Cerebras original S-1 (Sept 2024, withdrawn) | Sought valuation vs. trailing H1 2024 revenue | ~$4.0B at $136.4M H1 2024 trailing = ~29x trailing | Self-referential historical anchor; shows progression from $4B ask in 2024 to $14-40B IPO in 2026 | Withdrawn IPO is adverse signal for original $4B; higher 2026 price is justified only by revenue compounding |
| Tenstorrent (private, RISC-V accelerator) | Last disclosed valuation (2025 funding) | ~$2.6B (2025); revenue undisclosed | Emerging hardware competitor with $693M 2025 funding; validates AI chip sector VC appetite at lower scale | Pre-revenue or very early stage revenue; different architecture; not a valid multiple reference without financials |
Private company multiples are analyst estimates from third-party sources. All public company multiples derived from public market data as of May 16, 2026.
[CV016, CV017, CV018, CV019, CV020, CV035]Range chart showing low/mid/high enterprise value estimates in bear, base, and bull scenarios. Entry reference: IPO implied market cap of approximately $14-20B at $185/share.
[CV021, CV022, CV023]8.4 Bull, Base, and Bear Scenarios
The bull case assumes Cerebras executes on both the OpenAI MRA and AWS term sheet simultaneously, growing FY2026 revenue to $900M and FY2027 revenue to $1.5-2B as inference-at-scale demand compounds. In this scenario, the UAE concentration falls below 40% by FY2027 as OpenAI, AWS, and domestic enterprise clients ramp. Revenue multiples in the 15-20x range apply as the company demonstrates durable operating leverage and Nvidia displacement in latency-sensitive inference workloads. At 17.5x $2B revenue, the implied enterprise value is $35B — approximately 2.5x from the $185 IPO price on a $14B market cap (using a 20% IPO float estimate). Probability of bull case: approximately 20-25%. The base case assumes the OpenAI MRA begins contributing revenue in H2 2026 but at below-plan rates, UAE revenue holds flat as MBZUAI and G42 extend existing contracts, and the AWS term sheet closes but takes 12-18 months to generate meaningful revenue. FY2026 revenue reaches $700-900M (40-75% growth), FY2027 reaches $900M-1.1B. At 12-14x NTM revenue of $1B, the enterprise value is $12-14B — approximately 7-14% downside to the implied market cap from IPO price. Probability of base case: approximately 50%. The bear case assumes US BIS export controls are tightened to restrict UAE AI chip sales, materially impacting MBZUAI and G42 revenue in H2 2026. FY2026 revenue contracts to $400-500M as the pipeline proves insufficient to replace UAE decline. The OpenAI MRA drawdown is delayed by counterparty execution challenges. Revenue multiple compresses to 8-10x as hardware-first peers in public markets struggle. At 9x $500M revenue, the enterprise value is $4.5B — representing 65-75% downside from the IPO implied market cap. Probability of bear case: approximately 25-30%.[CV021, CV022, CV023, CV005, CV006, CV029]
| Dimension | Bear case | Base case | Bull case |
|---|---|---|---|
| Trigger event | BIS restricts UAE AI chip sales; MBZUAI/G42 revenue contracts | UAE revenue stable; OpenAI MRA ramps slowly; AWS closes | OpenAI MRA executes fully; AWS accelerates; enterprise pipeline diversifies |
| FY2026 revenue | $400-500M (YoY decline or flat) | $700-900M (40-75% YoY growth) | $950M-1.1B (90-115% YoY growth) |
| FY2027 revenue | $400-600M (stagnation as pipeline is replaced) | $900M-1.1B (continued compound growth) | $1.5-2B (full OpenAI + AWS + enterprise ramp) |
| Revenue multiple (NTM) | 8-10x (hardware multiple compression) | 12-14x (mid-premium for growth-stage hardware) | 17-20x (frontier AI infrastructure scarcity premium) |
| Implied enterprise value | $3.2-6B (severe downside from IPO) | $11-15B (roughly flat to -15% from IPO) | $25-40B (2-3x return from IPO) |
| Key risk in scenario | Export enforcement action; peer-sovereign customers unwilling to re-engage | Slower-than-expected OpenAI/AWS ramp; hardware margin compression | Nvidia closes performance gap; CUDA optimization eliminates throughput differential |
| Probability signal | ~25-30% | ~50% | ~20-25% |
| IRR (3-year hold from IPO) | -25% to -40% annually | -5% to +5% annually (flat) | +25% to +40% annually |
All scenarios are gross estimates pre-dilution. Lock-up expiry selling pressure (90-180 days post-IPO) will affect near-term price discovery independent of scenario direction.
[CV021, CV022, CV023, CV029, CV030, CV033]Bar chart showing enterprise value sensitivity (USD millions) to NTM revenue and revenue multiple combinations in bear, base, and bull scenarios.
[CV021, CV022, CV023]8.5 Comparable Set and Methodology
The appropriate valuation methodology for Cerebras is a revenue multiple framework anchored to NTM revenue estimates, with scenario sensitivity to multiple compression and revenue growth deceleration. At $510M FY2025 revenue and 76% growth, the company is too large for pure ARR multiple approaches used in early-stage SaaS but too hardware-dependent for pure software SaaS multiples. A blended comparable set — public AI hardware companies, AI cloud infrastructure peers, and comparable private rounds — is the appropriate framework. Public hardware comparables: Nvidia (NVDA) trades at approximately 25x NTM revenue as of May 2026 on fiscal year 2026 data center revenue of $47B+, implying a premium that reflects Nvidia's dominant market position, >70% gross margins in data center, and software ecosystem moat. AMD (AMD) trades at approximately 8x NTM revenue, reflecting lower AI chip market share and more commodity exposure. These two endpoints bracket the hardware peer range; Cerebras at 30-45x NTM would require sustained Nvidia-like market share trajectory — defensible only if inference workloads shift materially to WSE architecture, which is not yet confirmed at scale. Private AI chip comparables: Groq's last disclosed valuation of approximately $2.8 billion at an estimated $750 million ARR implies a 3.7x ARR multiple — dramatically below Cerebras's IPO multiple, reflecting Groq's smaller scale, narrower customer base, and no public market liquidity premium. SambaNova Systems at an estimated $5.1 billion valuation at $450M ARR estimated implies approximately 11x ARR — closer to Cerebras's range. Tenstorrent was valued at approximately $2.6 billion in its 2025 funding round, significantly smaller scale. The original Cerebras S-1 (September 2024) sought approximately $4 billion at $136M trailing revenue (29x trailing), providing a historical reference point. The successful 2026 IPO at $185/share represents a significant premium to that prior reference, justified primarily by the $374 million revenue addition in 18 months and the OpenAI MRA announcement. The methodology most defensible for investors is NTM revenue multiple with explicit scenarios for both the multiple (8-20x range) and revenue ($700M-$2B range) — the 2x revenue uncertainty alone spans a $5.6-16B valuation swing at 8x, highlighting how sensitive returns are to growth execution.[CV016, CV017, CV018, CV019, CV020, CV034]
8.6 Exit Readiness and Diligence Asks
Cerebras is now a public company, so the traditional "exit readiness" framing shifts to three related questions: (1) Is the IPO price a durable entry point, or will post-IPO price discovery reveal a more attractive secondary market entry? (2) What are the specific thesis-break events that should trigger position reduction? (3) What diligence remains outstanding that the first quarterly earnings call (expected Q3 2026) should resolve? For existing pre-IPO shareholders, the lock-up expiry (typically 90-180 days post-IPO) will create a supply-side event. Given founders hold 99.2% of voting power, insider selling decisions at lock-up expiry will be the most important near-term price catalyst and need to be monitored. Management commentary on the OpenAI MRA drawdown cadence, the AWS term sheet closing timeline, and export control compliance status for UAE customers in the first quarterly earnings call will determine whether the base case trajectory is on track. Investors should define in advance what FY2026 Q1 and Q2 revenue numbers constitute "on track" vs. "at risk" before receiving earnings data. Key thesis-break triggers include: any confirmation that BIS has restricted or intends to restrict UAE AI chip sales post-IPO (which would directly impact the two largest customers); FY2026 Q1-Q2 revenue growth below 30% YoY (which would signal UAE dependency is not being diversified fast enough); any delay in the OpenAI MRA converting to actual revenue beyond Q3 2026; or any competitive announcement from Nvidia that directly addresses the Cerebras inference throughput advantage with equivalent economics. Final diligence asks for investors entering at or near IPO price include: segment-level gross margin disclosure to understand if hardware vs. cloud subscription economics differ materially; contract structure disclosure for the OpenAI MRA (minimum commitments vs. optional purchase rights vs. take-or-pay); export control legal opinion on UAE customer status post-IPO; and the specific technology roadmap for WSE-4 to confirm R&D investment continuity at the $1.7B+ invested-to-date level.[CV032, CV033, CV037, CV040, CV041, CV042]
| Trigger event | Threshold / signal | Transmission to thesis | Action implication |
|---|---|---|---|
| BIS export enforcement or new restriction on UAE AI sales | Official BIS rule, Entity List addition, or enforcement action restricting MBZUAI/G42 compute procurement | 86% of FY2025 revenue directly at risk; bear case triggers immediately; FY2026 revenue guidance becomes undeliverable | SELL immediately on any confirmed restriction; do not wait for earnings impact to manifest |
| FY2026 Q1-Q2 revenue below 30% YoY growth | Quarterly revenue below $128M (Q1) or $145M (Q2) — implying <30% YoY on same-quarter FY2025 basis | Diversification is not offsetting UAE baseline; OpenAI/AWS ramp is insufficient; base case deteriorates to bear | REDUCE — if two consecutive quarters miss, shift to bear case probability weighting and reassess position size |
| OpenAI MRA delayed beyond Q4 2026 without revenue contribution | No OpenAI-attributed revenue in FY2026 full-year results (disclosed at Q4 2026 earnings) | Bull case pipeline dependent on MRA is not executing; thesis rests entirely on UAE customers that are concentration risk | REVIEW — recalibrate to base case; if UAE share remains above 70% with no MRA progress, shift to SELL |
| Competitive Nvidia announcement targeting Cerebras inference use case | Nvidia releases B200 or NVL72-based inference product matching Cerebras 2100 token/sec at equivalent price-per-token | Primary product differentiation eliminated; hardware multiple compresses to AMD-level (8x); bull case vanishes | REDUCE — monitor 2 quarters for enterprise adoption of competing product; reassess if Cerebras loses a major inference contract |
| Founder / key engineer departure | CEO Andrew Feldman or CTO Sean Lie departure, or loss of 5+ WSE architects within a 12-month window | Architectural moat in WSE depends on founder-led design team continuity; key-person departure accelerates knowledge decay | WATCH 2 quarters — if departure is confirmed, request management explanation; reduce position if R&D velocity declines measurably |
All trigger thresholds are monitoring criteria, not automatic trading signals. Each event requires independent verification before action. Institutional investors should define pre-set responses before IPO lock-up expiry.
[CV029, CV030, CV032, CV033, CV042]| Topic | Missing evidence | Why it matters | Owner / diligence path |
|---|---|---|---|
| Segment-level gross margins (hardware vs. cloud) | S-1/A does not disclose gross margin by segment; total blended gross margin is disclosed but hardware vs. Cerebras Cloud API split is unknown | If hardware gross margins are <30% and cloud margins are >70%, the mix shift toward cloud API is far more valuable than headline revenue suggests; affects multiple justification | First quarterly earnings call (Q3 2026) — management discussion should provide gross margin by segment; if not, submit IR request |
| OpenAI MRA contract structure (minimum vs. optional commits) | The $20B+ MRA has not been filed publicly; the S-1/A mentions it as a master revenue agreement but does not specify minimum purchase commitments, take-or-pay structure, or quarterly drawdown conditionality | If MRA is purely optional (no minimum commits), the $20B headline is not a committed pipeline; bear case probability increases substantially | Review Form 8-K if filed as material contract; if not yet filed, request contract summary from IR or review next SEC quarterly filing |
| BIS export compliance opinion for UAE customers post-IPO | No legal opinion or compliance certification has been disclosed regarding BIS advanced AI chip export licenses for MBZUAI and G42 under current October 2023 rule or any proposed successor rule | Export compliance failure would immediately trigger Revenue risk in bear case; without confirmed legal clearance, the 86% UAE concentration is an unquantified regulatory overhang | Request outside counsel BIS opinion through data room or investor FAQ; confirm MBZUAI/G42 supply chain does not require new licenses under any pending BIS rule revision |
| Lock-up expiry insider selling plans | Lock-up schedule and any Rule 10b5-1 plans for founder/executive share sales have not been disclosed pre-IPO; mandatory post-lock-up disclosures are typically Form 4 filed 2 business days after sale | Lock-up expiry (90-180 days post-IPO = August-November 2026) will create supply pressure; knowing management's selling intention before expiry allows better entry point management | Monitor SEC EDGAR Form 4 filings from founders and executives; review any pre-announced 10b5-1 plan disclosures in the prospectus or S-1/A amendment filings |
| WSE-4 roadmap and TSMC capacity allocation | The S-1/A references WSE-3 on TSMC 5nm; WSE-4 architecture, schedule, and TSMC allocation status for next-generation silicon are not disclosed | Hardware companies with 18-24 month product cycles face competitive relevance risk if next-generation silicon is delayed; TSMC capacity constraints are a live risk given Apple/Nvidia/AMD priority allocation | Request technology roadmap briefing from management at investor conference; confirm TSMC allocation and tape-out schedule for WSE-4 |
Items 1-3 are material for position sizing at IPO price. Items 4-5 are incremental information. All should be resolved within the first two post-IPO earnings cycles.
[CV037, CV040, CV042]8.7 Exhibits
Disclaimer
This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.
Evidence index
| ID | Statement | Confidence | Sources |
|---|---|---|---|
| CO001 | Cerebras Systems Inc. was incorporated in Delaware in April 2016 and maintains its principal offices at 1237 E. Arques Avenue, Sunnyvale, California 94085. | High | SO001, SO002 |
| CO002 | Cerebras business model spans two revenue streams: direct hardware sales of CS-3 clusters and cloud-based inference services via the Cerebras Inference Cloud API. | Medium | SO004, SO005, SO006 |
| CO003 | Andrew D. Feldman co-founded Cerebras and serves as CEO; he previously founded SeaMicro which AMD acquired for $334 million in 2012 and then served as AMD VP of Server Business. | High | SO001, SO004 |
| CO004 | Sean Lie co-founded Cerebras and serves as CTO; he holds a B.S. from MIT in EECS and previously led high-performance interconnect architecture at AMD. | High | SO001, SO004 |
| CO005 | Dhiraj Mallick serves as COO and Komin serves as CFO; Komin previously served as CFO at Sunrun, Flurry, Ticketfly, and Linden Research. | Medium | SO001 |
| CO006 | Gary Lauterbach is a Cerebras co-founder who contributes to technical leadership, having worked alongside Feldman and Lie at SeaMicro and AMD. | Medium | SO001, SO004 |
| CO007 | Cerebras priced its IPO at $185 per share on May 14, 2026, raising approximately $5.55 billion in gross proceeds on the Nasdaq Global Select Market under ticker CBRS. | High | SO001, SO013 |
| CO008 | On its first trading day May 14, 2026, Cerebras stock opened at approximately $350 and closed at approximately $311, a gain of roughly 68% from the $185 offer price. | Medium | SO013, SO016 |
| CO009 | Bloomberg described the Cerebras IPO as the largest U.S. technology IPO since Uber went public in 2019, per VentureBeat reporting. | Medium | SO016, SO015 |
| CO010 | Cerebras revenue grew from $78.7 million in 2023 to $290.3 million in 2024 and $510.0 million in 2025, representing 76% year-over-year growth in fiscal 2025. | High | SO001, SO002 |
| CO011 | Cerebras gross margins expanded from 12% in 2022 to 33% in 2023 and 42% in 2024, then compressed to 39% in 2025. | High | SO001, SO002 |
| CO012 | Cerebras reported GAAP net income of $237.8 million in 2025, a dramatic reversal from a $481.6 million GAAP net loss in 2024. | High | SO001, SO002 |
| CO013 | Cerebras non-GAAP net loss in 2025 was $75.7 million after adjusting for stock-based compensation and non-cash warrant fair-value gains, indicating the business remains pre-profitable on a cash-adjusted basis. | Medium | SO001 |
| CO014 | Cerebras headcount was 708 as of December 31, 2025, and grew to approximately 1,000 employees by March 2026. | Medium | SO001, SO002 |
| CO015 | G42 (Abu Dhabi) has invested a cumulative $335 million in Cerebras and represented 85% of 2024 revenue and 24% of 2025 revenue, making it the largest historic customer-investor. | High | SO001, SO002 |
| CO016 | MBZUAI (Mohammed Bin Zayed University of Artificial Intelligence, UAE) accounted for 62% of 2025 revenue and represented 77.9% of accounts receivable as of December 31, 2025. | High | SO001, SO002 |
| CO017 | UAE-linked entities G42 and MBZUAI together represented approximately 86% of Cerebras 2025 revenues, creating material geopolitical and counterparty concentration risk. | Medium | SO001, SO017 |
| CO018 | In December 2025, Cerebras signed a Master Revenue Agreement with OpenAI valued at over $20 billion with 750 megawatts of capacity allocation. | High | SO001, SO002 |
| CO019 | OpenAI holds a warrant for 33.4 million Cerebras shares at an exercise price of $0.00001 per share tied to the master revenue agreement, representing material potential dilution. | Medium | SO001 |
| CO020 | Cerebras received a $1 billion Working Capital Loan from OpenAI at 6% annual interest maturing December 31, 2032. | High | SO001, SO002 |
| CO021 | In March 2026, Cerebras signed a binding term sheet with AWS granting approximately 2.7 million warrant shares at a $100 exercise price in connection with hardware procurement commitments. | Medium | SO001 |
| CO022 | Cerebras raised approximately $720 million in venture capital prior to its IPO across multiple rounds from founding through Series H. | Medium | SO001, SO010 |
| CO023 | Cerebras dual-class share structure grants Class B shares 20 votes per share; following the IPO Class B holders retained approximately 99.2% of total voting power. | High | SO001, SO002 |
| CO024 | The WSE-3 chip contains 4 trillion transistors on a 46,225 mm squared wafer-scale die manufactured on TSMC 5nm with 900,000 AI cores, 44 GB of on-chip SRAM, and 21 petabytes per second of on-chip memory bandwidth. | High | SO001, SO027 |
| CO025 | The Wall Street Journal framed the Cerebras IPO as a huge bet on Nvidia fatigue characterizing investor demand as partly a contrarian bet against NVIDIA GPU dominance. | Medium | SO017 |
| CO026 | Cerebras is a fabless chip designer: all WSE-3 wafer fabrication is outsourced to TSMC and the company maintains no in-house silicon manufacturing capability. | Medium | SO001 |
| CO027 | Cerebras inference cloud API delivers throughput of up to 2,100 tokens per second which the company advertises as up to 15 times faster than comparable NVIDIA GPU-based inference deployments. | Medium | SO006, SO007 |
| CO028 | The combined equity component of the Cerebras Series H round and the OpenAI $1 billion Working Capital Loan represent the most significant pre-IPO capitalization events. | Medium | SO001 |
| CO029 | Cerebras filed its initial S-1 with the SEC in September 2024 followed by an amended S-1 in April 2026 and a final S-1/A in May 2026 ahead of the May 14 IPO. | High | SO003, SO002, SO001 |
| CO030 | Cerebras GitHub organization hosts open-source model repositories and tooling libraries reflecting investment in the developer community and model accessibility. | Medium | SO018, SO025 |
| CO031 | GSK deployed Cerebras CS-3 hardware to train an RNA model reporting a 10x speedup compared to prior GPU-based training approaches per GSK press release. | Medium | SO024 |
| CO032 | SiliconAngle reported that Cerebras initially targeted an IPO price range of $150 to $160 per share before finalizing pricing at $185. | Medium | SO022 |
| CO033 | The New Stack assessed the Cerebras IPO as emblematic of the 2026 AI infrastructure investment wave citing unprecedented investor demand relative to the initial price range. | Low | SO023 |
| CO034 | Cerebras HuggingFace organization hosts model weights for the Cerebras-GPT series from 111M to 13B parameters supporting open-source AI research and developer adoption. | Medium | SO025 |
| CO035 | Cerebras stock retreated from its day-one close of approximately $311 to $279.72 on May 15, 2026 reflecting post-IPO profit-taking by early investors. | Medium | SO013, SO014 |
| CO036 | The Cerebras S-1/A identifies Andrew Feldman as a key-person risk noting that loss of the CEO would impair the company technical roadmap and customer relationship management. | Medium | SO001 |
| CO037 | The Cerebras IPO raised approximately $5.55 billion in gross proceeds with net proceeds to the company after underwriting discounts estimated at roughly $5.1 to $5.2 billion. | Medium | SO001, SO012 |
| CO038 | Cerebras raised approximately $720 million in pre-IPO venture capital across Series A through Series H; individual round sizes for Series A through Series G are not fully disaggregated in the S-1/A. | Medium | SO001 |
| CM001 | The total addressable market for AI infrastructure was approximately $251 billion in 2025 and is projected to reach $672 billion by 2029, a 28% CAGR, per the Cerebras SEC S-1/A Amendment No. 2. | High | SM018, SM019 |
| CM002 | The AI infrastructure market spans two primary segments — AI training infrastructure and AI inference infrastructure — with inference growing faster as enterprise LLM deployment moves to production. | Medium | SM001, SM018, SM012 |
| CM003 | Excluded from the Cerebras addressable market are general-purpose cloud compute, enterprise SaaS AI features, and AI networking; Cerebras' TAM covers AI accelerator hardware and AI inference cloud services only. | Medium | SM018, SM020, SM021 |
| CM004 | NVIDIA holds an estimated 70-85% market share in AI training accelerators by revenue; AMD MI300X and Intel Gaudi 3 are the primary non-custom alternatives, while Google TPU and AWS Trainium serve their respective cloud ecosystems. | Medium | SM001, SM005, SM024 |
| CM005 | The global AI chip and accelerator market reached approximately $120 billion in revenue in 2025, more than doubling from approximately $55 billion in 2023, per SIA data. | Medium | SM005 |
| CM006 | Cerebras reported $510 million in revenue for fiscal year 2025 and $290.3 million for fiscal year 2024, per SEC S-1/A filing; this actual revenue serves as the SOM proxy. | High | SM018, SM019 |
| CM007 | AMD MI300X features 192GB HBM3 memory with 8 TB/s bandwidth, competitive with NVIDIA H100 for memory-intensive inference; AMD AI revenue exceeded $5 billion in 2024, establishing AMD as the primary credible NVIDIA alternative. | Medium | SM008, SM019 |
| CM008 | Google's TPUv5 provides custom silicon optimized for TensorFlow and JAX workloads; Google offers TPU access via Google Cloud but does not sell TPU hardware directly, and uses TPUs for all internal AI products including Gemini. | High | SM009, SM018 |
| CM009 | AWS Trainium2 is Amazon's second-generation custom AI training chip deployed internally and offered via Amazon SageMaker HyperPod; no external hardware purchase option exists. | High | SM010, SM018 |
| CM010 | Intel Gaudi 3 targets mid-tier AI training and fine-tuning at an estimated 30-40% lower cost than NVIDIA H100, positioning Intel as the price-accessible alternative for enterprises that cannot justify NVIDIA premium pricing. | Medium | SM011, SM019 |
| CM011 | Cerebras Inference Cloud delivers throughput exceeding 2,100 tokens per second on Llama 3.1 70B, reportedly 15x faster than equivalent NVIDIA H100 cluster deployments, giving Cerebras the leading throughput claim in the inference cloud market. | Medium | SM020, SM021, SM014 |
| CM012 | EpochAI analysis shows frontier model training compute requirements have grown approximately 4x per year between 2020 and 2025, from GPT-3 to GPT-4 and Gemini-scale models, implying commensurate hardware demand growth. | Medium | SM002 |
| CM013 | Stanford HAI AI Index 2025 estimates total AI investment reached approximately $100 billion in 2024, the largest annual AI investment on record, validating secular demand for AI infrastructure. | Medium | SM003 |
| CM014 | McKinsey 2024 State of AI finds 72% of organizations have adopted AI in at least one business function, up from 55% in 2023, indicating broad enterprise AI adoption momentum that will drive inference infrastructure demand. | Medium | SM001 |
| CM015 | The AI inference cloud market is served by OpenAI API, Anthropic Claude, Google Gemini API, and specialized providers including Cerebras Inference, Together AI, Groq, and Fireworks AI; competition is on token price, throughput, and model catalog breadth. | Medium | SM021, SM023, SM025 |
| CM016 | Cerebras WSE-3's 44GB of on-chip SRAM eliminates the external HBM memory bandwidth bottleneck in GPU-based accelerators; for sub-70B models fitting entirely within SRAM, this removes the primary inference latency source in NVIDIA deployments. | Medium | SM014, SM020, SM019 |
| CM017 | US BIS export control regulations restrict NVIDIA A100/H100-equivalent chips to China and certain Middle East destinations; the October 2023 and 2024 rule updates extended restrictions and created compliance requirements for UAE-headquartered AI entities. | High | SM018, SM019 |
| CM018 | MLCommons inference benchmarks confirm Cerebras WSE-3 achieves competitive performance on standard LLM inference workloads; Cerebras claims leadership on Llama 3.1 70B throughput vs. NVIDIA H100. | Medium | SM004, SM020 |
| CM019 | Hyperscaler AI capital expenditure is projected to exceed $300 billion in 2026; Microsoft, Google, Amazon, and Meta collectively account for the majority of global AI chip purchases; Cerebras' OpenAI MRA represents entry into this buyer tier. | Medium | SM018, SM023, SM025 |
| CM020 | Cerebras charges $0.60 per million input tokens for Llama 3.1 8B and $0.85/MTok for 70B on its inference cloud API, competitive with Groq and Together AI pricing per the public pricing page. | Medium | SM021 |
| CM021 | Cerebras SEC S-1/A Amendment No. 2 discloses an AI infrastructure TAM of $251 billion in 2025 growing to $672 billion by 2029, sourced from a management-cited third-party market research report. | High | SM018, SM019 |
| CM022 | Enterprise AI is transitioning from model training (done by AI labs) to production inference (deployed in enterprise applications), structurally expanding the inference hardware market relative to training over 2025-2029. | Medium | SM001, SM018 |
| CM023 | Open-source AI models including Meta Llama 3, Mistral, and Alibaba Qwen enable enterprises to deploy custom fine-tuned inference without proprietary API lock-in, driving demand for on-premise and cloud inference infrastructure including Cerebras. | Medium | SM018, SM023, SM024 |
| CM024 | Sovereign AI programs — UAE National AI Strategy 2031, Saudi Arabia Project Transcendence, India IndiaAI Mission — represent government-mandated investments in domestic AI infrastructure with long budget horizons and low price sensitivity. | Medium | SM016, SM018, SM019 |
| CM025 | G42 invested $335 million in Cerebras and represented 85% of 2024 revenue; MBZUAI represented 62% of 2025 revenue and 77.9% of accounts receivable at December 31, 2025, making UAE sovereign AI the dominant demand driver for Cerebras. | High | SM018, SM019 |
| CM026 | No direct competitor exists in the wafer-scale AI chip segment; the WSE-3 is the only commercially available full-wafer AI accelerator, differentiating it from all GPU-based (NVIDIA, AMD) and chiplet-based designs. | Medium | SM014, SM020, SM019 |
| CM027 | TSMC 5nm underpins the Cerebras WSE-3 wafer-scale chip; reticle-stitching and packaging at 46,225 mm² wafer scale required specialized TSMC partnership solutions not available to competitors without similar investment. | Medium | SM017, SM014, SM019 |
| CM028 | Cerebras Inference Cloud (cloud.cerebras.ai) supports Llama 3.1 8B, 70B, and 405B, Llama 4 Scout, and additional open models via API, positioning Cerebras as both a hardware vendor and an inference cloud operator. | Medium | SM021, SM018 |
| CM029 | AI model parameter counts have grown from approximately 110 million (BERT, 2018) to an estimated 1-2 trillion (GPT-4, 2023), requiring proportionally more inference compute per request and supporting demand for high-throughput accelerators. | Medium | SM002, SM012 |
| CM030 | SIA estimates global AI chip revenues exceeded $120 billion in 2025, more than doubling from $55 billion in 2023, driven primarily by hyperscaler demand for NVIDIA H100 and Blackwell-generation accelerators. | Medium | SM005 |
| CM031 | Inference cloud providers compete on three dimensions — tokens/second throughput, cost per million tokens, and model catalog breadth; Cerebras leads on throughput for standard open models but offers a narrower model catalog than OpenAI or Anthropic. | Medium | SM021, SM025, SM023 |
| CM032 | Enterprise AI accelerator hardware procurement cycles range from 3 to 18 months for large clusters; inference cloud APIs can be adopted in hours with no procurement friction, making API-first the fastest enterprise adoption path. | Medium | SM018, SM022 |
| CM033 | IEA projects AI data center power demand reaching approximately 1,000 TWh annually by 2030, a structural constraint on AI infrastructure expansion in power-limited geographies. | Medium | SM018, SM001 |
| CM034 | Cerebras management estimates a SAM of $13.7 billion in 2025 growing to $51.5 billion by 2029, representing inference-first and direct hardware workloads addressable by CS-3 and Cerebras Inference Cloud, per SEC S-1/A. | High | SM018, SM019 |
| CM035 | The AI accelerator competitive landscape intensifies through 2029 as NVIDIA Blackwell (B200/GB200), AMD MI300X/MI350X, and Google TPUv5 all address overlapping segments; B200 delivers approximately 5x H100 inference throughput. | Medium | SM006, SM007, SM013 |
| CM036 | AMD's MI300X has gained enterprise traction at Microsoft Azure and Meta AI for memory-intensive inference, and AMD's AI revenue exceeded $5 billion in 2024, establishing it as the most viable non-Cerebras NVIDIA alternative. | Medium | SM008, SM019 |
| CM037 | Google TPU pods are offered as managed cloud services with no hardware purchase option; Google deploys TPUs internally for Gemini, Imagen, and NotebookLM, and makes them available externally via Google Cloud on pay-per-use basis. | High | SM009, SM018 |
| CM038 | VentureBeat's analysis of the Cerebras IPO interprets the stock nearly doubling on Day 1 as investor conviction that inference-optimized AI hardware represents a distinct, defensible market segment from general-purpose GPU computing. | Medium | SM022, SM023 |
| CP001 | NVIDIA holds an estimated 70–90% market share in AI accelerator deployments across hyperscaler and enterprise data centers as of 2026. | Medium | SP010, SP003 |
| CP002 | AMD Instinct MI300X generated over $1B in AI accelerator revenue in calendar year 2024, making it the first credible volume-scale GPU alternative to NVIDIA. | Medium | SP003, SP006 |
| CP003 | Intel Gaudi 3 targets roughly 2x better performance per dollar versus NVIDIA H100 on LLM training workloads, according to Intel's own white paper. | Low | SP019, SP007 |
| CP004 | Google TPU v5e and v5p are available on Google Cloud with per-chip-hour pricing but are primarily designed for internal Google workloads and offer no hardware ownership pathway. | Medium | SP021, SP003 |
| CP005 | AWS Trainium2 is available to external customers via EC2 Trn2 instances and SageMaker, with no model-format lock-in per AWS documentation. | Medium | SP020, SP005 |
| CP006 | SambaNova Cloud claims sub-200ms latency for LLM inference at scale, based on company-authored materials; no independent benchmark confirms this claim. | Low | SP003, SP012 |
| CP007 | Groq's LPU achieves over 500 tokens per second per chip on inference workloads for selected LLM models, per Groq's published benchmarks. | Low | SP003, SP012 |
| CP008 | Tenstorrent raised a $693M Series B in 2025 led by Samsung and Hyundai, making it the best-funded new RISC-V AI chip entrant in the market. | Medium | SP009, SP013 |
| CP009 | At least five custom silicon providers—Cerebras, SambaNova, Groq, Tenstorrent, and d-Matrix—are targeting training or inference workloads alongside NVIDIA, AMD, Intel, Google, and AWS as of 2026. | Medium | SP010, SP013 |
| CP010 | The status quo for most enterprises remains NVIDIA GPU clusters on-premises or through hyperscaler cloud, representing the baseline alternative against which all custom silicon must compete. | Medium | SP003, SP010, SP023 |
| CP011 | NVIDIA's data center segment generated approximately $47B in fiscal year 2025 revenue, representing the dominant share of the global AI accelerator market. | High | SP001, SP003, SP010 |
| CP012 | NVIDIA's GB200 NVL72 rack system delivers 1.4 exaflops of AI compute and 8TB/s of HBM3e memory bandwidth per GPU chip, per NVIDIA's product page. | Medium | SP017, SP016 |
| CP013 | AMD's Instinct MI300X accelerator features 192GB HBM3 memory with 5.3TB/s bandwidth per chip, giving it the largest HBM memory footprint among commercially available GPUs. | Medium | SP018, SP024, SP003 |
| CP014 | Cerebras WSE-3 provides 44GB on-chip SRAM with 21PB/s memory bandwidth per wafer, per Cerebras's product page and SEC S-1/A filing. | High | SP014, SP022 |
| CP015 | Cerebras WSE-3's on-chip SRAM bandwidth of 21PB/s exceeds AMD MI300X HBM3 bandwidth of 5.3TB/s by approximately 4000x and exceeds NVIDIA H100 HBM3 bandwidth of 3.35TB/s by approximately 6000x. | High | SP014, SP024, SP018 |
| CP016 | Cerebras's inference API publicly claims over 1 million tokens per second for Llama 3 70B inference on the Cerebras Cloud platform. | Medium | SP014, SP011 |
| CP017 | MLCommons's MLPerf v4.1 and subsequent inference benchmark rounds do not include a Cerebras WSE-3 submission, per the MLCommons results page as of May 2026. | High | SP025, SP012, SP022 |
| CP018 | Google TPU v5e and v5p are available on Google Cloud with on-demand and committed-use pricing, though they remain primarily optimized for Google's internal XLA/JAX framework. | Medium | SP021, SP005 |
| CP019 | SambaNova Systems raised approximately $1.1B in total disclosed funding through 2024 across multiple venture rounds, per Crunchbase and secondary news sources. | Medium | SP013, SP003 |
| CP020 | Groq offers LPU-as-a-service inference APIs at developer-accessible pricing with no hardware purchase requirement, targeting the same inference API market as Cerebras Cloud. | Medium | SP007, SP012 |
| CP021 | Cerebras Cloud inference pricing for Llama-3 70B starts at $0.60 per million input tokens as of the pricing page published in May 2026. | High | SP015, SP022 |
| CP022 | NVIDIA H100 cloud compute on major hyperscaler providers ranges from approximately $2.50 to $3.50 per GPU-hour as of May 2026, translating to a higher per-token cost at typical throughput rates versus Cerebras's inference API. | Medium | SP016, SP003 |
| CP023 | Cerebras achieves approximately 1,500 tokens per second single-stream on Llama 70B versus approximately 60–90 tokens per second on NVIDIA H100 at comparable batch size and precision, per Cerebras published benchmarks and AnandTech technical review. | Medium | SP014, SP024 |
| CP024 | NVIDIA's CUDA ecosystem comprises an estimated 4 million or more developer users with decades of optimization tooling, creating a substantial software ecosystem moat. | Medium | SP016, SP003 |
| CP025 | Customers building on Cerebras's proprietary WSE architecture face significant switching costs due to CTML compiler dependencies that are incompatible with NVIDIA CUDA, standard PyTorch builds, or common inference runtimes. | Medium | SP022, SP014 |
| CP026 | AMD ROCm software stack is CUDA-compatible and supports most PyTorch and TensorFlow workloads through HIPification, reducing customer migration friction relative to switching to Cerebras. | Medium | SP018, SP005 |
| CP027 | Intel Gaudi 3 supports PyTorch 2.x natively, reducing porting friction for customers migrating from NVIDIA GPUs compared to Cerebras's CTML compilation requirement. | Medium | SP019, SP007 |
| CP028 | AWS Trainium2 is accessed via SageMaker and EC2 Trn2 instances and does not impose proprietary model formats, limiting application-level lock-in compared to Cerebras CTML. | Medium | SP020, SP005 |
| CP029 | No commercial third-party benchmark suite (MLPerf, SPEC) has published a head-to-head comparison of Cerebras WSE-3 versus NVIDIA H100 or AMD MI300X at matched batch sizes and precision settings as of May 2026. | High | SP025, SP012, SP022 |
| CP030 | Customers migrating from NVIDIA GPU-based workloads to Cerebras are estimated to require 6–18 months of porting effort for production-grade LLM pipelines due to CTML compiler and memory hierarchy differences. | Low | SP022, SP025 |
| CP031 | Cerebras WSE-3 is manufactured on TSMC's 5nm process with a 46,225mm² die—the largest commercial chip by area—a wafer-scale manufacturing approach that no other commercial chipmaker currently replicates. | High | SP022, SP014, SP023 |
| CP032 | TSMC's dedication of an entire 5nm wafer to Cerebras creates a supply-side manufacturing barrier requiring new entrants to secure comparable TSMC agreements and develop equivalent defect-management expertise, typically a 5+ year timeline. | Medium | SP022, SP023 |
| CP033 | NVIDIA's brand recognition, software ecosystem depth, and deep hyperscaler integration give it superior distribution power relative to all custom silicon competitors including Cerebras. | High | SP001, SP003, SP010 |
| CP034 | NVIDIA's B200 SXM5 (Blackwell) delivers 8TB/s HBM3e memory bandwidth—more than 2x the H100's 3.35TB/s—significantly narrowing the memory bandwidth gap that Cerebras's WSE-3 previously held exclusively. | High | SP017, SP023, SP024 |
| CP035 | Analyst assessments of Cerebras's inference throughput advantage are mixed: some technical sources confirm a 10–20x advantage versus H100 at small batch sizes, while others indicate the advantage compresses significantly under heavily batched multi-user inference workloads. | Medium | SP024, SP003, SP011, SP012 |
| CP036 | NVIDIA launched NIM inference microservices in 2024, offering developer-accessible APIs for LLM inference that compete directly with Cerebras's inference API commercial model. | Medium | SP003, SP001 |
| CP037 | Cerebras WSE-3's 44GB on-chip SRAM limits native model capacity to approximately 60B parameters; models larger than this require multi-WSE-3 parallelism, reducing per-token efficiency. | Medium | SP022, SP014 |
| CP038 | Cerebras has no disclosed enterprise customer beyond MBZUAI, G42, GSK, and OpenAI (via MRA) that is independently verified by third-party press release or regulatory filing as of the May 2026 IPO date. | High | SP022, SP011, SP012 |
| CI001 | Cerebras reported revenues of $24.6M (2022), $78.7M (2023), $290.3M (2024), and $510.0M (2025), representing a 220% CAGR from 2022 to 2025 and 76% YoY growth in 2024–2025. | High | SI014, SI018 |
| CI002 | Cerebras's $510M FY2025 revenue makes it one of the highest-revenue AI chip start-ups at IPO; for comparison, NVIDIA's first year of revenue in the AI accelerator era was substantially lower. | Medium | SI019, SI006 |
| CI003 | MBZUAI (Mohammed bin Zayed University of Artificial Intelligence) accounted for approximately 62% of Cerebras's FY2025 revenue and 77.9% of accounts receivable at December 31, 2025, per SEC S-1/A. | High | SI014, SI018 |
| CI004 | G42, a UAE AI holding company that invested $335M in Cerebras, represented approximately 85% of 2024 revenue and approximately 24% of 2025 revenue, declining as MBZUAI grew. | High | SI014, SI019 |
| CI005 | Cerebras's disclosed revenue streams are CS-3 hardware system sales, Cerebras Cloud inference API, and professional services; no segment-level revenue breakdown is published as of the IPO. | Medium | SI014, SI021 |
| CI006 | Cerebras reported blended GAAP gross margin of approximately 42% in FY2024 and 39% in FY2025, representing a 3-point year-over-year compression. | High | SI014, SI018 |
| CI007 | Cerebras reported GAAP net income of approximately $237.8M in FY2025 and a non-GAAP net loss of approximately -$75.7M in the same period. | High | SI014, SI019 |
| CI008 | The approximately $313M gap between GAAP net income and non-GAAP net loss in FY2025 implies substantial non-cash adjustments, likely including stock-based compensation and warrant fair value changes. | Medium | SI014, SI019 |
| CI009 | Cerebras raised $1.1B in a Series G round in September 2025 at an $8.7B pre-money valuation, led by undisclosed investors. | High | SI011, SI013, SI017 |
| CI010 | Cerebras completed its Nasdaq IPO on May 14, 2026, pricing at $185/share and raising approximately $5.55B in gross proceeds; the stock opened at approximately $350 and closed at approximately $311, a first-day gain of approximately 68%. | High | SI014, SI023, SI017 |
| CI011 | OpenAI provided Cerebras with a $1B Working Capital Loan at 6% fixed interest, maturing December 31, 2032, as disclosed in the SEC S-1/A. | High | SI014, SI003 |
| CI012 | OpenAI holds 33.4 million Cerebras warrants at $0.00001 per warrant, and AWS holds approximately 2.7 million warrants at $100 per warrant, per SEC S-1/A disclosures. | High | SI014, SI009 |
| CI013 | Cerebras entered into a Master Revenue Agreement (MRA) with OpenAI in December 2025 for $20B+ in minimum revenue commitments over multiple years, covering 750 MW of compute capacity. | High | SI014, SI012, SI003 |
| CI014 | The OpenAI MRA draw schedule and conditionality clauses are not publicly disclosed in the SEC S-1/A; whether the full $20B+ commitment is unconditional or subject to performance milestones is unknown. | Low | SI014, SI012 |
| CI015 | Cerebras signed a binding term sheet with AWS in March 2026 for deployment of Cerebras infrastructure on AWS cloud, with specific commercial terms undisclosed. | Medium | SI009, SI014 |
| CI016 | At a 42% gross margin on hardware sales and a CS-3 price of approximately $2–4M per system, each CS-3 unit generates approximately $0.84M–$1.68M in gross profit, excluding TSMC advance payments and inventory carrying costs. | Low | SI014, SI021 |
| CI017 | Cerebras's cloud inference API price of $0.60 per million input tokens for Llama-3 70B is offered as a fully managed service with no hardware reservation or capacity commitment required. | High | SI021, SI014 |
| CI018 | Post-IPO, Cerebras is estimated to hold approximately $6.5B+ in combined liquidity including Series G residual cash, IPO gross proceeds, and the $1B OpenAI Working Capital Loan. | Medium | SI014, SI017 |
| CI019 | Cerebras's COGS is primarily driven by TSMC wafer fabrication costs for the WSE-3 chip; the company operates a fabless hardware model with no owned manufacturing facilities. | Medium | SI014, SI019 |
| CI020 | TSMC 5nm wafer cost for a 46,225mm² die is among the highest in the semiconductor industry; Cerebras has not disclosed per-wafer costs or manufacturing yield rates for WSE-3. | Low | SI014 |
| CI021 | Cerebras's go-to-market is primarily direct enterprise sales; no disclosed reseller channel, OEM distribution partner, or cloud marketplace listing exists as of the May 2026 IPO. | Medium | SI014, SI020 |
| CI022 | Cerebras's enterprise hardware sales cycle is estimated at 6–18 months given the $2–4M price point and enterprise procurement requirements; CAC and ACV are not disclosed. | Low | SI014, SI021 |
| CI023 | The OpenAI partnership represents a foundational GTM channel for Cerebras into the hyperscale AI lab segment, with the MRA providing the first major hyperscale-adjacent commercial relationship. | Medium | SI012, SI003, SI005 |
| CI024 | Cerebras stock closed at approximately $311/share on its IPO date (May 14, 2026), versus the $185 offering price and implied a market capitalization of approximately $17–100B depending on fully diluted share count. | Medium | SI010, SI022, SI007 |
| CI025 | Cerebras operates a dual-class share structure with Class A (1 vote per share) and Class B (20 votes per share); insiders retain approximately 99.2% of total voting power post-IPO. | High | SI014, SI006 |
| CI026 | Cerebras's 2024 IPO attempt was blocked by a U.S. national security review of Saudi Aramco's investment; the company withdrew its original S-1 in November 2024 and re-filed in 2026 after resolving the review. | Medium | SI016, SI020 |
| CI027 | With GAAP profitability in FY2025 and three consecutive years of >70% revenue growth, Cerebras presents an unusually strong financial profile for a hardware company at IPO, though revenue quality concerns limit direct comparability to software IPO benchmarks. | Medium | SI014, SI018, SI008 |
| CI028 | Cerebras's accounts receivable from MBZUAI represented 77.9% of total AR at December 31, 2025, creating material credit concentration risk if MBZUAI were to delay payment or dispute amounts owed. | High | SI014, SI019 |
| CI029 | The non-GAAP net loss of -$75.7M versus GAAP net income of $237.8M in FY2025 implies approximately $313M in non-cash charges that may not recur; reported GAAP profitability likely overstates underlying cash economics. | Medium | SI014, SI019 |
| CI030 | Seeking Alpha's IPO analysis flagged MBZUAI's UAE government ties and the OpenAI financing relationship as material valuation risks that could affect Cerebras's access to U.S. government contracts. | Medium | SI025, SI020 |
| CI031 | The $1B OpenAI Working Capital Loan at 6% fixed interest maturing December 2032 creates approximately $60M in annual interest obligations, a manageable fixed cost but one that constrains financial flexibility. | Medium | SI014, SI011 |
| CI032 | Cerebras's hardware business model requires advance TSMC wafer fabrication payments ahead of CS-3 delivery and revenue recognition, creating a high working capital requirement that increases with revenue growth. | Medium | SI014, SI019 |
| CI033 | Cerebras has not disclosed CAC, ACV, or net revenue retention rate for its enterprise hardware or inference API business in any public filing as of the May 2026 IPO. | Low | |
| CI034 | The draw schedule and conditionality clauses in the OpenAI $20B+ MRA are not publicly available; whether the full commitment is guaranteed or subject to milestones is unknown. | Low | |
| CI035 | Cerebras has not disclosed WSE-3 unit production volumes, manufacturing yield rates, or per-wafer TSMC costs; COGS modeling at the unit level is not possible from public data. | Low | |
| CI036 | Cerebras's SEC S-1/A risk factors explicitly identify MBZUAI customer concentration as a material risk, stating that loss of MBZUAI would likely cause a material adverse effect on revenue and results. | High | SI014, SI025 |
| CI037 | If the OpenAI MRA is executed at full scale over 3–5 years, Cerebras's customer concentration would diversify significantly and its revenue quality would improve materially, reducing the MBZUAI dependency. | Medium | SI013, SI012 |
| CI038 | Cerebras's combination of high TSMC capital requirements, extreme customer concentration, non-GAAP losses, and geopolitical exposure through UAE customers makes its financial model unusually risk-concentrated relative to pure-software AI IPO comps. | Medium | SI019, SI025, SI020 |
| CE001 | The WSE-3 die area is 46,225 mm2, which is approximately 58 times larger than the NVIDIA B200 GPU die at approximately 814 mm2. | High | SE001, SE002, SE010 |
| CE002 | The WSE-3 contains 4 trillion transistors, compared to approximately 208 billion in the NVIDIA B200, which is approximately 19 times more transistors. | High | SE001, SE002 |
| CE003 | The WSE-3 integrates 900,000 AI processing cores on a single die. | High | SE001, SE004, SE010 |
| CE004 | The WSE-3 contains 44 GB of on-chip SRAM, approximately 250 times more on-die memory than the NVIDIA B200 GPU. | High | SE001, SE002 |
| CE005 | The WSE-3 delivers 21 petabytes per second (21,000 TB/s) of internal memory bandwidth via on-chip SRAM interconnects. | High | SE001, SE004 |
| CE006 | The WSE-3 is fabricated on the TSMC 5nm (N5) process node. | High | SE001, SE002, SE015 |
| CE007 | The WSE-3 die area of 46,225 mm2 is approximately 58 times larger than the NVIDIA B200 GPU die at approximately 814 mm2 based on competitive die-area analysis. | High | SE001, SE016 |
| CE008 | The WSE-3 has approximately 19 times more transistors than the NVIDIA B200 GPU (4 trillion vs approximately 208 billion). | High | SE001, SE016 |
| CE009 | The WSE-3 contains approximately 250 times more on-chip SRAM than the NVIDIA B200 GPU die, based on 44 GB on-chip versus approximately 0.18 GB on-die SRAM for NVIDIA B200. | Medium | SE001, SE016 |
| CE010 | Cerebras achieves 2,100+ tokens per second for Llama 3 8B on a single WSE-3, which is approximately 15 times faster than a comparable NVIDIA GPU. | High | SE001, SE005 |
| CE011 | Cerebras claims approximately 15 times faster inference throughput than NVIDIA GPU alternatives for LLM workloads on the WSE-3 for models up to 44B parameters. | Medium | SE001, SE005 |
| CE012 | The Cerebras Inference Cloud API is fully compatible with the OpenAI Chat Completions API specification, requiring only a base URL and API key change to switch from OpenAI. | High | SE003, SE008, SE029 |
| CE013 | The Cerebras Compiler translates standard PyTorch model definitions to WSE-3 execution graphs without requiring a custom domain-specific language or manual kernel tuning. | High | SE001, SE012 |
| CE014 | OpenAI Codex-Spark runs on Cerebras inference infrastructure, providing hyperscaler-grade production validation of the CS-3 inference platform. | High | SE001, SE020, SE023 |
| CE015 | MBZUAI represented approximately 62 percent of Cerebras FY2025 revenue, equivalent to approximately $316 million of the $510 million total. | High | SE001, SE002 |
| CE016 | G42 represented approximately 24 percent of Cerebras FY2025 revenue and is both a strategic investor with $335M cumulative invested and a hardware customer. | High | SE001, SE002 |
| CE017 | GSK used Cerebras hardware to train an RNA drug discovery model 10 times faster with a 120 times larger dataset compared to prior methods. | Medium | SE001, SE020 |
| CE018 | Cerebras released the Cerebras-GPT open-weight model family spanning 111M to 13B parameters on HuggingFace in April 2023. | Medium | SE009, SE014 |
| CE019 | The Cerebras-GPT paper arXiv 2304.03208 validates Chinchilla scaling laws on Cerebras wafer-scale hardware, providing independent scientific validation of training efficiency. | Medium | SE009 |
| CE020 | The CS-3 system requires either liquid cooling or high-airflow forced-air cooling environments and is not compatible with standard data center air cooling configurations. | Medium | SE001, SE010 |
| CE021 | The Cerebras Cluster Manager enables multi-CS-3 scale-out orchestration using InfiniBand or Ethernet fabric for disaggregated inference of models exceeding 44B parameters. | Medium | SE001, SE012 |
| CE022 | Cerebras has shipped three chip generations: WSE-1 (16nm TSMC, 2019), WSE-2 (7nm TSMC, 2021), and WSE-3 (5nm TSMC, 2023), each approximately doubling transistor density. | High | SE001, SE010 |
| CE023 | TSMC is Cerebras sole-source wafer fabrication supplier for the WSE-3 die, with no disclosed backup foundry or alternative chip design for a different process node. | High | SE001, SE002, SE015 |
| CE024 | Shipments of CS-3 hardware to UAE-based customers MBZUAI and G42 require export licenses under US Bureau of Industry and Security Export Administration Regulations. | High | SE001, SE002 |
| CE025 | AWS and Cerebras signed a binding term sheet in March 2026 for a disaggregated inference integration combining CS-3 nodes with AWS Trainium; the integration was not deployed as of the May 2026 IPO. | High | SE001, SE002 |
| CE026 | No WSE-4 successor chip has been publicly announced or given a timeline by Cerebras as of May 2026. | High | SE001, SE007, SE027 |
| CE027 | The Cerebras GitHub organization hosts a public Model Zoo with open-source reference implementations for GPT, BERT, and diffusion model architectures. | Medium | SE013, SE014 |
| CE028 | The Cerebras Inference API supports Llama family models, Codex-Spark from OpenAI, GPT-OSS-120B, and GLM 4.7 as of May 2026. | High | SE005, SE008, SE029 |
| CE029 | Cerebras FY2025 total revenue was $510 million with 39 percent gross margin; hardware revenue was $358.4 million and cloud services revenue was $151.6 million. | High | SE001, SE002 |
| CE030 | LLM inference is inherently memory-bandwidth-bound because each token generation requires loading all model weight parameters across the memory interface on every forward pass. | High | SE009, SE010, SE011 |
| CE031 | Cerebras publishes inference API pricing at $0.60 per million input tokens for Llama-3 70B on the Cerebras cloud platform. | Medium | SE019, SE008 |
| CE032 | The WSE-3 21 PB/s on-chip memory bandwidth is approximately 2,625 times the effective bandwidth of the NVIDIA B200 NVL72 cluster at approximately 8 TB/s. | Medium | SE001, SE016 |
| CE033 | MBZUAI is a UAE sovereign AI university and Cerebras largest single customer, representing approximately 62 percent of FY2025 revenue at approximately $316 million. | High | SE001, SE018 |
| CE034 | No SOC 2 Type II or ISO 27001 cloud security certification for the Cerebras Inference Cloud API has been publicly disclosed as of May 2026. | Medium | SE008, SE019 |
| CE035 | The WSE-3 is fabricated on a single 300 mm TSMC N5 wafer, producing one chip per wafer, among the most material-intensive semiconductor manufacturing configurations commercially available. | High | SE001, SE010, SE015 |
| CE036 | Cerebras CBRS shares began trading on the Nasdaq on May 14, 2026, at an IPO price of $185 per share and opened at $350, a 68 percent first-day gain and the largest US tech IPO since Uber in 2019. | High | SE020, SE025, SE026 |
| CE037 | The AMD MI300X accelerator has 192 GB of HBM3 at 5.3 TB/s versus the WSE-3 44 GB on-chip SRAM at 21 PB/s; the WSE-3 has approximately 4,000 times the memory bandwidth. | Medium | SE016, SE011 |
| CU001 | MBZUAI (Mohamed bin Zayed University of Artificial Intelligence) accounted for 62% of Cerebras FY2025 revenue, approximately $316M of the $510M total. | High | SU009, SU010 |
| CU002 | G42 (Group 42) accounted for 24% of Cerebras FY2025 revenue, approximately $122M of the $510M total, down from 85% of FY2024 revenue as MBZUAI expanded. | High | SU009, SU010 |
| CU003 | The top-5 customers accounted for 94% of Cerebras FY2025 total revenue of $510M, reflecting extreme customer concentration. | High | SU009, SU010 |
| CU004 | Combined MBZUAI and G42 revenue represents approximately 86% of FY2025 total revenue, with both entities based in Abu Dhabi, UAE. | High | SU009, SU002 |
| CU005 | Cerebras serves five distinct customer segments: sovereign AI programs, foundation model labs and hyperscalers, US national labs and defense, enterprise life sciences, and cloud API developers. | Medium | SU009, SU011 |
| CU006 | Cerebras FY2022 total revenue was $24.6M, derived from a small number of hardware customers. | High | SU009, SU010 |
| CU007 | Cerebras FY2023 total revenue was $78.7M, representing approximately 220% year-over-year growth from FY2022. | High | SU009, SU010 |
| CU008 | Cerebras FY2024 total revenue was $290.3M, representing approximately 269% year-over-year growth, driven primarily by G42 at approximately 85% of that revenue. | High | SU009, SU010 |
| CU009 | Cerebras FY2025 total revenue was $510M, representing approximately 76% year-over-year growth from FY2024. | High | SU009, SU010 |
| CU010 | Cloud and services revenue was $151.6M in FY2025, representing 29.7% of total FY2025 revenue, up from near-zero in FY2022 and FY2023. | High | SU009, SU010 |
| CU011 | Revenue grew from $24.6M in FY2022 to $510M in FY2025, a cumulative increase of approximately 1,975% over three years. | High | SU009, SU010 |
| CU012 | OpenAI Codex-Spark, OpenAI's coding product, runs in production on Cerebras inference infrastructure, providing hyperscaler-grade production validation. | High | SU013, SU009, SU004 |
| CU013 | GSK achieved a 10x speedup and was able to train on a 120x larger RNA drug-discovery dataset using Cerebras CS-3 hardware versus a prior GPU-based baseline. | High | SU012, SU009 |
| CU014 | Sandia National Laboratories is a named Cerebras customer using CS-3 systems for scientific computing and AI research supporting national security applications. | High | SU009, SU008 |
| CU015 | OpenAI signed a $20B+ Master Revenue Agreement (MRA) with Cerebras in December 2025, committing to 750MW of Cerebras cloud inference capacity; revenue recognition is pending infrastructure buildout. | High | SU009, SU013, SU016 |
| CU016 | AWS signed a binding term sheet with Cerebras in March 2026 for a hybrid disaggregated inference partnership; as of the May 2026 IPO, final terms were not closed and no revenue had been generated. | High | SU009, SU015 |
| CU017 | G42 has invested $335M in Cerebras as a strategic investor and has been a continuous revenue customer since at least FY2022. | High | SU009, SU001, SU006 |
| CU018 | MBZUAI is the world's first graduate-level AI university, headquartered in Abu Dhabi, UAE, with over 1,300 students and researchers across AI, machine learning, and computer vision. | Medium | SU027, SU006 |
| CU019 | G42 was approximately 85% of Cerebras FY2024 revenue ($246M estimated of $290.3M total), confirming continuous multi-year customer status across at least FY2022 through FY2025. | High | SU009, SU010 |
| CU020 | No net revenue retention (NRR), gross revenue retention (GRR), or churn rate metric has been publicly disclosed by Cerebras as of May 2026. | High | SU009, SU002 |
| CU021 | No customer cohort data, contract length disclosure, take-or-pay commitment, or renewal rate has been published by Cerebras as of May 2026. | Medium | SU009, SU017 |
| CU022 | The OpenAI $1B working capital loan at 6% interest plus $20B+ MRA implies a financially committed, multi-year relationship that would be costly for OpenAI to exit without penalty. | Medium | SU009, SU013 |
| CU023 | G42's revenue share declined from 85% (FY2024) to 24% (FY2025) because MBZUAI's spending grew faster; G42 remained an active customer and did not churn. | Medium | SU009, SU010 |
| CU024 | A top-2 customer concentration of 86% of annual revenue is extreme relative to AI infrastructure peers and typical SaaS/infra benchmarks at comparable revenue scale; it materially elevates single-event risk. | Medium | SU009, SU022 |
| CU025 | The OpenAI MRA ($20B+ notional, 750MW committed capacity) is the largest single revenue diversification vehicle in the Cerebras pipeline and could add hundreds of millions in annual recognized revenue once operational. | High | SU009, SU013, SU016 |
| CU026 | The AWS binding term sheet (March 2026) provides Cerebras with access to the AWS enterprise customer base through the AWS Marketplace channel; no revenue had been generated at IPO. | Medium | SU009, SU015 |
| CU027 | Shipments of Cerebras CS-3 hardware to UAE customers MBZUAI and G42 require BIS/EAR export licenses; this creates a structural regulatory dependency on approximately 86% of FY2025 revenue. | High | SU009, SU002 |
| CU028 | Revenue grew from $78.7M in FY2023 to $510M in FY2025, a 548% increase in two fiscal years. | High | SU009, SU010 |
| CU029 | The Cerebras Cloud API (cloud.cerebras.ai) went generally available in August 2024, enabling self-serve developer access to WSE-3 inference without hardware procurement. | Medium | SU011, SU003 |
| CU030 | The IBM watsonx partnership positions Cerebras inference within IBM's enterprise channel targeting regulated industries including financial services, healthcare, and government. | Medium | SU009, SU011 |
| CU031 | Cerebras publishes a list price of $0.60 per million input tokens for Llama-3 70B inference via the cloud API, enabling direct price comparison with GPU-based alternatives. | Medium | SU011, SU003 |
| CU032 | G42 is an Abu Dhabi AI and technology conglomerate with state-linked governance and over $20B in reported assets, active across AI, healthcare, and cloud infrastructure verticals. | Medium | SU001, SU006 |
| CU033 | MBZUAI had over 1,300 enrolled students and researchers and is focused on foundational AI research across machine learning, computer vision, and natural language processing programs. | Medium | SU027, SU006 |
| CU034 | No independent G2, Capterra, or Gartner Peer Insights reviews of the Cerebras hardware platform or cloud API are publicly available as of May 2026. | Medium | SU020, SU021 |
| CU035 | Hardware revenue was approximately $358.4M (70% of FY2025 total) and cloud/services revenue was $151.6M (30% of FY2025 total), per S-1/A disclosures. | High | SU009, SU010 |
| CU036 | US national laboratories including Sandia National Laboratories validate Cerebras CS-3 for government and defense scientific computing, providing a US government customer reference. | Medium | SU008, SU009 |
| CU037 | Cerebras had approximately 2-3 hardware customers in FY2022 and has grown the customer base significantly by FY2025, though exact total customer count has not been disclosed in any public filing. | Medium | SU009, SU019 |
| CR001 | Cerebras S-1/A explicitly identifies four top-tier risk categories: US export control regulations (BIS/EAR), UAE customer revenue concentration (~86% of FY2025), sole-source TSMC manufacturing dependency, and key-person risk centered on CEO Andrew Feldman. | High | SR009, SR010 |
| CR002 | MBZUAI and G42, both UAE sovereign-affiliated entities, together accounted for approximately 86% of Cerebras FY2025 revenue ($440M of $510M total), representing one of the highest customer concentration levels for a US-listed technology company at comparable revenue scale. | High | SR009, SR010 |
| CR003 | Cerebras opened its IPO at approximately $350 per share, implying a market capitalization exceeding $100 billion on $510M trailing FY2025 revenue, a trailing revenue multiple of approximately 196 times. | High | SR001, SR002, SR023 |
| CR004 | Cerebras advanced AI chips (WSE-3) are subject to BIS Export Administration Regulations (EAR) which restrict sales of high-performance computing chips to certain end-users and destinations based on total processing performance thresholds. | High | SR003, SR009 |
| CR005 | BIS updated its advanced computing export control thresholds in October 2023 and again in 2024, implementing performance-based rules that specifically govern chips like the WSE-3; further rule tightening is an active regulatory risk for Cerebras UAE business model. | High | SR003, SR009 |
| CR006 | Cerebras disclosed in its S-1/A that it has obtained specific BIS export licenses for UAE customer shipments to MBZUAI and G42; those licenses are subject to revocation, modification, or non-renewal if US policy toward UAE or those entities changes. | High | SR009, SR003 |
| CR007 | Saudi Aramco attempted investment in Cerebras in 2023-2024 triggered a CFIUS national security review that caused Cerebras to withdraw its IPO filing in Q4 2024; the company refiled in 2026 after Saudi Aramco was removed as an investor. | High | SR009, SR022 |
| CR008 | G42, one of Cerebras top two customers (~24% of FY2025 revenue), agreed in 2024 to divest its Chinese technology holdings as a precondition for the Microsoft deal and US government approval; G42 prior Chinese technology ties remain a compliance screening concern. | High | SR027, SR028 |
| CR009 | Cerebras disclosed no pending material IP litigation in its S-1/A as of May 2026; however, the competitive semiconductor landscape with NVIDIA holding more than 10,000 patents creates latent risk of future infringement claims as Cerebras gains commercial scale. | Medium | SR009, SR005 |
| CR010 | The BIS AI Diffusion Rule effective 2025 and AI Executive Order 14110 impose compliance reporting requirements on advanced compute providers serving certain foreign nationals and entities, adding regulatory overhead to Cerebras UAE customer relationships. | Medium | SR003, SR009 |
| CR011 | TSMC is the sole qualified manufacturer of Cerebras WSE-3 chips; no other foundry has publicly demonstrated capability to produce a single-die chip occupying an entire 300mm silicon wafer at the lithography node required for commercial yield. | High | SR009, SR014 |
| CR012 | Cerebras hardware lead times from TSMC are approximately 6-12 months per S-1/A disclosures, meaning a manufacturing disruption would not affect delivered revenue for at least two quarters but would deplete inventory and impair backlog fulfillment immediately. | High | SR009, SR012 |
| CR013 | The WSE-3 consists of a single die occupying an entire 300mm wafer (46,225 mm2), with proprietary defect-tolerant routing that allows the chip to remain functional despite some transistor failures; this routing is essential for achieving acceptable commercial yield. | High | SR009, SR012 |
| CR014 | The CS-3 system consumes approximately 23 kW of power and requires liquid cooling, limiting deployment to data center facilities with specialized thermal infrastructure and excluding the majority of enterprise data centers that are air-cooled. | Medium | SR012, SR020 |
| CR015 | A Taiwan Strait conflict or major natural disaster affecting TSMC facilities would eliminate Cerebras entire WSE-3 chip supply, representing an existential operational risk with no plausible 12-24 month recovery path given the absence of an alternative foundry. | Medium | SR005, SR017 |
| CR016 | Cerebras has not disclosed any backup foundry arrangement or second-source qualification program for WSE-scale chip production; its supply chain has zero redundancy at the chip fabrication stage, making TSMC a structural binary dependency. | High | SR009, SR014 |
| CR017 | Cerebras cloud inference services representing approximately 30% of FY2025 revenue introduce data security and incident response requirements; no public SOC 2 Type II certification or third-party security audit has been disclosed by Cerebras as of May 2026. | Low | SR011, SR009 |
| CR018 | OpenAI simultaneously occupies three roles in Cerebras capital structure: production customer (running Codex-Spark on Cerebras inference infrastructure), working-capital lender ($1B at 6% maturing 2032), and the anchor of the $20B+ Master Revenue Agreement. This triple-role dependency creates concentrated counterparty risk with no structural equivalent among comparable AI hardware companies. | High | SR004, SR026, SR009 |
| CR019 | The OpenAI $1B working capital loan at 6% interest matures December 31, 2032; if the OpenAI-Cerebras commercial relationship deteriorates the debt could shift from supportive to adversarial before the loan matures. | High | SR009, SR026 |
| CR020 | OpenAI holds 33.4 million warrants exercisable at $0.00001 per share, representing potential dilution of approximately 10-12% of fully diluted Cerebras shares if all warrants are exercised post-IPO. | Medium | SR009, SR004 |
| CR021 | The AWS binding term sheet signed in March 2026 has not been converted to a signed commercial agreement; until it is executed and a Marketplace listing is live, AWS represents a diversification aspiration rather than a confirmed revenue channel. | Medium | SR009, SR025 |
| CR022 | MBZUAI and G42 together constitute approximately 86% of FY2025 revenue, a concentration that is structurally unsustainable for a company targeting global AI infrastructure leadership and creates a single-event total-loss scenario if export licenses for both entities are simultaneously revoked. | High | SR009, SR029 |
| CR023 | Any tightening of BIS/EAR export rules for UAE destinations, or placement of MBZUAI or G42 on the Entity List, would immediately prohibit Cerebras from fulfilling existing and future hardware orders to its top two customers with no near-term substitute revenue source. | High | SR003, SR009 |
| CR024 | Cerebras reported FY2025 GAAP net income of approximately $237.8M but non-GAAP adjusted loss of approximately $75.7M; the discrepancy is driven primarily by non-cash warrant remeasurement gains and stock-based compensation, making GAAP income a poor proxy for actual cash generation. | High | SR009, SR014 |
| CR025 | Cerebras FY2025 GAAP income of $237.8M, if taken at face value without non-GAAP adjustment, significantly overstates cash profitability and could mislead retail investors who do not understand the warrant fair-value accounting mechanism driving the headline figure. | High | SR009, SR014 |
| CR026 | Cerebras hardware gross margin of approximately 39% in FY2025 reflects the high cost of TSMC wafer-scale fabrication; cloud inference gross margin is not separately disclosed but may be structurally higher as a services product. | Medium | SR009, SR014 |
| CR027 | At approximately 196 times trailing FY2025 revenue, Cerebras IPO valuation prices in continued hyper-growth; a deceleration from 76% FY2025 revenue growth to 40-50% would likely compress the multiple significantly and impair returns for IPO-price investors. | Medium | SR001, SR023, SR002 |
| CR028 | Hardware revenue ($358.4M in FY2025) is composed of large, lumpy contracts with few customers and extended lead times; this creates quarter-to-quarter revenue volatility that is difficult to forecast and could result in earnings misses that disproportionately depress the stock. | Medium | SR009, SR014 |
| CR029 | Cerebras capital-intensive hardware model requiring large upfront TSMC fabrication orders before revenue recognition makes working capital management a structural vulnerability; the OpenAI $1B loan partially addresses this but creates its own counterparty concentration. | Medium | SR009, SR026 |
| CR030 | Andrew Feldman (CEO) is explicitly named in Cerebras S-1/A risk factors as a key-person; the filing states that loss of the founding CEO would likely impair investor confidence, customer relationships, and strategic direction simultaneously. | High | SR009, SR008 |
| CR031 | Cerebras founders hold Class B shares with 10-to-1 voting rights relative to public Class A shares, giving founders approximately 99.2% aggregate voting power at IPO; public shareholders cannot remove management, override board appointments, or block capital structure changes. | High | SR009, SR001 |
| CR032 | Wafer-scale chip architecture requires a rare combination of semiconductor engineering skills in 3D floorplanning, power delivery network design, and defect-tolerant routing; loss of core chip architects could delay the CS-4 program by 12-18 months with no available external replacement talent pool. | Medium | SR013, SR014 |
| CR033 | Cerebras has implemented an active export compliance program per S-1/A disclosures, including obtaining BIS licenses for UAE customer shipments and monitoring ongoing compliance obligations; this is a genuine mitigation but does not eliminate the risk of license revocation. | Medium | SR009, SR003 |
| CR034 | Cerebras cloud/services revenue grew to $151.6M (29.7% of FY2025 total), a meaningful but insufficient diversification from hardware-only UAE sales; cloud revenue does not fully reduce export-license risk since cloud inference customers may also require license review when serving UAE-based end-users. | Medium | SR009, SR011 |
| CR035 | No public investor or analyst report as of May 2026 has disclosed Cerebras-specific thesis-break triggers or monitoring thresholds; investors relying on public information alone must construct their own monitoring frameworks without management validation. | Medium | SR006, SR007 |
| CR036 | The AWS binding term sheet, if converted to a live Marketplace listing within 12-18 months, could provide enterprise revenue diversification from US-based customers not subject to BIS/EAR licensing requirements for UAE entities. | Medium | SR009, SR025 |
| CR037 | G42 2024 agreement to divest Chinese technology holdings reduces but does not eliminate the CFIUS scrutiny risk for the Cerebras-G42 customer relationship; ongoing monitoring of G42 counterparty status is a permanent compliance requirement. | Medium | SR027, SR028 |
| CR038 | Cerebras disclosed no material pending litigation in its S-1/A as of the May 2026 IPO date, providing a clean legal baseline; however, absence of current litigation does not preclude future IP assertion claims as Cerebras scales and becomes a commercial threat. | Medium | SR009, SR022 |
| CR039 | Cerebras OpenAI MRA ($20B+ notional, 750MW committed capacity) represents the primary near-term revenue diversification catalyst, but requires substantial infrastructure buildout before revenue recognition; near-term revenue risk from UAE concentration is not yet mitigated by the MRA. | Medium | SR004, SR026 |
| CR040 | The gap between GAAP net income ($237.8M) and non-GAAP adjusted loss ($75.7M), approximately $313M driven by warrant remeasurement, creates a financial literacy risk for retail investors post-IPO who may anchor on headline GAAP EPS rather than adjusted operating performance. | Medium | SR009, SR014 |
| CR041 | G42 received a $335M strategic investment stake in Cerebras at a pre-IPO valuation; G42 continued large equity position aligns customer and investor incentives but also concentrates board influence in a foreign strategic investor with documented prior Chinese technology ties. | Medium | SR009, SR027 |
| CR042 | BIS advanced computing export rules have been updated at least twice since 2022 (October 2023 and October 2024), with each iteration progressively tightening performance thresholds; future updates could modify thresholds in ways that affect Cerebras existing UAE export license applicability without requiring a new entity-list designation. | High | SR003, SR009 |
| CR043 | The IPO lockup period of 180 days means approximately 95% of Cerebras shares held by insiders at IPO (primarily Class B founder shares) become eligible for sale around November 2026, creating near-term supply overhang that could depress the public share price. | Medium | SR001, SR002 |
| CR044 | OpenAI long-term strategic trajectory toward custom silicon development or alternative inference providers creates a latent risk that the Cerebras-OpenAI relationship, while currently mutually beneficial, could become competitive or be wound down after the MRA term ends. | Low | SR004, SR026 |
| CR045 | Cerebras revenue growth decelerated from 269% in FY2023-FY2024 to 76% in FY2024-FY2025; further deceleration to 40-50%, plausible if UAE contracts plateau before OpenAI MRA revenue begins, could trigger significant multiple compression from the 196x trailing revenue level. | Medium | SR009, SR014 |
| CR046 | Cerebras IPO coincided with a peak in AI chip investor enthusiasm; if hyperscaler capex moderates, NVIDIA supply constraints ease, or AI inference efficiency improvements reduce per-token compute demand, the market environment underpinning the IPO multiple could deteriorate before Cerebras achieves revenue diversification. | Low | SR005, SR007, SR002 |
| CV001 | Cerebras Systems completed its IPO on Nasdaq (ticker CBRS) on May 14, 2026 at $185 per share, raising $5.55 billion in gross proceeds — the largest US technology IPO since Uber in 2019. | High | SV004, SV005, SV007 |
| CV002 | Cerebras's S-1/A Amendment No. 2 (May 2026) confirms FY2025 revenue of $510 million, FY2024 revenue of $290.3 million, FY2023 revenue of $78.7 million, and FY2022 revenue of $24.6 million. | High | SV009, SV010 |
| CV003 | The S-1/A confirms that MBZUAI represented 62% and G42 represented 24% of Cerebras's FY2025 revenue — a combined UAE concentration of 86%. | High | SV009, SV003 |
| CV004 | Cerebras grew revenue from $24.6 million in FY2022 to $510 million in FY2025, a 3-year compound annual growth rate exceeding 120% — exceptional for a hardware company at this revenue scale. | High | SV009, SV010 |
| CV005 | Cerebras announced a master revenue agreement with OpenAI in December 2025 valued at $20 billion or more, representing the single largest inference infrastructure commitment disclosed by either party. | Medium | SV023, SV021 |
| CV006 | Cerebras signed a binding term sheet with AWS in March 2026 for AI inference services, providing a second major distribution channel beyond UAE sovereign customers. | Medium | SV030, SV007 |
| CV007 | The S-1/A filing confirms Cerebras founders hold approximately 99.2% of total voting power through a dual-class share structure; Class B shares carry superior voting rights. | High | SV009, SV031 |
| CV008 | At the $185 IPO price, Cerebras implies approximately 30-45x NTM revenue depending on total shares outstanding and FY2026 revenue trajectory; a range derived from the $5.55 billion IPO proceeds relative to FY2025 $510 million revenue. | Medium | SV012, SV017, SV001 |
| CV009 | Cerebras filed its original S-1 in September 2024 seeking approximately $4 billion valuation at trailing 12-month revenue of approximately $136.4 million (H1 2024) — roughly 29x trailing revenue — then withdrew it in November 2024 after weak institutional feedback. | High | SV020, SV022 |
| CV010 | Cerebras's last private funding round was a Series F of approximately $250 million in November 2021 at a post-money valuation of approximately $4 billion. | Medium | SV029, SV022 |
| CV011 | Cerebras raised approximately $720 million in total equity pre-IPO across all disclosed funding rounds from founding (2016) through the Series F (2021). | Medium | SV029, SV022 |
| CV012 | The appropriate investment recommendation for Cerebras at the $185 IPO price is conditional TRACK: the AI hardware thesis is structurally sound, but the 30-45x NTM multiple and UAE concentration make a price-insensitive buy unjustifiable. | Medium | SV017, SV001, SV027 |
| CV013 | The risk rating for Cerebras at IPO is high: three independent shocks (UAE export restrictions, OpenAI MRA failure, Nvidia competitive displacement) each carry sufficient probability and impact to individually break the base case thesis. | Medium | SV020, SV001, SV017 |
| CV014 | The valuation stance is aggressive: 30-45x NTM revenue at IPO has almost no precedent for a sustained public market multiple for hardware-first AI companies; it implies near-perfect growth execution with no room for revenue or multiple disappointment. | Medium | SV017, SV027, SV001 |
| CV015 | Confidence in the recommendation is medium: Cerebras is now a public company with quarterly disclosure requirements, which will close key information gaps (segment margins, customer diversification progress) within 2-3 quarters. | Medium | SV031, SV007 |
| CV016 | Nvidia (NVDA) trades at approximately 25x NTM revenue as of May 2026, representing the ceiling multiple for publicly traded AI hardware companies and the primary benchmark for AI chip sector valuation. | Medium | SV008, SV015, SV006 |
| CV017 | AMD (AMD) trades at approximately 8x NTM revenue as of May 2026, representing the hardware semiconductor floor multiple and the lower bound for AI chip public market comparables. | Medium | SV008, SV014, SV015 |
| CV018 | Groq's last disclosed valuation of approximately $2.8 billion at an estimated $750 million ARR implies an approximately 3.7x ARR multiple — the lowest private AI inference company multiple in the comparable set. | Low | SV016, SV028 |
| CV019 | SambaNova Systems was valued at approximately $5.1 billion at an estimated $450 million ARR, implying approximately 11x ARR — more comparable to Cerebras's range but still far below the IPO multiple. | Low | SV016, SV028 |
| CV020 | Tenstorrent was valued at approximately $2.6 billion in its 2025 funding round, representing an earlier-stage private AI chip company with undisclosed revenue that adds context to the AI chip VC landscape. | Low | SV016, SV029 |
| CV021 | The Cerebras bull case assumes FY2027 revenue of $1.5-2 billion as OpenAI MRA and AWS both ramp, implying an enterprise value of $25-40 billion at 17-20x NTM — approximately 2-3x from the IPO price. | Low | SV001, SV027 |
| CV022 | The Cerebras base case assumes FY2027 revenue of $900 million to $1.1 billion, implying an enterprise value of $11-15 billion at 12-14x NTM — approximately flat to 10-15% downside from the IPO implied market cap. | Low | SV001, SV027 |
| CV023 | The Cerebras bear case assumes UAE revenue contracts materially due to export control restrictions, FY2027 revenue of $400-600 million, and an enterprise value of $3-6 billion at 8-10x NTM — representing 65-75% downside from IPO price. | Low | SV020, SV001, SV017 |
| CV024 | Cerebras's WSE-3 chip delivers 2,100+ tokens per second on LLaMA-70B inference workloads, representing 10-15x the throughput of equivalent GPU-cluster configurations — a documented performance advantage for memory-bandwidth-bound inference. | Medium | SV018, SV003 |
| CV025 | The OpenAI $20B+ MRA represents a potential 3-4x FY2025 revenue pipeline over the MRA term, providing the primary mechanism for customer diversification away from UAE concentration if executed. | Medium | SV029, SV023 |
| CV026 | The AI inference cloud services market represents Cerebras's fastest-growing opportunity, with the Cerebras Inference Cloud API priced at $0.60 per million input tokens offering competitive throughput-per-dollar for latency-sensitive LLM deployments. | Medium | SV018, SV019 |
| CV027 | The 86% UAE revenue concentration is an adverse investment signal that most institutional investors would discount; no precedent exists for a publicly traded hardware company at this scale sustaining such concentration without significant regulatory or geopolitical risk. | Medium | SV017, SV020, SV001 |
| CV028 | Hardware-first business models historically trade at 8-12x revenue in public markets; Cerebras's 30-45x NTM multiple implies the market is treating Cerebras as a software-like platform, which the disclosed revenue mix does not yet support. | Medium | SV027, SV001 |
| CV029 | US BIS export controls on advanced AI chips remain an active regulatory uncertainty for UAE AI customers; the October 2023 rule and any proposed revisions could restrict or condition MBZUAI and G42 AI chip procurement from US suppliers including Cerebras. | Medium | SV020, SV001 |
| CV030 | Nvidia's CUDA software ecosystem with 4 million+ developers, decades of library optimization, and deep cloud provider integration creates a competitive moat that hardware performance advantages alone cannot overcome in enterprise purchasing decisions. | Medium | SV006, SV016 |
| CV031 | The November 2024 S-1 withdrawal is a permanent negative signal in the IPO record; the 2026 IPO succeeded only after 18 months of revenue compounding (from $136M trailing to $510M FY2025) and the OpenAI MRA announcement that changed institutional sentiment. | Medium | SV022, SV023 |
| CV032 | Lock-up expiry (typically 90-180 days post-IPO, August-November 2026) will create insider selling supply pressure; with founders holding 99.2% of voting power, their selling decisions at lock-up expiry are the most important near-term price catalyst. | Medium | SV009, SV012 |
| CV033 | The UAE concentration risk is not merely a diversification concern — it is a revenue quality risk, because sovereign AI program spending is subject to geopolitical and budget dynamics that differ fundamentally from enterprise SaaS customers. | Medium | SV020, SV001, SV003 |
| CV034 | The progression from the original 2024 S-1 ($136M trailing revenue at $4B valuation) to the 2026 IPO ($510M FY2025 revenue at $14-40B implied) was driven primarily by revenue compounding, not multiple expansion — a key distinction for investors evaluating return potential. | Medium | SV010, SV023, SV001 |
| CV035 | The $20B+ OpenAI MRA, if fully executed, would represent approximately 3-4x Cerebras's FY2025 revenue over the MRA term, transforming the revenue concentration profile away from UAE sovereign dependence. | Low | SV029, SV023 |
| CV036 | The AWS binding term sheet represents a potential second major non-sovereign distribution channel; if closed, it would be the first significant US hyperscaler adoption of Cerebras WSE architecture. | Medium | SV018, SV030 |
| CV037 | The dual-class voting structure (99.2% founder voting power) limits investor governance influence; institutional investors cannot force management changes, strategy shifts, or M&A decisions against founder wishes. | High | SV009, SV031 |
| CV038 | Nvidia's data center revenue of $47 billion+ in fiscal 2025 makes it 90x larger than Cerebras by revenue; the comparison is instructive not as a peer but as a market-structure anchor showing the scale of Nvidia dominance Cerebras must work against. | Medium | SV006, SV008 |
| CV039 | At the $185 IPO price, Cerebras implies approximately 33-55x FY2025 trailing revenue depending on total shares outstanding; the range reflects uncertainty about IPO float percentage and pre-IPO share count not disclosed in available public sources. | Medium | SV012, SV004, SV005 |
| CV040 | Cerebras has invested approximately $1.7 billion in R&D since founding, creating a capital-intensive innovation model that requires sustained revenue growth to justify ongoing R&D expenditure at the pace needed to stay competitive with TSMC-enabled Nvidia. | Medium | SV010, SV003 |
| CV041 | The 76% FY2025 revenue growth rate to $510 million is exceptional for a hardware company at this scale; it is comparable only to Nvidia's peak data center growth years and reflects concentrated sovereign AI demand rather than broad enterprise adoption. | Medium | SV008, SV003, SV001 |
| CV042 | Investors should define in advance what FY2026 quarterly revenue thresholds constitute 'on track' vs. 'at risk' before receiving post-IPO earnings data; the first quarterly earnings call (expected August 2026) will be the primary evidence event for the base case trajectory. | Medium | SV017, SV001 |