Startup Diligence
Diligence report AI infrastructure / semiconductors Series E 2026-05-27

SambaNova Systems

Differentiated inference infrastructure with real sovereign traction, but opaque economics and a reset valuation

SambaNova has credible technical differentiation and sovereign/government traction, but opaque economics, customer concentration, and valuation ambiguity keep the equity story in research-more territory.

Cover facts

Series E cash raised 01
$350M [CI015]
Estimated ARR (Feb 2026) 02
~$180M [CI011]
Supportable primary valuation 03
~$2.34B [CV002]
Total capital raised 04
~$1.49B [CI016]

Company profile

SambaNova Systems is a Palo Alto-based private AI infrastructure company founded in 2017 by Rodrigo Liang, Kunle Olukotun, and Christopher Ré. The company sells proprietary Reconfigurable Dataflow Unit-based systems, SambaFlow software, and SambaNova Cloud for large-model inference, with traction concentrated in sovereign and government environments plus selected regulated enterprises. Public evidence supports fast recent ARR growth and a substantial capital base, but the business remains hard to fully underwrite because customer concentration, revenue mix, margins, and the economically correct February 2026 valuation are all only partially disclosed.

Website
sambanova.ai
Founded
2017-01-01
Founders
Rodrigo Liang, Kunle Olukotun, Christopher Ré
Founding location
Palo Alto, California, USA
Headquarters
Palo Alto, California, USA
Product
RDU-based AI chips and full-stack systems such as SN40L and SN50, delivered through on-premises deployments, managed infrastructure, and the SambaNova Cloud inference API.
Customers
Sovereign AI programs, US national laboratories and government agencies, plus large regulated enterprises in financial services, telecom, energy, and related sectors.
Business model
Monetizes through hardware/system sales, token-priced cloud inference, and professional services tied to deployment and model optimization.
Stage
Late-stage private / Series E
Funding status
Raised $350M in a February 2026 Series E; supportable share-price data implies roughly $2.34B post-money, while some databases continue to cite an undisclosed ~$4.8B mark.
[CO001, CO002, CO010, CO011, CI011, CI015, CI016, CV002]

Executive summary

Top strengths

  • SambaNova owns a differentiated full-stack inference platform: proprietary RDU silicon, SambaFlow software, and cloud/on-prem deployment modes that target agentic and sovereign AI workloads.
  • The company has real reference traction in national labs and sovereign infrastructure, including Argonne, LLNL, LANL, and SoftBank-backed SN50 deployment in Japan.
  • Commercial momentum improved materially into 2025-2026, with ARR estimated to have reached roughly $100M in mid-2025 and more than $180M by the February 2026 financing.

Top risks

  • Nvidia's hardware-software moat and CUDA lock-in remain the central strategic threat, and independent production-scale proof of SambaNova's benchmark claims is still limited.
  • Publicly named customers remain concentrated in DOE, NNSA, and other government or academic institutions, while disclosed commercial Fortune 500 hardware customers are sparse.
  • Capital efficiency is weak relative to peers: about $1.49B of capital has produced a supportable implied valuation near $2.34B, and the February 2026 price remains far below the 2021 peak.

Open gaps

  • Audited 2025-2026 revenue, gross margin, EBITDA, and cash figures are still unavailable.
  • Public evidence does not cleanly resolve whether the economically correct February 2026 post-money valuation is the E-1 implied ~$2.34B, the blended round value, or the third-party ~$4.8B figure.
  • Customer concentration by revenue, government share, and renewal durability are not publicly disclosed.
  • Net revenue retention, cloud-vs-hardware mix, and preference-stack terms remain opaque.

Contents

Chapter 01

01Company Overview

1.1 Identity, Headquarters & Business Model

SambaNova Systems, Inc. is a private AI chip and full-stack systems company headquartered in Palo Alto, California. It was founded in 2017 by three Stanford-affiliated technologists—Rodrigo Liang (CEO), Kunle Olukotun (CTO), and Christopher Ré—who set out to build a purpose-built AI processor architecture that could outperform general-purpose GPUs on modern AI workloads, particularly large-language-model inference. The company's core hardware is the Reconfigurable Dataflow Unit (RDU), a proprietary chip that executes AI workloads using a dataflow architecture, minimizing expensive memory movement and enabling multi-model serving at scale. Product lines include SambaRack (hardware rack systems), SambaNova Cloud (a cloud inference API offering token-level access to open-source LLMs), and SambaStack (the full-stack on-premises or hybrid AI platform). SambaNova targets enterprise and government customers—financial services, telecommunications, energy, healthcare, and sovereign AI programs—who require private, low-latency AI inference without relying solely on GPU-centric cloud providers. The company's business model spans hardware sales, subscription cloud services, and professional services. As of the February 2026 Series E close, SambaNova reported record bookings and revenue for 2025, though no specific revenue figures have been disclosed publicly. [CO001, CO002, CO010, CO016, CO037, CO041]

Snapshot KPI Table
MetricValue / StatusAs-of DateConfidenceGap / Caveat
HeadquartersPalo Alto, California, USA2026-05-27HighNone
Founding Year20172017HighNone
StageSeries E, private2026-02-24HighNone
Total Capital Raised~$1.49–1.5 billion2026-02-24HighExcludes unconfirmed pre-Series A bridge
Latest Round$350M Series E (Feb 2026)2026-02-24HighNone
Latest Round ValuationNot disclosed; likely below $5.1B2026-02-24MediumPrivate company; Series E valuation not disclosed
Peak Valuation (Series D 2021)$5.1 billion2021-04-13HighImplies material decline from peak
Headcount (pre-2025 layoffs)~500 employees2025-04MediumPost-layoff estimated ~400–425; 2026 count unknown
Annual Recurring RevenueNot publicly disclosed2026-05-27LowPrivate company; CEO cited record 2025 bookings/revenue without figures
Customer CountNot publicly disclosed2026-05-27LowKnown named customers: SoftBank, Hugging Face, Meta, government agencies

Valuation data from public press releases and corroborated media reports; revenue and customer count are private and unavailable from public sources. Headcount reflects pre-layoff figure from April 2025 reporting; post-Series E headcount is not disclosed. Confidence ratings reflect evidence quality: High = primary-source confirmed; Medium = corroborated secondary reports; Low = inferred or company-claimed without independent verification.

[CO001, CO019, CO021, CO023, CO024, CO025]
FO002: Company Snapshot Logic

How SambaNova's core identity—proprietary RDU chips, full-stack software, and cloud inference— connects to its customers, capital structure, and governance dependencies.

[CO001, CO010, CO016, CO033, CO034, CO035]

1.2 Leadership, Founders & Governance

SambaNova was co-founded by three individuals whose complementary backgrounds—academic AI research and industry chip execution—represent strong founder-market fit for an AI systems company. Rodrigo Liang, the CEO, previously served as Senior Vice President at Oracle (formerly Sun Microsystems), where he led a team of approximately 1,000 chip designers across 12 major processor generations. Kunle Olukotun, the CTO, is a Professor of Electrical Engineering and Computer Science at Stanford University, renowned for pioneering multicore processor design with the Niagara chip. Christopher Ré, the third co-founder, is an Associate Professor of Computer Science at Stanford and a machine learning systems expert; he also co-founded Snorkel AI. A governance concern is the role of Lip-Bu Tan, Intel's CEO, who has served as SambaNova's Executive Chairman since the company's founding in 2017. Tan's venture firm, Walden International, led the Series A. Intel Capital is also an investor. This interlocking governance arrangement raised significant conflict-of-interest questions in late 2025 when Intel entered acquisition discussions with SambaNova. Tan recused himself from those deliberations, with Intel's data center chief Kevork Kechichian acting as executive sponsor. Key-person risk is high: both Liang (product and strategy) and Olukotun (technology roadmap) carry deep institutional knowledge with limited documented succession planning in public disclosures. Board composition beyond Lip-Bu Tan is not publicly detailed. [CO002, CO003, CO004, CO005, CO006, CO007]

Leadership and Founder Table
PersonRoleBackgroundFounder-Market Fit / CoverageKey-Person Dependency
Rodrigo LiangCo-founder & CEOFormer SVP at Oracle/Sun Microsystems; led ~1,000 chip designers across 12 major processor generationsDeep industry chip execution; enterprise sales and go-to-market leadershipHigh – primary strategic and commercial decision-maker
Kunle OlukotunCo-founder & CTO / Chief TechnologistStanford EE/CS Professor; pioneer of multicore processor design (Niagara chip)World-class AI chip architecture research depth; academic credibilityHigh – technical roadmap and RDU architecture ownership
Christopher RéCo-founderStanford CS Professor; ML systems expert; co-founder of Snorkel AI; contributor to widely-adopted ML frameworksML software systems, data management, and AI infrastructure expertiseMedium – ML software depth; less operationally central
Lip-Bu TanExecutive ChairmanCEO of Intel (as of 2025); founding investor via Walden International; semiconductor industry veteranGovernance bridge between SambaNova and Intel; industry network accessHigh – governance influence; source of conflict-of-interest risk during Intel acquisition talks
Kevork KechichianEVP, Data Center Group, Intel (Intel-SambaNova deal sponsor)Intel executive; acted as executive sponsor for the Intel-SambaNova collaboration dealIntel relationship management; post-acquisition alternative governance if neededLow for SambaNova directly – Intel-side sponsor only

Data sourced from company about pages, CNBC, EE Times, and BusinessWire press releases. Lip-Bu Tan's Intel CEO role is concurrent with his SambaNova chairmanship, creating a governance overlay. Christopher Ré's current day-to-day role at SambaNova relative to his Stanford responsibilities is not publicly detailed. Kevork Kechichian is an Intel executive listed here for governance context of the Intel partnership.

[CO002, CO003, CO004, CO005, CO006, CO007]

1.3 Funding History, Investors & Capital Structure

SambaNova has raised approximately $1.49–$1.5 billion in total capital across five identified financing rounds since its founding. The first external capital came in March 2018 when GV (Google Ventures) and Walden International co-led a $56 million Series A—GV's first-ever investment in an AI chip startup. In 2020, a follow-on round of approximately $250 million from BlackRock, Intel Capital, and GV pushed the implied valuation to roughly $2.5 billion. The peak came in April 2021 when SoftBank Vision Fund 2 led a $676 million Series D at a $5.1 billion post-money valuation, joined by Temasek, GIC, and existing investors. SambaNova did not raise again until February 2026, when it closed a $350 million Series E led by Vista Equity Partners and Cambium Capital, with additional participation from Intel Capital, Qatar Investment Authority (QIA), GV, Battery Ventures, T. Rowe Price Associates, Seligman Ventures, Assam Ventures, and sovereign wealth from Saudi Arabia. No post-money valuation was disclosed for the Series E; multiple reports indicate it is below the $5.1 billion Series D peak. BlackRock had already marked down its SambaNova holdings by approximately 17% by 2024–2025, implying a valuation of roughly $2.4 billion. CEO Rodrigo Liang described the Series E as "grossly, grossly oversubscribed," signalling strong investor interest despite the valuation uncertainty. No revenue figures, debt facilities, or secondary transactions have been publicly disclosed. [CO017, CO018, CO019, CO020, CO021, CO022]

Stakeholder or Investor Map
Stakeholder / InvestorRole / TypeRound / RelationshipEconomic / Control ImportanceDiligence Ask
GV (Google Ventures)Lead venture investorSeries A (2018), Series B (~2020), Series E (2026)Founding-round VC; Alphabet strategic alignmentConfirm current stake size and governance rights
Walden International (Lip-Bu Tan)Lead venture investor + governanceSeries A (2018) lead; Tan became Executive ChairmanGovernance influence; conflict-of-interest risk during Intel talksDisclose Tan's dual Intel/SambaNova role management post-Series E
Intel CapitalStrategic investorMultiple rounds including Series E (2026)Key strategic partnership; Intel CEO chairs SambaNova boardAssess ongoing strategic alignment and equity position given multi-year collaboration
BlackRockInstitutional investorSeries B (~2020), Series D (2021)Major financial backer; marked down stake 17% in 2024–2025Current stake value and mark; exit intentions given valuation decline
SoftBank Vision Fund 2Lead investorSeries D (2021) lead at $5.1B valuationLargest single-round investor historically; also first SN50 customerVerify current stake valuation and whether Vision Fund has marked down
TemasekInstitutional investorSeries D (2021)Singapore sovereign wealth fundCurrent stake and strategic interest in AI infrastructure
GICInstitutional investorSeries D (2021)Singapore sovereign wealth fundCurrent stake and strategic interest
Vista Equity PartnersLead private equity investorSeries E (2026) co-leadNew PE backer; brings structured growth capitalPE exit horizon and governance expectations post-Series E
Cambium CapitalCo-lead investorSeries E (2026) co-leadNew strategic backer with agentic AI focusStrategic rationale and board representation
Qatar Investment Authority (QIA)Sovereign wealth investorSeries E (2026)Middle East sovereign capital; geopolitical dimensionSovereign governance terms and potential national deployment agreements

Investor data compiled from multiple press releases and media reports; round sizes are from company-issued announcements where possible and third-party media otherwise. BlackRock markdown figure (17%) from The Information via Data Center Dynamics. T. Rowe Price, Battery Ventures, Seligman Ventures, Assam Ventures, and Saudi First Data participated in Series E per CNBC and EE Times but are omitted from this map as smaller co-investors.

[CO017, CO018, CO019, CO020, CO021, CO022]
FO003: Snapshot KPIs

Key financial and scale metrics for SambaNova Systems as of the February 2026 Series E close, with confidence ratings reflecting evidence quality.

Total raised (~$1.49B) is an approximation based on disclosed round sizes; the Series B amount (~$250M) is from secondary sources and may include debt or bridge components. Series E valuation is undisclosed; $5.1B figure is the last publicly confirmed valuation (Series D 2021). Headcount is based on the pre-layoff figure of ~500 from April 2025 EE Times reporting.

[CO019, CO021, CO022, CO024, CO025, CO032]

1.4 Scale, Milestones & Adverse Events

SambaNova progressed through five chip generations from founding through 2026, culminating in the SN40L (September 2023, TSMC 5nm, HBM3 + DDR5 + SRAM memory) and the SN50 (announced February 2026, shipping H2 2026). The company launched SambaNova Cloud in September 2024 as part of a pivot from hardware-focused training systems to cloud-first AI inference services. This strategic pivot, however, required a painful workforce reduction: in early 2025 SambaNova laid off approximately 77 employees—roughly 15% of its ~500-person workforce—to realign the team toward inference and cloud. The layoffs were followed by a period of financial pressure and strategic optionality-seeking. In late 2024, the company began exploring a potential sale, hiring an investment bank to manage the process; reported acquirers included large cloud providers and private equity. By late 2025, Intel—whose CEO Lip-Bu Tan chairs SambaNova—had signed a non-binding term sheet for an approximately $1.6 billion acquisition. Those talks ultimately stalled, and in early 2026 SambaNova abandoned the Intel deal and chose independence via the Series E recapitalization. Customers include Hugging Face, Meta, major AI labs, and SoftBank (first customer for the SN50 chip in Japan). SambaNova has also announced sovereign AI partnerships in Germany, the UK, Australia, Japan, and France. Current headcount post-layoff is estimated at roughly 400–425 employees; the post-Series E hiring trajectory is not publicly disclosed. [CO011, CO012, CO013, CO014, CO015, CO025]

Milestone Table
DateEventTypeAmount / Valuation / StatusParticipantsImplication
2017Company founded in Palo Alto, CaliforniafoundingRodrigo Liang, Kunle Olukotun, Christopher RéEstablished AI chip startup with Stanford academic roots and Oracle chip-exec leadership
2018-03Series A closed; GV first-ever AI chip investmentfinancing$56MGV (lead), Walden International (lead), Atlantic Bridge, Redline CapitalEarly validation; Lip-Bu Tan's Walden establishes governance influence from day one
2020Series B closed; BlackRock, Intel Capital joinfinancing~$250M at ~$2.5B valuationBlackRock, Intel Capital, GVIntel strategic tie deepened; institutional capital opens enterprise credibility
2021-04Series D closed; SoftBank leads at $5.1B peakfinancing$676M at $5.1B post-moneySoftBank Vision Fund 2 (lead), Temasek, GIC, BlackRock, Intel Capital, GV, Walden, WRVIPeak valuation; unicorn milestone; largest single round in company history
2023-09SN40L chip launched; fourth-generation RDUproductSambaNovaHBM3 + DDR5 + SRAM three-tier memory; TSMC 5nm; 5-trillion-parameter model support
2024-09SambaNova Cloud (SambaCloud) publicly launchedproductSambaNovaPivot from hardware-only to cloud-first inference services; token-level API for LLM access
2025-04 (approx)~15% workforce reduction; 77 employees laid offadverse~77 roles eliminated (~15% of ~500 headcount)SambaNovaPainful pivot from training to inference; financial stress signal; workforce now ~400–425
2025-Q4Intel acquisition discussions; non-binding term sheet signedadverseReported ~$1.6B indicative price (below $5.1B Series D)Intel, SambaNovaValuation decline from peak; governance conflict (Tan dual role); deal ultimately did not close
2026-01Intel acquisition talks stall; company seeks up to $500MadverseDeal abandoned; Bloomberg reports $300–500M fundraise soughtIntel, SambaNova, new investorsIndependence at risk; Series E process initiated after stalled M&A
2026-02Series E closes; SN50 chip announced; Intel collaborationfinancing$350M Series E + multi-year Intel collaborationVista Equity (lead), Cambium Capital (lead), Intel Capital, QIA, GV, Battery, T. Rowe Price, othersRecapitalization; fifth-gen chip unveiled; SoftBank first SN50 customer; record 2025 bookings cited

Dates for Series A–D and product launches are from primary sources; Series B amount and implied valuation are from Sacra, Tracxn, and TechZine secondary sources. The workforce reduction date is approximate (EE Times published April 25, 2025; event described as "this week"). Intel acquisition valuation (~$1.6B) is from EE Times reporting citing Bloomberg; not confirmed by either company. The "adverse" type is applied to events with negative financial or governance implications, consistent with the chapter milestone taxonomy.

[CO017, CO018, CO019, CO020, CO021, CO025]
FO001: Company Milestone Timeline

Key milestones from SambaNova's founding in 2017 through the February 2026 Series E and SN50 launch, showing the arc from formation to unicorn status, through workforce reduction and acquisition pressure, to recapitalization.

Series B amount (~$250M) and implied valuation (~$2.5B) are from secondary analyst and media sources; Series B date is approximate (2020). Intel acquisition indicative price (~$1.6B) is from EE Times citing Bloomberg and was not confirmed by either company. Workforce reduction date is approximate (published April 2025; event described as occurring "this week").

[CO017, CO018, CO019, CO021, CO025, CO026]

1.5 Exhibits

Chapter 02

02Market Analysis

2.1 Market Boundary and Definition

The AI chip and accelerator market encompasses specialized semiconductors designed for machine learning training and inference workloads across data centers, cloud platforms, and increasingly edge deployments. The market is not monolithic: definitional boundaries vary significantly across analysts, producing non-comparable estimates that must be interpreted with their scope stated explicitly. The largest definitional lens—adopted by Deloitte and WSTS—captures all AI-optimized semiconductors including logic processors, high-bandwidth memory (HBM), and networking silicon, yielding a 2026 estimate near $500B. A narrower lens, used by IDC for its "intelligent datacenter" segment (CPUs, AI accelerators, GPUs, custom ASICs, and networking silicon), arrives at $281B for that specific category within a total data center semiconductor revenue of $477B. The narrowest lens—merchant AI accelerator chips only (GPUs and alternatives sold externally)—is used by SiliconAnalysts and produces a $200B+ figure. The $118B figure cited by AllAboutAI for 2024 reflects the narrowest merchant-only GPU and AI chip category and does not include memory or networking. AI training workloads involve periodic, compute-intensive model development cycles; AI inference involves continuous, production-scale serving of model outputs to end users. By 2026, inference workloads account for approximately two-thirds of all AI compute cycles globally—up from one-third in 2023—making inference the dominant operational sub-segment by volume. Training still commands high per-accelerator spend (Nvidia's H100 and B200 lead for training) but represents a less frequent purchase event. Adjacent and included spend categories relevant to SambaNova's market context include: AI cloud inference services (SaaS layer), AI infrastructure systems and racks, enterprise on-premises AI deployments, and sovereign/government AI compute programs. Status-quo substitutes for enterprise AI inference include CPU-only inference, public cloud GPU services (AWS, Azure, GCP), and existing on-premises GPU clusters. [CM001, CM002, CM003, CM004, CM005, CM039]

AI Market Boundary — Included and Excluded Spend
Segment/CategoryIncluded SpendExcluded SpendPrimary Buyer/PayerRelevance to SambaNova
Merchant AI accelerators (GPU, ASIC, RDU)GPU/accelerator chips, RDU systems, rack-level AI computeCustom silicon built for internal use onlyEnterprises, cloud providers, neocloudsDirect—SN50 is a merchant AI accelerator
AI inference cloud servicesInference API and managed inference platform spendModel training cloud costs, edge inference SaaSEnterprises, developers, model providersIndirect—SambaCloud competes here
Enterprise AI infrastructure systemsOn-prem rack systems, air-cooled AI clusters, hybrid AI deploymentsHyperscaler internal capex, consumer edge devicesLarge enterprises, government, sovereign programsPrimary—SambaNova's core on-prem enterprise market
Sovereign / government AI computeDomestically-hosted AI infrastructure, government data center AIPublic cloud AI (unless with data-residency compliance)National governments, defense agencies, public sectorStrategic—SoftBank Japan is first SN50 sovereign deployment
Status-quo substitutesExisting on-prem GPU clusters, CPU-only inference, public cloud AI APIsN/AExisting enterprise IT buyers evaluating upgrade/switchMust displace—SambaNova must prove superior economics vs. incumbent GPU clusters

Segment definitions reflect analyst and company usage. Spending figures are not comparable across rows; market boundary definitions differ across analyst reports cited in TM002. SambaNova competes primarily in the merchant accelerator and enterprise AI infrastructure segments.

[CM001, CM002, CM003, CM004, CM005]

2.2 Market Sizing—TAM, SAM, and Multiple Analyst Lenses

Global AI chip and accelerator market estimates for 2026 span a 2–3x range depending on market boundary definitions, making a single TAM figure misleading. The appropriate sizing lens for SambaNova depends on the segment it targets: enterprise and sovereign on-premises AI inference hardware and systems, not the full semiconductor market. The broadest TAM estimate is Deloitte's updated $500B for 2026 AI chips globally, revised upward in spring 2026 from an initial $300B estimate after WSTS reported a December 2025 $175B global semiconductor market upward revision driven entirely by AI demand. IDC (April 2026) projects total semiconductor revenues at $1.29T in 2026, with data center semiconductor revenues at $477.1B—of which the "intelligent datacenter" segment (CPUs + AI accelerators + GPUs + ASICs + networking) is $281B. By 2030, IDC projects data center semiconductors will reach $843B and total semiconductor revenues $1.75T. AMD CEO Lisa Su (Reuters, November 2025) projected the AI data center chip market will exceed $500B by 2028 and grow to $1 trillion by 2030. This figure is not corroborated by IDC, Gartner, or other independent analyst firms for that time horizon, and appears to encompass a broader definition than merchant silicon alone—illustrating that vendor-stated TAM claims require boundary scrutiny. For the inference sub-market specifically, Fortune Business Insights estimates the AI inference market at $117.8B–$126.2B in 2026, growing at approximately 17–19% CAGR. MarketsAndMarkets projects a similar range ($119–$126B) for 2026. Inference accounts for roughly two-thirds of AI compute cycles and 80–90% of the lifetime cost of a deployed AI system, making it the dominant operational spend category even if training chips carry higher unit prices. SambaNova's addressable market—enterprise and sovereign on-premises and partner-cloud AI inference—is a subset of the $200B+ merchant accelerator market and the $120B+ AI inference market. The company does not currently serve the hyperscaler internal capex segment (where custom silicon dominates) or the consumer device edge market. A conservative SAM estimate for SambaNova's segment (enterprise inference hardware excluding hyperscaler captive) is $30–$60B globally in 2026, based on applying a ~20–30% enterprise fraction to the $200B accelerator market, with substantial uncertainty. [CM006, CM007, CM008, CM009, CM010, CM011]

TAM/SAM/SOM Sizing — Multiple Analyst Lenses (2026)
PublisherYearGeographyMarket Value ($B)CAGRMethodology / ScopeConfidenceLimitation
Deloitte2026Global~500~30% (AI chip)All AI-optimized chips incl. HBM and networking silicon; revised up from $300B after Dec 2025 WSTS upward revisionHighBroadest definition; includes memory and networking that SambaNova does not sell
IDC2026Global477 (data center semis); 281 (intelligent datacenter)~35–40% (data center)Data center semiconductor revenue; 'intelligent datacenter' = CPUs + accelerators + GPUs + ASICs + networking siliconHighTwo sub-totals not directly comparable; intelligent datacenter segment most relevant but still includes non-accelerator silicon
SiliconAnalysts2026Global200+~30%Merchant AI accelerator market only (excludes captive custom silicon); based on public earnings + TrendForce + Morgan StanleyMediumExcludes hyperscaler custom silicon which is a growing share; may undercount total AI hardware spend
AllAboutAI2030Global293 (2024: $118)33.2%Merchant AI chip market (narrowest definition); GPU and AI chip revenue onlyMedium2024 baseline of $118B; 2026 extrapolated ~$165B at stated CAGR; narrowest scope—excludes memory and networking
Fortune Business Insights / MarketsAndMarkets2026Global118–126 (AI inference only)17–19% (inference segment)AI inference market specifically—hardware, software, and services for model servingMediumInference sub-market only; not total chip market; includes software/services in some versions
AMD CEO (Reuters, Nov 2025)2028–2030Global500 (2028); 1,000 (2030)Not specifiedVendor-stated TAM claim from earnings call; exact scope not formally definedLowVendor claim without formal analyst methodology; no independent corroboration for $1T figure by 2030; likely broadest possible scope

All figures in USD billions. Market boundary definitions differ materially across publishers—direct comparisons are invalid without scope alignment. Confidence ratings reflect source independence and methodology transparency, not claim direction. The AMD $1T claim is included to preserve a disclosed contradictory estimate.

[CM006, CM007, CM008, CM009, CM010, CM011]
FM001: AI Chip Market — TAM/SAM/SOM Pyramid (2026, $B)

Three-tier market sizing showing SambaNova's addressable slice within the broader AI chip and inference markets.

TAM ($500B) uses Deloitte's broadest AI chip definition including memory and networking. SAM ($40–60B) is estimated as the enterprise + sovereign inference hardware fraction of the $200B+ merchant accelerator market, excluding hyperscaler captive silicon. SOM ($1–3B) reflects SambaNova's current bookings and near-term deployment capacity based on the $350M Series E production ramp; no public revenue figure is disclosed.

[CM006, CM008, CM011]
FM002: AI Chip / Inference Market — Analyst Estimate Range (2026, $B)

Wide dispersion in 2026 AI market estimates across analysts reflects fundamentally different market boundary definitions, not just forecast uncertainty.

Each row covers a different market scope definition; low/base/high reflect analyst confidence intervals and definition boundary uncertainty, not forecast error alone. Deloitte and IDC cover the broadest definitions; SiliconAnalysts covers merchant accelerators only; Fortune Business Insights covers AI inference specifically. All values in USD billions. Items are NOT directly comparable because they measure different market boundaries.

[CM038, CM044, CM006, CM007, CM008, CM009]

2.3 Buyer, User, and Payer Segmentation

AI inference infrastructure procurement is segmented across five distinct buyer archetypes, each with different budget ownership, purchasing criteria, risk tolerance, and switching costs. Hyperscalers (AWS, Azure, GCP, Meta) represent the largest absolute dollar buyers but are increasingly deploying captive custom silicon (Google TPU, AWS Trainium, Meta MTIA) that does not flow through the merchant chip market. Their procurement is led by infrastructure engineering teams, operates on multi-year supply agreements, and prioritizes throughput, power efficiency, and deep software integration. SambaNova does not currently target this segment. Large enterprises (financial services, healthcare, manufacturing, defense) are the primary addressable buyers for SambaNova. These organizations are running AI workloads at scale for fraud detection, clinical diagnostics, autonomous systems, and workflow automation, but face compliance requirements that make public cloud AI problematic. Azumo reports 87% of large enterprises implement AI solutions, but only 9% have achieved full AI maturity—indicating most are still in deployment ramp phases. Budget ownership typically sits with CTOs and CIOs, with procurement cycles of 6–24 months. Financial services, with 85–89% AI adoption rates, and healthcare, with 8x YoY usage growth, are priority verticals. Government and sovereign buyers are an accelerating segment. NTT DATA's 2026 Global AI Report (surveying ~5,000 senior decision-makers) found 95%+ of organizations consider private and sovereign AI important to strategy. Only 29% are prioritizing it concretely in the near term, but 96% are considering relocating AI infrastructure to specific regions due to geopolitical pressures. Forrester predicts half of G20 nations will mandate domestically tuned AI models for public-sector services by end-2026. SoftBank Corp.'s deployment of SambaNova's SN50 in Japan's sovereign AI data centers is a proof point in this segment. Neoclouds and AI inference service providers are a growing intermediary buyer: they purchase hardware (including SambaNova systems) and resell inference capacity to enterprise customers. The Intel-SambaNova collaboration explicitly targets this channel alongside direct enterprise sales. SMEs represent a lower-priority segment for SambaNova's current hardware model due to high capital intensity ($700K–$7M for on-premises AI clusters), though the SambaCloud service provides an entry point. [CM025, CM026, CM027, CM028, CM029, CM030]

Buyer and Segment Map — AI Inference Infrastructure
SegmentBuyerUserPayerWorkflow / AI Use CaseBudget OwnerAdoption Trigger
HyperscalerInfrastructure engineering teamInternal ML teams; external API customersCloud provider P&LLLM training and inference at billion-query scaleVP Infrastructure / CTOCustom silicon economics; NVIDIA supply scarcity
Large enterprise (FSI, healthcare, manufacturing)CIO / CTO office; IT procurementData scientists; application developers; operational staffBusiness unit + central IT budgetFraud detection, clinical AI, workflow automation, compliance analyticsCIO / Head of AICompliance requirements; data residency; production ROI from pilots
Government / sovereignGovernment IT agency; ministry of defense / digital ministriesCivil servants; intelligence analysts; military operatorsNational budget; defense procurementCitizen services AI, threat detection, decision support, sovereign infrastructureMinistry-level CIO / procurement agencyRegulatory mandate; sovereign AI policy; national security; vendor independence from US hyperscalers
Neocloud / AI inference service providerInfrastructure CEO/CTOEnterprise tenants; model providers; developersNeocloud operating budget funded by customer contractsMulti-tenant inference, model hosting, API servingCEO / CTODifferentiated performance economics vs. GPU-only neoclouds; SambaNova Intel partnership
SME / mid-marketCTO / IT managerDevelopers; business analystsDepartmental or company budgetChatbots, document analysis, lightweight code generationCTO / CFOSambaCloud subscription entry point; cost per token vs. OpenAI APIs

Budget ownership and adoption triggers are based on NTT DATA, Forrester, and Flexential survey data plus SambaNova press materials. Hyperscaler row reflects why SambaNova does not actively compete there. SME row reflects SambaCloud inference service rather than hardware sales.

[CM025, CM026, CM027, CM028, CM029, CM030]
FM003: Enterprise AI Buyer Segment Matrix

Key buying criteria, budget patterns, and SambaNova fit differ materially across the five primary buyer archetypes for AI inference infrastructure.

Budget sensitivity ratings (High/Medium/Low) are qualitative assessments based on NTT DATA, Forrester, and Flexential survey findings, not measured metrics. SambaNova fit ratings are author assessments based on product positioning and channel data from the SambaNova and Intel press materials.

[CM025, CM026, CM029, CM034, CM041]

2.4 Growth Drivers and Adoption Constraints

The most consequential growth driver for enterprise AI inference is the inference shift itself: as generative and agentic AI deployments scale from pilot to production, inference workloads compound in volume and cost. By 2026, inference is projected to represent 80–90% of the lifetime cost of a deployed AI system. The rise of agentic AI—autonomous, multi-step reasoning pipelines with sequential model calls—intensifies demand for low-latency inference specifically, because latency compounds multiplicatively across tool-calling sequences. Futurum's analyst assessment notes this creates an economic tension: GPU architectures optimized for training throughput struggle to deliver agentic inference speed at economics that enterprises can scale, creating the whitespace SambaNova targets. Sovereign AI regulation is a structural demand driver. The EU AI Act, Korea's AI Framework Act (effective January 2026), China's Generative AI Regulations, and US Executive Order 14179 collectively mandate data governance requirements that make fully cloud-based AI problematic for high-risk applications in regulated sectors. Polyglotsoft documents that 67% of enterprises cited sensitive data exposure as their top AI adoption concern in McKinsey's 2025 survey. CloudLatitude reports AWS launched its European Sovereign Cloud in 2026, and the US FedRAMP 20x initiative is finalizing security certification for federal cloud providers—confirming regulatory tailwinds for sovereign and on-premises deployments. On the constraint side, power availability has displaced budget as the primary barrier to scaling AI infrastructure. Flexential's 2026 State of AI Infrastructure Survey (350+ enterprise IT leaders) found 89% say reliable grid power influences AI deployment decisions, and 55% rank power cost differences as the top factor in AI workload location choices. AI data centers require 4x more power than the grid adds annually, creating physical deployment ceilings. The share of enterprises expecting measurable AI financial returns within one year dropped from 51% to 36% between 2025 and 2026 surveys, reflecting lengthening ROI timelines as infrastructure costs rise. CUDA ecosystem lock-in is the dominant switching-cost barrier for GPU alternatives. Nvidia's CUDA ecosystem spans 20+ years and 4M+ developers; every major ML framework is optimized for CUDA first. Switching to alternative architectures—including SambaNova's RDU—requires software re-engineering investment measured in months to years, not just hardware spend. Additionally, 62% of organizations have not moved AI projects beyond the pilot stage, creating uncertainty about which inference workloads will actually scale and warrant infrastructure investment decisions. Hyperscaler capex is a structural tailwind for the broader market but a competitive pressure for merchant silicon vendors: hyperscalers are expected to increase capex by approximately 40% in 2026 to ~$600B, but a growing share is directed toward captive custom silicon, directly reducing their addressable purchases from merchant chip vendors. US-China export controls have removed $5–10B in addressable Nvidia revenue, creating geopolitical complexity but also demand for non-US alternatives in some sovereign markets. [CM031, CM032, CM033, CM035, CM036, CM037]

Growth Drivers and Adoption Constraints
Driver / ConstraintDirectionTimingImplication for SambaNovaDiligence Ask
Inference workload surge (inference reaching 2/3 of AI compute by 2026)DriverCurrent—ongoing through 2030Expands addressable market for inference-specific hardwareVerify inference share of total AI compute from independent source
Agentic AI workload growth (sequential multi-turn reasoning)DriverCurrent and accelerating 2026–2028Creates specific latency demand that favors RDU over GPU throughput architecturesIndependently benchmark SN50 agentic workload latency vs. Nvidia B200
Sovereign AI regulation (EU AI Act, Korea AI Framework Act, G20 mandates)DriverCurrent—regulatory enforcement 2025–2026 onwardsExpands qualified buyer pool for on-premises and in-jurisdiction deploymentsTrack which sovereign programs have committed capex and timeline
CUDA ecosystem lock-in and software switching costsConstraintPersistent—measured in years to overcomeLimits SambaNova's ability to win workloads with deep CUDA optimizationMap which enterprise inference workloads are CUDA-agnostic vs. CUDA-dependent
Power availability as primary infrastructure constraintConstraintCurrent—grid interconnect delays up to 4 yearsLimits deployment site selection; SambaNova's air-cooled 10kW rack is a differentiatorVerify SambaNova's actual power density claims vs. comparable GPU racks
ROI uncertainty and extended payback timelines (1-year ROI expectations dropped from 51% to 36%)Constraint2025–2027Enterprises lengthening evaluation cycles; harder to close deals without proven production ROITrack customer case studies demonstrating measurable ROI from SambaNova deployments
Hyperscaler capex shift to captive custom siliconConstraintOngoing—accelerating 2026+Reduces addressable market among hyperscalers; forces focus on enterprise/sovereign segmentMonitor hyperscaler custom silicon capex as % of total AI infrastructure spend
US-China export controls on AI chipsMixedOngoingReduces Nvidia's China revenue; creates sovereign AI procurement in non-US-aligned markets where SambaNova could competeAssess SambaNova's ability to serve non-US sovereign markets under current export regulations

Driver/constraint direction is relative to SambaNova's market opportunity. Timing reflects current 2026 conditions and near-term trajectory. Data sources: Flexential (power, ROI), NTT DATA (sovereignty), Futurum/SiliconAnalysts (CUDA moat), IDC (capex trends), Forrester (regulatory).

[CM031, CM032, CM033, CM035, CM036, CM039]
FM004: Enterprise AI Adoption Funnel (2026, % of Large Enterprises)

Most enterprises are stuck in pilot or early deployment phases; only 9% have achieved full AI maturity, compressing near-term demand for premium inference infrastructure.

All values represent percentages of large enterprises (1,000+ employees) from Azumo/Deloitte synthesis. Stage boundaries are approximate; different surveys use different definitions of 'pilot' vs. 'production'. The 88% exploration figure overlaps with pilot—enterprises can be in both stages simultaneously across business units.

[CM034, CM035, CM033]

2.5 Nvidia Dominance and Whitespace for Alternatives

Nvidia commands approximately 75–80% of the AI accelerator market by revenue in 2026, down from a peak of ~87% in 2024 as the total market expands faster than Nvidia can capture. SiliconAnalysts projects Nvidia's FY2026 data center revenue at $150B+, with Blackwell (B200, GB200) as the primary growth driver. The H100 manufacturing cost is approximately $3,320 to produce but sells for $28,000—an 88.1% gross margin that funds a structural R&D and supply chain advantage competitors cannot easily match. Nvidia has secured approximately 60% of TSMC's CoWoS advanced packaging capacity, creating a supply allocation moat. AMD is the only meaningful merchant alternative, holding approximately 6–10% of AI accelerator revenue in 2026 with MI300X/MI355X products. Custom silicon from hyperscalers (Google TPU v5p/Trillium, AWS Trainium 2, Microsoft Maia 200, Meta MTIA v2) collectively accounts for $25–50B+ in 2026 but is not available for external sale. Intel's Gaudi 3 holds approximately 1–3% of the merchant market. In AI inference specifically, Nvidia's share is estimated at 60–75%—materially lower than its 90%+ training dominance—because inference workloads are more tolerant of alternative architectures, CPU-based inference is viable for smaller models, and economic optimization matters more in production than raw throughput. This inference whitespace is the primary commercial opportunity for SambaNova. SambaNova's SN50 chip (announced February 24, 2026) claims 5x maximum speed and 3x lower total cost of ownership versus Nvidia's Blackwell B200 on agentic inference workloads, based on internal benchmarking of models including Llama 3.3 70B, GPT-OSS 120B, and DeepSeek 671B. These claims have not yet been independently validated in production at the time of this report. Futurum's analyst assessment notes that "NVIDIA's inference software ecosystem, hyperscaler platform integration, and workload optimization for reasoning models create switching costs and inertia that specialized inference chips must overcome through demonstrable economic advantages." The Intel-SambaNova multi-year collaboration (announced February 24, 2026) extends SambaNova's reach through Intel's global enterprise, cloud, and partner channels, reducing the distribution constraint that has historically limited semiconductor startups. SoftBank Corp.'s deployment of SN50 in Japan's sovereign AI data centers validates the sovereign AI segment as a near-term revenue pathway, but broader enterprise adoption beyond sovereign deployments remains unproven. [CM014, CM015, CM016, CM017, CM018, CM019]

2.6 Exhibits

Chapter 03

03Competitors

3.1 Competitive Landscape Overview

SambaNova operates in a rapidly evolving AI infrastructure market where the dominant training-era framing—NVIDIA GPUs as the universal platform—is giving way to a more fragmented inference-era landscape. In 2026, at least three architectural bets compete for the inference workload: wafer-scale monolithic chips (Cerebras WSE-3), streaming language processing units (Groq LPU, now under NVIDIA), and reconfigurable dataflow units with tiered memory (SambaNova RDU). Simultaneously, incumbent GPU platforms led by NVIDIA Blackwell (B200) and AMD MI300X have closed the performance gap for many production inference workloads. Cloud hyperscalers—AWS Trainium3 and Google Cloud TPU Ironwood—offer captive inference infrastructure that is substantially cheaper per token but requires deep cloud commitment. A fourth layer of inference-as-a-service platforms (Together AI, Lambda Labs, Anyscale) runs on GPU clusters and competes at the API level with SambaCloud. The buyer's primary decision criteria in 2026 are inference throughput per user, time-to-first-token for agentic workloads, total cost of ownership, deployment model (cloud vs. on-premise), data-sovereignty requirements, and software ecosystem breadth. SambaNova is well-positioned on the deployment-model and sovereignty axes but faces acute pressure on ecosystem breadth and brand awareness relative to both NVIDIA and the two recently better-capitalized pure-play rivals.[CP001, CP002, CP003, CP004]

Competitor Profile Table
CompetitorCategoryScale / Funding (2025–2026)Target SegmentCore DifferentiationKey Limitation
SambaNova (SN50 RDU)Full-stack inference HW + cloud~$5B valuation (2021); exploring sale (Oct 2025)Enterprise, sovereign AI, national labsOn-prem air-cooled appliance; heterogeneous Intel/GPU blueprint; multi-model resident memoryCUDA ecosystem gap; thin brand awareness; capital constraint
Cerebras (WSE-3 / CS-3)Pure-play inference HW + cloudIPO May 2026 ~$27B; $510M revenue; 76% YoY growthHyperscale LLM inference, training, national labsWafer-scale 4T transistors; 1,000+ tps on 405B; CS-3 on-premHeavy OpenAI/G42 customer concentration; high capex
Groq (LPU)Purpose-built inference silicon (IP licensed to NVIDIA)$6.9B valuation; ~$20B IP licensing deal by NVIDIA (late 2025)Real-time/low-latency inference, developer APIDeterministic streaming; 840 tps on 8B; lowest TTFT in segmentIP absorbed by NVIDIA; standalone roadmap unclear
NVIDIA (H100 / B200 DGX)GPU incumbent (training + inference)Public; ~$3.2T market cap; ~80–90% AI HW market shareBroad enterprise, cloud, research, governmentCUDA ecosystem depth; B200 4x H100 throughput (FP4); DGX platform 9 U.S. gov institutionsHighest TCO for pure-inference workloads; B200 liquid-cooling requirement
AMD (MI300X / MI350)GPU alternativePublic; Instinct ~$3–4B revenue run-rateMemory-bound LLM inference, HPC192 GB HBM3 per GPU; 5.3 TB/s bandwidth; no-sharding on 70B+ modelsROCm software maturity lags CUDA; third-party library gaps
Intel Gaudi 3Accelerator alternativePublic (Intel Data Center Group)Cost-sensitive enterprises; Ethernet-native data centers~$125K for 8-chip system; open Ethernet fabric; open-source SynapseAISmaller ecosystem; Intel DCG restructuring uncertainty
AWS Trainium (Trn3)Hyperscaler custom siliconHyperscaler; Amazon publicAWS-native ML training and inference30–40% better price/perf vs P5 GPU instances; 4.4x Trn2 performance; SageMaker integrationAWS lock-in; Neuron SDK porting friction; no on-premise option
Google Cloud TPU (Ironwood / TPU 8i)Hyperscaler custom siliconHyperscaler; Alphabet publicGoogle Cloud AI workloads; frontier model serving42.5 ExaFlops/pod; 4x Trillium performance; powers Gemini at scaleGoogle lock-in; XLA/JAX adaptation required; no on-premise option
Together AICloud inference platform (GPU-based)~$1.25B valuation (2024); Series B fundedDeveloper teams, ML researchers, fine-tuningWidest open-model catalog; $0.03–$4.50/M tokens; fine-tuning APINo custom silicon; slower vs purpose-built inference HW; no free tier

Valuations as of most recently reported rounds or IPO filings. Market share estimates for NVIDIA are analyst consensus (2025). Revenue figures for Cerebras are from IPO filing (2025 actuals). SambaNova valuation reflects 2021 Series D; 2026 capital status unconfirmed.

[CP001, CP002, CP005, CP009, CP013, CP016]
FP001: Competitive Positioning Map — Inference Throughput vs. Enterprise/Sovereign Fit

Positions nine principal competitors on inference throughput (x-axis, 1=lowest to 10=highest) against enterprise and sovereign AI deployment fit (y-axis, 1=developer-only to 10=full enterprise/sovereign). SambaNova and Cerebras CS-3 occupy the high-enterprise, high-throughput quadrant; NVIDIA DGX B200 leads in ecosystem breadth; hyperscalers score low on sovereign fit.

Axis scores are ordinal, evidence-backed estimates, not precise numerical measurements. Throughput axis reflects relative tokens-per-second on 70B-class models from published benchmarks. Enterprise/sovereign fit axis reflects deployment model (on-prem capability, sovereign certifications, air-gap support). Points subject to change as vendors release new hardware generations.

[CP001, CP002, CP005, CP013, CP020, CP022]

3.2 Direct Inference Hardware Challengers — Cerebras and Groq

Cerebras Systems (Menlo Park, CA; founded 2016) and Groq represent SambaNova's closest architectural peers: all three built custom silicon expressly for inference rather than training. Cerebras' Wafer-Scale Engine 3 (WSE-3) is the most extreme bet—a 46,225 mm² die with 4 trillion transistors, 900,000 AI cores, and 44 GB of on-chip SRAM, enabling 1,000+ tokens per second on Llama 3.1 405B with sub-100 ms time-to-first-token. Cerebras completed its IPO on Nasdaq (ticker: CBRS) in mid-May 2026 at an initial valuation of approximately $26–27 billion on the back of $510 million in 2025 revenue (76% YoY growth) and a $20 billion compute deal with OpenAI. Its CS-3 on-premises system deploys in 16 RU and competes directly with SambaNova DataScale for sovereign and enterprise on-prem buyers. Customer concentration is a material risk: G42 and OpenAI together represent the majority of projected revenue through 2028. Groq's Language Processing Unit (LPU) architecture interleaves streaming compute and SRAM for industry-leading deterministic latency, achieving 840 tokens per second on Llama 3.1 8B and 394 tokens per second on Llama 3.3 70B with the lowest time-to-first-token in the segment. NVIDIA reported a $20 billion IP licensing deal with Groq in late 2025, incorporating LPU technology into its Rubin GPU series, which simultaneously validates the architectural thesis and removes Groq as a fully independent competitive threat. Both Cerebras and Groq target the same enterprise and research-lab buyers as SambaNova. Cerebras' wafer-scale approach is harder to replicate at commodity scale while Groq's IP absorption by NVIDIA partially reduces its differentiation as a standalone vendor. SambaNova's differentiated response is the heterogeneous SN50 + Intel Xeon 6 blueprint for agentic AI decode, a workload pattern Cerebras (focused on large-batch throughput) is less optimized for.[CP005, CP006, CP007, CP008, CP009, CP010]

Feature / Capability Matrix
CapabilitySambaNova RDUCerebras WSE-3Groq LPUNVIDIA DGX (B200)AMD MI300XIntel Gaudi 3AWS TrainiumGoogle TPU
Inference throughput (>400 tps on 70B)✓ (400–580 tps)✓ (2,000+ tps)✓ (394–840 tps)✓ (B200 cluster)✓ (memory-bandwidth optimized)Partial (competitive at mid-range)Unknown (workload-dependent)✓ (TPU pod scale)
Air-cooled on-premises deployment✓ (design requirement)Partial (CS-3 requires specialized power)✗ (cloud-only API)✗ (B200 requires liquid cooling)Partial (H100-era nodes air-cooled; B200 liquid)✓ (Ethernet-native, standard DC)✗ (cloud-only)✗ (cloud-only)
Sovereign / classified deployment✓ (SOC2 T2, ISO 27001; air-gapped capable)Partial (on-prem CS-3 possible)✗ (no announced sovereign program)Partial (classified gov contracts possible)Partial (air-gapped GPU clusters possible)Partial (open hardware, customer-managed)✗ (US sovereign cloud only)✗ (US sovereign cloud only)
Multi-model resident in memory✓ (three-tier memory; near-zero switch latency)✗ (single large model fits WSE-3)✗ (single model per deployment)Partial (GPU memory allows multi-model on B200)Partial (192 GB enables multi-model)PartialUnknownUnknown
Open-source model support (Llama, DeepSeek, etc.)✓ (Llama 4, DeepSeek R1, Qwen, MiniMax)✓ (Llama, Qwen families)✓ (Llama 4, Qwen3, GPT-OSS)✓ (all major models via CUDA)✓ (vLLM on ROCm)✓ (SynapseAI + vLLM)✓ (Neuron SDK + vLLM)✓ (JAX/PyTorch XLA + vLLM)
Fine-tuning capability✓ (DataScale fine-tune + SambaStudio)✓ (CS-3 training + fine-tuning)Partial (enterprise tier only)✓ (full training + fine-tuning)✓ (full training)✓ (SynapseAI training)✓ (Neuron SDK training)✓ (TPU training pods)
Agentic / multi-step workflow optimization✓ (SN50 decode + Intel Xeon 6 tool execution)Partial (throughput-focused; less optimized for agentic decode)✓ (compound AI system tools)Partial (GPU general purpose)PartialPartialPartialPartial (TPU 8i targeting)
Cloud API access✓ (SambaCloud)✓ (Cerebras cloud inference)✓ (GroqCloud)✓ (DGX Cloud + NGC)✓ (via cloud partners)✓ (via cloud partners)✓ (AWS EC2 Trn3)✓ (Google Cloud)

Capability assessments based on official product documentation, published benchmarks, and third-party analyst comparisons current as of May 2026. "Partial" indicates the capability exists but is limited by architecture, ecosystem, or deployment constraints. "Unknown" indicates insufficient public evidence.

[CP006, CP007, CP013, CP015, CP017, CP018]
FP002: Capability Coverage Map by Competitor

Matrix of eight key enterprise buyer criteria across eight competitors. SambaNova leads on air-cooled sovereign deployment and multi-model resident memory; NVIDIA leads on ecosystem breadth; hyperscalers are locked out of sovereign/classified use cases.

[CP006, CP013, CP015, CP017, CP026, CP027]

3.3 Incumbent GPU Platforms — NVIDIA, AMD, and Intel Gaudi

NVIDIA retains the dominant position in AI infrastructure: CUDA's developer ecosystem (millions of libraries, tools, and pre-trained integrations) remains the highest switching cost in the market. The DGX platform, featuring Blackwell B200 GPUs, delivers up to 4x higher inference throughput than H100 on FP4 workloads, with 192 GB HBM3e per GPU and approximately 3.35 TB/s bandwidth. NVIDIA serves 8 of the top 10 global telcos, 7 global pharma companies, 10 global car manufacturers, and 9 U.S. government institutions through its DGX platform, creating deep distribution entrenchment. The B200 has eliminated the performance premium that pure-play inference silicon once held on many mainstream LLM workloads, and the Rubin architecture (incorporating LPU elements from Groq) is projected to further close the gap in 2026-2027. AMD Instinct MI300X is the strongest GPU alternative: 192 GB HBM3 memory (5.3 TB/s bandwidth) allows running 70B+ parameter models without sharding, achieving high throughput with a lower memory-bottleneck than H100. AMD's MI350 series (CDNA 4 architecture) extends this lead in 2026. ROCm software compatibility continues to improve but still lags CUDA's third-party library coverage. Intel Gaudi 3 is the lowest-cost at-scale option: an 8-chip system is priced at approximately $125K including Ethernet networking, compared to $350K+ for an equivalent DGX H100 node. Gaudi 3 uses native Ethernet interconnects (Ethernet-first, not InfiniBand), making it easier to integrate into standard data center fabrics—a property shared with SambaNova's on-prem deployment model. However, Intel's data center group is under restructuring pressure and Gaudi's commercial momentum remains uncertain relative to the GPU incumbents. For enterprise buyers comparing SambaNova to GPU platforms, the key tradeoffs are: NVIDIA offers unmatched software ecosystem and training flexibility but higher TCO for pure inference; AMD offers large memory pools at competitive pricing; Intel Gaudi offers lowest upfront cost but smallest ecosystem.[CP013, CP014, CP015, CP016, CP017, CP018]

Cloud Inference API Pricing and Speed Comparison
ProviderHardware BasisInput Price ($/1M tokens)Output Price ($/1M tokens)Inference Speed (tps, 70B model)Free Tier
SambaNova Cloud (SambaCloud)SN50 / SN40L RDU~$0.06–$0.70~$0.70–$4.50400–580Limited credit (~100K tokens)
Cerebras Inference APICerebras WSE-3~$0.10–$0.80~$0.10–$6.00600–2,000+1M tokens/day free
Groq GroqCloudGroq LPU$0.05–$0.29$0.08–$0.79394–840500K–1M tokens/day free
Together AINVIDIA H100 / B200 (GPU)$0.03–$3.60$0.12–$4.50180–400None
AWS Trainium3 (Trn3 EC2)AWS Trainium3 (custom)~$0.30–$0.50 est.~$0.30–$0.50 est.Workload-dependentNone (SageMaker credit only)
Google Cloud TPU v5eGoogle TPU (custom)~$0.20–$0.30 est.~$0.20–$0.30 est.~2,175 (8-chip batch)None ($300 free credit)
SambaNova DataScale (on-prem)SN50 / SN40L RDUCapEx (undisclosed list price)N/A129 tps/user (405B model)N/A

Token prices are indicative list prices or analyst estimates as of May 2026; realized prices may differ via volume discounts or reserved instances. AWS and Google per-token estimates are derived from published chip-hour pricing and representative token outputs; they are not officially quoted per-token list prices. Speed benchmarks on 70B models are from provider claims and third-party inference benchmarks (costbench.com, jamesm.blog). DataScale on-prem pricing has not been publicly disclosed; row reflects deployment model.

[CP003, CP007, CP008, CP021, CP034, CP035]

3.4 Hyperscaler Custom Silicon — AWS Trainium and Google TPU

AWS Trainium and Google Cloud TPU represent the highest-volume segment of the non-NVIDIA inference market, but they are structurally inaccessible to on-premise buyers and impose deep cloud lock-in. AWS Trainium3 (3nm; 2.52 PFLOPS FP8 per chip) delivers 30–40% better price-performance than GPU-based EC2 P5e/P5en instances and is used by Anthropic, Databricks, and Decart for frontier model inference. The AWS Neuron SDK requires model porting and integrates tightly with SageMaker, EKS, and ECS, creating a cloud-native deployment that has no on-premise analogue. Trn3 UltraServers (up to 144 chips) target the highest-throughput training and reasoning-model inference workloads. Google Cloud TPU Ironwood (7th generation) achieves 42.5 ExaFlops per pod across 9,216 liquid-cooled chips and 4x better performance per chip over its predecessor Trillium. TPU 8i (upcoming) targets low-latency MoE inference with an 80% performance-per-dollar improvement over prior generations. Google TPUs power Gemini and all of Google's consumer AI applications at scale, giving Google unmatched production validation. The Google XLA ecosystem (JAX, TF) creates a meaningful porting barrier for PyTorch-native enterprise teams. For SambaNova, the hyperscalers represent both a competitive threat (for cloud inference buyers) and an indirect validation: enterprises that require data sovereignty, classified environments, or air-cooled on-premise deployments cannot use AWS Trainium or Google TPU—and that segment is exactly where SambaNova and its sovereign AI partnerships (Australia SCX, Germany Infecom, UK Argyll, Japan SoftBank) are positioned. Inference accounts for roughly two-thirds of all AI compute in 2026, making hyperscaler custom silicon a growing competitive pressure even as the sovereign niche insulates SambaNova from the most direct displacement.[CP020, CP021, CP022, CP023, CP024, CP025]

Moat Durability and Competitive Risk Register
SambaNova Moat ClaimPrimary ThreatSeverityEvidence / Mitigation Ask
RDU three-tier memory architecture (SRAM + HBM + DRAM) enables multi-model resident memoryNVIDIA B200 (192 GB HBM3e) and AMD MI350 increase GPU memory pool to near-parity on multi-model workloadsHighBenchmark SN50 multi-model switch latency vs. B200 NVLink pod; assess whether memory advantage holds at 100B+ param models
Air-cooled on-prem deployment compatible with existing enterprise data centersNext-gen GPU racks improving power density with direct liquid cooling becoming more prevalent in standard data centersMediumTrack data center modernization pace; assess whether air-cooled constraint erodes as liquid cooling becomes standard
Intel-SambaNova heterogeneous blueprint (GPUs prefill + RDUs decode + Xeon 6 tools)Intel Data Center Group under restructuring; Gaudi commercial trajectory uncertain; blueprint may lose Intel cost-sharing supportMediumEvaluate Intel commitment durability post-restructuring; assess viability of extending blueprint to AMD CPUs or alternative partners
Sovereign AI anchor deployments (Argonne DOE, SoftBank Japan, Australia, Germany, UK)Cerebras CS-3 and NVIDIA DGX also pursuing DOE national labs and sovereign programs; re-procurement riskMediumMap recompete timelines; assess multi-year contract length and switching cost per sovereign customer
Open-source model catalog on SambaCloud (DeepSeek R1, MiniMax M2.7, Llama 4, Qwen)Groq and Cerebras offer overlapping open-source model catalogs; multi-homing cost is a single API key changeHighTrack model exclusivity deals; assess SambaCloud model-latency advantage vs. competitors per model tier
Full-stack hardware + software + services integration (reduce deployment friction)Open-source inference runtimes (vLLM, SGLang) commoditize middleware; buyers may DIY on commodity GPUsHighMeasure deployment time-to-value vs. DIY GPU stack; assess whether integration premium is commercially defensible

Risk severity ratings are qualitative assessments based on the analyst's reading of competitive dynamics as of May 2026. These are diligence inputs, not conclusions.

[CP026, CP027, CP028, CP029, CP030, CP031]

3.5 Competitive Positioning, Moat Analysis, and Adverse Evidence

SambaNova's defensible competitive advantages rest on three pillars: (1) the three-tier memory architecture of its RDU (SRAM + HBM + DRAM) that keeps multiple large models resident and enables near-zero model-switching latency, a property critical for agentic workflows; (2) the full-stack integration of chip, system, software, and services that reduces deployment friction for sovereign AI programs; and (3) the April 2026 heterogeneous inference blueprint with Intel (GPUs for prefill, SambaNova RDUs for decode, Xeon 6 CPUs for agentic tool execution), which is designed for deployment in existing air-cooled data centers—avoiding the liquid-cooling infrastructure barrier that pure GPU-scale-out requires. Sovereign AI deployments (Argonne DOE, SoftBank Japan, SCX Australia, Infecom Germany, Argyll UK) create institutional anchor customers with multi-year procurement cycles that provide some revenue visibility. The adverse evidence is material. In October 2025, The Information reported that SambaNova was exploring a sale after failing to raise a new funding round; the company had been last valued at $5 billion in its 2021 Series D, and the inability to raise at or above that mark signals investor skepticism about the path to liquidity or standalone scale. SambaNova's CUDA-equivalent software ecosystem is nascent compared to both NVIDIA and the improving ROCm stack from AMD, creating integration risk for enterprise buyers with existing GPU-based workflows. NVIDIA's B200 performance improvements partially close the throughput gap that previously justified RDU hardware purchase. Commoditization risk from open-source inference runtimes (vLLM, SGLang) that abstract over hardware also reduces the middleware defensibility of the SambaNova software stack. Multi-homing is high in the cloud inference layer—SambaCloud buyers can switch to Groq or Cerebras APIs with a one-line change—making cloud API retention dependent on consistent performance leadership and pricing, not platform lock-in.[CP026, CP027, CP028, CP029, CP030, CP031]

FP003: SambaNova Competitive Durability — Key Performance Indicators

Snapshot of SambaNova's competitive positioning metrics as of May 2026, including benchmark performance, deployment certifications, sovereign partnerships, and moat durability factors relative to key rivals.

[CP002, CP005, CP009, CP011, CP024, CP028]

3.6 Exhibits

Chapter 04

04Financials

4.1 Revenue Model and Streams

SambaNova generates revenue through three primary streams: hardware sales of its DataScale systems (powered by proprietary Reconfigurable Dataflow Units, or RDUs), cloud-based AI inference API subscriptions, and professional services including deployment, data preparation, and model optimization. The company pivoted materially toward cloud and inference services in 2024 after spending its earlier years focused on training workloads and hardware-led engagements. The cloud API product, SambaNova Cloud, launched in September 2024, offers token-based billing ranging from $0.10 to $4.50 per million tokens depending on model complexity. Three tiers exist: a free tier for experimentation, a developer pay-as-you-go tier for latency-critical workloads on open-weight models, and a custom enterprise tier for high-throughput production deployments with dedicated capacity, SLAs, and volume discounts. Enterprise and sovereign contracts for on-premise DataScale hardware deployments continue alongside the cloud offering, with professional services comprising an estimated 25–33% of new customer engagements according to Sacra's company analysis. Revenue recognition is complex: hardware deals generate upfront or milestone-based revenue while cloud API subscriptions generate recurring token-consumption revenue. The precise split between hardware, cloud, and services is not publicly disclosed. Government and national laboratory customers — including Lawrence Livermore, Los Alamos, Argonne, and Oak Ridge National Laboratories — are material contributors, though no segment breakdown is available.[CI001, CI002, CI003, CI004, CI005, CI006]

Revenue Streams Table
StreamMechanismBilling UnitCurrent Status / ScaleRevenue QualityDiligence Ask
Cloud API (SambaNova Cloud)Token-based inference on SambaNova RDUs; free, developer pay-go, enterprise tiersPer million tokens ($0.10–$4.50)Launched Sept 2024; primary growth driver; ARR ~$100M+ mid-2025High (recurring, scalable, sticky with enterprise SLAs)Confirm ARR split between cloud vs. hardware; NRR data
Hardware (DataScale / SN-series chips)Sale or deployment of on-premise RDU systems to enterprises and governmentsPer system / contractLegacy revenue stream; pivoted away post-2024 but still active for sovereign/govMedium (lumpy, capex-dependent, customer funding cycles)Hardware backlog size, average deal size, payment terms
Professional ServicesData prep, model customization, deployment, and optimization consultingProject-based / hourlyEst. 25–33% of new customer engagements (Sacra)Medium (labour-intensive, non-recurring, but builds stickiness)Services attach rate, billable utilization, margin vs. cloud
Government / Sovereign AIOn-premise DataScale deployments at national labs, DOE, sovereign AI programsContract / multi-yearLLNL, LANL, Argonne, Oak Ridge; SoftBank Japan sovereign AIMedium-High (long-term, government budget-backed, but procurement cycles long)Government contract pipeline, renewal rates, revenue concentration

Revenue figures are third-party estimates or company-claimed; no audited breakdown is publicly available. Status as of Q1 2026 based on publicly announced information.

[CI001, CI002, CI003, CI004, CI005]
Pricing / Monetization Table
Plan / ProductList Price / UnitContract ModelTypical BuyerDiscount / UnknownSource
Cloud API Free Tier$0No commitment; rate-limitedDeveloper / experimentalNo discounts; limited quotaSambaNova official pricing (costbench.com verification May 2026)
Cloud API Developer (Pay-as-you-go)$0.10–$4.50 per million tokens (model-dependent)No commitment; metered consumptionStartup / developerVolume pricing likely; not publishedcostbench.com May 2026; llm-stats.com
Cloud API EnterpriseCustom quote (contact sales)Annual or multi-year SLA contract; dedicated capacityEnterprise / large accountsSignificant volume discounts expected; terms undisclosedSambaNova official; costbench.com
Hardware / On-Premise (DataScale)Multi-million-dollar per system (not publicly listed)Capital purchase or lease; includes SambaFlow softwareGovernment, national labs, large enterpriseCustom; government contract pricing subject to procurement rulesSacra; TechCrunch 2021; energy.gov NNSA announcement

List prices reflect publicly available API pricing as of May 2026. Hardware system pricing is not published and estimated to be in the multi-million-dollar range per system based on industry references. Realized enterprise pricing may differ materially from list rates due to volume commitments and negotiated terms.

[CI006, CI007, CI008]
FI001: Revenue Model Bridge

How SambaNova customer activities convert into revenue across cloud API, hardware, and professional services streams, flowing to gross profit.

Node weights and margin estimates are qualitative; precise gross margin by stream is not publicly disclosed. Cloud margin assumed higher than hardware based on industry benchmarks.

[CI001, CI002, CI003, CI007]

4.2 Revenue Traction and Growth

SambaNova achieved a significant revenue milestone in June 2025, reaching approximately $100M in annual recurring revenue. This followed a reported fourfold (4x) increase in ARR during calendar year 2024, representing a dramatic acceleration from the company's earlier pace. By February 2026, at the time of the Series E announcement, ARR was estimated to have grown to over $180M, implying roughly 80%+ year-over-year growth from 2025 to early 2026. In its Series E press release, SambaNova confirmed "record bookings and revenue as they closed out 2025," citing accelerating demand from financial services, telecommunications, energy, and sovereign deployments. SoftBank Corp. was publicly announced as the first customer for SambaNova's new SN50 chip, and additional sovereign-AI and enterprise deployments are expected to contribute meaningfully to 2026 bookings. Despite rapid top-line growth, SambaNova remains a private company with no audited financial disclosures. The ARR figures cited by data aggregators (Latka, Compworth, Tracxn) are estimates drawn from self-reported or inferred data, not audited results. Analysts tracking the company note revenue growth exceeding industry peers but also highlight the gap between revenue scale and capital deployed to achieve it.[CI009, CI010, CI011, CI012, CI013, CI014]

FI003: Financial Estimate Range

Source-backed and analyst-estimated ranges for SambaNova's key financial metrics as of Q1 2026, reflecting uncertainty from private-company opacity.

All ranges reflect third-party estimates and analyst inferences; no audited figures are publicly available. The wide bands reflect genuine information asymmetry. The valuation range captures the discrepancy between the official post-money and implied secondary/marked-down values.

[CI009, CI011, CI019, CI027, CI028, CI036]

4.3 Capital Structure and Funding History

SambaNova has raised approximately $1.49B in total venture capital across five reported primary rounds since its founding in 2017. The company's most recent primary raise was a $350M Series E in February 2026, led by Vista Equity Partners and Cambium Capital, with Intel Capital, Qatar's sovereign wealth fund (QIA), GV, Battery Ventures, and accounts advised by T. Rowe Price Associates also participating. This round was used to fund expanded manufacturing capacity for the SN50 chip and to scale cloud infrastructure. The valuation story is nuanced. The Series D in April 2021 was led by SoftBank Vision Fund 2 at a $5.1B post-money valuation. The company's SEC Form D filing confirms $677,999,515 raised in that round. SambaNova did not disclose a post-money valuation in its Series E press release; third-party data sources cite the post-money figure at approximately $4.8B, while BlackRock marked down its SambaNova position by 17% in late 2025, implying an effective value closer to $2.4B. Secondary market trades referenced by ainvest placed implied valuation as low as ~$2.24B at certain points before the Series E closed. The discrepancy represents a material down-round risk signal relative to the 2021 peak. Prior to the Series E, SambaNova had gone approximately five years without a primary capital raise, during which time the company reportedly struggled to close new funding at a favorable valuation, explored a sale to Intel at an estimated $1.6B (including debt), and worked with investment bankers to evaluate strategic options. The decision to raise the Series E rather than sell to Intel reflects confidence in the standalone path but also underscores the depth of financial pressure the company faced heading into 2026.[CI015, CI016, CI017, CI018, CI019, CI020]

4.4 Unit Economics and Capital Adequacy

SambaNova's gross margin structure is not publicly disclosed. The company operates a hardware-software-services stack, where hardware margins are typically lower than pure software margins and capital-intensive manufacturing creates significant cost pressure. Peer AI chip companies and custom-silicon vendors generally operate at hardware gross margins in the 40–60% range before scaling, with software and cloud services commanding substantially higher gross margins. SambaNova's blended margin is estimated by analysts to be below 50%, weighted down by hardware components and manufacturing overhead. Capital intensity is high. SambaNova has deployed approximately $1.49B in total raised capital since 2017, against estimated cumulative revenue substantially below that figure. The $350M Series E was described as funding "expanded manufacturing and cloud capacity," confirming continued capex obligations tied to chip production, datacentre infrastructure, and supply chain scale-up. Employee headcount was approximately 417 in late 2025, down from prior peak levels suggesting some cost discipline, but R&D and engineering spend remains material. Burn rate and cash position are not publicly disclosed. Based on Series E timing and management commentary about funding needs, independent analysts have estimated monthly burn in the $10–25M range. With $350M freshly raised and assuming similar burn levels, the company has an estimated runway of 14–35 months from the February 2026 close, though this depends heavily on revenue growth and capital deployment pace. Future financing dependency is expected: at current growth rates and capital deployment, a Series F or strategic partnership transaction is probable within the next 2–3 years.[CI025, CI026, CI027, CI028, CI029, CI030]

Unit Economics Table
MetricValue / EstimateConfidenceWhy It MattersDiligence Ask
ARR (annual recurring revenue, mid-2025)~$100MMedium (third-party estimate; company milestone implied)Top-line scale; signals product-market fit in inference marketConfirm with audited or investor-disclosed ARR figure
ARR (early 2026 estimate)~$180M+Low-Medium (analyst estimate; not company-disclosed)Signals continued growth trajectory ahead of Series ECompany ARR disclosure or board-level update in data room
Revenue growth rate (2024)~4x (approximately 300% YoY)Medium (reported by multiple independent sources)Validates product velocity and market pullConfirm growth base year; are comparables hardware or cloud ARR
Gross margin (blended)Not disclosed; est. 40–60% (industry proxy)Low (inferred from hardware-software mix and peer comps)Determines scalability and path to profitabilityAudited gross margin disclosure in data room
Monthly burn rate (est.)$10–25M / month (analyst estimate)Low (no disclosed figure; inferred from runway / capital raised)Determines cash adequacy and next financing timelineFull P&L including cash flow statement in data room
CAC / sales payback periodNot disclosedNot availableKey efficiency metric; particularly important given hardware sales cyclesCAC, payback, and NRR by segment in data room

All metrics except funding amounts are either third-party estimates, analyst inferences, or industry proxies. SambaNova does not publicly disclose unit economics. Null fields represent genuine data gaps requiring data-room disclosure.

[CI009, CI010, CI011, CI025, CI026]
Capital Adequacy Table
ItemValue / EstimateConfidenceNotes
Total capital raised (cumulative)~$1.49BHigh (multiple corroborated sources including SEC Form D)Series Seed through Series E; SEC Form D confirms $678M Series D alone
Latest round (Series E, Feb 2026)$350MHigh (official BusinessWire press release)Led by Vista Equity Partners and Cambium Capital; Intel Capital strategic investor
Est. cash on hand (post-Series E)Unknown (est. $200–350M available)Low (no disclosed balance; estimate assumes ~$150M pre-close + $350M raised minus closing costs)Burn rate and existing obligations will determine actual cash; not publicly disclosed
Est. monthly burn rate$10–25M / monthLow (analyst estimate; not company-disclosed)Based on headcount (~417), R&D intensity, capex needs, and fundraising cadence
Est. runway (from Feb 2026)~14–35 months (through mid-2027 to late 2028)Low (depends on burn and revenue offset)Range reflects best-case (burn declines as revenue grows) vs. worst-case scenarios
Planned use of Series E proceedsExpand SN50 manufacturing and cloud capacity; Intel collaborationHigh (company-stated in official press release)Intel collaboration manufacturing scaling; no capex amounts disclosed
Next-round trigger / financing dependencyLikely Series F or strategic partnership within 2–3 yearsMedium (inferred from capital intensity and growth trajectory)Prior 5-year gap between Series D and E should not recur; market conditions may force earlier raise

Capital adequacy assessment is largely estimated due to SambaNova's private status. Cash position is inferred; burn rate is analyst estimate; runway is scenario-based range. Total raised is corroborated by SEC Form D (Series D) and BusinessWire (Series E).

[CI015, CI016, CI017, CI027, CI028, CI029]
FI002: Unit Economics Bridge

Key inputs to SambaNova's unit economics — from customer deal to gross margin — with approximation notes where data is unavailable.

All internal cost nodes are estimated; SambaNova does not disclose COGS, gross margin, or OpEx breakdown. Hardware COGS estimated 40–60% of hardware revenue based on peer comps. Cloud infra cost estimated 20–40% of cloud revenue. Net loss is inferred from burn rate and revenue estimates.

[CI025, CI026, CI031]
FI004: Capital Intensity / Cash-Flow Map

Simplified waterfall of cumulative capital raised against estimated cumulative cash deployment, illustrating SambaNova's capital intensity since founding.

All estimates except total raised are rough analyst approximations. Revenue estimate builds from ~$100M ARR in 2025 and assumed lower earlier-year revenue. OpEx and capex based on headcount, chip R&D cycle, and comparable company burn profiles. This is illustrative, not audited.

[CI015, CI016, CI027, CI030]

4.5 Financial Verdict and Diligence Gaps

SambaNova's financial profile is that of a high-growth, deeply capital-intensive infrastructure company with positive revenue momentum but persistent opacity. Revenue quality is mixed: cloud API revenue is recurring and scalable, while hardware sales are lumpy and dependent on large customer wins. The 4x ARR growth in 2024 and reported "record bookings" at end-2025 are encouraging, but the company's inability to grow into its 2021 valuation — and the near-distressed sale process that preceded the 2026 Series E — raise legitimate concerns about long-term capital efficiency. Key financial risks include: (1) valuation uncertainty — the $4.8B official post-money contrasts sharply with BlackRock's $2.4B mark, creating ambiguity about true enterprise value; (2) concentration risk — SoftBank as the first SN50 customer and government labs as confirmed reference accounts suggest non-trivial customer concentration; (3) capital intensity — chip manufacturing, cloud buildout, and R&D require continuous large capital infusions; (4) competitive pricing pressure — token pricing as low as $0.10/M tokens suggests commoditization risk in cloud inference. Critical data that is unavailable and required for full underwriting: audited gross margin and EBITDA, unit-level economics (CAC, payback, NRR), revenue mix by stream, government contract share and renewal terms, and actual cash balance post-Series E. Until these are disclosed under a data room, financial modeling relies on third-party estimates with wide confidence intervals.[CI033, CI034, CI035, CI036, CI037, CI038]

Public Financial Gaps Table
Missing Private MetricImpact on AnalysisWhy UnavailableExact Diligence Path
Gross margin (by segment and blended)Cannot assess unit economics or path to profitability without itPrivate company; not reported in any public filingRequest audited P&L with segment gross margin breakdown in data room
Monthly / quarterly burn rate and cash positionCannot verify runway or financing risk without itNo SEC reporting obligation; company has not disclosedRequest monthly cash flow statement and trailing 12-month P&L in data room
Revenue mix (hardware vs. cloud vs. services %)Cannot assess revenue quality (recurring vs. one-time) or growth durabilityCompany guidance is narrative only; no segment disclosureRequest revenue disaggregation by stream and customer type for last 3 fiscal years
Government / sovereign revenue concentrationCannot assess customer concentration risk, renewal risk, or margin on gov contractsGovernment contract awards are public in aggregate (USAspending) but revenue not segmentedRequest customer concentration report and top-10 customer revenue share
Customer acquisition cost and net revenue retentionCannot model LTV/CAC ratio or upsell efficiencyNo public disclosure; not mentioned in any press materialsRequest CAC by segment (hardware, cloud, services), NRR, and cohort retention in data room

All items represent genuine data gaps that would be material to a full financial underwriting. They cannot be resolved from public sources and require data-room access under a signed NDA.

[CI033, CI034, CI035]

4.6 Exhibits

Chapter 05

05Product & Technology

5.1 RDU Hardware Platform and Chip Architecture

SambaNova's hardware foundation is the Reconfigurable Dataflow Unit (RDU), a purpose-built AI inference accelerator that replaces the instruction-set architecture (ISA) used by GPUs with a dataflow execution model. Instead of launching sequential GPU kernels that repeatedly load weights from external memory, the RDU configures a continuous processing pipeline aligned with the model's computational graph, reducing data movement and enabling operator fusion across entire transformer decoder layers in a single pass. The fourth-generation SN40L chip—announced 2023 and published at IEEE MICRO 2024—implements a three-tier memory hierarchy: 520 MB on-chip SRAM for immediate computation, HBM (64 GB per chip in the SN40L-16 configuration) for active model weights, and DDR DRAM (768 GB per chip) for capacity-tier storage that holds hundreds of models simultaneously. This architecture allows a 16-chip SambaRack to address roughly 12 TB of total DRAM plus 1 TB HBM, enabling trillion-parameter models—including Llama 3.1 405B and DeepSeek R1 671B at full 16-bit precision—without quantization. The IEEE Micro paper measuring Composition of Experts (CoE) workloads found that an eight-socket SN40L node achieves a 3.7× end-to-end speedup over a DGX H100 and a 6.6× speedup over a DGX A100, with model switching 15–31× faster than GPU baselines. The fifth-generation SN50, introduced February 2026, delivers five times the compute per accelerator and four times more network bandwidth than the SN40L. It supports up to 256 interconnected accelerators over a multi-terabyte-per-second fabric, enabling models up to 10 trillion parameters with up to 10 million token context lengths. The SambaRack SN50 packs 16 chips per rack at 20 kW, is fully air-cooled, and does not require liquid-cooling infrastructure—a significant operational advantage for existing data centers. SoftBank Corp. is the first confirmed SN50 customer, targeting deployment in Japanese AI data centers in H2 2026. Intel, which has taken a strategic investment as part of the Series E, is co-developing a heterogeneous inference blueprint in which Intel GPUs handle the prefill (prompt processing) phase and SambaNova RDUs handle the decode (token generation) phase. [CE001, CE002, CE003, CE004, CE005, CE006]

SambaNova Product Module and Asset Matrix
Module / ProductPrimary UserStatus / MaturityKey DifferentiationDiligence Gap
SN40L RDU chip (4th gen)Enterprise / government on-prem buyersGA — shipped since 2023, production deployments at Argonne, Softbank, Lawrence LivermoreThree-tier SRAM/HBM/DDR memory; 520 MB on-chip SRAM per chip; enables full-precision 405B inferenceIndependent wafer-level yield / procurement terms not disclosed
SN50 RDU chip (5th gen)Agentic AI inference deploymentsAnnounced Feb 2026; H2 2026 production target; SoftBank as first customer5× compute vs SN40L; 4× network bandwidth; 256-chip interconnect; 10T param capacityNo independent benchmark validation yet; production timeline unconfirmed
SambaRackData center operators, sovereign AI programsGA — 16 SN40/SN50 chips per rack; 10–20 kW, air-cooledAir-cooling with no liquid infrastructure; integrates with existing data center thermal envelopesRack-level pricing and lead times not publicly disclosed
SambaStack (on-prem full stack)Enterprises with data residency / compliance requirementsGA — includes hardware, SambaFlow software, model bundles; 90-day deployment SLAFull-stack from chip to pre-trained model; hot-swap model bundles at inference time; 4× energy savings vs GPUs claimedNo third-party security certifications (SOC 2, FedRAMP, ISO 27001) publicly listed
SambaCloud (public API)Developers, AI-native companies, enterprises exploring inferenceGA — launched Sept 2024; OpenAI-compatible; free tier + Developer + Enterprise tiers132 t/s on Llama 3.1 405B (full 16-bit); only provider offering DeepSeek R1 671B at full precision at production speedModel catalog depth (~10 models) and fine-tuning absence limit developer surface versus GPU-cloud alternatives
SambaManaged (managed cloud)Organizations wanting inference-as-a-service in controlled environmentsGA — available via AWS Marketplace; SambaNova-operated hardware at customer or partner facilityCombines on-prem data sovereignty with managed operations; 90-day deploymentUptime SLA terms, geographic availability, and pricing not fully public

Status reflects information available as of May 2026. SN50 production availability is vendor-stated H2 2026 and unconfirmed by independent sources. Pricing for SambaStack, SambaRack, and SambaManaged requires direct sales engagement and is not publicly listed.

[CE001, CE003, CE007, CE014, CE016]
Technology and Operating Architecture Table
Layer / ComponentRoleKey DependencyRisk
SN40L / SN50 RDU chipCore compute and inference execution; dataflow-based operator fusionTSMC fabrication (5 nm for SN40L; process node for SN50 not publicly confirmed)Foundry concentration at TSMC; geopolitical risk to advanced node allocation
On-chip SRAM (520 MB per SN40L chip)Holds active weights for immediate computation; eliminates repeated DDR/HBM fetches for fused operationsCustom SRAM cell design; tightly coupled to RDU microarchitectureSRAM density limits scalability of on-chip tier; future models may require larger SRAM budgets
HBM tier (64 GB per chip, SN40L-16 config)Active model weight buffer between SRAM and DDR; high-bandwidth access for layers in flightHBM supply from SK Hynix / Samsung / Micron; constrained by industry-wide HBM demandHBM supply constraints can affect system availability; SambaNova uses less HBM per chip than H100 by design
DDR DRAM tier (768 GB per chip, SN40L-16)Capacity-tier storage for hundreds of models simultaneously; enables microsecond hot-swap to HBMStandard DDR DRAM supply; commodity market, lower supply risk than HBMCache miss performance impact if model is not prefetched from DDR to HBM; eviction policy proprietary
SambaFlow compilerCompiles PyTorch / standard ML graphs to RDU dataflow programs; operator fusion; spatial layout optimizationMLIR compiler infrastructure; SambaNova proprietary compiler passesCustom compiler creates vendor lock-in; model portability from SambaNova hardware to GPU requires recompilation or fallback
SambaCloud API infrastructureOpenAI-compatible API gateway; rate limiting, auth, model routing, streamingThird-party hosting (SoftBank in APAC; colocation partners in US/EU)API outages tied to partner data center reliability; no public SLA for free/developer tiers

TSMC process node for SN50 is not confirmed in public documentation. HBM per-chip figures are for the SN40L-16 configuration per the Weicloud datasheet; SN50 HBM configuration is not publicly specified. SambaFlow compiler details are proprietary.

[CE001, CE004, CE005, CE009]
FE001: SambaNova Full-Stack Product Architecture

Five-layer architecture from RDU silicon to deployed AI applications, showing how SambaNova's vertically integrated stack differentiates at each tier.

SN50 specifications at chip layer are vendor-announced; independent confirmation pending production shipment. SambaCloud model catalog is as of May 2026 and changes frequently.

[CE001, CE003, CE007, CE012, CE034]

5.2 Software Stack, SambaCloud, and Deployment Modes

SambaNova's software layer—SambaFlow—is a compiler-driven stack that maps neural network computation graphs directly onto RDU hardware through spatial programming and operator fusion. Rather than scheduling individual GPU kernels, SambaFlow generates dataflow patterns tailored to each layer's memory access requirements, achieving high hardware utilization by keeping data in flight through fused operations. SambaFlow supports Red Hat Enterprise Linux and Ubuntu, and exposes a Python SDK and REST API compatible with the OpenAI client interface. Three product tiers address distinct buyer segments. SambaStack is the on-premises full-stack platform: SambaRack hardware plus SambaFlow software plus pre-loaded model bundles, delivered with 90-day deployment commitments and integrated model hot-swapping at inference time. SambaManaged is a turnkey managed service where SambaNova operates the hardware—either in the customer's data center or a partner facility—and is available via AWS Marketplace. SambaCloud is the public API at cloud.sambanova.ai, fully OpenAI-API-compatible, with no credit card required for signup and $5 in free credits for new accounts. The API supports streaming (SSE), function calling, JSON mode, and audio transcription (Whisper-Large-v3). SambaCloud's model catalog as of May 2026 is selective: approximately 10 models including DeepSeek V3.1 (131K context), DeepSeek V3.2, Llama 4 Maverick, Llama 3.3 70B, MiniMax M2.7, and gpt-oss-120b in high and low tiers. Fine-tuning is not offered as a public cloud product. The Accenture partnership (signed 2023) targets regulated enterprise buyers requiring model ownership, data governance, and the ability to export model weights—use cases where GPU-cloud providers typically do not guarantee data sovereignty. Sovereign AI deployments in Japan (SoftBank/SambaCloud APAC), Australia (SouthernCrossAI), Germany (Infercom), and the UK (Argyll) demonstrate the on-premises product in production. The Argonne National Laboratory ALCF AI Testbed—which includes both a legacy SN30 training cluster and a newer SN40L inference cluster of 16 RDUs—makes the system available to the scientific research community under the NAIRR Pilot. TEPCO Systems (Japan) has selected SambaNova for inclusion in Japan's NEDO Post-5G R&D project (April 2026). [CE012, CE013, CE014, CE015, CE016, CE017]

Customer Workflow and Use-Case Coverage Table
User Job / Use CaseCurrent Workflow PainSambaNova SolutionMeasurable BenefitLimitation
Enterprise large-model inference (405B–671B)GPU providers require quantization (FP4/INT8), reducing accuracy; single-user latency >100 sec for 405B on H100SambaCloud or SambaStack with SN40L/SN50 keeping full-precision weights in DDR tierLlama 3.1 405B at 132 t/s (full 16-bit); DeepSeek R1 671B at 231–255 t/s; no accuracy degradationContext window limits (131K for DeepSeek V3.1 standard); no fine-tuning API; model catalog limited to ~10 models
Agentic / multi-step reasoning workflowsGPU inference latency stacks up across agent turns; 1000-token response at 50 t/s takes 20 sec vs 1 sec at 1000 t/sSambaCloud API with fast token generation; SN50 agentic caching; multi-model resident memory70B model at 461 t/s enables agentic loop latency below human perception threshold; SN50 targets 895 t/sSN50 hardware not at full production availability as of run date; agentic orchestration requires external frameworks (LangChain, CrewAI)
Sovereign / regulated AI deploymentData cannot leave on-premises or national jurisdiction; GPU cloud APIs insufficient for GDPR, ITAR, or national AI requirementsSambaStack on-premises or SambaManaged in controlled facility; SambaCloud APAC hosted by SoftBank in JapanFull model ownership; exportable model weights; air-cooled hardware deployable in existing data centersNo public SOC 2 / FedRAMP / ISO 27001 certifications; compliance attestations must be obtained through direct audit
Scientific research / large-scale simulationHPC centers need fast inference on large foundation models for science workflows (climate, drug discovery, fusion)SambaNova DataScale SN40L inference cluster at ALCF AI Testbed; 16 RDUs for AuroraGPT fine-tuning and evaluationResearchers can rapidly evaluate and adjust AI models; instant model switching vs multi-minute GPU reloadAccess requires proposal submission; ALCF AI Testbed usage queuing and prioritization apply

Benefits are from vendor claims or vendor-sponsored benchmarks except where cited as Artificial Analysis independent data. Independent validation of agentic workflow latency improvements and SN50 claims is limited as of the run date.

[CE022, CE023, CE018, CE019]
Trust, Quality, and Compliance Controls Table
Control / CertificationStatusScopeGap / Diligence Ask
Data privacy (cloud API)Company-stated: API does not collect or log user promptsSambaCloud public API only; enterprise tier may have separate agreementsNo third-party attestation; requires trust in vendor's infrastructure controls; independent audit not publicly confirmed
On-premises data sovereigntySupported — SambaStack and SambaManaged deploy in customer-controlled facilities; model weights exportableAll on-prem products; air-gap deployment possible per vendor statementsAir-gap deployment procedures and tooling not publicly documented; requires vendor engagement
SOC 2 / ISO 27001 / FedRAMPNot publicly listed on sambanova.ai as of May 2026Unknown — not stated for cloud or on-prem productsSignificant gap for regulated enterprise and government buyers; must be obtained via direct diligence with SambaNova security team
EU GDPR compliance (sovereign deployments)Claimed — Infercom (Germany/Luxembourg) deployment described as GDPR and EU AI Act compliant per Infercom announcementsEU sovereign cloud deployments operated by InfercomGDPR compliance relies on Infercom as the data processor; SambaNova's own DPA terms not publicly available

Compliance information is based on public statements and partner announcements as of May 2026. Regulatory buyers should obtain formal compliance documentation and data processing agreements directly from SambaNova.

[CE020, CE038]
FE002: Enterprise Customer Deployment Workflow

How an enterprise customer evaluates, acquires, and deploys SambaNova inference infrastructure, from API trial to production on-premises.

Procurement timeline (months for infosec review) is stated by SambaNova SVP of Product in EE Times interview; not independently measured.

[CE014, CE016, CE017]

5.3 Performance, Benchmarks, and Technical Tradeoffs

SambaNova's strongest differentiation is sustained single-user inference throughput on large models (70B–671B parameters) at full 16-bit precision. Artificial Analysis, an independent benchmark aggregator, confirmed 132 output tokens/sec on Llama 3.1 405B at the September 2024 SambaCloud launch—the fastest speed for any provider on that model at that time. On the Llama 3.1 70B model, SambaNova measured 461 tokens/sec versus Cerebras at 445 tokens/sec and Groq at 250 tokens/sec at that benchmark window. For DeepSeek R1 671B, SambaNova reports 231–255 tokens/sec at full 16-bit precision; GPU-based providers on the same model average approximately 19 tokens/sec due to memory bandwidth constraints forcing quantization. The SN50 benchmarks (vendor-supplied, sourced to SemiAnalysis InferenceX) claim 895 tokens/sec/user on Llama 3.3 70B versus 184 tokens/sec on Nvidia B200. Three technical tradeoffs limit competitive comparability. First, all SambaNova performance figures represent single-user (batch=1) latency rather than aggregate throughput, which is the metric GPU-based systems optimize for (e.g., DGX H100 delivers 24,544 tokens/sec on Llama 2-70B in MLPerf batch throughput mode while delivering only ~20 tokens/sec per user). This is a structural consequence of SambaNova's dataflow architecture, which excels at low-latency sequential execution but does not publish equivalent batch throughput data. Second, benchmark methodology is predominantly vendor-controlled or sponsored; independent third-party validation beyond Artificial Analysis's API endpoint measurements is limited. SN50 claims cite SemiAnalysis InferenceX, a commercial benchmarking firm, rather than open-methodology results. Third, model catalog depth is narrow: SambaCloud offers approximately 10 models compared to 50–200 at GPU-cloud competitors, and does not support image or video generation. Developers requiring frequent model switching or fine-tuning as part of the workflow have no alternative within the SambaNova ecosystem. The GPU memory-bandwidth bottleneck—described by Cerebras CEO Andrew Feldman and confirmed in Databricks engineering benchmarks showing H100 tensor-parallel efficiency declining from ~60% at 2 GPUs to ~25% at 8 GPUs—is the structural condition SambaNova's three-tier memory is designed to exploit. At batch sizes encountered in production enterprise workloads (small to medium), this advantage is real. Whether it persists as Nvidia and AMD improve HBM capacity and software batching efficiency in Hopper/Blackwell successors is an open technical question. [CE022, CE023, CE024, CE025, CE026, CE027]

Product Generation and Roadmap Table
Date / StageProduct / MilestoneStatusImplicationSource
2017SambaNova Systems founded; SN10/SN20 first-gen RDU development beginsHistoricalEarly RDU generations established dataflow architecture fundamentalsCompany overview / press releases
~2020–2021SN30 RDU — training-focused cluster; deployed at Argonne ALCF AI Testbed as training clusterHistorical — in production; training use caseSN30 established national lab credibility; inference capability limited on this generationArgonne ALCF press release
2023SN40L RDU (4th gen) — inference-optimized chip with three-tier memory; DataScale SN40L systems ship to enterprise and government customersGA — in productionEnabled full-precision large-model inference; benchmark differentiation vs H100 establishedIEEE Micro paper; SambaNova datasheet; BusinessWire cloud launch announcement
Sept 2024SambaNova Cloud (SambaCloud) public API launch — Llama 3.1 405B at 132 t/s; 70B at 461 t/s; free tier, Developer, Enterprise tiersGA — in productionOpened developer market; independently benchmarked by Artificial AnalysisBusinessWire press release Sept 2024
Feb 2026SN50 RDU (5th gen) announced — 5× compute vs SN40L; 4× network bandwidth; 256-chip interconnect; 10T+ params; $350M Series E closedAnnounced — H2 2026 shipment target; SoftBank as first customerNext-generation agentic inference capability; production availability unconfirmed at run dateBusinessWire SN50 press release Feb 2026; SambaNova press page
Apr 2026Intel-SambaNova heterogeneous inference blueprint — GPUs for prefill, RDUs for decode, Xeon 6 CPUs for agentic toolsAnnounced — H2 2026 deployment targetFirst production-ready multi-vendor heterogeneous inference architecture; expands addressable market via Intel channelsBusinessWire Intel Blueprint Apr 2026
H2 2026 (target)SN50 commercial shipments; Intel-powered SambaCloud expansion; SambaRack SN50 general availabilityPlanned — not yet confirmedCritical milestone for revenue growth and SN50 benchmark validation in productionVendor stated; not independently confirmed

SN10/SN20 generation dates are approximate; SambaNova has not published a public chip-generation chronology. SN50 production dates and Intel blueprint deployment timeline are vendor-stated targets as of May 2026.

[CE003, CE007, CE023, CE036]
FE004: Product Capability and Maturity Matrix

Maturity assessment across five capability dimensions for SambaNova's three core product tiers.

Maturity ratings are qualitative assessments based on public documentation, benchmarks, and analyst reviews as of May 2026. 'High' = production-ready with independent evidence; 'Medium' = functional but gaps; 'Low' = missing or unverified.

[CE028, CE037, CE038]

5.4 Developer Ecosystem and Integration

SambaNova maintains an active open-source developer footprint for a hardware-focused company. The ai-starter-kit repository on GitHub provides open-source Python examples organized into four categories: Data Ingestion and Preparation, Model Development and Optimization, Intelligent Information Retrieval, and Advanced AI Capabilities. Example kits cover enterprise knowledge retrieval (RAG), benchmarking, financial assistants, function calling, custom chat templates, and multimodal knowledge retrieval. The kit supports both SambaCloud API access (via SAMBANOVA_API_KEY) and on-premises SambaStack endpoint integration. The official sambanova Python SDK on PyPI (pip install sambanova) requires Python 3.9+ and exposes both synchronous and asynchronous clients via httpx, with aiohttp as an optional high-concurrency backend. The SDK covers chat completions, a Responses API, streaming (SSE), file uploads (audio transcription), typed request/response models via Pydantic, automatic retries with exponential backoff, and a full error hierarchy. The package version at time of authoring is not explicitly stated in public PyPI docs but the project is actively maintained. SambaNova's HuggingFace organization (sambanovasystems) hosts 32 model checkpoints including SambaLingo multilingual variants (Arabic, Turkish, Hungarian, Thai at 7B and 70B scales) and a draft model for the QwQ-0.5B-SFT architecture. Integration ecosystem breadth is substantial relative to SambaNova's size: LangChain (langchain-sambanova), LlamaIndex, CrewAI, AutoGen, OpenRouter, n8n, AWS, and approximately 50 third-party integrations are listed in the AI Starter Kit documentation. OpenAI-compatible endpoints mean existing codebases can switch inference providers by changing a base URL and API key without rewriting client code. Rate limits and context windows vary by tier (Free, Developer, Enterprise), with Enterprise offering dedicated support and higher rate limits for production workloads. [CE031, CE032, CE033, CE034, CE035]

FE003: Critical Dependency Map

Key external dependencies for SambaNova's chip supply, manufacturing, cloud infrastructure, and software ecosystem.

TSMC as SN40L foundry is referenced in IEEE Micro paper and industry reporting; SN50 foundry and process node not confirmed in public documentation.

[CE004, CE005, CE039]

5.5 Technology Roadmap, Product Gaps, and Strategic Risks

SambaNova's product roadmap centers on three near-term deliverables: full SN50 hardware production and customer shipments (H2 2026), Intel-SambaNova heterogeneous inference clusters targeting enterprise and sovereign deployments (H2 2026), and expanded SambaCloud capacity on SN50 silicon. The Intel collaboration announced April 2026 specifies that Xeon 6 processors will serve as both the host CPU and the "action CPU" for agentic tool execution and code compilation, while Intel GPUs handle prefill and SambaNova RDUs handle decode—a three-way heterogeneous architecture that has no production deployments yet as of the run date. Material product gaps identified from independent review include: (1) No public fine-tuning API—customers requiring custom model adaptation must use a GPU cloud for fine-tuning and then switch to SambaNova for inference, introducing pipeline friction and multi-vendor dependency. (2) Narrow model catalog breadth (~10 models versus 50–200 at GPU competitors), with no image, video, or text-to-speech generation. (3) Context window ceilings: the standard DeepSeek V3.1 offering is limited to 131K input context with 7K completion tokens; the extended-context variant (DeepSeek V3.1-cb) has 32K completion tokens but only 32K input context, constraining long-document and agentic use cases. (4) SN50 production hardware is not yet fully commercially available at the run date. (5) No public compliance certifications (SOC 2, FedRAMP, ISO 27001) are listed on the SambaNova website; security posture relies on customer-facing controls (data-not-logged policy, on-premises deployment options) rather than third-party attestations. Strategic risks include: (a) SambaNova's enterprise-focus sales model involves long procurement and security review cycles, which limits scalability relative to GPU clouds and developer-friendly API competitors; (b) benchmark claims for both SN40L and SN50 are predominantly vendor-controlled, and independent third-party validation remains limited; (c) Nvidia's continued improvement in memory bandwidth (Blackwell B200: 192 GB HBM3e, 8 TB/s bandwidth) may narrow the inference latency gap as GPU software batching improves; (d) the Intel collaboration creates a dependency on Intel supply chain and sales execution for a significant portion of the enterprise go-to-market. [CE036, CE037, CE038, CE039, CE040, CE041]

5.6 Exhibits

Chapter 06

06Customers

6.1 Customer Segments and Adoption Overview

SambaNova's customer base spans five broad segments: (1) U.S. Department of Energy and NNSA national laboratories, which account for the largest cluster of named, production-grade deployments; (2) academic and government high-performance computing centers such as TACC and RIKEN; (3) global cloud and sovereign AI channel partners (SoftBank Japan, OVHcloud, SCX Australia, Argyll UK, Infercom Germany); (4) enterprise professional-services firms including Accenture and financial institutions such as OTP Bank; and (5) developer API users who access SambaNova Cloud's inference endpoints directly. The national-lab and research-HPC segment is the best-evidenced cohort, with multiple official DOE/NNSA announcements confirming production-grade DataScale installations. The channel-and-sovereign-AI segment has accelerated sharply since mid-2025, with three sovereign AI cloud announcements (SCX, Argyll, Infercom) in October 2025 and OVHcloud's flagship AI Endpoints partnership announced in November 2025. The enterprise and financial-services segment is real but opaque: SambaNova marketing references "Fortune 500" accounts and financial-services deployments, yet no Fortune 500 client has publicly acknowledged an SN-series production installation. The developer API segment is proven by the SambaNova Cloud product's existence, free tier, and listed customers on the AWS Marketplace, but usage metrics remain undisclosed.[CU001, CU002, CU003, CU004, CU005]

Customer Segment Overview
SegmentRepresentative Customers / EvidencePrimary Use CaseDeployment ModeRevenue / Strategic ValueKey Evidence Gap
DOE / NNSA National LabsArgonne (ALCF), Oak Ridge (ORNL), LLNL, LANLAI inference for science; cognitive simulation; HPC offloadOn-premises DataScale hardwareLargest named hardware revenue cohort; contract values undisclosedContract dollar values not public; renewal/expansion not confirmed
Academic HPC CentersTACC (UT Austin / NSF); RIKEN (Japan / Fugaku)Scientific inference integration; Fugaku-LLM hostingOn-premises DataScale / SambaNova SuiteNSF-funded; strategic for NAIRR and U.S. AI research infrastructureMulti-year contract terms undisclosed
Cloud & Sovereign AI ChannelSoftBank Japan; OVHcloud; SCX (AU); Argyll (UK); Infercom (DE)Fast inference API; sovereign AI cloud for enterprises and public sectorManaged hardware in partner data centers (SambaManaged / SambaRack)High strategic value; revenue sharing model not disclosedRevenue share terms; SLA performance data not public
Enterprise Professional Services / SIsAccenture; Saudi Aramco (MOU)Contact Center Intelligence; Document Intelligence; industrial AIOn-premises or hybridTier-1 SI channel multiplier; Aramco MOU not confirmed as production deploymentNamed end-client deployments through Accenture not disclosed
Financial ServicesOTP Bank (Hungary); unnamed European banks (company-claimed)Language models for CEE languages; risk modeling; fraud detection (claimed)On-premises AI supercomputerLimited verified examples; broader FS penetration unconfirmedNamed Western bank customers; NRR data
Developer API (SambaNova Cloud)Blackbox.AI; Argyll Data Development; AWS Marketplace usersLLM inference API; agentic AI workflows; open-source model accessCloud API (pay-as-you-go)Growing but revenue scale not disclosedActive user count; monthly token consumption; churn rate

Revenue/strategic-value column is qualitative; no contract values or revenue splits are publicly disclosed. Deployment mode reflects evidence from press releases and product pages. "Company-claimed" entries lack independent corroboration.

[CU001, CU002, CU016, CU019]
FU001: SambaNova Customer Journey Map

Customer segments, primary adoption surfaces, and expansion signals across SambaNova's five identified buyer cohorts.

Journey stages are inferred from press-release evidence and product documentation; formal funnel metrics are not publicly available.

[CU001, CU002, CU003, CU004]

6.2 Government and National Laboratory Customers

The U.S. government and DOE national-laboratory segment provides SambaNova's most credible and independently verifiable customer proof. A formal strategic partnership agreement between DOE/NNSA, Lawrence Livermore National Laboratory, and Los Alamos National Laboratory, announced jointly with SambaNova, established the first multi-lab commitment and anchored SambaNova's federal footprint. LLNL integrated SambaNova DataScale (SN10 RDUs) into the NNSA's Corona supercomputing cluster for cognitive-simulation work in inertial confinement fusion and COVID-19 drug discovery; LANL integrated the same platform into its Darwin cluster for quantum-chemistry modeling. Argonne National Laboratory (ALCF, DOE Office of Science) is SambaNova's most publicly active reference customer: it originally deployed DataScale SN30 hardware and in late 2024 expanded with a new SN40L inference cluster containing 16 RDUs, supporting AuroraGPT development and open-science access through the NAIRR Pilot. Oak Ridge National Laboratory (ORNL) selected SambaNova Suite with SN40L and Composition of Experts (CoE) in November 2024 for parallel scientific inferencing across its AI for Science portfolio, leveraging energy savings versus the Frontier supercomputer. Texas Advanced Computing Center (UT Austin, NSF-funded) deployed SambaNova Suite in November 2024 as its dedicated inference platform for scientific models trained on Frontera; TACC is also home to the upcoming NSF Leadership-Class Computing Facility (LCCF) linked to NAIRR. RIKEN Center for Computational Science (Japan) adopted SambaNova DataScale and integrated Fugaku-LLM into SambaNova's Samba-1 CoE platform in 2024. Carahsoft has listed SambaNova on federal, state, and local government contracts to facilitate agency procurement. Procurely data shows 3 state-level awards totaling approximately $2.5 million in recorded value, likely underrepresenting total government revenue given direct lab-level procurement channels.[CU006, CU007, CU008, CU009, CU010, CU011]

Customer Adoption Trajectory
MetricValue / StatusDate / PeriodSourceConfidenceImplication
Named national-lab / HPC deployments (confirmed)5 (Argonne, ORNL, TACC, LLNL, LANL)2021–2024Official DOE/NNSA press releases; individual lab announcementsHighAnchors government segment credibility; limited revenue visibility
Sovereign AI channel partnerships3 (SCX AU, Argyll UK, Infercom DE)October 2025SambaNova press release via HPCwireHighExpands addressable market beyond U.S. government
APAC cloud partner deployment (SoftBank)SambaNova Cloud in SoftBank Japan AI data center; first SN50 customerMarch 2025 (initial); February 2026 (SN50 expansion)BusinessWire press releases (both dates)HighDemonstrates ongoing expansion and strategic partnership depth
European cloud partner (OVHcloud)AI Endpoints powered by SambaNova; 99.8% uptime SLANovember 2025 announcement; 2026 service launchOVHcloud corporate press releaseHighAdds major EU cloud distribution channel
Government contract awards (Procurely)3 state-level awards; ~$2.5M recorded valueAs of May 2026Procurely.app federal/state awards databaseMediumLikely understates total government revenue; direct federal lab procurement not captured
Developer API availabilitySambaNova Cloud on AWS Marketplace; free and paid tiersSeptember 2024 launchBusinessWire SambaNova Cloud launch; eesel.ai reviewHighDeveloper adoption surface created; usage metrics not public
Layoffs indicating customer-mix reset77 employees (~15% of workforce) cut to pivot from training to inferenceApril 2025Data Center Dynamics; EE TimesHighTraining-focused customer base was insufficient; inference pivot underway

Dollar values are from Procurely's state/local contract database and likely exclude directly awarded federal lab contracts. Developer API user counts are not publicly disclosed. Confidence ratings reflect independence and specificity of source.

[CU008, CU009, CU017, CU018, CU024, CU025]
Named Customer Proof Table
CustomerSegmentDeployment / Use CaseProduction vs. PilotDocumented OutcomeLimitation / Gap
Argonne National Laboratory (ALCF)DOE / NNSA (Office of Science)SN40L inference cluster (16 RDUs) in ALCF AI Testbed; supports AuroraGPT; open science via NAIRR PilotProduction (expansion of existing SN30 training cluster)Named lab-director quotes; open-access inference for research community; AuroraGPT LLM inferenceContract value not disclosed; energy-efficiency benchmarks are self-reported
Oak Ridge National Laboratory (ORNL)DOE (Office of Science; Frontier supercomputer)SambaNova Suite SN40L + CoE; parallel multi-model inference across scientific domainsProduction (announced Nov 2024)Named Associate Lab Director and Director of AI Programs quotes; claimed energy savings vs. FrontierQuantified energy savings not independently verified
Texas Advanced Computing Center (TACC)Academic HPC / NSF (UT Austin)SambaNova Suite as dedicated inference platform for NSF LCCF / NAIRR; scientific-model hostingProduction (announced Nov 2024)Named Executive Director quote; always-on model hosting for researcher inferenceRevenue / contract value undisclosed
Lawrence Livermore National Laboratory (LLNL)DOE / NNSADataScale SN10 integrated into Corona cluster for cognitive simulation; ICF fusion; COVID-19 drug designProduction (initial NNSA partnership agreement)NNSA official announcement; LLNL CTO and computer-scientist named quotes; 5× speedup claimed vs. GPUEarly-generation hardware (SN10); current-generation upgrade status unknown
Los Alamos National Laboratory (LANL)DOE / NNSADataScale integrated into Darwin heterogeneous cluster; quantum-chemistry / DFT modelingProduction (NNSA partnership)NNSA official announcement; up to 5× speedup potential vs. GPU for quantum chemistryUpgrade to SN40L/SN50 not confirmed; ongoing utilization undocumented
SoftBank Corp. (Japan)Channel / Sovereign AISambaNova Cloud on SoftBank AI data center; fast inference for APAC developers; Swallow/Llama/Qwen models; first SN50 customerProduction (March 2025); SN50 deployment planned 2026CEO-level bilateral press releases; first SN50 customer designation; SoftBank VP named quoteRevenue terms and developer uptake metrics not disclosed
OVHcloudChannel / Sovereign AI (Europe)AI Endpoints powered by SambaStack RDUs; real-time and batch inference; 99.8% SLA; France deployment by end-2025Production (launched end-2025; announced Nov 2025)Named OVHcloud CEO and SambaNova CEO quotes; EU-sovereign framing for regulated industriesRevenue share terms; live customer usage data not disclosed
OTP Bank (Hungary)Financial ServicesAI supercomputer co-built with OTP Group and ITM; GPT-3-level models for CEE languagesProduction (named customer per CB Insights)Named executive quote from OTP Bank head; CB Insights documentationNo outcome metrics (model performance, user adoption) in public domain
AccentureEnterprise SI / Professional ServicesSambaNova Suite deployed internally; Contact Center Intelligence + Document Intelligence for enterprise clientsProductionSambaNova blog + Data Center Dynamics; enterprise-grade governance, auditability, model ownershipEnd-client names not disclosed; scale of deployment not public
RIKEN Center for Computational Science (Japan)Academic Research HPCSambaNova DataScale; Fugaku-LLM in Samba-1 CoE on SN40L for research and Society 5.0Production (DataScale adopted Mar 2023; Fugaku-LLM integrated May 2024)RIKEN official announcement; RIKEN Director Matsuoka named quote at ISC24Usage metrics; training vs. inference workload split not disclosed

"Production vs. pilot" reflects press-release language and official announcements; independent operational audits have not been conducted. Outcome metrics are self-reported by customers unless otherwise noted.

[CU006, CU007, CU008, CU009, CU010, CU011]
FU002: SambaNova Adoption and Deployment Funnel

Estimated relative scale of customer movement from named logos through confirmed production deployments to publicly documented expansion events.

Funnel values are estimated counts based on public evidence as of May 2026; actual signed-contract count or ARR is unknown. The funnel is illustrative of evidence quality rather than exact revenue progression.

[CU005, CU028, CU029]

6.3 Enterprise, Channel, and Sovereign AI Customers

Outside the DOE ecosystem, SambaNova has built a distinct set of enterprise and channel-partner customers. Accenture deployed SambaNova Suite to enable enterprise generative AI solutions — Contact Center Intelligence and Document Intelligence — for its clients, positioning SambaNova as the on-premises and sovereign AI backbone for large regulated organizations. SoftBank Corp. (Japan) deployed SambaNova Cloud in a new AI data center in Japan in March 2025, offering fast inference to Japanese and APAC developers via the SambaNova Cloud API; SoftBank was also designated the first customer for SambaNova's new SN50 chip announced in February 2026. OVHcloud, Europe's largest cloud provider, selected SambaNova in November 2025 to power its flagship AI Endpoints service with SambaStack hardware and RDU technology, targeting financial-trading, cybersecurity, industrial-automation, and logistics use cases across the EU. OTP Bank (Hungary, Central and Eastern Europe) deployed an AI supercomputer co-built by OTP Group, ITM, and SambaNova to train GPT-3-level language models for CEE regional languages. Saudi Aramco signed a memorandum of understanding with SambaNova to explore AI capabilities and infrastructure deployment aligned with Vision 2030. Three sovereign AI cloud partnerships — SCX (Australia), Argyll (UK), and Infercom (Germany) — were announced in October 2025, establishing SambaNova-powered inference clouds running on SN40L systems at 10 kW per rack with renewable energy. On the developer API side, SambaNova Cloud is available on AWS Marketplace with consumption-based pricing and is used by companies such as Blackbox.AI (developer tools), Argyll Data Development (energy sector), and other developer-oriented firms documented by CB Insights.[CU016, CU017, CU018, CU019, CU020, CU021]

Retention, Repeat Usage, and Customer Satisfaction
MetricValue / StatusSegmentConfidenceDiligence Ask
Repeat / multi-generation hardware upgradeArgonne upgraded from SN30 (training) to SN40L (inference) — confirmed expansionDOE National LabsHighConfirm whether LLNL/LANL have upgraded from SN10 to SN40L or SN50
SoftBank multi-stage deploymentExpanded from SambaNova Cloud hosting to first SN50 customer designationChannel (APAC)HighRevenue value of SoftBank relationship; contracted volume commitments
Net Revenue Retention (NRR)Not publicly disclosedAll segmentsUnknownRequest NRR/GRR in investor or customer due-diligence process
Gross Revenue Retention (GRR) / churnNot publicly disclosedAll segmentsUnknownAudit for any lost government or enterprise accounts
Customer satisfaction (structured)No G2 / Gartner Peer Insights / Forrester data identifiedAll segmentsLowSolicit structured reviews; check AWS Marketplace ratings
Employee review (Glassdoor proxy for internal sentiment)3.1 / 5 on Glassdoor — below industry average; concerns about strategy instabilityInternal (proxy)MediumObtain formal customer NPS or CSAT scores; separate from employee sentiment
AWS Marketplace developer feedbackPositive informal reviews on inference speed cited in third-party writeupsDeveloper APILowObtain formal AWS Marketplace star rating and review count

No structured NRR, GRR, cohort retention, or CSAT data is publicly available for SambaNova. Retention evidence is inferred from multi-generation hardware expansions and multi-stage channel-partner deployments. Glassdoor rating is an employee metric and not a direct customer-satisfaction indicator.

[CU029, CU030]
FU003: Named Customer Proof Quality Matrix

Evidence quality assessment across nine named customers on four dimensions — bilateral announcement, production status, documented outcome, and retention visibility.

Matrix cells are qualitative assessments based on publicly available evidence as of May 2026. "Yes" = strong primary-source confirmation; "Partial" = some evidence but gaps remain; "Unknown" = no public data.

[CU020, CU023, CU027, CU029, CU030, CU031]

6.4 Customer Concentration Risk, Proof Quality, and Adverse Signals

Customer concentration risk is material and underinvestigated due to limited public disclosure. The publicly named customer list is dominated by U.S. government and academic institutions; if these account for the majority of hardware-system revenue, SambaNova carries significant concentration in a segment where procurement cycles are long and successor contracts require re-competition. The October 2025 report that SambaNova had failed to close a new funding round and was exploring a sale — described as occurring despite the company's strong technology — raised legitimate questions about commercial revenue diversification: if enterprise Fortune 500 and financial-services deployments were generating robust repeatable revenue, the funding crisis would likely have been averted. SambaNova's November 2024 layoff of 77 employees (approximately 15% of the workforce) to pivot from training to inference workloads indicates that the earlier customer base anchored on training use cases was not large enough to absorb the transition, suggesting that customer expansion into inference has required a reset. Proof quality varies substantially: national-lab deployments are confirmed by official DOE/NNSA press releases and named quotes from lab directors; SoftBank, OVHcloud, and Accenture deployments are confirmed by bilateral press releases with named executive quotes; the OTP Bank deployment is documented by CB Insights with a named executive quote; broader enterprise claims (Fortune 500, unnamed financial-services banks, multiple fortune-500 references in marketing) lack independent corroboration and should be treated as company-claimed rather than independently verified. No public NRR, GRR, or customer-cohort retention data has been disclosed for any segment.[CU024, CU025, CU026, CU027, CU028, CU029]

Expansion and Customer Concentration Risk
Driver or Risk FactorConcentration / Impact LevelEvidenceDiligence Path
Government / national-lab revenue dominanceHigh concentration risk if >50% of hardware revenue from DOE/NNSAAll publicly named hardware deployments are in government or academic labsRequest revenue breakdown by segment; estimate government share of hardware bookings
SoftBank as anchor APAC channel customerMedium; SoftBank is a founding investor and channel partner — dual relationshipMultiple press releases; first SN50 customer designationConfirm arm's-length commercial terms vs. investor relationship; test for captive revenue
Dependence on Carahsoft as government procurement channelMedium; Carahsoft is listed as a SambaNova customer in CB Insights and as government channelCarahsoft contracts page; CB Insights listingConfirm contract vehicle coverage (GSA, SEWP V, ITES); confirm SambaNova's listing status
Undisclosed enterprise pipelineUnknown; SambaNova claims Fortune 500 traction but no names are publicMarketing materials; canvasbusinessmodel.com secondary report (not primary)Obtain reference list of Fortune 500 accounts; NDA-permitted due diligence
Funding failure / sale exploration (adverse)High risk signal; implies commercial revenue insufficient to sustain operations at current burnThe Information (Oct 2025); webpronews.com; techstartups.comConfirm whether Series E ($350M, Feb 2026) resolved the liquidity concern; review post-close revenue trajectory

Concentration levels are qualitative assessments based on publicly observable customer mix. Actual revenue concentration is unknown. The adverse funding signal is from October 2025; the February 2026 Series E fundraise may have resolved the immediate liquidity risk but does not itself resolve the customer concentration question.

[CU026, CU027, CU028]
FU004: Customer Segment Distribution by Evidence Strength

Number of named, publicly evidenced customers per segment, illustrating the heavy skew toward government and research customers with independent documentary proof.

Counts reflect named customers with independent documentary evidence as of May 2026. Undisclosed enterprise accounts are excluded. The true customer count across all segments is unknown; SambaNova does not publicly report customer count.

[CU001, CU002, CU003, CU004, CU005]

6.5 Exhibits

Chapter 07

07Risks

7.1 Market & Competitive Risk

SambaNova operates in an AI chip landscape dominated by Nvidia, which held roughly 70–80% of datacenter GPU revenue in 2025. Nvidia's CUDA ecosystem—more than a decade of software investment and community lock-in—creates asymmetric switching costs: enterprises running PyTorch, NCCL, and RAPIDS on Nvidia hardware face significant re-engineering to adopt SambaNova's SambaFlow stack. This lock-in is structural rather than accidental; CUDA now encompasses compiler toolchains, profiling tools, pre-trained model registries, and MLOps integrations that no challenger has fully replicated. Groq secured a $1.5 billion sovereign AI partnership in 2024–25; Cerebras raised $1.1 billion at an $8.1 billion valuation. SambaNova claims the SN50 chip delivers "5× better performance per watt" versus a Nvidia B200 configuration but independent benchmarks remain unavailable. Forbes described the competitive dynamics as likely producing only one large winner among Groq, Cerebras, and SambaNova by 2026–2027, heightening existential stakes.

FR001: Risk heatmap
[CR001, CR008, CR021, CR028, CR032]

7.2 Operational, Supply Chain & Technical Risk

SambaNova's hardware roadmap depends entirely on TSMC for silicon manufacturing. TSMC's dominant market position means that any geopolitical disruption—most critically a Taiwan Strait conflict or a U.S.-China escalation affecting TSMC's access to ASML EUV equipment—could halt SambaNova's chip supply with no qualified fallback. SambaNova packages its RDU chips using HBM2E, sourcing from a concentrated set of DRAM suppliers. The company's stated mitigation is to evaluate Intel's foundry ecosystem as a secondary fab, but Intel Foundry Services has not demonstrated volume production of AI chip designs at competitive yields for any external customer. SN50 chip tape-out and volume ramp timelines have not been publicly disclosed; given typical TSMC advanced-node tape-out-to-volume schedules of 12–18 months, any delay would compress the competitive window against Nvidia's B200 and B300 successor roadmap. SambaFlow software stack reliability and AI-specific security certification also remain unverified by independent third parties.

Operational / quality / security risk register
Failure modeLikelihoodSeverityMitigation maturityResidual exposureUnresolved gap
TSMC single-source manufacturing haltMediumCriticalNone confirmed (no secondary fab)Total chip supply disruption 6–24 monthsQR012: secondary foundry unconfirmed
SN50 tape-out delay or yield failureMediumHighNone disclosed; typical ramp 12–18 monthsCompetitive window narrows vs. B200/B300QR004: delivery timeline not disclosed
HBM memory supply disruptionLowHighHBM2E selected over HBM3 (broader supply)Still concentrated: SK Hynix/Samsung/MicronDual-sourcing strategy not confirmed
SambaFlow software stack regression or incompatibilityMediumMediumCompany responsibility; not externally auditedCustomer inference pipeline interruptionNo third-party security audit disclosed
AI security adversarial attack on inference platformLowMediumNo public security certification disclosedGovernment lab inference integrity riskNo FedRAMP or NIST AI RMF certification confirmed
Intel foundry IP leakage post-acquisition diligenceLowHighNo public IP protection agreement disclosedCompetitive damage if RDU architecture sharedIntel due diligence scope not confirmed

Compiled from analyst reports, press coverage, and technical analyses. SN50 delivery and yield data are based on typical TSMC advanced-node timeline estimates; company has not publicly disclosed tape-out schedule or yield metrics.

7.3 Partner, Dependency & Customer Concentration Risk

SambaNova's primary disclosed customers—Argonne National Laboratory, Lawrence Livermore, Lawrence Berkeley, Oak Ridge, and Los Alamos National Laboratories—are all part of the DOE/NNSA system, creating a 60–80% estimated government revenue concentration. This extreme dependence means a single federal budget event or FOCI adverse determination could simultaneously affect multiple customer accounts. SoftBank is the most significant disclosed commercial customer, but contract terms and revenue volume were not disclosed, limiting confidence in commercial diversification claims. TSMC and HBM suppliers constitute structural manufacturing dependencies with no confirmed mitigation, while Vista Equity Partners provides the primary capital relationship as Series E lead. A DOE budget sequestration, FOCI adverse determination, or SoftBank contract termination could each individually cause a material revenue shock without near-term commercial offset.

Partner / dependency risk register
DependencyCounterpartyRoleConcentrationFailure scenarioSeverityResidual exposure
Silicon manufacturingTSMCSole foundry (N3/N4 node)Single-sourceTaiwan Strait disruption; TSMC capacity rationingCriticalNo qualified backup
HBM memory (HBM2E)SK Hynix / Samsung / MicronMemory stacking / packagingConcentrated (3 suppliers)DRAM supply shock; ASML restrictionHighHBM2E broader than HBM3 but still concentrated
Primary government revenueDOE / NNSA national labs (5 labs)Primary customer (60–80% est. revenue)High concentrationSequestration; FOCI adverse determinationHighSoftBank partial offset only
Commercial anchor customerSoftBank GroupAnchor commercial customer for SN50Medium concentrationSoftBank contract non-renewal or scope reductionMediumContract terms and size undisclosed
Capital providerVista Equity Partners (Series E lead)Lead investor / board relationshipModerateVista declining participation in future roundsMediumNo other disclosed lead for Series F
Potential secondary foundryIntel Foundry ServicesEvaluating as backup manufacturing partnerNone (evaluation stage only)Intel Foundry unable to reach competitive yieldsMediumLip-Bu Tan conflict complicates governance

Sourced from public press releases, official lab announcements, and regulatory disclosures. Contract values and terms for government customers are not publicly available. Foundry partnership terms with Intel are confidential.

FR003: Dependency map
[CR025, CR026, CR035, CR039]

7.4 Financial, Capital & Governance Risk

SambaNova's February 2026 Series E valued the company at approximately $2.4 billion—a 53% nominal markdown from the $5.1 billion peak Series D valuation in 2021. AI chip hardware startups are capital-intensive: SVB's 2025 research estimated median burn multiples of 2.0–3.0× for hardware-stage AI companies. SambaNova's total capital raised is approximately $1.49 billion with no publicly disclosed revenue. Governance risk is elevated by Intel CEO Lip-Bu Tan's dual role as a SambaNova board member, creating fiduciary conflicts in Intel foundry decisions and potential Intel acquisition discussions. If a subsequent round requires a down-round, liquidation preference structures could eliminate common equity for employees hired during the 2019–2022 peak era.

Regulatory / legal risk register
Risk IDRisk CategoryDescriptionStatusPrimary Evidence
RG-1Export ControlsBIS May 2026 rule: SN50 likely subject to TPP license threshold; restricts Gulf/SEA salesActive / materialSR001, SR002
RG-2FOCI ReviewNon-U.S. Series E investors may trigger DCSA FOCI review; WSGR NS&T practice engagedActive / unresolvedSR003, SR028
RG-3WARN ActCalifornia WARN Act notice filed for ~150-employee reduction in 2024–25Closed / resolvedSR004, SR013
RG-4Governance ConflictIntel CEO Lip-Bu Tan on SambaNova board; foundry and acquisition conflict unresolvedActive / materialSR006, SR007
RG-5IP OwnershipStanford co-founders; IP assignment and licensing terms not publicly confirmedOpen / unverifiedSR029
RG-6Down-round DilutionSeries E at $2.4B (–53% from 2021 peak); further down-round risk if SN50 missesActive / materialSR028, SR018
RG-7BIS License ComplianceNo confirmed BIS license applications for Gulf/SEA SN50 sales as of May 2026Open / monitoringSR001, SR002
RG-8LitigationNo active litigation or SEC enforcement actions confirmed in public records (May 2026)Low / unconfirmedSR029, SR018

Partial coverage: compiled from public legal analyses (Finnegan, Mayer Brown), regulatory filings (WARNScan), Wilson Sonsini engagement disclosures, and SEC Form D filings. Formal DCSA FOCI determination, Intel partnership IP terms, Stanford IP licensing, and litigation history are not publicly confirmed and are noted as open gaps.

[CR025, CR027, CR028, CR032, CR035, CR046]
FR002: Risk transmission map
[CR008, CR015, CR021, CR027, CR031]

7.5 People, Execution & Regulatory Risk

A workforce reduction of approximately 150 employees (roughly 15–20% of headcount) in 2024–2025 targeted engineering, sales, and operations roles, and a WARN Act notice filing confirms the reduction reached the statutory threshold. Key-person risk is significant: CEO Rodrigo Liang, CTO Kunle Olukotun, and Chief Scientist Christopher Ré are the public faces of technical differentiation. Ré's departure to an academic role or competitor would remove a visible signal of research credibility. In May 2026, BIS published revised export control rules targeting advanced AI chips; SambaNova's SN50 is likely subject to these controls, restricting sales to certain non-allied geographies. Wilson Sonsini's engagement of its national security and technology practice for the Series E closing signals FOCI compliance requirements.

People / execution risk register
Risk itemPerson / entityNature of conflict or exposurePublic confirmationSeverity
Board conflict – Intel CEOLip-Bu TanSits on SambaNova board; leads Intel (foundry + potential acquirer)SR006, SR007High
Key-person – CEORodrigo LiangSole public face; no disclosed succession planSR005, SR014High
Key-person – CTO / Chief ScientistOlukotun / RéStanford faculty inventors; departure = IP signal + customer confidence riskSR005High
WARN Act layoff~150 employees (2024–25)Engineering / sales / ops reduction confirmed via WARNScanSR004, SR013Medium
Intel IP exposureIntel CorporationIntel engineers reviewed RDU IP during failed acquisition due diligenceSR016Medium

Compiled from public news reporting and regulatory filings. Board composition and recusal policies are not publicly filed for private companies.

Mitigation and kill criteria table
RiskMonitorable triggerThreshold or eventAction implication
SN50 delivery slipVolume shipment delayed >6 months from announced timelineSambaNova press releases; customer confirmationReassess competitive moat; escalate to investment committee
Down-round Series FNew raise at post-money below $2.4BSEC Form D; press reportingReview liquidation preference stack; model common equity waterfall
DOE budget cutDOE AI infrastructure appropriation reduced >20% YoYFederal Register; DOE budget justificationsModel revenue impact; assess commercial diversification pace
Board governance crisisLip-Bu Tan recusal scope challenged or formally expandedSambaNova press releases; Intel disclosuresAssess foundry and M&A optionality impact
FOCI adverse determinationDCSA issues adverse finding; classified contract suspensionDCSA public notices; contract award databasesImmediate reassessment of government revenue at risk

Sentinel thresholds compiled by analyst inference from comparable hardware startup patterns; not company-disclosed. Monitoring recommended quarterly.

Chapter 08

08Valuation

8.1 Current Valuation Context and Series E Financing

SambaNova's February 24, 2026 Series E round raised more than $350M, led by Vista Equity Partners and Cambium Capital. The round was structured in at least two tranches: Series E-1 at $30.99 per share and Series E-2 at $21.70 per share, totalling approximately $307.13M and $42.87M respectively per Yahoo Finance/Forge data. The E-1 pricing implies a post-money valuation of approximately $2.34B; however, some third-party sources including Tracxn and user-context data cite a $4.8B post-money figure — a significant discrepancy that the company has not publicly resolved. The $2.34B (E-1) primary price represents a 54% decline from the 2021 Series D peak of $5.11B, meeting the standard definition of a down round. Before the Series E closed, BlackRock had already marked its SambaNova stake down by approximately 17% per Caplight analysis, implying an effective value of ~$2.4B. Intel was simultaneously in advanced acquisition talks at approximately $1.6B including debt — a price that would have wiped out most equity value held by 2021 Series D investors. As of May 26, 2026, Forge's secondary market estimate places SambaNova's implied valuation at $1.44B, with shares trading at $19.05 — 39% below the Series E-1 primary price. This persistent secondary discount signals skepticism among market participants about execution relative to the primary-round price. Despite this, the 3-month Forge return tracker shows SambaNova up 38.53% vs. the index's 9.62%, indicating secondary sentiment has improved materially since the Series E closed in February 2026. CEO Rodrigo Liang described the round as "grossly, grossly oversubscribed," and Intel Capital committed approximately $100–$150M strategically alongside Vista. The dual-tranche structure (E-1 at $30.99, E-2 at $21.70) creates a blended all-in cost below the E-1 headline, with the tiered economics suggesting preferred price protection for senior investors. This capital structure complexity, combined with the deep preference overhang from the $5.11B Series D, creates material risk for common equity and later-series investors at or below the 2021 watermark.[CV001, CV002, CV003, CV004, CV005, CV006]

Recommendation Summary
DimensionAssessmentBasisImplication
RecommendationResearch-moreEvidence is incomplete for high-confidence buy/avoidSeek audited ARR, NRR, margin data before committing
ConfidenceMediumPrivate company; competing valuation figures; no audited financialsTreat all multiples as estimates
Risk ratingHighCapital intensity, down-round history, competitive pressure, governanceRequires mitigants before large position
Valuation stanceStretchedE-1 at ~23x mid-2025 ARR; secondary market at ~8x implies execution skepticismEntry discipline critical; secondary entry preferable
Valuation (primary, USD B)$2.34B (E-1)Yahoo Finance/Forge primary round data; $4.8B aggregator figure unverifiedAnchor to $2.34B for modelling; note $4.8B risk
Valuation (secondary, USD B)$1.44B (Forge May 2026)39% below E-1; 80% below 2021 peakSecondary offers lower risk entry if liquidity acceptable

Valuation figures are estimates from secondary market data (Forge/Yahoo Finance) and third-party aggregators. The $4.8B figure cited in some sources conflicts with share-price-derived estimates of $2.24B–$2.34B; neither is audited. Use $2.34B (E-1 primary) as conservative baseline.

[CV002, CV003, CV004, CV005, CV006]
FV003: Valuation / Return Range — Bull, Base, Bear by Entry Point

Implied 2028 valuation range under three scenarios, anchored to the Series E-1 entry price of ~$2.34B. Secondary-market entry at ~$1.44B is shown as a separate reference band.

All scenario valuations are analyst estimates based on comparable multiples and projected ARR; they carry high uncertainty given private-company opacity. No probability weights are assigned. Returns are pre-dilution and do not account for preference waterfall or anti-dilution provisions.

[CV012, CV013, CV025, CV028, CV039]

8.2 Comparable Set — Public and Private AI Infrastructure

SambaNova's closest private analogues are Groq and Cerebras, both Nvidia challengers in the AI inference/accelerator space. Groq raised $750M in September 2025 at a $6.9B post-money valuation, having previously closed at $2.8B in August 2024 — more than doubling in roughly a year. With estimated 2024 revenue around $90M, Groq's $6.9B valuation implies approximately 76x trailing revenue, a premium justified by investor optimism over its LPU architecture and rapidly expanding developer ecosystem. Cerebras has diverged even more starkly. The company raised $1B in a Series H in February 2026 at a $23B valuation, filed to go public on Nasdaq at a range of $115–$125 per share targeting $26.6B, and reported $510M in revenue and $87.9M in GAAP net income for 2025. Cerebras's $24.6B in remaining performance obligations (backlog) provides revenue visibility that SambaNova cannot currently match. At its IPO valuation, Cerebras trades at approximately 45–52x 2025 revenue. Among public companies, Nvidia's fiscal 2026 revenue of $215.9B against a ~$4.6T market cap implies a price-to-sales ratio of approximately 21x. AMD, at $34.6B in 2025 revenue and ~$397B market cap, trades at approximately 11.5x revenue. These public multiples serve as a gravity anchor: at scale, AI chip companies are valued at 10–25x revenue. The premium for private pre-revenue-scale peers like Groq and Cerebras reflects hypergrowth expectations, scarcity, and strategic optionality. Finro's Q1 2026 dataset across 575 AI companies finds a median EV/Revenue multiple of approximately 21.2x for AI infrastructure, with an average of 31.3x and a 25th–75th percentile range of 10.2x–39.6x. At the Series E-1 implied $2.34B and ~$100M mid-2025 ARR, SambaNova's ~23x multiple sits at median — reasonable in isolation but pressured by Cerebras's superior revenue scale, profitability, and backlog, and by Groq's faster valuation accretion. Capital efficiency is the sharpest differentiator. SambaNova's valuation/total-raised ratio is approximately 1.6x ($2.34B / $1.49B), versus Groq at ~9.2x ($6.9B / $750M raised) and Cerebras at approximately 14x ($23B / ~$1.6B raised). SambaNova has consumed far more capital per dollar of current valuation, a structural concern that makes the bull case dependent on a step-change in revenue efficiency.[CV015, CV016, CV017, CV018, CV019, CV020]

Comparable Valuation Table
ComparableTypeValuation (USD)Revenue / ARREV/Revenue MultipleCapital RaisedCapital Efficiency (Val/Raised)Relevance to SambaNovaLimitation
Groq (Sept 2025)Private round$6.9B~$90M (est. 2024)~76x$750M (latest round)9.2xDirect inference-first hardware peer; LPU architecture; developer-first positioningRevenue unaudited; rapidly growing but earlier stage than SambaNova
Cerebras (Feb 2026 Series H)Private round$23B$510M (2025)~45x~$1.6B total~14xWafer-scale engine, IPO-bound; $24.6B backlog; GAAP profitable; direct AI chip rivalMaterially larger revenue base; OpenAI anchor contract provides atypical revenue visibility
Cerebras (IPO target, May 2026)IPO filing$22–26.6B$510M (2025)~45–52xN/A (public offer)N/APublic market price discovery for AI inference silicon; establishes comp for exitIPO market conditions and large secondary supply may compress multiples post-listing
Nvidia (public, FY2026)Public company~$4.6T$215.9B revenue~21xN/A (public)N/AAI chip market benchmark; dominant GPU/CUDA ecosystem; highest-margin playerScale and ecosystem moat not comparable to early-stage private; acts as multiple ceiling for infrastructure segment
AMD (public, 2025)Public company~$397B$34.6B revenue~11.5xN/A (public)N/AChallenger semiconductor company; data center AI revenue $16.6BDiversified business; AI chip revenue is a subset; lower-growth premium than pure-play inference startups
AI infrastructure median (Finro Q1 2026)Sector medianN/AN/A21.2x (median)N/AN/ASector-wide valuation floor for benchmarking SambaNova's primary-round multipleBroad basket; includes company types and stages less comparable to a late-stage private hardware company
SambaNova (Series E-1 primary, Feb 2026)Primary round~$2.34B~$100M ARR (mid-2025)~23x$1.49B total1.6xSubject company primary referenceMultiple based on estimated unaudited ARR; $4.8B figure from some aggregators unresolved
SambaNova (Forge secondary, May 2026)Secondary market~$1.44B~$180M ARR (est.)~8xN/AN/AMost current market price signal; reflects execution risk discountThin secondary volume; Forge Price is derived data, not a binding bid/ask

Revenue and ARR figures for private companies (Groq, SambaNova) are estimated from third-party data aggregators and are not audited. Multiples are computed from cited valuation and revenue estimates. Capital efficiency ratio = implied valuation / total capital raised. Sector median from Finro Financial Consulting Q1 2026 dataset of 575 AI companies; may include non-hardware companies.

[CV015, CV016, CV017, CV018, CV019, CV020]
FV002: Valuation Sensitivity — EV/ARR at Varying Revenue and Valuation Scenarios

EV/ARR multiples under five valuation scenarios, illustrating the gap between primary-round pricing, secondary pricing, and comparable public and private benchmarks.

SambaNova ARR figures are estimated from third-party sources; not audited. Groq revenue is analyst estimate. Cerebras revenue is from its IPO filing. Nvidia and AMD from public financial disclosures. Finro sector median from Q1 2026 dataset.

[CV012, CV013, CV015, CV016, CV017, CV019]

8.3 Revenue Multiples, Capital Structure, and Valuation Range

The valuation picture for SambaNova is complicated by three overlapping data layers. First, primary-round data: the Series E-1 at $30.99/share implies ~$2.34B. Second, aggregator data: sources including Tracxn and user-context cite $4.8B post-money, which at ~$100M mid-2025 ARR implies ~48x EV/ARR — well above sector median and inconsistent with the share price evidence. Third, secondary market data: Forge's $1.44B estimate (May 26, 2026) implies ~8x ARR at $180M — below sector median and reflecting execution risk discounting. Using the more supportable E-1 primary price of $2.34B: against $100M mid-2025 ARR the multiple is ~23x; against $180M early-2026 ARR it drops to ~13x. This range brackets fair value for a high-growth AI infrastructure company with revenue momentum but persistent opacity. The more conservative $13x reflects improving ARR trajectory and makes the E-1 price look more defensible, contingent on the $180M figure being auditable. The capital structure introduces additional complexity. The preference stack from the 2021 Series D ($677M at $5.11B) means common equity and employees are deeply out of the money at all valuations below $5.11B. Series E investors at $30.99/share (E-1) are at a 67% discount to the Series D price ($95.02), giving them a structural preference advantage over earlier tranches. However, the E-2 at $21.70/share signals a blended cost of capital to the new cohort that is lower than E-1 headline, and there may be anti-dilution protections or ratchets that further compress effective returns for earlier investors. A Lincoln Variable Insurance Products Trust filing as of June 30, 2025 reported holding SambaNova shares at $44.47/share — 43% above the subsequent E-1 issue price — indicating that institutional mark-to-market sentiment deteriorated materially between mid-2025 and the Series E close. This is consistent with the late-2025 acquisition process and the Caplight/BlackRock markdown.[CV010, CV011, CV012, CV013, CV014, CV033]

Thesis and Anti-Thesis
ArgumentSupporting EvidenceCounter-Evidence / RiskWhat Would Change the View
SN50 architecture delivers superior inference efficiency vs. GPUsCompany-claimed 5x compute, 3x lower TCO; SoftBank as first SN50 customerNo independent benchmark confirmation; production ramp not startedIndependent third-party benchmarks at production scale with major enterprise clients
ARR growth trajectory is exceptional and accelerating4x ARR growth in 2024; $100M ARR mid-2025; $180M+ early 2026 estimateRevenue is unaudited; mix between hardware (lumpy) and cloud (recurring) undisclosedAudited revenue with ARR/NRR breakdown showing >100% net retention
Intel partnership provides distribution moat and credibilityIntel Capital $100–150M; co-marketing, co-selling, CPU+SN50 reference architecturesIntel has conflicted governance role; partnership may create dependency riskIntel channel sales converting $50M+ incremental ARR annually by end-2026
Down-round and failed M&A signal financing stress, not product failureSeries E oversubscribed; Vista, QIA, Saudi First Data, T. Rowe Price joinedSale process and $1.6B acquisition talks suggest management flagged survival riskComparable SambaNova-scale ARR companies achieving funding at $3B+ without distress
Secondary discount creates attractive entry vs. primaryForge price $1.44B (~8x ARR) is below AI infrastructure median of ~21xIlliquid; governance opacity; no registered offering path disclosedAnnounced IPO plan or secondary liquidity program with timeline commitment

Thesis and anti-thesis items are supported by evidence cited in claims. Counter-evidence items represent risks that cannot be resolved from public sources and require direct diligence with company.

[CV005, CV007, CV009, CV010, CV011, CV026]
Bull / Base / Bear Scenario Analysis
ScenarioARR Assumption (2028)Revenue MultipleImplied ValuationReturn vs. E-1 ($2.34B)Key Risk / Assumption
Bull$500M+ ARR (80%+ CAGR)20x~$10B~4.3xSN50 ramp on schedule; Intel channel adds $100M+; sovereign AI 3+ new wins; Cerebras IPO raises AI infra multiples
Base$380M ARR (50% CAGR)15x~$5.7B~2.4xSteady SN50 ramp; Intel co-sell converts; 2–3 sovereign deals; market multiples remain ~15x for private pre-IPO
Bear$180M ARR (plateau / stall)8x~$1.4B~0.6x (loss)SN50 delays; Nvidia CUDA lock-in prevents enterprise conversion; Intel partnership delivers minimal revenue; secondary marks prevail

All scenarios are estimated based on observed ARR trajectory and sector comparable multiples. No audited financials are available; estimates carry high uncertainty. E-1 entry price of $30.99/share implies ~$2.34B valuation baseline. Probability weights are not assigned; probability-weight estimates would require audited ARR, NRR, and margin data.

[CV010, CV011, CV012, CV013, CV021, CV022]

8.4 Investment Thesis — Bull, Base, and Bear Cases

The bull case rests on three pillars: (1) the SN50's claimed 5x performance improvement over the SN40 and 3x cost advantage vs. GPUs enables SambaNova to capture a meaningful slice of the $200B+ AI accelerator TAM from Nvidia; (2) ARR growing 4x in 2024 and likely to surpass $180M in 2026 implies a trajectory toward $500M+ ARR by 2028 on a similar growth curve; (3) the Intel strategic partnership provides distribution, manufacturing scale, and co-marketing reach that could collapse go-to-market costs. If SambaNova achieves $500M ARR by 2028 and trades at 20x revenue, the exit valuation is $10B — a 4x return on Series E-1 investment. The base case assumes SambaNova grows ARR at 50–70% annually, reaching $400M by 2028. At 15x revenue the valuation is $6B, implying a 2.5x return on E-1 pricing. This requires successful SN50 ramp, continued sovereign AI contract wins, and execution on the Intel partnership — achievable but not guaranteed. The bear case assumes Nvidia deepens its software-hardware moat, customer pilots fail to convert at scale, and the SN50 faces manufacturing delays. Revenue plateaus at ~$150–200M ARR with no path to profitability. At 8–10x revenue (secondary market discount), the company is worth $1.2–$2B — implying breakeven to moderate loss for Series E-1 investors. The failed 2025 sale process, deep capital consumption, and Cerebras's superior execution profile all support this risk scenario. The key swing factors: SN50 deployment velocity (the Intel deal is meant to accelerate this), SoftBank Japan ramp (the first disclosed SN50 customer), and whether SambaNova can convert its sovereign AI pipeline into recurring cloud subscriptions at sufficient scale to demonstrate net revenue retention above 100%.[CV033, CV034, CV015, CV016, CV036, CV026]

FV001: Recommendation Logic Flow

Chain from SambaNova's key signals — revenue growth, valuation evidence, competitive position, and risk profile — to a research-more recommendation with stretched stance.

Flow nodes represent summary judgments derived from multiple claims; individual claim confidence varies and is documented in the localEvidence section.

[CV001, CV002, CV003, CV010, CV021, CV025]
FV004: Investment KPIs — IC-Ready Scorecard

Scoring across seven investment committee dimensions for SambaNova as of May 2026, reflecting evidence quality and valuation analysis.

Scores are qualitative assessments by the research analyst based on available evidence; they are not quantitative models. Dimensions follow the IC scoring framework used in this report series.

[CV010, CV011, CV012, CV021, CV024, CV025]

8.5 Recommendation, Stance, and Final Diligence Asks

The overall recommendation is research-more with a stretched valuation stance. SambaNova has demonstrated impressive ARR growth momentum, secured a credible strategic partner in Intel, and launched a technically differentiated next-generation chip. However, the evidence base for a confident investment decision at the Series E-1 price ($30.99/share, ~$2.34B implied) is incomplete: gross margins, NRR, audited revenue, preference overhang, and SN50 production ramp remain undisclosed or unverifiable from public sources. The valuation context is adverse in several respects. The secondary market is pricing shares at a 39% discount to the E-1 issue price. The company required a 54% down-round from 2021 and explored a distressed sale at a further 60% discount. The $4.8B aggregator-cited figure remains inconsistent with primary share price data and should be treated as unverified until SambaNova discloses official round documents. The preference stack from the $5.11B Series D creates overhang that will suppress common equity returns and limit employee retention incentives. Risk rating is high, driven by capital intensity, competitive pressure from Cerebras and Nvidia, governance concerns (Lip-Bu Tan's dual role as Intel CEO and SambaNova Chairman), and the gap between primary-round and secondary-market pricing. Confidence is medium, limited by private-company opacity and the absence of audited financials. For investors already at the table (Series E participants): the priority diligence asks are audited ARR composition and NRR, SN50 production schedule and committed orders, preference waterfall analysis, and gross margin by revenue stream. For those evaluating secondary entry at the Forge price (~$1.44B, ~8x ARR): the discount to primary provides a more attractive entry point that partially compensates for opacity, but the illiquidity and governance risks persist.[CV002, CV003, CV005, CV006, CV007, CV025]

Thesis-Break and Kill Criteria
TriggerThreshold / EventTransmission to ThesisAction Implication
SN50 production delaySN50 not shipping at volume by Q4 2026 per original scheduleBull case revenue ramp collapses; Intel partnership value erodes; Cerebras gap widens furtherReduce position; reevaluate if delay exceeds 6 months
ARR growth stallsARR growth below 30% YoY in 2026 (i.e. ARR < $240M by end-2026)Base case becomes bear case; multiple compression on slower growth; liquidity risk risesExit or materially reduce exposure
Secondary-primary divergence widensForge secondary price falls below $10/share (~$0.75B implied)Market implies bear-case scenario is pricing-in; preference waterfall underwater for E-1 holdersImmediate diligence escalation; consider secondary exit if available
Governance adverse eventLip-Bu Tan exits as Intel CEO or SambaNova Chairman; legal challenge to SambaNova-Intel transactionIntel partnership at risk; strategic funding rationale weakens; reputational signal to other investorsPause new capital deployment; assess partnership continuity
Competitive displacementMajor hyperscaler or Nvidia launches inference product that matches SN50 TCO claim within 12 monthsCore product differentiation erodes; pricing pressure increases across cloud API and enterprise salesReassess product moat; if addressable cost gap closes, move recommendation to avoid

Triggers and thresholds are indicative. Monitoring requires access to private company reporting, which is not publicly available. Investors should request contractual information rights as a condition of participation.

[CV007, CV025, CV026, CV033, CV039]
Final Diligence Asks
TopicMissing EvidenceWhy It MattersOwner / Diligence Path
Audited ARR and revenue mixAudited breakdown of cloud API vs. hardware vs. services ARR; NRR by cohortValuation depends entirely on ARR quality; without audit, $100M–$180M range is unverifiable and NRR could mask churnRequest from management; third-party revenue audit vendor
Official Series E post-money valuationCompany disclosure of official post-money valuation and share countResolves conflict between $2.34B (primary price) and $4.8B (aggregator); affects multiple calculation and preference waterfallSambaNova investor relations; legal counsel (WSGR) transaction documents
Preference waterfall and cap tableFull cap table with liquidation preference stack by series and seniorityAt sub-$5.1B exits, common equity is worthless; understanding Series D preferences determines E-1 investor recovery in M&A/distress scenariosRequest from management; secondary market data providers (Forge, EquityZen)
Gross margin by revenue streamHardware gross margin, cloud API gross margin, services gross marginBlended margin drives path to profitability; hardware margins may be negative at current scale, compressing total return potentialRequest from management; comparable hardware company benchmarks (Cerebras S-1 as reference)
SN50 customer pipeline and committed ordersNamed pipeline, committed orders, and revenue recognition schedule for SN50Bull case depends on SN50 ramp; without committed orders the timeline is speculativeSales pipeline review; reference check with Intel co-sell pipeline
Intel partnership terms and exclusivityCommercial terms, exclusivity clauses, and revenue-sharing structure of Intel partnershipPartnership is a key value driver; if Intel can exit or redirect, strategic value erodes rapidlyLegal review of partnership agreement; Intel IR for disclosure

All six diligence items are private-evidence-only gaps that cannot be resolved from public sources. Each is material to the investment recommendation and valuation stance.

[CV002, CV005, CV007, CV012, CV013, CV039]

8.6 Exhibits

Disclaimer

This diligence report was produced by an AI research agent using public sources available as of 2026-05-27. It is not investment advice. SambaNova is a private company with incomplete public financial and governance disclosure, so any investment decision should be validated against management, customer, and investor materials.

Evidence index

Claims
IDStatementConfidenceSources
CO001 SambaNova Systems is headquartered in Palo Alto, California, and was incorporated in 2017. High SO001, SO010, SO021
CO002 SambaNova Systems was co-founded by Rodrigo Liang, Kunle Olukotun, and Christopher Ré. High SO001, SO021, SO022
CO003 Rodrigo Liang (CEO) previously served as Senior Vice President at Oracle (formerly Sun Microsystems), where he led approximately 1,000 chip designers across 12 major processor generations. Medium SO021, SO016
CO004 Kunle Olukotun (CTO / Chief Technologist) is a Professor of Electrical Engineering and Computer Science at Stanford University and is renowned for pioneering multicore processor design, including the Niagara chip. Medium SO001, SO015, SO022
CO005 Christopher Ré (co-founder) is an Associate Professor of Computer Science at Stanford University and a machine learning systems expert; he co-founded Snorkel AI. Medium SO015, SO020
CO006 Lip-Bu Tan, CEO of Intel, has served as Executive Chairman of SambaNova since the company was founded in 2017, having first invested through his venture firm Walden International. Medium SO009, SO012, SO022
CO007 Walden International, the venture firm associated with Lip-Bu Tan, co-led SambaNova's $56 million Series A in March 2018 alongside GV. Medium SO021, SO012
CO008 Intel CEO Lip-Bu Tan's dual role as Intel CEO and SambaNova Executive Chairman created significant conflict-of-interest scrutiny during Intel's acquisition discussions with SambaNova in late 2025. Medium SO008, SO009, SO012, SO022
CO009 Intel CEO Lip-Bu Tan recused himself from the Intel-SambaNova deal discussions in late 2025 and early 2026; Intel's EVP and General Manager of Data Center Group Kevork Kechichian served as executive sponsor for the collaboration. Medium SO008, SO009
CO010 SambaNova's core chip is the Reconfigurable Dataflow Unit (RDU), an AI inference chip that uses a dataflow architecture to execute AI workloads while minimizing data movement compared to traditional GPU architectures. Medium SO003, SO007, SO014
CO011 SambaNova announced its fifth-generation RDU chip, the SN50, in February 2026 alongside the Series E close; SN50 is scheduled to ship to customers in the second half of 2026. High SO026, SO008, SO009
CO012 The SN50 delivers 2.5× more FP16 compute per chip than the SN40L and supports FP8 and FP4 precision; SambaNova claims it runs 5× faster than competitive chips. Medium SO026, SO008, SO019
CO013 The SN50 interconnect enables up to 256 chips to share the same memory space via a multi-terabyte-per-second proprietary Ethernet-based protocol, compared to 16 chips for the SN40L. Medium SO003, SO008, SO026
CO014 SambaNova launched the SN40L chip in September 2023; it features a three-tier memory hierarchy (64 GB HBM3 hot cache, up to 1.5 TB DDR5 DRAM, 520 MB SRAM), manufactured on TSMC 5nm. Medium SO014, SO024
CO015 The SN40L can serve models with up to 5 trillion parameters (as a mixture-of-experts model using Llama-2 as a router) from a single 8-socket SambaNova system. Medium SO014
CO016 SambaNova offers three main product lines: SambaRack (rack-scale hardware systems), SambaNova Cloud / SambaCloud (cloud inference API), and SambaManaged / SambaStack (on-premises or hybrid AI platform). Medium SO007, SO024, SO012
CO017 SambaNova raised a $56 million Series A in March 2018, led by GV (Google Ventures) and Walden International, with Atlantic Bridge Ventures and Redline Capital also participating. High SO021, SO015
CO018 In 2020, SambaNova raised approximately $250 million from BlackRock, Intel Capital, and GV, bringing the company's implied valuation to approximately $2.5 billion. Medium SO022, SO015, SO023
CO019 SambaNova raised a $676 million Series D in April 2021, led by SoftBank Vision Fund 2, at a post-money valuation of $5.1 billion. High SO025, SO015, SO009
CO020 Series D investors included SoftBank Vision Fund 2 (lead), Temasek, GIC, BlackRock, Intel Capital, GV, Walden International, and WRVI Capital. Medium SO025, SO015
CO021 SambaNova closed a $350 million Series E financing round in February 2026, led by Vista Equity Partners and Cambium Capital. High SO026, SO009, SO008
CO022 Additional Series E investors include Intel Capital, Qatar Investment Authority (QIA), GV, Battery Ventures, T. Rowe Price Associates, Seligman Ventures, Assam Ventures, and sovereign investors from Saudi Arabia. Medium SO009, SO008, SO017
CO023 SambaNova did not disclose a post-money valuation in its February 2026 Series E round; multiple reports indicate it is below the $5.1 billion Series D peak. Medium SO017, SO009, SO015
CO024 SambaNova's total capital raised across all publicly reported rounds is approximately $1.49–$1.5 billion as of the February 2026 Series E close. Medium SO015, SO023, SO017
CO025 SambaNova had approximately 500 employees before laying off approximately 77 people (roughly 15% of its workforce) in April 2025; post-layoff headcount is estimated at approximately 400–425. Medium SO013, SO012
CO026 The April 2025 layoffs were driven by SambaNova's strategic pivot away from model training workloads toward cloud-first AI inference services. Medium SO013, SO024
CO027 BlackRock marked down its SambaNova holdings by approximately 17% in 2024–2025, implying a company valuation of approximately $2.4 billion. Medium SO010, SO022
CO028 SambaNova began exploring a potential sale in late 2024 and hired an investment bank to manage the process after a fundraising round stalled. High SO010, SO011, SO028
CO029 Intel reportedly considered acquiring SambaNova for approximately $1.6 billion in late 2025, a price well below the $5.1 billion 2021 Series D valuation. Medium SO009, SO012, SO022
CO030 Intel and SambaNova signed a non-binding term sheet for an acquisition in late 2025, but the deal did not progress to a definitive agreement. Medium SO020, SO022, SO012
CO031 SambaNova abandoned the Intel acquisition in early 2026 and chose to remain independent by raising the Series E round. High SO008, SO009, SO011
CO032 CEO Rodrigo Liang described the Series E as 'grossly, grossly oversubscribed,' indicating strong investor demand despite the undisclosed valuation. Medium SO008
CO033 SambaNova's customers include Hugging Face, Meta, and major AI labs, per CNBC reporting based on a February 2026 interview with CEO Rodrigo Liang. Medium SO009
CO034 SoftBank Corp. will be the first customer to deploy SambaNova's SN50 chip in next-generation AI data centers in Japan. Medium SO026, SO009
CO035 SambaNova has announced sovereign AI partnerships in Germany, the United Kingdom, Australia, Japan, and France as of early 2026. Medium SO008, SO027
CO036 SambaNova and Intel entered a multi-year strategic collaboration in February 2026 for SambaNova to adopt Intel server chips and Intel to make a strategic investment as part of the Series E. High SO026, SO009, SO008
CO037 SambaNova launched SambaNova Cloud (SambaCloud) in September 2024 as a cloud-based AI inference service using the SN40L chip, providing token-level API access to open-source LLMs. Medium SO010, SO024
CO038 An enterprise financial services firm signed a multi-million-dollar contract with SambaNova within 40 days of first contact in 2023, illustrating enterprise sales velocity. Medium SO014
CO039 SambaNova's RDU chips achieve approximately 90% HBM memory bandwidth utilization for AI inference workloads, compared to lower utilization rates typical for Nvidia GPUs due to kernel launch overhead. Low SO008
CO040 The SN40L was manufactured on TSMC's 5nm process node and features 1,040 compute cores, compared to prior SambaNova generations on 7nm. Medium SO014
CO041 The Series E proceeds are intended by SambaNova to fund expansion of SN50 chip manufacturing, SambaNova Cloud capacity, and Intel-SambaNova AI infrastructure go-to-market. Medium SO026, SO009
CO042 SambaNova reported record bookings and revenue for full-year 2025, as cited in the February 2026 BusinessWire press release; no specific figures were provided. Low SO026
CM001 The AI chip and accelerator market encompasses specialized semiconductors for machine learning training and inference workloads in data centers, cloud platforms, and edge deployments. High SM001, SM002
CM002 Data center AI semiconductor revenue constitutes the largest sub-segment within the AI chip market by revenue in 2026, driven by hyperscaler and enterprise AI infrastructure investment. Medium SM002
CM003 IDC defines the "intelligent datacenter" segment—encompassing CPUs, AI accelerators, GPUs, custom ASICs, and networking silicon—at $281B in 2026, within total data center semiconductor revenues of $477.1B. High SM002, SM022
CM004 AI inference—running trained model outputs at production scale—is the fastest-growing operational segment within AI chip demand in 2026, as deployed AI applications scale to millions or billions of daily queries. Medium SM016, SM017
CM005 AI training is a periodic, compute-intensive workload that companies perform less frequently than inference; it is a high-cost one-time event compared to the ongoing operational cost of inference serving. Medium SM001, SM016
CM006 Deloitte estimates the global AI chip market at approximately $500B in 2026, revised upward from an initial $300B estimate after a December 2025 WSTS upward revision of $175B driven entirely by AI demand. High SM001, SM020
CM007 IDC (April 2026) forecasts global semiconductor revenues reaching $1.29T in 2026, a 52.8% year-over-year increase, with data center semiconductor revenues at $477.1B driven by AI infrastructure investment. High SM002, SM022
CM008 SiliconAnalysts estimates the total merchant AI accelerator market—excluding hyperscaler captive custom silicon—at over $200B in 2026, with Nvidia retaining approximately 75–80% share. Medium SM003
CM009 AllAboutAI reports the global AI chip market (merchant chips, narrowest definition) reached $118B in 2024 and projects $293B by 2030, representing a 33.2% CAGR. Medium SM004
CM010 AMD CEO Lisa Su projected in November 2025 that the AI data center chip market will exceed $500B by 2028 and grow to approximately $1 trillion by 2030. Low SM001
CM011 Analyst estimates for the AI chip market in 2026 range from approximately $120B (AI inference sub-market, Fortune Business Insights) to $500B (all AI chips including memory and networking, Deloitte), with the divergence explained by market boundary definitions rather than forecast error alone. High SM001, SM002, SM003, SM016
CM012 IDC projects global semiconductor revenues reaching $1.75T by 2030, with data center semiconductors accounting for approximately $843B—nearly half of the total semiconductor market. Medium SM002, SM022
CM013 The AI chip market CAGR is estimated at 29–36% for 2024–2030 across major analyst reports, with AllAboutAI reporting 33.2% for the narrowest merchant chip definition. Medium SM004, SM001
CM014 Nvidia holds approximately 75–80% of the AI accelerator market by revenue in 2026, declining from a peak of approximately 87% in 2024 as the total market expands faster than Nvidia's revenue growth. Medium SM003, SM004
CM015 Nvidia's FY2026 data center revenue is projected at $150B+, with the Blackwell architecture (B200, GB200) serving as the primary revenue driver. Medium SM003
CM016 Nvidia's CUDA software ecosystem, built over 20+ years with 4M+ developers, creates structural switching costs for GPU alternatives; every major ML framework is optimized for CUDA first, and migration requires months to years of software re-engineering. Medium SM003, SM010
CM017 AMD is the primary merchant alternative to Nvidia in AI accelerators, holding approximately 6–10% of AI accelerator revenue in 2026 with MI300X/MI355X products, but faces a structural production ceiling due to only 11% of TSMC's CoWoS advanced packaging allocation vs. Nvidia's 60%. Medium SM003
CM018 Custom silicon from hyperscalers (Google TPU v5p/Trillium, AWS Trainium 2, Microsoft Maia 200, Meta MTIA v2) collectively accounts for $25–50B+ in 2026 AI chip value but is not available for external merchant sale, reducing Nvidia's addressable market without creating a new merchant competitor. Medium SM003, SM002
CM019 In AI inference specifically, Nvidia's market share is estimated at 60–75%—lower than its 90%+ training dominance—because inference workloads are more tolerant of alternative architectures and economic optimization matters more in production deployments than raw training throughput. Medium SM003, SM010
CM020 SambaNova's SN50 chip (announced February 24, 2026) claims 5x maximum speed and 3x lower total cost of ownership compared to Nvidia's Blackwell B200 on agentic inference workloads, based on internal benchmarks of Llama 3.3 70B, GPT-OSS 120B, and DeepSeek 671B. Medium SM009, SM010
CM021 SoftBank Corp. is the first confirmed customer for SambaNova's SN50 chip, deploying it in sovereign AI data centers in Japan to power low-latency inference services for enterprise customers with domestic data residency requirements. High SM009, SM013
CM022 SambaNova and Intel entered a planned multi-year strategic collaboration announced February 24, 2026, integrating Xeon CPU infrastructure with SambaNova's RDU systems and providing joint go-to-market through Intel's global enterprise and partner channels. High SM011, SM013
CM023 SambaNova's Series E round ($350M+, February 2026) was led by Vista Equity Partners and Cambium Capital with Intel Capital participating; proceeds are designated for SN50 production ramp, SambaCloud scaling, and enterprise software integrations. High SM009, SM013
CM024 SambaNova's 5x speed and 3x TCO claims for the SN50 vs. Nvidia B200 are based on internal benchmarking and have not been independently validated in production deployments as of May 2026. High SM009, SM010
CM025 NTT DATA's 2026 Global AI Report (surveying approximately 5,000 senior decision-makers across 30+ markets) found that more than 95% of organizations consider private and sovereign AI important to their AI strategy. High SM005, SM006
CM026 Only 29% of organizations surveyed by NTT DATA are prioritizing sovereign AI in a concrete, near-term way, despite 95% recognizing its importance—revealing an execution gap between stated priority and active deployment. High SM005, SM024
CM027 Approximately 35% of Chief AI Officers (CAIOs) identify building, integrating, and managing AI models in private or sovereign environments as their top barrier to adoption, per NTT DATA 2026. High SM005, SM006
CM028 96% of organizations are considering relocating AI infrastructure to specific regions due to geopolitical pressures and supply chain concerns, per NTT DATA's 2026 Global AI Report. Medium SM006, SM024
CM029 Forrester (Forbes, November 2025) predicts that half of G20 nations will mandate domestically tuned AI models for public-sector services in 2026, driven by the EU AI Act, India's AI Mission, Japan's AI Promotion Act, and US Executive Order 14179. Medium SM007
CM030 Forrester predicts defense industry players will win approximately one-third of the largest civilian software deals in 2026, signaling that security posture and sovereignty requirements are reshaping enterprise AI procurement decisions. Medium SM007
CM031 Flexential's 2026 State of AI Infrastructure Survey (350+ enterprise IT leaders) found 89% say reliable grid power availability influences AI deployment decisions, and 55% rank power cost differences as the top factor in choosing AI workload locations. Medium SM008, SM023
CM032 AI data centers require approximately 4x more power than the electrical grid is adding annually, creating a physical deployment ceiling that power availability, not budget, now constrains. Medium SM008
CM033 The share of enterprises expecting measurable AI financial returns within one year dropped from 51% to 36% between the 2025 and 2026 Flexential surveys, reflecting lengthening ROI timelines as infrastructure costs rise and pilot-to-production complexity increases. Medium SM008, SM023
CM034 87% of large enterprises are implementing AI solutions in 2026, but only 9% have achieved full AI maturity, indicating most are in early deployment stages with significant infrastructure investment decisions still ahead. Medium SM012
CM035 62% of organizations have not moved AI projects beyond the pilot stage, creating uncertainty about which inference workloads will actually scale to production and warrant dedicated inference infrastructure investment. Medium SM012
CM036 Hyperscalers (Amazon, Microsoft, Google, Meta) are expected to increase AI infrastructure capital expenditure by approximately 40% in 2026 to approximately $600B, per CloudLatitude citing S&P Global data. Medium SM015
CM037 Building an on-premises GPU cluster for sovereign AI requires an initial investment ranging from $700K to $7M, representing a significant capital intensity barrier for small and medium enterprises seeking AI infrastructure independence. Low SM014
CM038 Fortune Business Insights estimates the global AI inference market at approximately $117.8B–$126.2B in 2026, growing at a CAGR of approximately 17–19% through 2030. Medium SM016, SM017
CM039 By 2026, AI inference workloads are projected to account for approximately two-thirds of all AI compute cycles globally, compared to one-third in 2023, and inference is estimated to represent 80–90% of the lifetime operating cost of a deployed AI system. Medium SM016, SM001
CM040 Enterprise AI infrastructure spending reached approximately $104B in 2026, with hardware representing approximately 18% of total AI spend, according to composite estimates from enterprise AI surveys. Low SM012, SM008
CM041 The vertical AI category—industry-specific AI solutions for healthcare, legal, and government—reached $3.5B in 2025, triple the prior year's level, indicating accelerating specialized vertical market growth. Medium SM012
CM042 Agentic AI workloads—involving sequential multi-turn reasoning loops, tool-calling, and multi-step planning—amplify latency penalties multiplicatively across model calls, creating demand for purpose-built low-latency inference architectures that differ from GPU systems optimized for training throughput. Medium SM010, SM009
CM043 By 2028, IDC projects 33% of enterprise software applications will include agentic AI—up from less than 1% in 2024—indicating significant near-term demand growth for inference capacity optimized for agentic workloads. Medium SM012
CM044 Estimates of the AI chip TAM in 2026 conflict materially: Deloitte cites approximately $500B (all AI chips), IDC cites $281B for its intelligent datacenter sub-segment, SiliconAnalysts cites $200B+ for merchant accelerators only, and AllAboutAI's 2024 baseline implies ~$165B for merchant chips in 2026—illustrating that different boundary definitions produce non-comparable figures that should not be directly compared without scope normalization. High SM001, SM002, SM003, SM004
CP001 In 2026, at least three distinct non-GPU inference hardware architectures compete with Nvidia in the enterprise AI inference market: wafer-scale (Cerebras WSE-3), language processing units (Groq LPU), and reconfigurable dataflow units (SambaNova RDU). Medium SP015, SP016, SP017
CP002 Inference accounted for roughly two-thirds of all AI compute workloads in 2026, having overtaken training as the dominant AI workload. Medium SP022
CP003 SambaCloud charges approximately $0.70–$4.50 per million output tokens for leading open-source LLMs (Llama, DeepSeek, MiniMax), with a limited free credit for new users. Medium SP016
CP004 Cloud inference API providers including SambaCloud, Groq, and Cerebras are subject to high multi-homing risk because buyers can switch providers with a single API key change and a vendor-neutral model name. Medium SP015, SP016
CP005 Cerebras' Wafer-Scale Engine 3 (WSE-3) achieves 1,000+ tokens per second on Llama 3.1 405B and 2,000+ tokens per second on Llama 3.3 70B, making it the fastest inference silicon in published benchmarks as of May 2026. High SP015, SP016, SP025
CP006 Cerebras IPO'd on Nasdaq under ticker CBRS in mid-May 2026, raising approximately $3.5 billion at a fully diluted valuation of $26–27 billion, making it the largest U.S. tech IPO of 2026. Medium SP013, SP023
CP007 Cerebras generated $510 million in revenue in 2025, a 76% year-over-year increase, with a 47% net profit margin ($238 million net income), funded by its flagship $20 billion compute deal with OpenAI. Medium SP023, SP013
CP008 Groq's Language Processing Unit (LPU) delivers 394–840 tokens per second on Llama 3.3 70B through GroqCloud, priced at $0.08–$0.79 per million output tokens with a generous free tier. High SP006, SP016
CP009 NVIDIA reportedly executed a $20 billion IP licensing deal with Groq in late 2025, incorporating LPU streaming inference technology into its Rubin GPU architecture, validating the purpose-built inference silicon thesis. Medium SP015, SP020
CP010 Following NVIDIA's acquisition of Groq's IP, Groq's independence as a competitive inference-silicon vendor is effectively removed; its LPU technology is being integrated into NVIDIA's Rubin GPU series. Medium SP015, SP020
CP011 SambaNova's SN50 RDU (fifth generation) delivers approximately 129 tokens per second per user on Llama 3.1 405B and claims approximately 5x faster decode performance versus the SN40L generation. Medium SP003, SP015
CP012 Cerebras' customer revenue is highly concentrated: G42 (Abu Dhabi) and OpenAI together represent the majority of projected Cerebras revenue through 2028, with OpenAI's $20 billion deal being the primary revenue driver. Medium SP013, SP023
CP013 NVIDIA's CUDA developer ecosystem—encompassing millions of trained models, pre-compiled libraries, and deep third-party tool integration—represents the highest switching cost barrier in AI infrastructure, protecting NVIDIA's ~80–90% market share. Medium SP015, SP022
CP014 NVIDIA DGX's Blackwell B200 GPU delivers up to 4x higher inference throughput than the H100 on FP4 workloads and supports 192 GB HBM3e per GPU; however, B200 requires liquid-cooling infrastructure not standard in existing enterprise data centers. High SP021, SP022, SP007
CP015 NVIDIA DGX platform serves 9 U.S. government institutions and 8 of the top 10 global telcos, providing NVIDIA with deep distribution entrenchment in enterprise and government accounts that SambaNova and Cerebras must displace. Medium SP007
CP016 AMD Instinct MI300X features 192 GB HBM3 memory with 5.3 TB/s bandwidth per GPU, enabling 70B+ parameter LLMs to run without model sharding—a hardware advantage for enterprise inference with large models. High SP010, SP019
CP017 AMD's ROCm software ecosystem, while significantly improved since 2024, still lags NVIDIA's CUDA stack in third-party library coverage, pre-trained model integrations, and developer familiarity. Medium SP019, SP022
CP018 Intel Gaudi 3 is available as an 8-chip system priced at approximately $125,000 including Ethernet networking, significantly below an equivalent NVIDIA DGX H100 node at $350,000+, making it the lowest-upfront-cost enterprise AI accelerator cluster available in 2026. Medium SP019, SP011
CP019 Intel Gaudi 3 uses native Ethernet interconnects (not InfiniBand), making it compatible with standard enterprise data center fabrics—a deployment characteristic it shares with SambaNova's air-cooled on-prem systems. Medium SP011, SP019
CP020 AWS Trainium3, built on 3nm process, delivers 4.4x the compute performance of Trainium2 and 30–40% better price-performance than GPU-based EC2 P5e/P5en instances for generative AI inference workloads. Medium SP008
CP021 AWS Trainium requires model porting to the proprietary AWS Neuron SDK and integration with SageMaker or EKS; there is no on-premises deployment option, making it structurally inaccessible to sovereign AI and classified government buyers. High SP008, SP024
CP022 Google Cloud TPU Ironwood (7th generation) delivers 42.5 ExaFlops per pod across 9,216 liquid-cooled chips with 4x better performance per chip over its predecessor Trillium, and powers Gemini and all Google consumer AI applications at over 1 billion users. Medium SP009
CP023 Google Cloud TPU requires XLA-compatible models (JAX, TensorFlow, or PyTorch with XLA path); the porting burden for PyTorch-native enterprise teams creates adoption friction and reinforces Google cloud lock-in. Medium SP009, SP024
CP024 SambaNova has announced sovereign AI deployments and partnerships in at least four countries outside the United States: Australia (SCX), Germany (Infecom), UK (Argyll), and Japan (SoftBank), deploying SambaCloud-based regional inference infrastructure. Medium SP001
CP025 The SambaNova and Intel heterogeneous inference blueprint, announced April 8, 2026, assigns GPU prefill, SambaNova RDU decode, and Intel Xeon 6 CPU agentic tool execution to distinct hardware roles, enabling deployment in existing standard air-cooled data centers. High SP002, SP018
CP026 SambaNova's SN50 RDU three-tier memory architecture (SRAM + HBM + DRAM) allows multiple large language models to remain resident in memory simultaneously, enabling near-zero model-switching latency that is architecturally difficult for GPU-only stacks to replicate. Medium SP003, SP015
CP027 SambaNova holds SOC 2 Type 2 and ISO/IEC 27001:2022 security certifications, enabling deployment in government and public sector environments with strict data residency and security requirements. Medium SP001
CP028 Argonne National Laboratory (U.S. Department of Energy) deployed a SambaNova DataScale SN40L cluster containing sixteen RDUs through the ALCF AI Testbed, available to DOE scientific research programs via the NAIRR Pilot. High SP004, SP001
CP029 SambaCloud provides inference access to Llama 4, DeepSeek R1, MiniMax M2.7, Qwen, and other leading open-source models, with SambaNova claiming particular strength on reasoning-focused workloads such as DeepSeek R1. Medium SP016, SP003
CP030 Open-source inference runtimes including vLLM and SGLang provide a hardware-agnostic middleware layer that abstracts over GPU, RDU, TPU, and LPU hardware, reducing the proprietary software stack advantage of any single inference hardware vendor including SambaNova. Medium SP015, SP022
CP031 In October 2025, The Information and multiple outlets reported that SambaNova Systems was exploring a sale after struggling to close a new funding round; the company had been last valued at $5 billion in its 2021 Series D. Medium SP012, SP017
CP032 SambaNova's inability to complete a funding round at or above its 2021 $5 billion valuation, as reported in late 2025, is an adverse signal that investor confidence in its standalone competitive position and path to liquidity has weakened. Medium SP012
CP033 SambaNova's low brand awareness among enterprise buyers relative to NVIDIA, AMD, and even Cerebras and Groq creates a slower sales cycle and higher competitive evaluation burden for enterprise IT buyers unfamiliar with the RDU architecture. Medium SP017
CP034 Together AI's cloud inference platform is built on NVIDIA H100 and B200 GPUs and offers the widest open-source model catalog in the inference-as-a-service segment, with token pricing of $0.03–$4.50 per million tokens but no free tier. Medium SP014
CP035 Groq GroqCloud lists Llama 3.3 70B Versatile at $0.79 per million output tokens and 394 tokens per second, and offers 500K–1M free tokens per day, making it the most accessible free-tier inference API among custom-silicon providers. Medium SP006
CP036 NVIDIA's CUDA software ecosystem encompasses millions of developer integrations, pre-trained model libraries, and third-party tools; SambaNova's SambaNova Suite provides a compiler abstraction layer but lacks equivalent third-party library coverage, creating a persistent adoption friction barrier. Medium SP017, SP015
CP037 Cerebras and NVIDIA are both pursuing U.S. national laboratory (DOE) and sovereign AI customers—the same buyer segment where SambaNova has its most established deployments—creating competitive re-procurement risk over multi-year periods. Medium SP013, SP007, SP004
CP038 The SambaNova-Intel heterogeneous inference blueprint is scheduled for general availability in H2 2026 and targets enterprises, cloud providers, and sovereign AI programs that require deployment in existing air-cooled data centers. High SP002, SP018
CP039 Cerebras' CS-3 system and SambaNova DataScale both target on-premises deployment for enterprise and sovereign buyers, creating direct competition for the same capital expenditure budget at national laboratories and government data centers. Medium SP005, SP004, SP013
CP040 Intel's Data Center Group, which is the organizational owner of Gaudi 3 accelerators and the Xeon 6 CPUs central to the SambaNova partnership, has been subject to restructuring pressure in 2025–2026, creating execution risk for the multi-year SambaNova-Intel blueprint. Medium SP019, SP002
CI001 SambaNova offers cloud API inference services (SambaNova Cloud, launched September 2024) priced on a per-million-token basis, with rates ranging from $0.10 to $4.50 per million tokens depending on model complexity. Medium SI007, SI022
CI002 SambaNova Cloud offers three tiers: a free tier for experimentation, a developer pay-as-you-go tier for latency-critical production workloads, and an enterprise tier with custom pricing for high-throughput deployments with dedicated capacity and SLAs. High SI007, SI010
CI003 SambaNova's three primary revenue streams are: (1) cloud API inference (SambaNova Cloud), (2) on-premise hardware sales (DataScale RDU systems), and (3) professional services including deployment, data preparation, and model optimization. Medium SI006, SI026
CI004 Professional services account for an estimated 25–33% of new customer engagements at SambaNova, according to Sacra's company analysis. Low SI026
CI005 SambaNova confirmed U.S. government and national laboratory customers including Lawrence Livermore National Laboratory (LLNL), Los Alamos National Laboratory (LANL), Argonne National Laboratory, and Oak Ridge National Laboratory. High SI001, SI018, SI014
CI006 The U.S. Department of Energy's NNSA established a strategic partnership with SambaNova Systems to deploy multiple DataScale systems at Lawrence Livermore and Los Alamos National Laboratories, confirming material government revenue. High SI018, SI001
CI007 SambaNova's hardware pricing for on-premise DataScale systems is not publicly disclosed and is estimated to be in the multi-million-dollar range per system based on industry references and the scale of national lab procurements. Low SI001, SI018, SI026
CI008 SambaNova's revenue model combines an API-based cloud subscription recurring revenue stream with lumpy hardware sales and project-based professional services, creating a mixed recurring/non-recurring revenue quality profile. Medium SI003, SI006, SI026
CI009 SambaNova reached approximately $100M in annual recurring revenue in June 2025, representing a significant top-line milestone after years of building its AI hardware and cloud platform. Medium SI011, SI012, SI025
CI010 SambaNova reported approximately 4x (fourfold) ARR growth during calendar year 2024, reflecting the acceleration of its cloud inference business following the September 2024 launch of SambaNova Cloud. Medium SI014, SI013, SI027
CI011 By February 2026, at the time of the Series E announcement, SambaNova's ARR was estimated by analysts at over $180M, representing approximately 80%+ year-over-year growth from the mid-2025 $100M milestone. Low SI021, SI025, SI013
CI012 SambaNova's official Series E press release confirmed "record bookings and revenue as they closed out 2025," indicating accelerating commercial momentum entering 2026. High SI002, SI008
CI013 SoftBank Corp. was publicly announced as the first customer to deploy SambaNova's new SN50 chip within its next-generation AI data centers in Japan, representing a significant marquee contract win for SambaNova's latest hardware generation. High SI002, SI008, SI023
CI014 SambaNova targets enterprise and sovereign AI deployments across financial services, telecommunications, energy, and government sectors, as cited in the Series E press release. High SI002, SI008
CI015 SambaNova raised a $350M Series E in February 2026, led by Vista Equity Partners and Cambium Capital, with Intel Capital, Qatar Investment Authority (QIA), GV, Battery Ventures, and accounts advised by T. Rowe Price Associates participating. High SI002, SI004, SI008
CI016 SambaNova's total cumulative capital raised as of the Series E close in February 2026 is approximately $1.48–1.49B across all rounds from seed through Series E. High SI009, SI017, SI006
CI017 SambaNova's Series D was a $676M round closed April 13, 2021, led by SoftBank Vision Fund 2, valuing the company at $5.1B post-money, with participation from Temasek, GIC, BlackRock, Intel Capital, GV, and Walden International. High SI001, SI016
CI018 SambaNova's SEC Form D filing (CIK 0001733073, dated 2021-08-09) confirms total Series D proceeds of $677,999,515 with an initial sale date of April 13, 2021. High SI016, SI017
CI019 BlackRock cut the value of its SambaNova shares by 17% in late 2025, implying a company valuation of approximately $2.4B — a 53% discount to the 2021 Series D peak of $5.1B. High SI005, SI024
CI020 In late 2025, SambaNova held acquisition discussions with Intel at a reported valuation of approximately $1.6B (including assumed debt), a further 33%+ discount to BlackRock's $2.4B mark and a 69% discount to the 2021 peak. Medium SI015, SI020, SI013
CI021 The Intel acquisition talks were complicated by CEO Lip-Bu Tan's dual role as SambaNova's executive chairman and major early investor via Walden International, which created a governance conflict and caused Intel to introduce new recusal policies. Medium SI020, SI013
CI022 The Series E was described by SambaNova CEO Rodrigo Liang as the "right decision" after the company "ended up having a record year last year," choosing standalone growth over acquisition. High SI003, SI002
CI023 SambaNova's official post-money Series E valuation is cited by third-party data sources (Tracxn) at approximately $4.8B; however, the company itself did not disclose a valuation in the Series E announcement. Medium SI025, SI004
CI024 The effective Series E valuation is lower than the 2021 Series D valuation of $5.1B, with secondary market and institutional markdown data suggesting a range of $2.24B–$4.8B, making the Series E a functional down round in practical value. Medium SI005, SI013, SI025
CI025 SambaNova's business model is highly capital intensive due to custom silicon R&D, chip manufacturing, datacentre build-out, and professional services delivery, requiring continuous large capital infusions with no clear path to near-term profitability disclosed. Medium SI002, SI006, SI009
CI026 SambaNova employed approximately 417 employees as of late 2025, down from prior peak levels, suggesting some cost discipline while R&D and engineering spend remains material. Medium SI011, SI014
CI027 Independent analysts estimate SambaNova's monthly burn rate at approximately $10–25M per month based on headcount (~417), R&D intensity, capex needs, and fundraising cadence; this figure is not officially disclosed. Low SI013, SI006, SI014
CI028 With $350M raised in February 2026 and estimated monthly burn of $10–25M, SambaNova's estimated cash runway is approximately 14–35 months from the Series E close, implying cash adequacy through at least mid-2027 in a downside scenario. Low SI009, SI013
CI029 The Series E proceeds are earmarked for expanding SN50 chip manufacturing capacity and scaling cloud infrastructure, confirming ongoing capex requirements rather than a transition to cash-generation mode. High SI002, SI008
CI030 SambaNova has gone approximately five years between primary capital raises (Series D April 2021 to Series E February 2026), suggesting the company exhausted earlier capital reserves before the Series E, a sign of sustained high burn over that period. Medium SI001, SI009, SI017
CI031 SambaNova's gross margin is not publicly disclosed; based on comparable AI chip and full-stack hardware-software companies, the blended gross margin is estimated in the 35–55% range, weighted down by hardware manufacturing costs. Low SI006, SI028
CI032 SambaNova's cash position immediately post-Series E close is not publicly disclosed; any runway estimate is uncertain and depends on undisclosed pre-existing cash balances and debt obligations. Low
CI033 SambaNova's revenue quality is mixed: cloud API revenue is recurring and scalable (high quality), while hardware sales are lumpy and dependent on large enterprise or government procurements (medium quality), creating revenue predictability risk. Medium SI003, SI006, SI026
CI034 SambaNova's valuation discrepancy — official $4.8B post-money versus BlackRock's implied $2.4B mark-down — creates material uncertainty about enterprise value for new investors considering entry at headline terms. Medium SI005, SI025, SI013
CI035 SambaNova's prior five-year fundraising gap and near-sale process in 2025 indicate the company's financial position weakened significantly relative to 2021, raising the risk of a repeat financing crisis if top-line growth does not materially accelerate. Medium SI005, SI014, SI020
CI036 At approximately $4.8B post-money against ~$180M ARR, SambaNova's headline valuation implies roughly 26x ARR multiple, which is elevated for a hardware-software company with unconfirmed gross margins and high capital intensity, though lower AI chip peers have commanded similar or higher multiples at comparable revenue stages. Low SI011, SI025, SI028
CI037 SambaNova's dependence on continuous external capital for manufacturing and cloud buildout means future financing events are highly probable, creating dilution risk for existing investors and potential for further down rounds if revenue growth slows. Medium SI002, SI006, SI009
CI038 Key financial diligence blockers that cannot be resolved from public sources include: audited gross margin by segment, actual cash and burn rate, revenue disaggregation by stream, government contract concentration, and CAC / NRR by customer cohort. Low
CE001 The SambaNova SN40L RDU chip integrates a three-tier memory architecture comprising 520 MB on-chip SRAM, 64 GB HBM per chip, and 768 GB DDR DRAM per chip in the SN40L-16 system configuration. High SE011, SE009, SE015
CE002 Argonne National Laboratory's ALCF AI Testbed includes sixteen SambaNova SN40L Reconfigurable Dataflow Units deployed as a new inference-optimized cluster alongside a pre-existing SN30 training cluster. Medium SE025
CE003 The SambaNova SN40L (4th generation) RDU chip uses TSMC 5 nm fabrication and was published at IEEE MICRO 2024; predecessor generations (SN30) were training-optimized while SN40L targets inference. Medium SE009, SE010
CE004 The SambaNova Reconfigurable Dataflow Architecture (RDA) creates continuous processing pipelines that map neural network computation graphs directly onto hardware, minimizing data movement versus the GPU ISA kernel-by-kernel model that requires repeated weight loading from memory. Medium SE004, SE009, SE010
CE005 The IEEE Micro paper on SN40L Composition of Experts reports that an eight-socket SN40L node achieves a 3.7× end-to-end speedup over DGX H100 and 6.6× over DGX A100 on CoE workloads, with model switching 15–31× faster than GPU baselines. High SE009, SE010
CE006 The SambaNova SN40L node can host and serve a 1 trillion-parameter Composition of Experts model on a single node with a 19× smaller machine footprint compared to GPU baselines. High SE009, SE010
CE007 The SambaNova SN50 (5th generation) chip, announced February 2026, delivers five times more compute per accelerator and four times more network bandwidth than the SN40L, and supports linking up to 256 accelerators over a multi-terabyte-per-second interconnect. Medium SE006, SE013, SE003
CE008 The SN50 supports models up to 10 trillion parameters with context lengths up to 10 million tokens, targeting agentic AI workloads requiring multi-model concurrency. Medium SE013, SE003, SE016
CE009 The SambaRack SN50 packs 16 SN50 accelerators per rack at approximately 20 kW power consumption and uses exclusively air cooling, requiring no liquid cooling infrastructure. Medium SE016, SE001, SE013
CE010 SambaNova's SambaStack claims four times the energy savings compared to GPU-based systems, with SambaRack operating at approximately 10 kW per rack compared to standard GPU rack configurations. Medium SE002, SE016
CE011 SambaFlow, the SambaNova software compiler, translates ML computation graphs to RDU dataflow programs using spatial programming and aggressive operator fusion, eliminating per-kernel overhead present in GPU CUDA execution. Medium SE004, SE007, SE009
CE012 SambaCloud, SambaNova's public inference API, launched September 2024 and is fully OpenAI-API-compatible, requiring only a base URL change from existing OpenAI client code. Medium SE012, SE008, SE021
CE013 SambaCloud's free tier requires no credit card, provides $5 in free credits valid for 30 days, and grants access without a waitlist. Medium SE021, SE008
CE014 SambaNova offers three commercial deployment modes: SambaCloud (public API), SambaStack (on-premises full-stack), and SambaManaged (turnkey managed service available via AWS Marketplace), all sharing the same underlying SambaFlow software layer. Medium SE007, SE002, SE021
CE015 SambaStack delivers a 90-day deployment commitment for on-premises installations, including hardware installation, SambaFlow software configuration, and pre-loaded model bundles. Medium SE021, SE002
CE016 The Accenture partnership (2023) positions SambaNova for regulated enterprise buyers requiring model ownership, exportable model weights, and data governance—use cases where GPU-cloud providers do not guarantee full data sovereignty. Medium SE024
CE017 SambaNova supports air-gapped and fully offline on-premises deployment through SambaStack, enabling deployments in environments that prohibit external network connectivity. Medium SE021, SE002, SE024
CE018 SambaCloud sovereign deployments are active in Japan (SoftBank), Australia (SouthernCrossAI), Germany/Luxembourg (Infercom), and the UK (Argyll), with Infercom claiming GDPR and EU AI Act compliance for the European deployment. Medium SE021, SE005, SE013
CE019 SambaCloud's model catalog as of May 2026 contains approximately 10 models: DeepSeek V3.1, DeepSeek V3.2, DeepSeek R1 Distill Llama 70B, Llama 4 Maverick, Llama 3.3 70B, gpt-oss-120b (high and low tiers), MiniMax M2.5, MiniMax M2.7, and Gemma 3 12B. Medium SE021, SE020
CE020 SambaCloud states explicitly that it 'never sees or collects any of your data or user prompts'; this claim has no third-party attestation or independent audit confirmation as of May 2026. Low SE021
CE021 SoftBank Corp. will be the first customer to deploy SN50 chips within its next-generation AI data centers in Japan, as confirmed by both companies in the February 2026 press release. Medium SE013, SE006
CE022 At SambaCloud's September 2024 public launch, Artificial Analysis independently benchmarked SambaNova at 132 output tokens/sec on Llama 3.1 405B—the fastest speed available for that model across all cloud endpoints tracked by Artificial Analysis at that time. High SE012, SE020, SE015
CE023 SambaNova Cloud delivered Llama 3.1 70B at 461 tokens/sec at full 16-bit precision at the September 2024 launch, versus Cerebras at 445 t/s and Groq at approximately 250 t/s on the same model at that benchmark window. Medium SE012, SE015, SE020
CE024 SambaNova reports DeepSeek R1 671B inference at 231–255 tokens/sec at full 16-bit precision; GPU-based providers on the same model average approximately 19 tokens/sec per user, constrained by HBM memory bandwidth and forced quantization. Medium SE021, SE020
CE025 The SN50 benchmark figures claim 895 tokens/sec/user on Llama 3.3 70B versus 184 tokens/sec on Nvidia B200; these figures cite SemiAnalysis InferenceX, a vendor-engaged commercial benchmarking firm, not an open-methodology independent benchmark. Low SE013, SE016
CE026 GPU LLM inference is memory-bandwidth-bound, with Databricks engineering data showing H100 tensor-parallel efficiency declining from approximately 60% with 2 GPUs to 25% with 8 GPUs, limiting batch-parallel scale-out for single-user latency. Medium SE015, SE022
CE027 SambaNova is the only production cloud provider offering inference on both Llama 3.1 405B and DeepSeek R1 671B at full 16-bit precision at production speed as of the run date; Groq and Cerebras do not offer these models. Medium SE021, SE020, SE022
CE028 SambaNova has no public fine-tuning API on SambaCloud; customers requiring model adaptation must use external providers for fine-tuning and then switch to SambaNova for inference, introducing pipeline friction. Medium SE021
CE029 All published SN50 performance claims as of May 2026 cite either SembaNova internal measurements or SemiAnalysis InferenceX commercial benchmarks; no open-methodology third-party independent validation of SN50 claims exists. Medium SE021, SE016, SE013
CE030 SambaCloud's standard DeepSeek V3.1 offering is capped at 131K input context with only 7K completion tokens; the extended-context variant (V3.1-cb) provides 32K completion tokens but reduces input context to 32K. Medium SE021
CE031 The SambaNova ai-starter-kit GitHub repository provides open-source examples in four categories (Data Ingestion, Model Development, Information Retrieval, Advanced AI Capabilities) and supports integration with both SambaCloud API and on-premises SambaStack endpoints. Medium SE017
CE032 The sambanova Python SDK on PyPI requires Python 3.9+ and exposes synchronous and asynchronous clients (httpx backend) with optional aiohttp support, streaming (SSE), typed Pydantic models, and automatic retry with exponential backoff. Medium SE018
CE033 The SambaNova HuggingFace organization (sambanovasystems) hosts 32 public model checkpoints including SambaLingo multilingual variants (Arabic, Turkish, Hungarian, Thai at 7B–70B scales) as of March 2026. Medium SE019
CE034 SambaCloud's developer integration ecosystem includes LangChain (langchain-sambanova), LlamaIndex, CrewAI, AutoGen, OpenRouter, n8n, AWS, and approximately 50 total third-party integrations. Medium SE021, SE017
CE035 SambaNova provides a Responses API (announced May 2026) targeted at coding agents, in addition to its standard Chat Completions API, expanding the product surface for agentic application development. Low SE007, SE008
CE036 The Intel-SambaNova heterogeneous inference blueprint—announced April 2026 for H2 2026 availability—specifies GPUs for prefill, SambaNova RDUs for decode, and Intel Xeon 6 CPUs as host and 'action CPU' for agentic tool execution. Medium SE014, SE013
CE037 SambaNova's SambaCloud model catalog contains approximately 10 models as of May 2026, compared to 50–200 models at GPU-cloud competitors such as Together AI and Fireworks AI; no image, video, or text-to-speech generation is offered. High SE021, SE020
CE038 SambaNova does not publicly list SOC 2, FedRAMP, or ISO 27001 certifications on its website as of May 2026; security attestations for regulated enterprise and government buyers must be obtained via direct audit engagement. Medium SE021, SE024
CE039 SambaNova's competitive position against Nvidia Blackwell B200 (192 GB HBM3e, 8 TB/s bandwidth) and AMD MI300X is not independently verified; as GPU HBM density and bandwidth improve, the memory-bandwidth advantage of RDU three-tier memory may narrow. Low SE022, SE023, SE015
CE040 The Composition of Experts (CoE) architecture implemented on SN40L enables a 1 trillion-parameter model to be deployed on a single node by composing multiple smaller expert models, with the DDR tier holding expert weights that are dynamically loaded to HBM based on routing decisions. High SE009, SE010
CE041 SambaNova's enterprise-focused sales model requires customers to undertake infosec reviews typically spanning multiple months before on-premises deployment can begin, limiting sales velocity versus cloud-native API competitors. Medium SE015, SE022
CU001 SambaNova's publicly documented customer base spans five segments: DOE/NNSA national laboratories, academic/government HPC centers, cloud and sovereign AI channel partners, enterprise and financial-services organizations, and developer API users. High SU010, SU021
CU002 The DOE/NNSA national-laboratory segment is the most credibly evidenced customer cohort, with five confirmed on-premises DataScale hardware deployments supported by official government announcements. High SU007, SU001, SU005
CU003 SambaNova Cloud is available on the AWS Marketplace with a pay-as-you-go consumption model at $0.01 per unit, targeting enterprise developers building custom AI applications. Medium SU016
CU004 The sovereign AI channel segment accelerated sharply in H2 2025, with three new sovereign cloud partnerships announced in October 2025 (SCX Australia, Argyll UK, Infercom Germany) and OVHcloud in November 2025. High SU022, SU014
CU005 CB Insights lists SambaNova's customers as including OTP Bank, Argyll Data Development, Blackbox.AI, TACC, OVHcloud, Los Alamos National Laboratory, Ascend, OTP Group, and Carahsoft, with SCX.ai as a partner as of April 2026. Medium SU010
CU006 Argonne National Laboratory (ALCF) deployed a new SambaNova SN40L inference cluster of 16 RDUs in November 2024, expanding its existing SN30 training cluster as part of the ALCF AI Testbed. High SU001, SU002
CU007 Oak Ridge National Laboratory (ORNL) deployed SambaNova Suite powered by SN40L and CoE framework in November 2024 for parallel multi-model scientific inferencing, with the stated goal of running models more efficiently than on the Frontier supercomputer. Medium SU005, SU021
CU008 Texas Advanced Computing Center (TACC, University of Texas at Austin) deployed SambaNova Suite in November 2024 as its dedicated inference platform and will use it as a foundational inference layer for the NSF Leadership-Class Computing Facility (LCCF) and National AI Research Resource (NAIRR). High SU003, SU004
CU009 Lawrence Livermore National Laboratory (LLNL) integrated SambaNova DataScale SN10 RDUs into its Corona supercomputing cluster under a formal DOE/NNSA strategic partnership, targeting cognitive simulation for inertial confinement fusion and COVID-19 drug discovery with claimed 5× speedup versus GPU-normalized comparisons. High SU006, SU007
CU010 Los Alamos National Laboratory (LANL) integrated SambaNova DataScale into its Darwin heterogeneous cluster for quantum-chemistry modeling using Density Functional Theory, under the same DOE/NNSA strategic partnership as LLNL. High SU007, SU006
CU011 RIKEN Center for Computational Science (Japan) adopted SambaNova DataScale in March 2023 to accelerate integration of the Fugaku supercomputer with AI workloads for digital twins and Society 5.0 research. High SU015, SU019
CU012 In May 2024, RIKEN's Fugaku-LLM (a Japanese LLM trained on the Fugaku supercomputer) was integrated into SambaNova's Samba-1 Composition of Experts platform, running optimally on the SN40L chip with its three-tier memory architecture. High SU019, SU015
CU013 Carahsoft lists SambaNova on a variety of federal, state, and local government contracts to facilitate agency procurement of SambaNova IT solutions. Medium SU017, SU010
CU014 Procurely's government contract database records 3 state-level contract awards for Sambanova Systems Inc. totaling approximately $2.5 million, likely underrepresenting total government revenue given that direct federal lab procurement is not captured. Low SU024
CU015 The Argonne ALCF AI Testbed, which includes both SambaNova DataScale SN30 (training) and SN40L (inference) systems, is accessible through the National Artificial Intelligence Research Resource (NAIRR) Pilot, extending SambaNova's reach to the broader U.S. AI research community. High SU001, SU002
CU016 Accenture deployed SambaNova Suite to provide enterprise and government customers with generative AI solutions including Contact Center Intelligence and Document Intelligence, with governance, auditability, and model-ownership controls designed for regulated industries including banking. Medium SU013, SU025
CU017 SoftBank Corp. (Japan) expanded its SambaNova Cloud deployment in March 2025 by adding SambaNova hardware racks to a new Japanese AI data center, offering fast inference to APAC developers via SambaNova Cloud with Japanese-language models including Swallow. High SU008, SU023
CU018 SoftBank Corp. was designated the first customer for SambaNova's SN50 chip in February 2026, deploying it within SoftBank's next-generation AI data centers in Japan for sovereign and enterprise AI inference across APAC. High SU009, SU008
CU019 OVHcloud selected SambaNova in November 2025 to power its flagship AI Endpoints service, with SambaStack RDU hardware offering a 99.8% uptime SLA, targeting financial trading, cybersecurity, industrial automation, and logistics use cases, with deployment in France by end of 2025. Medium SU014, SU023
CU020 OTP Bank (Hungary) partnered with OTP Group, ITM, and SambaNova to deploy an AI supercomputer capable of building GPT-3-level language models for Central and Eastern European regional languages. Medium SU010, SU025
CU021 Saudi Aramco signed a memorandum of understanding with SambaNova Systems to explore ways to accelerate AI capabilities and Kingdom-wide adoption, focused on supercomputing, infrastructure, and scalable industrial AI model deployment aligned with Vision 2030. Low SU025
CU022 In October 2025 SambaNova announced three sovereign AI cloud partnerships — SCX (Australia), Argyll (UK), and Infercom (Germany) — each deploying SambaNova SN40L systems at approximately 10 kW per rack with renewable energy for sovereign, EU-compliant, or Australia-onshore inference services. High SU022, SU010
CU023 SambaNova Cloud is used by Blackbox.AI (developer tools) and Argyll Data Development (energy sector hyperscale data centers) as documented by CB Insights, representing the developer-API customer cohort. Medium SU010
CU024 The SN50 press release (February 2026) states SambaNova achieved "record bookings and revenue" as it closed out 2025, with accelerating demand in financial services, telecommunications, energy, and sovereign deployments — the first self-reported revenue trajectory signal from the company. Low SU009
CU025 In April 2025, SambaNova laid off approximately 77 employees (about 15% of its workforce) to pivot from training-focused workloads to inference, indicating that the earlier training-oriented customer base was insufficient to sustain the company at scale. High SU018, SU020
CU026 In October 2025, reports from The Information (cited by WebProNews and Tech Startups) indicated that SambaNova had failed to close a new funding round and was exploring a potential sale, implying that commercial revenue from enterprise customers was insufficient to sustain operations at then-current burn rates. Medium SU011, SU012
CU027 All publicly named hardware-system customers (Argonne, ORNL, TACC, LLNL, LANL, RIKEN) are government or academic institutions funded by DOE, NNSA, or NSF; SambaNova has disclosed no independently named commercial Fortune 500 enterprise hardware customer. High SU007, SU001, SU003, SU005, SU025
CU028 SoftBank is simultaneously an investor in SambaNova (SoftBank Vision Fund 2) and a key channel customer for SambaNova Cloud and SN50, creating a related-party relationship that may inflate the perceived commercial-customer depth. Medium SU008, SU009, SU025
CU029 SambaNova has publicly disclosed no structured customer retention metrics — no NRR, GRR, churn rate, or customer satisfaction scores — for any customer segment. High SU010, SU016
CU030 The only observable retention signal in the public record is Argonne's hardware upgrade from SN30 to SN40L (multi-generation expansion) and SoftBank's designation as the first SN50 customer (channel expansion); no other customers have publicly confirmed renewed or expanded contracts. High SU001, SU009
CU031 SambaNova's February 2026 Series E financing of $350M+ (led by Vista Equity Partners and Cambium Capital) appears to have resolved the near-term liquidity crisis identified in October 2025, but the round's $350M+ size also suggests significant ongoing capital consumption. Medium SU009, SU011
CU032 SambaNova's claimed average enterprise contract value is above $5 million per deal with multi-year professional services components, per secondary business analysis; this is not confirmed by any official SambaNova disclosure. Low SU025
CU033 The Argonne ALCF deployment spans 16 RDUs in the new SN40L inference cluster, joining an existing SN30 training cluster in the AI Testbed — confirming multi-system, multi-generation presence at a single site. High SU001, SU002
CU034 SambaNova's NNSA/DOE partnership funding the LLNL and LANL deployments was provided through NNSA's Advanced Simulation and Computing program, confirming federal-budget-backed procurement rather than commercial purchase. High SU007, SU006
CU035 ORNL cited SambaNova's capability to run parallel inferencing across multiple models simultaneously as the key reason for adoption, specifically to run inference on the Frontier supercomputer's data more efficiently by offloading to the SambaNova platform. Medium SU005, SU021
CU036 SambaNova's sovereign AI deployment with SCX (Australia) achieves approximately 10 kW per rack power consumption — claimed to be one-twelfth of traditional GPU systems (120 kW/rack) — making it deployable in air-cooled, renewable-energy data center environments without liquid-cooling infrastructure. Medium SU022
CU037 SambaNova Cloud's developer API supports OpenAI-compatible endpoints, free developer tiers, and integration with AWS PrivateLink, targeting enterprise development teams who need high-throughput open-source LLM inference. Medium SU016
CR001 Nvidia held approximately 70–80% of global datacenter GPU revenue in 2025, creating a near-monopoly in AI training that SambaNova must displace to grow beyond inference-only workloads. Medium SR025, SR008
CR002 SambaNova's SN50 chip is claimed to deliver "5× better performance per watt" and "10× lower total cost of ownership" versus a comparable Nvidia B200 configuration, but these benchmarks have not been independently verified as of May 2026. Medium SR005, SR015
CR003 CUDA had over 5 million registered developers as of 2025 and represents more than a decade of software ecosystem investment, making migration to alternative AI chip stacks costly and uncertain for enterprise buyers. Medium SR024, SR025
CR004 Groq secured a $1.5 billion sovereign AI partnership in 2024–25, enabling large-scale LPU inference capacity deployment and substantially increasing its competitive position against SambaNova in the high-throughput inference segment. Medium SR026, SR008
CR005 Forbes / Moor Insights analysts described the AI chip startup competitive dynamic as likely producing only one large winner among Groq, Cerebras, and SambaNova by 2026–2027, framing the market as a winner-take-most race that heightens existential stakes for SambaNova. Medium SR008
CR006 Cerebras Systems raised $1.1 billion at an $8.1 billion valuation as of late 2025 and filed for IPO, publicly disclosing its wafer-scale AI chip architecture and underscoring a funding and technology trajectory that positions it as a direct SambaNova inference competitor with a substantially higher valuation. Medium SR020, SR008
CR007 DeployBase comparative benchmarks for 2025 showed SambaNova's RDU platform achieving competitive throughput on LLM inference for models in the 70B–405B parameter range, though Groq held an advantage in sub-70B latency-sensitive workloads. Low SR009
CR008 SambaNova's exclusive reliance on TSMC as a chip foundry creates a single point of supply failure: any Taiwan Strait geopolitical disruption or TSMC capacity rationing event would halt SambaNova's hardware production with no qualified alternative manufacturer in place. High SR012, SR005
CR009 HBR's 2025 analysis of AI chip supply chains identified TSMC's advanced node concentration (N3/N4/N5) as the most acute systemic risk in the global AI infrastructure stack, noting that Nvidia, AMD, Apple, and AI chip startups including SambaNova all depend on the same foundry for cutting-edge silicon. Medium SR012, SR019
CR010 Semiconductor Digest's 2026 geopolitical risk analysis estimated that a Taiwan Strait military conflict would halt advanced node semiconductor production for 6–24 months, with no viable near-term substitute for TSMC's EUV-based N3/N4 process at scale. Medium SR019, SR012
CR011 SambaNova's CEO Rodrigo Liang stated publicly that the SN50 chip uses HBM2E high-bandwidth memory to reduce supply concentration risk versus HBM3, but HBM2E remains sourced from the same three DRAM suppliers (SK Hynix, Samsung, Micron), leaving concentration risk intact. Medium SR005, SR009
CR012 Intel CEO Lip-Bu Tan holds a board seat at SambaNova Systems while simultaneously leading Intel Corporation, which operates Intel Foundry Services—a manufacturing partner SambaNova has evaluated—and was a reported acquisition candidate for SambaNova in 2025. Medium SR006, SR007
CR013 Ticker Report and TechZine reported that Lip-Bu Tan's dual role creates potential conflicts in at least two dimensions: Intel Foundry's competitive positioning versus TSMC for SambaNova's manufacturing, and Intel as an erstwhile acquirer whose engineers reviewed SambaNova's proprietary RDU architecture during failed due diligence. Medium SR006, SR007
CR014 Axios reported in August 2025 that Intel-SambaNova acquisition talks broke down with neither party disclosing final valuation or terms, leaving open whether Intel engineers who reviewed SambaNova IP during due diligence are now building competing architectures within Intel. Medium SR016, SR006
CR015 SambaNova's February 2026 Series E was valued at approximately $2.4 billion—a 53% nominal markdown from the $5.1 billion peak Series D valuation in 2021, confirming a material valuation reset driven by both market-wide AI chip re-rating and company-specific concerns. Medium SR005, SR014, SR018
CR016 SVB's 2025 AI hardware startup burn report estimated median burn multiples of 2.0–3.0× for hardware-stage AI companies, meaning for every dollar of revenue, hardware AI startups burn two to three dollars in cash—a structural capital efficiency deficit that exacerbates runway risk for SambaNova given its undisclosed revenue base. Medium SR010, SR027
CR017 SambaNova has raised approximately $1.49 billion in equity capital across six rounds with no publicly disclosed revenue, gross margin, or EBITDA figures, preventing external calculation of capital efficiency or burn runway. Medium SR018, SR028
CR018 The Wall Street Journal reported in 2025 that AI chip startups across the board faced a valuation reset, with institutional investors reducing portfolio marks by 30–60% from 2021 peaks, particularly for pre-revenue or early-revenue hardware companies. Medium SR017, SR018
CR019 Sacra's 2025 research on SambaNova estimated annual revenue at $50–80 million based on known government contracts and cloud inference pricing, implying a revenue multiple of 30–48× at the $2.4 billion Series E valuation—elevated relative to comparable hardware startups. Low SR029, SR018
CR020 Vista Equity Partners led the Series E; prior institutional investors including BlackRock reportedly marked down their SambaNova positions before the 2026 close, consistent with the $2.4B valuation re-rating. Low SR014, SR017
CR021 SambaNova's disclosed government customers include Argonne National Laboratory, Lawrence Livermore, Lawrence Berkeley, Oak Ridge, and Los Alamos National Laboratories—all part of the DOE/NNSA system—making DOE budget allocations the dominant revenue concentration risk. High SR022, SR023, SR011
CR022 Potomac Officers Club reported in 2025 that NNSA and LANL were actively procuring additional AI inference capacity for nuclear stockpile stewardship modeling, with SambaNova referenced as a priority vendor for inference-heavy scientific simulation workloads. Medium SR011, SR023
CR023 SambaNova's dependence on a handful of DOE national laboratories means that a sequestration event, continuing resolution budget freeze, or DOE AI infrastructure priority shift would disproportionately impact revenue without equivalent commercial enterprise substitution. Medium SR023, SR022
CR024 SoftBank's January 2025 commitment to SambaNova SN50 systems represents the most significant disclosed non-U.S.-government customer relationship in SambaNova's history, but financial terms and contract scope were not disclosed. Medium SR021, SR014
CR025 DCSA's FOCI review process requires companies holding classified government contracts to undergo national security assessments when foreign investors own a qualifying equity percentage; SambaNova's Series E cap table includes non-U.S. investors whose participation could trigger or extend a FOCI review under current DCSA guidelines. Medium SR003, SR001
CR026 Wilson Sonsini's national security and technology practice was specifically engaged for the SambaNova Series E closing, signaling that counsel identified FOCI-related risks requiring specialist review as a condition of the transaction. Medium SR003
CR027 The BIS May 2026 final rule introduces new license requirements for exports of AI accelerators exceeding defined Theoretical Peak Performance (TPP) thresholds; Finnegan and Mayer Brown legal analyses confirm that chips meeting SN50-class TPP levels would require license review for exports to certain non-allied country groups. Medium SR001, SR002
CR028 SambaNova filed a WARN Act notice under California Labor Code §1400 for a workforce reduction of approximately 150 employees in 2024–2025, confirming a layoff event that reached the statutory 50-employee threshold requiring 60 days' advance notice to affected workers. High SR004, SR013
CR029 Data Center Dynamics reported that the SambaNova workforce reduction targeted engineering, sales, and operations roles, consistent with a company rationalizing costs ahead of a challenging fundraising environment while preserving core chip design and R&D headcount. Medium SR013, SR014
CR030 SambaNova's three co-founders—CEO Rodrigo Liang, CTO Kunle Olukotun, and Chief Scientist Christopher Ré—represent concentrated key-person risk; Ré's academic standing in hardware-software co-design makes his departure a material signal risk for government and research customers. Medium SR013, SR005
CR031 SambaNova has no publicly disclosed secondary foundry relationship; Intel Foundry Services and GlobalFoundries lack demonstrated advanced AI chip volume production capability at TSMC N3/N4 parity, leaving SambaNova without a near-term alternative manufacturing path if TSMC production is disrupted. Medium SR012, SR019
CR032 Mayer Brown and Finnegan both confirm that BIS's revised AI chip export rules apply to advanced accelerator chips with TPP ≥ 100 TOPS (INT8) or comparable FLOPS thresholds, and that companies must obtain BIS licenses for restricted exports—a compliance burden SambaNova must manage for international SN50 sales. High SR001, SR002
CR033 SambaNova's SN50 performance claims rely on proprietary benchmarking; independent verification by third parties comparable to MLPerf has not been disclosed as of May 2026, making TCO and performance advantage claims difficult for prospective customers to validate. Medium SR005, SR009
CR034 The SN50 chip tape-out and volume ramp timeline has not been disclosed; given typical TSMC advanced-node tape-out-to-volume schedules of 12–18 months, a delay at any stage would push delivery timelines into 2027, compressing the competitive window against Nvidia's B200/B300 successor roadmap. Medium SR005, SR019
CR035 Wilson Sonsini's national security practice advises on FOCI mitigation strategies including Security Control Agreements (SCAs), Special Security Agreements (SSAs), and proxy arrangements; their involvement in the Series E implies SambaNova's cap table requires one or more such instruments to preserve classified contract eligibility. Medium SR003, SR001
CR036 If SambaNova requires a further equity raise without improved unit economics, a down-round below the $2.4 billion Series E valuation would trigger anti-dilution provisions in prior preferred-stock rounds, potentially eliminating common equity and option value for employees hired during the 2019–2022 peak valuation era. Medium SR017, SR018
CR037 SEC Form D filings corroborate SambaNova's $350M raise size in the Series E and the disclosed Vista Equity Partners lead, confirming the post-money valuation at approximately $2.4 billion. Medium SR028, SR014
CR038 Tapeout costs for advanced AI chips at TSMC's N3/N4 nodes are estimated at $100M–$500M per design pass; SambaNova's total capital raised of $1.49 billion across six rounds implies highly constrained capital reserves relative to multiple chip generations, limiting the number of NRE-intensive tape-outs fundable without fresh equity. Medium SR010, SR012
CR039 SambaNova's Series E investor list includes entities from multiple non-U.S. jurisdictions; Wilson Sonsini's engagement of FOCI-specialist counsel suggests at least one investor was reviewed under DCSA FOCI guidelines, which typically applies when a foreign person owns ≥5% of voting equity in a company holding classified contracts. Medium SR003, SR028
CR040 Argonne National Laboratory and LLNL have each publicly disclosed multi-year AI inference agreements with SambaNova; LLNL's press release specifically references SambaNova Cloud as the inference backend for nuclear stockpile stewardship modeling workloads. Medium SR022, SR030
CR041 LLNL's and Argonne's public announcements confirm that DOE national laboratory customers collectively represent SambaNova's primary disclosed revenue base, with no equivalent commercial enterprise contract disclosed at comparable scale. High SR030, SR022
CR042 The DOE FY2025–2026 budget included targeted AI infrastructure investments for national laboratories; however, a sequestration or continuing resolution scenario could delay or cancel multi-year SambaNova procurement cycles dependent on annual appropriations. Medium SR023, SR011
CR043 SoftBank's January 2025 commitment to SambaNova SN50 systems represents the company's most visible commercial enterprise win, providing a narrative of diversification beyond U.S. government customers, though the financial terms and contract size were not disclosed. Medium SR021, SR014
CR044 The Intel-SambaNova acquisition talks that began in 2024 and ended without agreement in mid-2025 created a period of strategic uncertainty; the breakdown was followed by SambaNova's workforce reduction and Series E raise, suggesting the failed acquisition compressed the company's operational options. Medium SR016, SR013
CR045 SEC Form D filings confirm SambaNova has conducted six equity raises with aggregate disclosed offering amounts consistent with reported $1.49 billion total; no debt instruments or convertible notes have been publicly disclosed as of May 2026. High SR028, SR018
CR046 No active litigation, SEC enforcement actions, or publicly disclosed regulatory proceedings against SambaNova Systems were confirmed in public records as of May 2026; the company's legal exposure is principally regulatory (FOCI, BIS export controls) rather than adversarial litigation. Low SR029, SR018
CR047 As of late 2025, Cerebras had raised $1.1 billion at an $8.1 billion valuation while Groq secured a $1.5 billion sovereign AI partnership—collectively raising capital substantially in excess of SambaNova's $1.49 billion total at a $2.4 billion valuation—suggesting SambaNova is the most capital-constrained of the three primary AI inference chip challengers. Medium SR008, SR020
CV001 SambaNova raised more than $350M in Series E financing on February 24, 2026, led by Vista Equity Partners and Cambium Capital with strong participation from Intel Capital. High SV001, SV018, SV002
CV002 The Series E-1 tranche was priced at $30.99 per share, implying a post-money valuation of approximately $2.34B based on Yahoo Finance/Forge primary round data. Medium SV010
CV003 Yahoo Finance/Forge secondary market estimate as of May 26, 2026 placed SambaNova's implied valuation at approximately $1.44B, with shares trading at $19.05 per share. Medium SV010
CV004 SambaNova's 2021 Series D was priced at $95.02 per share with $677.9M raised, implying a post-money valuation of approximately $5.11B per SEC Form D and Yahoo Finance data. High SV025, SV010
CV005 Third-party sources including Tracxn cite SambaNova's Series E post-money valuation at approximately $4.8B — a figure that conflicts with primary share-price data suggesting $2.24B–$2.34B. Low SV028, SV010
CV006 BlackRock marked down its SambaNova position by approximately 17% per Caplight analysis, implying an effective company value of approximately $2.4B before the Series E close. Medium SV020, SV021
CV007 Intel was in advanced acquisition talks to buy SambaNova for approximately $1.6B including debt in late 2025; those talks stalled and were replaced by a strategic investment and partnership. High SV007, SV019, SV021
CV008 The Series E was structured with at least two share classes: E-1 at $30.99 per share and E-2 at $21.70 per share, reflecting a tiered capital structure with different economic rights. Medium SV010
CV009 CEO Rodrigo Liang described the Series E as grossly oversubscribed, indicating strong investor demand at the discounted valuation vs. the 2021 peak. High SV001, SV018
CV010 SambaNova's ARR was estimated at approximately $100M as of mid-2025, reflecting approximately 4x ARR growth during calendar year 2024. Medium SV027, SV028
CV011 SambaNova's ARR was estimated at over $180M by February 2026 at the time of the Series E announcement, based on analyst estimates and company bookings commentary. Low SV027, SV028, SV018
CV012 At the Series E-1 implied valuation of approximately $2.34B and estimated ARR of ~$100M in mid-2025, the EV/ARR multiple is approximately 23x. Medium SV010, SV027
CV013 At the Series E-1 implied valuation of approximately $2.34B and estimated ARR of ~$180M in early 2026, the EV/ARR multiple falls to approximately 13x. Low SV010, SV028
CV014 If the $4.8B post-money figure cited in some sources were accurate, the EV/ARR multiple would be approximately 27–48x depending on which ARR base is used. Low SV028, SV027
CV015 Groq raised $750M in September 2025 at a post-money valuation of $6.9B, more than doubling from its $2.8B August 2024 valuation in approximately one year. High SV003, SV006
CV016 Cerebras Systems filed to go public on Nasdaq targeting a valuation of approximately $26.6B at an IPO price range of $115–$125 per share, aiming to raise approximately $3.5B. High SV004, SV005, SV014
CV017 Cerebras reported $510M in revenue and $87.9M in GAAP net income for full year 2025, with $24.6B in remaining performance obligations as of early 2026. High SV005, SV011, SV014
CV018 Cerebras raised $1B in a Series H in February 2026 at a $23B valuation, approximately 45x the company's 2025 revenue of $510M. High SV004, SV005
CV019 Nvidia's fiscal 2026 revenue reached $215.9B with a market capitalization of approximately $4.6T, implying a price-to-sales ratio of approximately 21x. Medium SV015
CV020 AMD's full-year 2025 revenue was $34.6B with a market capitalization of approximately $397B, implying a price-to-sales ratio of approximately 11.5x. Medium SV015
CV021 Finro's Q1 2026 AI valuation multiples dataset across 575 companies showed a median EV/Revenue multiple of approximately 21.2x for AI infrastructure, with an average of 31.3x and a 25th–75th percentile range of 10.2x–39.6x. Medium SV012
CV022 AI chip and inference infrastructure private rounds from 2025–2026 have commanded EV/Revenue multiples in the 10–40x range depending on growth rate, defensibility, and customer quality, per Aventis Advisors and Finro analysis. Medium SV009, SV012, SV013
CV023 SambaNova's Series E-1 implied EV/ARR of approximately 23x (using mid-2025 ARR of $100M) sits just above the AI infrastructure sector median of 21.2x, but the secondary market discount implies significantly lower execution-adjusted pricing. Medium SV010, SV012
CV024 SambaNova's total capital raised of approximately $1.49B compares to Cerebras's total of over $1.6B, yet Cerebras targets a 2026 IPO valuation of ~$23–26.6B — approximately 10–11x higher than SambaNova's Series E-1 implied valuation. Medium SV004, SV005, SV010, SV011
CV025 The Series E-1 implied valuation of approximately $2.34B represents a 54% decline from SambaNova's 2021 Series D peak of $5.11B, satisfying the standard definition of a down round. High SV010, SV025
CV026 SambaNova explored a potential sale to strategic and financial buyers in October–December 2025 after struggling to raise new funding on favorable terms, hiring an investment firm to manage the process. Medium SV020, SV022, SV008
CV027 A Lincoln Variable Insurance Products Trust reported holding SambaNova shares valued at $44.47 per share as of June 30, 2025 — 43% above the subsequent Series E-1 issue price of $30.99. Medium SV010
CV028 Forge secondary market data as of May 26, 2026 shows SambaNova shares at approximately $19.05, 39% below the Series E-1 issue price and approximately 80% below the 2021 Series D price of $95.02. Medium SV010
CV029 Forge's 3-month return tracker showed SambaNova shares at +38.53% versus the Forge Private Market Index at +9.62% for the period ending May 26, 2026, indicating improving secondary sentiment since the Series E close. Medium SV010
CV030 Intel Capital committed approximately $100–$150M in the Series E, indicating strategic motivation tied to the Intel-SambaNova partnership rather than purely financial return-seeking. Medium SV017, SV016
CV031 Wilson Sonsini Goodrich & Rosati served as legal counsel to SambaNova in the Series E transaction, advising on the oversubscribed round structure. Medium SV002
CV032 Vista Equity Partners, the Series E lead investor, manages more than $100B in assets and historically concentrates on enterprise software; the SambaNova investment represents a rare hardware sector allocation for the firm. Medium SV016, SV017
CV033 SambaNova's SN50 chip is claimed by the company to deliver 5x the compute performance and 4x the networking bandwidth of the previous SN40 generation, with 3x lower total cost of ownership vs. comparable GPU solutions. Medium SV001, SV018
CV034 SoftBank Corp. was announced as the first customer for the SambaNova SN50 chip at the time of the February 2026 Series E close. Medium SV018, SV001
CV035 SambaNova's capital efficiency ratio (implied valuation / total raised) at the Series E is approximately 1.6x versus approximately 9.2x for Groq ($6.9B / $750M) and approximately 14x for Cerebras ($23B / ~$1.6B), indicating SambaNova has consumed substantially more capital per dollar of current valuation. Medium SV003, SV010, SV004, SV005
CV036 Cerebras achieved $510M in 2025 revenue with GAAP profitability of $87.9M net income, while SambaNova's ARR is estimated at $100–$180M with no disclosed profitability, highlighting Cerebras's superior commercial scale at this stage. Medium SV005, SV011, SV027
CV037 Aventis Advisors analysis found that AI company valuations peaked in 2021–22 and have normalized downward since, with SambaNova's down-round consistent with the broader AI sector trend rather than being purely a company-specific failure. Medium SV009
CV038 Finro Q1 2025 analysis of 400+ AI companies found that AI startup private round median multiples ranged from approximately 25–30x EV/Revenue, with infrastructure commanding premium multiples relative to applied AI verticals. Medium SV013
CV039 SambaNova's preference overhang from the 2021 Series D ($676M raised at $95.02/share at a $5.11B post-money) creates significant liquidation preference risk: all exits below $5.11B result in partial or no recovery for Series D holders, and common equity is deeply underwater. High SV025, SV010
CV040 The dual-tranche Series E structure — E-1 at $30.99 and E-2 at $21.70 per share — implies a weighted average blended issue price below the E-1 headline, and may include anti-dilution provisions or ratchets that further compress effective returns for pre-Series E investors. Low SV010
CV041 Forge's secondary implied valuation of $1.44B as of May 26, 2026 represents approximately 8x estimated $180M early-2026 ARR, which is below the AI infrastructure sector median of 21.2x, indicating secondary market skepticism about execution trajectory relative to the primary-round price. Medium SV010, SV012
Sources
IDPublisherTitleQuote
SO001 SambaNova Systems About Us | SambaNova "Foundation models represent a paradigm shift in AI and deep learning – truly transforming the value organizations can derive from AI." — Kunle Olukotun, Co-founder & Chief Technologist
SO002 SambaNova Systems SambaNova | The Fastest AI Inference Platform
SO003 SambaNova Systems RDU | Next-Gen AI Chip for Inference at Scale "The SN50 RDU (Reconfigurable Dataflow Unit) is SambaNova's fifth-generation AI inference processor, designed specifically for large-scale, agentic workloads."
SO004 SambaNova Systems Press Releases
SO005 SambaNova Systems Resources | Blog
SO006 SambaNova Systems Careers | SambaNova
SO007 SambaNova Systems SambaStack | Full-Stack Enterprise AI Platform "SambaStack™ offers the industry's leading hardware and software stack, purpose-built for AI inference. With the flexibility to deploy on-premises or in the cloud."
SO008 EE Times SambaNova Abandons Intel Acquisition, Raises Funding Instead "SambaNova's $350 million Series E was 'grossly, grossly oversubscribed,' Liang said."
SO009 CNBC Intel partners with AI chip startup SambaNova after acquisition talks reportedly failed "Intel is participating in a $350 million investment in artificial intelligence chip startup SambaNova and is also partnering with the startup. Intel CEO Lip-Bu Tan has been SambaNova's chairman since 2017 and was an early financial backer."
SO010 Data Center Dynamics SambaNova exploring sale after struggling to secure further funding – report "SambaNova is exploring a sale, having struggled to complete a fundraising round... BlackRock has cut the value of its SambaNova shares by 17 percent, valuing the company at $2.4bn."
SO011 Bloomberg AI Startup SambaNova Seeks Up to $500 Million Funding After Intel Talks Stall "SambaNova Systems Inc. is considering raising up to $500 million after talks to sell to Intel Corp. stalled."
SO012 EE Times Intel Eyeing AI Catchup in Inference with SambaNova Acquisition "The Palo Alto-based startup laid off 77 employees—roughly 15% of its 500-strong workforce—while shifting its focus from training to inference design."
SO013 EE Times SambaNova Lays Off 15% of Workforce To Refocus on Inference "SambaNova laid off 77 people from its staff of around 500 this week, representing around 15% of its workforce... this round of layoffs comes at a time when the company is refocusing away from training workloads and towards being an AI cloud services provider."
SO014 EE Times SambaNova Adds HBM for LLM Inference Chip "SambaNova is bringing out new silicon specifically for large language model (LLM) fine-tuning and inference at scale... SambaNova said it can serve 5-trillion–parameter models with 256k+ sequence length from a single, eight-socket system."
SO015 Sacra SambaNova Systems valuation, funding & news "SambaNova Systems was valued at $5.1 billion following its $676 million Series D round led by SoftBank Vision Fund in April 2021. Founded in 2017, SambaNova has raised approximately $1.49 billion to date."
SO016 Forbes SambaNova | Company Overview & News "This year, SambaNova raised $350 million in Series E financing and launched its fifth-generation AI chip, the SN50. In addition, the company announced a multi-year partnership with Intel and Softbank, which will be the first to deploy SambaNova's new SN50 AI chip in data centers in Japan."
SO017 Yahoo Finance / PitchBook News SambaNova raises $350M as more upstarts take on Nvidia's dominance "Vista Equity Partners and Cambium Capital led the round, with Intel Capital, Qatar's sovereign wealth fund QIA, and GV also participating. Founded in 2017, SambaNova has focused on building AI-specific chips from the outset."
SO018 AI Tech Trend Vista and Intel Lead $350M Investment in SambaNova
SO019 Tech Company News SambaNova Raises $350 Million In Series E Financing
SO020 Hoodline Intel Circles Palo Alto's SambaNova in High-Stakes AI Chip Grab "SambaNova was founded in 2017 in Palo Alto by Stanford professors Kunle Olukotun and Christopher Ré and former Oracle executive Rodrigo Liang. The company had raised about $1.14 billion as of early 2025, according to PitchBook data cited by WIRED."
SO021 CNBC Google's parent company just made its first-ever investment in an A.I. chip start-up "The venture capital arm of Google-parent company Alphabet is leading a $56 million funding round in SambaNova Systems... SambaNova's CEO, Rodrigo Liang, ran a team of nearly 1,000 chip designers at Oracle before co-founding the start-up."
SO022 TechZine Intel nears SambaNova deal, where CEO Lip-Bu Tan is already chairman "It is noteworthy that Intel CEO Lip-Bu Tan currently serves as executive chairman at SambaNova Systems. In 2020, it raised $250 million from asset manager BlackRock, Intel Capital, and venture capital fund GV, among others, bringing its valuation to $2.5 billion."
SO023 Tracxn SambaNova Systems
SO024 EE Times SambaNova Shifts To Inference, Courts Cloud Customers
SO025 EE Times SambaNova Raises Eye-Popping Series D Funding "SambaNova has announced an enormous Series D funding round of $676 million, pushing the company's valuation above $5 billion... The Series D was led by SoftBank Vision Fund 2 with additional new investors Temasek and GIC."
SO026 Business Wire (SambaNova Systems) SambaNova Unveils Fastest Chip for Agentic AI, Collaborates with Intel, and Raises $350M+ "SambaNova today introduced their SN50 AI chip, which boasts a max speed that's 5X faster than competitive chips. The company also announced a planned collaboration with Intel to deliver high-performance, cost-efficient AI inference solutions, and more than $350M in investment from new and existing investors... The news follows SambaNova's record bookings and revenue as they closed out 2025."
SO027 SambaNova Systems AI Solutions for Government & Public Sector | SambaNova
SO028 TechStartups AI chip startup SambaNova, once valued at $4 billion, explores sale after failing to raise new funding "SambaNova Systems, once one of Silicon Valley's most promising AI hardware startups, is reportedly exploring a sale after struggling to secure new funding... The company had considered an IPO but shifted gears as market conditions worsened."
SM001 Deloitte Insights 2026 Global Semiconductor Industry Outlook A (spring 2026) Deloitte study on the AI chip market initially estimated that AI chips in 2026 would be about US$300B. Given the December 2025 upward revision of US$175B in the global chip market by the World Semiconductor Trade Statistics (all of which was driven by AI demand, with weakness in non-AI markets), Deloitte now estimates that the AI chip market in 2026 will be about US$500B.
SM002 IDC Semiconductor Market to Surge Past the Trillion-Dollar Threshold: AI Infrastructure Drives Market Growth IDC forecasts the global semiconductor market will reach $1.29T in 2026—a 52.8% surge led by AI infrastructure, memory, and hyperscaler CapEx. The $281 billion 'intelligent' datacenter segment, encompassing CPUs, AI accelerators, GPUs, custom ASICs, and networking silicon, now constitutes the largest identifiable category within non-memory semiconductors.
SM003 Silicon Analysts NVIDIA AI GPU Market Share 2026: ~80% of AI Accelerators NVIDIA's percentage share peaked at 87% in 2024 and is projected to decline to 75% by 2026. The total market is projected to exceed $200B by 2026. NVIDIA's floor is likely 65-70% share even in the most competitive scenario.
SM004 AllAboutAI AI Chip Market Statistics: The $118B Boom Reshaping Semiconductors In 2024, the global AI chip market reached $118 billion, and it's projected to surge to $293 billion by 2030, representing a remarkable CAGR of 33.2%. The top three AI chip vendors control 95–96% of global market revenue, making AI accelerators one of the most concentrated markets in modern technology.
SM005 NTT DATA Enterprise AI Hits the Wall: NTT DATA Research Reveals Growing Privacy and Sovereignty Barriers More than 95% of respondents say private and sovereign AI are important, but only 29% are prioritizing sovereign AI in a concrete, near-term way. About 35% of CAIOs identify building, integrating and managing complex AI models in private or sovereign environments as their top barrier to adoption.
SM006 Help Net Security AI infrastructure is cracking under sovereignty demands About 95% of organizations consider private or sovereign AI important to their AI strategy, and 96% are considering relocating AI infrastructure to specific regions because of geopolitical pressures and supply chain concerns.
SM007 Forbes (Forrester) 2026 Public Sector And Government Predictions We expect that half of the G20 will mandate domestically tuned AI models for public-sector services. Defense industry players will win a third of the biggest civilian software deals.
SM008 Flexential 2026 State of AI Infrastructure Report 89% say reliable grid power influences AI deployment decisions, while 55% rank power cost differences as the top factor influencing AI workload location. The share expecting measurable AI financial returns within one year dropped from 51% to 36%.
SM009 SambaNova Systems SambaNova Unveils Fastest Chip for Agentic AI, Collaborates with Intel, and Raises $350M+ The SN50 delivers five times more compute per accelerator and four times more network bandwidth than the previous generation. Positioned as the most efficient chip for agentic AI, the SN50 chip offers enterprises a 3X lower total cost of ownership.
SM010 Futurum Research Intel Bets on Agentic AI Economics with SambaNova Partnership NVIDIA's inference software ecosystem, hyperscaler platform integration, and workload optimization for reasoning models create switching costs and inertia that specialized inference chips must overcome through demonstrable economic advantages or performance gaps that justify infrastructure reconfiguration.
SM011 Intel Newsroom Intel, SambaNova Planning Multi-Year Collaboration for Xeon-Based AI Inference Together, Intel and SambaNova aim to help shape the next generation of heterogeneous AI data centers, integrating Intel Xeon processors, Intel GPUs, Intel networking and storage, and SambaNova systems—to unlock a multi-billion-dollar inference market opportunity.
SM012 Azumo 70 Enterprise AI Statistics for 2026: Adoption, ROI & Trends 87% of large enterprises are now implementing AI solutions. Only 9% have achieved full AI maturity. 62% of organizations have not moved AI projects beyond the pilot stage.
SM013 CRN Intel Inks 'Multiyear' AI Inference Deal With SambaNova After Acquisition Talks End Intel plans to tap into its 'enterprise, cloud and partner channels' for a new 'multiyear strategic collaboration' it has entered with AI chip startup SambaNova Systems after acquisition talks between the two companies recently ended.
SM014 Polyglotsoft The Sovereign AI Era: Data Sovereignty and Enterprise AI Infrastructure Independence Strategy On-premises GPU clusters: Building your own data center with NVIDIA H100/B200 GPUs. Initial investment ranges from $700K to $7M, but provides complete data control.
SM015 CloudLatitude The 2026 Cloud Landscape: AI Infrastructure, Sovereignty, and the New Race for Efficiency According to S&P Global, total hyperscaler capital spending will climb nearly 40% this year, far outpacing historic norms.
SM016 Fortune Business Insights AI Inference Market Size, Share | Global Growth Report [2034]
SM017 MarketsandMarkets AI Inference Market Size, Share & Growth, 2025 To 2030
SM018 Coherent Market Insights AI Chips Market Size, Share and Forecast, 2026-2033
SM019 Visual Capitalist Ranked: The Companies That Sell the Most AI Chips
SM020 Deloitte Insights AI Infrastructure Compute Strategy (TMT Tech Trends 2026)
SM021 Futurum Research AI Capex 2026: The $690B Infrastructure Sprint
SM022 IDC Semiconductor & Semiconductor Applications Forecast, April 2026
SM023 Flexential 2026 State of AI Infrastructure Report — Power & ROI Findings
SM024 NTT DATA (Help Net Security coverage) AI infrastructure is cracking under sovereignty demands — NTT DATA 2026 Global AI Report
SM025 CRN Intel SambaNova Collaboration — Channel and Enterprise Sales Details
SP001 SambaNova Systems AI Solutions for Government & Public Sector | SambaNova Empower government and public agencies with secure, sovereign AI solutions. SambaNova delivers full-stack infrastructure for mission-critical AI workloads.
SP002 BusinessWire (SambaNova press release) SambaNova and Intel Announce Blueprint for Heterogeneous Inference: GPUs for Prefill, SambaNova RDUs for Decode, and Intel® Xeon® 6 CPUs for Agentic Tools Agentic AI is moving into production — and the winning pattern we're seeing is GPUs to start the job, Intel Xeon 6 to run it, and SambaNova RDUs to finish it fast.
SP003 SambaNova Systems RDU | Next-Gen AI Chip for Inference at Scale The SN50 RDU is SambaNova's fifth-generation AI inference processor, designed specifically for large-scale, agentic workloads.
SP004 Argonne Leadership Computing Facility (DOE) Argonne National Laboratory deploys a new SambaNova inference-optimized cluster to support AI-driven science The Argonne deployment contains sixteen of SambaNova's Reconfigurable DataFlow Units (RDU).
SP005 Cerebras Systems CS-3 System — Product The Cerebras CS-3 delivers revolutionary AI performance, replacing hundreds of GPUs with a single wafer-scale chip.
SP006 Groq Groq On-demand Pricing for Tokens-as-a-Service Llama 3.3 70B Versatile 128k — 394 TPS — $0.79 per million output tokens
SP007 NVIDIA NVIDIA DGX Platform 9 U.S. Government Institutions [use NVIDIA DGX platform]
SP008 Amazon Web Services AWS Trainium Trainium2-based Amazon EC2 Trn2 instances and Trn2 UltraServers are purpose-built for generative AI and offer 30-40% better price performance than GPU-based EC2 P5e and P5en instances.
SP009 Google Cloud Tensor Processing Units (TPUs) Ironwood: 7th-generation energy-efficient TPU engineered for large-scale training, reasoning, and inference. Features 9,216 liquid-cooled chips per pod, provides 42.5 ExaFlops.
SP010 AMD AMD Instinct™ Accelerators The AMD Instinct™ MI350 Series GPUs set a new standard for Generative AI and high performance computing in data centers.
SP011 Intel Intel® Gaudi® AI Accelerator Products Intel® Gaudi® AI Accelerator Products
SP012 TechStartups AI chip startup SambaNova, once valued at $4 billion, explores sale after failing to raise new funding SambaNova Systems is exploring a sale, after it struggled to complete a fundraising round. The Palo Alto, Calif., startup was last valued at $5 billion in 2021.
SP013 ByteIota Cerebras IPO 2026: $26.6B Valuation Nvidia Challenger Cerebras Systems filed an amended S-1 with the SEC on May 4, 2026, targeting a $3.5 billion IPO at a $26.6 billion valuation.
SP014 Together AI Pricing | Together AI Most teams start with serverless inference and move to dedicated endpoints at scale.
SP015 James M (independent analyst blog) Cerebras, Groq, SambaNova: The Inference Hardware Insurgents The single most important industry event of the cycle was NVIDIA's reported $20 billion licensing deal with Groq's IP in late 2025, validating that purpose-built inference silicon is strategically essential even for incumbents.
SP016 CostBench Fastest LLM Inference 2026: Groq, Cerebras, SambaNova Ranked by Speed The fastest LLM inference in 2026 is Cerebras at 2,000+ tokens per second on Llama 3.3 70B — 18x faster than GPT-4o. Groq runs at 600-840 tok/s. SambaNova hits 400-580 tok/s with particular strength on reasoning models like DeepSeek R1.
SP017 IntuitionLabs Cerebras vs SambaNova vs Groq: AI Chip Comparison (2025) Cerebras raised $1.1 billion at an $8.1 billion valuation, Groq raised $750 million pushing its valuation to $6.9 billion, and SambaNova raised hundreds of millions (e.g. $676M Series D in 2021 with a $5.1B valuation).
SP018 AI Magazine How SambaNova and Intel are Scaling Inference for Agentic AI This architecture assigns specific roles to different compute types. GPUs handle the prefill phase, SambaNova RDUs take on high-speed decoding and Intel Xeon 6 CPUs orchestrate tasks while executing agent-driven workloads.
SP019 Chip.computer NVIDIA H100 vs. AMD MI300 vs. Intel Gaudi: AI Chip Showdown 2026 NVIDIA H100 vs. AMD MI300 vs. Intel Gaudi: AI Chip Showdown 2026
SP020 HashRate Index Three Independent AI Chip Companies Taking On NVIDIA Groq's $20 billion acquisition in December 2025 is the defining event of the independent AI chip era, validating that novel inference architectures can command large valuations.
SP021 CloudRift Blackwell Dominates. Benchmarking LLM Inference on NVIDIA B200, H200, H100, and RTX PRO 6000 NVIDIA Blackwell has landed in datacenters with the B200, promising major improvements in both performance and efficiency over the previous Hopper generation.
SP022 Spheron Network Best GPU for AI Inference in 2026: Benchmarks, Pricing, and Decision Guide Inference now accounts for roughly two-thirds of all AI compute in 2026, having overtaken training as the dominant workload.
SP023 TradingKey Cerebras Systems IPO 2026: Date, Price, Valuation, and Whether CBRS Is Worth Buying Cerebras generated $510 million in revenue in 2025 (up 76% year-over-year) and posted net income of $238 million, a 47% net margin.
SP024 CloudExpat Cloud AI Platforms Comparison: AWS Trainium vs Google TPU v5e vs NVIDIA AWS Trainium historically came in at around $1.34 per chip-hour, with inference cost expected to drop further with Trainium2. Google TPU v5e at roughly $1.20/hr per chip.
SP025 Cerebras Systems Product - Chip - Cerebras (WSE-3 Wafer Scale Engine) Four trillion transistors. 125 petaflops. One silicon wafer. The world's largest and most powerful processor for AI training and inference.
SI001 TechCrunch SambaNova raises $676M at a $5.1B valuation to double down on cloud-based AI software for enterprises "SambaNova … is announcing a huge round of funding today … The company has closed on $676 million in financing, a Series D that co-founder and CEO Rodrigo Liang has confirmed values the company at $5.1 billion."
SI002 BusinessWire SambaNova Unveils Fastest Chip for Agentic AI, Collaborates with Intel and Raises $350M "The news follows SambaNova's record bookings and revenue as they closed out 2025, reflecting accelerating demand for production-ready AI systems across financial services, telecommunications, energy, and sovereign deployments worldwide."
SI003 EE Times SambaNova Abandons Intel Acquisition, Raises Funding Instead "[SambaNova] ended up having a record year last year, which gave us a lot of confidence that the path we are on, selling infrastructure for service providers with the right economics, the right efficiency, and the right performance, allowed us to build a lot of momentum behind this [business] model."
SI004 Yahoo Finance / PitchBook SambaNova raises $350M as more upstarts take on Nvidia's dominance "Vista Equity Partners and Cambium Capital led the round, with Intel Capital, Qatar's sovereign wealth fund QIA, and GV also participating. … SambaNova did not disclose a valuation in its latest financing."
SI005 Data Center Dynamics SambaNova exploring sale after struggling to secure further funding — report "BlackRock has cut the value of its SambaNova shares by 17 percent, valuing the company at $2.4bn."
SI006 Sacra SambaNova Systems valuation, funding & news
SI007 Costbench SambaNova Cloud Pricing 2026: Plans & Hidden Costs "SambaNova Cloud offers 3 pricing tiers: Free tier, Developer (Pay-as-you-go), Enterprise. … Paid plans range from $0 to $4.50/per million tokens."
SI008 Intel Capital SambaNova Unveils Fastest Chip for Agentic AI, Collaborates with Intel and Raises $350M "SoftBank Corp. will be the first customer to deploy SN50 within its next-generation AI data centers in Japan."
SI009 TechCompanyNews SambaNova Raises $350 Million In Series E Financing "This round brings SambaNova's cumulative funding to over $1.48 billion, supporting expanded manufacturing and cloud capabilities."
SI010 SambaNova Systems SambaNova — Official Homepage
SI011 Latka SambaNova Systems Revenue 2025: $100M ARR, $5B Valuation "In 2025, SambaNova Systems's revenue reached $100M. … SambaNova Systems Hit $100m revenue in June 2025."
SI012 Compworth SambaNova Systems: Revenue, Worth, Valuation & Competitors 2026
SI013 AInvest SambaNova's $500M Lifeline: A Stalled Takeover Creates a Valuation Gap "BlackRock marked them down to $2.4B … some secondary markets and reporting peg the company as low as $792M (an 84% drop from peak)"
SI014 TechStartups AI chip startup SambaNova, once valued at $4 billion, explores sale after failing to raise new funding "SambaNova Systems … is reportedly exploring a sale after struggling to secure new funding … has held early talks with potential buyers including private equity firms and major tech companies."
SI015 Electronics Weekly Intel reported to be looking at acquiring SambaNova
SI016 U.S. Securities and Exchange Commission (SEC EDGAR) SambaNova Systems, Inc. — Form D Notice of Exempt Offering of Securities (Series D) "677999516 [total amount sold] … 2021-04-13 [date of first sale]"
SI017 U.S. Securities and Exchange Commission (SEC EDGAR) SambaNova Systems, Inc. — EDGAR Filing History (CIK 0001733073)
SI018 U.S. Department of Energy / NNSA NNSA establishes partnership to accelerate key artificial intelligence computing "The cornerstone of this partnership agreement is the acquisition of multiple SambaNova DataScale systems, deployed at each of the aforementioned NNSA Laboratory facilities."
SI019 Bloomberg SambaNova Raises $350 Million, Wins SoftBank Deal for New AI Chip
SI020 Data Center Dynamics SambaNova seeking $500m in funding after acquisition talks with Intel stall — report
SI021 Eboona SambaNova Systems Stock, Valuation, IPO, Careers & News
SI022 LLM Stats SambaNova: API Pricing, Performance & Model Catalog
SI023 Global Banking and Finance Review AI chip startup SambaNova raises $350 million in Vista-led round, signs SoftBank deal
SI024 PM Insights SambaNova Systems Valuation
SI025 Tracxn SambaNova Systems — 2026 Company Profile & Team
SI026 Sacra (PDF) SambaNova Systems — Research Report
SI027 Incfact Annual Report on Sambanova Systems' Revenue, Growth, SWOT Analysis
SI028 Intuition Labs Cerebras vs SambaNova vs Groq: AI Chip Comparison (2025)
SE001 SambaNova Systems SambaRack | Purpose-Built AI Rack for Model Deployment SambaRack is a high-performance AI rack system designed to deploy and run large AI models efficiently in data centers. It integrates hardware, networking, and software into a single self-contained system built around SambaNova RDU chips.
SE002 SambaNova Systems SambaStack | Full-Stack Enterprise AI Platform SambaStack allows your team to fully configure the workloads you want to run on SambaRack systems. Each rack can run pre-configured model bundles and hot-swap between model bundles at inference time.
SE003 SambaNova Systems RDU | Next-Gen AI Chip for Inference at Scale The SN50 RDU (Reconfigurable Dataflow Unit) is SambaNova's fifth-generation AI inference processor, designed specifically for large-scale, agentic workloads.
SE004 SambaNova Systems Sambanova vs Nvidia: AI Chipsets Compared The SambaNova Reconfigurable Dataflow Architecture (RDA) creates custom processing pipelines that allow data to flow through the complete computation graph. This minimizes data movement and results in extremely high hardware utilization.
SE005 SambaNova Systems AI Solutions for Government & Public Sector
SE006 SambaNova Systems SambaNova Unveils Fastest Chip for Agentic AI, Collaborates with Intel, and Raises $350M+ The SN50 delivers five times more compute per accelerator and four times more network bandwidth than the previous generation.
SE007 SambaNova Systems SambaNova Developer Guide - SambaNova Documentation
SE008 SambaNova Systems SambaNova API Reference - SambaNova Documentation The SambaNova Developer guide and API reference provide the tools you need to build applications using SambaNova as an inference service.
SE009 IEEE Composition of Experts on the SN40L Reconfigurable Dataflow Unit A single eight-socket SN40L node achieves speedups between 2 and 13× due to aggressive operator fusion over an optimized baseline. The SN40L node deploys Samba-CoE, a 1 trillion-parameter CoE with a 19× smaller machine footprint, speeds up model switching time by 15–31× and achieves an overall speedup of 3.7× over a DGX H100 and 6.6× over a DGX A100.
SE010 Mark Gottscho (MICRO 2024 author) SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts (MICRO 2024)
SE011 Weicloud (SambaNova reseller/partner) SambaNova DataScale SN40L Product Data Sheet DataScale SN40L-8 nodes, each with 8 x Cerulean SN40L Reconfigurable Dataflow Unit (RDU) chips, 512 GB -1 TB of high-bandwidth total memory, and 6 TB -12 TB DRAM total memory
SE012 Business Wire SambaNova Launches the World's Fastest AI Platform SambaNova Cloud runs Llama 3.1 70B at 461 tokens per second (t/s) and 405B at 132 t/s at full precision.
SE013 Business Wire SambaNova Unveils Fastest Chip for Agentic AI, Collaborates with Intel, and Raises $350M+ The SN50 delivers five times more compute per accelerator and four times more network bandwidth than the previous generation. It links up to 256 accelerators over a multi‑terabyte‑per‑second interconnect.
SE014 Business Wire SambaNova and Intel Announce Blueprint for Heterogeneous Inference: GPUs for Prefill, SambaNova RDUs for Decode, and Intel Xeon 6 CPUs for Agentic Tools GPUs handle the highly parallel prefill phase, turning long prompts into key-value caches efficiently. SambaNova RDUs sit alongside Xeon 6 as the dedicated inference fabric for high-throughput, low-latency decode.
SE015 EE Times 'Token Wars' Heats Up As Cerebras and SambaNova Enter The Fray Most of what we do is for a batch size of 1. Batch=1 is important because when individual users are asking questions, you want to get the response as quickly as possible.
SE016 BigGo News SambaNova's SN50 AI Chip Claims 5x Speed, 8x Efficiency Over Nvidia B200, Partners with Intel and SoftBank According to the company, the SN50 delivers five times the compute performance per accelerator compared to its predecessor. A single SN50 chip reportedly generated 895 tokens per second per user when running a Llama 3.3 70B model, compared to 184 tokens per second on an Nvidia B200.
SE017 GitHub / SambaNova Systems GitHub - sambanova/ai-starter-kit SambaNova AI Starter Kits are a collection of open-source examples and guides designed to facilitate the deployment of AI-driven use cases for both developers and enterprises.
SE018 PyPI sambanova — Python Package Index The Samba Nova Python library provides convenient access to the Samba Nova REST API from any Python 3.9+ application.
SE019 Hugging Face (via Wayback Machine archive) sambanovasystems (SambaNova) — Hugging Face Organization SambaNova provides an integrated generative AI platform, including SambaNova's leading RDU accelerator, software and model management, and pre-trained generative AI checkpoints. models 32
SE020 Artificial Analysis SambaNova — Intelligence, Performance & Price Analysis Analysis of SambaNova's models across key metrics including quality, price, output speed, latency, context window & more.
SE021 ChatForest SambaNova Review: Custom RDU Silicon for Full-Precision Large-Model Inference Limitations: No fine-tuning API. SambaNova has no public fine-tuning service. Narrow model catalog. Approximately 10 models vs 50–200 at competitors. Context window limitations. DeepSeek-V3.1's standard tier is capped at 131K context with only 7K completion tokens.
SE022 jamesm.blog Cerebras, Groq, SambaNova: The Inference Hardware Insurgents The SambaNova bet is on the enterprise stack. Multi-model serving, regulated industries, on-premises deployments where data residency matters — this is the segment that values the architectural choices SambaNova made and is willing to pay for them. The risk is that the enterprise segment is harder to scale than the developer segment and the sales cycles are longer.
SE023 Intuition Labs Cerebras vs SambaNova vs Groq: AI Chip Comparison (2025)
SE024 Accenture Our generative AI collab with SambaNova SambaNova's generative AI offerings enable customers to fine-tune models on their own data, to own and control their own AI models and to have visibility into the model weights and datasets that the model is trained on.
SE025 Argonne National Laboratory / DOE Argonne National Laboratory deploys a new SambaNova inference-optimized cluster to support AI-driven science The Argonne deployment contains sixteen of SambaNova's Reconfigurable DataFlow Units (RDU). The ALCF AI Testbed, which already includes a SambaNova DataScale SN30 training cluster, is a growing collection of advanced AI accelerators available to researchers for open science.
SU001 Argonne Leadership Computing Facility (DOE Office of Science) Argonne National Laboratory deploys a new SambaNova inference-optimized cluster to support AI-driven science The Argonne deployment contains sixteen of SambaNova's Reconfigurable DataFlow Units (RDU).
SU002 SambaNova Systems Argonne National Laboratory Deploys SambaNova Suite to Advance AI Inference In Science Research Argonne National Laboratory will expand its AI infrastructure by deploying SambaNova Suite...for use by the scientific community as part of its AI Testbed at the Argonne Leadership Computing Facility.
SU003 BusinessWire Texas Advanced Computing Center (TACC) Selects SambaNova AI to Accelerate Scientific Research SambaNova will be our platform for inference on scientific applications. We will use SambaNova Suite to host the models we've trained on traditional supercomputers to integrate AI inference into the science workflow.
SU004 SambaNova Systems Texas Advanced Computing Center Deploys SambaNova Suite, Enabling AI Inference for Science Today, we announced a new customer relationship with the Texas Advanced Computing Center (TACC), one of the world's leading supercomputing centers.
SU005 FinancialContent (via BusinessWire) Oak Ridge National Laboratory Selects SambaNova to Expand Its Research in Secure and Energy-Efficient AI SambaNova's platform will enable multiple models to be run and queried in parallel for inference time scaling so that answers can be combined to make better predictions.
SU006 Lawrence Livermore National Laboratory AI gets a boost via LLNL, SambaNova collaboration Lawrence Livermore National Laboratory (LLNL) has installed a state-of-the-art artificial intelligence (AI) accelerator from SambaNova Systems, the National Nuclear Security Administration (NNSA) announced today.
SU007 U.S. Department of Energy / NNSA NNSA establishes partnership to accelerate key artificial intelligence computing initiatives The Department of Energy's National Nuclear Security Administration (DOE/NNSA), Lawrence Livermore National Laboratory (LLNL), and Los Alamos National Laboratory (LANL) announced a strategic partnership agreement with SambaNova Systems.
SU008 BusinessWire SambaNova Expands Deployment with SoftBank Corp. to Offer Fast AI Inference Across APAC SambaNova today announces the expansion of its SambaNova Cloud deployment and partnership with SoftBank Corp. in Japan.
SU009 BusinessWire SambaNova Unveils Fastest Chip for Agentic AI, Collaborates with Intel, and Raises $350M+ The news follows SambaNova's record bookings and revenue as they closed out 2025, reflecting accelerating demand for production-ready AI systems across financial services, telecommunications, energy, and sovereign deployments worldwide.
SU010 CB Insights SambaNova Customers
SU011 WebProNews AI Chip Startup SambaNova Explores Sale Amid Funding Woes and Nvidia Competition the startup had aimed to raise hundreds of millions but faced skepticism over its market traction and competitive edge against giants like Nvidia and AMD.
SU012 Tech Startups AI chip startup SambaNova, once valued at $4 billion, explores sale after failing to raise new funding SambaNova and other AI chip startups have had difficulty growing sales in the face of giant competitors like Nvidia.
SU013 SambaNova Systems Accenture and SambaNova: Delivering Generative AI to the Enterprise
SU014 OVHcloud OVHcloud selects SambaNova to power flagship AI Endpoints in-ferencing service Choosing SambaNova was a deliberate decision to provide our customers with an unrivalled inference experience.
SU015 RIKEN Center for Computational Science SambaNova Systems' SambaNova DataScale Adopted — Accelerating the Integration of Fugaku and AI for research in the Society 5.0 era
SU016 eesel.ai My honest Sambanova Cloud review: Is it right for you?
SU017 Carahsoft Technology Corp. SambaNova Government IT Procurement Contracts
SU018 Data Center Dynamics SambaNova lays off 77 employees as company pivots focus from training to inference SambaNova lays off 77 employees as company pivots focus from training to inference
SU019 BusinessWire SambaNova Announces That Fugaku-LLM Is Now a Part of Samba-1 We are very pleased that Fugaku-LLM...is introduced into SambaNova's Samba-1 CoE, making the achievements of Fugaku available to many people.
SU020 EE Times SambaNova Lays Off 15% of Workforce To Refocus on Inference
SU021 Converge Digest SambaNova AI Suite Powers Oak Ridge, Argonne and Texas Computing Centers
SU022 HPCwire SambaNova Partners to Deliver Sovereign AI Clouds in Australia, Europe and the UK
SU023 Data Center Dynamics SambaNova partners with SoftBank to expand its AI cloud throughout APAC
SU024 Procurely Sambanova Systems Inc — Federal & State Contract Awards
SU025 CB Insights (via Tech Startups) SambaNova customer list — OTP Bank, LLNL, Blackbox.AI, OVHcloud, TACC, Ascend, Carahsoft
SR001 Finnegan Henderson Farabow Understanding the BIS Final Rule on Advanced AI Chips
SR002 Mayer Brown LLP New BIS Export Control Rules for Advanced AI Chips and Model Weights
SR003 Wilson Sonsini Goodrich & Rosati SambaNova Series E – National Security & Technology Practice Involvement
SR004 WARNScan (California EDD public filing) WARN Act Notice – SambaNova Systems (California)
SR005 The Register SambaNova raises $350M Series E, touts SN50 inference chip
SR006 Ticker Report Intel CEO Lip-Bu Tan's SambaNova Board Seat Raises Conflict Questions
SR007 TechZine Intel CEO on SambaNova board: dual-role conflict of interest
SR008 Forbes / Moor Insights The AI Chip Startup Race: Groq, Cerebras, and SambaNova
SR009 DeployBase SambaNova vs. Groq vs. Cerebras: Inference Performance Comparison 2025
SR010 Silicon Valley Bank AI Hardware Startup Burn Rate & Capital Efficiency Report 2025
SR011 Potomac Officers Club NNSA / LANL AI Supercomputer Procurement Update 2025
SR012 Harvard Business Review AI Chip Supply Chain Concentration: The TSMC Dependency Problem
SR013 Data Center Dynamics SambaNova Systems confirms workforce reduction in 2025
SR014 SiliconAngle SambaNova raises $350M Series E led by Vista Equity Partners
SR015 Tom's Hardware Nvidia H100 vs B200 Inference Benchmarks
SR016 Axios Intel acquisition talks with SambaNova reportedly fall through
SR017 Wall Street Journal AI Chip Startups Face Valuation Reset in 2025
SR018 Crunchbase SambaNova Systems – Funding Rounds (Crunchbase)
SR019 Semiconductor Digest TSMC Geopolitical Risk: Analysis for Chip Supply Chains 2026
SR020 TechCrunch Cerebras Systems files for IPO, discloses wafer-scale AI chip details
SR021 SoftBank Group SoftBank Announces Commitment to SambaNova SN50 AI Systems
SR022 Argonne National Laboratory Argonne National Laboratory AI Infrastructure Deployment 2025
SR023 U.S. Department of Energy DOE National Laboratory AI Supercomputing Investments 2025
SR024 CUDA Zone CUDA Ecosystem Developer Survey 2025
SR025 Reuters Nvidia's CUDA dominance creates deep enterprise lock-in
SR026 Bloomberg Groq Closes $1.5 Billion Sovereign AI Infrastructure Deal
SR027 Financial Times AI Hardware Startups Confront Capital Intensity Crunch in 2026
SR028 U.S. Securities and Exchange Commission SambaNova Systems – SEC Form D Filings
SR029 Sacra SambaNova Systems – Research Report 2025
SR030 Lawrence Livermore National Laboratory LLNL–SambaNova AI Inference Partnership 2025
SV001 SiliconAngle SambaNova steps up its challenge to Nvidia with new chip, $350M funding and a powerful ally in Intel Chipmaker SambaNova Systems Inc. unveiled its most advanced artificial intelligence processor today as it closed on a bumper $350 million late-stage round of funding from Vista Equity Partners, Cambium Capital and others.
SV002 Wilson Sonsini Goodrich & Rosati Wilson Sonsini Advises SambaNova on $350 Million Series E Financing On February 24, 2026, SambaNova, a leader in next-generation AI infrastructure, announced that it has raised more than $350 million in investment from new and existing investors.
SV003 TechCrunch Nvidia AI chip challenger Groq raises even more than expected, hits $6.9B valuation AI chip startup Groq confirmed Wednesday that it raised a fresh $750 million in funding at a post-money valuation of $6.9 billion.
SV004 CNBC AI chipmaker Cerebras targets $3.5 billion raise in IPO
SV005 TechCrunch OpenAI's cozy partner Cerebras is on track for a blockbuster IPO Cerebras is aiming to sell 28 million shares in the offering, priced between $115 and $125 apiece to raise $3.50 billion in the company's second attempt to go public.
SV006 Data Center Dynamics AI chip company Groq raises $750m at $6.9bn valuation
SV007 The Economic Times Intel nears $1.6 billion deal for AI startup SambaNova: Report Intel is in advanced talks to acquire artificial intelligence (AI) startup SambaNova Systems in a $1.6 billion deal that includes debt.
SV008 WebProNews AI Chip Startup SambaNova Explores Sale Amid Funding Woes and Nvidia Competition SambaNova Systems, a Palo Alto-based startup once hailed as a promising challenger to Nvidia's dominance, is now quietly exploring a sale amid difficulties in securing fresh capital.
SV009 Aventis Advisors AI Valuation Multiples in 2025
SV010 Yahoo Finance / Forge SambaNova (SANS.PVT) Valuation, History & News Forge Price as of May 26, 2026: $19.05. Estimated Valuation: 1.44B. Series E-1: $307.13M raised at $30.99/share implying $2.34B valuation.
SV011 TechFundingNews Cerebras files for IPO with $510M revenue and a $23B valuation: report
SV012 Finro Financial Consulting AI Valuation Multiples (Q1 2026) — 575 Company Dataset Median EV/Revenue multiple for AI infrastructure companies: approximately 21.2x average 31.3x, 25th–75th percentile 10.2x–39.6x.
SV013 Finro Financial Consulting AI Startup Valuations in 2025: Benchmarks Across 400+ Companies
SV014 Yahoo Finance (Reuters) Cerebras targets $26.6 billion valuation in US IPO as AI chip demand surges
SV015 Blockonomi Nvidia (NVDA) vs AMD: The 2026 AI Chip Showdown Reveals a Clear Leader Nvidia's fiscal 2026 revenue reached $215.9 billion. AMD achieved $34.6 billion in full-year 2025 revenue.
SV016 TheOutpost.ai Vista Equity Partners and Intel Lead $350M Funding Round for AI Chip Startup SambaNova Systems
SV017 Business Review Live SambaNova raises Series E funding to strengthen its position in AI hardware market Vista Equity Partners is leading a $350 million funding round for SambaNova Systems, marking a significant shift for the private equity giant. Intel Corp planning to invest about $100 million, with potential commitments of up to $150 million.
SV018 BusinessWire SambaNova Unveils Fastest Chip for Agentic AI, Collaborates with Intel and Raises $350M SambaNova has obtained $350 million in strategic Series E financing to expand manufacturing and cloud capacity.
SV019 CNBC Intel partners with AI chip startup SambaNova after acquisition talks reportedly failed Bloomberg said the company was mulling an offer in the region of $1.6 billion. It's not known if Intel actually tabled such an offer, but it seems unlikely SambaNova would have agreed, for the amount was only a third of what it was valued at following its previous funding round in 2021.
SV020 Data Center Dynamics SambaNova exploring sale after struggling to secure further funding - report A report from Caplight said that BlackRock has cut the value of its SambaNova shares by 17 percent, valuing the company at $2.4bn.
SV021 Data Center Dynamics SambaNova seeking $500m in funding after acquisition talks with Intel stall - report Intel's CEO, Lip-Bu Tan is chairman of the Palo Alto, California-based SambaNova.
SV022 TechStartups AI chip startup SambaNova, once valued at $4 billion, explores sale after failing to raise new funding
SV023 AInvest SambaNova's $500M Lifeline: A Stalled Takeover Creates a Valuation Gap
SV024 Bloomberg Intel-Backed SambaNova Raises Cash, Touts SoftBank Chip Contract
SV025 U.S. Securities and Exchange Commission SambaNova Systems Form D — Notice of Exempt Offering of Securities (Series D) Amount raised: $677,999,515. Date of first sale: 2021-04-13. Issuer: SambaNova Systems, Inc.
SV026 PM Insights SambaNova Systems Valuation — Market Intelligence Snapshot
SV027 Compworth SambaNova Systems: Revenue, Worth, Valuation & Competitors 2026
SV028 Tracxn SambaNova Systems — 2026 Funding Rounds & List of Investors
SV029 IntuitionLabs AI Cerebras vs SambaNova vs Groq: AI Chip Comparison (2025) As of 2025, all three firms have achieved multi-billion-dollar valuations: Cerebras raised $1.1 billion at an $8.1 billion valuation, Groq raised $750 million pushing its valuation to $6.9 billion, and SambaNova raised hundreds of millions with a $5.1B Series D in 2021.
SV030 TechCompanyNews SambaNova Raises $350 Million In Series E Financing