Startup Diligence
Diligence report AI inference infrastructure / custom silicon late-stage private 2026-05-26

d-Matrix

Full-stack inference hardware company with credible product progress but incomplete proof at a $2B private mark

d-Matrix has a differentiated inference architecture and credible commercialization momentum, but the $2B Series C still gets ahead of public proof on revenue, customer depth, and durable deployment economics.

Cover facts

Latest disclosed valuation 01
2000 USD M [CO015, CV001]
Total disclosed funding 02
450 USD M [CO015, CV001]
Founded 03
2019 [CO001]
Headquarters 04
Santa Clara, California [CO002]
Product stack 05
Corsair / JetStream / Aviator [CO010]
Latest disclosed headcount 06
250+ employees [CO018]

Company profile

d-Matrix is a Santa Clara-based AI inference infrastructure company founded in 2019 by Sid Sheth and Sudeep Bhoja. The company now presents a concrete full-stack product story around Corsair accelerator cards, JetStream networking, Aviator software, and open-standards rack deployment for hyperscale, enterprise, and sovereign datacenters. Public financing history is well supported, culminating in a $275 million Series C at a $2 billion valuation and $450 million of total disclosed funding, with 250+ employees and a multi-office footprint by late 2025. What remains missing is the operating denominator: public sources still do not disclose revenue, ARR, broad customer count, or control economics in enough detail to underwrite the business like a mature late-stage infrastructure company.

Website
www.d-matrix.ai
Founded
2019-01-01
Founders
Sid Sheth, Sudeep Bhoja
Headquarters
Santa Clara, California
Product
Corsair accelerator cards, JetStream transparent networking cards, Aviator software, and the SquadRack reference architecture together form a low-latency inference stack designed for standard PCIe servers, Ethernet fabrics, and brownfield datacenter deployment.
Customers
Hyperscale, enterprise, sovereign-cloud, and specialist AI-cloud operators running latency-sensitive inference workloads.
Business model
Hardware-plus-software platform sales routed through OEM, integrator, and cloud partners, with economics likely tied to multi-component deployments rather than standalone self-serve software.
Stage
Series C private company
Funding status
$110M Series B in 2023 followed by a $275M Series C in November 2025 at a $2B valuation, bringing total disclosed funding to $450M.
[CO001, CO002, CO004, CO008, CO010, CO015, CO018, CO019]

Executive summary

Top strengths

  • Concrete full-stack productization: Corsair, JetStream, Aviator, and SquadRack make d-Matrix more than an architecture concept and give buyers a testable deployment surface.
  • Differentiated latency thesis: the memory-centric DIMC design and open-standards deployment model target a real inference bottleneck in brownfield datacenters rather than requiring a full rack rebuild.
  • Strong capital and ecosystem support: $450M disclosed funding and visible backing from Temasek, Bullhound, Triatomic, QIA, EDBI, and M12 reduce near-term existential financing risk.
  • Early commercialization proof is real enough to matter: Supermicro channel availability, GigaIO scale-up integration, and Gimlet's heterogeneous-cloud work show a credible path from sampling to production.

Top risks

  • No public revenue, ARR, gross margin, pricing, or customer-count disclosure supports the current $2B mark, so valuation underwriting is denominator-blind.
  • Execution depends on synchronized maturity across Corsair, JetStream, Aviator, partner-qualified systems, and the pre-commercial 3DIMC/Raptor roadmap.
  • Supply-chain and ecosystem dependence on advanced packaging, foundry partners, and OEM/integration partners creates real qualification, cost, and timing risk.
  • Customer proof remains partner-led and pilot-like: public evidence still lacks a named production-customer roster, retention metrics, or concentration data.
  • Security, compliance, and export-control posture are thinner publicly than many enterprise or sovereign buyers may require.

Open gaps

  • Audited revenue or ARR, gross margin, pricing, cash runway, and realized unit economics remain undisclosed.
  • Named production deployments, active customer count, repeat-order cadence, retention, and concentration metrics are still missing from public sources.
  • Cap-table terms, liquidation preferences, secondary or 409A marks, and board-control economics are not public.
  • Production yield, cost curve, and delivery timing for JetStream ramp and the 3DIMC-to-Raptor roadmap remain under-evidenced.
  • Public security/compliance artifacts such as SOC 2, ISO 27001, trust-center documentation, and concrete export-control exposure analysis remain incomplete.

Contents

Chapter 01

01Company Overview

1.1 Identity, Footprint, and Business Model

d-Matrix should be treated as a private, Series C AI infrastructure company built specifically around inference rather than training. The official home page, about page, and 2025 Series C release all anchor the company in Santa Clara and frame its business as selling accelerated compute for hyperscale, enterprise, and sovereign data-center workloads. Public materials say the company was founded in 2019, and the current product stack combines Corsair inference accelerators, JetStream networking, and Aviator software. The home page further says the 3DIMC architecture uses a chiplet-based, PCIe-friendly design that can scale to models above 100 billion parameters. That makes the reusable ground truth for later chapters straightforward: d-Matrix is monetizing hardware-plus-software infrastructure for low-latency inference. What the public record does not yet provide is a companywide customer count or revenue figure that would let diligence treat current commercial scale as fully quantified.[CO001, CO002, CO003, CO008, CO009, CO010]

Snapshot KPI table
MetricValue / statusDateConfidenceGap / note
Founded20192019highOfficial and investor materials align on 2019 as the founding year.
HeadquartersSanta Clara, California2025-11-12highSeries C and Corsair materials keep Santa Clara as the canonical HQ.
Public office footprintToronto; Sydney; Bangalore; Belgrade2025-11-12highThese are the latest named non-HQ offices in the fetched pack.
StagePrivate Series C company2025-11-12highStage call is supported by the November 2025 financing and no public-listing evidence.
Core productsCorsair accelerators; JetStream networking; Aviator software2025-11-18highBy late 2025 the company publicly described a three-part hardware and software stack.
Total disclosed funding$450M2025-11-12highOfficial, investor, and independent coverage all corroborate the total.
Latest disclosed valuation$2B2025-11-12highValuation is tied specifically to the Series C close.
Latest disclosed headcount250+ worldwide2025-11-12mediumPublicly disclosed in Series C key facts, but not independently refreshed in 2026.
Public commercialization proofGigaIO partnership plus Gimlet Labs benchmark2026-03mediumNamed proof comes from partner and workload announcements rather than a broad customer roster.
Current customer countNot publicly disclosed2026-05-26lowRequest booked-customer, deployment, and repeat-order counts rather than inferring from partner rhetoric.
Current revenue or ARRNot publicly disclosed2026-05-26lowRequest monthly recurring or run-rate revenue directly from finance materials.

This table separates hard public identity and financing facts from material metrics that remain unavailable in the fetched pack, especially customer count and revenue or ARR.

[CO001, CO002, CO003, CO009, CO010, CO015]
FO002: Company snapshot logic

Founders and capital feed a full-stack inference platform, but commercialization evidence still trails the fundraising narrative.

[CO004, CO008, CO010, CO015, CO025, CO026]

1.2 Founders, Leadership, and Governance

The public leadership picture is unusually clear at the founder, executive, and board-name level but still incomplete on control economics. The about page identifies Sid Sheth as founder and CEO and Sudeep Bhoja as founder and CTO, while public investor and product releases continue to quote the same pair through 2025, suggesting stable technical and commercial leadership. The company also exposes an operating bench across software, product, finance, legal, and manufacturing. Governance visibility is more mixed. d-Matrix publicly names a board slate tied to Bullhound Capital, Playground Global, Triatomic Capital, M12, Nautilus Venture Partners, and Temasek, which aligns well with the financing history and gives later chapters a usable map of who likely influences strategy. But the fetched pack does not disclose board committees, voting thresholds, protective provisions, or current ownership percentages, so governance analysis should treat the board roster as real but incomplete rather than assume standard venture rights.[CO004, CO005, CO006, CO007, CO042, CO046]

Leadership and founder table
PersonPublic roleBackground or functional scopeWhy it mattersKey-person or governance note
Sid ShethFounder and Chief Executive OfficerPublic commercial and strategy voice across Series B, Corsair, and Series C releasesCentral owner of fundraising story, go-to-market posture, and inference-first thesisHigh key-person dependence because he anchors both investor messaging and external strategy
Sudeep BhojaFounder and Chief Technology OfficerCo-founder tied to architecture and compute-memory roadmapCore technical steward for the platform and successor products beyond CorsairTechnical continuity is strong, but roadmap execution remains concentrated
Peter BuckinghamSenior Vice President, SoftwareVisible software executive on the about pageShows the company is building software capability rather than only siliconPublic sources do not show deeper software-org breadth
Sree GanesanVice President, ProductPublic product leader on the about pageSupports functional coverage for productization and roadmap packagingNo public product P&L or commercial ownership details disclosed
PJ JamkhandiVice President, Finance & AccountingNamed finance executive on the about pageImportant for diligence because public fundraising outpaces financial disclosureNo public CFO-level KPI package appears in the fetched pack
Richard OgawaGeneral CounselNamed legal executive on the about pageRelevant because financing and IP-heavy partnerships rely on legal executionBoard committee and legal-risk oversight structure are not disclosed publicly
Jerry QubainVice President, Manufacturing, OperationsVisible operations leader on the about pageImportant in a semiconductor business that must scale manufacturing and deliveryManufacturing footprint and supplier dependencies remain mostly undisclosed
Per RomanBoard member via Bullhound CapitalInvestor-affiliated director disclosed on the about page and echoed by Series C investor materialsIllustrates that new capital providers have visible governance influencePublic materials do not disclose committee assignments or vote thresholds
Michael StewartBoard member via M12Investor-linked director and recurring public supporter in company releasesConnects long-running Microsoft adjacency with governance visibilityInvestor rights and ownership economics behind the seat are not public
Russell ThamBoard member via TemasekDirector tied to the Series B lead investor on the public board slateSignals continuity from Series B capital into current governanceNo fetched source discloses whether Temasek holds protective provisions

Coverage is intentionally partial because the public record exposes named founders, executives, and directors more clearly than committee structures, ownership percentages, or internal reporting lines.

[CO004, CO005, CO006, CO007, CO042, CO046]

1.3 Capital Formation, Stage, and Stakeholders

Capital formation is the strongest externally corroborated part of the company overview. VentureBeat reported a $44 million Series A in 2022 around the Microsoft Project Bonsai relationship. The official 2023 release then disclosed a $110 million Series B led by Temasek, and the official 2025 Series C release, Bullhound statement, QIA statement, and independent press all agree that d-Matrix raised $275 million at a $2 billion valuation, bringing total disclosed funding to $450 million. The same Series C materials also provide the clearest current scale markers in the fetched pack: 250+ employees worldwide, Santa Clara headquarters, and offices in Toronto, Sydney, Bangalore, and Belgrade. That combination supports a practical stage call of late-stage private infrastructure startup rather than early research project. Still, fundraising visibility exceeds operating disclosure. The company says the new capital will support global expansion and large-scale deployments, but no fetched source discloses current revenue, ARR, or a broad named-customer list, so those metrics should remain explicit diligence gaps.[CO011, CO012, CO013, CO014, CO015, CO016]

Stakeholder or investor map
StakeholderRolePublic signalControl or economic importanceDiligence ask
Bullhound CapitalSeries C co-lead and board-affiliated investorLed the 2025 round and is represented on the public board by Per RomanNew-money sponsor with visible governance presence in the latest roundClarify board seat terms, economics, and liquidation preferences
Triatomic CapitalSeries C co-leadNamed as a 2025 co-lead and represented on the public board by Jeff HuberImportant signaling investor in the current capital stackConfirm check size and any information or consent rights
TemasekSeries B lead and Series C co-leadBacked the 2023 Series B, reappeared in the 2025 Series C, and is represented by Russell Tham on the board slateMost visible repeat backer across the financing arcRequest current ownership percentage and any special governance rights
Qatar Investment AuthorityNew Series C investorQIA publicly announced participation in the 2025 roundAdds sovereign-capital credibility and possible long-duration supportClarify whether the position is purely financial or strategic
M12Repeat investor and board-linked Microsoft fundVisible in Series B and Series C materials and represented by Michael Stewart on the board slateSustains a long-running Microsoft adjacency with board-level visibilityClarify any commercial or cloud-channel expectations beyond venture ownership
Playground GlobalEarly investor with public board visibilityQuoted in Series B materials and represented on the board by Sasha OstojicLikely important to the early technical and strategic formation storyConfirm current stake after later financings
Nautilus Venture PartnersBoard-linked repeat investorNamed on the board slate and in later investor rostersProvides continuity between earlier rounds and current governanceClarify economics, board voting rights, and follow-on participation
GigaIODeployment and infrastructure partnerMay 2025 partnership integrated Corsair with SuperNODE for enterprise-scale inferenceMost visible system-level commercialization proof before the 2026 acquisition title on the newsroom indexRequest booked deployments, pipeline value, and any exclusivity terms
AlchipRoadmap and packaging partnerNovember 2025 collaboration targets 3D DRAM and the Raptor successor platformStrategically important to the next product generation rather than current revenue visibilityClarify commercialization timetable, manufacturing dependencies, and co-development obligations
Gimlet LabsPublic workload-validation partnerMarch 2026 benchmark used Corsair in a heterogeneous deployment modelProvides concrete external workload evidence even if it is not a broad customer rosterRequest whether the relationship has recurring production spend or only evaluation status

This is not a full cap table. It combines investors and strategic commercialization counterparties because both groups shape the company’s control story and near-term execution risk.

[CO015, CO016, CO020, CO026, CO027, CO028]
FO003: Snapshot KPIs

Publicly supportable KPIs concentrate in funding, footprint, and product capability rather than revenue or customer count.

Current customer count and revenue or ARR are intentionally excluded because the fetched pack does not disclose them.

[CO003, CO009, CO015, CO018, CO022, CO024]

1.4 Milestones, Commercial Proof, and Pressure Points

The milestone record shows a company moving from architecture thesis to productization, partner integration, and early public workload proof. The chronology starts with 2019 founding, a 2022 Series A and Microsoft-linked tooling relationship, and the 2023 Series B that funded commercialization. Corsair was unveiled in November 2024 with early-access sampling and a Q2 2025 broad-availability target. d-Matrix then expanded the stack with a May 2025 GigaIO partnership, a September 2025 JetStream launch, the October 2025 SquadRack reference architecture, the November 2025 Alchip roadmap announcement, and a March 2026 Gimlet Labs benchmark that claimed 2–10x latency benefits in a heterogeneous setup. The same record also exposes pressure points. Public materials still rely heavily on vendor claims and partner-led commercialization proof, while independent coverage emphasizes that startups like d-Matrix must overcome Nvidia's installed ecosystem, scale manufacturing, and prove durable customer adoption. That makes the chapter's biggest remaining gaps commercial disclosure and control visibility, not identity or capitalization.[CO021, CO022, CO023, CO024, CO025, CO026]

Milestone table
DateEventTypeAmount or statusParticipantsImplication
2019Company foundedfoundingFormationSid Sheth; Sudeep BhojaSets the start date and explains why later materials call the firm six years old in 2025
2022-04Series A and Microsoft Bonsai relationship publicizedfinancing$44M Series A; Microsoft tooling supportd-Matrix; VentureBeat; Microsoft Project BonsaiShows early ecosystem access before Corsair was public
2023-09-06Series B closedfinancing$110M led by Temasekd-Matrix; Temasek; Playground; M12; SK Hynix; Marvell; EntradaFunds commercialization and expands strategic-capital base
2024-11-19Corsair unveiledproductFirst public inference platform launchd-Matrix; M12; OEM and integrator collaboratorsMoves the company from architecture thesis to shipping product story
2025-05-01GigaIO partnership announcedpartnershipSuperNODE integration for rack-scale inferenced-Matrix; GigaIOProvides enterprise deployment proof and system-level distribution support
2025-09-08JetStream launchedproduct400Gbps Transparent NIC; production expected by year-endd-MatrixExpands d-Matrix into networking and makes the platform more complete
2025-10-14SquadRack announcedscaleOpen standards-based rack-scale reference architectured-Matrix; Arista; Broadcom; SupermicroShows ecosystem traction and a standards-based deployment strategy
2025-11-12Series C closedfinancing$275M at $2B valuation; total raised $450Md-Matrix; Bullhound Capital; Triatomic; Temasek; QIA; EDBI; M12Confirms late-stage private status and funds global expansion
2025-11-18Alchip collaboration announcedpartnership3DIMC and Raptor successor roadmapd-Matrix; AlchipPoints to the product roadmap beyond Corsair
2026-03Gimlet Labs benchmark publishedpartnership2–10x latency improvement claim in heterogeneous setupGimlet Labs; d-MatrixProvides public workload proof using Corsair in an external environment
2026-04GigaIO data center business acquisition appeared on the newsroom indexscalePost-Series C acquisition milestoned-Matrix; GigaIOSuggests continued system-level expansion after the 2025 financing

Dates are kept as visible in the fetched record. Month-only milestones remain month-level rather than being over-precision backfilled from memory or model assumptions.

[CO001, CO011, CO012, CO021, CO023, CO024]
FO001: Company milestone timeline

The company moved from funding and architecture positioning into product launch, partner integration, and public workload validation between 2019 and 2026.

Month-only milestones use the first day of the stated month so the renderer can place them on a timeline without implying a more precise day than the fetched source supports.

[CO001, CO008, CO011, CO012, CO013, CO021]

1.5 Exhibits

Chapter 02

02Market Analysis

2.1 Market Boundary and the Shift from Training to Inference

The relevant market boundary for d-Matrix is not 'AI' in the abstract and not even all accelerator spending. Public company materials, technical docs, and launch messaging consistently place the company in datacenter inference: serving already-trained generative models with low latency, high memory bandwidth, and better cost and energy efficiency. Included spend therefore covers accelerator cards, inference servers and racks, interconnect and NIC layers required to scale serving, and the software stack that makes those systems deployable in enterprise or neocloud environments. Excluded spend includes model training clusters, hyperscaler-custom silicon that never becomes merchant hardware, pure API spend, and edge-device inference. This distinction matters because the technical bottlenecks change after training. d-Matrix and third-party developer evidence both emphasize that prefill is compute-bound while decode and agentic serving become memory- and latency-bound, which is exactly the part of the stack where generic GPU economics become weakest and where a specialist accelerator can plausibly wedge in.[CM001, CM002, CM003, CM004, CM005, CM006]

Market definition table
Segment / categoryIncluded spendExcluded spendBuyer / payerRelevance to d-Matrix
Broad generative AI spendModel software, services, cloud consumption, infrastructure, and adjacent toolingNo clean separation between training, inference, software, and servicesCIO / CTO / line-of-business budgetsToo broad for underwriting d-Matrix
AI accelerator marketMerchant accelerator hardware for AI workloads across training and inferenceGeneral CPU spend and most software/servicesClouds, enterprises, model labs, OEMsUseful hardware outer bound, but still broader than d-Matrix
Datacenter generative inference infrastructureInference cards, servers, racks, memory, interconnect, and serving software for deployed LLMsModel training clusters, edge AI devices, unrelated HPCInfra teams, platform teams, AI product operatorsClosest direct category
Open-standards low-latency inference systemsPCIe and Ethernet based inference deployments using merchant accelerators and partner serversCaptive hyperscaler custom silicon and bespoke internal fabricsEnterprises, neoclouds, OEM/integrator channelsMost relevant d-Matrix wedge
Status-quo substitute stackNvidia/AMD GPU servers, Groq-style inference clouds, and external API inferenceSpecialist merchant accelerators not yet adoptedExisting cloud or platform ownersPrimary displacement pool

Boundary logic narrows from very broad AI spend to the smaller merchant datacenter inference layer that d-Matrix actually addresses. The fourth row is the operative underwriting boundary; rows above it are context, not SAM.

[CM001, CM002, CM003, CM004, CM005, CM007]

2.2 Demand Drivers and Structural Bottlenecks

Public evidence strongly supports the category thesis that inference demand is outrunning the infrastructure originally built for training. CNN, VentureBeat, and TechTarget all describe GPU scarcity, datacenter expansion pressure, and the need for more energy and networking capacity to keep up with AI demand. d-Matrix’s own architecture narrative is directionally consistent with that independent evidence: memory movement burns power, host-mediated networking adds latency, and batch size creates a direct trade-off between user experience and hardware ROI. Reasoning and agentic workflows amplify those pressures because they increase inference-time compute without letting operators hide latency behind a simple throughput metric. That is why the market is shifting from a training-first conversation to an operations-and-serving conversation. The constraint stack is not just raw FLOPS; it is power, memory bandwidth, cluster communication overhead, and the economics of delivering a fast answer at scale without leaving too much expensive hardware idle.[CM008, CM009, CM010, CM011, CM012, CM013]

Growth drivers and constraints table
FactorDirectionTimingImplication for d-MatrixDiligence ask
Reasoning and agentic workloads expand inference-time computeDriverCurrent and acceleratingHelps the category thesis because more serving work becomes latency-sensitive and memory-boundQuantify how much of pipeline demand is reasoning / agentic versus conventional chat
GPU scarcity and datacenter build pressureDriverCurrentCreates openness to alternatives and heterogeneous serving architecturesRequest actual buyer stories where lack of GPU supply pulled d-Matrix into evaluation
Memory-movement power taxDriverStructuralStrengthens d-Matrix's DIMC / SRAM narrative versus separate-compute HBM stacksBenchmark power per request on representative memory-bound workloads
Host-mediated multi-node networking overheadDriverStructural at scaleSupports JetStream and transparent-NIC positioning for rack-level inferenceVerify production latency and jitter under real multi-node traffic
Batch-size economics in interactive appsDriverCurrentFavors hardware that keeps utilization high at small batches instead of optimizing only peak throughputGet customer evidence on acceptable latency thresholds and revenue sensitivity
Nvidia software moat and benchmark leadershipConstraintCurrentLimits d-Matrix adoption outside very specific workload wedgesTest software maturity, tooling friction, and time-to-deploy versus GPU stacks
Captive hyperscaler custom silicon growthConstraintCurrent and risingShrinks the merchant share of the broad inference market available to startupsEstimate how much target demand is already committed to internal silicon or reserved GPU supply
Need for proof of production TCO, not just vendor claimsConstraintImmediateIndependent workload-level evidence is still thinner than the market narrativeRequest signed production deployments, reference architectures, and customer ROI evidence

This table mixes structural drivers and structural constraints because the same forces that expand inference demand also tighten the proof burden on a startup supplier. Every row is meant as an underwriting question, not as a claim that demand converts cleanly into revenue.

[CM008, CM009, CM010, CM011, CM012, CM013]

2.3 Buyer Segments, Budget Owners, and Adoption Path

The buyer map is narrower than generic 'enterprise AI' language implies. d-Matrix’s most realistic early segments are enterprises or private-cloud operators with latency-sensitive inference workloads, neocloud and managed-inference providers, AI product companies trying to control serving economics, and OEM or integrator partners who package the cards into systems. Hyperscalers still matter strategically, but they are structurally harder to win because they already control large GPU estates and are increasingly building captive silicon. d-Matrix’s open PCIe and Ethernet positioning, plus its air-cooled rack narrative, are more naturally aligned with customers who want to slot inference into existing datacenter footprints without redesigning facilities around liquid-cooled megaclusters. The adoption path is correspondingly infrastructure-heavy: identify a memory-bound or latency-sensitive workload, validate fit in a pilot or single-node deployment, then expand to server- or rack-level production once networking, model partitioning, and operating cost are proven. Integrators and OEMs matter because buyers are purchasing a deployable system, not just a chip.[CM016, CM017, CM018, CM019, CM020, CM021]

Segment / buyer map
SegmentBuyerUserPayerWorkflowBudget ownerAdoption trigger
Enterprise private cloud / regulated operatorsInfrastructure or platform engineering teamML platform, application, and operations teamsCorporate IT / infra budgetInternal copilots, support agents, private retrieval and reasoningCTO / VP Infrastructure / Head of AI platformNeed for low latency, data control, and predictable serving cost
Neocloud or managed inference providerCloud product and capacity planning teamsExternal developer customersCloud capex and service marginHosted model serving and inference API productsGM cloud / VP product / financeMargin pressure from GPU serving and desire for differentiated latency
AI product company / model API operatorApplied AI or platform teamEnd users of coding, support, or agent productsProduct P&LInteractive, memory-bound inference workloadsCPO / CTO / infra leadRapid usage spikes and loss-making inference economics
OEM / integrator channelServer OEM or system integratorSolution architects and deployment engineersEnd-customer project budgetPackaging cards, servers, networking, and support into deployable systemsBusiness unit lead / solution practice ownerCustomer demand for turnkey deployments
Hyperscaler / captive silicon buyerCloud infrastructure organizationInternal model and platform teamsMassive datacenter capexFoundation-model and cloud inference at hyperscaleSVP infrastructure / custom silicon orgUsually harder for d-Matrix to win because incumbent GPU and internal-silicon options are strong

The first four rows are realistic merchant-facing targets. The hyperscaler row is included because it matters strategically, but public evidence suggests it is the least directly accessible segment for a startup merchant accelerator.

[CM016, CM017, CM018, CM019, CM020, CM021]
FM003: Buyer / segment map

Realistic buying segments are mapped to the user profile, payer model, budget owner, and the trigger most likely to move a d-Matrix-style deployment into production.

[CM018, CM019, CM020, CM021, CM041, CM042]
FM004: Adoption funnel or value-chain map

Merchant inference adoption narrows from workload pain to pilot proof to production rollout, with incumbent software and capacity alternatives acting as a constant friction node.

The flow is qualitative. It reflects the infrastructure buying path implied by the source pack rather than a published conversion funnel, with the risk node showing where incumbents and API abstraction can arrest adoption before full rollout.

[CM018, CM020, CM021, CM037, CM043, CM044]

2.4 Sizing Lenses and What Is Actually Addressable

The public market numbers are large, but they are not directly additive and they do not all describe the same commercial surface. Gartner’s generative-AI spending forecast is the broadest adjacency and includes software, services, and infrastructure. MarketsandMarkets’ AI inference estimate is narrower, but still broader than merchant silicon alone. Silicon Analysts’ AI accelerator market is narrower again on the hardware axis but still includes training and multiple incumbent or captive architectures. The most decision-relevant lower-bound lens in the source pack is not even a TAM report at all: CNBC’s estimate that Bing AI alone needed at least $4 billion of infrastructure. That deployment-cost lens captures why the category exists. For d-Matrix, the accessible SAM is smaller than all of these figures because it excludes training, excludes much of captive hyperscaler custom silicon, and excludes buyers who are content to consume inference through external APIs. Public data is sufficient to show a large category, but not sufficient to publish a clean d-Matrix SOM without management disclosures.[CM022, CM023, CM024, CM025, CM026, CM027]

TAM / SAM / SOM or sizing lens table
Publisher / lensYearGeographyValue / indicatorGrowthMethodology lensConfidenceLimitation
Gartner via Forbes2025Global$644B generative AI spending76.4% YoYVery broad spend forecast across hardware, software, and serviceslowFar broader than inference hardware or merchant accelerators
MarketsandMarkets via Forbes2025-2030Global$106.15B to $254.98B AI inference marketImplied strong multi-year growthInference-market forecast cited in 2025 market commentarylowIncludes more than merchant datacenter hardware
Silicon Analysts2025E-2026EGlobal$160B to $200B+ total AI accelerator marketFast growth with share broadeningRevenue view of accelerator hardware marketlowMixes training and inference and includes incumbent/captive exposure
CNBC infrastructure lens2023Global / Bing reference caseAt least $4B infrastructure needed to serve Bing AI usersN/ADeployment-cost lens rather than TAMmediumSingle-service example, not market size
d-Matrix product architecture2025-2026Rack / datacenterUp to 100B models in performance mode; 1T+ in capacity modeScales with cards, servers, and racksTechnical deployment envelope for a merchant inference platformmediumCapability statement, not revenue TAM
Analyst synthesis2026Global merchant sliceNo clean public standalone SAM or SOMN/AEvidence-constrained conclusion from the source packmediumRequires management disclosures on deployments, cards per cluster, and buyer mix

These lenses are intentionally non-additive. The first three are market definitions with different denominators; the fourth and fifth are deployment economics / capability lenses; the last row records the still-missing public SAM/SOM rather than inventing one.

[CM022, CM023, CM024, CM025, CM026, CM027]
FM001: Market sizing lens

Layered lenses narrow from broad generative-AI spend to the much smaller deployment-cost and merchant-hardware wedge that is actually relevant to d-Matrix.

The layers are not a nested dataset. They represent progressively narrower commercial lenses in the same $B unit: broad spend, accelerator hardware, inference market, and a concrete large-deployment cost example. The purpose is to force boundary discipline, not to imply the values sum or cascade mechanically.

[CM022, CM023, CM024, CM025, CM026, CM040]
FM002: Market estimate range

Public $B lenses vary sharply depending on whether the denominator is broad GenAI spend, inference market revenue, accelerator hardware, or a single large deployment cost example.

All rows use the same $B unit, but they do not describe the same market surface. The midpoint for the AI inference row is arithmetic rather than publisher-provided and is included only to visualize dispersion between the 2025 and 2030 endpoints.

[CM022, CM023, CM024, CM025, CM026, CM045]

2.5 Contradictory Benchmark and Market-Share Narratives

The chapter’s core contradiction is that the inference opportunity is clearly expanding while the public evidence for share capture still overwhelmingly favors incumbents. d-Matrix’s own materials and a partner-backed Gimlet workload suggest real performance advantages on memory-bound or heterogeneous inference stages, especially where small-batch latency matters. But standardized benchmark coverage, mainstream GPU product pages, and market-share estimates still show Nvidia setting the visible frontier in most public comparisons. Even the market-share numbers disagree with one another because their denominators change: one source talks about AI chips, another about discrete GPUs, and another about total accelerators including custom silicon. That does not make the startup thesis wrong; it means the thesis must be much more specific. Investors should underwrite d-Matrix as a wedge into merchant, latency-sensitive, open-standards inference deployments where buyers care about predictable TCO and deployment fit—not as a generic claim that it will displace the GPU stack across all inference workloads.[CM028, CM029, CM030, CM031, CM032, CM033]

2.6 Exhibits

Chapter 03

03Competitors

3.1 Competitive Landscape Overview

d-Matrix sits inside a crowded but still-fluid AI inference market. The most important direct alternatives are not one homogeneous peer set: NVIDIA and AMD sell general-purpose accelerator stacks that can run both training and inference; Groq, Cerebras, and SambaNova pursue inference-first or tightly integrated alternatives; and the status quo for many buyers is still internal build on incumbent GPU clouds or private racks. Independent market coverage keeps circling the same structural pressures: inference workloads are growing faster than buyers can absorb power, cooling, memory, and GPU procurement costs, and that is why challengers such as d-Matrix get attention in the first place. d-Matrix's own pitch is that Corsair cards, JetStream NICs, and Aviator software attack the memory-movement penalty that makes large-model inference expensive. The company therefore competes less as a generic GPU replacement than as a purpose-built answer to the memory-bound, latency-sensitive slice of inference demand.[CP001, CP002, CP025, CP026, CP029, CP032]

Competitor Profile Table
CompetitorCategoryScale / FundingTarget SegmentKey DifferentiationPrimary Limitation
d-MatrixInference-only challenger$450M total raised per 2025 coverageEnterprise and datacenter LLM inferenceDigital in-memory compute; Corsair cards + JetStream NICs + Aviator software; standard-server insertion pointPublic benchmark proof and production customer depth remain limited
NVIDIA H100/H200/GB200Incumbent GPU stackDominant enterprise and cloud installed base; ~75-92% share estimates depending source and yearTraining plus inference from enterprise to hyperscaleCUDA/TensorRT/NIM, NVLink, InfiniBand, OEM and cloud reachHigh capex, power and cooling burden; buyers can still be overprovisioned for inference-only use cases
AMD MI300XMainstream GPU alternativeScaled through major OEM server channelsHyperscale and enterprise inference needing more memory per GPU192GB HBM3, 5.3TB/s bandwidth, ROCm software, same-server procurement motion as GPUsWeaker software incumbency than CUDA and less end-to-end platform mindshare
GroqInference service challengerPrivate company; public, private and co-cloud plans plus on-prem optionInteractive low-latency inference, regulated and air-gapped workloadsLPU architecture, predictable spend, GroqRack on-prem continuityDifferent compiler/runtime path than standard GPU stacks
CerebrasIntegrated large-model systemLarge private AI chip company with wafer-scale platformVery large model training and inferenceWafer-scale engine, enormous on-device compute, system-level memory strategySpecialized appliance model is less drop-in than standard racks
SambaNovaIntegrated enterprise platform$350M+ 2026 round per independent guideAgentic AI and enterprise inference deploymentsHardware, software, and models tuned together; SN50 positioned against Blackwell on price-performancePlatform adoption requires buying into a proprietary full stack
Internal build / custom siliconSubstitute / status quo alternativeHyperscaler and large-enterprise capex budgets rather than vendor fundingLargest buyers with data-governance, latency, or utilization reasons to own the stackCan optimize around proprietary workloads and avoid vendor dependenceRequires huge engineering, procurement, and operations capability

Coverage is partial and focused on the most decision-relevant options for 2026 enterprise inference buying: incumbent GPU stacks, inference-first challengers, and internal-build substitutes. Scale fields use public funding, share, or deployment context only where disclosed.

[CP001, CP013, CP014, CP018, CP021, CP023]
FP001: Competitive Positioning Map

Ordinal map of ecosystem leverage (x-axis) versus inference-specific specialization (y-axis) across d-Matrix and the most material alternatives.

Axes are ordinal estimates derived from public evidence, not a vendor-published numeric scale. X-axis measures ecosystem, channel, and deployment leverage; Y-axis measures how specialized each offer is for inference economics or latency-sensitive serving.

[CP018, CP021, CP023, CP029, CP032, CP040]

3.2 Incumbent GPU Stacks: NVIDIA and AMD

NVIDIA remains the benchmark d-Matrix must beat in practice, even when the buyer says it wants "better inference economics" rather than "the fastest raw cluster." H100, H200, and GB200 span the full ladder from single-GPU deployment to rack-scale NVLink domains, and NVIDIA layers software, networking, and enterprise support on top of the silicon. H100 anchors the installed base with NVLink, InfiniBand, Magnum IO, and AI Enterprise; H200 extends the moat by pushing memory capacity and bandwidth higher for large-model inference; GB200 raises the competitive bar again by collapsing 72 GPUs into a liquid-cooled rack-scale system that NVIDIA markets as a single massive inference machine. AMD is the clearest mainstream alternative because MI300X offers more memory per GPU, high bandwidth, and ROCm rather than CUDA. But AMD still competes inside the same OEM and datacenter procurement motion as NVIDIA, which means it pressures incumbent economics without fully changing how the buyer evaluates or deploys inference infrastructure.[CP006, CP007, CP008, CP009, CP010, CP011]

Feature / Capability Matrix
Buying Criteriond-MatrixNVIDIAAMDGroqCerebrasSambaNova
Small-batch interactive latency focusHighMediumMediumHighMediumMedium
Large-model memory footprint per unit / systemMediumHighHighMediumHighHigh
Mature software ecosystem and framework portabilityEmergingVery highModerateModerateModerateModerate
Rack-scale networking and OEM availabilityEmergingVery highHighModerateModerateModerate
Air-gapped / on-prem deployment pathPartner-ledHighHighHighHighHigh
Pricing transparency / usage-based packagingLowLowLowMediumLowLow
Training-ecosystem reuse for inference buyersLowVery highHighLowLowLow

Ratings are author assessments synthesized from official product pages and independent market analysis. They summarize buyer-relevant fit rather than a standardized benchmark score, and "Emerging" or "Low" often reflects public-proof gaps rather than proven technical absence.

[CP013, CP014, CP017, CP018, CP020, CP022]

3.3 Specialized Inference Challengers

The non-GPU challenger field is fragmented, which matters because d-Matrix is not competing against a single "inference chip" archetype. Groq packages low latency as a service with public, private, co-cloud, and on-prem options; it is the cleanest contrast to d-Matrix on deployment model and spend predictability. Cerebras goes in the opposite direction, using a wafer-scale engine and very large system memory so customers can keep far more of a model local to a specialized appliance. SambaNova bundles hardware, software, and tuned models into an integrated platform and positions that bundle directly against NVIDIA Blackwell on price-performance. Independent landscape surveys therefore frame the market as a set of different answers to the same question: should buyers solve inference with broad GPU ecosystems, with service-led low-latency specialists, with giant integrated systems, or with memory-local cards that fit more easily into standard enterprise servers? d-Matrix has a real seat at that table, but it does not yet own the default category narrative.[CP018, CP019, CP020, CP021, CP022, CP023]

Pricing / Packaging Comparison
Vendor / OptionPrice / Unit / Contract ModelIncluded CapabilitiesDiscount / UnknownsCompetitive Implication
d-MatrixNo public list price; company-quoted outcome is 3x lower cost versus GPU-based systemsCorsair accelerators, JetStream NICs, Aviator software, standard PCIe/server deploymentRealized pricing, utilization assumptions, and support terms are undisclosedStrong if validated in production, but public TCO proof is still thin
NVIDIA H100 / H200 stackHardware plus enterprise-software and OEM/cloud contracts; public list pricing generally opaqueCUDA, TensorRT, Triton, AI Enterprise/NIM, NVLink, InfiniBand, broad OEM availabilityBulk discounts and cloud pass-through pricing are opaqueDefault procurement path for buyers who value ecosystem certainty over inference-specific specialization
NVIDIA GB200 NVL72Rack-scale system sale rather than commodity card purchase72-GPU NVLink domain, liquid cooling, Mission Control, AI-factory toolingPublic transaction pricing not disclosed; facility upgrades can be materialRaises the performance bar but also narrows the buyer pool to very large deployments
AMD MI300XSold through OEM server programs and cloud contracts rather than public street pricing192GB HBM3, ROCm stack, OAM module packaging, OEM server availabilityNet pricing and support bundles are not publicCompetes by improving memory economics inside familiar GPU buying motions
GroqFree, Developer, and Enterprise plans; usage-based and custom enterprise contractsPublic/private/co-cloud access, regional endpoints, prompt caching, optional GroqRack on-premPer-model enterprise economics are negotiated and not fully publicMost transparent service-style packaging among the challengers
Cerebras / SambaNovaSystem-level enterprise contracts; pricing not publicly normalizedIntegrated hardware plus software stack, tuned large-model serving, enterprise supportActual token economics and appliance pricing are mostly privateCompete on turnkey integrated value rather than open component comparison
Internal build on incumbent GPU cloudHistorically can mean very high monthly bills plus datacenter upgrades or huge GPU reservationsMaximum workflow control and reuse of incumbent ML toolingTrue cost depends on model size, traffic, utilization, and reserved capacityThe substitute remains expensive, but it is familiar and therefore sticky

Public pricing is sparse across inference hardware, so this table emphasizes packaging, contract form, and what is visibly bundled. Where a row cites cost impact, it distinguishes vendor-authored claims from independent examples of GPU-era operating expense.

[CP001, CP018, CP019, CP027, CP028, CP035]
FP002: Feature Breadth / Capability Map

Buyer-fit view of where each vendor class is strongest for common inference workload requirements.

Cells summarize best-fit orientation from public materials and independent analysis; they should be read as comparative buyer-fit, not benchmarked performance scores.

[CP020, CP022, CP024, CP029, CP030, CP031]

3.4 Switching Costs, Distribution, and Internal Build

The central competitive question is not whether d-Matrix can make a strong silicon argument; it is whether it can overcome the software, networking, and channel inertia that keeps enterprises on incumbent platforms. NVIDIA's advantage is deeply systemic: buyers already know how to buy GPU clusters, cool them, connect them, program them with CUDA and TensorRT, and source them through familiar OEM and cloud channels. AMD weakens that moat at the margin with ROCm and more memory per GPU, but it still lives in the same rack and server ecosystem. Groq, Cerebras, and SambaNova attack the problem by changing the packaging model—API-led service, giant appliance, or fully integrated platform. d-Matrix sits between those camps. Its modular chiplet and card-based approach could lower insertion friction, but the public record still lacks the customer references, price sheets, and benchmark normalization that would prove buyers are willing to switch, multi-home, or standardize on it at production scale. Internal build is also a real substitute as hyperscalers and large enterprises increasingly design or co-design inference silicon around their own workloads.[CP027, CP028, CP030, CP033, CP034, CP035]

3.5 Moat Durability and Adverse Evidence

d-Matrix has a credible but still-provisional moat. The strongest pillar is technical fit: multiple sources tie inference cost and power pain to memory movement, and d-Matrix's digital-in-memory-compute plus chiplet interconnect strategy is explicitly built around that bottleneck. A second pillar is modularity; Jayhawk, cards, NICs, and software imply a heterogeneous deployment story that could be easier to insert than a proprietary super-appliance. But adverse evidence is meaningful. NVIDIA still owns the most mature benchmark record, software stack, and distribution footprint. AMD keeps narrowing the memory argument. Groq has the cleanest public packaging for predictable low-latency service. Cerebras and SambaNova sell more tightly integrated systems for very large models. Most importantly, d-Matrix's public performance and cost claims remain largely vendor-authored rather than normalized by an independent, apples-to-apples benchmark suite. That leaves moat durability at "moderate": differentiated enough to matter, but not yet durable enough to ignore incumbent or integrated-system counterattack risk.[CP037, CP038, CP039, CP040]

Moat Durability / Competitive Risk Register
Moat ClaimThreatSeverityMitigation / What Would HelpDiligence Ask
Digital in-memory compute better matches inference’s memory bottleneckH200/GB200/next-gen GPU memory and bandwidth gains narrow the efficiency gapHighIndependent same-model energy and latency benchmarks versus H100/H200/MI300XWhat public benchmark shows d-Matrix’s claimed gain under the same model, batch, and latency target?
Modular chiplet plus card/NIC architecture lowers deployment frictionBuyers may still prefer proven NVLink/NIM/OEM stacks or turnkey challenger appliancesHighMore named OEM systems and production customer referencesWhich server OEMs ship production-ready d-Matrix systems with supported software images today?
Inference-only focus gives d-Matrix a sharper product thesis than general GPUsTraining-plus-inference platforms still win procurement because they are multipurposeMediumProof that buyers separate inference budgets from general AI infrastructure budgetsHow often do enterprises buy d-Matrix specifically for inference instead of extending existing GPU clusters?
Claimed lower cost and energy improve ROI for scale inferenceClaims remain largely vendor-authored and may not survive apples-to-apples normalizationHighThird-party benchmark or public customer bill-of-materials evidenceWhat utilization, model size, and software assumptions sit behind the 3x-lower-cost claim?
Heterogeneous-system strategy could fit regulated or private deploymentsGroq already has clearer cloud-to-on-prem packaging, while Cerebras/SambaNova sell more integrated systemsMediumShow repeatable on-prem reference architecture and support modelDoes d-Matrix have a public deployment path as crisp as GroqRack or a turnkey appliance rival?
Inference market growth creates room for challengersInternal build and custom silicon can absorb the highest-value workloads before d-Matrix doesMediumFocus on customers lacking hyperscaler-scale design budgetsWhich customer segments are least likely to replace third-party inference hardware with internal silicon?

Severity is a qualitative judgment based on the public record available on 2026-05-26. This is a durability register, not a probability forecast, so it prioritizes the threats most likely to compress d-Matrix’s differentiation if left unanswered.

[CP028, CP033, CP035, CP037, CP038, CP039]
FP003: Moat / Readiness KPIs

Compact snapshot of the key numbers shaping d-Matrix’s competitive durability versus incumbent and challenger alternatives.

[CP001, CP003, CP008, CP014, CP032]

3.6 Exhibits

Chapter 04

04Financials

4.1 Revenue Model and Monetization Surfaces

The public record points to a hardware-platform revenue model rather than a software-only or API-only business. D-Matrix repeatedly sells the story as a full-stack inference platform built from Corsair accelerators, JetStream networking, and Aviator software, with commercialization routed through hyperscale, enterprise, sovereign, OEM, and system-integrator channels. That matters because revenue quality will likely depend on hardware shipments and multi-component deployment wins, not self-serve software adoption. The GTM evidence is also distinctly qualification-heavy. Series B was framed as the round that would let the company begin commercializing Corsair, while the 2024 and 2025 product announcements still leaned on sampling, early-access customers, OEM qualification, and partner-configured systems. SquadRack, SuperNODE, and the Gimlet cloud integration all imply larger contract values than a single card sale could support, but they also imply shared economics across server, networking, and cloud partners. Crucially, d-Matrix does not publish Corsair list pricing, JetStream pricing, software license pricing, or a booked-customer count. Even the ecosystem page warns that partner logos are illustrative and not actual customers. So the monetization surface is real and widening, but public evidence still shows channel formation and pilot conversion more clearly than realized revenue.[CI001, CI002, CI003, CI004, CI005, CI006]

Revenue streams table
StreamMechanismBuyer / routeCurrent public statusRevenue qualityDiligence ask
Corsair acceleratorsSale of PCIe accelerator cards and dual-card configurationsEnterprises, hyperscalers, sovereign clouds, OEMs, system integratorsSampling in late 2024; broad availability targeted for Q2 2025Likely high-ACV hardware revenue, but no public ASP, volume, or gross marginRequest shipped units, ASPs, gross margin, and pilot-to-production conversion
JetStream NICsSale of dedicated networking cards that scale Corsair clustersSold with multi-node deployments and rack solutionsSamples available in Sep 2025; full production targeted by year-end 2025Potential attach revenue, but attach rate and pricing are undisclosedRequest NIC attach rate, BOM, and realized pricing by deployment size
Aviator softwareInference software bundled with hardware and deployment toolsEnterprise and OEM deployment teamsRepeatedly marketed as part of the full-stack platform; no standalone price disclosedCould create recurring software revenue, but public evidence only supports bundle-level monetizationRequest license model, maintenance terms, and renewal or support revenue
Integrated rack and server solutionsPartner-configured systems such as SquadRack and GigaIO SuperNODEOEM and infrastructure channel partnersReference architectures and joint solutions are public; booked revenue is notHigher contract value is plausible, but economics may be shared with server and networking partnersRequest who invoices the customer, system ASP, and partner revenue split
Heterogeneous cloud accessSelect-customer access to Corsair through Gimlet Cloud alongside GPUsAI-native model providers and cloud buyersTargeted for select customers in 2H 2026Could become usage-based proof, but current evidence is still early-access and benchmark orientedRequest contract structure, revenue share, and pilot-to-production conversion rates

Rows are restricted to monetization surfaces that are explicitly visible in public product, partner, and deployment materials. No public source in the pack discloses realized pricing, revenue mix, or contract duration.

[CI001, CI006, CI007, CI008, CI009, CI010]
Pricing / monetization table
OfferPublic list pricePublic performance / cost claimRealized pricing visibilityMonetization implicationSource status
Corsair card hardwareNot publicly disclosedUp to 10x faster interactive speed and 3x better cost-performance versus GPUsNonePricing power cannot be tested without a price sheet or customer deal dataOfficial product claims exist; public pricing does not
JetStream networkingNot publicly disclosed400 Gbps NIC plus 3x cost-performance claim when paired with CorsairNoneMulti-node attach may lift ACV, but networking BOM also risesSpec sheet exists; price sheet does not
Aviator softwareNot publicly disclosedEnterprise-grade software stack that is part of the platform storyNoneRecurring software upside may exist, but standalone licensing is not publicly visibleBundled marketing only
GigaIO SuperNODE + CorsairNot publicly disclosed64+ Corsairs in a single node with lower deployment complexity and better TCONoneImplies system-level contract potential, but no public partner pricing or revenue splitPartner promotional
Gimlet Cloud accessNot publicly disclosed2x-10x latency improvement at equal energy efficiency in heterogeneous inference benchmarkNoneCould create usage-based revenue, but customer pricing and margin remain unannouncedSelect-customer early access only

This table intentionally separates availability and performance language from actual monetization. No reviewed source provides a public Corsair, JetStream, software, or integrated-system list price.

[CI008, CI017, CI019, CI024, CI036]
FI001: Revenue model bridge

Public evidence suggests a qualification-led revenue chain that starts with workload fit, moves through partner-enabled hardware deployment, and only then expands into software and larger system revenue.

The sequence is inferred from public commercialization language rather than management-reported conversion data. Pricing, attach rates, and contract duration remain undisclosed.

[CI001, CI006, CI007, CI008, CI010, CI011]

4.2 Unit Economics and Cost Drivers

D-Matrix’s unit-economics case is easy to describe but still only partially public. Official materials consistently claim up to 10x faster interactive speed, 3x better cost performance, and 3x to 5x better energy efficiency than GPU-based alternatives, while third-party Gimlet testing adds a more specific workload proxy: speculative decode offload to Corsair produced 2x to 10x lower end-to-end request latency at the same energy efficiency. That is meaningful evidence that the product can improve customer economics on memory-bound inference stages. But it is not the same as proving d-Matrix’s own gross margin or payback profile. The white paper and JetStream brief explicitly label cost and power figures as preliminary, and both documents reveal why system economics cannot be reduced to chip benchmarks alone. JetStream adds a dedicated 400 Gbps NIC, optics or DACs, power draw, switch ports, and rack-integration work. The Corsair server and rack specs imply larger average deal sizes, but they also imply server, networking, and support costs that sit outside the accelerator die itself. External market proxies reinforce why this matters: AI inference can consume extraordinary operating budgets, while GPU infrastructure remains expensive, power hungry, and supply constrained. D-Matrix may address those economics, but public proof still lives at the benchmark and architecture layer, not the P&L layer.[CI013, CI014, CI015, CI016, CI017, CI018]

Unit economics table
Metric / proxyPublic value / statusConfidenceWhy it mattersDiligence ask
Claimed cost-performance improvement3x better performance / TCO versus GPU-based alternativesmediumThis is the core ROI claim for enterprise buyersReproduce on customer workloads with full system BOM
Claimed energy efficiency improvement3x to 5x better energy efficiency versus GPU-based alternativesmediumPower and cooling are central to inference economicsCompare rack-level power including switches, optics, and idle overhead
Claimed interactive speed improvementUp to 10x faster interactive speed; 30K tokens/s at 2 ms/token for Llama 70BmediumSupports the thesis that latency-sensitive workloads justify specialized hardwareRequest benchmark methodology, prompt mix, and concurrency assumptions
JetStream networking overhead400 Gbps NIC, PCIe Gen5 x16, max 150W TDP, supplemental power inputhighNetworking is part of the cost stack, not a free add-onRequest node-level power budget and transceiver cost
Server and rack density proxy8-card server up to 2 TB capacity memory; 8-node rack for 100B models; GigaIO says 64+ Corsairs in a nodemediumPotential deal size is large, but deployment complexity and rack cost rise tooRequest typical cards-per-deal, node count, and installed rack configuration
Third-party heterogeneous benchmark proxyGimlet reports 2x-10x lower latency at equal energy efficiency for speculative decode offload; 1.6B draft model fits on 2 cardsmediumBest current public proof that specialized offload can improve real workload economicsRequest production throughput, utilization, and win-rate data by workload

The table mixes vendor claims, published specifications, and third-party benchmark proxies. It should not be read as d-Matrix reported gross margin, CAC, or payback data.

[CI014, CI015, CI016, CI017, CI018, CI019]
FI002: Unit economics bridge

D-Matrix’s public ROI case is built around moving memory-bound inference stages onto specialized hardware while keeping the rest of the system compatible with standard racks and networks.

This figure translates public architecture and benchmark evidence into a commercial logic chain. It is not a reported cost model and should not be read as a gross-margin bridge.

[CI017, CI018, CI024, CI025, CI033, CI050]
FI003: Financial estimate range

Fixed disclosed values and public benchmark ranges frame the financial discussion: capital is known, but most operating metrics remain undisclosed, so the usable public ranges are on claimed efficiency rather than company P&L.

The first two rows are fixed disclosed values. The latter rows are benchmark or vendor-claim ranges, not audited financial metrics. No public range exists for d-Matrix revenue, burn, or gross margin.

[CI002, CI017, CI024]

4.3 Capital Intensity, Partner Dependence, and Financing Dependency

Capital formation is the clearest part of the financial picture. D-Matrix has publicly disclosed $450 million of total funding, including a $110 million Series B in 2023 and a $275 million Series C at a $2 billion valuation in 2025. Both rounds were explicitly tied to commercialization, roadmap execution, hiring, global expansion, and large-scale deployments. That is positive because the company is not trying to scale a complex inference hardware stack on a shoestring. It is also a warning sign: the operating plan still appears funding-backed more than customer-funded. The roadmap itself is capital intensive. Corsair relies on chiplets, high-bandwidth memory integration, server qualification, and partner distribution. JetStream adds dedicated networking hardware. The next-generation 3DIMC roadmap adds an explicit dependency on Alchip for ASIC design, advanced packaging, and manufacturing management. Public comp filings show what that usually means in practice: inventory commitments, deposits, long lead times, board and memory costs, and margin pressure when demand timing shifts. D-Matrix may benefit from using partners rather than vertically owning every system component, but that simply trades some fixed cost for execution dependence on Supermicro, Arista, Broadcom, GigaIO, cloud partners, and advanced-packaging suppliers. Without public cash, burn, or runway, investors cannot tell whether the current balance sheet comfortably covers those scale-up requirements or merely bridges the company to its next financing event.[CI002, CI003, CI004, CI005, CI019, CI020]

Capital adequacy table
ItemPublic value / statusEvidenceUnderwriting read-throughDiligence ask
Total disclosed capital raised$450MSeries C official release, investor releases, and independent coverage alignStrong capitalization for a private inference hardware company, but not proof of self-funding operationsReconcile cap table, primary vs. secondary proceeds, and liquidation preferences
Latest round size / valuation$275M Series C at $2B valuationOfficial plus Bullhound, QIA, and news coverageShows continued investor access and valuation support in late 2025Request post-money ownership, board rights, and preference stack
Series B bridge capital$110M in 2023 to begin commercializing CorsairOfficial Series B releaseConfirms multiple years of pre-revenue or early-revenue buildout before the 2025 raiseRequest spend bridge from Series B to Series C
Disclosed use of fundsRoadmap, hiring, global expansion, and large-scale deploymentsOfficial Series B and Series C materialsCapital is earmarked for execution and scale-up rather than disclosed profitabilityRequest 24-month operating plan and deployment capex plan
Cash on hand / burn / runwayNot publicly disclosedNo figure in reviewed official, investor, or independent sourcesRunway and next-round timing cannot be tested from public materialsRequest current cash balance, monthly burn, and runway under base and downside scenarios
Debt / project finance / purchase commitmentsNot publicly disclosed for d-Matrix; public hardware comps show these items can affect marginsD-Matrix disclosure gap plus Nvidia filing proxyHardware scaling may hide working-capital needs even if fixed capex is partner-assistedRequest supplier deposits, NRE, inventory commitments, warranties, and debt facilities

The disclosed funding stack is factual, but capital adequacy remains only partially visible because public sources do not provide cash, burn, runway, or supplier-commitment detail.

[CI002, CI003, CI004, CI005, CI039, CI040]
FI004: Capital intensity / cash-flow map

The main financial burden sits at the system and roadmap level: silicon and packaging, server and rack integration, networking, and customer validation all require capital or partner execution before recurring operating proof appears.

This is an evidence-backed qualitative matrix rather than a reported cash-flow statement. It highlights where capital need and execution dependence are visible in the source pack.

[CI011, CI014, CI020, CI022, CI041, CI042]

4.4 Financial Verdict and Disclosure Gaps

The right financial verdict is cautious but not dismissive. Public evidence supports a serious capital base, a coherent revenue mechanism, and credible workload-level reasons customers might buy the product. It does not support a traditional underwriting view on current revenue quality, gross margin, sales efficiency, or runway. D-Matrix has disclosed funding, valuation, commercialization milestones, partner architectures, and benchmark-style efficiency claims, but it has not publicly disclosed revenue, ARR, gross margin, customer count, repeat orders, list pricing, realized ASPs, software attach rates, cash balance, burn, runway, debt, or supplier commitments. That disclosure gap matters more here than it would in a pure software startup, because this business depends on hardware manufacturing, packaging, networking, server integration, and deployment timing. The resulting investment case is therefore forward-looking and financing-dependent. If management can show booked revenue, repeat deployments, margin by hardware and software layer, and a runway that covers the 3DIMC and deployment roadmap, the funding stack looks like a real launchpad. Without those disclosures, the round history mainly proves that the company can raise capital, not that it has already crossed the line into repeatable, self-sustaining economics.[CI036, CI037, CI038, CI039, CI040, CI041]

Public financial gaps table
Missing metricWhat public evidence saysWhy it mattersExact diligence path
Revenue / ARRNo public figure found in official, investor, or independent sourcesImpossible to anchor underwriting on current commercial scale or valuation multipleRequest monthly and annual revenue by product line, geography, and customer cohort
Gross margin / contribution marginNo public figure foundCannot test whether hardware, networking, and software mix can support durable profitabilityRequest gross margin bridge by Corsair, JetStream, Aviator, and integrated systems
Realized ASP / discountsNo public Corsair or JetStream price sheet foundCannot translate performance claims into deal economics or discount strategyRequest price book, discount bands, and pilot versus production ASPs
Software attach / recurring revenueAviator is marketed as part of the platform, but its licensing model is undisclosedRecurring mix could materially change revenue quality and multipleRequest percentage of deals with paid software, annual contract value, and renewal terms
Customer count / repeat ordersPartners and benchmarks are visible, but broad production-customer evidence is notCannot judge demand depth, repeatability, or concentration riskRequest named production customers, repeat orders, deployed cards or nodes, and backlog
Cash / burn / runway / commitmentsNo public cash balance, burn, runway, debt, or supplier-commitment data foundFinancing dependency and insolvency timing cannot be underwritten from the public recordRequest cash waterfall, monthly burn, purchase commitments, and next-round trigger

This table is the main underwriting blocker list. The missing metrics are normal for a private company, but they are especially consequential in a hardware business with manufacturing, networking, and deployment dependencies.

[CI036, CI037, CI038, CI039, CI040, CI041]

4.5 Exhibits

Chapter 05

05Product & Technology

5.1 Product Surface and Workload Fit

d-Matrix is no longer just pitching an abstract chip architecture. The current customer-facing product surface is a full inference stack: Corsair accelerator cards, JetStream networking cards, Aviator software, and SquadRack as the rack-level reference blueprint. In customer workflow terms, the company is targeting interactive chat, coding, agentic, reasoning, and multimodal workloads where low latency matters more than brute-force benchmark throughput. The product page is unusually specific for a startup hardware vendor because it publishes single-card, dual-card, server, and rack configurations with concrete memory-bandwidth and capacity figures. That specificity is a real product-tech strength because it gives buyers and diligence teams something testable instead of a vague platform promise. The maturity caveat is that most public proof still sits in official briefs, partner announcements, and reference-architecture pages rather than in named production customer deployments. The architecture signal is strong; the customer-proof signal is still comparatively thin. Recent company essays also sharpen the workload thesis by arguing that longer context windows, reasoning chains, and agent loops turn inference into a context-and-latency problem, not just a raw-throughput contest.[CE001, CE002, CE003, CE004, CE005, CE036]

d-Matrix Product Module and Asset Matrix
Module / product linePrimary userStatus / maturityDifferentiationDiligence gap
Corsair single/dual-card acceleratorsPlatform architects and enterprise AI infrastructure teamsCurrent flagship product with public card, server, and rack specsDIMC plus on-chip SRAM in PCIe cards designed for low-latency decode and interactive inferenceNamed production customers and field reliability data are not public
JetStream transparent NICOperators scaling Corsair across nodes and racksCurrent networking product with samples announced and detailed spec publishedDevice-initiated accelerator-to-accelerator transfers over standard Ethernet rather than host-mediated RDMA flowBroad production proof and compatibility matrix are not public
Aviator software stackML engineers, infra engineers, and platform teamsCurrent software layer with documented build and execution flowsCo-designed stack spanning model factory, compression, compiler, runtime, Kubernetes plugin, and observability toolsSupported-model matrix, SLA posture, and customer support scope are not public
SquadRack reference architectureCloud, sovereign-cloud, and enterprise datacenter operatorsReference blueprint rather than a fully closed applianceAir-cooled rack design using Corsair, JetStream, standard PCIe servers, and standard Ethernet fabricsReference architecture is not the same thing as broad GA deployment
Pavehawk test silicon and Raptor roadmapFuture buyers needing larger-memory inference footprintsRoadmap stage; public validation is architectural, not commercial3DIMC roadmap adds 3D DRAM to extend the low-latency memory-centric architecture beyond CorsairIndependent benchmarks, production timing, package yield, and customer commitments are not public

This matrix separates what is currently productized from what is still roadmap. Status language reflects the public corpus as of 2026-05-26, not management aspiration.

[CE001, CE002, CE005, CE008, CE013, CE023]
FE001: d-Matrix Product Architecture Map

Layered view from target workload to software, networking, compute-memory hardware, and the future memory-scaling roadmap.

The layer boundaries are analytical rather than company-authored. The figure is meant to show how product, software, networking, and roadmap fit together as one platform.

[CE001, CE008, CE010, CE013, CE023, CE026]

5.2 DIMC, JetStream, and Aviator Architecture

The core technical thesis is that modern inference is dominated by memory movement, especially during decode, so d-Matrix optimized for memory-centric execution rather than trying to look like another GPU. Corsair pairs DIMC compute with large on-chip SRAM and capacity memory, then scales laterally through chiplets, DMX Link, and DMX Bridge. Aviator matters as much as the silicon because it handles model conversion, compression, compilation, runtime execution, cluster orchestration, and developer tooling. Recent technical essays extend that framing beyond the chip itself, arguing that context pressure, KV-cache reduction, and optimization across software, networking, and infrastructure are necessary complements to DIMC rather than side projects. JetStream extends the same low-latency philosophy to multi-node communication by separating data and control planes and enabling device-initiated accelerator-to-accelerator transfers over standard Ethernet. The most credible part of the technical story is that the hardware, networking, and software layers are described in enough detail to form a coherent operating model. The main diligence question is not whether there is an architecture, but how broadly that architecture has been validated outside d-Matrix and a small set of technical partners.[CE006, CE007, CE008, CE009, CE010, CE011]

Workflow and Use-Case Table
User jobCurrent workflow paind-Matrix solutionMeasurable benefitLimitation
Interactive chat or coding assistantGPU-only decode can become latency-sensitive and queue-sensitive at small batchesRun low-latency decode on Corsair while keeping the deployment in standard PCIe serversCompany cites up to 60,000 tokens/s at 1 ms/token for Llama3 8B in a single serverThis performance figure is company-reported and workload specific
70B-class on-prem inference serviceLarge models outgrow a single card and create rack-level memory and networking pressureUse eight-server Corsair racks and JetStream-based node-to-node scalingCompany cites 30,000 tokens/s at 2 ms/token for Llama3 70B and 100B-parameter scale targetsPublic proof is strongest in reference architectures and partner deployments, not in named customer rollouts
Heterogeneous pipeline with large verifier modelGPU-only systems must compromise between fast decode and high-quality large-model verificationUse Corsair for the memory-bound speculative draft stage and keep the larger verifier on GPUGimlet reports 2-5x request speedup at equal energy for one speculative-decoding setupIndependent validation is narrow and does not cover every inference phase
Enterprise retrofit into existing datacenterNew AI systems often require proprietary fabrics or heavy infrastructure changesDeploy PCIe cards plus Ethernet switching in existing datacenter footprintsSquadRack is air-cooled and built around standard servers and top-of-rack Ethernet switchesDeployment quality still depends on third-party server and switch integration
Single-node scale-up before multi-node complexityDistributed inference can add coordination overhead and operating complexityUse GigaIO SuperNODE to host dozens of Corsair cards inside one nodePartner claims lower multi-node complexity and cites 30,000 tokens/s at 2 ms/token for Llama3 70BSingle-node partnership claims are partner-reported rather than broadly benchmarked

Benefits mix official claims and one independent practitioner benchmark. The table is about workflow fit, not a comprehensive benchmark bake-off.

[CE005, CE012, CE015, CE016, CE017, CE018]
Technology and Operating Architecture Table
Layer / processRoleKey dependencyKey risk
DIMC cores with on-chip SRAMKeep memory close to compute for low-latency token generation and interactive decodeSuccessful mapping of inference kernels and data movement into the DIMC memory hierarchyArchitecture is differentiated, but broad third-party workload validation is still limited
Capacity memory tierExtend model and context footprint beyond on-chip SRAM without abandoning the memory-centric designAviator scheduling and runtime policies that decide what sits in fast versus capacity memoryPerformance depends on workload fit and on how much data must still move between tiers
Chiplets plus DMX Link and DMX BridgeScale from one chiplet pool to dual-card and rack-level configurationsPackaging, interconnect yield, and card qualification across OEM systemsChiplet and bridge complexity can become a packaging, signal-integrity, or qualification bottleneck
JetStream transparent NICProvide low-latency inter-node accelerator communication without host-mediated RDMA overheadStandard Ethernet optics, switches, PCIe Gen5 hosts, and JetStream firmwareProduction networking proof is newer than Corsair compute proof
Aviator software stackConvert models, compress graphs, compile binaries, orchestrate distributed execution, and expose toolingModel support, compiler maturity, runtime stability, and developer enablementSoftware quality is central because hardware value is only realized through the stack
Rack and server ecosystem layerTurn cards and NICs into deployable systems through Supermicro, GigaIO, Liqid, Arista, and BroadcomPartner qualification, support handoffs, and standard-server supplyOpen standards reduce lock-in but increase integration dependence
3DIMC on Pavehawk and RaptorPush the architecture toward larger-memory inference using stacked DRAMAlchip packaging and ASIC work, foundry capacity, and successful test-silicon translation into product siliconRoadmap value is high, but commercial timing and external validation remain unproven

This table focuses on the operating stack needed to turn Corsair into a usable platform. Dependencies are called out explicitly because d-Matrix is not selling a closed appliance today.

[CE006, CE007, CE008, CE010, CE011, CE020]
FE002: Customer Workflow and Operating Flow

How a model moves from developer input into a heterogeneous d-Matrix deployment and then serves low-latency inference.

This flow synthesizes the published Aviator build and execution model with d-Matrix's own heterogeneous prefill/decode narrative. It is specific to the reviewed corpus, not an official product diagram.

[CE008, CE010, CE017, CE018, CE045, CE046]

5.3 Deployment, Rack Architecture, and Supply Dependencies

d-Matrix is trying to make deployment easier by leaning into standard PCIe servers, standard Ethernet fabrics, and ecosystem partners instead of demanding a proprietary rack rebuild. SquadRack and GigaIO SuperNODE show the intended server and rack topologies: scale-up inside the node where possible, then scale out over Ethernet when model size or request volume grows. That should lower retrofit friction for enterprise and sovereign-cloud environments that want AI inference without hyperscaler-style liquid cooling. The tradeoff is dependency. d-Matrix depends on Supermicro, Arista, Broadcom, GigaIO, and Liqid to turn cards into deployable systems, and the roadmap depends on Alchip, foundry capacity, and advanced packaging to move from Corsair to 3DIMC-based products. Recent company and partner posts make that interdependence explicit: d-Matrix now frames open standards as a strategic choice, its own GigaIO announcement folds single-node scale-up into the official deployment story, and Alchip independently confirms the shared packaging role behind 3DIMC. The older Nighthawk and Jayhawk disclosures show this dependency pattern is not new; packaging and interconnect have always been central to the company thesis. The product can therefore be differentiated and still execution-sensitive: the architecture is open-standards friendly, but the supply chain is not self-contained.[CE013, CE014, CE015, CE021, CE022, CE023]

Roadmap and Development-Stage Table
Date / stageMilestoneStatusImplicationSource
2021 platform eraNighthawk chiplet platformHistorical predecessorShows chiplets and scale-out were part of the design thesis before Corsair launchedEE News Europe / HPCwire
January 2023Jayhawk chiplet platform announcedHistorical public milestoneIndicates a packaging and interconnect roadmap on TSMC 6nm before the commercial Corsair launchEE News Europe / HPCwire
November 2024Corsair launched with Q2 2025 broad-availability targetCurrent-generation product milestoneMarks the shift from platform thesis to shipping compute product with concrete card specsd-Matrix launch / whitepaper
May 2025GigaIO SuperNODE scale-up partnershipPartner deployment milestoneShows single-node scale-up path for dozens of Corsair cards before broader rack-level rolloutsGigaIO / Data Center Dynamics
September to October 2025JetStream launch and SquadRack blueprintCurrent-generation expansionTurns d-Matrix into a compute-plus-networking-plus-rack story rather than a card-only storyd-Matrix JetStream / SquadRack
November 20253DIMC, Pavehawk validation, and Raptor roadmapRoadmap stageExtends the company thesis toward larger-memory products but remains pre-commercial in this corpusd-Matrix Alchip announcement / Going vertical

The timeline mixes historical architecture milestones, current product milestones, and future roadmap steps. The key diligence distinction is current commercial product versus future memory-scaling roadmap.

[CE021, CE022, CE023, CE024, CE027, CE041]
FE003: Critical Dependency Map

The deployable d-Matrix platform depends on synchronized progress across silicon, packaging, software, networking, and ecosystem partners.

The dependency graph is analytical. It highlights why d-Matrix can have a differentiated architecture and still carry meaningful execution sensitivity across partners and packaging.

[CE026, CE027, CE028, CE029, CE030, CE034]

5.4 Roadmap, Differentiation, and Trust Gaps

The differentiation case is clear enough to describe: d-Matrix is attacking low-latency decode and inference-time memory pressure with a memory-centric architecture, open-standards rack design, and a future hybrid SRAM-plus-3D-DRAM roadmap. That stands in contrast to the incumbent GPU direction, which emphasizes HBM capacity, proprietary NVLink domains, and increasingly liquid-cooled rack systems. Recent company framing around context growth, agent latency, open standards, and enterprise-scale reasoning is directionally consistent with that thesis, but it still functions more as strategic argument than as independent proof. The weakness is not conceptual differentiation but underwriting confidence. The chapter corpus shows some independent practitioner validation via Gimlet, and it shows broader market appetite for specialized inference hardware, but it does not show broad third-party benchmark coverage, public reliability metrics, named production customers, or published compliance credentials. The 3DIMC, Pavehawk, and Raptor story is especially important because it is meant to extend the current architecture into larger-memory deployments; it is also still the least externally validated part of the roadmap. Investors can underwrite a concrete architecture today, but they still need diligence on scale, trust, and roadmap execution.[CE031, CE032, CE033, CE034, CE035, CE037]

Trust, Quality, and Compliance Table
Control / signalStatusScopeGap
Secure boot on JetStreamPublicly disclosedNIC-level boot integrity for the networking cardNo broader platform security architecture is published for Corsair plus Aviator plus JetStream as a system
Kubernetes plugin, metrics exporter, debugger, and profilerPublicly disclosedOperational tooling for deployment, monitoring, and developer diagnosisTooling existence is not the same as public uptime, MTBF, or support-performance data
Open-standards deployment modelPublicly disclosedPCIe cards, Ethernet networking, and standard top-of-rack switching reduce bespoke infrastructure riskOpen standards do not remove integration, firmware, or partner-qualification risk
Independent practitioner benchmark from GimletPublicly disclosed but narrowOne speculative-decoding workflow on a heterogeneous Corsair-plus-GPU setupThe validation surface is narrow and is not equivalent to broad production benchmarking
Public compliance credentialsNot disclosed in reviewed corpusEnterprise trust posture for regulated, sovereign-cloud, and security-sensitive buyersNo public SOC 2, ISO 27001, FedRAMP, or trust-center references were found in the reviewed official surfaces
Public fleet reliability metricsNot disclosed in reviewed corpusField reliability, service durability, and operator confidenceNo public MTBF, RMA, uptime, or failure-rate disclosure appears in the chapter corpus

This table distinguishes disclosed controls from missing trust artifacts. Absence refers only to the reviewed corpus and should be closed by diligence requests rather than assumed away.

[CE009, CE035, CE037, CE038, CE039, CE040]
FE004: Product Maturity and Capability Map

Maturity view across the current product stack and the future 3DIMC roadmap, emphasizing external proof and trust gaps.

This is an author judgement based on public evidence depth, not a company scorecard. The matrix is intentionally strict on independent validation and trust disclosure.

[CE013, CE016, CE023, CE035, CE038, CE040]

5.5 Exhibits

Chapter 06

06Customers

6.1 Buyer Segments and Buying Centers

Public materials give a reasonably consistent picture of who d-Matrix is trying to sell to, even if they do not reveal how many accounts have actually converted. The Series C release anchors the top-level target set as hyperscale, enterprise, and sovereign customers, while SquadRack narrows the operational buyer to cloud providers, sovereign clouds, and enterprises trying to serve agentic AI, reasoning, and video workloads. The recurring decision-maker profile is therefore infrastructure and platform leadership rather than application teams: data-center operators, AI platform architects, and cloud builders looking for lower-latency inference inside existing facilities. That targeting is reinforced by the sales narrative around standard PCIe servers, Ethernet, and air cooling. d-Matrix is not pitching a hyperscaler-only custom rack that requires greenfield infrastructure; it is pitching a retrofit path for operators who want to add inference capacity without taking on a proprietary fabric or liquid-cooling mandate. The strongest current fit appears to be enterprise and neocloud-style operators plus sovereign-cloud programs that value infrastructure optionality, while disclosed hyperscale production references remain absent.[CU001, CU002, CU003, CU004, CU005, CU006]

Customer segmentation table
SegmentBuyer / budget ownerPrimary usersPublic proofStrategic valueKey gap
Hyperscalers and AI factoriesPlatform and infrastructure leadershipAI infra teams serving frontier or high-volume inferenceSeries C release names hyperscale customers as a target, but no account is disclosed by nameWould validate very large-scale production adoption and silicon credibilityNo named hyperscale production customer is public
Enterprise datacentersCIO / CTO / AI platform budgetInfrastructure and platform teams retrofitting existing facilitiesGigaIO, Supermicro, and Liqid messaging all frame Corsair around enterprise deployment and TCONear-term commercial wedge because retrofit economics matterMost proof is partner-led rather than end-customer case-study led
Sovereign clouds and national programsState-backed cloud operators and infrastructure procurement teamsOperators serving regulated or national workloadsSeries C and SquadRack materials explicitly mention sovereign customers or sovereign cloudsCould be high-value accounts that care about infrastructure controlNo sovereign customer is named or quantified publicly
Specialist AI clouds / neocloudsCloud operator leadershipModel providers and AI-native companiesGimlet is the clearest named operator using heterogeneous infrastructure with CorsairProvides a route into high-usage workloads without needing a hyperscaler logo firstPublic availability is still select-customer in 2H 2026
OEM / channel ecosystemSystem integrator and OEM product teamsServer, rack, and networking integration teamsSupermicro, GigaIO, Arista, Broadcom, and Liqid appear in public commercialization materialsCritical force multiplier for reach, deployment speed, and support coverageChannel dependence can obscure true end-customer count and revenue mix

This table separates end-buyer segments from the OEM and channel layer that helps d-Matrix reach them. Public evidence is strongest on the route to market, not on disclosed account counts by segment.

[CU001, CU002, CU004, CU005, CU007, CU025]
FU001: Customer journey map

d-Matrix is trying to move buyers from retrofit evaluation to scale-up, rack-scale, and heterogeneous cloud expansion without forcing a greenfield datacenter rebuild.

Stages are analytical journey steps synthesized from public commercialization materials, not a company-authored funnel with disclosed volumes.

[CU003, CU019, CU020, CU021, CU022, CU041]

6.2 Named Proof and Deployment Maturity

The customer-evidence picture improves materially once the analysis separates named proof points from broad customer-count rhetoric. Publicly, d-Matrix can point to a genuine commercialization ladder: Corsair sampling to early-access customers in late 2024, JetStream samples in 2025, GigaIO's integrated SuperNODE offer in May 2025, SquadRack's Supermicro channel availability for Q1 2026, and the March 2026 Gimlet Labs partnership for heterogeneous cloud deployment. Those are meaningful milestones because they show the company moving from hardware announcement to partner-integrated systems and then to a named external operator. But the same corpus also defines the limit of the proof. The ecosystem page explicitly says the showcased logos are not actual customers, and the public GigaIO, Supermicro, and Liqid statements remain partner endorsements rather than end-customer case studies. Gimlet is the clearest named external operator in the set, yet even there the public record says select-customer availability is planned for the second half of 2026 rather than already broad production. In other words, deployment maturity is real, but it is still better evidenced by channel readiness and operator pilots than by disclosed production-account depth.[CU008, CU009, CU010, CU011, CU012, CU013]

Customer growth / adoption trajectory table
MilestonePublic statusDateSourceConfidenceImplicationMissing denominator
Corsair early-access samplingSampling to early-access customers2024-11-19d-Matrix / eeNews EuropehighShows the product moved beyond lab-only positioningNo count of early-access accounts or conversion rate
GigaIO scale-up partnershipIntegrated enterprise scale-up system announced2025-05-01d-Matrix / GigaIO / AIwiremediumDemonstrates a route to single-node enterprise deploymentNo installed-base or booked-customer figure
JetStream NICSamples available; full production expected by year-end2025-09-08d-MatrixmediumExpands the platform from accelerator cards into multi-node networkingNo shipment or attach-rate disclosure
SquadRack commercializationConfigurations available for purchase through Supermicro in Q1 20262025-10-14PRNewswire / d-MatrixhighMoves proof from reference architecture toward channel transactabilityNo disclosed order backlog or customer count
Series C customer expansionCompany says funding supports multiple large-scale deployments and rapid customer growth2025-11-12d-Matrix / independent coveragemediumSuggests the sales funnel progressed beyond pilotsNo account names, deployment counts, or revenue split
Gimlet cloud offerSolution planned for select customers in 2H 20262026-03-12d-Matrix / Data Center KnowledgemediumFreshest named external operator proof in the corpusSelect-customer plan is not the same as broad GA or renewal data

Trajectory milestones are operational commercialization markers, not customer-count metrics. Public materials show stage progression but do not disclose the denominator behind each step.

[CU008, CU009, CU010, CU011, CU014, CU015]
Named customer proof table
Proof pointSegmentDeployment / use caseProduction vs pilotOutcome / signalLimitation
Gimlet LabsSpecialist AI cloud / operatorCorsair deployed alongside GPUs for agentic inference and speculative decodingPre-broad-availability; select customers planned in 2H 2026Named external operator plus technical benchmark and planned cloud offerStill short of disclosed broad production customer count or renewals
GigaIOEnterprise scale-up infrastructure partnerCorsair integrated into SuperNODE for large single-node inference deploymentsIntegrated solution with early-access interest pathShows a credible enterprise deployment vehicle and scale-up expansion routePartner proof rather than named end-customer production logo
SupermicroOEM / channel partnerSquadRack and X14 AI server platform for rack-scale inferencePurchasable configuration path in Q1 2026Indicates channel transactability and rack-integration readinessPublic statement does not identify live end users or outcomes
LiqidComposable infrastructure partnerCorsair integrated for flexible inference deploymentsPartner endorsement onlySupports the story that Corsair can slot into alternate enterprise platformsNo named customer, deployment size, or usage metric is disclosed
Undisclosed early-access customersEvaluation cohortCorsair samples in late 2024Pilot / evaluationShows some pre-GA external testing before broader commercializationAccount count, conversion, and segment mix are not public

The strongest named proof in public is a mix of one cloud operator and multiple channel/OEM partners. Logos and endorsements improve confidence in deployment maturity, but they do not answer the core question of how many end customers are in production.

[CU008, CU012, CU014, CU015, CU016, CU019]
FU002: Adoption / deployment funnel

Public evidence shows d-Matrix progressing from early-access sampling to channel availability and then to select-customer cloud rollout, but not yet to a disclosed installed-base metric.

Values are qualitative stage ordering only; they do not represent customer counts or conversion rates.

[CU008, CU009, CU010, CU015, CU041, CU044]
FU003: Customer proof matrix

The proof base is strongest on named partner and operator validation, and weakest on end-customer specificity, retention visibility, and concentration disclosure.

Matrix labels are qualitative judgments derived from the public corpus and are meant to compare evidence quality rather than score account economics.

[CU015, CU016, CU033, CU040, CU045]

6.3 Expansion Path, Channel Dependence, and Procurement Friction

The expansion thesis is coherent and probably the chapter's biggest positive. d-Matrix is trying to land first where latency-sensitive inference can be inserted into existing datacenters, then expand from card evaluation to single-node scale-up, rack-scale rollout, and broader heterogeneous-cloud deployments. GigaIO represents the single-node scale-up step, SquadRack plus Supermicro represent the rack-scale and channel-sales step, and Gimlet represents a cloud-service-provider path for workload-specific acceleration alongside GPUs. The challenge is that every one of those steps depends on partner execution. Public materials repeatedly show that d-Matrix needs OEMs, system integrators, switching partners, and orchestration layers to translate technical differentiation into repeatable customer adoption. Independent sources make the friction explicit: enterprises still optimize procurement around Nvidia's trusted software and systems ecosystem, and heterogeneous deployments only become easy when the abstraction layer hides hardware complexity. That means d-Matrix's customer expansion path is plausible, but conversion risk still sits at the intersection of OEM qualification, developer workflow compatibility, and whether buyers trust the platform enough to move from pilot or evaluation into repeat production spend.[CU021, CU022, CU023, CU024, CU025, CU026]

Expansion and concentration risk table
Expansion driverHow it expandsConcentration or channel riskImpactDiligence path
Existing-datacenter retrofitLand via PCIe cards, standard Ethernet, and air-cooled deploymentsExpansion depends on buyers trusting non-Nvidia acceleration in existing workflowsCan shorten time to first deployment if compatibility claims holdValidate deployment effort and operator training burden on two live accounts
Single-node scale-up via GigaIOExpand from evaluation card count to dozens of Corsairs in one nodeRoute is partner-mediated and does not reveal end-customer concentrationHelpful bridge from POC to meaningful workload sizeRequest pipeline by partner and stage, plus attach rates
Rack-scale path via Supermicro / SquadRackExpand from node to rack and then multi-rack clustersOEM availability does not prove sell-through or support qualityCould unlock larger enterprise, sovereign, and cloud budgetsRequest bookings, deployments, and time-to-production through Supermicro
Cloud-operator path via GimletExpand into heterogeneous inference cloud workloads and AI-native operatorsSelect-customer launch creates dependency on a small number of early operator winsCould create high-usage reference accounts quicklyRequest current committed capacity, target customers, and revenue model with Gimlet
Direct hyperscale or sovereign winsWould expand average contract size materiallyPublic sources do not name any such account, leaving concentration upside and downside both opaqueLargest upside path but hardest one to underwrite publicly todayRequest named accounts or procurement-stage evidence with timing and scope

The table treats concentration risk as both upside and downside because undisclosed large accounts can accelerate growth while also masking dependency.

[CU020, CU021, CU022, CU023, CU024, CU032]
Procurement and validation friction table
Friction pointWhy it existsCurrent mitigationPublic evidenceRemaining ask
Entrenched Nvidia ecosystemCustomers already know the incumbent software, tooling, and deployment modeld-Matrix positions open standards, better latency, and lower power/TCOForbes, GFM Review, and IntuitionLabs all highlight incumbent ecosystem advantageRequest win-loss analysis against Nvidia-based alternatives
Heterogeneous deployment complexityMultiple accelerators only help if orchestration hides hardware complexityGimlet abstracts workload placement and d-Matrix markets Aviator plus JetStreamData Center Knowledge and Gimlet describe the abstraction layer as centralRequest operator evidence on deployment effort and code changes
OEM and channel dependenceCustomers buy systems, racks, and support packages rather than bare siliconSupermicro, GigaIO, Liqid, Arista, and Broadcom extend the commercial surfaceOfficial commercialization materials repeatedly rely on those partnersRequest support split, certification matrix, and partner revenue concentration
Proof depth versus proof qualityPartner quotes and architecture milestones are easier to publish than production cohort datad-Matrix now has one named operator in Gimlet and purchasable SquadRack configurationsThe corpus shows real progress but still lacks retention and installed-base disclosureRequest production reference calls and cohort metrics before underwriting durable demand

This table is intentionally qualitative. Public evidence is much stronger on what could make procurement easier than on how quickly those mitigations are converting into durable production revenue.

[CU024, CU034, CU035, CU036, CU037, CU038]

6.4 Durability, Concentration, and Disclosure Gaps

Durability evidence is the weak point in the public record. Official and independent sources say the company is seeing rapid customer growth and preparing multiple large-scale deployments, but none of the reviewed materials quantify active customer count, repeat-order frequency, contract length, gross retention, net revenue retention, or churn. The same goes for concentration risk: there is no public disclosure of top-customer exposure, partner-versus-direct channel mix, or what share of revenue comes from any one hyperscale, enterprise, or sovereign account. As a result, the chapter can underwrite that commercialization is moving forward, but not that it is already broad, sticky, or diversified. The freshest evidence in 2026 is architectural and partner-led, not cohort-led. That is the central underwriting gap investors should keep front and center: d-Matrix has enough named proof to show adoption is not purely conceptual, but not enough disclosure to prove that adoption is durable, repeatable, and diversified at production scale.[CU028, CU029, CU030, CU031, CU032, CU033]

Retention / repeat usage / satisfaction table
MetricValueSegmentConfidenceWhy it mattersDiligence ask
Customer countAll segmentslowWithout active-account count, deployment rhetoric cannot be translated into market share or adoption depthRequest current active production customers by hyperscale, enterprise, sovereign, and cloud-operator segment
Net revenue retentionAll recurring accountslowNRR is the cleanest way to test land-and-expand claims in infrastructureRequest trailing-12-month NRR by direct and channel-sourced cohort
Gross retention / churnAll customerslowGross retention shows whether deployments survive beyond proof-of-conceptRequest logo churn, gross retention, and reasons for non-renewal
Contract duration / repeat order cadenceEnterprise and sovereign accountslowHardware and infrastructure businesses can look healthy on pipeline while still lacking repeat ordersRequest average term length, renewal cadence, and backlog conversion data
Referenceable satisfaction / user advocacyOnly partner quotes and a named operator benchmark are publicChannel and operator proofmediumReference calls and user satisfaction matter because the product changes infrastructure design choicesRequest at least three production reference calls with deployment age, workload, and realized outcome

Nulls are intentional. The public record reviewed for this chapter does not disclose retention, renewal, or customer-count metrics in a way that can be underwritten quantitatively.

[CU028, CU029, CU030, CU031, CU040, CU045]

6.5 Exhibits

Chapter 07

07Risks

7.1 Roadmap and System-Integration Risk

The highest-ranked risk is that d-Matrix is not selling one isolated chip; it is trying to commercialize a synchronized stack spanning Corsair cards, JetStream networking, Aviator software, partner-qualified racks, and a next-generation 3DIMC roadmap. That architecture can be differentiated and still fragile. JetStream is still publicly framed as early commercialization rather than mature fleet deployment, and the Pavehawk-to-Raptor path is explicitly described in lab-validation terms instead of production yield or broad customer proof. The company also made major architecture pivots during its first five years, which shows adaptability but also means buyers and investors are underwriting a moving roadmap. Mitigation exists in the form of concrete product surfaces and partner disclosures, but mitigation maturity remains only moderate because the public record still relies heavily on company-authored milestones rather than broad independent production evidence.[CR001, CR003, CR004, CR005, CR006, CR007]

Operational / quality / security risk register
Failure modeLikelihoodSeverityMitigation maturityResidual exposureUnresolved gap
JetStream and multi-node deployment remain earlier-stage than mature incumbent networking stackshighhighmoderatehighPublic evidence is still early-commercial rather than long-running production proof
Pavehawk and Raptor remain roadmap assets, so next-generation memory scaling is not yet field validatedhighblockinglowhighNo public production-yield, GA timing, or external benchmark package was found
System efficiency claims may compress once networking, orchestration, and support overhead are includedmediumhighlowmediumThe reviewed corpus lacks rack-level TCO or fleet operations data from independent production users
Field reliability is under-evidenced because no public MTBF, uptime, or RMA metrics were surfacedmediumhighlowhighDurability remains unresolved outside company-authored deployment materials
Disaggregated inference adds more integration surfaces where firmware, switch, or software coordination can failmediumhighmoderatemediumThere is little public evidence on install time, operational toil, or rollback behavior
Rapid workload evolution can make today’s architectural assumptions less durable if models shift againmediummediummoderatemediumThe roadmap must keep pace with future reasoning, multimodal, and agentic inference patterns

Operational risk is dominated by maturity and integration rather than by a single disclosed defect. Public mitigations exist, but most are still conceptual or partner-led.

[CR001, CR003, CR004, CR005, CR006, CR008]
FR003: Dependency map

Maps the core dependencies that make d-Matrix’s differentiated architecture simultaneously open-standards friendly and execution-sensitive.

Nodes are grouped analytically to show underwriting dependencies rather than to mirror one official product diagram.

[CR002, CR004, CR005, CR009, CR013, CR014]

7.2 Supply, Manufacturing, and Ecosystem Dependence

The second-ranked risk is concentrated dependency across packaging, foundry choices, networking, and system integration. The next-generation memory roadmap depends on external advanced-packaging execution, and public third-party coverage ties d-Matrix’s chiplet path to TSMC-node choices and evolving interface standards. At the same time, the current go-to-market motion depends on an ecosystem of Arista, Broadcom, Supermicro, GigaIO, and similar partners to turn cards into deployable systems. That partner-heavy model reduces proprietary lock-in for the customer, but it shifts execution risk onto third-party qualification, support handoffs, and BOM availability. Macro supply conditions do not make this easier: advanced packaging bottlenecks and adjacent GPU shortages have already shown how quickly AI-hardware supply chains can tighten. Mitigation today is conceptual openness and partner breadth, not evidence of a self-contained or highly redundant supply chain.[CR002, CR009, CR010, CR011, CR012, CR013]

Partner / dependency risk register
DependencyCounterparty / clusterRoleConcentrationFailure scenarioSeverityMitigationResidual exposure
Advanced packaging and 3DIMC executionAlchip plus future foundry and OSAT chainNext-generation roadmap enablementhighRaptor slips or ships with weak economics because packaging, node, or qualification work lagsblockingHold next-gen valuation upside contingent on clear manufacturing milestoneshigh
Reference-rack and OEM stackArista, Broadcom, SupermicroTurns cards and NICs into deployable systemshighQualification or support gaps slow customer rollout even if silicon is availablehighDemand a validated BOM and support handoff modelhigh
Single-node scale-up partnerGigaIOProvides a scale-up deployment path for larger configurationsmediumPartner-led deployment proof does not translate into broad repeatable field adoptionmediumSeparate partner demos from repeatable customer demand in diligencemedium
Early external customer-validation channelGimlet LabsProvides cloud deployment and early customer visibilitymediumThe reference partnership fails to convert into diversified production-customer proofhighTrack conversion from select-customer availability to named production winshigh
Incumbent software and operations ecosystemCUDA-era buyer workflows and procurement normsShapes default enterprise buying behaviorhighCustomers keep defaulting to incumbent stacks because switching costs outweigh point-performance upsidehighRequire evidence of workload-level wins that survive full-system comparisonhigh

This register mixes supplier, integrator, and market-structure dependencies because all three can block commercialization even if the silicon itself performs.

[CR002, CR009, CR010, CR013, CR014, CR028]

7.3 Adoption Proof and Deployment Friction

The third-ranked risk is that adoption proof still trails the technical story. Public evidence is strongest around early-access partners, reference architectures, and select-customer availability windows rather than around a broad set of named production users. The Gimlet and GigaIO announcements help because they show real counterparties and deployment intent, but they still read as early commercialization surfaces, not a mature installed base. That matters because enterprise AI adoption remains constrained by networking, power, and operational complexity even when the chip-level economics look better. d-Matrix’s air-cooled, standard-server positioning is a real sales advantage, yet the public record still does not show whether customers can deploy the full stack without heavy solution-engineering effort or long qualification cycles. This is therefore both a customer-proof risk and an implementation-friction risk.[CR014, CR015, CR016, CR017, CR018, CR035]

7.4 Compliance, Security, and Regulatory Risk

The fourth-ranked risk is that compliance, export-control, and security proof is thinner than what many enterprise and sovereign buyers now expect from inference infrastructure vendors. The public legal surfaces that were easy to verify are the website privacy policy and terms of use; those are necessary, but they are not a substitute for product-security architecture, audit posture, or a dedicated trust center. That gap matters because regulated buyers increasingly prioritize data control, predictable governance, and on-prem assurance, and competing platforms market those qualities explicitly. Overlaying that is export-control risk. BIS has already tightened controls on advanced computing semiconductors and circumvention pathways, which means advanced-AI hardware supply chains operate under an active regulatory regime. D-Matrix may never be directly blocked by those rules, but investors still need to understand how packaging partners, manufacturing geographies, and customer targets interact with evolving trade controls.[CR023, CR024, CR025, CR026, CR027, CR041]

Regulatory / legal risk register
RiskJurisdiction / scopeCurrent public signalLikelihoodSeverityMitigation maturityResidual exposureDiligence path
Advanced-chip export controls and anti-circumvention rules can disrupt supply-chain planning or customer geographyU.S. and countries of concernBIS tightened controls on advanced computing semiconductors and related circumvention pathwaysmediumhighlowhighObtain the manufacturing geography map and export-control counsel memo covering packaging partners and target accounts
Public product-security and compliance disclosure is thin relative to regulated-buyer expectationsGlobal enterprise and sovereign buyersReviewed public legal pages did not surface a trust center or product assurance packagemediumhighlowhighRequest security architecture, audit posture, and regulated-customer references
Website privacy and terms coverage does not answer product-level data-governance or liability questionsCustomer contracts and deployment governancePublic legal pages describe site handling and disclaimers rather than model or fleet controlsmediummediumlowmediumReview customer contracts, DPA terms, indemnities, and security commitments
Trade-policy shifts can affect next-generation packaging or node transitions even if current products are shippingFoundry, packaging, and future roadmapThe roadmap points to more advanced packaging and node sensitivity over timemediumhighlowhighMap which future programs would be most exposed to rule changes or licensing reviews
Public compliance posture may be insufficient for sovereign or highly regulated deployment bidsGovernment, defense-adjacent, finance, and healthcare buyersCompetitors market stronger public security or on-prem assurance language than d-Matrix does todaymediumhighlowmediumRequest the product-security roadmap and list of blocked or delayed regulated opportunities

This register focuses on public legal, export, and trust signals. Residual exposure remains elevated because the current corpus is stronger on architecture than on auditable compliance evidence.

[CR023, CR024, CR025, CR026, CR027]

7.5 Competition, Financing, and Kill Triggers

The final ranked cluster is the combination of incumbent competition, ecosystem lock-in, and financing discipline. NVIDIA, AMD, and specialist alternatives such as Groq and Cerebras keep raising the bar on memory capacity, rack-scale integration, deployment model, and software expectations. Independent analysis also argues that switching costs are system-wide, not just chip-level, and that the market stays concentrated through 2026. That means d-Matrix must win specific workloads where its latency-per-watt or brownfield-deployment advantages survive full-system comparison. Financing risk compounds this challenge. The 2025 round bought time, but it also set a valuation benchmark before broad public revenue proof. Management’s own history includes a near-death cash moment, and the company is expanding footprint and headcount while public customer proof remains limited. The investment case should therefore stay milestone-based, with explicit thesis-break triggers tied to roadmap timing, production-customer conversion, and supply-chain stability.[CR019, CR020, CR021, CR022, CR028, CR029]

People / execution risk register
Role / functionDependency or gapLikelihoodSeverityMitigationDiligence path
Global engineering and operations footprintTeams now span multiple geographies and must coordinate silicon, software, and commercialization programsmediumhighProgram-management discipline and stronger release governanceRequest org chart, release cadence, and decision-rights model across sites
Commercialization leadership and field supportPublic proof of scaled customer-success and support operations is limitedmediumhighBuild customer engineering and post-sales capability before broad rolloutRequest support headcount, escalation model, and deployment staffing assumptions
Product-definition disciplineCompany history includes multiple architectural pivots as the market evolvedmediummediumMilestone-gated roadmap with explicit customer design-partner feedback loopsReview roadmap governance, stage-gates, and criteria for killing or adding programs
Capital allocation during scale-upHiring and footprint expansion can raise burn faster than validated revenue proofmediumhighTie hiring and global expansion to booked demand and support loadRequest burn bridge, hiring plan, and trigger points for expense control

Execution risk is less about founder credibility and more about whether the company can scale commercialization and release discipline fast enough to match its technical ambition.

[CR007, CR019, CR020, CR021, CR022, CR044]
Mitigation and kill criteria table
RiskMonitorable triggerThreshold / eventAction implication
Roadmap maturityJetStream or Raptor timelineMaterial slip versus customer qualification or procurement windowsMove from underwriting roadmap upside to valuing only currently deployable product surfaces
Customer proofNamed production deploymentsNo diversified production-customer set beyond partner or early-access references by 2026-2027Downgrade commercialization confidence and valuation support
Supply and packagingAdvanced packaging or export-control shockRule change, capacity miss, or cost spike that materially changes roadmap timing or unit economicsTreat next-gen revenue assumptions as delayed and revisit cash needs
Competitive differentiationFull-system latency-per-watt comparisonIncumbent or specialist alternatives close the gap without comparable integration burdenReduce willingness to pay for architecture premium and moat assumptions
Security and regulated adoptionTrust-center and compliance evidenceNo product-security assurance package while regulated opportunities remain part of the narrativeConstrain TAM assumptions and delay sovereign or regulated-market upside
Financing disciplineBurn versus proofHeadcount and global expansion continue while bookings or production evidence remain thinShift the thesis to capital preservation and downside underwriting

These triggers are designed to be observable from follow-on diligence or future refreshes rather than generic narrative concerns.

[CR037, CR038, CR039, CR040, CR041, CR045]
FR001: Risk heatmap

Residual-risk view showing roadmap and customer-proof risks clustering in the highest-impact cells, with compliance and financing risks close behind.

Likelihood and impact are qualitative rankings synthesized from the reviewed corpus; they are not statistical loss estimates.

[CR036, CR037, CR038, CR039, CR040, CR041]
FR002: Risk transmission map

Shows how roadmap, supply, compliance, and adoption risks transmit into bookings, burn, financing, and valuation.

This map is analytical rather than company-authored; it visualizes second-order effects that matter to underwriting.

[CR016, CR021, CR022, CR036, CR037, CR038]

7.6 Exhibits

Chapter 08

08Valuation

8.1 Verified marks, price discipline, and the core call

d-Matrix is not a paper company. The 2025 Series C was publicly disclosed at $275 million and a $2 billion valuation, the investor list is real, and the company has a visible product story around Corsair accelerators, JetStream networking, and Aviator software. The round also came after a 2023 $110 million Series B and years of technical positioning around digital in-memory compute for inference. That is enough to justify serious attention. It is not enough to justify blind acceptance of the last mark. The valuation problem is the denominator. The reviewed public corpus still does not disclose revenue, ARR, gross margin, free cash flow, customer count, or cap-table terms. The Series C announcement claims rapid customer growth and expanding partners, but it does not tell an outside investor what portion of that story is pilot activity, production deployment, or economically attractive recurring demand. Without those numbers, a conventional late-stage revenue multiple cannot be defended from public evidence. The right conclusion is therefore price-sensitive. The company may be strategically relevant to the inference stack, but at $2 billion the public record still looks incomplete. That is why the present-tense call is research-more rather than buy: the quality of the technical story is good enough to keep working, but the public proof is still too thin to treat the latest round as a clean fair-value anchor.[CV001, CV002, CV003, CV005, CV006, CV007]

Recommendation summary table
DimensionAssessmentEvidence anchorDecision implication
Recommendationresearch-moreReal financing and product proof, but missing operating denominator metricsKeep working only if diligence rights or pricing improve
ConfidencemediumFunding and technical facts are real, but revenue/ARR/margin remain undisclosedUse wide scenario bands and avoid precision
Risk ratinghighCommercialization, benchmark normalization, and next-price discovery are still unresolvedAssume downside asymmetry until private evidence closes the gaps
Valuation stancestretched at $2BSeries C price sits above the chapter's public-evidence midpointTreat the last round as a milestone mark, not a proven bargain
Indicative entry disciplinebelow ~$1.5B or with hard downside protectionBase case centers below the last round and no target return can be supported publicly at $2BRequire price improvement, structure, or unusually strong data-room proof
Likely exit pathstrategic sale or later secondary before IPOStrategic relevance is more visible than public-market disclosure readinessDo not underwrite near-term IPO liquidity from the public record

Recommendation is explicitly price-sensitive. It judges the 2026 evidence set and the latest disclosed round, not just the quality of the architecture.

[CV001, CV017, CV049, CV050, CV051, CV052]
Thesis / anti-thesis table
ArgumentBull readBear readWhat would change the view
Inference timingInvestors correctly anticipated inference becoming the economic battleground of AIThe theme may be right while this price and this company-specific capture are still too optimisticShow named production customers winning inference budgets today
ArchitectureMemory-local design and modular insertion could solve a real latency and power problemClaims remain too company-authored and may not survive normalized testingPublish a third-party same-model benchmark with throughput, latency, power, and cost together
CommercializationGigaIO, Gimlet, and sovereign/enterprise language suggest real market pullPublic evidence may still be mostly pilot, partner, and roadmap proof rather than scaled recurring demandDisclose booked revenue, customer count, and pilot-to-production conversion
Comparable contextScarce inference assets may deserve a premium to generic hardware or software namesNVIDIA and AMD already disclose enormous scale, while private challengers remain heterogeneous and opaqueProve d-Matrix belongs in a premium scarcity bucket instead of a speculative startup bucket
Price discipline$2B could be reasonable if bull-case milestones arrive quicklyAt present the mark already prices in success without public financial proofA lower entry or strong private data-room evidence would materially improve the call

The anti-thesis is not that d-Matrix lacks technical merit. It is that the latest price gets ahead of what the public corpus can actually support.

[CV013, CV014, CV015, CV016, CV017, CV030]
FV001: Recommendation logic

The call flows from real financing and technical credibility through missing denominator metrics to a price-sensitive research-more recommendation.

This is a decision-logic compression of the chapter, not a causal operating model.

[CV001, CV013, CV014, CV017, CV032, CV044]

8.2 What the public record supports — and what it does not

The strongest public evidence supports d-Matrix as an inference-first hardware company with real financing, a coherent architecture, and growing ecosystem ties. Company, investor, and partner materials consistently repeat the same operating narrative: inference is becoming the economic bottleneck in AI, d-Matrix designed around the memory-movement problem, and products such as Corsair can improve throughput, latency, power, and total cost versus GPU-heavy alternatives. GigaIO and Gimlet also provide at least some outside evidence that the hardware is being integrated into real systems and tested in heterogeneous inference workflows. But the evidence gets weaker exactly where valuation needs it to get stronger. Most numerical economic claims are company-authored or partner-amplified. The best independent technical support in the corpus is Gimlet's speculative-decoding analysis, which is useful but still narrower than a same-model, same-batch, same-latency benchmark versus incumbent GPU clusters. More importantly, no reviewed source discloses the recurring revenue base, margin structure, pricing curve, or customer concentration that would let an investor translate technical promise into a defensible enterprise value. That asymmetry matters. Public evidence today supports the proposition that d-Matrix could become important. It does not support the proposition that the current private mark is already cheap. In valuation terms, the chapter can underwrite strategic optionality and milestone progress; it cannot underwrite hidden unit economics as if they had been publicly proved.[CV004, CV005, CV006, CV009, CV010, CV011]

FV004: Investment KPIs

KPI dashboard highlights the few hard public anchors and the main missing denominator facts driving the recommendation.

Dashboard mixes disclosed facts and inferred valuation outputs; every inferred item is labeled as such in the detail field.

[CV001, CV004, CV017, CV049, CV050, CV053]

8.3 Comparable set and inferred underwriting logic

The comp set is useful, but only if it is used correctly. NVIDIA and AMD are the cleanest public reference points for scale, disclosure quality, and incumbent strength. Their official product pages define the technical bar on memory, bandwidth, and rack-scale integration, while their 2026 filings show what fully disclosed AI infrastructure businesses look like when they are already operating at enormous commercial scale. That makes them indispensable scale fences. It does not make them direct pricing comps for a private startup with no public revenue denominator. The private and specialized challenger set is also heterogeneous. Groq is easier to understand commercially because it exposes service tiers and on-prem optionality. Cerebras competes through wafer-scale integrated systems. Low-confidence secondary coverage suggests SambaNova belongs in yet another bucket: integrated enterprise AI infrastructure rather than modular card insertion. D-Matrix sits somewhere else again, trying to sell a memory-local inference stack that can slot into more standard server and rack environments. Those are not minor packaging differences; they change customer budgets, deployment friction, and the kind of valuation support that can be observed publicly. The most plausible inference is that Series C investors were underwriting category timing and architectural fit, not already-revealed financial scale. In other words, they likely paid for the belief that inference demand will keep rising, that the market will reward lower-latency and lower-power alternatives, and that d-Matrix can convert partner and sovereign or enterprise interest into a durable niche before incumbent stacks absorb the opportunity. That logic is understandable. It is still not the same thing as public valuation support.[CV018, CV019, CV020, CV021, CV022, CV023]

Comparable valuation table
ReferenceStatusObserved metricMultiple / valuation contextWhy it mattersKey limitation
d-Matrix Nov-2025 Series CVerified private mark$275M raise at $2B valuation; $450M total raisedLatest disclosed pricing anchorOnly clean private valuation datum in the reviewed corpusNo public revenue, ARR, or margin denominator
NVIDIA FY2026 + Q1 FY2027Public incumbent scale fence$215.9B FY2026 revenue; $193.737B Data Center revenue; $81.615B latest quarterUseful as disclosure and scale boundary, not a startup multiple compShows what fully disclosed AI infrastructure scale looks likeTraining and inference are mixed; business is far broader than d-Matrix
AMD FY2025 + Q1 2026Public alternative scale fence$34.639B FY2025 revenue; $16.635B Data Center revenue; $10.253B latest quarterAnother scale and disclosure boundary rather than a direct price compShows how even a challenger incumbent discloses denominator metrics d-Matrix does notAlso mixes broader portfolio exposure beyond inference
GroqPrivate inference-service referencePublic/private/co-cloud plans and GroqRack on-prem option are visibleCommercial packaging is clearer than d-Matrix public pricingUseful for business-model comparison and deployment-path clarityReviewed corpus does not provide high-confidence current valuation support
CerebrasPrivate integrated-system referenceWafer-scale inference and training platform; system-level architectureComparable as a differentiated architecture, not as a modular card compShows another path to inference differentiationReviewed corpus does not provide high-confidence current valuation support
SambaNova / other integrated inference hardwareLow-confidence private referenceSecondary coverage describes integrated enterprise platform and recent fundingIncluded as category context only, not a numeric pricing anchorConfirms the private comp set is fragmented, not standardizedSource quality is insufficient for high-confidence valuation transfer

This table intentionally mixes one verified private mark, two public scale fences, and three business-model references. It is a comparable set for underwriting logic, not a clean multiple screen.

[CV001, CV022, CV023, CV026, CV027, CV028]
FV002: Valuation sensitivity

Directional evidence adjustments versus the $2B Series C mark show why the current public midpoint stays below the last round.

Bars are directional USD-million-style adjustments against the latest private mark, used only to visualize which evidence pushes the call up or down. They are not management guidance or transaction data.

[CV013, CV015, CV017, CV025, CV039, CV040]

8.4 Bull, base, and bear scenarios

Because revenue and ARR are undisclosed, scenario work has to be milestone-based rather than multiple-based. The right question is not “what exact EV/revenue multiple should a private AI chip startup trade on?” The public record cannot answer that. The right question is what evidence state would justify paying below, around, or above the last disclosed $2 billion mark. The bear case assumes that commercialization remains mostly pilot- and partner-led, that independent benchmark normalization still does not emerge, and that the next financing or secondary event prices d-Matrix closer to IP value plus conditional commercial optionality than to proven platform leadership. That produces a wide but clearly sub-round range of about $700 million to $1.0 billion. The base case assumes continued deployment progress and financing access, but still no public financial disclosure strong enough to prove a premium over the Series C valuation; that supports roughly $1.2 billion to $1.8 billion. The bull case requires named large-scale deployments, validated economics, and clearer evidence that d-Matrix can coexist with or displace incumbent GPU infrastructure in production, supporting roughly $2.5 billion to $3.5 billion. That framing matters because the latest disclosed round sits above the chapter's midpoint. The Series C is not impossible to justify, but it already prices in an execution path closer to the bull case than to the base case. That is why the current stance is stretched rather than attractive.[CV041, CV042, CV043, CV044, CV045, CV046]

Bull / base / bear scenario table
ScenarioProof-state assumptionValuation logicImplied valuation rangeProbability signalKey trigger
BearPilots do not convert clearly, benchmark proof remains thin, and next money reprices riskAsset/IP plus conditional commercial optionality~$700M-$1.0B25%A financing reset or stalled deployment evidence
BaseDeployments progress and financing remains available, but denominator metrics stay privateMilestone progress around the last round but below a full premium endorsement~$1.2B-$1.8B50%Commercial progress without enough public proof to justify $2B+
BullNamed large deployments, validated economics, and durable niche proof beside incumbent GPUsStrategic scarcity plus de-risked commercialization~$2.5B-$3.5B25%Independent validation and repeatable production wins
Probability-weighted midpointMix of the three states aboveBlended scenario center of gravity from public evidence~$1.5B-$2.0B100%Useful discipline check versus the $2B Series C mark

These ranges are milestone-based scenario judgments in USD millions. They are not disclosed values, fair-value appraisals, or revenue-multiple outputs because the required denominator metrics are not public.

[CV041, CV042, CV043, CV045, CV046, CV047]
FV003: Valuation / return range

Milestone-based bear, base, and bull valuation ranges in USD millions versus the last disclosed round.

All figures are inferred scenario values in USD millions except the last line, which is the disclosed Series C valuation. The range is milestone-based because revenue and ARR are not public.

[CV001, CV045, CV046, CV047, CV048, CV049]

8.5 Exit readiness, thesis-break triggers, and final diligence asks

Public evidence supports strategic relevance more than exit readiness. The company is building into one of the most crowded but important layers of the AI stack, and that does create strategic-sale or later-secondary optionality if the platform becomes a must-have inference component. But current public disclosure does not look IPO-ready. Public comparables disclose revenue, margins, filing cadence, and segment performance; d-Matrix does not. That disclosure gap drives the kill criteria. A priced financing materially below $1 billion would strongly suggest the market has reset the Series C narrative. A failure to convert partner announcements and early-access programs into named production deployments would undercut the commercialization story. An independent benchmark that fails to support the claimed cost or efficiency edge would remove the central reason to pay a premium for a specialized inference architecture. The required diligence is therefore concrete rather than generic: audited revenue and ARR, margin history, customer concentration, bookings-to-deployment conversion, realized pricing and support terms, and the preference stack or any recent 409A or secondary marks. Without those, a new investor is underwriting the story that d-Matrix could matter, not the proposition that $2 billion is already a disciplined entry price.[CV017, CV050, CV053, CV054, CV055, CV056]

Thesis-break and kill triggers table
TriggerThreshold / eventTransmission to thesisAction implication
Reset financingAny priced round or secondary clearing well below ~$1BConfirms the Series C narrative did not hold and compresses upside from hereRebuild the investment case from the new market-clearing price
Commercialization stallNo named production deployments or weak pilot conversion by the next financing cycleUndercuts the argument that architecture is becoming a real revenue engineShift immediately toward the bear case
Benchmark failureIndependent same-workload tests fail to validate a material TCO or efficiency edgeRemoves the main reason to pay a premium for specialized inference hardwareStrip out premium assumptions and revisit willingness to invest
Weak operating disclosurePrivate diligence shows poor margin quality, concentrated customers, or low conversionConverts a technology story into a low-quality hardware commercialization storyDemand a deep markdown or walk
Incumbent absorptionCustomers standardize on NVIDIA/AMD or integrated alternatives instead of adding d-MatrixShrinks the niche the Series C appears to underwriteAssume slower growth and lower exit optionality

Kill criteria are monitorable and tied directly to valuation transmission rather than generic company risk.

[CV039, CV041, CV043, CV053, CV054, CV055]
Final diligence asks table
TopicMissing evidenceWhy it mattersOwner / diligence path
Audited revenue / ARRMonthly and annual revenue plus ARR for 2024-2026Core denominator missing from every public valuation methodRequest audited financial statements and board KPI packs
Gross margin and cash conversionGross margin, opex, burn, and working-capital profileNeeded to separate valuable infrastructure software leverage from expensive hardware deploymentRequest audited P&L, cash-flow history, and margin bridge
Customer proofNamed production customers, cohort expansion, and top-account concentrationNeeded to distinguish partner activity from durable recurring demandRequest customer list, concentration schedule, and pilot-to-production conversion data
Pricing and support economicsRealized ASPs, discounting, software attach, and support obligationsNeeded to test whether public TCO claims survive commercial realityRequest pricing schedules, pilot invoices, and support-term summaries
Cap table and preferencesFully diluted ownership, preference stack, side letters, and any recent 409A or tender marksNeeded because enterprise value and common-equity value can diverge sharply in markdown scenariosRequest cap table, waterfall model, and latest 409A or secondary data
Benchmark normalizationSame-model third-party benchmark versus H100/H200/MI300X and any integrated challenger actually in the shortlistNeeded to validate the whole chapter's premium-economics assumptionCommission or obtain normalized benchmark evidence before underwriting price

These asks are blocking rather than optional. Without them, an investor is underwriting a strategic narrative more than a defendable valuation.

[CV017, CV030, CV054, CV055, CV056]

8.6 Exhibits

Disclaimer

This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.

Evidence index

Claims
IDStatementConfidenceSources
CO001 d-Matrix was founded in 2019. High SO002, SO004, SO006
CO002 d-Matrix is headquartered in Santa Clara, California. High SO004, SO005, SO006
CO003 The latest public office list in the fetched pack names Toronto, Sydney, Bangalore, and Belgrade in addition to the Santa Clara headquarters. High SO004, SO011
CO004 The founders publicly identified by d-Matrix are Sid Sheth and Sudeep Bhoja, with Sheth as CEO and Bhoja as CTO. High SO002, SO004
CO005 The public executive bench on the about page includes software, product, finance, legal, and manufacturing leaders. Medium SO002
CO006 The public board roster names Per Roman, Sasha Ostojic, Jeff Huber, Michael Stewart, Connie Sheng, and Russell Tham. Medium SO002
CO007 The fetched public record does not disclose board committees, ownership percentages, or explicit protective provisions behind the named board slate. Medium SO002, SO004, SO008
CO008 d-Matrix positions itself as an accelerated computing company built for AI inference in data centers rather than as a general-purpose training hardware vendor. High SO001, SO004
CO009 The home page says d-Matrix's 3DIMC architecture uses a chiplet-based, PCIe-friendly design that can scale to models up to 100 billion parameters. High SO001, SO004
CO010 By late 2025 d-Matrix publicly described a three-part product stack of Corsair accelerators, JetStream networking, and Aviator software. High SO004, SO022, SO024
CO011 VentureBeat reported that d-Matrix raised $44 million in a Series A round in April 2022. Medium SO014
CO012 d-Matrix closed a $110 million Series B funding round on September 6, 2023, led by Temasek. High SO006, SO012
CO013 The Series B release said the new capital would help d-Matrix commercialize Corsair and invest in recruitment and commercialization. Medium SO006
CO014 The Series B release listed Playground Global, M12, SK Hynix, Nautilus Venture Partners, Marvell Technology, and Entrada Ventures among d-Matrix backers and strategic partners. High SO006, SO012
CO015 d-Matrix said its November 2025 Series C raised $275 million at a $2 billion valuation and brought total disclosed funding to $450 million. High SO004, SO007, SO008, SO009
CO016 The 2025 Series C was co-led by Bullhound Capital, Triatomic Capital, and Temasek, with new participation from QIA and EDBI plus follow-on participation from M12, Nautilus Venture Partners, Industry Ventures, and Mirae Asset. High SO004, SO007, SO008
CO017 Independent coverage corroborated the $275 million raise, $2 billion valuation, and $450 million total funding figures announced in Series C. High SO009, SO010, SO011
CO018 The latest disclosed employee count in the fetched pack is 250+ worldwide from the November 2025 Series C key facts. Medium SO004
CO019 The November 2025 materials identify Santa Clara as headquarters and Toronto, Sydney, Bangalore, and Belgrade as global office locations. High SO004, SO011
CO020 The public record supports treating d-Matrix as a private Series C infrastructure startup rather than a public company or early research project. High SO004, SO009
CO021 d-Matrix unveiled Corsair on November 19, 2024 as a new computing platform designed for AI inference in modern datacenters. Medium SO005
CO022 The Corsair launch materials claimed 60,000 tokens per second at 1 millisecond per token for Llama3 8B in a single server and 30,000 tokens per second at 2 milliseconds per token for Llama3 70B in a single rack. Medium SO005
CO023 Corsair was sampling to early-access customers in November 2024 and was expected to be broadly available in Q2 2025. Medium SO005
CO024 d-Matrix launched JetStream in September 2025 as a custom I/O card and said the NICs deliver up to 400Gbps bandwidth with full production expected by year-end. Medium SO022
CO025 JetStream, paired with Corsair and Aviator, pushed d-Matrix toward a full compute-networking-software platform rather than a single-chip offering. High SO022, SO023
CO026 SquadRack was announced in October 2025 with Arista, Broadcom, and Supermicro as an open standards-based rack-scale reference architecture for AI inference. High SO023, SO004
CO027 GigaIO said in May 2025 that integrating Corsair with SuperNODE would support enterprise-scale inference and cited 30,000 tokens per second at 2 milliseconds per token on Llama3 70B. Medium SO020
CO028 The November 2025 Alchip collaboration pointed to a 3D DRAM roadmap and said the commercial debut would come on the Raptor accelerator as Corsair’s successor. Medium SO024
CO029 Gimlet Labs reported in March 2026 that offloading speculative decoding to d-Matrix Corsair produced 2–10x end-to-end latency improvements at similar energy efficiency versus a homogeneous GPU setup. Medium SO021
CO030 The d-Matrix news index lists a March 2026 Gimlet Labs collaboration and an April 2026 acquisition of GigaIO’s data center business as post-Series C milestones. Medium SO003
CO031 CRN included d-Matrix in its list of the 10 hottest semiconductor startups of 2023 after the company’s $110 million Series B. Medium SO012
CO032 Silicon Republic described d-Matrix as an AI startup to watch in 2024 because major product launches were scheduled after its $110 million funding round backed by Temasek and M12. Medium SO013
CO033 VentureBeat linked Microsoft Project Bonsai support to d-Matrix before Corsair was public, giving the company early ecosystem validation. Medium SO014
CO034 CNN reported in 2023 that GPU and interposer shortages were constraining AI infrastructure, with Sid Sheth arguing companies would need more efficient approaches. Medium SO015
CO035 CNBC described LLM inference costs as structurally high and quoted Sheth arguing that GPUs were not built for spiky inference workloads. Medium SO016, SO018
CO036 TechTarget likewise quoted Sheth saying inference economics are about dollars per inference, power efficiency, and latency rather than only raw compute. Medium SO017
CO037 Bloomberg described investor interest in chips that process data where it is stored, a market tailwind that aligns with d-Matrix's in-memory positioning. Medium SO019
CO038 d-Matrix and Bullhound both claimed up to 10x faster performance, 3x lower cost, and 3-5x better energy efficiency than GPU-based systems, but those ratios remain vendor claims rather than independently audited field benchmarks in the fetched pack. Medium SO004, SO007, SO011
CO039 Named customer disclosure remains thin because Series C and product releases mention rapid customer growth and large-scale deployments without identifying a broad hyperscale or enterprise customer roster. Medium SO004, SO009
CO040 The clearest public commercialization proof in the fetched corpus comes from partner or customer-adjacent materials from GigaIO and Gimlet rather than from a broad set of named end customers. Medium SO020, SO021
CO041 GFM Review’s 2024 competitive overview argued that startups like d-Matrix still must overcome Nvidia’s entrenched ecosystem, scale production, and win broader adoption. Medium SO025
CO042 The public board affiliations mirror the capital base by linking Bullhound, Playground, Triatomic, M12, Nautilus, and Temasek to visible governance seats. Medium SO002, SO004, SO007, SO008
CO043 Series C materials said the new capital would accelerate global expansion and support multiple large-scale deployments for hyperscale, enterprise, and sovereign customers. High SO004, SO007, SO009
CO044 No public revenue or ARR figure appears anywhere in the fetched company, investor, or independent source pack. Low SO001, SO004, SO009
CO045 No public companywide customer count appears in the fetched pack even though the company and partners refer to growth and deployments. Low SO004, SO009, SO021
CO046 Current control economics remain opaque because no fetched source discloses ownership percentages, board committees, or investor-protection mechanics. Low SO002, SO004, SO008
CM001 d-Matrix’s product and technical materials consistently place the company in the datacenter inference market rather than the training market. High SM002, SM003, SM014
CM002 d-Matrix’s commercial form factor is merchant infrastructure built from PCIe cards, servers, racks, and Ethernet networking rather than a bespoke end-to-end datacenter stack. High SM002, SM004, SM006, SM014
CM003 The most relevant market boundary for d-Matrix is merchant datacenter generative inference infrastructure, excluding model training clusters and broad software or services spend. Medium SM002, SM003, SM011
CM004 Public source material separates inference into compute-bound prefill and memory-bound decode, implying that the relevant hardware market cannot be analyzed with a training-style compute lens alone. High SM009, SM022
CM005 Reasoning and agentic workflows make end-to-end latency more decision-relevant than raw tokens-per-second because each extra inference step compounds user waiting time. Medium SM005, SM009, SM012
CM006 Distillation and better smaller models broaden the set of inference workloads that can be served by specialized merchant hardware rather than only frontier-scale GPU clusters. Medium SM005, SM012
CM007 d-Matrix’s own product envelope implies a narrower commercial wedge focused on low-latency and memory-efficient serving rather than universal coverage of every model size or all AI compute. Medium SM002, SM003, SM005
CM008 Independent coverage shows that AI demand has strained GPU supply and datacenter expansion plans, creating a category-level opening for alternative inference architectures. High SM017, SM018, SM020
CM009 Microsoft’s public disclosure that AI datacenters depend on predictable energy, networking supplies, and GPUs shows that inference scaling is constrained by infrastructure inputs beyond compute silicon alone. High SM018, SM020
CM010 Memory movement between processor and memory chips is itself a major source of AI power consumption, which strengthens the commercial importance of architectures that reduce that movement. High SM003, SM016
CM011 d-Matrix’s DIMC thesis is that HBM-centric GPU architectures are too power-hungry and too memory-bound for the best latency economics in interactive inference. Medium SM003, SM014
CM012 d-Matrix’s JetStream materials argue that host-mediated communication introduces avoidable latency in multi-node inference and that device-initiated communication is a meaningful scaling advantage. Medium SM004, SM007, SM008
CM013 Batch size acts as a direct trade-off knob between user experience and hardware ROI in interactive inference workloads. Medium SM010, SM020
CM014 Public examples from generative-AI applications show that inference can become a material operating-cost burden rather than a negligible variable cost. High SM019, SM020
CM015 The existence of API-based inference and incumbent GPU clouds means part of the real substitute set for d-Matrix is a service choice, not only a hardware choice. Medium SM019, SM026, SM027
CM016 The most realistic early buyer segments for d-Matrix are enterprise or private-cloud operators, neocloud or managed-inference providers, AI product companies, and OEM or integrator channels. Medium SM006, SM021, SM026, SM027
CM017 Hyperscalers are strategically important but structurally harder for d-Matrix to win because they already control large GPU estates and increasingly use captive silicon. Medium SM018, SM028, SM029
CM018 d-Matrix’s standard PCIe and Ethernet positioning is more naturally aligned with enterprises and partner-led deployments than with fully custom hyperscale rack architectures. Medium SM004, SM006, SM014, SM021
CM019 On-prem or private-cloud optionality matters because peer inference providers such as Groq explicitly sell predictable-spend cloud plus on-prem deployment choices. Medium SM026
CM020 Integrators and OEMs are part of the buying path because production inference deployments require server qualification, networking design, and operational support in addition to accelerator cards. High SM003, SM014, SM021
CM021 GigaIO’s SuperNODE positioning suggests that some enterprise buyers will prefer scale-up systems that avoid complex distributed inference before they adopt full rack-scale multi-node designs. Medium SM021, SM014
CM022 Broad generative-AI spending forecasts are too wide to use as d-Matrix’s TAM because they include software, services, and categories far beyond merchant inference hardware. Medium SM028
CM023 Forbes cites a MarketsandMarkets estimate that the AI inference market will grow from $106.15 billion in 2025 to $254.98 billion by 2030. Low SM028
CM024 Silicon Analysts estimates the total AI accelerator market at roughly $160 billion in 2025E and $200 billion plus in 2026E, which is broader than inference-only merchant hardware. Low SM029
CM025 CNBC’s estimate that Bing AI required at least $4 billion of infrastructure is a useful lower-bound deployment-cost lens even though it is not a market-size estimate. Medium SM019
CM026 d-Matrix’s accessible SAM is narrower than either the published AI inference market or the total accelerator market because it excludes training clusters, captive hyperscaler silicon, and much API-consumed inference. Medium SM003, SM006, SM018, SM029
CM027 No clean public standalone SAM exists for open-standards merchant inference systems optimized for low-latency multi-node serving, so any precise public SAM or SOM would be overstated. Medium SM027, SM028, SM029
CM028 d-Matrix’s official materials claim up to 10-20x advantages on latency, throughput, power, or TCO versus GPU comparisons, but those results are explicitly preliminary and workload-specific. Medium SM002, SM003, SM014
CM029 A third-party developer-signal benchmark from Gimlet reports that using d-Matrix for speculative decoding can improve end-to-end request speed by 2-10x versus a GPU-only speculative decoder at equal energy efficiency. Medium SM022
CM030 Forbes reports that MLPerf Inference 5.0 results still showed Nvidia winning every submitted benchmark and that GB200 met Nvidia’s rack-scale inference claims on the cited tests. Medium SM028, SM025
CM031 Nvidia’s public product pages show that the incumbent GPU stack continues to improve memory capacity, bandwidth, and rack-scale inference performance from H100 to H200 to GB200. High SM023, SM024, SM025
CM032 The substitute set for d-Matrix now includes not only Nvidia GPUs but also Groq’s inference cloud and partner-led scale-up systems built around different accelerator assumptions. Medium SM021, SM023, SM024, SM025, SM026
CM033 CNBC described Nvidia as having about 95% of the market for AI chips in early 2023. Medium SM019
CM034 CNN cited Nvidia as controlling 84% of the market for discrete GPUs during the 2023 AI chip shortage. Medium SM018
CM035 Silicon Analysts estimates Nvidia at roughly 80-90% of AI accelerator revenue in 2025 and about 75% by 2026 as custom silicon broadens. Low SM029
CM036 The public market-share numbers disagree because they use different denominators—AI chips, discrete GPUs, or total accelerators including custom silicon—so raw share rhetoric is directionally useful but not directly comparable. Medium SM018, SM019, SM029
CM037 d-Matrix’s standard-form-factor story reduces some retrofit friction relative to bespoke liquid-cooled racks, but it does not remove the need to prove cluster design, memory capacity, and operating economics at production scale. Medium SM004, SM006, SM014, SM021, SM025
CM038 Hyperscaler custom silicon growth and Nvidia’s software moat cap the share of the broad inference market that is realistically contestable by merchant startups. Medium SM018, SM028, SM029
CM039 Reasoning and agentic AI increase the amount of inference-time compute buyers may need, which helps the category thesis while simultaneously increasing sensitivity to serving cost and latency. Medium SM011, SM012, SM028
CM040 The most investable wedge for d-Matrix is latency-sensitive, memory-bound, open-standards inference serving where buyers care about predictable TCO and do not already own enough optimized GPU capacity. Medium SM006, SM009, SM010, SM021, SM022, SM026
CM041 d-Matrix’s current deployment narrative is strongest in enterprise and partner-led channels because public materials emphasize OEMs, integrators, scale-up partners, and standard datacenter compatibility. Medium SM006, SM014, SM015, SM021
CM042 d-Matrix’s product materials explicitly frame standard servers, racks, and existing datacenter infrastructure as a feature, which aligns with buyers who want incremental deployment rather than facility redesign. Medium SM002, SM004, SM006
CM043 Because prefill remains compute-bound and GPUs still excel there, the strongest public third-party evidence for d-Matrix today supports heterogeneous co-deployment rather than universal GPU replacement. Medium SM009, SM022, SM028
CM044 The same GPU shortages and power constraints that support d-Matrix’s market narrative can also push buyers to reserve incumbent capacity or stay with existing vendors instead of adopting a startup accelerator. Medium SM017, SM018, SM029
CM045 The public source pack is strong enough to prove a large inference category but not strong enough to justify a precise public SOM for d-Matrix without management disclosures on deployments, cards per cluster, and workload mix. Medium SM015, SM028, SM029
CM046 Vendor-authored heterogeneous-stage comparisons and standardized full-system leaderboards measure different things, so the public benchmark record does not establish one universally dominant inference architecture. Medium SM022, SM023, SM025, SM028
CP001 D-Matrix says its Corsair, JetStream, and Aviator stack can deliver 10x faster performance, 3x lower cost, and 3-5x better energy efficiency than GPU-based systems. Medium SP001, SP003
CP002 Independent reporting frames the core AI inference energy problem as repeated movement of data between memory and processors. Medium SP004
CP003 D-Matrix’s Jayhawk chiplet platform was publicly described with 16 Gbit/s per-wire bandwidth and less than 0.5 pJ/bit energy efficiency. Medium SP007, SP008
CP004 Jayhawk was also presented as a modular, pre-validated chiplet architecture that can refresh compute platforms faster and accommodate third-party chiplets. Medium SP007, SP008
CP005 Historical H100 scarcity and silicon-interposer bottlenecks pushed the market to look for more efficient second-source inference hardware. Medium SP009, SP010, SP012
CP006 NVIDIA markets H100 as combining 900 GB/s NVLink, NDR InfiniBand, and Magnum IO software to scale inference across clustered systems. Medium SP014
CP007 NVIDIA markets H100 as up to 30x faster for inference on the largest models and bundles enterprise deployment software around the platform. High SP014, SP021
CP008 NVIDIA says H200 increases memory to 141 GB of HBM3e at 4.8 TB/s, which it describes as nearly double H100 capacity and 1.4x more bandwidth. Medium SP015
CP009 NVIDIA says H200 can deliver up to 2x H100 inference speed on Llama2-class workloads while improving energy efficiency and total cost of ownership. Medium SP015
CP010 NVIDIA markets GB200 NVL72 as a liquid-cooled rack with 72 Blackwell GPUs, 36 Grace CPUs, and a 130 TB/s NVLink domain. Medium SP016
CP011 NVIDIA markets GB200 NVL72 as delivering 30x faster real-time trillion-parameter inference and Forbes says the NVL72 matched NVIDIA’s promised ~30x gain over an 8-GPU H200 benchmark. High SP016, SP022
CP012 AMD markets MI300X as a 192 GB HBM3, 5.3 TB/s, eight-Infinity-Fabric-link accelerator with ROCm software support. Medium SP017
CP013 AMD explicitly positions MI300X around higher memory capacity and bandwidth than competing accelerators, making memory-per-GPU its clearest attack on NVIDIA and d-Matrix alternatives. Medium SP017
CP014 Independent benchmark coverage compares MI300X directly with H100, H200, and B200 under current LLM inference workloads, showing AMD is treated as a real production competitor. High SP017, SP020
CP015 ROCm gives AMD a less locked-down software position than CUDA, but framework compatibility remains a major practical selection criterion for enterprise buyers. Medium SP017, SP021
CP016 Groq publicly sells inference as public, private, or co-cloud service with Free, Developer, and Enterprise plans plus optional on-prem GroqRack. High SP018, SP021
CP017 Groq’s public packaging emphasizes low latency, predictable spend, regional endpoints, and no-batching-required operation rather than training versatility. Medium SP018
CP018 Independent analysis describes Groq as an inference specialist with its own compiler and runtime path, which can improve latency but still requires ecosystem switching away from standard GPU tooling. Medium SP021, SP023
CP019 Cerebras markets WSE-3 as a wafer-scale processor with four trillion transistors and 125 petaflops that survives defects through redundant compute and routing. High SP019, SP021
CP020 Independent analysis says Cerebras now competes by serving very large models on specialized systems rather than sharding those workloads across many smaller GPUs. Medium SP021
CP021 Independent analysis describes SambaNova’s SN50 as a full-stack platform with 1.6 PFLOPS FP16, 3.2 PFLOPS FP8, and very large model and context-window support. Medium SP021
CP022 SambaNova’s Intel collaboration and hardware-software co-design show that major inference challengers increasingly bundle system integration with silicon. Medium SP021
CP023 Independent landscape coverage places d-Matrix, Groq, and Cerebras inside the same inference-challenger set even though they attack different technical bottlenecks. Medium SP005, SP023
CP024 The AI inference market is growing around low-latency and efficient deployment needs, but high power demand and supply chain risk still constrain adoption. Medium SP004, SP025
CP025 Historical deployment cost evidence shows why inference economics matter: one startup described nearly $200,000 in monthly model-serving bills and analysts estimated ChatGPT-scale infrastructure could require 10,000 Nvidia GPUs plus costly upgrades. Medium SP011, SP012, SP013
CP026 D-Matrix’s lower-cost and lower-energy pitch is strategically relevant because buyers are already conditioned to think of inference as an operating-cost problem. Medium SP001, SP003, SP011, SP012, SP013
CP027 D-Matrix is not trying to beat GB200 on absolute rack-scale throughput; it is offering a purpose-built inference card and NIC stack optimized for memory locality in standard servers. Medium SP001, SP003, SP008
CP028 Relative to Groq, d-Matrix competes less on public-cloud API distribution and more on hardware insertion into enterprise or OEM infrastructure. Medium SP001, SP018, SP021
CP029 Relative to Cerebras and SambaNova, d-Matrix is the less integrated but potentially easier-to-insert option because it fits around cards, NICs, and partner servers instead of a giant proprietary appliance. Medium SP001, SP019, SP021
CP030 NVIDIA remains the default incumbent because its hardware is ubiquitous and its software plus channel ecosystem create switching costs far beyond chip specifications. High SP014, SP016, SP021, SP026
CP031 Intuition Labs cites NVIDIA at roughly 92% of the discrete GPU market in H1 2025 while Silicon Analysts still shows a 75-87% share band through 2026, which is adverse evidence for any challenger moat story. Medium SP021, SP024
CP032 D-Matrix’s core switching-cost problem is therefore software and networking, not just silicon performance, because buyers already know how to buy, cool, connect, and operate incumbent GPU clusters. Medium SP014, SP016, SP021
CP033 AMD pressures NVIDIA on memory-per-GPU and open software posture, but it still competes inside the same OEM server procurement motion rather than changing the buyer workflow as sharply as Groq or Cerebras. Medium SP017, SP020, SP021
CP034 Groq has a clearer public story than d-Matrix for regulated or air-gapped deployments because it explicitly advertises GroqRack and cloud-to-local continuity. Medium SP018, SP021
CP035 Internal build is a credible substitute because large buyers increasingly want inference hardware that fits existing ML pipelines and can be co-designed around proprietary workloads. Medium SP021, SP025
CP036 D-Matrix’s strongest moat candidate is technical fit to the memory bottleneck, because both independent reporting and its chiplet disclosures link inference efficiency to keeping compute close to memory. Medium SP004, SP007, SP008
CP037 A second moat candidate is modularity, because Jayhawk and the broader card-plus-NIC architecture imply d-Matrix can partner into heterogeneous systems rather than forcing a clean-sheet deployment. Medium SP001, SP007, SP008
CP038 Adverse evidence is material because d-Matrix still lacks a public MLCommons-style apples-to-apples benchmark against H100, H200, MI300X, Groq, or Cerebras. Medium SP001, SP003, SP020, SP022
CP039 Without standardized public benchmarking, buyers still have to underwrite d-Matrix’s cost and performance claims primarily through vendor-authored comparisons. Medium SP001, SP003, SP022
CP040 Overall moat durability looks moderate rather than strong because d-Matrix is differentiated on inference-specific memory locality, but NVIDIA and AMD own broader ecosystems, Groq has cleaner service packaging, and Cerebras plus SambaNova sell more integrated systems. Medium SP021, SP023, SP024
CI001 D-Matrix publicly frames its monetizable platform as Corsair accelerators, JetStream NICs, and Aviator software. Medium SI001, SI022
CI002 D-Matrix disclosed a $275 million Series C at a $2 billion valuation, bringing total disclosed funding to $450 million. High SI001, SI007, SI008, SI009
CI003 Series C proceeds were earmarked for roadmap execution, global expansion, and multiple large-scale deployments. High SI001, SI008
CI004 D-Matrix disclosed a $110 million Series B in 2023 to begin commercializing Corsair after prior chiplet launches. Medium SI004
CI005 The Series B announcement said the company planned to use proceeds for recruitment and commercialization to meet demand for lower-cost inference infrastructure. Medium SI004
CI006 Public financing materials identify hyperscale, enterprise, and sovereign customers as the intended buying base. Medium SI001, SI008
CI007 Corsair was sampling to early-access customers in November 2024 and was targeted for broad availability in Q2 2025. Medium SI003
CI008 JetStream samples were available in September 2025 with full production expected by year-end 2025. Medium SI023, SI006
CI009 The combined Gimlet Cloud offering was targeted for select-customer availability in the second half of 2026. Medium SI026
CI010 Official materials say d-Matrix is collaborating with OEMs and system integrators to qualify Corsair-based solutions. Medium SI003, SI005
CI011 Public partner materials name Supermicro, Arista, Broadcom, GigaIO, and Liqid as deployment or integration partners around the d-Matrix stack. Medium SI021, SI022
CI012 The d-Matrix ecosystem page says partner logos are illustrative and not actual customers. Medium SI021
CI013 JetStream is designed as a standard PCIe card that connects to off-the-shelf top-of-rack Ethernet switches. Medium SI006, SI022
CI014 JetStream's published specification includes 400 Gbps bandwidth, PCIe Gen5 x16, and 150W max TDP with transceivers. Medium SI006
CI015 The Corsair white paper says an 8-card inference server can provide up to 2 TB of capacity memory. Medium SI005
CI016 The same white paper says an 8-card inference server provides 16 GB of performance memory and 1,200 TB/s of performance-memory bandwidth. Medium SI005
CI017 Official and partner materials repeatedly claim up to 10x faster speed, 3x better cost-performance, and 3x to 5x better energy efficiency versus GPU-based systems. Medium SI001, SI003, SI006, SI018
CI018 Both the Corsair white paper and the JetStream brief say performance, cost, and power estimates are preliminary and subject to change. High SI005, SI006
CI019 GigaIO says its SuperNODE can support dozens or 64+ Corsair accelerators in a single node and simplify deployment relative to multi-node setups. Medium SI018, SI021
CI020 SquadRack publicly combines Corsair, JetStream, Supermicro servers, Broadcom switches, Arista Ethernet, and Aviator software in one rack-scale reference architecture. Medium SI022
CI021 D-Matrix says scaling beyond SRAM into 3D-stacked DRAM is necessary to support larger reasoning models and higher token consumption. Medium SI025, SI027
CI022 The Alchip collaboration makes advanced ASIC design, packaging, and manufacturing management explicit dependencies of the next-generation roadmap. Medium SI025
CI023 D-Matrix's public workload story complements GPUs in heterogeneous inference stacks rather than replacing GPU training or all inference phases outright. Medium SI026, SI028, SI029
CI024 Gimlet reported that running the speculative decoder on Corsair improved end-to-end request latency by 2x to 10x at equivalent energy efficiency. Medium SI019
CI025 Gimlet reported that a 1.6B speculative decoder fits on two Corsair cards and that Corsair is air-cooled and rack-compatible. Medium SI019
CI026 The Microsoft Project Bonsai partnership provided early compiler and toolchain validation before broad commercial availability. Medium SI012
CI027 CNBC reported that Latitude spent nearly $200,000 per month on OpenAI and AWS at peak usage, showing how inference costs can swamp a small startup. Medium SI015
CI028 CNBC cited analyst estimates that OpenAI could spend about $40 million for one month of ChatGPT inference and Bing AI would need at least $4 billion of infrastructure. Medium SI015
CI029 The Washington Post reported that leading AI chatbots lose money on every chat because operating costs are so high. Medium SI017
CI030 TechTarget reported that ChatGPT-class deployments require not just GPUs but meaningful networking and power-management upgrades. Medium SI016
CI031 VentureBeat reported that the perceived GPU shortage often reflects board-level component and systems bottlenecks rather than a lack of GPU dies alone. Medium SI013
CI032 CNN reported that AI chip shortages are shaped by supply-chain bottlenecks such as advanced packaging and interposer constraints. Medium SI014
CI033 Bloomberg explained that repeatedly moving data between memory and processors is a major electricity cost in AI systems. Medium SI011
CI034 The Corsair white paper says chiplets improve yields and lower costs while moving the architectural problem toward interconnect design. Medium SI005
CI035 AIMultiple reports that cloud GPU list prices for the same model can differ several times over across providers. Medium SI020
CI036 No public Corsair or JetStream price sheet appears in the reviewed sources, so list price, discounting, and realized ASP remain undisclosed. Medium SI003, SI006, SI021, SI023
CI037 Reviewed official, investor, and independent materials disclose funding and deployment plans but no revenue or ARR figure. Medium SI001, SI007, SI008, SI009
CI038 Reviewed sources do not disclose gross margin, contribution margin, or gross profit dollars for d-Matrix. Medium SI001, SI003, SI005, SI006
CI039 No reviewed public source discloses d-Matrix cash on hand, monthly burn, or runway months. Medium SI001, SI004, SI007, SI008
CI040 No reviewed public source discloses debt facilities or project-finance obligations for d-Matrix. Medium SI001, SI004, SI007, SI008
CI041 Public customer evidence remains partner and benchmark oriented; the source pack does not provide a broad named-customer roster or booked-customer count. Medium SI001, SI019, SI021, SI026
CI042 GFM Review argues that startups like d-Matrix still need substantial capital investment, brand building, and proof at scale to challenge Nvidia's ecosystem. Low SI031
CI043 NVIDIA's 2026 annual report says competitive data-center AI platforms are sold as co-designed chips, networking, systems, software, and paid enterprise software licenses. Medium SI030
CI044 NVIDIA's 2026 annual report says the company may place non-cancellable component orders, pay premiums or deposits, and absorb inventory provisions when demand shifts. Medium SI030
CI045 NVIDIA's filing defines cost of revenue to include wafers, packaging, board and device costs, memory, shipping, warranty, and inventory charges. Medium SI030
CI046 NVIDIA reported fiscal 2026 capex of $6.1 billion and said gross margin fell to 71.1% partly because full-scale datacenter solutions and a $4.5 billion excess-inventory charge weighed on results. Medium SI030
CI047 D-Matrix's public financial case is strongest as a customer-economics thesis rather than a publicly proven d-Matrix margin profile. Medium SI017, SI018, SI019, SI030
CI048 With $450 million of disclosed financing but no public cash, burn, or revenue data, d-Matrix still appears financing-dependent for roadmap execution and deployment scaling. Medium SI001, SI004, SI007, SI008
CI049 The go-to-market motion appears account-driven and qualification-heavy because official materials emphasize sampling, OEM qualification, and select-customer deployment windows. Medium SI003, SI005, SI026
CI050 Because JetStream adds dedicated networking hardware, power draw, and switch infrastructure on top of accelerator cards, d-Matrix's TCO case depends on whole-cluster design rather than chip benchmarks alone. Medium SI006, SI022, SI029
CI051 In a 2024 retrospective, management said d-Matrix came within two weeks of running out of cash in 2020 before reframing its fundraising strategy, showing capital availability has been existential since the company's earliest phase. Medium SI032
CE001 d-Matrix's current product surface consists of Corsair accelerators, JetStream networking, Aviator software, and the SquadRack reference rack architecture. High SE003, SE005, SE006, SE023
CE002 Corsair is packaged as an industry-standard PCIe Gen5 full-height full-length card and is sold in single-card and dual-card configurations for standard datacenter servers. High SE003, SE007, SE009
CE003 A single Corsair card is specified with 2 GB of on-chip performance memory, up to 256 GB of off-chip capacity memory, 150 TB/s memory bandwidth, and 2400 TFLOPs of MXINT8 peak compute. High SE007, SE009
CE004 A dual-card Corsair configuration is specified with 4 GB of performance memory, 300 TB/s of bandwidth, up to 512 GB of capacity memory, and cross-card DMX Bridge connectivity. High SE003, SE009
CE005 The published rack configuration uses eight servers and 64 Corsair cards to reach 128 GB of performance memory and 9.6 PB/s aggregate bandwidth. High SE003, SE009
CE006 d-Matrix built Corsair around DIMC because it views generative inference as memory-bound during token generation rather than only compute-bound. High SE007, SE009, SE016
CE007 Within Corsair, DMX Link provides die-to-die connectivity inside the package and DMX Bridge extends that connectivity across two cards. High SE007, SE009
CE008 Aviator's build flow comprises Model Factory, Compressor, and Compiler, while execution runs through Inference Engine and Host Runtime. High SE003, SE009, SE021
CE009 Aviator integrates with PyTorch, MLIR, Triton DSL, Kubernetes device plugins, metrics export, debugging, and profiling workflows. Medium SE009
CE010 JetStream is a transparent NIC that extends device-initiated accelerator-to-accelerator communication across nodes by bypassing host-orchestrated transfers. High SE010, SE014, SE015
CE011 JetStream is specified as a PCIe Gen5 x16 Ethernet card with 400 Gbps maximum bandwidth, QSFP-DD optics, 150 W max TDP, and secure boot support. High SE006, SE010
CE012 d-Matrix claims that JetStream combined with Corsair and Aviator can scale beyond 100B-parameter models and deliver up to 10x speed, 3x cost-performance, and 3x energy-efficiency gains versus GPU-based alternatives. Medium SE006, SE010
CE013 SquadRack is an eight-node rack blueprint that combines Corsair accelerators, JetStream I/O, Supermicro servers, Broadcom PCIe switches, Arista leaf switches, and Aviator software. High SE005, SE013
CE014 SquadRack is designed to remain air-cooled and configurable by rack power or rack-height constraints rather than requiring special liquid-cooling infrastructure. Medium SE013
CE015 GigaIO's SuperNODE integration is positioned to host dozens of Corsair accelerators in one node and publicly cites 30,000 tokens per second at 2 milliseconds per token for Llama3 70B. Medium SE023, SE026
CE016 Gimlet Labs reported that using Corsair for a speculative-decoding draft model delivered 2-5x end-to-end request speedups at equal energy relative to a GPU-only draft-model configuration. Medium SE027
CE017 d-Matrix argues that prefill is compute-bound while decode is memory-bound, so heterogeneous pipelines should mix GPUs with memory-centric accelerators instead of forcing every phase onto one device class. Medium SE016, SE017, SE018
CE018 In d-Matrix's published deployment model, speculative decoding lets Corsair run the memory-bound speculator stage while a larger GPU-resident model performs verification. Medium SE017, SE027
CE019 Keyformer is d-Matrix's published KV-cache sparsification technique and its example reports baseline accuracy with 50% of prompt KV cache plus 2.1x latency improvement and up to 2.4x token-throughput improvement. Medium SE020
CE020 dmx.compressor is a torch.fx-based quantization toolkit for PTQ and QAT that is meant to feed compressed graphs into Aviator Compiler. Medium SE021, SE009
CE021 Before Corsair, d-Matrix publicly described Nighthawk and then Jayhawk as chiplet platforms for in-memory inference, with Jayhawk introduced in 2023. Medium SE024, SE025
CE022 Jayhawk was presented as a modular chiplet platform on TSMC 6nm with 16 Gbps per wire bandwidth and less than 0.5 pJ/bit energy efficiency over organic substrates. Medium SE024, SE025
CE023 d-Matrix says 3DIMC has been validated on Pavehawk test silicon and is planned to debut commercially in the Raptor accelerator, the successor to Corsair. Medium SE008, SE011
CE024 d-Matrix claims that the future Raptor 3DIMC product will use 3D-stacked DRAM and deliver up to 10x faster inference than HBM4-based solutions. Medium SE008, SE011
CE025 The strategic purpose of 3DIMC is to extend d-Matrix's low-latency SRAM-centric architecture to larger-memory models and more demanding agentic pipelines. Medium SE011, SE012
CE026 d-Matrix's deployment model depends on partner qualification across server, switch, and rack layers rather than shipping a fully closed vertical appliance on its own. Medium SE004, SE007, SE013, SE026
CE027 The 3DIMC roadmap depends specifically on Alchip for ASIC and packaging expertise, making manufacturing execution partly external to d-Matrix. Medium SE008
CE028 The earlier Jayhawk platform also depended on TSMC process technology and BoW or UCIe-style chiplet ecosystems, reinforcing foundry and packaging exposure. Medium SE024, SE025
CE029 d-Matrix's current deployment story is intentionally open-standards based, using PCIe cards, standard Ethernet, and off-the-shelf top-of-rack switches rather than proprietary interconnect domains. High SE003, SE010, SE013
CE030 That open approach should lower retrofit friction in existing datacenters, but it also makes system quality dependent on partners such as Supermicro, Arista, Broadcom, GigaIO, and Liqid. Medium SE004, SE005, SE013
CE031 The incumbent GPU roadmap emphasizes HBM, NVLink, and increasingly liquid-cooled rack-scale systems, which differs materially from d-Matrix's PCIe-plus-Ethernet design center. Medium SE028, SE029, SE030
CE032 NVIDIA publishes H100 with 188 GB HBM3, H200 with 141 GB HBM3e and 4.8 TB/s bandwidth, and GB200 NVL72 as a liquid-cooled 72-GPU rack with 130 TB/s rack communication. Medium SE028, SE029, SE030
CE033 Independent market commentary says enterprise buyers increasingly want specialized inference hardware for data control, latency, and predictable cost rather than only general-purpose GPUs. Medium SE031, SE032
CE034 Independent market commentary also says NVIDIA still dominates accelerator share and packaging access, which raises the bar for any challenger that depends on external foundry and packaging capacity. Medium SE032, SE034
CE035 d-Matrix has at least one external practitioner validation signal because Gimlet publicly benchmarked Corsair in a speculative-decoding workflow. Medium SE027
CE036 The strongest public proof of deployment maturity is still at the reference-architecture and partner-demo layer rather than in named production customer case studies. Medium SE004, SE005, SE023, SE026
CE037 JetStream's published specification lists secure boot support, which is the clearest explicit product-security control surfaced in the reviewed corpus. Medium SE010
CE038 The reviewed official homepage, about, technology, product, ecosystem, and news surfaces do not publish SOC 2, ISO 27001, FedRAMP, or a public trust-center or status-page reference. Medium SE001, SE002, SE003, SE004, SE005
CE039 The reviewed corpus does not provide public MTBF, uptime, RMA, or field-failure metrics for Corsair or JetStream. Medium SE001, SE003, SE005, SE009, SE010
CE040 Company performance claims for Corsair, JetStream, and 3DIMC remain either preliminary or self-reported, so the independent benchmark base is still narrow. Medium SE009, SE010, SE027, SE033
CE041 The Raptor and Pavehawk roadmap has public architecture claims but no independently verified production silicon, customer deployments, or broad benchmark suite in this chapter corpus as of runDate. Medium SE008, SE011, SE023, SE033
CE042 d-Matrix's public developer surface is improving because Aviator builds on open-source tools and dmx.compressor points users to a GitHub repo, but the ecosystem still looks thinner than NVIDIA's enterprise software stack. Low SE009, SE021, SE028
CE043 d-Matrix positions its hardware for reasoning, agents, and video-generation inference workloads where token growth and KV-cache pressure amplify memory bottlenecks. Medium SE007, SE012, SE019
CE044 The hybrid SRAM-plus-DRAM roadmap gives d-Matrix a technical middle ground between tiny SRAM-only deployments and massive HBM-heavy GPU clusters. Medium SE011, SE012, SE029
CE045 JetStream carries accelerator-to-accelerator traffic over standard Ethernet and standard optics, so rack networking depends on conventional datacenter switching rather than a bespoke fabric. High SE010, SE014
CE046 Aviator's decoupled host and on-card runtime is intended to reduce launch latency by letting the host enqueue work independently of the card's current execution state. Medium SE009
CE047 d-Matrix says it built JetStream because traditional RDMA NICs would not satisfy the latency target or preserve decoupled execution across nodes. Medium SE010, SE014
CE048 d-Matrix explicitly pitches SquadRack to cloud providers, sovereign clouds, and enterprises that want high-performance inference without redesigning the entire datacenter around proprietary infrastructure. Medium SE005, SE013
CE049 d-Matrix's recent technical framing argues that rising context lengths make inference a context-management problem as much as a compute problem, increasing pressure on memory capacity, KV-cache efficiency, and data movement. Medium SE020, SE036, SE041
CE050 d-Matrix argues that agentic and reasoning workloads magnify latency sensitivity because chained tool use, long contexts, and verifier steps make per-token delay matter more than peak batch throughput. Medium SE037, SE041
CE051 d-Matrix explicitly frames open standards as a prerequisite for broader AI adoption, reinforcing its use of PCIe, Ethernet, and standard datacenter components instead of proprietary rack domains. Medium SE013, SE038
CE052 d-Matrix's recent product messaging argues that competitive inference now depends on optimizing every layer from model compression and runtime to networking and infrastructure, not just the accelerator die. Medium SE022, SE039
CE053 d-Matrix published its own announcement of the GigaIO partnership, showing that the SuperNODE scale-up design is part of the company-authored deployment strategy rather than only a partner press claim. Medium SE026, SE035
CE054 Alchip's own announcement corroborates that the 3DIMC roadmap depends on external ASIC and advanced-packaging collaboration, reinforcing packaging execution as a real dependency rather than just company positioning. Medium SE008, SE040
CE055 d-Matrix's recent reasoning-at-enterprise-scale messaging treats context management, low latency, software-hardware co-design, and open deployment as one system recipe, which is consistent with but does not independently validate the broader product thesis. Medium SE037, SE039, SE041
CU001 d-Matrix says the Series C capital is meant to support hyperscale, enterprise, and sovereign customers. High SU002, SU016
CU002 SquadRack materials explicitly target cloud providers, sovereign clouds, and enterprises that need low-latency inference capacity. High SU003, SU030
CU003 The public sales narrative targets infrastructure buyers who want low-latency inference inside standard PCIe servers and Ethernet-based datacenters rather than bespoke GPU racks. Medium SU009, SU010, SU023
CU004 GigaIO frames the first scale-up commercialization path as enterprise deployment inside a single SuperNODE rather than as a named hyperscale production cluster. Medium SU007, SU020, SU029
CU005 Gimlet frames the joint offer around model providers, frontier AI labs, and AI-native cloud workloads that benefit from heterogeneous inference infrastructure. Medium SU008, SU021, SU027
CU006 d-Matrix says customer conversations with large cloud service providers helped shape its inference-first product thesis. Medium SU013
CU007 The disclosed proof base fits enterprise, sovereign, and specialist-cloud operators more clearly than it fits named hyperscale production accounts. Medium SU002, SU022, SU023, SU027
CU008 Corsair was being sampled to early-access customers in November 2024 and was targeted for broad availability in the second quarter of 2025. High SU006, SU028
CU009 JetStream samples were available by September 2025 with full production expected by the end of that year. Medium SU004
CU010 SquadRack configurations were slated to be available for purchase through Supermicro in the first quarter of 2026. High SU003, SU030
CU011 Series C materials say the new capital will support multiple large-scale deployments, but they do not identify those deployments by customer name. Medium SU002, SU016, SU017
CU012 The ecosystem page's Supermicro, GigaIO, and Liqid statements are partner endorsements rather than disclosed end-customer case studies. Medium SU001, SU006
CU013 The d-Matrix ecosystem page explicitly says the showcased partner logos are not actual customers. Medium SU001
CU014 GigaIO public materials include an early-access call to action for SuperNODEs running Corsair, indicating an evaluation funnel rather than a disclosed installed base. Medium SU020, SU029
CU015 The d-Matrix and Gimlet combined solution was planned for select customers in the second half of 2026, so it remains pre-broad-availability at the run date. High SU008, SU026, SU027
CU016 Gimlet is a named external operator planning to deploy Corsair alongside GPUs in Gimlet Cloud. High SU008, SU027
CU017 Gimlet's technical writeup shows a concrete evaluated workload using gpt-oss-120b with a 1.6B speculative decoder on Corsair and GPU. Medium SU021
CU018 Gimlet says two Corsair cards can host the speculative decoder and that the cards are air-cooled and rack-compatible, which lowers pilot friction inside existing datacenters. Medium SU021
CU019 GigaIO says a single SuperNODE can support dozens of Corsair accelerators, making scale-up inside one node the first enterprise expansion path before broader scale-out. Medium SU020, SU029
CU020 SquadRack extends the expansion path from one node to an eight-server rack and then to hundreds of nodes across multiple racks. High SU003, SU030
CU021 d-Matrix repeatedly frames commercialization as land-and-expand inside existing datacenters using standard PCIe servers and standard Ethernet networking. Medium SU009, SU010, SU030
CU022 Existing-datacenter compatibility, air cooling, and standards-based networking are central to the company's appeal for enterprise and sovereign buyers with infrastructure constraints. Medium SU008, SU009, SU027
CU023 Public commercialization depends on OEM and integrator partners including Supermicro, GigaIO, Arista, Broadcom, and Liqid rather than on a closed turnkey appliance sold only by d-Matrix. Medium SU001, SU003, SU023
CU024 That partner-led route can widen reach but also makes expansion sensitive to OEM qualification, support handoffs, and channel execution. Medium SU023, SU027, SU030
CU025 Enterprise private-inference buyers often choose on-premises or near-edge deployments for data control, latency, and predictable cost, which matches d-Matrix's positioning. Medium SU023
CU026 Major system integrators package accelerators into turnkey servers, so d-Matrix's OEM relationships are a core part of customer conversion rather than a side channel. Medium SU023, SU030
CU027 AIMultiple's 2026 cloud-provider tiering suggests that d-Matrix's disclosed proof today speaks more clearly to specialist AI clouds and enterprise operators than to named hyperscalers. Medium SU022, SU027
CU028 No reviewed public source discloses d-Matrix's current active customer count by account or by segment as of 2026-05-26. Medium SU002, SU016, SU017, SU019
CU029 No reviewed public source discloses d-Matrix's net revenue retention. Medium SU002, SU014, SU016
CU030 No reviewed public source discloses gross retention, churn, or renewal rates for d-Matrix deployments. Medium SU002, SU016, SU017
CU031 No reviewed public source discloses average contract length, backlog conversion, or repeat-order cadence for d-Matrix customers. Medium SU002, SU016, SU019
CU032 No reviewed public source discloses top-customer concentration, channel mix, or revenue contribution by partner versus end customer. Medium SU002, SU015, SU016
CU033 Customer evidence is materially fresher on partner integrations and deployment architecture than on paid production-account counts or cohort durability. Medium SU015, SU016, SU027, SU030
CU034 GFM Review argues that challengers like d-Matrix must persuade customers to adopt new technologies that may require significant changes to existing workflows, and the GroqCloud and Cerebras product pages show that buyers can compare d-Matrix against other non-Nvidia inference alternatives during that evaluation. Medium SU025, SU034, SU035
CU035 GFM Review also argues that widespread adoption depends on proving consistent, reliable performance at scale against Nvidia's trusted ecosystem. Medium SU025
CU036 IntuitionLabs says Nvidia still dominates enterprise inference hardware while OEMs package alternative accelerators into deployable systems, and AMD's MI350 page separately markets drop-in compatibility plus Kubernetes-friendly deployment, underscoring how much deployment polish d-Matrix must match to reduce procurement inertia. Medium SU023, SU033
CU037 Forbes argues Nvidia keeps lowering deployment friction through software, solutions, and datacenter-scale offerings, which raises the bar for d-Matrix customer adoption. Medium SU024
CU038 Data Center Knowledge says the value of the Gimlet approach depends on an abstraction layer that lets developers use heterogeneous chips without rewriting code. Medium SU027
CU039 Data Center Knowledge reports that initial Gimlet targets include frontier AI labs and that cloud and enterprise deployment at scale remain the strategic goal. Medium SU027
CU040 d-Matrix's Series C release claims rapid customer growth, but the absence of counts, cohort metrics, or named production accounts means that growth cannot be underwritten quantitatively. Medium SU002, SU014, SU016
CU041 Supermicro purchase availability in Q1 2026 shifts proof from reference architecture toward channel transactability, but still not toward a disclosed installed-base metric. Medium SU003, SU030
CU042 GigaIO calls the 2025 announcement the next phase of a strategic partnership, suggesting d-Matrix moved from component collaboration toward a more integrated system sales story. Medium SU020, SU029
CU043 Supermicro, GigaIO, and Liqid all pitch TCO, deployment speed, or inference flexibility, but none of the public statements quantify live end-customer outcomes by logo. Medium SU001, SU003, SU029
CU044 Both the Gimlet partnership page and GigaIO's AIwire press carry explicit interest or early-access calls to action, indicating that the funnel still includes invitation-driven evaluation motions. Medium SU026, SU029
CU045 The central customer diligence issue is the gap between credible partner validation and the still-undisclosed number of production customers, retention, and concentration metrics. Medium SU001, SU002, SU027, SU029
CR001 JetStream was announced with sample availability and a year-end production target, so public proof still sits at an early commercialization stage rather than a mature installed base. Medium SR002
CR002 SquadRack is presented as a reference blueprint built with Arista, Broadcom, and Supermicro rather than a closed appliance, so deployment readiness depends on partner qualification. Medium SR001
CR003 d-Matrix says it built a transparent NIC because standard networking approaches would not hit the latency targets needed for distributed inference. High SR005, SR006
CR004 JetStream adds accelerator-to-accelerator transfers, dedicated switching paths, and standard-Ethernet coordination that increase integration complexity across NIC, switch, server, and software layers. High SR002, SR005, SR006
CR005 The 3DIMC roadmap is still pre-commercial because Pavehawk is described as lab validated and Raptor as the planned first product use rather than a shipping platform. High SR003, SR004
CR006 The going-vertical post describes Pavehawk as d-Matrix’s first crack at the next memory problem and emphasizes lab stress testing instead of customer deployment or production-yield evidence. Medium SR004
CR007 d-Matrix’s five-year retrospective says the company pivoted from analog IMC to digital IMC and later from early transformer workloads to generative inference, showing technical flexibility but also roadmap volatility. Medium SR026
CR008 d-Matrix argues that reasoning-heavy workloads after the DeepSeek moment increase inference-time compute and memory pressure, which makes the thesis more relevant but also forces the roadmap to keep pace with rapidly changing model behavior. Medium SR008
CR009 eeNews Europe reported Jayhawk as a second-generation chiplet platform tied to TSMC 6nm and open chiplet interfaces, reinforcing that packaging and interconnect execution have long been core dependencies. Medium SR012
CR010 SiliconANGLE reported that a future Raptor generation is expected to move from 6nm to 4nm, which would raise qualification and supply-chain execution sensitivity if timing slips. Medium SR010
CR011 CNN reported that silicon interposers and advanced packaging are major AI-chip bottlenecks, suggesting smaller accelerator vendors can face supply risk even when end demand is healthy. Medium SR014
CR012 VentureBeat reported acute Nvidia GPU shortages during the AI boom, showing how upstream component shortages can distort pricing and lead times across adjacent inference ecosystems. Medium SR013
CR013 SquadRack and the GigaIO partnership show that d-Matrix depends on an external ecosystem of servers, switches, interconnects, and scale-up platforms rather than a vertically integrated manufacturing stack. High SR001, SR024
CR014 The Gimlet partnership describes select-customer availability in 2H 2026, which is encouraging external validation but still early compared with broad production deployment evidence. High SR025, SR031
CR015 d-Matrix’s public materials still emphasize reference architectures, early access, and partner launches more than a named production customer roster, leaving adoption depth under-evidenced. Medium SR001, SR025, SR031
CR016 TechTarget argued that enterprises still need significant compute, networking, and power investment for generative AI, implying sales cycles can stall even when accelerator efficiency is attractive on paper. Medium SR015
CR017 The Washington Post argued that chatbot providers were losing money on every chat in the early LLM buildout, so customer unit economics must improve for buyers to accept switching and integration risk. Medium SR016
CR018 d-Matrix says Corsair ships as an air-cooled standard PCIe card, which lowers retrofit friction but does not itself prove field reliability or supportability at scale. High SR025, SR031
CR019 The careers page shows d-Matrix operating across Santa Clara, Sydney, Bengaluru, Toronto, and Belgrade, broadening talent access but increasing coordination demands for a hardware-software platform. Medium SR030
CR020 The retrospective says d-Matrix nearly ran out of cash in 2020 before raising follow-on funding, showing that the company has already experienced financing fragility during a long hardware gestation period. Medium SR026
CR021 Data Center Dynamics and SiliconANGLE reported that d-Matrix raised $275 million in late 2025 at a $2 billion valuation, bringing total funding to roughly $450 million before broad commercialization. Medium SR009, SR010
CR022 The same financing coverage says the new capital is meant to support commercialization, global expansion, and large-scale deployments, which implies continued capital intensity ahead of mature revenue proof. Medium SR009, SR010
CR023 BIS said in October 2023 that the United States was strengthening export controls on advanced computing semiconductors and circumvention pathways to countries of concern, making trade policy an active risk variable for advanced AI chip supply chains. Medium SR029
CR024 d-Matrix’s privacy policy is a website data-handling disclosure that mentions reasonable safeguards and compliance-by-law, but it is not a published product-security assurance package for datacenter buyers. Medium SR027
CR025 d-Matrix’s terms of use include broad warranty disclaimers for the site, while the reviewed public corpus did not surface a dedicated trust center, SOC 2, ISO 27001, or similar enterprise-control page for the product stack. Medium SR027, SR028
CR026 Intuition Labs says private inference buyers care about data control, predictable cost, and deployment support, so public compliance and security gaps can directly slow adoption in regulated or sovereign environments. Medium SR022
CR027 Groq publicly markets zero-data-retention and on-prem air-gapped options, and NVIDIA markets enterprise security features, which raises the disclosure bar for d-Matrix on security posture. Medium SR020, SR017
CR028 NVIDIA H200 increases incumbent memory and bandwidth for inference, and GB200 pushes the competitive bar to rack-scale systems, so d-Matrix competes against a moving full-stack target rather than one static GPU baseline. High SR017, SR018
CR029 AMD MI300X offers 192GB of HBM3 and 5.3 TB/s bandwidth, blunting the argument that memory-hungry inference buyers have no mainstream alternatives to d-Matrix. Medium SR019
CR030 Groq and Cerebras both market purpose-built inference platforms with public deployment narratives, so d-Matrix is competing inside a crowded specialist category rather than a greenfield niche. Medium SR020, SR021
CR031 Intuition Labs describes switching costs as stack-wide across software, networking, cooling, and operations, which means d-Matrix must overcome ecosystem lock-in as well as raw chip performance. Medium SR022
CR032 Silicon Analysts estimates NVIDIA keeps dominant AI accelerator share through 2026, implying challengers like d-Matrix still fight for a relatively narrow share-of-wallet unless they win very specific workloads or power-constrained segments. Medium SR023
CR033 Bloomberg described the AI energy crisis as a data-movement problem, which validates d-Matrix’s architecture thesis but also means the company must prove rack-level efficiency rather than only chip-level superiority. High SR011, SR007
CR034 d-Matrix’s own disaggregation materials argue that prefill, decode, memory, and networking must be co-optimized, which means system-level overhead can erode theoretical efficiency gains if integration is poor. High SR006, SR007
CR035 SquadRack, the Gimlet materials, and d-Matrix’s brownfield-retrofit blog all stress air-cooled standard-server deployment and reuse of spare data-center capacity, so a key diligence question is whether those benefits persist once networking, orchestration, and support overhead are included in real customer environments. High SR001, SR025, SR036
CR036 The highest residual execution risk is synchronized delivery across Corsair, JetStream, Aviator, OEM partners, and the Raptor roadmap because no single layer is sufficient on its own to win production deployments. High SR001, SR002, SR003, SR024
CR037 A practical thesis-break trigger is a material slip in JetStream or Raptor readiness relative to customer procurement cycles because current proof is still early and partner-led. High SR002, SR003, SR025
CR038 A second thesis-break trigger is failure to convert early partner and reference deployments into named production customers by 2026 or 2027, which would leave the $2 billion valuation resting mostly on architecture promise. Medium SR009, SR010, SR031
CR039 A third thesis-break trigger is evidence that incumbent GPU or specialist inference alternatives close the latency-per-watt gap without d-Matrix’s integration burden, removing the switching rationale. Medium SR017, SR018, SR020, SR021, SR022
CR040 A fourth thesis-break trigger is a material export-control or advanced-packaging disruption that stalls next-generation product availability or worsens unit economics. Medium SR029, SR014, SR013
CR041 Public risk mitigation today is more mature around architectural positioning and partner announcements than around disclosed reliability metrics, compliance credentials, or customer concentration data. Medium SR001, SR002, SR025, SR027, SR028
CR042 No public MTBF, uptime, or field-failure metrics were identified in the reviewed risk corpus, leaving durability under production conditions unresolved. Low SR001, SR002, SR027, SR028
CR043 No public named production customer list or deployment count was surfaced in the reviewed risk corpus beyond partner, reference, and early-access announcements. Medium SR001, SR025, SR031
CR044 The 2026 careers footprint and global expansion narrative suggest d-Matrix is scaling organizationally before large public customer proof, which can raise burn and program-management pressure if bookings lag. Medium SR009, SR026, SR030
CR045 d-Matrix’s risk profile is dominated not by one binary technical flaw but by compounding dependencies across roadmap maturity, ecosystem integration, customer proof, compliance disclosure, and financing discipline. High SR001, SR003, SR009, SR022, SR029
CV001 The latest disclosed d-Matrix financing was a $275 million Series C at a $2 billion valuation, bringing total disclosed funding to $450 million. High SV001, SV002, SV003, SV004
CV002 The 2025 round was described as oversubscribed and co-led by Bullhound Capital, Triatomic Capital, and Temasek, with QIA, EDBI, M12, and other investors participating. High SV001, SV002, SV003, SV004
CV003 d-Matrix said the Series C proceeds would advance the roadmap, accelerate global expansion, and support large-scale deployments for hyperscale, enterprise, and sovereign customers. Medium SV001, SV002, SV003, SV004
CV004 The Series C key facts disclosed 250+ employees and offices in Toronto, Sydney, Bangalore, and Belgrade in addition to the Santa Clara headquarters. Medium SV001
CV005 The Series C materials described rapid customer growth but did not disclose revenue, ARR, or customer count. Medium SV001, SV002, SV003, SV004
CV006 The reviewed public materials also do not disclose gross margin, retention, free cash flow, or cap-table preference terms for d-Matrix. Medium SV001, SV008, SV009, SV010, SV011, SV012
CV007 d-Matrix's 2023 Series B raised $110 million to begin commercializing Corsair after earlier chiplet launches and a lower-TCO inference pitch. Medium SV008
CV008 The founder's retrospective says d-Matrix first raised a $40 million Series A after pivoting toward digital in-memory compute and later refocused Corsair around generative AI inference. Medium SV010
CV009 Official and partner GigaIO materials say the combined SuperNODE plus Corsair system targets around 30,000 tokens per second at 2 milliseconds per token while lowering TCO and power versus GPU-heavy approaches. Medium SV011, SV013
CV010 d-Matrix's March 2026 Gimlet announcement said select customers could access the joint solution in the second half of 2026. Medium SV012
CV011 Gimlet's technical writeup found a 2-10x end-to-end interactivity improvement versus a GPU-only speculative-decoding setup at comparable energy efficiency in the studied configuration. Medium SV014
CV012 d-Matrix's 2024 Cambrian or Forbes repost highlighted a company claim of 30,000 tokens per second at 2 milliseconds per token for Llama 70B in a single rack. Medium SV009
CV013 The investor and partner mix suggests the Series C underwrites inference demand growth, energy efficiency, and strategic infrastructure optionality more than disclosed financial scale. Medium SV001, SV002, SV003, SV011, SV012
CV014 The 2025 round looks more like milestone financing around architecture and deployment readiness than conventional ARR-backed late-stage software underwriting. Medium SV001, SV003, SV007, SV008, SV010, SV012, SV013, SV014
CV015 Public support for d-Matrix's economic claims remains heavily company-authored because 10x performance, 3x lower cost, and 3-5x better energy efficiency recur across company, investor, and partner materials. Medium SV001, SV002, SV003, SV004, SV007, SV011, SV012, SV013
CV016 Independent corroboration exists but is narrower because Gimlet evaluated a specific speculative-decoding configuration rather than a whole-platform same-model benchmark against incumbent clusters. Medium SV012, SV014
CV017 No source reviewed in this chapter discloses d-Matrix revenue, ARR, gross margin, or free cash flow, so an EV/revenue or EV/ARR underwriting model cannot be supported publicly. Medium SV001, SV004, SV005, SV006, SV007, SV008, SV009, SV010, SV011, SV012
CV018 NVIDIA markets H100 as an inference accelerator integrated with NVLink, InfiniBand, Magnum IO, and AI Enterprise. Medium SV018
CV019 NVIDIA says H200 adds 141 GB of HBM3e and 4.8 TB/s bandwidth for larger-model inference. Medium SV019
CV020 NVIDIA markets GB200 NVL72 as a liquid-cooled rack-scale system with 72 Blackwell GPUs, 36 Grace CPUs, and a 130 TB/s NVLink domain. Medium SV020
CV021 AMD markets MI300X as a 192 GB HBM3 accelerator with 5.3 TB/s bandwidth and ROCm software support for large-model AI. Medium SV021
CV022 Groq publicly exposes inference service tiers and an on-prem GroqRack option, making its commercial packaging more visible than d-Matrix's current public pricing. Medium SV022
CV023 Cerebras positions wafer-scale hardware for both training and inference, which is a very different deployment model from a modular PCIe-card strategy. Medium SV023
CV024 AIMultiple and Forbes both frame inference hardware as a market still led by NVIDIA but increasingly judged on cost, throughput, and energy efficiency. Medium SV024, SV025
CV025 Silicon Analysts estimates NVIDIA still holds about 80-90% of AI accelerator revenue in 2025, indicating a highly concentrated market. Low SV026
CV026 NVIDIA's FY2026 10-K reports $215.9 billion of revenue and $193.737 billion of Data Center revenue. High SV029, SV032, SV033, SV035
CV027 NVIDIA's Q1 FY2027 10-Q reports $81.615 billion of quarterly revenue and 74.9% gross margin for the quarter ended April 26, 2026. Medium SV032, SV033, SV035
CV028 AMD's FY2025 10-K reports $34.639 billion of total revenue, $16.635 billion of Data Center revenue, and 50% gross margin. High SV030, SV031, SV034, SV036
CV029 AMD's Q1 2026 10-Q reports $10.253 billion of quarterly revenue and 53% gross margin for the quarter ended March 28, 2026. Medium SV031, SV034, SV036
CV030 Because NVIDIA and AMD disclose multi-billion revenue and filing-backed margins while d-Matrix discloses none, public GPU-company valuation logic cannot be transferred mechanically onto d-Matrix. Medium SV001, SV029, SV030, SV031, SV032
CV031 Low-confidence secondary coverage portrays SambaNova as another integrated enterprise platform, reinforcing that private challengers span multiple business models rather than one clean comparable set. Low SV022, SV023, SV027
CV032 The best-supported use of NVIDIA and AMD in this chapter is as scale fences and disclosure benchmarks, not as direct pricing anchors for a private startup with hidden denominator metrics. Medium SV018, SV019, SV020, SV021, SV029, SV030
CV033 CRN and Silicon Republic showed d-Matrix had startup mindshare by 2023-2024, but press attention is not equivalent to named production-customer disclosure. Medium SV016, SV017
CV034 Partner references with GigaIO and Gimlet show ecosystem progress, but they still stop short of public recurring-revenue or customer-concentration disclosure. Medium SV011, SV012, SV013, SV014
CV035 Groq's packaging is more commercially legible than d-Matrix's because public plan tiers and on-prem optionality are visible while d-Matrix public pricing remains absent. Medium SV001, SV011, SV012, SV022
CV036 Cerebras and SambaNova-style integrated systems imply a different buyer motion from d-Matrix's modular PCIe-card and NIC approach. Low SV013, SV023, SV027
CV037 Bloomberg and the market-report sources reinforce that energy, power, and infrastructure cost are central to inference buying and help explain why investors may pay for efficiency narratives. Medium SV015, SV024, SV028
CV038 The same sources also show that high power requirements, data-security concerns, and supply-chain stress remain structural adoption risks for inference hardware. Low SV015, SV028
CV039 Concentrated incumbent share and stack lock-in argue against paying a startup scarcity premium unless d-Matrix proves repeatable production wins. Medium SV018, SV020, SV022, SV024, SV025, SV026
CV040 The Series C price can be rationalized only if investors believe inference demand will shift materially toward specialized hardware and d-Matrix will win a meaningful share of that spend. Medium SV001, SV002, SV003, SV015, SV024, SV025, SV026
CV041 A reasonable bull case requires named scale deployments, independent benchmark normalization, and evidence that customers choose d-Matrix for production inference rather than pilots. Medium SV003, SV010, SV011, SV012, SV014, SV017
CV042 The base case assumes deployments continue and the company avoids a down round, but public evidence remains insufficient to prove premium economics above the latest private mark. Medium SV003, SV009, SV010, SV011, SV012, SV017
CV043 The bear case is plausible because commercialization proof is still thin relative to the scale of NVIDIA and AMD incumbency and the fragmentation of private challengers. Medium SV017, SV024, SV025, SV026, SV029, SV030
CV044 Valuation therefore has to be milestone-based rather than multiple-based because the denominator is hidden and the comp set is structurally mismatched. Medium SV017, SV029, SV030, SV031, SV032
CV045 This chapter's valuation ranges are scenario judgments around the last disclosed round, not market-cleared fair values or implied revenue-multiple outputs. Medium SV001, SV017, SV029, SV030, SV031, SV032
CV046 A bear-case valuation range of roughly $700 million to $1.0 billion reflects IP and partner optionality but assumes pilots stall or the next financing resets materially below Series C. Medium SV009, SV011, SV012, SV013, SV017, SV024, SV026
CV047 A base-case valuation range of roughly $1.2 billion to $1.8 billion assumes commercialization progresses and financing remains available, but public disclosure still does not justify a clear premium above the latest round. Medium SV001, SV003, SV010, SV011, SV012, SV015, SV017
CV048 A bull-case valuation range of roughly $2.5 billion to $3.5 billion requires named large deployments, third-party validated economics, and evidence that d-Matrix can hold a differentiated niche beside incumbent GPU infrastructure. Medium SV003, SV011, SV012, SV013, SV014, SV018, SV020
CV049 A 25% bear, 50% base, and 25% bull weighting centers the public-evidence midpoint around roughly $1.5 billion to $2.0 billion, below the last disclosed $2 billion round. Medium SV001, SV017, SV029, SV030, SV031, SV032
CV050 Because the Series C price sits above the chapter's midpoint and still requires unproven bull milestones, the current public evidence supports a stretched valuation stance. Medium SV001, SV017, SV029, SV030, SV031, SV032
CV051 The recommendation is research-more rather than buy because the company may be strategically relevant, but the price is ahead of public proof. Medium SV001, SV017, SV029, SV030, SV031, SV032
CV052 Confidence is medium because financing and product facts are real, yet the most decision-critical operating metrics remain undisclosed. Medium SV001, SV003, SV017
CV053 Risk rating is high because the investment case still depends on commercialization proof, supply-chain execution, and a financing or exit path that public evidence does not yet de-risk. Medium SV013, SV014, SV015, SV017, SV026, SV027, SV028
CV054 Entry discipline should require a materially lower price or hard data-room evidence on revenue quality and cap-table terms. Medium SV017, SV030, SV031, SV032
CV055 Thesis-breakers include a down round below roughly $1 billion, failure to convert pilots into named deployments, or independent benchmarks that negate the claimed TCO edge. Medium SV011, SV012, SV013, SV014, SV017, SV028
CV056 Diligence should request audited revenue or ARR, gross margin, customer concentration, bookings-to-deployment conversion, realized pricing, and preference-stack details before underwriting the Series C mark. Medium SV017, SV030, SV031, SV032
CV057 The reviewed corpus does not provide high-confidence private-market valuation data for Groq, Cerebras, or SambaNova comparable to d-Matrix's disclosed $2 billion round, so they can inform business-model comparison but not numeric pricing. Low SV022, SV023, SV027
CV058 Public evidence supports strategic sale or later secondary optionality more than a near-term IPO because d-Matrix has not reached public-company disclosure quality in the reviewed corpus. Medium SV017, SV029, SV030
Sources
IDPublisherTitleQuote
SO001 d-Matrix d-Matrix - Ultra-low Latency Batched Inference for Generative AI Chiplet-based design enables scaling SRAM-based architecture to power models up to 100B parameters.
SO002 d-Matrix About d-Matrix | Innovators in AI Compute Solutions & Technology d-Matrix is led by a team of dedicated entrepreneurs with a 20+ year history in building businesses that have shipped over 100M chips and generated over $5B in revenue.
SO003 d-Matrix d-Matrix News
SO004 d-Matrix d-Matrix Raises $275 Million to Power the Age of AI Inference Founded: 2019 | HQ: Santa Clara, CA | Global Offices: Toronto (Canada); Sydney (Australia); Bangalore (India); Belgrade (Serbia) | Employees: 250+ worldwide.
SO005 d-Matrix d-Matrix Unveils Corsair, the World’s Most Efficient AI Computing Platform for Inference in Datacenters Corsair is sampling to early-access customers and will be broadly available in Q2’2025.
SO006 d-Matrix d-Matrix Announces $110 Million in Series B Funding to Make Generative AI Commercially Viable with First-of-Its-Kind Inference Compute Platform d-Matrix was founded in 2019 to solve the memory-compute integration problem, which is the final frontier in AI compute efficiency.
SO007 Bullhound Capital Bullhound Capital leads $275M investment into AI inference leader d-Matrix d-Matrix ... has closed $275 million in Series C funding, valuing the company at $2 billion and bringing the total raised to date to $450 million.
SO008 Qatar Investment Authority QIA joins d-Matrix’s Series C USD 275m funding round Valued at USD 2 billion and bringing the total raised to date to USD 450 million, d-Matrix will use the new capital to advance their roadmap and accelerate global expansion.
SO009 Data Center Dynamics Chip startup d-Matrix raises $275m against $2bn valuation
SO010 SiliconANGLE Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SO011 Converge Digest d-Matrix Raises $275 Million to Accelerate the Age of AI Inference
SO012 CRN 10 Hottest Semiconductor Startups 2023
SO013 Silicon Republic 24 tech start-ups to watch in 2024
SO014 VentureBeat New Microsoft partnership accelerates generative AI development
SO015 CNN AI chips supply chain bottlenecks
SO016 CNBC ChatGPT and generative AI are booming, but the costs can be extraordinary
SO017 TechTarget Infrastructure to support OpenAI’s ChatGPT could be costly
SO018 The Washington Post The hidden cost of ChatGPT is all the computing power
SO019 Bloomberg AI Energy Crisis Boosts Need for More Efficient Chips
SO020 GigaIO GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform The new GigaIO SuperNODE platform, capable of supporting dozens of d-Matrix Corsair accelerators in a single node, is now the industry’s most scalable AI inference platform.
SO021 Gimlet Labs Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU Compared to the same speculative decoder on GPU and equivalent energy consumption, we've found that the Corsair-based solution delivers 2-5X end-to-end request speedup and up to 10X end-to-end speedup for energy-optimized configurations.
SO022 d-Matrix d-Matrix Announces JetStream I/O Accelerators Enabling Ultra-Low Latency for AI Inference at Scale JetStream NICs are full-height PCIe Gen5 cards delivering a maximum 400Gpbs bandwidth. Samples are available now, with full production expected by year-end.
SO023 d-Matrix d-Matrix Announces SquadRack, Industry’s First Rack-Scale Solution Purpose-Built for AI Inference at Datacenter Scale SquadRack™, the industry’s first blueprint for disaggregated standards-based rack-scale solutions for ultra-low latency batched inference.
SO024 d-Matrix d-Matrix and Alchip Announce Collaboration on World’s First 3D DRAM Solution to Supercharge AI Inference 3DIMC will commercially debut on the d-Matrix Raptor™ inference accelerator, the successor to d-Matrix Corsair™.
SO025 GFM Review The New Wave of AI Chips: Can Cerebras, d-Matrix and Groq take on Nvidia? These new entrants face several hurdles, including the need for substantial capital investment to scale production, challenges in building brand recognition, and the difficulty of competing against Nvidia’s established ecosystem.
SM001 d-Matrix Technology
SM002 d-Matrix Product Corsair is the first-of-its-kind AI compute platform. It offers unparalleled performance and efficiency for generative AI inference in the datacenter.
SM003 d-Matrix d-Matrix Corsair Redefines Performance and Efficiency for AI Inference at Scale Traditional accelerators often use costly and power-hungry High Bandwidth Memory (HBM)... d-Matrix breaks this memory barrier by integrating a multiplier directly into memory bit cell using a logic process.
SM004 d-Matrix JetStream Product Brief d-Matrix JetStream is a purpose-built network interface card (NIC) enabling efficient scaling of AI workloads and delivering ultra-low latency inference with Corsair clusters.
SM005 d-Matrix The Power of the Middle Lane: Why a Hybridized Approach to Memory Gives the Best of Both Worlds The per-token compute for a given inference has become the “cheap” part of the inference process, while that movement to and from memory is a massive “tax” per token generation.
SM006 d-Matrix Scaling AI the Right Way: Introducing Our Rack-level Inference Solution SquadRack is a reference rack blueprint ... based on rack power or rack height constraints. And it is air-cooled and doesn’t have any special cooling infrastructure requirements.
SM007 d-Matrix Why We Needed a New Transparent NIC Solution We consider there to be three total barriers to fast, efficient, high-performance AI inference: compute, memory, and I/O.
SM008 d-Matrix Why We Decoupled Execution to Accelerate I/O Rack-scale and multi-rack solutions with standard RoCE v2 systems accrue a large penalty scaling linearly due to the inherent costs of a system where the control plane and data plane both require a series of handshakes.
SM009 d-Matrix Why Modern AI Workloads Demand a Disaggregated Approach During decode, however, each new token is dependent on information from the prior one... making the speed of the response governed by the speed at which those trips to and from memory happen.
SM010 d-Matrix Batching Just Right: How Interactive Apps Serve as a New Battleground Lowering a batch size means a snappier, lower-latency user experience; while raising it increases overall throughput (and improves the economics).
SM011 d-Matrix What Is AI Inference and Why It Matters in the Age of Generative AI Generative AI models are orders of magnitude bigger ... They need multiple cards or servers to perform inference, consuming significantly more computational resources and energy.
SM012 d-Matrix Impact of the DeepSeek Moment on Inference Compute Reasoning models are highly memory bound and end up underutilizing the GPUs that are optimized for training.
SM013 d-Matrix Introducing dmx.compressor Quantization plays a key role in reducing memory usage, speeding up inference, and lowering energy consumption at inference time.
SM014 d-Matrix d-Matrix Unveils Corsair, the World’s Most Efficient AI Computing Platform for Inference in Datacenters Corsair delivers up to 10x faster interactive speed, 3x better performance per total cost of ownership (TCO), and 3x greater energy efficiency.
SM015 Data Center Dynamics Chip startup d-Matrix raises $275m against $2bn valuation It brings the total raised by d-Matrix to $450 million, which the company said it will use to advance its roadmap, accelerate global expansion, and support multiple large-scale deployments of its data center inference platform.
SM016 Bloomberg AI Energy Crisis Boosts Need for More Efficient Chips Every time the algorithm needs data, it goes to the library, known as a memory chip, checks out the data and takes it to another chip, known as a processor... a process that burns through lots of electricity.
SM017 VentureBeat Nvidia GPU shortage is top gossip of Silicon Valley The problem with inference is if the workload spikes very rapidly ... There is no way your GPU capacity can keep up with that because it was not built for that. It was built for training.
SM018 CNN AI chips supply chain is straining under demand Our datacenters depend on the availability of permitted and buildable land, predictable energy, networking supplies, and servers, including graphics processing units.
SM019 CNBC ChatGPT and generative AI are booming, but the costs can be extraordinary Curran believes that it could have cost OpenAI $40 million to process the millions of prompts people fed into the software that month.
SM020 TechTarget Infrastructure to support OpenAI’s ChatGPT could be costly Users have to build up not just compute, but their networking and power management.
SM021 GigaIO GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform This integration enables enterprises to deploy ultra-low-latency batched inference workloads at scale without the complexity of traditional distributed computing approaches.
SM022 Gimlet Labs Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU Compared to the same speculative decoder on GPU and equivalent energy consumption, we've found that the Corsair-based solution delivers 2-5X end-to-end request speedup on configurations optimized for interactivity, and up to 10X end-to-end speedup for energy-optimized configurations.
SM023 NVIDIA H100 Tensor Core GPU H100 extends NVIDIA’s market-leading inference leadership with several advancements that accelerate inference by up to 30X and deliver the lowest latency.
SM024 NVIDIA H200 Tensor Core GPU The H200 boosts inference speed by up to 2X compared to H100 GPUs when handling LLMs like Llama2.
SM025 NVIDIA GB200 NVL72 The NVIDIA GB200 NVL72 ... delivers 30x faster real-time trillion-parameter large language model inference, with 10x greater performance for mixture-of-experts architectures.
SM026 Groq Products The AI inference platform built for developers. Fast responses, scalable performance, and costs you can plan for.
SM027 AIMultiple AI Hardware Hyperscalers run broad cloud platforms with GPU rental as one product among many. Specialist neoclouds focus on GPU and AI infrastructure as their core product.
SM028 Forbes AI Inference Is King: Do You Know Which Chip Is Best? MarketsandMarkets projects that the AI inference market is expected to grow from $106.15 billion in 2025 to $254.98 billion by 2030.
SM029 Silicon Analysts Nvidia AI Accelerator Market Share 2024-2026 NVIDIA commands approximately 80-90% of the AI accelerator market by revenue as of 2025.
SP001 Data Center Dynamics Chip startup d-Matrix raises $275m against $2bn valuation Its full-stack platform is powered by the company’s Corsair inference accelerators, JetStream NICs, and Aviator software, which it claims can deliver 10X faster performance, 3X lower cost, and 3–5X better energy efficiency than GPU-based systems.
SP002 SiliconANGLE Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SP003 TechStartups AI chip startup d-Matrix raises $275M at $2B valuation to challenge Nvidia’s grip on AI inference Traditional GPUs, including Nvidia’s, move data constantly between memory and processing cores, consuming huge amounts of energy.
SP004 Bloomberg AI Energy Crisis Boosts Need for More Efficient Chips AI requires massive amounts of data... a process that burns through lots of electricity.
SP005 CRN 10 Hottest Semiconductor Startups of 2023 Nvidia may dominate the AI computing space... but that isn’t stopping a slew of semiconductor startup companies, such as Cerebras Systems, d-Matrix, Lightmatter and Tenstorrent.
SP006 Silicon Republic Start-ups to Watch in 2024 d-Matrix claims its chips are optimised to support generative AI systems and is competing in the same space as Nvidia.
SP007 eeNews Europe Second-generation chiplet platform targets generative AI The Jayhawk chiplet platform features... 16 Gbit/s/wire bandwidth and less than 0.5 pJ/bit energy efficiency.
SP008 HPCwire d-Matrix Launches New Chiplet Connectivity Platform to Address Growing Compute Demand for Generative AI By using a modular chiplet-based approach, data center customers can refresh compute platforms on a much faster cadence using a pre-validated chiplet architecture.
SP009 VentureBeat Nvidia GPU shortage is top gossip of Silicon Valley For LLM training... there is no substitute for Nvidia’s H100.
SP010 CNN AI chips supply chain bottlenecks The shortage could force companies to find creative ways around the problem.
SP011 CNBC ChatGPT and generative AI are booming, but the costs can be extraordinary At its peak in 2021, Latitude was spending nearly $200,000 a month on OpenAI’s software and Amazon Web Services.
SP012 TechTarget Infrastructure to support OpenAI’s ChatGPT could be costly Reports surfacing earlier this month indicate that just to develop training models and inferencing alone for OpenAI’s ChatGPT can require 10,000 Nvidia GPUs.
SP013 The Washington Post ChatGPT’s hidden cost is GPU compute The enormous cost of running today’s large language models is limiting their quality and threatening to throttle the global AI boom.
SP014 NVIDIA NVIDIA H100 H100 extends NVIDIA’s market-leading inference leadership... with several advancements that accelerate inference by up to 30X and deliver the lowest latency.
SP015 NVIDIA NVIDIA H200 The NVIDIA H200 is the first GPU to offer 141GB of HBM3e memory at 4.8TB/s.
SP016 NVIDIA NVIDIA GB200 NVL72 The NVIDIA GB200 NVL72... delivers 30x faster real-time trillion-parameter LLM inference.
SP017 AMD AMD Instinct MI300X Accelerators Dedicated Memory Size 192 GB... Peak Memory Bandwidth 5.3 TB/s... Supported Technologies AMD ROCm.
SP018 Groq Groq Products Available in public, private, or co-cloud instances.
SP019 Cerebras Cerebras Product Chip Four trillion transistors. 125 petaflops. One silicon wafer.
SP020 AIMultiple AI Hardware We benchmarked NVIDIA’s B200, H200, H100, and AMD’s MI300X to assess how well they scale for LLM inference.
SP021 Intuition Labs LLM Inference Hardware Enterprise Guide NVIDIA’s ecosystem advantages (CUDA software stack, TensorRT, etc.) create high switching costs.
SP022 Forbes AI Inference Is King: Do You Know Which Chip Is Best? Now for the real test: is the NVL72 as fast as Nvidia promised at launch? Yes, it is thirty times faster than the 8-GPU H200.
SP023 GFM Review The New Wave of AI Chips: Can Cerebras, d-Matrix, and Groq Take on Nvidia The specialized nature of Cerebras, d-Matrix, and Groq’s technologies allows these companies to carve out niches where their solutions can outperform Nvidia’s more generalized approach.
SP024 Silicon Analysts NVIDIA AI Accelerator Market Share 2024-2026 NVIDIA commands approximately 80-90% of the AI accelerator market by revenue as of 2025... priority TSMC CoWoS allocation maintain dominance.
SP025 Research and Markets via GlobeNewswire / Yahoo Finance AI inference - Company Evaluation Report, 2025 The market faces constraints such as the high power requirements of AI chips... and concerns around data security and supply chain disruptions.
SP026 NVIDIA NVIDIA L40S GPU Experience breakthrough multi-workload performance with the NVIDIA L40S GPU. Combining powerful AI compute with best-in-class graphics and media acceleration, the L40S GPU is built to power the next generation of data center workloads—from generative AI and large language model (LLM) inference and training to 3D graphics, rendering, and video.
SI001 d-Matrix d-Matrix Raises $275 Million to Power the Age of AI Inference D-Matrix has closed $275 million in Series C funding, valuing the company at $2 billion and bringing the total raised to date to $450 million.
SI002 d-Matrix d-Matrix launches Corsair for AI inference without GPUs / HBM Corsair offers performance of 60,000 tokens/second at 1 ms/token for Llama3 8B in a single server and 30,000 tokens/second at 2 ms/token for Llama3 70B in a single rack.
SI003 d-Matrix d-Matrix Unveils Corsair, the World's Most Efficient AI Computing Platform for Inference in Datacenters Corsair is sampling to early-access customers and will be broadly available in Q2'2025.
SI004 d-Matrix d-Matrix Announces $110 Million in Series B Funding to Make Generative AI Commercially Viable The goal of the fundraise is to enable d-Matrix to begin commercializing Corsair.
SI005 d-Matrix d-Matrix Corsair Redefines Performance and Efficiency for AI Inference at Scale Performance, cost and power estimates are preliminary and subject to change. Results may vary.
SI006 d-Matrix JetStream Product Brief JetStream Product Specification ... Maximum bandwidth 400 Gbps ... Max TDP (w/ transceivers) 150 W.
SI007 Bullhound Capital Bullhound Capital leads $275M investment into AI inference leader d-Matrix D-Matrix ... has closed $275 million in Series C funding, valuing the company at $2 billion and bringing the total raised to date to $450 million.
SI008 Qatar Investment Authority QIA joins d-Matrix's Series C USD 275m funding round Valued at USD 2 billion and bringing the total raised to date to USD 450 million, d-Matrix will use the new capital to advance their roadmap, accelerate global expansion and support multiple large-scale deployments.
SI009 Data Center Dynamics Chip startup d-Matrix raises $275m against $2bn valuation Its single-card configuration provides up to 256GB of off-chip capacity memory and 2GB of performance memory at 150Tbps.
SI010 SiliconANGLE Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SI011 Bloomberg AI Energy Crisis Boosts Need for More Efficient Chips AI requires massive amounts of data ... a process that burns through lots of electricity.
SI012 VentureBeat New Microsoft partnership accelerates generative AI development Project Bonsai is a platform ... and the early results are very encouraging.
SI013 VentureBeat Nvidia GPU shortage is top gossip of Silicon Valley When people use the word GPU shortage, they're really talking about a shortage of ... some component on the board, not the GPU itself.
SI014 CNN AI chips supply chain bottlenecks The shortage could force companies to find creative ways around the problem.
SI015 CNBC ChatGPT and generative AI are booming, but the costs can be extraordinary At its peak in 2021, Latitude was spending nearly $200,000 a month on OpenAI's generative AI software and Amazon Web Services.
SI016 TechTarget Infrastructure to support OpenAI's ChatGPT could be costly Users have to build up not just compute, but their networking and power management.
SI017 The Washington Post The hidden cost of ChatGPT is all the computing power AI chatbots have a problem: They lose money on every chat.
SI018 GigaIO GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform Our single-node server eliminates complex multi-node configurations and simplifies deployment, enabling enterprises to quickly adapt to evolving AI workloads while significantly improving their TCO and operational efficiency.
SI019 Gimlet Labs Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU Compared to the same speculative decoder on GPU and equivalent energy consumption, we've found that the Corsair-based solution delivers 2-5X end-to-end request speedup ... and up to 10X ... for energy-optimized configurations.
SI020 AIMultiple AI Hardware Cloud GPU list prices for the same model can differ several times over from one provider to another.
SI021 d-Matrix d-Matrix Ecosystem Not actual customers.
SI022 d-Matrix d-Matrix Announces SquadRack Combined with Supermicro's AI servers, Arista's ethernet switches, and Broadcom's PCIe and ethernet switch chips, we're delivering an AI inference rack that speeds up time to deployment.
SI023 d-Matrix d-Matrix Announces JetStream I/O Accelerators JetStream NICs are full-height PCIe Gen5 cards delivering a maximum 400Gpbs bandwidth. Samples are available now, with full production expected by year-end.
SI024 d-Matrix d-Matrix Emerges from Stealth with Strong AI Performance and Efficiency d-Matrix uses a hybrid approach to memory ... using SRAM as Performance Memory and a larger DRAM store for Capacity Memory.
SI025 d-Matrix d-Matrix and Alchip Announce Collaboration on World's First 3D DRAM Solution The collaboration has already enabled a key technology, d-Matrix 3DIMC, that is featured on d-Matrix Pavehawk test silicon and has been successfully validated in d-Matrix's labs.
SI026 d-Matrix Gimlet Cloud to Deploy d-Matrix Corsair Alongside GPUs The companies plan to make their combined solution available to select customers through Gimlet Cloud in 2H 2026.
SI027 d-Matrix Why We Created a 3D DRAM Solution to Advance Low-Latency AI Inference Our next task is meeting the incredible demand required by emerging AI workloads with high user expectations, and that starts with Pavehawk.
SI028 d-Matrix How Speculative Decoding Supercharged AI Inference in Disaggregated Pipelines Rather than reducing the batch size on a GPU and wasting compute to generate a low-latency experience, it reduces the total number of sequential inferences the powerful model handles.
SI029 d-Matrix Democratizing AI Through Hardware-Software Codesign for LLM Inference The team looks at the associated software design of modern systems including collective communication algorithms and the distributed inference serving stack.
SI030 NVIDIA via Stocklight NVIDIA Corporation Annual Report 2026 These Data Center systems are extreme co-designed with the GPU, CPU, NVLink switch, DPU, NIC, and scale-out networking along with software stacks and algorithms to deliver data center-scale computing solutions.
SI031 GFM Review The New Wave of AI Chips: Can Cerebras, d-Matrix, and Groq Take on Nvidia? These new entrants face several hurdles, including the need for substantial capital investment to scale production, challenges in building brand recognition, and the difficulty of competing against Nvidia's established ecosystem.
SI032 d-Matrix Transforming AI: d-Matrix's Pivotal Moments in Pursuit of GenAI Inference at Scale After coming within two weeks of running out of cash, we had an a-ha moment ... why don't we go ask for $40 million and show a big vision to match the size of the opportunity.
SE001 d-Matrix d-Matrix homepage
SE002 d-Matrix Technology
SE003 d-Matrix Product
SE004 d-Matrix Ecosystem
SE005 d-Matrix SquadRack announcement Corsair delivers the compute-memory acceleration, while JetStream delivers I/O acceleration.
SE006 d-Matrix JetStream announcement JetStream NICs are full-height PCIe Gen5 cards delivering a maximum 400Gpbs bandwidth.
SE007 d-Matrix Corsair launch announcement Each Corsair card is powered by DIMC compute cores with 2400 TFLOPs of 8-bit peak compute, 2 GB of integrated Performance Memory, and up to 256 GB of off-chip Capacity Memory.
SE008 d-Matrix d-Matrix and Alchip announce collaboration on 3D DRAM solution 3DIMC will commercially debut on the d-Matrix Raptor inference accelerator, the successor to d-Matrix Corsair.
SE009 d-Matrix d-Matrix Corsair Redefines Performance and Efficiency for AI Inference at Scale d-Matrix Aviator is an enterprise-grade software stack co-designed with d-Matrix hardware.
SE010 d-Matrix JetStream product brief Maximum bandwidth 400 Gbps
SE011 d-Matrix Going vertical: Why we created a 3D DRAM solution to advance low-latency AI inference
SE012 d-Matrix The power of the middle lane: why a hybridized approach to memory gives the best of both worlds
SE013 d-Matrix Scaling AI the right way: introducing our rack-level inference solution
SE014 d-Matrix Why we needed a new transparent NIC solution
SE015 d-Matrix Why we decoupled execution to accelerate I/O
SE016 d-Matrix Why modern AI workloads demand a disaggregated approach
SE017 d-Matrix How speculative decoding supercharged AI inference in disaggregated pipelines
SE018 d-Matrix Batching just right: how interactive apps serve as a new battleground
SE019 d-Matrix Impact of the DeepSeek moment on inference compute
SE020 d-Matrix Keyformer: KV cache reduction through attention sparsification for efficient generative inference
SE021 d-Matrix Introducing dmx.compressor
SE022 d-Matrix Democratizing AI through hardware-software codesign for LLM inference
SE023 Data Center Dynamics Chip startup d-Matrix raises $275m against $2bn valuation
SE024 EE News Europe Second generation chiplet platform targets generative AI
SE025 HPCwire d-Matrix launches new chiplet connectivity platform to address growing compute demand for generative AI
SE026 GigaIO GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SE027 Gimlet Labs Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU Compared to the same speculative decoder on GPU and equivalent energy consumption, we've found that the Corsair-based solution delivers 2-5X end-to-end request speedup.
SE028 NVIDIA NVIDIA H100
SE029 NVIDIA NVIDIA H200
SE030 NVIDIA NVIDIA GB200 NVL72
SE031 AIMultiple AI Hardware
SE032 Intuition Labs Private LLM Inference: Key Hardware and Integrators for Enterprise
SE033 Forbes AI Inference Is King: Do You Know Which Chip Is Best?
SE034 Silicon Analysts NVIDIA AI accelerator market share 2024-2026
SE035 d-Matrix GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SE036 d-Matrix AI is a context problem
SE037 d-Matrix The fight for latency: why agents have changed the game
SE038 d-Matrix Open standards are the path to the next AI breakthrough
SE039 d-Matrix Why optimizing every layer of AI workloads from software to infrastructure is now critical as apps take off
SE040 Alchip d-Matrix and Alchip announce collaboration on world's first 3D DRAM solution to supercharge AI inference
SE041 d-Matrix The complete recipe to unlock AI reasoning at enterprise scale
SU001 d-Matrix AI Computing Solutions for Cloud & Enterprise Markets *Not actual customers.
SU002 d-Matrix d-Matrix Raises $275 Million to Power the Age of AI Inference
SU003 d-Matrix SquadRack
SU004 d-Matrix JetStream
SU005 d-Matrix d-Matrix launches Corsair for AI inference without GPUs, HBM
SU006 d-Matrix d-Matrix Unveils Corsair, the World's Most Efficient AI Computing Platform for Inference in Datacenters
SU007 d-Matrix GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform
SU008 d-Matrix Gimlet Cloud, built for running agentic AI inference, to deploy d-Matrix Corsair low latency, memory-optimized accelerators alongside GPUs
SU009 d-Matrix Scaling AI the Right Way: Introducing Our Rack-Level Inference Solution
SU010 d-Matrix Why We Needed a New Transparent NIC Solution
SU011 d-Matrix How Speculative Decoding Supercharged AI Inference in Disaggregated Pipelines
SU012 d-Matrix Impact of the DeepSeek Moment on Inference Compute
SU013 d-Matrix Transforming AI: d-Matrix's Pivotal Moments in Pursuit of Gen AI Inference at Scale
SU014 Bullhound Capital Bullhound Capital leads $275M investment into AI inference leader d-Matrix
SU015 QIA QIA joins d-Matrix's Series C USD 275m funding round
SU016 Data Center Dynamics Chip startup d-Matrix raises $275m against $2bn valuation
SU017 SiliconANGLE Chip startup d-Matrix raises $275M to speed inference with memory-centric compute
SU018 TechStartups AI chip startup d-Matrix raises $275M at $2B valuation to challenge Nvidia's grip on AI inference
SU019 Converge Digest d-Matrix Raises $275 Million to Accelerate the Age of AI Inference
SU020 GigaIO GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform
SU021 Gimlet Labs Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU
SU022 AIMultiple AI hardware
SU023 IntuitionLabs Private LLM Inference: Key Hardware and Integrators for Enterprise
SU024 Forbes AI Inference Is King. Do You Know Which Chip Is Best?
SU025 GFM Review The New Wave of AI Chips: Can Cerebras, d-Matrix and Groq Take on Nvidia?
SU026 d-Matrix d-Matrix and Gimlet Join Forces in Strategic Partnership
SU027 Data Center Knowledge Gimlet Labs, d-Matrix Partner to Boost Agentic AI Inference
SU028 eeNews Europe d-Matrix launches Corsair for AI inference without GPUs, HBM
SU029 AIwire GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform
SU030 PR Newswire d-Matrix Announces SquadRack, Industry's First Rack-Scale Solution Purpose-Built for AI Inference at Datacenter Scale
SU031 d-Matrix Request Access or Buy
SU032 d-Matrix The fight for latency: why agents have changed the game
SU033 AMD AMD Instinct MI350 Series GPUs AMD Instinct™ MI350 Series GPUs help enable frictionless adoption with drop-in compatibility, while the AMD GPU Operator simplifies deployment and workload configuration in Kubernetes.
SU034 Groq GroqCloud The AI inference platform built for developers. Fast responses, scalable performance, and costs you can plan for. Available in public, private, or co-cloud instances.
SU035 Cerebras Chip The world’s largest and most powerful processor for AI training and inference.
SR001 d-Matrix SquadRack announcement
SR002 d-Matrix JetStream announcement
SR003 d-Matrix d-Matrix and Alchip announce collaboration on 3D DRAM solution
SR004 d-Matrix Going vertical: Why we created a 3D DRAM solution to advance low-latency AI inference
SR005 d-Matrix Why We Needed a New Transparent NIC Solution
SR006 d-Matrix Why We Decoupled Execution to Accelerate I/O
SR007 d-Matrix Why Modern AI Workloads Demand a Disaggregated Approach
SR008 d-Matrix Impact of the DeepSeek Moment on Inference Compute
SR009 Data Center Dynamics Chip startup d-Matrix raises $275m against $2bn valuation
SR010 SiliconANGLE Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SR011 Bloomberg AI Energy Crisis Boosts Need for More Efficient Chips
SR012 eeNews Europe Second-generation chiplet platform targets generative AI
SR013 VentureBeat Nvidia GPU shortage is top gossip of Silicon Valley
SR014 CNN AI chips supply chain bottlenecks
SR015 TechTarget Infrastructure to support OpenAI’s ChatGPT could be costly
SR016 The Washington Post ChatGPT’s hidden cost is GPU compute
SR017 NVIDIA NVIDIA H200
SR018 NVIDIA NVIDIA GB200 NVL72
SR019 AMD AMD Instinct MI300X Accelerators
SR020 Groq Groq Products
SR021 Cerebras Cerebras Product Chip
SR022 Intuition Labs LLM Inference Hardware Enterprise Guide
SR023 Silicon Analysts NVIDIA AI Accelerator Market Share 2024-2026
SR024 d-Matrix GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SR025 d-Matrix d-Matrix and Gimlet Labs accelerate agentic AI inference
SR026 d-Matrix Transforming AI: d-Matrix’s pivotal moments in pursuit of Gen AI inference at scale
SR027 d-Matrix Privacy Policy - d-Matrix
SR028 d-Matrix Terms of Use - d-Matrix
SR029 Bureau of Industry and Security Commerce Strengthens Restrictions on Advanced Computing Semiconductors, Semiconductor Manufacturing Equipment, and Supercomputing Items to Countries of Concern
SR030 d-Matrix Join d-Matrix | Careers in AI Innovation and Technology
SR031 d-Matrix d-Matrix and Gimlet Join Forces in Strategic Partnership - d-Matrix
SR032 Reuters AI chip startup d-Matrix raises $110 mln with backing from Microsoft
SR033 Supermicro Supermicro expands AI solutions and adds d-Matrix inference and rack-scale systems
SR034 Alchip d-Matrix and Alchip announce collaboration on world’s first 3D DRAM solution to supercharge AI inference
SR035 NVIDIA NVIDIA AI Enterprise
SR036 d-Matrix Using what’s on hand: spare data center space is an untapped gold mine - d-Matrix
SV001 d-Matrix d-Matrix raises $275 million to power the age of AI inference d-Matrix has closed $275 million in Series C funding, valuing the company at $2 billion and bringing the total raised to date to $450 million.
SV002 Bullhound Capital Bullhound Capital leads $275M investment into AI inference leader d-Matrix
SV003 Qatar Investment Authority QIA joins d-Matrix’s Series C USD 275m funding round
SV004 Data Center Dynamics Chip startup d-Matrix raises $275m against $2bn valuation It brings the total raised by d-Matrix to $450 million, which the company said it will use to advance its roadmap, accelerate global expansion, and support multiple large-scale deployments of its data center inference platform.
SV005 SiliconANGLE Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SV006 TechStartups AI chip startup d-Matrix raises $275M at $2B valuation to challenge Nvidia’s grip on AI inference
SV007 Converge Digest d-Matrix raises $275 million to accelerate the age of AI inference
SV008 d-Matrix d-Matrix announces $110 million in Series B funding
SV009 d-Matrix d-Matrix emerges from stealth with strong AI performance and efficiency
SV010 d-Matrix Transforming AI: d-Matrix’s pivotal moments in pursuit of GenAI inference at scale
SV011 d-Matrix GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SV012 d-Matrix Gimlet Cloud to deploy d-Matrix Corsair accelerators alongside GPUs
SV013 GigaIO GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SV014 Gimlet Labs Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU By offloading the speculative decoder to d-Matrix Corsair, we achieved 2-10X interactivity improvements over a homogeneous speculative decoding setup for the same energy efficiency.
SV015 Bloomberg AI Energy Crisis Boosts Need for More Efficient Chips
SV016 CRN 10 Hottest Semiconductor Startups of 2023
SV017 Silicon Republic Start-ups to Watch in 2024
SV018 NVIDIA NVIDIA H100 Tensor Core GPU
SV019 NVIDIA NVIDIA H200 Tensor Core GPU
SV020 NVIDIA NVIDIA GB200 NVL72
SV021 AMD AMD Instinct MI300X
SV022 Groq Groq products
SV023 Cerebras The Future of AI is Wafer Scale
SV024 AIMultiple Top 25+ AI Chip Makers: NVIDIA & Its Competitors
SV025 Forbes AI Inference Is King: Do You Know Which Chip Is Best?
SV026 Silicon Analysts NVIDIA AI accelerator market share 2024-2026
SV027 Intuition Labs Private LLM Inference: Key Hardware and Integrators for Enterprise
SV028 Yahoo Finance / GlobeNewswire AI inference - Company Evaluation Report, 2025 However, concerns around data security and supply chain disruptions remain persistent challenges for companies operating in the AI inference space.
SV029 Securities and Exchange Commission NVIDIA Corporation Form 10-K for fiscal year ended January 25, 2026
SV030 Securities and Exchange Commission Advanced Micro Devices, Inc. Form 10-K for fiscal year ended December 27, 2025
SV031 Securities and Exchange Commission Advanced Micro Devices, Inc. Form 10-Q for quarter ended March 28, 2026
SV032 Securities and Exchange Commission NVIDIA Corporation Form 10-Q for quarter ended April 26, 2026
SV033 Securities and Exchange Commission NVIDIA SEC submissions JSON
SV034 Securities and Exchange Commission AMD SEC submissions JSON
SV035 Securities and Exchange Commission NVIDIA SEC companyfacts JSON
SV036 Securities and Exchange Commission AMD SEC companyfacts JSON