Diligence report AI inference infrastructure / custom silicon late-stage private 2026-05-26

d-Matrix

Full-stack inference hardware company with credible product progress but incomplete proof at a $2B private mark

d-Matrix has a differentiated inference architecture and credible commercialization momentum, but the $2B Series C still gets ahead of public proof on revenue, customer depth, and durable deployment economics.

Cover facts

Latest disclosed valuation 01

2000 USD M [CO015, CV001]

Total disclosed funding 02

450 USD M [CO015, CV001]

Founded 03

2019 [CO001]

Headquarters 04

Santa Clara, California [CO002]

Product stack 05

Corsair / JetStream / Aviator [CO010]

Latest disclosed headcount 06

250+ employees [CO018]

Company profile

d-Matrix is a Santa Clara-based AI inference infrastructure company founded in 2019 by Sid Sheth and Sudeep Bhoja. The company now presents a concrete full-stack product story around Corsair accelerator cards, JetStream networking, Aviator software, and open-standards rack deployment for hyperscale, enterprise, and sovereign datacenters. Public financing history is well supported, culminating in a $275 million Series C at a $2 billion valuation and $450 million of total disclosed funding, with 250+ employees and a multi-office footprint by late 2025. What remains missing is the operating denominator: public sources still do not disclose revenue, ARR, broad customer count, or control economics in enough detail to underwrite the business like a mature late-stage infrastructure company.

Website: www.d-matrix.ai
Founded: 2019-01-01
Founders: Sid Sheth, Sudeep Bhoja
Headquarters: Santa Clara, California
Product: Corsair accelerator cards, JetStream transparent networking cards, Aviator software, and the SquadRack reference architecture together form a low-latency inference stack designed for standard PCIe servers, Ethernet fabrics, and brownfield datacenter deployment.
Customers: Hyperscale, enterprise, sovereign-cloud, and specialist AI-cloud operators running latency-sensitive inference workloads.
Business model: Hardware-plus-software platform sales routed through OEM, integrator, and cloud partners, with economics likely tied to multi-component deployments rather than standalone self-serve software.
Stage: Series C private company
Funding status: $110M Series B in 2023 followed by a $275M Series C in November 2025 at a $2B valuation, bringing total disclosed funding to $450M.

[CO001, CO002, CO004, CO008, CO010, CO015, CO018, CO019]

Executive summary

Top strengths

Concrete full-stack productization: Corsair, JetStream, Aviator, and SquadRack make d-Matrix more than an architecture concept and give buyers a testable deployment surface.
Differentiated latency thesis: the memory-centric DIMC design and open-standards deployment model target a real inference bottleneck in brownfield datacenters rather than requiring a full rack rebuild.
Strong capital and ecosystem support: $450M disclosed funding and visible backing from Temasek, Bullhound, Triatomic, QIA, EDBI, and M12 reduce near-term existential financing risk.
Early commercialization proof is real enough to matter: Supermicro channel availability, GigaIO scale-up integration, and Gimlet's heterogeneous-cloud work show a credible path from sampling to production.

Top risks

No public revenue, ARR, gross margin, pricing, or customer-count disclosure supports the current $2B mark, so valuation underwriting is denominator-blind.
Execution depends on synchronized maturity across Corsair, JetStream, Aviator, partner-qualified systems, and the pre-commercial 3DIMC/Raptor roadmap.
Supply-chain and ecosystem dependence on advanced packaging, foundry partners, and OEM/integration partners creates real qualification, cost, and timing risk.
Customer proof remains partner-led and pilot-like: public evidence still lacks a named production-customer roster, retention metrics, or concentration data.
Security, compliance, and export-control posture are thinner publicly than many enterprise or sovereign buyers may require.

Open gaps

Audited revenue or ARR, gross margin, pricing, cash runway, and realized unit economics remain undisclosed.
Named production deployments, active customer count, repeat-order cadence, retention, and concentration metrics are still missing from public sources.
Cap-table terms, liquidation preferences, secondary or 409A marks, and board-control economics are not public.
Production yield, cost curve, and delivery timing for JetStream ramp and the 3DIMC-to-Raptor roadmap remain under-evidenced.
Public security/compliance artifacts such as SOC 2, ISO 27001, trust-center documentation, and concrete export-control exposure analysis remain incomplete.

Chapter 01

01Company Overview

1.1 Identity, Footprint, and Business Model

d-Matrix should be treated as a private, Series C AI infrastructure company built specifically around inference rather than training. The official home page, about page, and 2025 Series C release all anchor the company in Santa Clara and frame its business as selling accelerated compute for hyperscale, enterprise, and sovereign data-center workloads. Public materials say the company was founded in 2019, and the current product stack combines Corsair inference accelerators, JetStream networking, and Aviator software. The home page further says the 3DIMC architecture uses a chiplet-based, PCIe-friendly design that can scale to models above 100 billion parameters. That makes the reusable ground truth for later chapters straightforward: d-Matrix is monetizing hardware-plus-software infrastructure for low-latency inference. What the public record does not yet provide is a companywide customer count or revenue figure that would let diligence treat current commercial scale as fully quantified.[CO001, CO002, CO003, CO008, CO009, CO010]

Snapshot KPI table
Metric	Value / status	Date	Confidence	Gap / note
Founded	2019	2019	high	Official and investor materials align on 2019 as the founding year.
Headquarters	Santa Clara, California	2025-11-12	high	Series C and Corsair materials keep Santa Clara as the canonical HQ.
Public office footprint	Toronto; Sydney; Bangalore; Belgrade	2025-11-12	high	These are the latest named non-HQ offices in the fetched pack.
Stage	Private Series C company	2025-11-12	high	Stage call is supported by the November 2025 financing and no public-listing evidence.
Core products	Corsair accelerators; JetStream networking; Aviator software	2025-11-18	high	By late 2025 the company publicly described a three-part hardware and software stack.
Total disclosed funding	$450M	2025-11-12	high	Official, investor, and independent coverage all corroborate the total.
Latest disclosed valuation	$2B	2025-11-12	high	Valuation is tied specifically to the Series C close.
Latest disclosed headcount	250+ worldwide	2025-11-12	medium	Publicly disclosed in Series C key facts, but not independently refreshed in 2026.
Public commercialization proof	GigaIO partnership plus Gimlet Labs benchmark	2026-03	medium	Named proof comes from partner and workload announcements rather than a broad customer roster.
Current customer count	Not publicly disclosed	2026-05-26	low	Request booked-customer, deployment, and repeat-order counts rather than inferring from partner rhetoric.
Current revenue or ARR	Not publicly disclosed	2026-05-26	low	Request monthly recurring or run-rate revenue directly from finance materials.

This table separates hard public identity and financing facts from material metrics that remain unavailable in the fetched pack, especially customer count and revenue or ARR.

[CO001, CO002, CO003, CO009, CO010, CO015]

FO002: Company snapshot logic

Founders and capital feed a full-stack inference platform, but commercialization evidence still trails the fundraising narrative.

[CO004, CO008, CO010, CO015, CO025, CO026]

1.2 Founders, Leadership, and Governance

The public leadership picture is unusually clear at the founder, executive, and board-name level but still incomplete on control economics. The about page identifies Sid Sheth as founder and CEO and Sudeep Bhoja as founder and CTO, while public investor and product releases continue to quote the same pair through 2025, suggesting stable technical and commercial leadership. The company also exposes an operating bench across software, product, finance, legal, and manufacturing. Governance visibility is more mixed. d-Matrix publicly names a board slate tied to Bullhound Capital, Playground Global, Triatomic Capital, M12, Nautilus Venture Partners, and Temasek, which aligns well with the financing history and gives later chapters a usable map of who likely influences strategy. But the fetched pack does not disclose board committees, voting thresholds, protective provisions, or current ownership percentages, so governance analysis should treat the board roster as real but incomplete rather than assume standard venture rights.[CO004, CO005, CO006, CO007, CO042, CO046]

Leadership and founder table
Person	Public role	Background or functional scope	Why it matters	Key-person or governance note
Sid Sheth	Founder and Chief Executive Officer	Public commercial and strategy voice across Series B, Corsair, and Series C releases	Central owner of fundraising story, go-to-market posture, and inference-first thesis	High key-person dependence because he anchors both investor messaging and external strategy
Sudeep Bhoja	Founder and Chief Technology Officer	Co-founder tied to architecture and compute-memory roadmap	Core technical steward for the platform and successor products beyond Corsair	Technical continuity is strong, but roadmap execution remains concentrated
Peter Buckingham	Senior Vice President, Software	Visible software executive on the about page	Shows the company is building software capability rather than only silicon	Public sources do not show deeper software-org breadth
Sree Ganesan	Vice President, Product	Public product leader on the about page	Supports functional coverage for productization and roadmap packaging	No public product P&L or commercial ownership details disclosed
PJ Jamkhandi	Vice President, Finance & Accounting	Named finance executive on the about page	Important for diligence because public fundraising outpaces financial disclosure	No public CFO-level KPI package appears in the fetched pack
Richard Ogawa	General Counsel	Named legal executive on the about page	Relevant because financing and IP-heavy partnerships rely on legal execution	Board committee and legal-risk oversight structure are not disclosed publicly
Jerry Qubain	Vice President, Manufacturing, Operations	Visible operations leader on the about page	Important in a semiconductor business that must scale manufacturing and delivery	Manufacturing footprint and supplier dependencies remain mostly undisclosed
Per Roman	Board member via Bullhound Capital	Investor-affiliated director disclosed on the about page and echoed by Series C investor materials	Illustrates that new capital providers have visible governance influence	Public materials do not disclose committee assignments or vote thresholds
Michael Stewart	Board member via M12	Investor-linked director and recurring public supporter in company releases	Connects long-running Microsoft adjacency with governance visibility	Investor rights and ownership economics behind the seat are not public
Russell Tham	Board member via Temasek	Director tied to the Series B lead investor on the public board slate	Signals continuity from Series B capital into current governance	No fetched source discloses whether Temasek holds protective provisions

Coverage is intentionally partial because the public record exposes named founders, executives, and directors more clearly than committee structures, ownership percentages, or internal reporting lines.

[CO004, CO005, CO006, CO007, CO042, CO046]

1.3 Capital Formation, Stage, and Stakeholders

Capital formation is the strongest externally corroborated part of the company overview. VentureBeat reported a $44 million Series A in 2022 around the Microsoft Project Bonsai relationship. The official 2023 release then disclosed a $110 million Series B led by Temasek, and the official 2025 Series C release, Bullhound statement, QIA statement, and independent press all agree that d-Matrix raised $275 million at a $2 billion valuation, bringing total disclosed funding to $450 million. The same Series C materials also provide the clearest current scale markers in the fetched pack: 250+ employees worldwide, Santa Clara headquarters, and offices in Toronto, Sydney, Bangalore, and Belgrade. That combination supports a practical stage call of late-stage private infrastructure startup rather than early research project. Still, fundraising visibility exceeds operating disclosure. The company says the new capital will support global expansion and large-scale deployments, but no fetched source discloses current revenue, ARR, or a broad named-customer list, so those metrics should remain explicit diligence gaps.[CO011, CO012, CO013, CO014, CO015, CO016]

Stakeholder or investor map
Stakeholder	Role	Public signal	Control or economic importance	Diligence ask
Bullhound Capital	Series C co-lead and board-affiliated investor	Led the 2025 round and is represented on the public board by Per Roman	New-money sponsor with visible governance presence in the latest round	Clarify board seat terms, economics, and liquidation preferences
Triatomic Capital	Series C co-lead	Named as a 2025 co-lead and represented on the public board by Jeff Huber	Important signaling investor in the current capital stack	Confirm check size and any information or consent rights
Temasek	Series B lead and Series C co-lead	Backed the 2023 Series B, reappeared in the 2025 Series C, and is represented by Russell Tham on the board slate	Most visible repeat backer across the financing arc	Request current ownership percentage and any special governance rights
Qatar Investment Authority	New Series C investor	QIA publicly announced participation in the 2025 round	Adds sovereign-capital credibility and possible long-duration support	Clarify whether the position is purely financial or strategic
M12	Repeat investor and board-linked Microsoft fund	Visible in Series B and Series C materials and represented by Michael Stewart on the board slate	Sustains a long-running Microsoft adjacency with board-level visibility	Clarify any commercial or cloud-channel expectations beyond venture ownership
Playground Global	Early investor with public board visibility	Quoted in Series B materials and represented on the board by Sasha Ostojic	Likely important to the early technical and strategic formation story	Confirm current stake after later financings
Nautilus Venture Partners	Board-linked repeat investor	Named on the board slate and in later investor rosters	Provides continuity between earlier rounds and current governance	Clarify economics, board voting rights, and follow-on participation
GigaIO	Deployment and infrastructure partner	May 2025 partnership integrated Corsair with SuperNODE for enterprise-scale inference	Most visible system-level commercialization proof before the 2026 acquisition title on the newsroom index	Request booked deployments, pipeline value, and any exclusivity terms
Alchip	Roadmap and packaging partner	November 2025 collaboration targets 3D DRAM and the Raptor successor platform	Strategically important to the next product generation rather than current revenue visibility	Clarify commercialization timetable, manufacturing dependencies, and co-development obligations
Gimlet Labs	Public workload-validation partner	March 2026 benchmark used Corsair in a heterogeneous deployment model	Provides concrete external workload evidence even if it is not a broad customer roster	Request whether the relationship has recurring production spend or only evaluation status

This is not a full cap table. It combines investors and strategic commercialization counterparties because both groups shape the company’s control story and near-term execution risk.

[CO015, CO016, CO020, CO026, CO027, CO028]

FO003: Snapshot KPIs

Publicly supportable KPIs concentrate in funding, footprint, and product capability rather than revenue or customer count.

Current customer count and revenue or ARR are intentionally excluded because the fetched pack does not disclose them.

[CO003, CO009, CO015, CO018, CO022, CO024]

1.4 Milestones, Commercial Proof, and Pressure Points

The milestone record shows a company moving from architecture thesis to productization, partner integration, and early public workload proof. The chronology starts with 2019 founding, a 2022 Series A and Microsoft-linked tooling relationship, and the 2023 Series B that funded commercialization. Corsair was unveiled in November 2024 with early-access sampling and a Q2 2025 broad-availability target. d-Matrix then expanded the stack with a May 2025 GigaIO partnership, a September 2025 JetStream launch, the October 2025 SquadRack reference architecture, the November 2025 Alchip roadmap announcement, and a March 2026 Gimlet Labs benchmark that claimed 2–10x latency benefits in a heterogeneous setup. The same record also exposes pressure points. Public materials still rely heavily on vendor claims and partner-led commercialization proof, while independent coverage emphasizes that startups like d-Matrix must overcome Nvidia's installed ecosystem, scale manufacturing, and prove durable customer adoption. That makes the chapter's biggest remaining gaps commercial disclosure and control visibility, not identity or capitalization.[CO021, CO022, CO023, CO024, CO025, CO026]

Milestone table
Date	Event	Type	Amount or status	Participants	Implication
2019	Company founded	founding	Formation	Sid Sheth; Sudeep Bhoja	Sets the start date and explains why later materials call the firm six years old in 2025
2022-04	Series A and Microsoft Bonsai relationship publicized	financing	$44M Series A; Microsoft tooling support	d-Matrix; VentureBeat; Microsoft Project Bonsai	Shows early ecosystem access before Corsair was public
2023-09-06	Series B closed	financing	$110M led by Temasek	d-Matrix; Temasek; Playground; M12; SK Hynix; Marvell; Entrada	Funds commercialization and expands strategic-capital base
2024-11-19	Corsair unveiled	product	First public inference platform launch	d-Matrix; M12; OEM and integrator collaborators	Moves the company from architecture thesis to shipping product story
2025-05-01	GigaIO partnership announced	partnership	SuperNODE integration for rack-scale inference	d-Matrix; GigaIO	Provides enterprise deployment proof and system-level distribution support
2025-09-08	JetStream launched	product	400Gbps Transparent NIC; production expected by year-end	d-Matrix	Expands d-Matrix into networking and makes the platform more complete
2025-10-14	SquadRack announced	scale	Open standards-based rack-scale reference architecture	d-Matrix; Arista; Broadcom; Supermicro	Shows ecosystem traction and a standards-based deployment strategy
2025-11-12	Series C closed	financing	$275M at $2B valuation; total raised $450M	d-Matrix; Bullhound Capital; Triatomic; Temasek; QIA; EDBI; M12	Confirms late-stage private status and funds global expansion
2025-11-18	Alchip collaboration announced	partnership	3DIMC and Raptor successor roadmap	d-Matrix; Alchip	Points to the product roadmap beyond Corsair
2026-03	Gimlet Labs benchmark published	partnership	2–10x latency improvement claim in heterogeneous setup	Gimlet Labs; d-Matrix	Provides public workload proof using Corsair in an external environment
2026-04	GigaIO data center business acquisition appeared on the newsroom index	scale	Post-Series C acquisition milestone	d-Matrix; GigaIO	Suggests continued system-level expansion after the 2025 financing

Dates are kept as visible in the fetched record. Month-only milestones remain month-level rather than being over-precision backfilled from memory or model assumptions.

[CO001, CO011, CO012, CO021, CO023, CO024]

FO001: Company milestone timeline

The company moved from funding and architecture positioning into product launch, partner integration, and public workload validation between 2019 and 2026.

Month-only milestones use the first day of the stated month so the renderer can place them on a timeline without implying a more precise day than the fetched source supports.

[CO001, CO008, CO011, CO012, CO013, CO021]

1.5 Exhibits

Chapter 02

02Market Analysis

2.1 Market Boundary and the Shift from Training to Inference

The relevant market boundary for d-Matrix is not 'AI' in the abstract and not even all accelerator spending. Public company materials, technical docs, and launch messaging consistently place the company in datacenter inference: serving already-trained generative models with low latency, high memory bandwidth, and better cost and energy efficiency. Included spend therefore covers accelerator cards, inference servers and racks, interconnect and NIC layers required to scale serving, and the software stack that makes those systems deployable in enterprise or neocloud environments. Excluded spend includes model training clusters, hyperscaler-custom silicon that never becomes merchant hardware, pure API spend, and edge-device inference. This distinction matters because the technical bottlenecks change after training. d-Matrix and third-party developer evidence both emphasize that prefill is compute-bound while decode and agentic serving become memory- and latency-bound, which is exactly the part of the stack where generic GPU economics become weakest and where a specialist accelerator can plausibly wedge in.[CM001, CM002, CM003, CM004, CM005, CM006]

Market definition table
Segment / category	Included spend	Excluded spend	Buyer / payer	Relevance to d-Matrix
Broad generative AI spend	Model software, services, cloud consumption, infrastructure, and adjacent tooling	No clean separation between training, inference, software, and services	CIO / CTO / line-of-business budgets	Too broad for underwriting d-Matrix
AI accelerator market	Merchant accelerator hardware for AI workloads across training and inference	General CPU spend and most software/services	Clouds, enterprises, model labs, OEMs	Useful hardware outer bound, but still broader than d-Matrix
Datacenter generative inference infrastructure	Inference cards, servers, racks, memory, interconnect, and serving software for deployed LLMs	Model training clusters, edge AI devices, unrelated HPC	Infra teams, platform teams, AI product operators	Closest direct category
Open-standards low-latency inference systems	PCIe and Ethernet based inference deployments using merchant accelerators and partner servers	Captive hyperscaler custom silicon and bespoke internal fabrics	Enterprises, neoclouds, OEM/integrator channels	Most relevant d-Matrix wedge
Status-quo substitute stack	Nvidia/AMD GPU servers, Groq-style inference clouds, and external API inference	Specialist merchant accelerators not yet adopted	Existing cloud or platform owners	Primary displacement pool

Boundary logic narrows from very broad AI spend to the smaller merchant datacenter inference layer that d-Matrix actually addresses. The fourth row is the operative underwriting boundary; rows above it are context, not SAM.

[CM001, CM002, CM003, CM004, CM005, CM007]

2.2 Demand Drivers and Structural Bottlenecks

Public evidence strongly supports the category thesis that inference demand is outrunning the infrastructure originally built for training. CNN, VentureBeat, and TechTarget all describe GPU scarcity, datacenter expansion pressure, and the need for more energy and networking capacity to keep up with AI demand. d-Matrix’s own architecture narrative is directionally consistent with that independent evidence: memory movement burns power, host-mediated networking adds latency, and batch size creates a direct trade-off between user experience and hardware ROI. Reasoning and agentic workflows amplify those pressures because they increase inference-time compute without letting operators hide latency behind a simple throughput metric. That is why the market is shifting from a training-first conversation to an operations-and-serving conversation. The constraint stack is not just raw FLOPS; it is power, memory bandwidth, cluster communication overhead, and the economics of delivering a fast answer at scale without leaving too much expensive hardware idle.[CM008, CM009, CM010, CM011, CM012, CM013]

Growth drivers and constraints table
Factor	Direction	Timing	Implication for d-Matrix	Diligence ask
Reasoning and agentic workloads expand inference-time compute	Driver	Current and accelerating	Helps the category thesis because more serving work becomes latency-sensitive and memory-bound	Quantify how much of pipeline demand is reasoning / agentic versus conventional chat
GPU scarcity and datacenter build pressure	Driver	Current	Creates openness to alternatives and heterogeneous serving architectures	Request actual buyer stories where lack of GPU supply pulled d-Matrix into evaluation
Memory-movement power tax	Driver	Structural	Strengthens d-Matrix's DIMC / SRAM narrative versus separate-compute HBM stacks	Benchmark power per request on representative memory-bound workloads
Host-mediated multi-node networking overhead	Driver	Structural at scale	Supports JetStream and transparent-NIC positioning for rack-level inference	Verify production latency and jitter under real multi-node traffic
Batch-size economics in interactive apps	Driver	Current	Favors hardware that keeps utilization high at small batches instead of optimizing only peak throughput	Get customer evidence on acceptable latency thresholds and revenue sensitivity
Nvidia software moat and benchmark leadership	Constraint	Current	Limits d-Matrix adoption outside very specific workload wedges	Test software maturity, tooling friction, and time-to-deploy versus GPU stacks
Captive hyperscaler custom silicon growth	Constraint	Current and rising	Shrinks the merchant share of the broad inference market available to startups	Estimate how much target demand is already committed to internal silicon or reserved GPU supply
Need for proof of production TCO, not just vendor claims	Constraint	Immediate	Independent workload-level evidence is still thinner than the market narrative	Request signed production deployments, reference architectures, and customer ROI evidence

This table mixes structural drivers and structural constraints because the same forces that expand inference demand also tighten the proof burden on a startup supplier. Every row is meant as an underwriting question, not as a claim that demand converts cleanly into revenue.

[CM008, CM009, CM010, CM011, CM012, CM013]

2.3 Buyer Segments, Budget Owners, and Adoption Path

The buyer map is narrower than generic 'enterprise AI' language implies. d-Matrix’s most realistic early segments are enterprises or private-cloud operators with latency-sensitive inference workloads, neocloud and managed-inference providers, AI product companies trying to control serving economics, and OEM or integrator partners who package the cards into systems. Hyperscalers still matter strategically, but they are structurally harder to win because they already control large GPU estates and are increasingly building captive silicon. d-Matrix’s open PCIe and Ethernet positioning, plus its air-cooled rack narrative, are more naturally aligned with customers who want to slot inference into existing datacenter footprints without redesigning facilities around liquid-cooled megaclusters. The adoption path is correspondingly infrastructure-heavy: identify a memory-bound or latency-sensitive workload, validate fit in a pilot or single-node deployment, then expand to server- or rack-level production once networking, model partitioning, and operating cost are proven. Integrators and OEMs matter because buyers are purchasing a deployable system, not just a chip.[CM016, CM017, CM018, CM019, CM020, CM021]

Segment / buyer map
Segment	Buyer	User	Payer	Workflow	Budget owner	Adoption trigger
Enterprise private cloud / regulated operators	Infrastructure or platform engineering team	ML platform, application, and operations teams	Corporate IT / infra budget	Internal copilots, support agents, private retrieval and reasoning	CTO / VP Infrastructure / Head of AI platform	Need for low latency, data control, and predictable serving cost
Neocloud or managed inference provider	Cloud product and capacity planning teams	External developer customers	Cloud capex and service margin	Hosted model serving and inference API products	GM cloud / VP product / finance	Margin pressure from GPU serving and desire for differentiated latency
AI product company / model API operator	Applied AI or platform team	End users of coding, support, or agent products	Product P&L	Interactive, memory-bound inference workloads	CPO / CTO / infra lead	Rapid usage spikes and loss-making inference economics
OEM / integrator channel	Server OEM or system integrator	Solution architects and deployment engineers	End-customer project budget	Packaging cards, servers, networking, and support into deployable systems	Business unit lead / solution practice owner	Customer demand for turnkey deployments
Hyperscaler / captive silicon buyer	Cloud infrastructure organization	Internal model and platform teams	Massive datacenter capex	Foundation-model and cloud inference at hyperscale	SVP infrastructure / custom silicon org	Usually harder for d-Matrix to win because incumbent GPU and internal-silicon options are strong

The first four rows are realistic merchant-facing targets. The hyperscaler row is included because it matters strategically, but public evidence suggests it is the least directly accessible segment for a startup merchant accelerator.

[CM016, CM017, CM018, CM019, CM020, CM021]

FM003: Buyer / segment map

Realistic buying segments are mapped to the user profile, payer model, budget owner, and the trigger most likely to move a d-Matrix-style deployment into production.

[CM018, CM019, CM020, CM021, CM041, CM042]

FM004: Adoption funnel or value-chain map

Merchant inference adoption narrows from workload pain to pilot proof to production rollout, with incumbent software and capacity alternatives acting as a constant friction node.

The flow is qualitative. It reflects the infrastructure buying path implied by the source pack rather than a published conversion funnel, with the risk node showing where incumbents and API abstraction can arrest adoption before full rollout.

[CM018, CM020, CM021, CM037, CM043, CM044]

2.4 Sizing Lenses and What Is Actually Addressable

The public market numbers are large, but they are not directly additive and they do not all describe the same commercial surface. Gartner’s generative-AI spending forecast is the broadest adjacency and includes software, services, and infrastructure. MarketsandMarkets’ AI inference estimate is narrower, but still broader than merchant silicon alone. Silicon Analysts’ AI accelerator market is narrower again on the hardware axis but still includes training and multiple incumbent or captive architectures. The most decision-relevant lower-bound lens in the source pack is not even a TAM report at all: CNBC’s estimate that Bing AI alone needed at least $4 billion of infrastructure. That deployment-cost lens captures why the category exists. For d-Matrix, the accessible SAM is smaller than all of these figures because it excludes training, excludes much of captive hyperscaler custom silicon, and excludes buyers who are content to consume inference through external APIs. Public data is sufficient to show a large category, but not sufficient to publish a clean d-Matrix SOM without management disclosures.[CM022, CM023, CM024, CM025, CM026, CM027]

TAM / SAM / SOM or sizing lens table
Publisher / lens	Year	Geography	Value / indicator	Growth	Methodology lens	Confidence	Limitation
Gartner via Forbes	2025	Global	$644B generative AI spending	76.4% YoY	Very broad spend forecast across hardware, software, and services	low	Far broader than inference hardware or merchant accelerators
MarketsandMarkets via Forbes	2025-2030	Global	$106.15B to $254.98B AI inference market	Implied strong multi-year growth	Inference-market forecast cited in 2025 market commentary	low	Includes more than merchant datacenter hardware
Silicon Analysts	2025E-2026E	Global	$160B to $200B+ total AI accelerator market	Fast growth with share broadening	Revenue view of accelerator hardware market	low	Mixes training and inference and includes incumbent/captive exposure
CNBC infrastructure lens	2023	Global / Bing reference case	At least $4B infrastructure needed to serve Bing AI users	N/A	Deployment-cost lens rather than TAM	medium	Single-service example, not market size
d-Matrix product architecture	2025-2026	Rack / datacenter	Up to 100B models in performance mode; 1T+ in capacity mode	Scales with cards, servers, and racks	Technical deployment envelope for a merchant inference platform	medium	Capability statement, not revenue TAM
Analyst synthesis	2026	Global merchant slice	No clean public standalone SAM or SOM	N/A	Evidence-constrained conclusion from the source pack	medium	Requires management disclosures on deployments, cards per cluster, and buyer mix

These lenses are intentionally non-additive. The first three are market definitions with different denominators; the fourth and fifth are deployment economics / capability lenses; the last row records the still-missing public SAM/SOM rather than inventing one.

[CM022, CM023, CM024, CM025, CM026, CM027]

FM001: Market sizing lens

Layered lenses narrow from broad generative-AI spend to the much smaller deployment-cost and merchant-hardware wedge that is actually relevant to d-Matrix.

The layers are not a nested dataset. They represent progressively narrower commercial lenses in the same $B unit: broad spend, accelerator hardware, inference market, and a concrete large-deployment cost example. The purpose is to force boundary discipline, not to imply the values sum or cascade mechanically.

[CM022, CM023, CM024, CM025, CM026, CM040]

FM002: Market estimate range

Public $B lenses vary sharply depending on whether the denominator is broad GenAI spend, inference market revenue, accelerator hardware, or a single large deployment cost example.

All rows use the same $B unit, but they do not describe the same market surface. The midpoint for the AI inference row is arithmetic rather than publisher-provided and is included only to visualize dispersion between the 2025 and 2030 endpoints.

[CM022, CM023, CM024, CM025, CM026, CM045]

2.5 Contradictory Benchmark and Market-Share Narratives

The chapter’s core contradiction is that the inference opportunity is clearly expanding while the public evidence for share capture still overwhelmingly favors incumbents. d-Matrix’s own materials and a partner-backed Gimlet workload suggest real performance advantages on memory-bound or heterogeneous inference stages, especially where small-batch latency matters. But standardized benchmark coverage, mainstream GPU product pages, and market-share estimates still show Nvidia setting the visible frontier in most public comparisons. Even the market-share numbers disagree with one another because their denominators change: one source talks about AI chips, another about discrete GPUs, and another about total accelerators including custom silicon. That does not make the startup thesis wrong; it means the thesis must be much more specific. Investors should underwrite d-Matrix as a wedge into merchant, latency-sensitive, open-standards inference deployments where buyers care about predictable TCO and deployment fit—not as a generic claim that it will displace the GPU stack across all inference workloads.[CM028, CM029, CM030, CM031, CM032, CM033]

2.6 Exhibits

Chapter 03

03Competitors

3.1 Competitive Landscape Overview

d-Matrix sits inside a crowded but still-fluid AI inference market. The most important direct alternatives are not one homogeneous peer set: NVIDIA and AMD sell general-purpose accelerator stacks that can run both training and inference; Groq, Cerebras, and SambaNova pursue inference-first or tightly integrated alternatives; and the status quo for many buyers is still internal build on incumbent GPU clouds or private racks. Independent market coverage keeps circling the same structural pressures: inference workloads are growing faster than buyers can absorb power, cooling, memory, and GPU procurement costs, and that is why challengers such as d-Matrix get attention in the first place. d-Matrix's own pitch is that Corsair cards, JetStream NICs, and Aviator software attack the memory-movement penalty that makes large-model inference expensive. The company therefore competes less as a generic GPU replacement than as a purpose-built answer to the memory-bound, latency-sensitive slice of inference demand.[CP001, CP002, CP025, CP026, CP029, CP032]

Competitor Profile Table
Competitor	Category	Scale / Funding	Target Segment	Key Differentiation	Primary Limitation
d-Matrix	Inference-only challenger	$450M total raised per 2025 coverage	Enterprise and datacenter LLM inference	Digital in-memory compute; Corsair cards + JetStream NICs + Aviator software; standard-server insertion point	Public benchmark proof and production customer depth remain limited
NVIDIA H100/H200/GB200	Incumbent GPU stack	Dominant enterprise and cloud installed base; ~75-92% share estimates depending source and year	Training plus inference from enterprise to hyperscale	CUDA/TensorRT/NIM, NVLink, InfiniBand, OEM and cloud reach	High capex, power and cooling burden; buyers can still be overprovisioned for inference-only use cases
AMD MI300X	Mainstream GPU alternative	Scaled through major OEM server channels	Hyperscale and enterprise inference needing more memory per GPU	192GB HBM3, 5.3TB/s bandwidth, ROCm software, same-server procurement motion as GPUs	Weaker software incumbency than CUDA and less end-to-end platform mindshare
Groq	Inference service challenger	Private company; public, private and co-cloud plans plus on-prem option	Interactive low-latency inference, regulated and air-gapped workloads	LPU architecture, predictable spend, GroqRack on-prem continuity	Different compiler/runtime path than standard GPU stacks
Cerebras	Integrated large-model system	Large private AI chip company with wafer-scale platform	Very large model training and inference	Wafer-scale engine, enormous on-device compute, system-level memory strategy	Specialized appliance model is less drop-in than standard racks
SambaNova	Integrated enterprise platform	$350M+ 2026 round per independent guide	Agentic AI and enterprise inference deployments	Hardware, software, and models tuned together; SN50 positioned against Blackwell on price-performance	Platform adoption requires buying into a proprietary full stack
Internal build / custom silicon	Substitute / status quo alternative	Hyperscaler and large-enterprise capex budgets rather than vendor funding	Largest buyers with data-governance, latency, or utilization reasons to own the stack	Can optimize around proprietary workloads and avoid vendor dependence	Requires huge engineering, procurement, and operations capability

Coverage is partial and focused on the most decision-relevant options for 2026 enterprise inference buying: incumbent GPU stacks, inference-first challengers, and internal-build substitutes. Scale fields use public funding, share, or deployment context only where disclosed.

[CP001, CP013, CP014, CP018, CP021, CP023]

FP001: Competitive Positioning Map

Ordinal map of ecosystem leverage (x-axis) versus inference-specific specialization (y-axis) across d-Matrix and the most material alternatives.

Axes are ordinal estimates derived from public evidence, not a vendor-published numeric scale. X-axis measures ecosystem, channel, and deployment leverage; Y-axis measures how specialized each offer is for inference economics or latency-sensitive serving.

[CP018, CP021, CP023, CP029, CP032, CP040]

3.2 Incumbent GPU Stacks: NVIDIA and AMD

NVIDIA remains the benchmark d-Matrix must beat in practice, even when the buyer says it wants "better inference economics" rather than "the fastest raw cluster." H100, H200, and GB200 span the full ladder from single-GPU deployment to rack-scale NVLink domains, and NVIDIA layers software, networking, and enterprise support on top of the silicon. H100 anchors the installed base with NVLink, InfiniBand, Magnum IO, and AI Enterprise; H200 extends the moat by pushing memory capacity and bandwidth higher for large-model inference; GB200 raises the competitive bar again by collapsing 72 GPUs into a liquid-cooled rack-scale system that NVIDIA markets as a single massive inference machine. AMD is the clearest mainstream alternative because MI300X offers more memory per GPU, high bandwidth, and ROCm rather than CUDA. But AMD still competes inside the same OEM and datacenter procurement motion as NVIDIA, which means it pressures incumbent economics without fully changing how the buyer evaluates or deploys inference infrastructure.[CP006, CP007, CP008, CP009, CP010, CP011]

Feature / Capability Matrix
Buying Criterion	d-Matrix	NVIDIA	AMD	Groq	Cerebras	SambaNova
Small-batch interactive latency focus	High	Medium	Medium	High	Medium	Medium
Large-model memory footprint per unit / system	Medium	High	High	Medium	High	High
Mature software ecosystem and framework portability	Emerging	Very high	Moderate	Moderate	Moderate	Moderate
Rack-scale networking and OEM availability	Emerging	Very high	High	Moderate	Moderate	Moderate
Air-gapped / on-prem deployment path	Partner-led	High	High	High	High	High
Pricing transparency / usage-based packaging	Low	Low	Low	Medium	Low	Low
Training-ecosystem reuse for inference buyers	Low	Very high	High	Low	Low	Low

Ratings are author assessments synthesized from official product pages and independent market analysis. They summarize buyer-relevant fit rather than a standardized benchmark score, and "Emerging" or "Low" often reflects public-proof gaps rather than proven technical absence.

[CP013, CP014, CP017, CP018, CP020, CP022]

3.3 Specialized Inference Challengers

The non-GPU challenger field is fragmented, which matters because d-Matrix is not competing against a single "inference chip" archetype. Groq packages low latency as a service with public, private, co-cloud, and on-prem options; it is the cleanest contrast to d-Matrix on deployment model and spend predictability. Cerebras goes in the opposite direction, using a wafer-scale engine and very large system memory so customers can keep far more of a model local to a specialized appliance. SambaNova bundles hardware, software, and tuned models into an integrated platform and positions that bundle directly against NVIDIA Blackwell on price-performance. Independent landscape surveys therefore frame the market as a set of different answers to the same question: should buyers solve inference with broad GPU ecosystems, with service-led low-latency specialists, with giant integrated systems, or with memory-local cards that fit more easily into standard enterprise servers? d-Matrix has a real seat at that table, but it does not yet own the default category narrative.[CP018, CP019, CP020, CP021, CP022, CP023]

Pricing / Packaging Comparison
Vendor / Option	Price / Unit / Contract Model	Included Capabilities	Discount / Unknowns	Competitive Implication
d-Matrix	No public list price; company-quoted outcome is 3x lower cost versus GPU-based systems	Corsair accelerators, JetStream NICs, Aviator software, standard PCIe/server deployment	Realized pricing, utilization assumptions, and support terms are undisclosed	Strong if validated in production, but public TCO proof is still thin
NVIDIA H100 / H200 stack	Hardware plus enterprise-software and OEM/cloud contracts; public list pricing generally opaque	CUDA, TensorRT, Triton, AI Enterprise/NIM, NVLink, InfiniBand, broad OEM availability	Bulk discounts and cloud pass-through pricing are opaque	Default procurement path for buyers who value ecosystem certainty over inference-specific specialization
NVIDIA GB200 NVL72	Rack-scale system sale rather than commodity card purchase	72-GPU NVLink domain, liquid cooling, Mission Control, AI-factory tooling	Public transaction pricing not disclosed; facility upgrades can be material	Raises the performance bar but also narrows the buyer pool to very large deployments
AMD MI300X	Sold through OEM server programs and cloud contracts rather than public street pricing	192GB HBM3, ROCm stack, OAM module packaging, OEM server availability	Net pricing and support bundles are not public	Competes by improving memory economics inside familiar GPU buying motions
Groq	Free, Developer, and Enterprise plans; usage-based and custom enterprise contracts	Public/private/co-cloud access, regional endpoints, prompt caching, optional GroqRack on-prem	Per-model enterprise economics are negotiated and not fully public	Most transparent service-style packaging among the challengers
Cerebras / SambaNova	System-level enterprise contracts; pricing not publicly normalized	Integrated hardware plus software stack, tuned large-model serving, enterprise support	Actual token economics and appliance pricing are mostly private	Compete on turnkey integrated value rather than open component comparison
Internal build on incumbent GPU cloud	Historically can mean very high monthly bills plus datacenter upgrades or huge GPU reservations	Maximum workflow control and reuse of incumbent ML tooling	True cost depends on model size, traffic, utilization, and reserved capacity	The substitute remains expensive, but it is familiar and therefore sticky

Public pricing is sparse across inference hardware, so this table emphasizes packaging, contract form, and what is visibly bundled. Where a row cites cost impact, it distinguishes vendor-authored claims from independent examples of GPU-era operating expense.

[CP001, CP018, CP019, CP027, CP028, CP035]

FP002: Feature Breadth / Capability Map

Buyer-fit view of where each vendor class is strongest for common inference workload requirements.

Cells summarize best-fit orientation from public materials and independent analysis; they should be read as comparative buyer-fit, not benchmarked performance scores.

[CP020, CP022, CP024, CP029, CP030, CP031]

3.4 Switching Costs, Distribution, and Internal Build

The central competitive question is not whether d-Matrix can make a strong silicon argument; it is whether it can overcome the software, networking, and channel inertia that keeps enterprises on incumbent platforms. NVIDIA's advantage is deeply systemic: buyers already know how to buy GPU clusters, cool them, connect them, program them with CUDA and TensorRT, and source them through familiar OEM and cloud channels. AMD weakens that moat at the margin with ROCm and more memory per GPU, but it still lives in the same rack and server ecosystem. Groq, Cerebras, and SambaNova attack the problem by changing the packaging model—API-led service, giant appliance, or fully integrated platform. d-Matrix sits between those camps. Its modular chiplet and card-based approach could lower insertion friction, but the public record still lacks the customer references, price sheets, and benchmark normalization that would prove buyers are willing to switch, multi-home, or standardize on it at production scale. Internal build is also a real substitute as hyperscalers and large enterprises increasingly design or co-design inference silicon around their own workloads.[CP027, CP028, CP030, CP033, CP034, CP035]

3.5 Moat Durability and Adverse Evidence

d-Matrix has a credible but still-provisional moat. The strongest pillar is technical fit: multiple sources tie inference cost and power pain to memory movement, and d-Matrix's digital-in-memory-compute plus chiplet interconnect strategy is explicitly built around that bottleneck. A second pillar is modularity; Jayhawk, cards, NICs, and software imply a heterogeneous deployment story that could be easier to insert than a proprietary super-appliance. But adverse evidence is meaningful. NVIDIA still owns the most mature benchmark record, software stack, and distribution footprint. AMD keeps narrowing the memory argument. Groq has the cleanest public packaging for predictable low-latency service. Cerebras and SambaNova sell more tightly integrated systems for very large models. Most importantly, d-Matrix's public performance and cost claims remain largely vendor-authored rather than normalized by an independent, apples-to-apples benchmark suite. That leaves moat durability at "moderate": differentiated enough to matter, but not yet durable enough to ignore incumbent or integrated-system counterattack risk.[CP037, CP038, CP039, CP040]

Moat Durability / Competitive Risk Register
Moat Claim	Threat	Severity	Mitigation / What Would Help	Diligence Ask
Digital in-memory compute better matches inference’s memory bottleneck	H200/GB200/next-gen GPU memory and bandwidth gains narrow the efficiency gap	High	Independent same-model energy and latency benchmarks versus H100/H200/MI300X	What public benchmark shows d-Matrix’s claimed gain under the same model, batch, and latency target?
Modular chiplet plus card/NIC architecture lowers deployment friction	Buyers may still prefer proven NVLink/NIM/OEM stacks or turnkey challenger appliances	High	More named OEM systems and production customer references	Which server OEMs ship production-ready d-Matrix systems with supported software images today?
Inference-only focus gives d-Matrix a sharper product thesis than general GPUs	Training-plus-inference platforms still win procurement because they are multipurpose	Medium	Proof that buyers separate inference budgets from general AI infrastructure budgets	How often do enterprises buy d-Matrix specifically for inference instead of extending existing GPU clusters?
Claimed lower cost and energy improve ROI for scale inference	Claims remain largely vendor-authored and may not survive apples-to-apples normalization	High	Third-party benchmark or public customer bill-of-materials evidence	What utilization, model size, and software assumptions sit behind the 3x-lower-cost claim?
Heterogeneous-system strategy could fit regulated or private deployments	Groq already has clearer cloud-to-on-prem packaging, while Cerebras/SambaNova sell more integrated systems	Medium	Show repeatable on-prem reference architecture and support model	Does d-Matrix have a public deployment path as crisp as GroqRack or a turnkey appliance rival?
Inference market growth creates room for challengers	Internal build and custom silicon can absorb the highest-value workloads before d-Matrix does	Medium	Focus on customers lacking hyperscaler-scale design budgets	Which customer segments are least likely to replace third-party inference hardware with internal silicon?

Severity is a qualitative judgment based on the public record available on 2026-05-26. This is a durability register, not a probability forecast, so it prioritizes the threats most likely to compress d-Matrix’s differentiation if left unanswered.

[CP028, CP033, CP035, CP037, CP038, CP039]

FP003: Moat / Readiness KPIs

Compact snapshot of the key numbers shaping d-Matrix’s competitive durability versus incumbent and challenger alternatives.

[CP001, CP003, CP008, CP014, CP032]

3.6 Exhibits

Chapter 04

04Financials

4.1 Revenue Model and Monetization Surfaces

The public record points to a hardware-platform revenue model rather than a software-only or API-only business. D-Matrix repeatedly sells the story as a full-stack inference platform built from Corsair accelerators, JetStream networking, and Aviator software, with commercialization routed through hyperscale, enterprise, sovereign, OEM, and system-integrator channels. That matters because revenue quality will likely depend on hardware shipments and multi-component deployment wins, not self-serve software adoption. The GTM evidence is also distinctly qualification-heavy. Series B was framed as the round that would let the company begin commercializing Corsair, while the 2024 and 2025 product announcements still leaned on sampling, early-access customers, OEM qualification, and partner-configured systems. SquadRack, SuperNODE, and the Gimlet cloud integration all imply larger contract values than a single card sale could support, but they also imply shared economics across server, networking, and cloud partners. Crucially, d-Matrix does not publish Corsair list pricing, JetStream pricing, software license pricing, or a booked-customer count. Even the ecosystem page warns that partner logos are illustrative and not actual customers. So the monetization surface is real and widening, but public evidence still shows channel formation and pilot conversion more clearly than realized revenue.[CI001, CI002, CI003, CI004, CI005, CI006]

Revenue streams table
Stream	Mechanism	Buyer / route	Current public status	Revenue quality	Diligence ask
Corsair accelerators	Sale of PCIe accelerator cards and dual-card configurations	Enterprises, hyperscalers, sovereign clouds, OEMs, system integrators	Sampling in late 2024; broad availability targeted for Q2 2025	Likely high-ACV hardware revenue, but no public ASP, volume, or gross margin	Request shipped units, ASPs, gross margin, and pilot-to-production conversion
JetStream NICs	Sale of dedicated networking cards that scale Corsair clusters	Sold with multi-node deployments and rack solutions	Samples available in Sep 2025; full production targeted by year-end 2025	Potential attach revenue, but attach rate and pricing are undisclosed	Request NIC attach rate, BOM, and realized pricing by deployment size
Aviator software	Inference software bundled with hardware and deployment tools	Enterprise and OEM deployment teams	Repeatedly marketed as part of the full-stack platform; no standalone price disclosed	Could create recurring software revenue, but public evidence only supports bundle-level monetization	Request license model, maintenance terms, and renewal or support revenue
Integrated rack and server solutions	Partner-configured systems such as SquadRack and GigaIO SuperNODE	OEM and infrastructure channel partners	Reference architectures and joint solutions are public; booked revenue is not	Higher contract value is plausible, but economics may be shared with server and networking partners	Request who invoices the customer, system ASP, and partner revenue split
Heterogeneous cloud access	Select-customer access to Corsair through Gimlet Cloud alongside GPUs	AI-native model providers and cloud buyers	Targeted for select customers in 2H 2026	Could become usage-based proof, but current evidence is still early-access and benchmark oriented	Request contract structure, revenue share, and pilot-to-production conversion rates

Rows are restricted to monetization surfaces that are explicitly visible in public product, partner, and deployment materials. No public source in the pack discloses realized pricing, revenue mix, or contract duration.

[CI001, CI006, CI007, CI008, CI009, CI010]

Pricing / monetization table
Offer	Public list price	Public performance / cost claim	Realized pricing visibility	Monetization implication	Source status
Corsair card hardware	Not publicly disclosed	Up to 10x faster interactive speed and 3x better cost-performance versus GPUs	None	Pricing power cannot be tested without a price sheet or customer deal data	Official product claims exist; public pricing does not
JetStream networking	Not publicly disclosed	400 Gbps NIC plus 3x cost-performance claim when paired with Corsair	None	Multi-node attach may lift ACV, but networking BOM also rises	Spec sheet exists; price sheet does not
Aviator software	Not publicly disclosed	Enterprise-grade software stack that is part of the platform story	None	Recurring software upside may exist, but standalone licensing is not publicly visible	Bundled marketing only
GigaIO SuperNODE + Corsair	Not publicly disclosed	64+ Corsairs in a single node with lower deployment complexity and better TCO	None	Implies system-level contract potential, but no public partner pricing or revenue split	Partner promotional
Gimlet Cloud access	Not publicly disclosed	2x-10x latency improvement at equal energy efficiency in heterogeneous inference benchmark	None	Could create usage-based revenue, but customer pricing and margin remain unannounced	Select-customer early access only

This table intentionally separates availability and performance language from actual monetization. No reviewed source provides a public Corsair, JetStream, software, or integrated-system list price.

[CI008, CI017, CI019, CI024, CI036]

FI001: Revenue model bridge

Public evidence suggests a qualification-led revenue chain that starts with workload fit, moves through partner-enabled hardware deployment, and only then expands into software and larger system revenue.

The sequence is inferred from public commercialization language rather than management-reported conversion data. Pricing, attach rates, and contract duration remain undisclosed.

[CI001, CI006, CI007, CI008, CI010, CI011]

4.2 Unit Economics and Cost Drivers

D-Matrix’s unit-economics case is easy to describe but still only partially public. Official materials consistently claim up to 10x faster interactive speed, 3x better cost performance, and 3x to 5x better energy efficiency than GPU-based alternatives, while third-party Gimlet testing adds a more specific workload proxy: speculative decode offload to Corsair produced 2x to 10x lower end-to-end request latency at the same energy efficiency. That is meaningful evidence that the product can improve customer economics on memory-bound inference stages. But it is not the same as proving d-Matrix’s own gross margin or payback profile. The white paper and JetStream brief explicitly label cost and power figures as preliminary, and both documents reveal why system economics cannot be reduced to chip benchmarks alone. JetStream adds a dedicated 400 Gbps NIC, optics or DACs, power draw, switch ports, and rack-integration work. The Corsair server and rack specs imply larger average deal sizes, but they also imply server, networking, and support costs that sit outside the accelerator die itself. External market proxies reinforce why this matters: AI inference can consume extraordinary operating budgets, while GPU infrastructure remains expensive, power hungry, and supply constrained. D-Matrix may address those economics, but public proof still lives at the benchmark and architecture layer, not the P&L layer.[CI013, CI014, CI015, CI016, CI017, CI018]

Unit economics table
Metric / proxy	Public value / status	Confidence	Why it matters	Diligence ask
Claimed cost-performance improvement	3x better performance / TCO versus GPU-based alternatives	medium	This is the core ROI claim for enterprise buyers	Reproduce on customer workloads with full system BOM
Claimed energy efficiency improvement	3x to 5x better energy efficiency versus GPU-based alternatives	medium	Power and cooling are central to inference economics	Compare rack-level power including switches, optics, and idle overhead
Claimed interactive speed improvement	Up to 10x faster interactive speed; 30K tokens/s at 2 ms/token for Llama 70B	medium	Supports the thesis that latency-sensitive workloads justify specialized hardware	Request benchmark methodology, prompt mix, and concurrency assumptions
JetStream networking overhead	400 Gbps NIC, PCIe Gen5 x16, max 150W TDP, supplemental power input	high	Networking is part of the cost stack, not a free add-on	Request node-level power budget and transceiver cost
Server and rack density proxy	8-card server up to 2 TB capacity memory; 8-node rack for 100B models; GigaIO says 64+ Corsairs in a node	medium	Potential deal size is large, but deployment complexity and rack cost rise too	Request typical cards-per-deal, node count, and installed rack configuration
Third-party heterogeneous benchmark proxy	Gimlet reports 2x-10x lower latency at equal energy efficiency for speculative decode offload; 1.6B draft model fits on 2 cards	medium	Best current public proof that specialized offload can improve real workload economics	Request production throughput, utilization, and win-rate data by workload

The table mixes vendor claims, published specifications, and third-party benchmark proxies. It should not be read as d-Matrix reported gross margin, CAC, or payback data.

[CI014, CI015, CI016, CI017, CI018, CI019]

FI002: Unit economics bridge

D-Matrix’s public ROI case is built around moving memory-bound inference stages onto specialized hardware while keeping the rest of the system compatible with standard racks and networks.

This figure translates public architecture and benchmark evidence into a commercial logic chain. It is not a reported cost model and should not be read as a gross-margin bridge.

[CI017, CI018, CI024, CI025, CI033, CI050]

FI003: Financial estimate range

Fixed disclosed values and public benchmark ranges frame the financial discussion: capital is known, but most operating metrics remain undisclosed, so the usable public ranges are on claimed efficiency rather than company P&L.

The first two rows are fixed disclosed values. The latter rows are benchmark or vendor-claim ranges, not audited financial metrics. No public range exists for d-Matrix revenue, burn, or gross margin.

[CI002, CI017, CI024]

4.3 Capital Intensity, Partner Dependence, and Financing Dependency

Capital formation is the clearest part of the financial picture. D-Matrix has publicly disclosed $450 million of total funding, including a $110 million Series B in 2023 and a $275 million Series C at a $2 billion valuation in 2025. Both rounds were explicitly tied to commercialization, roadmap execution, hiring, global expansion, and large-scale deployments. That is positive because the company is not trying to scale a complex inference hardware stack on a shoestring. It is also a warning sign: the operating plan still appears funding-backed more than customer-funded. The roadmap itself is capital intensive. Corsair relies on chiplets, high-bandwidth memory integration, server qualification, and partner distribution. JetStream adds dedicated networking hardware. The next-generation 3DIMC roadmap adds an explicit dependency on Alchip for ASIC design, advanced packaging, and manufacturing management. Public comp filings show what that usually means in practice: inventory commitments, deposits, long lead times, board and memory costs, and margin pressure when demand timing shifts. D-Matrix may benefit from using partners rather than vertically owning every system component, but that simply trades some fixed cost for execution dependence on Supermicro, Arista, Broadcom, GigaIO, cloud partners, and advanced-packaging suppliers. Without public cash, burn, or runway, investors cannot tell whether the current balance sheet comfortably covers those scale-up requirements or merely bridges the company to its next financing event.[CI002, CI003, CI004, CI005, CI019, CI020]

Capital adequacy table
Item	Public value / status	Evidence	Underwriting read-through	Diligence ask
Total disclosed capital raised	$450M	Series C official release, investor releases, and independent coverage align	Strong capitalization for a private inference hardware company, but not proof of self-funding operations	Reconcile cap table, primary vs. secondary proceeds, and liquidation preferences
Latest round size / valuation	$275M Series C at $2B valuation	Official plus Bullhound, QIA, and news coverage	Shows continued investor access and valuation support in late 2025	Request post-money ownership, board rights, and preference stack
Series B bridge capital	$110M in 2023 to begin commercializing Corsair	Official Series B release	Confirms multiple years of pre-revenue or early-revenue buildout before the 2025 raise	Request spend bridge from Series B to Series C
Disclosed use of funds	Roadmap, hiring, global expansion, and large-scale deployments	Official Series B and Series C materials	Capital is earmarked for execution and scale-up rather than disclosed profitability	Request 24-month operating plan and deployment capex plan
Cash on hand / burn / runway	Not publicly disclosed	No figure in reviewed official, investor, or independent sources	Runway and next-round timing cannot be tested from public materials	Request current cash balance, monthly burn, and runway under base and downside scenarios
Debt / project finance / purchase commitments	Not publicly disclosed for d-Matrix; public hardware comps show these items can affect margins	D-Matrix disclosure gap plus Nvidia filing proxy	Hardware scaling may hide working-capital needs even if fixed capex is partner-assisted	Request supplier deposits, NRE, inventory commitments, warranties, and debt facilities

The disclosed funding stack is factual, but capital adequacy remains only partially visible because public sources do not provide cash, burn, runway, or supplier-commitment detail.

[CI002, CI003, CI004, CI005, CI039, CI040]

FI004: Capital intensity / cash-flow map

The main financial burden sits at the system and roadmap level: silicon and packaging, server and rack integration, networking, and customer validation all require capital or partner execution before recurring operating proof appears.

This is an evidence-backed qualitative matrix rather than a reported cash-flow statement. It highlights where capital need and execution dependence are visible in the source pack.

[CI011, CI014, CI020, CI022, CI041, CI042]

4.4 Financial Verdict and Disclosure Gaps

The right financial verdict is cautious but not dismissive. Public evidence supports a serious capital base, a coherent revenue mechanism, and credible workload-level reasons customers might buy the product. It does not support a traditional underwriting view on current revenue quality, gross margin, sales efficiency, or runway. D-Matrix has disclosed funding, valuation, commercialization milestones, partner architectures, and benchmark-style efficiency claims, but it has not publicly disclosed revenue, ARR, gross margin, customer count, repeat orders, list pricing, realized ASPs, software attach rates, cash balance, burn, runway, debt, or supplier commitments. That disclosure gap matters more here than it would in a pure software startup, because this business depends on hardware manufacturing, packaging, networking, server integration, and deployment timing. The resulting investment case is therefore forward-looking and financing-dependent. If management can show booked revenue, repeat deployments, margin by hardware and software layer, and a runway that covers the 3DIMC and deployment roadmap, the funding stack looks like a real launchpad. Without those disclosures, the round history mainly proves that the company can raise capital, not that it has already crossed the line into repeatable, self-sustaining economics.[CI036, CI037, CI038, CI039, CI040, CI041]

Public financial gaps table
Missing metric	What public evidence says	Why it matters	Exact diligence path
Revenue / ARR	No public figure found in official, investor, or independent sources	Impossible to anchor underwriting on current commercial scale or valuation multiple	Request monthly and annual revenue by product line, geography, and customer cohort
Gross margin / contribution margin	No public figure found	Cannot test whether hardware, networking, and software mix can support durable profitability	Request gross margin bridge by Corsair, JetStream, Aviator, and integrated systems
Realized ASP / discounts	No public Corsair or JetStream price sheet found	Cannot translate performance claims into deal economics or discount strategy	Request price book, discount bands, and pilot versus production ASPs
Software attach / recurring revenue	Aviator is marketed as part of the platform, but its licensing model is undisclosed	Recurring mix could materially change revenue quality and multiple	Request percentage of deals with paid software, annual contract value, and renewal terms
Customer count / repeat orders	Partners and benchmarks are visible, but broad production-customer evidence is not	Cannot judge demand depth, repeatability, or concentration risk	Request named production customers, repeat orders, deployed cards or nodes, and backlog
Cash / burn / runway / commitments	No public cash balance, burn, runway, debt, or supplier-commitment data found	Financing dependency and insolvency timing cannot be underwritten from the public record	Request cash waterfall, monthly burn, purchase commitments, and next-round trigger

This table is the main underwriting blocker list. The missing metrics are normal for a private company, but they are especially consequential in a hardware business with manufacturing, networking, and deployment dependencies.

[CI036, CI037, CI038, CI039, CI040, CI041]

4.5 Exhibits

Chapter 05

05Product & Technology

5.1 Product Surface and Workload Fit

d-Matrix is no longer just pitching an abstract chip architecture. The current customer-facing product surface is a full inference stack: Corsair accelerator cards, JetStream networking cards, Aviator software, and SquadRack as the rack-level reference blueprint. In customer workflow terms, the company is targeting interactive chat, coding, agentic, reasoning, and multimodal workloads where low latency matters more than brute-force benchmark throughput. The product page is unusually specific for a startup hardware vendor because it publishes single-card, dual-card, server, and rack configurations with concrete memory-bandwidth and capacity figures. That specificity is a real product-tech strength because it gives buyers and diligence teams something testable instead of a vague platform promise. The maturity caveat is that most public proof still sits in official briefs, partner announcements, and reference-architecture pages rather than in named production customer deployments. The architecture signal is strong; the customer-proof signal is still comparatively thin. Recent company essays also sharpen the workload thesis by arguing that longer context windows, reasoning chains, and agent loops turn inference into a context-and-latency problem, not just a raw-throughput contest.[CE001, CE002, CE003, CE004, CE005, CE036]

d-Matrix Product Module and Asset Matrix
Module / product line	Primary user	Status / maturity	Differentiation	Diligence gap
Corsair single/dual-card accelerators	Platform architects and enterprise AI infrastructure teams	Current flagship product with public card, server, and rack specs	DIMC plus on-chip SRAM in PCIe cards designed for low-latency decode and interactive inference	Named production customers and field reliability data are not public
JetStream transparent NIC	Operators scaling Corsair across nodes and racks	Current networking product with samples announced and detailed spec published	Device-initiated accelerator-to-accelerator transfers over standard Ethernet rather than host-mediated RDMA flow	Broad production proof and compatibility matrix are not public
Aviator software stack	ML engineers, infra engineers, and platform teams	Current software layer with documented build and execution flows	Co-designed stack spanning model factory, compression, compiler, runtime, Kubernetes plugin, and observability tools	Supported-model matrix, SLA posture, and customer support scope are not public
SquadRack reference architecture	Cloud, sovereign-cloud, and enterprise datacenter operators	Reference blueprint rather than a fully closed appliance	Air-cooled rack design using Corsair, JetStream, standard PCIe servers, and standard Ethernet fabrics	Reference architecture is not the same thing as broad GA deployment
Pavehawk test silicon and Raptor roadmap	Future buyers needing larger-memory inference footprints	Roadmap stage; public validation is architectural, not commercial	3DIMC roadmap adds 3D DRAM to extend the low-latency memory-centric architecture beyond Corsair	Independent benchmarks, production timing, package yield, and customer commitments are not public

This matrix separates what is currently productized from what is still roadmap. Status language reflects the public corpus as of 2026-05-26, not management aspiration.

[CE001, CE002, CE005, CE008, CE013, CE023]

FE001: d-Matrix Product Architecture Map

Layered view from target workload to software, networking, compute-memory hardware, and the future memory-scaling roadmap.

The layer boundaries are analytical rather than company-authored. The figure is meant to show how product, software, networking, and roadmap fit together as one platform.

[CE001, CE008, CE010, CE013, CE023, CE026]

5.2 DIMC, JetStream, and Aviator Architecture

The core technical thesis is that modern inference is dominated by memory movement, especially during decode, so d-Matrix optimized for memory-centric execution rather than trying to look like another GPU. Corsair pairs DIMC compute with large on-chip SRAM and capacity memory, then scales laterally through chiplets, DMX Link, and DMX Bridge. Aviator matters as much as the silicon because it handles model conversion, compression, compilation, runtime execution, cluster orchestration, and developer tooling. Recent technical essays extend that framing beyond the chip itself, arguing that context pressure, KV-cache reduction, and optimization across software, networking, and infrastructure are necessary complements to DIMC rather than side projects. JetStream extends the same low-latency philosophy to multi-node communication by separating data and control planes and enabling device-initiated accelerator-to-accelerator transfers over standard Ethernet. The most credible part of the technical story is that the hardware, networking, and software layers are described in enough detail to form a coherent operating model. The main diligence question is not whether there is an architecture, but how broadly that architecture has been validated outside d-Matrix and a small set of technical partners.[CE006, CE007, CE008, CE009, CE010, CE011]

Workflow and Use-Case Table
User job	Current workflow pain	d-Matrix solution	Measurable benefit	Limitation
Interactive chat or coding assistant	GPU-only decode can become latency-sensitive and queue-sensitive at small batches	Run low-latency decode on Corsair while keeping the deployment in standard PCIe servers	Company cites up to 60,000 tokens/s at 1 ms/token for Llama3 8B in a single server	This performance figure is company-reported and workload specific
70B-class on-prem inference service	Large models outgrow a single card and create rack-level memory and networking pressure	Use eight-server Corsair racks and JetStream-based node-to-node scaling	Company cites 30,000 tokens/s at 2 ms/token for Llama3 70B and 100B-parameter scale targets	Public proof is strongest in reference architectures and partner deployments, not in named customer rollouts
Heterogeneous pipeline with large verifier model	GPU-only systems must compromise between fast decode and high-quality large-model verification	Use Corsair for the memory-bound speculative draft stage and keep the larger verifier on GPU	Gimlet reports 2-5x request speedup at equal energy for one speculative-decoding setup	Independent validation is narrow and does not cover every inference phase
Enterprise retrofit into existing datacenter	New AI systems often require proprietary fabrics or heavy infrastructure changes	Deploy PCIe cards plus Ethernet switching in existing datacenter footprints	SquadRack is air-cooled and built around standard servers and top-of-rack Ethernet switches	Deployment quality still depends on third-party server and switch integration
Single-node scale-up before multi-node complexity	Distributed inference can add coordination overhead and operating complexity	Use GigaIO SuperNODE to host dozens of Corsair cards inside one node	Partner claims lower multi-node complexity and cites 30,000 tokens/s at 2 ms/token for Llama3 70B	Single-node partnership claims are partner-reported rather than broadly benchmarked

Benefits mix official claims and one independent practitioner benchmark. The table is about workflow fit, not a comprehensive benchmark bake-off.

[CE005, CE012, CE015, CE016, CE017, CE018]

Technology and Operating Architecture Table
Layer / process	Role	Key dependency	Key risk
DIMC cores with on-chip SRAM	Keep memory close to compute for low-latency token generation and interactive decode	Successful mapping of inference kernels and data movement into the DIMC memory hierarchy	Architecture is differentiated, but broad third-party workload validation is still limited
Capacity memory tier	Extend model and context footprint beyond on-chip SRAM without abandoning the memory-centric design	Aviator scheduling and runtime policies that decide what sits in fast versus capacity memory	Performance depends on workload fit and on how much data must still move between tiers
Chiplets plus DMX Link and DMX Bridge	Scale from one chiplet pool to dual-card and rack-level configurations	Packaging, interconnect yield, and card qualification across OEM systems	Chiplet and bridge complexity can become a packaging, signal-integrity, or qualification bottleneck
JetStream transparent NIC	Provide low-latency inter-node accelerator communication without host-mediated RDMA overhead	Standard Ethernet optics, switches, PCIe Gen5 hosts, and JetStream firmware	Production networking proof is newer than Corsair compute proof
Aviator software stack	Convert models, compress graphs, compile binaries, orchestrate distributed execution, and expose tooling	Model support, compiler maturity, runtime stability, and developer enablement	Software quality is central because hardware value is only realized through the stack
Rack and server ecosystem layer	Turn cards and NICs into deployable systems through Supermicro, GigaIO, Liqid, Arista, and Broadcom	Partner qualification, support handoffs, and standard-server supply	Open standards reduce lock-in but increase integration dependence
3DIMC on Pavehawk and Raptor	Push the architecture toward larger-memory inference using stacked DRAM	Alchip packaging and ASIC work, foundry capacity, and successful test-silicon translation into product silicon	Roadmap value is high, but commercial timing and external validation remain unproven

This table focuses on the operating stack needed to turn Corsair into a usable platform. Dependencies are called out explicitly because d-Matrix is not selling a closed appliance today.

[CE006, CE007, CE008, CE010, CE011, CE020]

FE002: Customer Workflow and Operating Flow

How a model moves from developer input into a heterogeneous d-Matrix deployment and then serves low-latency inference.

This flow synthesizes the published Aviator build and execution model with d-Matrix's own heterogeneous prefill/decode narrative. It is specific to the reviewed corpus, not an official product diagram.

[CE008, CE010, CE017, CE018, CE045, CE046]

5.3 Deployment, Rack Architecture, and Supply Dependencies

d-Matrix is trying to make deployment easier by leaning into standard PCIe servers, standard Ethernet fabrics, and ecosystem partners instead of demanding a proprietary rack rebuild. SquadRack and GigaIO SuperNODE show the intended server and rack topologies: scale-up inside the node where possible, then scale out over Ethernet when model size or request volume grows. That should lower retrofit friction for enterprise and sovereign-cloud environments that want AI inference without hyperscaler-style liquid cooling. The tradeoff is dependency. d-Matrix depends on Supermicro, Arista, Broadcom, GigaIO, and Liqid to turn cards into deployable systems, and the roadmap depends on Alchip, foundry capacity, and advanced packaging to move from Corsair to 3DIMC-based products. Recent company and partner posts make that interdependence explicit: d-Matrix now frames open standards as a strategic choice, its own GigaIO announcement folds single-node scale-up into the official deployment story, and Alchip independently confirms the shared packaging role behind 3DIMC. The older Nighthawk and Jayhawk disclosures show this dependency pattern is not new; packaging and interconnect have always been central to the company thesis. The product can therefore be differentiated and still execution-sensitive: the architecture is open-standards friendly, but the supply chain is not self-contained.[CE013, CE014, CE015, CE021, CE022, CE023]

Roadmap and Development-Stage Table
Date / stage	Milestone	Status	Implication	Source
2021 platform era	Nighthawk chiplet platform	Historical predecessor	Shows chiplets and scale-out were part of the design thesis before Corsair launched	EE News Europe / HPCwire
January 2023	Jayhawk chiplet platform announced	Historical public milestone	Indicates a packaging and interconnect roadmap on TSMC 6nm before the commercial Corsair launch	EE News Europe / HPCwire
November 2024	Corsair launched with Q2 2025 broad-availability target	Current-generation product milestone	Marks the shift from platform thesis to shipping compute product with concrete card specs	d-Matrix launch / whitepaper
May 2025	GigaIO SuperNODE scale-up partnership	Partner deployment milestone	Shows single-node scale-up path for dozens of Corsair cards before broader rack-level rollouts	GigaIO / Data Center Dynamics
September to October 2025	JetStream launch and SquadRack blueprint	Current-generation expansion	Turns d-Matrix into a compute-plus-networking-plus-rack story rather than a card-only story	d-Matrix JetStream / SquadRack
November 2025	3DIMC, Pavehawk validation, and Raptor roadmap	Roadmap stage	Extends the company thesis toward larger-memory products but remains pre-commercial in this corpus	d-Matrix Alchip announcement / Going vertical

The timeline mixes historical architecture milestones, current product milestones, and future roadmap steps. The key diligence distinction is current commercial product versus future memory-scaling roadmap.

[CE021, CE022, CE023, CE024, CE027, CE041]

FE003: Critical Dependency Map

The deployable d-Matrix platform depends on synchronized progress across silicon, packaging, software, networking, and ecosystem partners.

The dependency graph is analytical. It highlights why d-Matrix can have a differentiated architecture and still carry meaningful execution sensitivity across partners and packaging.

[CE026, CE027, CE028, CE029, CE030, CE034]

5.4 Roadmap, Differentiation, and Trust Gaps

The differentiation case is clear enough to describe: d-Matrix is attacking low-latency decode and inference-time memory pressure with a memory-centric architecture, open-standards rack design, and a future hybrid SRAM-plus-3D-DRAM roadmap. That stands in contrast to the incumbent GPU direction, which emphasizes HBM capacity, proprietary NVLink domains, and increasingly liquid-cooled rack systems. Recent company framing around context growth, agent latency, open standards, and enterprise-scale reasoning is directionally consistent with that thesis, but it still functions more as strategic argument than as independent proof. The weakness is not conceptual differentiation but underwriting confidence. The chapter corpus shows some independent practitioner validation via Gimlet, and it shows broader market appetite for specialized inference hardware, but it does not show broad third-party benchmark coverage, public reliability metrics, named production customers, or published compliance credentials. The 3DIMC, Pavehawk, and Raptor story is especially important because it is meant to extend the current architecture into larger-memory deployments; it is also still the least externally validated part of the roadmap. Investors can underwrite a concrete architecture today, but they still need diligence on scale, trust, and roadmap execution.[CE031, CE032, CE033, CE034, CE035, CE037]

Trust, Quality, and Compliance Table
Control / signal	Status	Scope	Gap
Secure boot on JetStream	Publicly disclosed	NIC-level boot integrity for the networking card	No broader platform security architecture is published for Corsair plus Aviator plus JetStream as a system
Kubernetes plugin, metrics exporter, debugger, and profiler	Publicly disclosed	Operational tooling for deployment, monitoring, and developer diagnosis	Tooling existence is not the same as public uptime, MTBF, or support-performance data
Open-standards deployment model	Publicly disclosed	PCIe cards, Ethernet networking, and standard top-of-rack switching reduce bespoke infrastructure risk	Open standards do not remove integration, firmware, or partner-qualification risk
Independent practitioner benchmark from Gimlet	Publicly disclosed but narrow	One speculative-decoding workflow on a heterogeneous Corsair-plus-GPU setup	The validation surface is narrow and is not equivalent to broad production benchmarking
Public compliance credentials	Not disclosed in reviewed corpus	Enterprise trust posture for regulated, sovereign-cloud, and security-sensitive buyers	No public SOC 2, ISO 27001, FedRAMP, or trust-center references were found in the reviewed official surfaces
Public fleet reliability metrics	Not disclosed in reviewed corpus	Field reliability, service durability, and operator confidence	No public MTBF, RMA, uptime, or failure-rate disclosure appears in the chapter corpus

This table distinguishes disclosed controls from missing trust artifacts. Absence refers only to the reviewed corpus and should be closed by diligence requests rather than assumed away.

[CE009, CE035, CE037, CE038, CE039, CE040]

FE004: Product Maturity and Capability Map

Maturity view across the current product stack and the future 3DIMC roadmap, emphasizing external proof and trust gaps.

This is an author judgement based on public evidence depth, not a company scorecard. The matrix is intentionally strict on independent validation and trust disclosure.

[CE013, CE016, CE023, CE035, CE038, CE040]

5.5 Exhibits

Chapter 06

06Customers

6.1 Buyer Segments and Buying Centers

Public materials give a reasonably consistent picture of who d-Matrix is trying to sell to, even if they do not reveal how many accounts have actually converted. The Series C release anchors the top-level target set as hyperscale, enterprise, and sovereign customers, while SquadRack narrows the operational buyer to cloud providers, sovereign clouds, and enterprises trying to serve agentic AI, reasoning, and video workloads. The recurring decision-maker profile is therefore infrastructure and platform leadership rather than application teams: data-center operators, AI platform architects, and cloud builders looking for lower-latency inference inside existing facilities. That targeting is reinforced by the sales narrative around standard PCIe servers, Ethernet, and air cooling. d-Matrix is not pitching a hyperscaler-only custom rack that requires greenfield infrastructure; it is pitching a retrofit path for operators who want to add inference capacity without taking on a proprietary fabric or liquid-cooling mandate. The strongest current fit appears to be enterprise and neocloud-style operators plus sovereign-cloud programs that value infrastructure optionality, while disclosed hyperscale production references remain absent.[CU001, CU002, CU003, CU004, CU005, CU006]

Customer segmentation table
Segment	Buyer / budget owner	Primary users	Public proof	Strategic value	Key gap
Hyperscalers and AI factories	Platform and infrastructure leadership	AI infra teams serving frontier or high-volume inference	Series C release names hyperscale customers as a target, but no account is disclosed by name	Would validate very large-scale production adoption and silicon credibility	No named hyperscale production customer is public
Enterprise datacenters	CIO / CTO / AI platform budget	Infrastructure and platform teams retrofitting existing facilities	GigaIO, Supermicro, and Liqid messaging all frame Corsair around enterprise deployment and TCO	Near-term commercial wedge because retrofit economics matter	Most proof is partner-led rather than end-customer case-study led
Sovereign clouds and national programs	State-backed cloud operators and infrastructure procurement teams	Operators serving regulated or national workloads	Series C and SquadRack materials explicitly mention sovereign customers or sovereign clouds	Could be high-value accounts that care about infrastructure control	No sovereign customer is named or quantified publicly
Specialist AI clouds / neoclouds	Cloud operator leadership	Model providers and AI-native companies	Gimlet is the clearest named operator using heterogeneous infrastructure with Corsair	Provides a route into high-usage workloads without needing a hyperscaler logo first	Public availability is still select-customer in 2H 2026
OEM / channel ecosystem	System integrator and OEM product teams	Server, rack, and networking integration teams	Supermicro, GigaIO, Arista, Broadcom, and Liqid appear in public commercialization materials	Critical force multiplier for reach, deployment speed, and support coverage	Channel dependence can obscure true end-customer count and revenue mix

This table separates end-buyer segments from the OEM and channel layer that helps d-Matrix reach them. Public evidence is strongest on the route to market, not on disclosed account counts by segment.

[CU001, CU002, CU004, CU005, CU007, CU025]

FU001: Customer journey map

d-Matrix is trying to move buyers from retrofit evaluation to scale-up, rack-scale, and heterogeneous cloud expansion without forcing a greenfield datacenter rebuild.

Stages are analytical journey steps synthesized from public commercialization materials, not a company-authored funnel with disclosed volumes.

[CU003, CU019, CU020, CU021, CU022, CU041]

6.2 Named Proof and Deployment Maturity

The customer-evidence picture improves materially once the analysis separates named proof points from broad customer-count rhetoric. Publicly, d-Matrix can point to a genuine commercialization ladder: Corsair sampling to early-access customers in late 2024, JetStream samples in 2025, GigaIO's integrated SuperNODE offer in May 2025, SquadRack's Supermicro channel availability for Q1 2026, and the March 2026 Gimlet Labs partnership for heterogeneous cloud deployment. Those are meaningful milestones because they show the company moving from hardware announcement to partner-integrated systems and then to a named external operator. But the same corpus also defines the limit of the proof. The ecosystem page explicitly says the showcased logos are not actual customers, and the public GigaIO, Supermicro, and Liqid statements remain partner endorsements rather than end-customer case studies. Gimlet is the clearest named external operator in the set, yet even there the public record says select-customer availability is planned for the second half of 2026 rather than already broad production. In other words, deployment maturity is real, but it is still better evidenced by channel readiness and operator pilots than by disclosed production-account depth.[CU008, CU009, CU010, CU011, CU012, CU013]

Customer growth / adoption trajectory table
Milestone	Public status	Date	Source	Confidence	Implication	Missing denominator
Corsair early-access sampling	Sampling to early-access customers	2024-11-19	d-Matrix / eeNews Europe	high	Shows the product moved beyond lab-only positioning	No count of early-access accounts or conversion rate
GigaIO scale-up partnership	Integrated enterprise scale-up system announced	2025-05-01	d-Matrix / GigaIO / AIwire	medium	Demonstrates a route to single-node enterprise deployment	No installed-base or booked-customer figure
JetStream NIC	Samples available; full production expected by year-end	2025-09-08	d-Matrix	medium	Expands the platform from accelerator cards into multi-node networking	No shipment or attach-rate disclosure
SquadRack commercialization	Configurations available for purchase through Supermicro in Q1 2026	2025-10-14	PRNewswire / d-Matrix	high	Moves proof from reference architecture toward channel transactability	No disclosed order backlog or customer count
Series C customer expansion	Company says funding supports multiple large-scale deployments and rapid customer growth	2025-11-12	d-Matrix / independent coverage	medium	Suggests the sales funnel progressed beyond pilots	No account names, deployment counts, or revenue split
Gimlet cloud offer	Solution planned for select customers in 2H 2026	2026-03-12	d-Matrix / Data Center Knowledge	medium	Freshest named external operator proof in the corpus	Select-customer plan is not the same as broad GA or renewal data

Trajectory milestones are operational commercialization markers, not customer-count metrics. Public materials show stage progression but do not disclose the denominator behind each step.

[CU008, CU009, CU010, CU011, CU014, CU015]

Named customer proof table
Proof point	Segment	Deployment / use case	Production vs pilot	Outcome / signal	Limitation
Gimlet Labs	Specialist AI cloud / operator	Corsair deployed alongside GPUs for agentic inference and speculative decoding	Pre-broad-availability; select customers planned in 2H 2026	Named external operator plus technical benchmark and planned cloud offer	Still short of disclosed broad production customer count or renewals
GigaIO	Enterprise scale-up infrastructure partner	Corsair integrated into SuperNODE for large single-node inference deployments	Integrated solution with early-access interest path	Shows a credible enterprise deployment vehicle and scale-up expansion route	Partner proof rather than named end-customer production logo
Supermicro	OEM / channel partner	SquadRack and X14 AI server platform for rack-scale inference	Purchasable configuration path in Q1 2026	Indicates channel transactability and rack-integration readiness	Public statement does not identify live end users or outcomes
Liqid	Composable infrastructure partner	Corsair integrated for flexible inference deployments	Partner endorsement only	Supports the story that Corsair can slot into alternate enterprise platforms	No named customer, deployment size, or usage metric is disclosed
Undisclosed early-access customers	Evaluation cohort	Corsair samples in late 2024	Pilot / evaluation	Shows some pre-GA external testing before broader commercialization	Account count, conversion, and segment mix are not public

The strongest named proof in public is a mix of one cloud operator and multiple channel/OEM partners. Logos and endorsements improve confidence in deployment maturity, but they do not answer the core question of how many end customers are in production.

[CU008, CU012, CU014, CU015, CU016, CU019]

FU002: Adoption / deployment funnel

Public evidence shows d-Matrix progressing from early-access sampling to channel availability and then to select-customer cloud rollout, but not yet to a disclosed installed-base metric.

Values are qualitative stage ordering only; they do not represent customer counts or conversion rates.

[CU008, CU009, CU010, CU015, CU041, CU044]

FU003: Customer proof matrix

The proof base is strongest on named partner and operator validation, and weakest on end-customer specificity, retention visibility, and concentration disclosure.

Matrix labels are qualitative judgments derived from the public corpus and are meant to compare evidence quality rather than score account economics.

[CU015, CU016, CU033, CU040, CU045]

6.3 Expansion Path, Channel Dependence, and Procurement Friction

The expansion thesis is coherent and probably the chapter's biggest positive. d-Matrix is trying to land first where latency-sensitive inference can be inserted into existing datacenters, then expand from card evaluation to single-node scale-up, rack-scale rollout, and broader heterogeneous-cloud deployments. GigaIO represents the single-node scale-up step, SquadRack plus Supermicro represent the rack-scale and channel-sales step, and Gimlet represents a cloud-service-provider path for workload-specific acceleration alongside GPUs. The challenge is that every one of those steps depends on partner execution. Public materials repeatedly show that d-Matrix needs OEMs, system integrators, switching partners, and orchestration layers to translate technical differentiation into repeatable customer adoption. Independent sources make the friction explicit: enterprises still optimize procurement around Nvidia's trusted software and systems ecosystem, and heterogeneous deployments only become easy when the abstraction layer hides hardware complexity. That means d-Matrix's customer expansion path is plausible, but conversion risk still sits at the intersection of OEM qualification, developer workflow compatibility, and whether buyers trust the platform enough to move from pilot or evaluation into repeat production spend.[CU021, CU022, CU023, CU024, CU025, CU026]

Expansion and concentration risk table
Expansion driver	How it expands	Concentration or channel risk	Impact	Diligence path
Existing-datacenter retrofit	Land via PCIe cards, standard Ethernet, and air-cooled deployments	Expansion depends on buyers trusting non-Nvidia acceleration in existing workflows	Can shorten time to first deployment if compatibility claims hold	Validate deployment effort and operator training burden on two live accounts
Single-node scale-up via GigaIO	Expand from evaluation card count to dozens of Corsairs in one node	Route is partner-mediated and does not reveal end-customer concentration	Helpful bridge from POC to meaningful workload size	Request pipeline by partner and stage, plus attach rates
Rack-scale path via Supermicro / SquadRack	Expand from node to rack and then multi-rack clusters	OEM availability does not prove sell-through or support quality	Could unlock larger enterprise, sovereign, and cloud budgets	Request bookings, deployments, and time-to-production through Supermicro
Cloud-operator path via Gimlet	Expand into heterogeneous inference cloud workloads and AI-native operators	Select-customer launch creates dependency on a small number of early operator wins	Could create high-usage reference accounts quickly	Request current committed capacity, target customers, and revenue model with Gimlet
Direct hyperscale or sovereign wins	Would expand average contract size materially	Public sources do not name any such account, leaving concentration upside and downside both opaque	Largest upside path but hardest one to underwrite publicly today	Request named accounts or procurement-stage evidence with timing and scope

The table treats concentration risk as both upside and downside because undisclosed large accounts can accelerate growth while also masking dependency.

[CU020, CU021, CU022, CU023, CU024, CU032]

Procurement and validation friction table
Friction point	Why it exists	Current mitigation	Public evidence	Remaining ask
Entrenched Nvidia ecosystem	Customers already know the incumbent software, tooling, and deployment model	d-Matrix positions open standards, better latency, and lower power/TCO	Forbes, GFM Review, and IntuitionLabs all highlight incumbent ecosystem advantage	Request win-loss analysis against Nvidia-based alternatives
Heterogeneous deployment complexity	Multiple accelerators only help if orchestration hides hardware complexity	Gimlet abstracts workload placement and d-Matrix markets Aviator plus JetStream	Data Center Knowledge and Gimlet describe the abstraction layer as central	Request operator evidence on deployment effort and code changes
OEM and channel dependence	Customers buy systems, racks, and support packages rather than bare silicon	Supermicro, GigaIO, Liqid, Arista, and Broadcom extend the commercial surface	Official commercialization materials repeatedly rely on those partners	Request support split, certification matrix, and partner revenue concentration
Proof depth versus proof quality	Partner quotes and architecture milestones are easier to publish than production cohort data	d-Matrix now has one named operator in Gimlet and purchasable SquadRack configurations	The corpus shows real progress but still lacks retention and installed-base disclosure	Request production reference calls and cohort metrics before underwriting durable demand

This table is intentionally qualitative. Public evidence is much stronger on what could make procurement easier than on how quickly those mitigations are converting into durable production revenue.

[CU024, CU034, CU035, CU036, CU037, CU038]

6.4 Durability, Concentration, and Disclosure Gaps

Durability evidence is the weak point in the public record. Official and independent sources say the company is seeing rapid customer growth and preparing multiple large-scale deployments, but none of the reviewed materials quantify active customer count, repeat-order frequency, contract length, gross retention, net revenue retention, or churn. The same goes for concentration risk: there is no public disclosure of top-customer exposure, partner-versus-direct channel mix, or what share of revenue comes from any one hyperscale, enterprise, or sovereign account. As a result, the chapter can underwrite that commercialization is moving forward, but not that it is already broad, sticky, or diversified. The freshest evidence in 2026 is architectural and partner-led, not cohort-led. That is the central underwriting gap investors should keep front and center: d-Matrix has enough named proof to show adoption is not purely conceptual, but not enough disclosure to prove that adoption is durable, repeatable, and diversified at production scale.[CU028, CU029, CU030, CU031, CU032, CU033]

Retention / repeat usage / satisfaction table
Metric	Value	Segment	Confidence	Why it matters	Diligence ask
Customer count		All segments	low	Without active-account count, deployment rhetoric cannot be translated into market share or adoption depth	Request current active production customers by hyperscale, enterprise, sovereign, and cloud-operator segment
Net revenue retention		All recurring accounts	low	NRR is the cleanest way to test land-and-expand claims in infrastructure	Request trailing-12-month NRR by direct and channel-sourced cohort
Gross retention / churn		All customers	low	Gross retention shows whether deployments survive beyond proof-of-concept	Request logo churn, gross retention, and reasons for non-renewal
Contract duration / repeat order cadence		Enterprise and sovereign accounts	low	Hardware and infrastructure businesses can look healthy on pipeline while still lacking repeat orders	Request average term length, renewal cadence, and backlog conversion data
Referenceable satisfaction / user advocacy	Only partner quotes and a named operator benchmark are public	Channel and operator proof	medium	Reference calls and user satisfaction matter because the product changes infrastructure design choices	Request at least three production reference calls with deployment age, workload, and realized outcome

Nulls are intentional. The public record reviewed for this chapter does not disclose retention, renewal, or customer-count metrics in a way that can be underwritten quantitatively.

[CU028, CU029, CU030, CU031, CU040, CU045]

6.5 Exhibits

Chapter 07

07Risks

7.1 Roadmap and System-Integration Risk

The highest-ranked risk is that d-Matrix is not selling one isolated chip; it is trying to commercialize a synchronized stack spanning Corsair cards, JetStream networking, Aviator software, partner-qualified racks, and a next-generation 3DIMC roadmap. That architecture can be differentiated and still fragile. JetStream is still publicly framed as early commercialization rather than mature fleet deployment, and the Pavehawk-to-Raptor path is explicitly described in lab-validation terms instead of production yield or broad customer proof. The company also made major architecture pivots during its first five years, which shows adaptability but also means buyers and investors are underwriting a moving roadmap. Mitigation exists in the form of concrete product surfaces and partner disclosures, but mitigation maturity remains only moderate because the public record still relies heavily on company-authored milestones rather than broad independent production evidence.[CR001, CR003, CR004, CR005, CR006, CR007]

Operational / quality / security risk register
Failure mode	Likelihood	Severity	Mitigation maturity	Residual exposure	Unresolved gap
JetStream and multi-node deployment remain earlier-stage than mature incumbent networking stacks	high	high	moderate	high	Public evidence is still early-commercial rather than long-running production proof
Pavehawk and Raptor remain roadmap assets, so next-generation memory scaling is not yet field validated	high	blocking	low	high	No public production-yield, GA timing, or external benchmark package was found
System efficiency claims may compress once networking, orchestration, and support overhead are included	medium	high	low	medium	The reviewed corpus lacks rack-level TCO or fleet operations data from independent production users
Field reliability is under-evidenced because no public MTBF, uptime, or RMA metrics were surfaced	medium	high	low	high	Durability remains unresolved outside company-authored deployment materials
Disaggregated inference adds more integration surfaces where firmware, switch, or software coordination can fail	medium	high	moderate	medium	There is little public evidence on install time, operational toil, or rollback behavior
Rapid workload evolution can make today’s architectural assumptions less durable if models shift again	medium	medium	moderate	medium	The roadmap must keep pace with future reasoning, multimodal, and agentic inference patterns

Operational risk is dominated by maturity and integration rather than by a single disclosed defect. Public mitigations exist, but most are still conceptual or partner-led.

[CR001, CR003, CR004, CR005, CR006, CR008]

FR003: Dependency map

Maps the core dependencies that make d-Matrix’s differentiated architecture simultaneously open-standards friendly and execution-sensitive.

Nodes are grouped analytically to show underwriting dependencies rather than to mirror one official product diagram.

[CR002, CR004, CR005, CR009, CR013, CR014]

7.2 Supply, Manufacturing, and Ecosystem Dependence

The second-ranked risk is concentrated dependency across packaging, foundry choices, networking, and system integration. The next-generation memory roadmap depends on external advanced-packaging execution, and public third-party coverage ties d-Matrix’s chiplet path to TSMC-node choices and evolving interface standards. At the same time, the current go-to-market motion depends on an ecosystem of Arista, Broadcom, Supermicro, GigaIO, and similar partners to turn cards into deployable systems. That partner-heavy model reduces proprietary lock-in for the customer, but it shifts execution risk onto third-party qualification, support handoffs, and BOM availability. Macro supply conditions do not make this easier: advanced packaging bottlenecks and adjacent GPU shortages have already shown how quickly AI-hardware supply chains can tighten. Mitigation today is conceptual openness and partner breadth, not evidence of a self-contained or highly redundant supply chain.[CR002, CR009, CR010, CR011, CR012, CR013]

Partner / dependency risk register
Dependency	Counterparty / cluster	Role	Concentration	Failure scenario	Severity	Mitigation	Residual exposure
Advanced packaging and 3DIMC execution	Alchip plus future foundry and OSAT chain	Next-generation roadmap enablement	high	Raptor slips or ships with weak economics because packaging, node, or qualification work lags	blocking	Hold next-gen valuation upside contingent on clear manufacturing milestones	high
Reference-rack and OEM stack	Arista, Broadcom, Supermicro	Turns cards and NICs into deployable systems	high	Qualification or support gaps slow customer rollout even if silicon is available	high	Demand a validated BOM and support handoff model	high
Single-node scale-up partner	GigaIO	Provides a scale-up deployment path for larger configurations	medium	Partner-led deployment proof does not translate into broad repeatable field adoption	medium	Separate partner demos from repeatable customer demand in diligence	medium
Early external customer-validation channel	Gimlet Labs	Provides cloud deployment and early customer visibility	medium	The reference partnership fails to convert into diversified production-customer proof	high	Track conversion from select-customer availability to named production wins	high
Incumbent software and operations ecosystem	CUDA-era buyer workflows and procurement norms	Shapes default enterprise buying behavior	high	Customers keep defaulting to incumbent stacks because switching costs outweigh point-performance upside	high	Require evidence of workload-level wins that survive full-system comparison	high

This register mixes supplier, integrator, and market-structure dependencies because all three can block commercialization even if the silicon itself performs.

[CR002, CR009, CR010, CR013, CR014, CR028]

7.3 Adoption Proof and Deployment Friction

The third-ranked risk is that adoption proof still trails the technical story. Public evidence is strongest around early-access partners, reference architectures, and select-customer availability windows rather than around a broad set of named production users. The Gimlet and GigaIO announcements help because they show real counterparties and deployment intent, but they still read as early commercialization surfaces, not a mature installed base. That matters because enterprise AI adoption remains constrained by networking, power, and operational complexity even when the chip-level economics look better. d-Matrix’s air-cooled, standard-server positioning is a real sales advantage, yet the public record still does not show whether customers can deploy the full stack without heavy solution-engineering effort or long qualification cycles. This is therefore both a customer-proof risk and an implementation-friction risk.[CR014, CR015, CR016, CR017, CR018, CR035]

7.4 Compliance, Security, and Regulatory Risk

The fourth-ranked risk is that compliance, export-control, and security proof is thinner than what many enterprise and sovereign buyers now expect from inference infrastructure vendors. The public legal surfaces that were easy to verify are the website privacy policy and terms of use; those are necessary, but they are not a substitute for product-security architecture, audit posture, or a dedicated trust center. That gap matters because regulated buyers increasingly prioritize data control, predictable governance, and on-prem assurance, and competing platforms market those qualities explicitly. Overlaying that is export-control risk. BIS has already tightened controls on advanced computing semiconductors and circumvention pathways, which means advanced-AI hardware supply chains operate under an active regulatory regime. D-Matrix may never be directly blocked by those rules, but investors still need to understand how packaging partners, manufacturing geographies, and customer targets interact with evolving trade controls.[CR023, CR024, CR025, CR026, CR027, CR041]

Regulatory / legal risk register
Risk	Jurisdiction / scope	Current public signal	Likelihood	Severity	Mitigation maturity	Residual exposure	Diligence path
Advanced-chip export controls and anti-circumvention rules can disrupt supply-chain planning or customer geography	U.S. and countries of concern	BIS tightened controls on advanced computing semiconductors and related circumvention pathways	medium	high	low	high	Obtain the manufacturing geography map and export-control counsel memo covering packaging partners and target accounts
Public product-security and compliance disclosure is thin relative to regulated-buyer expectations	Global enterprise and sovereign buyers	Reviewed public legal pages did not surface a trust center or product assurance package	medium	high	low	high	Request security architecture, audit posture, and regulated-customer references
Website privacy and terms coverage does not answer product-level data-governance or liability questions	Customer contracts and deployment governance	Public legal pages describe site handling and disclaimers rather than model or fleet controls	medium	medium	low	medium	Review customer contracts, DPA terms, indemnities, and security commitments
Trade-policy shifts can affect next-generation packaging or node transitions even if current products are shipping	Foundry, packaging, and future roadmap	The roadmap points to more advanced packaging and node sensitivity over time	medium	high	low	high	Map which future programs would be most exposed to rule changes or licensing reviews
Public compliance posture may be insufficient for sovereign or highly regulated deployment bids	Government, defense-adjacent, finance, and healthcare buyers	Competitors market stronger public security or on-prem assurance language than d-Matrix does today	medium	high	low	medium	Request the product-security roadmap and list of blocked or delayed regulated opportunities

This register focuses on public legal, export, and trust signals. Residual exposure remains elevated because the current corpus is stronger on architecture than on auditable compliance evidence.

[CR023, CR024, CR025, CR026, CR027]

7.5 Competition, Financing, and Kill Triggers

The final ranked cluster is the combination of incumbent competition, ecosystem lock-in, and financing discipline. NVIDIA, AMD, and specialist alternatives such as Groq and Cerebras keep raising the bar on memory capacity, rack-scale integration, deployment model, and software expectations. Independent analysis also argues that switching costs are system-wide, not just chip-level, and that the market stays concentrated through 2026. That means d-Matrix must win specific workloads where its latency-per-watt or brownfield-deployment advantages survive full-system comparison. Financing risk compounds this challenge. The 2025 round bought time, but it also set a valuation benchmark before broad public revenue proof. Management’s own history includes a near-death cash moment, and the company is expanding footprint and headcount while public customer proof remains limited. The investment case should therefore stay milestone-based, with explicit thesis-break triggers tied to roadmap timing, production-customer conversion, and supply-chain stability.[CR019, CR020, CR021, CR022, CR028, CR029]

People / execution risk register
Role / function	Dependency or gap	Likelihood	Severity	Mitigation	Diligence path
Global engineering and operations footprint	Teams now span multiple geographies and must coordinate silicon, software, and commercialization programs	medium	high	Program-management discipline and stronger release governance	Request org chart, release cadence, and decision-rights model across sites
Commercialization leadership and field support	Public proof of scaled customer-success and support operations is limited	medium	high	Build customer engineering and post-sales capability before broad rollout	Request support headcount, escalation model, and deployment staffing assumptions
Product-definition discipline	Company history includes multiple architectural pivots as the market evolved	medium	medium	Milestone-gated roadmap with explicit customer design-partner feedback loops	Review roadmap governance, stage-gates, and criteria for killing or adding programs
Capital allocation during scale-up	Hiring and footprint expansion can raise burn faster than validated revenue proof	medium	high	Tie hiring and global expansion to booked demand and support load	Request burn bridge, hiring plan, and trigger points for expense control

Execution risk is less about founder credibility and more about whether the company can scale commercialization and release discipline fast enough to match its technical ambition.

[CR007, CR019, CR020, CR021, CR022, CR044]

Mitigation and kill criteria table
Risk	Monitorable trigger	Threshold / event	Action implication
Roadmap maturity	JetStream or Raptor timeline	Material slip versus customer qualification or procurement windows	Move from underwriting roadmap upside to valuing only currently deployable product surfaces
Customer proof	Named production deployments	No diversified production-customer set beyond partner or early-access references by 2026-2027	Downgrade commercialization confidence and valuation support
Supply and packaging	Advanced packaging or export-control shock	Rule change, capacity miss, or cost spike that materially changes roadmap timing or unit economics	Treat next-gen revenue assumptions as delayed and revisit cash needs
Competitive differentiation	Full-system latency-per-watt comparison	Incumbent or specialist alternatives close the gap without comparable integration burden	Reduce willingness to pay for architecture premium and moat assumptions
Security and regulated adoption	Trust-center and compliance evidence	No product-security assurance package while regulated opportunities remain part of the narrative	Constrain TAM assumptions and delay sovereign or regulated-market upside
Financing discipline	Burn versus proof	Headcount and global expansion continue while bookings or production evidence remain thin	Shift the thesis to capital preservation and downside underwriting

These triggers are designed to be observable from follow-on diligence or future refreshes rather than generic narrative concerns.

[CR037, CR038, CR039, CR040, CR041, CR045]

FR001: Risk heatmap

Residual-risk view showing roadmap and customer-proof risks clustering in the highest-impact cells, with compliance and financing risks close behind.

Likelihood and impact are qualitative rankings synthesized from the reviewed corpus; they are not statistical loss estimates.

[CR036, CR037, CR038, CR039, CR040, CR041]

FR002: Risk transmission map

Shows how roadmap, supply, compliance, and adoption risks transmit into bookings, burn, financing, and valuation.

This map is analytical rather than company-authored; it visualizes second-order effects that matter to underwriting.

[CR016, CR021, CR022, CR036, CR037, CR038]

7.6 Exhibits

Chapter 08

08Valuation

8.1 Verified marks, price discipline, and the core call

d-Matrix is not a paper company. The 2025 Series C was publicly disclosed at $275 million and a $2 billion valuation, the investor list is real, and the company has a visible product story around Corsair accelerators, JetStream networking, and Aviator software. The round also came after a 2023 $110 million Series B and years of technical positioning around digital in-memory compute for inference. That is enough to justify serious attention. It is not enough to justify blind acceptance of the last mark. The valuation problem is the denominator. The reviewed public corpus still does not disclose revenue, ARR, gross margin, free cash flow, customer count, or cap-table terms. The Series C announcement claims rapid customer growth and expanding partners, but it does not tell an outside investor what portion of that story is pilot activity, production deployment, or economically attractive recurring demand. Without those numbers, a conventional late-stage revenue multiple cannot be defended from public evidence. The right conclusion is therefore price-sensitive. The company may be strategically relevant to the inference stack, but at $2 billion the public record still looks incomplete. That is why the present-tense call is research-more rather than buy: the quality of the technical story is good enough to keep working, but the public proof is still too thin to treat the latest round as a clean fair-value anchor.[CV001, CV002, CV003, CV005, CV006, CV007]

Recommendation summary table
Dimension	Assessment	Evidence anchor	Decision implication
Recommendation	research-more	Real financing and product proof, but missing operating denominator metrics	Keep working only if diligence rights or pricing improve
Confidence	medium	Funding and technical facts are real, but revenue/ARR/margin remain undisclosed	Use wide scenario bands and avoid precision
Risk rating	high	Commercialization, benchmark normalization, and next-price discovery are still unresolved	Assume downside asymmetry until private evidence closes the gaps
Valuation stance	stretched at $2B	Series C price sits above the chapter's public-evidence midpoint	Treat the last round as a milestone mark, not a proven bargain
Indicative entry discipline	below ~$1.5B or with hard downside protection	Base case centers below the last round and no target return can be supported publicly at $2B	Require price improvement, structure, or unusually strong data-room proof
Likely exit path	strategic sale or later secondary before IPO	Strategic relevance is more visible than public-market disclosure readiness	Do not underwrite near-term IPO liquidity from the public record

Recommendation is explicitly price-sensitive. It judges the 2026 evidence set and the latest disclosed round, not just the quality of the architecture.

[CV001, CV017, CV049, CV050, CV051, CV052]

Thesis / anti-thesis table
Argument	Bull read	Bear read	What would change the view
Inference timing	Investors correctly anticipated inference becoming the economic battleground of AI	The theme may be right while this price and this company-specific capture are still too optimistic	Show named production customers winning inference budgets today
Architecture	Memory-local design and modular insertion could solve a real latency and power problem	Claims remain too company-authored and may not survive normalized testing	Publish a third-party same-model benchmark with throughput, latency, power, and cost together
Commercialization	GigaIO, Gimlet, and sovereign/enterprise language suggest real market pull	Public evidence may still be mostly pilot, partner, and roadmap proof rather than scaled recurring demand	Disclose booked revenue, customer count, and pilot-to-production conversion
Comparable context	Scarce inference assets may deserve a premium to generic hardware or software names	NVIDIA and AMD already disclose enormous scale, while private challengers remain heterogeneous and opaque	Prove d-Matrix belongs in a premium scarcity bucket instead of a speculative startup bucket
Price discipline	$2B could be reasonable if bull-case milestones arrive quickly	At present the mark already prices in success without public financial proof	A lower entry or strong private data-room evidence would materially improve the call

The anti-thesis is not that d-Matrix lacks technical merit. It is that the latest price gets ahead of what the public corpus can actually support.

[CV013, CV014, CV015, CV016, CV017, CV030]

FV001: Recommendation logic

The call flows from real financing and technical credibility through missing denominator metrics to a price-sensitive research-more recommendation.

This is a decision-logic compression of the chapter, not a causal operating model.

[CV001, CV013, CV014, CV017, CV032, CV044]

8.2 What the public record supports — and what it does not

The strongest public evidence supports d-Matrix as an inference-first hardware company with real financing, a coherent architecture, and growing ecosystem ties. Company, investor, and partner materials consistently repeat the same operating narrative: inference is becoming the economic bottleneck in AI, d-Matrix designed around the memory-movement problem, and products such as Corsair can improve throughput, latency, power, and total cost versus GPU-heavy alternatives. GigaIO and Gimlet also provide at least some outside evidence that the hardware is being integrated into real systems and tested in heterogeneous inference workflows. But the evidence gets weaker exactly where valuation needs it to get stronger. Most numerical economic claims are company-authored or partner-amplified. The best independent technical support in the corpus is Gimlet's speculative-decoding analysis, which is useful but still narrower than a same-model, same-batch, same-latency benchmark versus incumbent GPU clusters. More importantly, no reviewed source discloses the recurring revenue base, margin structure, pricing curve, or customer concentration that would let an investor translate technical promise into a defensible enterprise value. That asymmetry matters. Public evidence today supports the proposition that d-Matrix could become important. It does not support the proposition that the current private mark is already cheap. In valuation terms, the chapter can underwrite strategic optionality and milestone progress; it cannot underwrite hidden unit economics as if they had been publicly proved.[CV004, CV005, CV006, CV009, CV010, CV011]

FV004: Investment KPIs

KPI dashboard highlights the few hard public anchors and the main missing denominator facts driving the recommendation.

Dashboard mixes disclosed facts and inferred valuation outputs; every inferred item is labeled as such in the detail field.

[CV001, CV004, CV017, CV049, CV050, CV053]

8.3 Comparable set and inferred underwriting logic

The comp set is useful, but only if it is used correctly. NVIDIA and AMD are the cleanest public reference points for scale, disclosure quality, and incumbent strength. Their official product pages define the technical bar on memory, bandwidth, and rack-scale integration, while their 2026 filings show what fully disclosed AI infrastructure businesses look like when they are already operating at enormous commercial scale. That makes them indispensable scale fences. It does not make them direct pricing comps for a private startup with no public revenue denominator. The private and specialized challenger set is also heterogeneous. Groq is easier to understand commercially because it exposes service tiers and on-prem optionality. Cerebras competes through wafer-scale integrated systems. Low-confidence secondary coverage suggests SambaNova belongs in yet another bucket: integrated enterprise AI infrastructure rather than modular card insertion. D-Matrix sits somewhere else again, trying to sell a memory-local inference stack that can slot into more standard server and rack environments. Those are not minor packaging differences; they change customer budgets, deployment friction, and the kind of valuation support that can be observed publicly. The most plausible inference is that Series C investors were underwriting category timing and architectural fit, not already-revealed financial scale. In other words, they likely paid for the belief that inference demand will keep rising, that the market will reward lower-latency and lower-power alternatives, and that d-Matrix can convert partner and sovereign or enterprise interest into a durable niche before incumbent stacks absorb the opportunity. That logic is understandable. It is still not the same thing as public valuation support.[CV018, CV019, CV020, CV021, CV022, CV023]

Comparable valuation table
Reference	Status	Observed metric	Multiple / valuation context	Why it matters	Key limitation
d-Matrix Nov-2025 Series C	Verified private mark	$275M raise at $2B valuation; $450M total raised	Latest disclosed pricing anchor	Only clean private valuation datum in the reviewed corpus	No public revenue, ARR, or margin denominator
NVIDIA FY2026 + Q1 FY2027	Public incumbent scale fence	$215.9B FY2026 revenue; $193.737B Data Center revenue; $81.615B latest quarter	Useful as disclosure and scale boundary, not a startup multiple comp	Shows what fully disclosed AI infrastructure scale looks like	Training and inference are mixed; business is far broader than d-Matrix
AMD FY2025 + Q1 2026	Public alternative scale fence	$34.639B FY2025 revenue; $16.635B Data Center revenue; $10.253B latest quarter	Another scale and disclosure boundary rather than a direct price comp	Shows how even a challenger incumbent discloses denominator metrics d-Matrix does not	Also mixes broader portfolio exposure beyond inference
Groq	Private inference-service reference	Public/private/co-cloud plans and GroqRack on-prem option are visible	Commercial packaging is clearer than d-Matrix public pricing	Useful for business-model comparison and deployment-path clarity	Reviewed corpus does not provide high-confidence current valuation support
Cerebras	Private integrated-system reference	Wafer-scale inference and training platform; system-level architecture	Comparable as a differentiated architecture, not as a modular card comp	Shows another path to inference differentiation	Reviewed corpus does not provide high-confidence current valuation support
SambaNova / other integrated inference hardware	Low-confidence private reference	Secondary coverage describes integrated enterprise platform and recent funding	Included as category context only, not a numeric pricing anchor	Confirms the private comp set is fragmented, not standardized	Source quality is insufficient for high-confidence valuation transfer

This table intentionally mixes one verified private mark, two public scale fences, and three business-model references. It is a comparable set for underwriting logic, not a clean multiple screen.

[CV001, CV022, CV023, CV026, CV027, CV028]

FV002: Valuation sensitivity

Directional evidence adjustments versus the $2B Series C mark show why the current public midpoint stays below the last round.

Bars are directional USD-million-style adjustments against the latest private mark, used only to visualize which evidence pushes the call up or down. They are not management guidance or transaction data.

[CV013, CV015, CV017, CV025, CV039, CV040]

8.4 Bull, base, and bear scenarios

Because revenue and ARR are undisclosed, scenario work has to be milestone-based rather than multiple-based. The right question is not “what exact EV/revenue multiple should a private AI chip startup trade on?” The public record cannot answer that. The right question is what evidence state would justify paying below, around, or above the last disclosed $2 billion mark. The bear case assumes that commercialization remains mostly pilot- and partner-led, that independent benchmark normalization still does not emerge, and that the next financing or secondary event prices d-Matrix closer to IP value plus conditional commercial optionality than to proven platform leadership. That produces a wide but clearly sub-round range of about $700 million to $1.0 billion. The base case assumes continued deployment progress and financing access, but still no public financial disclosure strong enough to prove a premium over the Series C valuation; that supports roughly $1.2 billion to $1.8 billion. The bull case requires named large-scale deployments, validated economics, and clearer evidence that d-Matrix can coexist with or displace incumbent GPU infrastructure in production, supporting roughly $2.5 billion to $3.5 billion. That framing matters because the latest disclosed round sits above the chapter's midpoint. The Series C is not impossible to justify, but it already prices in an execution path closer to the bull case than to the base case. That is why the current stance is stretched rather than attractive.[CV041, CV042, CV043, CV044, CV045, CV046]

Bull / base / bear scenario table
Scenario	Proof-state assumption	Valuation logic	Implied valuation range	Probability signal	Key trigger
Bear	Pilots do not convert clearly, benchmark proof remains thin, and next money reprices risk	Asset/IP plus conditional commercial optionality	~$700M-$1.0B	25%	A financing reset or stalled deployment evidence
Base	Deployments progress and financing remains available, but denominator metrics stay private	Milestone progress around the last round but below a full premium endorsement	~$1.2B-$1.8B	50%	Commercial progress without enough public proof to justify $2B+
Bull	Named large deployments, validated economics, and durable niche proof beside incumbent GPUs	Strategic scarcity plus de-risked commercialization	~$2.5B-$3.5B	25%	Independent validation and repeatable production wins
Probability-weighted midpoint	Mix of the three states above	Blended scenario center of gravity from public evidence	~$1.5B-$2.0B	100%	Useful discipline check versus the $2B Series C mark

These ranges are milestone-based scenario judgments in USD millions. They are not disclosed values, fair-value appraisals, or revenue-multiple outputs because the required denominator metrics are not public.

[CV041, CV042, CV043, CV045, CV046, CV047]

FV003: Valuation / return range

Milestone-based bear, base, and bull valuation ranges in USD millions versus the last disclosed round.

All figures are inferred scenario values in USD millions except the last line, which is the disclosed Series C valuation. The range is milestone-based because revenue and ARR are not public.

[CV001, CV045, CV046, CV047, CV048, CV049]

8.5 Exit readiness, thesis-break triggers, and final diligence asks

Public evidence supports strategic relevance more than exit readiness. The company is building into one of the most crowded but important layers of the AI stack, and that does create strategic-sale or later-secondary optionality if the platform becomes a must-have inference component. But current public disclosure does not look IPO-ready. Public comparables disclose revenue, margins, filing cadence, and segment performance; d-Matrix does not. That disclosure gap drives the kill criteria. A priced financing materially below $1 billion would strongly suggest the market has reset the Series C narrative. A failure to convert partner announcements and early-access programs into named production deployments would undercut the commercialization story. An independent benchmark that fails to support the claimed cost or efficiency edge would remove the central reason to pay a premium for a specialized inference architecture. The required diligence is therefore concrete rather than generic: audited revenue and ARR, margin history, customer concentration, bookings-to-deployment conversion, realized pricing and support terms, and the preference stack or any recent 409A or secondary marks. Without those, a new investor is underwriting the story that d-Matrix could matter, not the proposition that $2 billion is already a disciplined entry price.[CV017, CV050, CV053, CV054, CV055, CV056]

Thesis-break and kill triggers table
Trigger	Threshold / event	Transmission to thesis	Action implication
Reset financing	Any priced round or secondary clearing well below ~$1B	Confirms the Series C narrative did not hold and compresses upside from here	Rebuild the investment case from the new market-clearing price
Commercialization stall	No named production deployments or weak pilot conversion by the next financing cycle	Undercuts the argument that architecture is becoming a real revenue engine	Shift immediately toward the bear case
Benchmark failure	Independent same-workload tests fail to validate a material TCO or efficiency edge	Removes the main reason to pay a premium for specialized inference hardware	Strip out premium assumptions and revisit willingness to invest
Weak operating disclosure	Private diligence shows poor margin quality, concentrated customers, or low conversion	Converts a technology story into a low-quality hardware commercialization story	Demand a deep markdown or walk
Incumbent absorption	Customers standardize on NVIDIA/AMD or integrated alternatives instead of adding d-Matrix	Shrinks the niche the Series C appears to underwrite	Assume slower growth and lower exit optionality

Kill criteria are monitorable and tied directly to valuation transmission rather than generic company risk.

[CV039, CV041, CV043, CV053, CV054, CV055]

Final diligence asks table
Topic	Missing evidence	Why it matters	Owner / diligence path
Audited revenue / ARR	Monthly and annual revenue plus ARR for 2024-2026	Core denominator missing from every public valuation method	Request audited financial statements and board KPI packs
Gross margin and cash conversion	Gross margin, opex, burn, and working-capital profile	Needed to separate valuable infrastructure software leverage from expensive hardware deployment	Request audited P&L, cash-flow history, and margin bridge
Customer proof	Named production customers, cohort expansion, and top-account concentration	Needed to distinguish partner activity from durable recurring demand	Request customer list, concentration schedule, and pilot-to-production conversion data
Pricing and support economics	Realized ASPs, discounting, software attach, and support obligations	Needed to test whether public TCO claims survive commercial reality	Request pricing schedules, pilot invoices, and support-term summaries
Cap table and preferences	Fully diluted ownership, preference stack, side letters, and any recent 409A or tender marks	Needed because enterprise value and common-equity value can diverge sharply in markdown scenarios	Request cap table, waterfall model, and latest 409A or secondary data
Benchmark normalization	Same-model third-party benchmark versus H100/H200/MI300X and any integrated challenger actually in the shortlist	Needed to validate the whole chapter's premium-economics assumption	Commission or obtain normalized benchmark evidence before underwriting price

These asks are blocking rather than optional. Without them, an investor is underwriting a strategic narrative more than a defendable valuation.

[CV017, CV030, CV054, CV055, CV056]

8.6 Exhibits

Disclaimer

This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.

Evidence index

Claims
ID	Statement	Confidence	Sources
CO001	d-Matrix was founded in 2019.	High	SO002, SO004, SO006
CO002	d-Matrix is headquartered in Santa Clara, California.	High	SO004, SO005, SO006
CO003	The latest public office list in the fetched pack names Toronto, Sydney, Bangalore, and Belgrade in addition to the Santa Clara headquarters.	High	SO004, SO011
CO004	The founders publicly identified by d-Matrix are Sid Sheth and Sudeep Bhoja, with Sheth as CEO and Bhoja as CTO.	High	SO002, SO004
CO005	The public executive bench on the about page includes software, product, finance, legal, and manufacturing leaders.	Medium	SO002
CO006	The public board roster names Per Roman, Sasha Ostojic, Jeff Huber, Michael Stewart, Connie Sheng, and Russell Tham.	Medium	SO002
CO007	The fetched public record does not disclose board committees, ownership percentages, or explicit protective provisions behind the named board slate.	Medium	SO002, SO004, SO008
CO008	d-Matrix positions itself as an accelerated computing company built for AI inference in data centers rather than as a general-purpose training hardware vendor.	High	SO001, SO004
CO009	The home page says d-Matrix's 3DIMC architecture uses a chiplet-based, PCIe-friendly design that can scale to models up to 100 billion parameters.	High	SO001, SO004
CO010	By late 2025 d-Matrix publicly described a three-part product stack of Corsair accelerators, JetStream networking, and Aviator software.	High	SO004, SO022, SO024
CO011	VentureBeat reported that d-Matrix raised $44 million in a Series A round in April 2022.	Medium	SO014
CO012	d-Matrix closed a $110 million Series B funding round on September 6, 2023, led by Temasek.	High	SO006, SO012
CO013	The Series B release said the new capital would help d-Matrix commercialize Corsair and invest in recruitment and commercialization.	Medium	SO006
CO014	The Series B release listed Playground Global, M12, SK Hynix, Nautilus Venture Partners, Marvell Technology, and Entrada Ventures among d-Matrix backers and strategic partners.	High	SO006, SO012
CO015	d-Matrix said its November 2025 Series C raised $275 million at a $2 billion valuation and brought total disclosed funding to $450 million.	High	SO004, SO007, SO008, SO009
CO016	The 2025 Series C was co-led by Bullhound Capital, Triatomic Capital, and Temasek, with new participation from QIA and EDBI plus follow-on participation from M12, Nautilus Venture Partners, Industry Ventures, and Mirae Asset.	High	SO004, SO007, SO008
CO017	Independent coverage corroborated the $275 million raise, $2 billion valuation, and $450 million total funding figures announced in Series C.	High	SO009, SO010, SO011
CO018	The latest disclosed employee count in the fetched pack is 250+ worldwide from the November 2025 Series C key facts.	Medium	SO004
CO019	The November 2025 materials identify Santa Clara as headquarters and Toronto, Sydney, Bangalore, and Belgrade as global office locations.	High	SO004, SO011
CO020	The public record supports treating d-Matrix as a private Series C infrastructure startup rather than a public company or early research project.	High	SO004, SO009
CO021	d-Matrix unveiled Corsair on November 19, 2024 as a new computing platform designed for AI inference in modern datacenters.	Medium	SO005
CO022	The Corsair launch materials claimed 60,000 tokens per second at 1 millisecond per token for Llama3 8B in a single server and 30,000 tokens per second at 2 milliseconds per token for Llama3 70B in a single rack.	Medium	SO005
CO023	Corsair was sampling to early-access customers in November 2024 and was expected to be broadly available in Q2 2025.	Medium	SO005
CO024	d-Matrix launched JetStream in September 2025 as a custom I/O card and said the NICs deliver up to 400Gbps bandwidth with full production expected by year-end.	Medium	SO022
CO025	JetStream, paired with Corsair and Aviator, pushed d-Matrix toward a full compute-networking-software platform rather than a single-chip offering.	High	SO022, SO023
CO026	SquadRack was announced in October 2025 with Arista, Broadcom, and Supermicro as an open standards-based rack-scale reference architecture for AI inference.	High	SO023, SO004
CO027	GigaIO said in May 2025 that integrating Corsair with SuperNODE would support enterprise-scale inference and cited 30,000 tokens per second at 2 milliseconds per token on Llama3 70B.	Medium	SO020
CO028	The November 2025 Alchip collaboration pointed to a 3D DRAM roadmap and said the commercial debut would come on the Raptor accelerator as Corsair’s successor.	Medium	SO024
CO029	Gimlet Labs reported in March 2026 that offloading speculative decoding to d-Matrix Corsair produced 2–10x end-to-end latency improvements at similar energy efficiency versus a homogeneous GPU setup.	Medium	SO021
CO030	The d-Matrix news index lists a March 2026 Gimlet Labs collaboration and an April 2026 acquisition of GigaIO’s data center business as post-Series C milestones.	Medium	SO003
CO031	CRN included d-Matrix in its list of the 10 hottest semiconductor startups of 2023 after the company’s $110 million Series B.	Medium	SO012
CO032	Silicon Republic described d-Matrix as an AI startup to watch in 2024 because major product launches were scheduled after its $110 million funding round backed by Temasek and M12.	Medium	SO013
CO033	VentureBeat linked Microsoft Project Bonsai support to d-Matrix before Corsair was public, giving the company early ecosystem validation.	Medium	SO014
CO034	CNN reported in 2023 that GPU and interposer shortages were constraining AI infrastructure, with Sid Sheth arguing companies would need more efficient approaches.	Medium	SO015
CO035	CNBC described LLM inference costs as structurally high and quoted Sheth arguing that GPUs were not built for spiky inference workloads.	Medium	SO016, SO018
CO036	TechTarget likewise quoted Sheth saying inference economics are about dollars per inference, power efficiency, and latency rather than only raw compute.	Medium	SO017
CO037	Bloomberg described investor interest in chips that process data where it is stored, a market tailwind that aligns with d-Matrix's in-memory positioning.	Medium	SO019
CO038	d-Matrix and Bullhound both claimed up to 10x faster performance, 3x lower cost, and 3-5x better energy efficiency than GPU-based systems, but those ratios remain vendor claims rather than independently audited field benchmarks in the fetched pack.	Medium	SO004, SO007, SO011
CO039	Named customer disclosure remains thin because Series C and product releases mention rapid customer growth and large-scale deployments without identifying a broad hyperscale or enterprise customer roster.	Medium	SO004, SO009
CO040	The clearest public commercialization proof in the fetched corpus comes from partner or customer-adjacent materials from GigaIO and Gimlet rather than from a broad set of named end customers.	Medium	SO020, SO021
CO041	GFM Review’s 2024 competitive overview argued that startups like d-Matrix still must overcome Nvidia’s entrenched ecosystem, scale production, and win broader adoption.	Medium	SO025
CO042	The public board affiliations mirror the capital base by linking Bullhound, Playground, Triatomic, M12, Nautilus, and Temasek to visible governance seats.	Medium	SO002, SO004, SO007, SO008
CO043	Series C materials said the new capital would accelerate global expansion and support multiple large-scale deployments for hyperscale, enterprise, and sovereign customers.	High	SO004, SO007, SO009
CO044	No public revenue or ARR figure appears anywhere in the fetched company, investor, or independent source pack.	Low	SO001, SO004, SO009
CO045	No public companywide customer count appears in the fetched pack even though the company and partners refer to growth and deployments.	Low	SO004, SO009, SO021
CO046	Current control economics remain opaque because no fetched source discloses ownership percentages, board committees, or investor-protection mechanics.	Low	SO002, SO004, SO008
CM001	d-Matrix’s product and technical materials consistently place the company in the datacenter inference market rather than the training market.	High	SM002, SM003, SM014
CM002	d-Matrix’s commercial form factor is merchant infrastructure built from PCIe cards, servers, racks, and Ethernet networking rather than a bespoke end-to-end datacenter stack.	High	SM002, SM004, SM006, SM014
CM003	The most relevant market boundary for d-Matrix is merchant datacenter generative inference infrastructure, excluding model training clusters and broad software or services spend.	Medium	SM002, SM003, SM011
CM004	Public source material separates inference into compute-bound prefill and memory-bound decode, implying that the relevant hardware market cannot be analyzed with a training-style compute lens alone.	High	SM009, SM022
CM005	Reasoning and agentic workflows make end-to-end latency more decision-relevant than raw tokens-per-second because each extra inference step compounds user waiting time.	Medium	SM005, SM009, SM012
CM006	Distillation and better smaller models broaden the set of inference workloads that can be served by specialized merchant hardware rather than only frontier-scale GPU clusters.	Medium	SM005, SM012
CM007	d-Matrix’s own product envelope implies a narrower commercial wedge focused on low-latency and memory-efficient serving rather than universal coverage of every model size or all AI compute.	Medium	SM002, SM003, SM005
CM008	Independent coverage shows that AI demand has strained GPU supply and datacenter expansion plans, creating a category-level opening for alternative inference architectures.	High	SM017, SM018, SM020
CM009	Microsoft’s public disclosure that AI datacenters depend on predictable energy, networking supplies, and GPUs shows that inference scaling is constrained by infrastructure inputs beyond compute silicon alone.	High	SM018, SM020
CM010	Memory movement between processor and memory chips is itself a major source of AI power consumption, which strengthens the commercial importance of architectures that reduce that movement.	High	SM003, SM016
CM011	d-Matrix’s DIMC thesis is that HBM-centric GPU architectures are too power-hungry and too memory-bound for the best latency economics in interactive inference.	Medium	SM003, SM014
CM012	d-Matrix’s JetStream materials argue that host-mediated communication introduces avoidable latency in multi-node inference and that device-initiated communication is a meaningful scaling advantage.	Medium	SM004, SM007, SM008
CM013	Batch size acts as a direct trade-off knob between user experience and hardware ROI in interactive inference workloads.	Medium	SM010, SM020
CM014	Public examples from generative-AI applications show that inference can become a material operating-cost burden rather than a negligible variable cost.	High	SM019, SM020
CM015	The existence of API-based inference and incumbent GPU clouds means part of the real substitute set for d-Matrix is a service choice, not only a hardware choice.	Medium	SM019, SM026, SM027
CM016	The most realistic early buyer segments for d-Matrix are enterprise or private-cloud operators, neocloud or managed-inference providers, AI product companies, and OEM or integrator channels.	Medium	SM006, SM021, SM026, SM027
CM017	Hyperscalers are strategically important but structurally harder for d-Matrix to win because they already control large GPU estates and increasingly use captive silicon.	Medium	SM018, SM028, SM029
CM018	d-Matrix’s standard PCIe and Ethernet positioning is more naturally aligned with enterprises and partner-led deployments than with fully custom hyperscale rack architectures.	Medium	SM004, SM006, SM014, SM021
CM019	On-prem or private-cloud optionality matters because peer inference providers such as Groq explicitly sell predictable-spend cloud plus on-prem deployment choices.	Medium	SM026
CM020	Integrators and OEMs are part of the buying path because production inference deployments require server qualification, networking design, and operational support in addition to accelerator cards.	High	SM003, SM014, SM021
CM021	GigaIO’s SuperNODE positioning suggests that some enterprise buyers will prefer scale-up systems that avoid complex distributed inference before they adopt full rack-scale multi-node designs.	Medium	SM021, SM014
CM022	Broad generative-AI spending forecasts are too wide to use as d-Matrix’s TAM because they include software, services, and categories far beyond merchant inference hardware.	Medium	SM028
CM023	Forbes cites a MarketsandMarkets estimate that the AI inference market will grow from $106.15 billion in 2025 to $254.98 billion by 2030.	Low	SM028
CM024	Silicon Analysts estimates the total AI accelerator market at roughly $160 billion in 2025E and $200 billion plus in 2026E, which is broader than inference-only merchant hardware.	Low	SM029
CM025	CNBC’s estimate that Bing AI required at least $4 billion of infrastructure is a useful lower-bound deployment-cost lens even though it is not a market-size estimate.	Medium	SM019
CM026	d-Matrix’s accessible SAM is narrower than either the published AI inference market or the total accelerator market because it excludes training clusters, captive hyperscaler silicon, and much API-consumed inference.	Medium	SM003, SM006, SM018, SM029
CM027	No clean public standalone SAM exists for open-standards merchant inference systems optimized for low-latency multi-node serving, so any precise public SAM or SOM would be overstated.	Medium	SM027, SM028, SM029
CM028	d-Matrix’s official materials claim up to 10-20x advantages on latency, throughput, power, or TCO versus GPU comparisons, but those results are explicitly preliminary and workload-specific.	Medium	SM002, SM003, SM014
CM029	A third-party developer-signal benchmark from Gimlet reports that using d-Matrix for speculative decoding can improve end-to-end request speed by 2-10x versus a GPU-only speculative decoder at equal energy efficiency.	Medium	SM022
CM030	Forbes reports that MLPerf Inference 5.0 results still showed Nvidia winning every submitted benchmark and that GB200 met Nvidia’s rack-scale inference claims on the cited tests.	Medium	SM028, SM025
CM031	Nvidia’s public product pages show that the incumbent GPU stack continues to improve memory capacity, bandwidth, and rack-scale inference performance from H100 to H200 to GB200.	High	SM023, SM024, SM025
CM032	The substitute set for d-Matrix now includes not only Nvidia GPUs but also Groq’s inference cloud and partner-led scale-up systems built around different accelerator assumptions.	Medium	SM021, SM023, SM024, SM025, SM026
CM033	CNBC described Nvidia as having about 95% of the market for AI chips in early 2023.	Medium	SM019
CM034	CNN cited Nvidia as controlling 84% of the market for discrete GPUs during the 2023 AI chip shortage.	Medium	SM018
CM035	Silicon Analysts estimates Nvidia at roughly 80-90% of AI accelerator revenue in 2025 and about 75% by 2026 as custom silicon broadens.	Low	SM029
CM036	The public market-share numbers disagree because they use different denominators—AI chips, discrete GPUs, or total accelerators including custom silicon—so raw share rhetoric is directionally useful but not directly comparable.	Medium	SM018, SM019, SM029
CM037	d-Matrix’s standard-form-factor story reduces some retrofit friction relative to bespoke liquid-cooled racks, but it does not remove the need to prove cluster design, memory capacity, and operating economics at production scale.	Medium	SM004, SM006, SM014, SM021, SM025
CM038	Hyperscaler custom silicon growth and Nvidia’s software moat cap the share of the broad inference market that is realistically contestable by merchant startups.	Medium	SM018, SM028, SM029
CM039	Reasoning and agentic AI increase the amount of inference-time compute buyers may need, which helps the category thesis while simultaneously increasing sensitivity to serving cost and latency.	Medium	SM011, SM012, SM028
CM040	The most investable wedge for d-Matrix is latency-sensitive, memory-bound, open-standards inference serving where buyers care about predictable TCO and do not already own enough optimized GPU capacity.	Medium	SM006, SM009, SM010, SM021, SM022, SM026
CM041	d-Matrix’s current deployment narrative is strongest in enterprise and partner-led channels because public materials emphasize OEMs, integrators, scale-up partners, and standard datacenter compatibility.	Medium	SM006, SM014, SM015, SM021
CM042	d-Matrix’s product materials explicitly frame standard servers, racks, and existing datacenter infrastructure as a feature, which aligns with buyers who want incremental deployment rather than facility redesign.	Medium	SM002, SM004, SM006
CM043	Because prefill remains compute-bound and GPUs still excel there, the strongest public third-party evidence for d-Matrix today supports heterogeneous co-deployment rather than universal GPU replacement.	Medium	SM009, SM022, SM028
CM044	The same GPU shortages and power constraints that support d-Matrix’s market narrative can also push buyers to reserve incumbent capacity or stay with existing vendors instead of adopting a startup accelerator.	Medium	SM017, SM018, SM029
CM045	The public source pack is strong enough to prove a large inference category but not strong enough to justify a precise public SOM for d-Matrix without management disclosures on deployments, cards per cluster, and workload mix.	Medium	SM015, SM028, SM029
CM046	Vendor-authored heterogeneous-stage comparisons and standardized full-system leaderboards measure different things, so the public benchmark record does not establish one universally dominant inference architecture.	Medium	SM022, SM023, SM025, SM028
CP001	D-Matrix says its Corsair, JetStream, and Aviator stack can deliver 10x faster performance, 3x lower cost, and 3-5x better energy efficiency than GPU-based systems.	Medium	SP001, SP003
CP002	Independent reporting frames the core AI inference energy problem as repeated movement of data between memory and processors.	Medium	SP004
CP003	D-Matrix’s Jayhawk chiplet platform was publicly described with 16 Gbit/s per-wire bandwidth and less than 0.5 pJ/bit energy efficiency.	Medium	SP007, SP008
CP004	Jayhawk was also presented as a modular, pre-validated chiplet architecture that can refresh compute platforms faster and accommodate third-party chiplets.	Medium	SP007, SP008
CP005	Historical H100 scarcity and silicon-interposer bottlenecks pushed the market to look for more efficient second-source inference hardware.	Medium	SP009, SP010, SP012
CP006	NVIDIA markets H100 as combining 900 GB/s NVLink, NDR InfiniBand, and Magnum IO software to scale inference across clustered systems.	Medium	SP014
CP007	NVIDIA markets H100 as up to 30x faster for inference on the largest models and bundles enterprise deployment software around the platform.	High	SP014, SP021
CP008	NVIDIA says H200 increases memory to 141 GB of HBM3e at 4.8 TB/s, which it describes as nearly double H100 capacity and 1.4x more bandwidth.	Medium	SP015
CP009	NVIDIA says H200 can deliver up to 2x H100 inference speed on Llama2-class workloads while improving energy efficiency and total cost of ownership.	Medium	SP015
CP010	NVIDIA markets GB200 NVL72 as a liquid-cooled rack with 72 Blackwell GPUs, 36 Grace CPUs, and a 130 TB/s NVLink domain.	Medium	SP016
CP011	NVIDIA markets GB200 NVL72 as delivering 30x faster real-time trillion-parameter inference and Forbes says the NVL72 matched NVIDIA’s promised ~30x gain over an 8-GPU H200 benchmark.	High	SP016, SP022
CP012	AMD markets MI300X as a 192 GB HBM3, 5.3 TB/s, eight-Infinity-Fabric-link accelerator with ROCm software support.	Medium	SP017
CP013	AMD explicitly positions MI300X around higher memory capacity and bandwidth than competing accelerators, making memory-per-GPU its clearest attack on NVIDIA and d-Matrix alternatives.	Medium	SP017
CP014	Independent benchmark coverage compares MI300X directly with H100, H200, and B200 under current LLM inference workloads, showing AMD is treated as a real production competitor.	High	SP017, SP020
CP015	ROCm gives AMD a less locked-down software position than CUDA, but framework compatibility remains a major practical selection criterion for enterprise buyers.	Medium	SP017, SP021
CP016	Groq publicly sells inference as public, private, or co-cloud service with Free, Developer, and Enterprise plans plus optional on-prem GroqRack.	High	SP018, SP021
CP017	Groq’s public packaging emphasizes low latency, predictable spend, regional endpoints, and no-batching-required operation rather than training versatility.	Medium	SP018
CP018	Independent analysis describes Groq as an inference specialist with its own compiler and runtime path, which can improve latency but still requires ecosystem switching away from standard GPU tooling.	Medium	SP021, SP023
CP019	Cerebras markets WSE-3 as a wafer-scale processor with four trillion transistors and 125 petaflops that survives defects through redundant compute and routing.	High	SP019, SP021
CP020	Independent analysis says Cerebras now competes by serving very large models on specialized systems rather than sharding those workloads across many smaller GPUs.	Medium	SP021
CP021	Independent analysis describes SambaNova’s SN50 as a full-stack platform with 1.6 PFLOPS FP16, 3.2 PFLOPS FP8, and very large model and context-window support.	Medium	SP021
CP022	SambaNova’s Intel collaboration and hardware-software co-design show that major inference challengers increasingly bundle system integration with silicon.	Medium	SP021
CP023	Independent landscape coverage places d-Matrix, Groq, and Cerebras inside the same inference-challenger set even though they attack different technical bottlenecks.	Medium	SP005, SP023
CP024	The AI inference market is growing around low-latency and efficient deployment needs, but high power demand and supply chain risk still constrain adoption.	Medium	SP004, SP025
CP025	Historical deployment cost evidence shows why inference economics matter: one startup described nearly $200,000 in monthly model-serving bills and analysts estimated ChatGPT-scale infrastructure could require 10,000 Nvidia GPUs plus costly upgrades.	Medium	SP011, SP012, SP013
CP026	D-Matrix’s lower-cost and lower-energy pitch is strategically relevant because buyers are already conditioned to think of inference as an operating-cost problem.	Medium	SP001, SP003, SP011, SP012, SP013
CP027	D-Matrix is not trying to beat GB200 on absolute rack-scale throughput; it is offering a purpose-built inference card and NIC stack optimized for memory locality in standard servers.	Medium	SP001, SP003, SP008
CP028	Relative to Groq, d-Matrix competes less on public-cloud API distribution and more on hardware insertion into enterprise or OEM infrastructure.	Medium	SP001, SP018, SP021
CP029	Relative to Cerebras and SambaNova, d-Matrix is the less integrated but potentially easier-to-insert option because it fits around cards, NICs, and partner servers instead of a giant proprietary appliance.	Medium	SP001, SP019, SP021
CP030	NVIDIA remains the default incumbent because its hardware is ubiquitous and its software plus channel ecosystem create switching costs far beyond chip specifications.	High	SP014, SP016, SP021, SP026
CP031	Intuition Labs cites NVIDIA at roughly 92% of the discrete GPU market in H1 2025 while Silicon Analysts still shows a 75-87% share band through 2026, which is adverse evidence for any challenger moat story.	Medium	SP021, SP024
CP032	D-Matrix’s core switching-cost problem is therefore software and networking, not just silicon performance, because buyers already know how to buy, cool, connect, and operate incumbent GPU clusters.	Medium	SP014, SP016, SP021
CP033	AMD pressures NVIDIA on memory-per-GPU and open software posture, but it still competes inside the same OEM server procurement motion rather than changing the buyer workflow as sharply as Groq or Cerebras.	Medium	SP017, SP020, SP021
CP034	Groq has a clearer public story than d-Matrix for regulated or air-gapped deployments because it explicitly advertises GroqRack and cloud-to-local continuity.	Medium	SP018, SP021
CP035	Internal build is a credible substitute because large buyers increasingly want inference hardware that fits existing ML pipelines and can be co-designed around proprietary workloads.	Medium	SP021, SP025
CP036	D-Matrix’s strongest moat candidate is technical fit to the memory bottleneck, because both independent reporting and its chiplet disclosures link inference efficiency to keeping compute close to memory.	Medium	SP004, SP007, SP008
CP037	A second moat candidate is modularity, because Jayhawk and the broader card-plus-NIC architecture imply d-Matrix can partner into heterogeneous systems rather than forcing a clean-sheet deployment.	Medium	SP001, SP007, SP008
CP038	Adverse evidence is material because d-Matrix still lacks a public MLCommons-style apples-to-apples benchmark against H100, H200, MI300X, Groq, or Cerebras.	Medium	SP001, SP003, SP020, SP022
CP039	Without standardized public benchmarking, buyers still have to underwrite d-Matrix’s cost and performance claims primarily through vendor-authored comparisons.	Medium	SP001, SP003, SP022
CP040	Overall moat durability looks moderate rather than strong because d-Matrix is differentiated on inference-specific memory locality, but NVIDIA and AMD own broader ecosystems, Groq has cleaner service packaging, and Cerebras plus SambaNova sell more integrated systems.	Medium	SP021, SP023, SP024
CI001	D-Matrix publicly frames its monetizable platform as Corsair accelerators, JetStream NICs, and Aviator software.	Medium	SI001, SI022
CI002	D-Matrix disclosed a $275 million Series C at a $2 billion valuation, bringing total disclosed funding to $450 million.	High	SI001, SI007, SI008, SI009
CI003	Series C proceeds were earmarked for roadmap execution, global expansion, and multiple large-scale deployments.	High	SI001, SI008
CI004	D-Matrix disclosed a $110 million Series B in 2023 to begin commercializing Corsair after prior chiplet launches.	Medium	SI004
CI005	The Series B announcement said the company planned to use proceeds for recruitment and commercialization to meet demand for lower-cost inference infrastructure.	Medium	SI004
CI006	Public financing materials identify hyperscale, enterprise, and sovereign customers as the intended buying base.	Medium	SI001, SI008
CI007	Corsair was sampling to early-access customers in November 2024 and was targeted for broad availability in Q2 2025.	Medium	SI003
CI008	JetStream samples were available in September 2025 with full production expected by year-end 2025.	Medium	SI023, SI006
CI009	The combined Gimlet Cloud offering was targeted for select-customer availability in the second half of 2026.	Medium	SI026
CI010	Official materials say d-Matrix is collaborating with OEMs and system integrators to qualify Corsair-based solutions.	Medium	SI003, SI005
CI011	Public partner materials name Supermicro, Arista, Broadcom, GigaIO, and Liqid as deployment or integration partners around the d-Matrix stack.	Medium	SI021, SI022
CI012	The d-Matrix ecosystem page says partner logos are illustrative and not actual customers.	Medium	SI021
CI013	JetStream is designed as a standard PCIe card that connects to off-the-shelf top-of-rack Ethernet switches.	Medium	SI006, SI022
CI014	JetStream's published specification includes 400 Gbps bandwidth, PCIe Gen5 x16, and 150W max TDP with transceivers.	Medium	SI006
CI015	The Corsair white paper says an 8-card inference server can provide up to 2 TB of capacity memory.	Medium	SI005
CI016	The same white paper says an 8-card inference server provides 16 GB of performance memory and 1,200 TB/s of performance-memory bandwidth.	Medium	SI005
CI017	Official and partner materials repeatedly claim up to 10x faster speed, 3x better cost-performance, and 3x to 5x better energy efficiency versus GPU-based systems.	Medium	SI001, SI003, SI006, SI018
CI018	Both the Corsair white paper and the JetStream brief say performance, cost, and power estimates are preliminary and subject to change.	High	SI005, SI006
CI019	GigaIO says its SuperNODE can support dozens or 64+ Corsair accelerators in a single node and simplify deployment relative to multi-node setups.	Medium	SI018, SI021
CI020	SquadRack publicly combines Corsair, JetStream, Supermicro servers, Broadcom switches, Arista Ethernet, and Aviator software in one rack-scale reference architecture.	Medium	SI022
CI021	D-Matrix says scaling beyond SRAM into 3D-stacked DRAM is necessary to support larger reasoning models and higher token consumption.	Medium	SI025, SI027
CI022	The Alchip collaboration makes advanced ASIC design, packaging, and manufacturing management explicit dependencies of the next-generation roadmap.	Medium	SI025
CI023	D-Matrix's public workload story complements GPUs in heterogeneous inference stacks rather than replacing GPU training or all inference phases outright.	Medium	SI026, SI028, SI029
CI024	Gimlet reported that running the speculative decoder on Corsair improved end-to-end request latency by 2x to 10x at equivalent energy efficiency.	Medium	SI019
CI025	Gimlet reported that a 1.6B speculative decoder fits on two Corsair cards and that Corsair is air-cooled and rack-compatible.	Medium	SI019
CI026	The Microsoft Project Bonsai partnership provided early compiler and toolchain validation before broad commercial availability.	Medium	SI012
CI027	CNBC reported that Latitude spent nearly $200,000 per month on OpenAI and AWS at peak usage, showing how inference costs can swamp a small startup.	Medium	SI015
CI028	CNBC cited analyst estimates that OpenAI could spend about $40 million for one month of ChatGPT inference and Bing AI would need at least $4 billion of infrastructure.	Medium	SI015
CI029	The Washington Post reported that leading AI chatbots lose money on every chat because operating costs are so high.	Medium	SI017
CI030	TechTarget reported that ChatGPT-class deployments require not just GPUs but meaningful networking and power-management upgrades.	Medium	SI016
CI031	VentureBeat reported that the perceived GPU shortage often reflects board-level component and systems bottlenecks rather than a lack of GPU dies alone.	Medium	SI013
CI032	CNN reported that AI chip shortages are shaped by supply-chain bottlenecks such as advanced packaging and interposer constraints.	Medium	SI014
CI033	Bloomberg explained that repeatedly moving data between memory and processors is a major electricity cost in AI systems.	Medium	SI011
CI034	The Corsair white paper says chiplets improve yields and lower costs while moving the architectural problem toward interconnect design.	Medium	SI005
CI035	AIMultiple reports that cloud GPU list prices for the same model can differ several times over across providers.	Medium	SI020
CI036	No public Corsair or JetStream price sheet appears in the reviewed sources, so list price, discounting, and realized ASP remain undisclosed.	Medium	SI003, SI006, SI021, SI023
CI037	Reviewed official, investor, and independent materials disclose funding and deployment plans but no revenue or ARR figure.	Medium	SI001, SI007, SI008, SI009
CI038	Reviewed sources do not disclose gross margin, contribution margin, or gross profit dollars for d-Matrix.	Medium	SI001, SI003, SI005, SI006
CI039	No reviewed public source discloses d-Matrix cash on hand, monthly burn, or runway months.	Medium	SI001, SI004, SI007, SI008
CI040	No reviewed public source discloses debt facilities or project-finance obligations for d-Matrix.	Medium	SI001, SI004, SI007, SI008
CI041	Public customer evidence remains partner and benchmark oriented; the source pack does not provide a broad named-customer roster or booked-customer count.	Medium	SI001, SI019, SI021, SI026
CI042	GFM Review argues that startups like d-Matrix still need substantial capital investment, brand building, and proof at scale to challenge Nvidia's ecosystem.	Low	SI031
CI043	NVIDIA's 2026 annual report says competitive data-center AI platforms are sold as co-designed chips, networking, systems, software, and paid enterprise software licenses.	Medium	SI030
CI044	NVIDIA's 2026 annual report says the company may place non-cancellable component orders, pay premiums or deposits, and absorb inventory provisions when demand shifts.	Medium	SI030
CI045	NVIDIA's filing defines cost of revenue to include wafers, packaging, board and device costs, memory, shipping, warranty, and inventory charges.	Medium	SI030
CI046	NVIDIA reported fiscal 2026 capex of $6.1 billion and said gross margin fell to 71.1% partly because full-scale datacenter solutions and a $4.5 billion excess-inventory charge weighed on results.	Medium	SI030
CI047	D-Matrix's public financial case is strongest as a customer-economics thesis rather than a publicly proven d-Matrix margin profile.	Medium	SI017, SI018, SI019, SI030
CI048	With $450 million of disclosed financing but no public cash, burn, or revenue data, d-Matrix still appears financing-dependent for roadmap execution and deployment scaling.	Medium	SI001, SI004, SI007, SI008
CI049	The go-to-market motion appears account-driven and qualification-heavy because official materials emphasize sampling, OEM qualification, and select-customer deployment windows.	Medium	SI003, SI005, SI026
CI050	Because JetStream adds dedicated networking hardware, power draw, and switch infrastructure on top of accelerator cards, d-Matrix's TCO case depends on whole-cluster design rather than chip benchmarks alone.	Medium	SI006, SI022, SI029
CI051	In a 2024 retrospective, management said d-Matrix came within two weeks of running out of cash in 2020 before reframing its fundraising strategy, showing capital availability has been existential since the company's earliest phase.	Medium	SI032
CE001	d-Matrix's current product surface consists of Corsair accelerators, JetStream networking, Aviator software, and the SquadRack reference rack architecture.	High	SE003, SE005, SE006, SE023
CE002	Corsair is packaged as an industry-standard PCIe Gen5 full-height full-length card and is sold in single-card and dual-card configurations for standard datacenter servers.	High	SE003, SE007, SE009
CE003	A single Corsair card is specified with 2 GB of on-chip performance memory, up to 256 GB of off-chip capacity memory, 150 TB/s memory bandwidth, and 2400 TFLOPs of MXINT8 peak compute.	High	SE007, SE009
CE004	A dual-card Corsair configuration is specified with 4 GB of performance memory, 300 TB/s of bandwidth, up to 512 GB of capacity memory, and cross-card DMX Bridge connectivity.	High	SE003, SE009
CE005	The published rack configuration uses eight servers and 64 Corsair cards to reach 128 GB of performance memory and 9.6 PB/s aggregate bandwidth.	High	SE003, SE009
CE006	d-Matrix built Corsair around DIMC because it views generative inference as memory-bound during token generation rather than only compute-bound.	High	SE007, SE009, SE016
CE007	Within Corsair, DMX Link provides die-to-die connectivity inside the package and DMX Bridge extends that connectivity across two cards.	High	SE007, SE009
CE008	Aviator's build flow comprises Model Factory, Compressor, and Compiler, while execution runs through Inference Engine and Host Runtime.	High	SE003, SE009, SE021
CE009	Aviator integrates with PyTorch, MLIR, Triton DSL, Kubernetes device plugins, metrics export, debugging, and profiling workflows.	Medium	SE009
CE010	JetStream is a transparent NIC that extends device-initiated accelerator-to-accelerator communication across nodes by bypassing host-orchestrated transfers.	High	SE010, SE014, SE015
CE011	JetStream is specified as a PCIe Gen5 x16 Ethernet card with 400 Gbps maximum bandwidth, QSFP-DD optics, 150 W max TDP, and secure boot support.	High	SE006, SE010
CE012	d-Matrix claims that JetStream combined with Corsair and Aviator can scale beyond 100B-parameter models and deliver up to 10x speed, 3x cost-performance, and 3x energy-efficiency gains versus GPU-based alternatives.	Medium	SE006, SE010
CE013	SquadRack is an eight-node rack blueprint that combines Corsair accelerators, JetStream I/O, Supermicro servers, Broadcom PCIe switches, Arista leaf switches, and Aviator software.	High	SE005, SE013
CE014	SquadRack is designed to remain air-cooled and configurable by rack power or rack-height constraints rather than requiring special liquid-cooling infrastructure.	Medium	SE013
CE015	GigaIO's SuperNODE integration is positioned to host dozens of Corsair accelerators in one node and publicly cites 30,000 tokens per second at 2 milliseconds per token for Llama3 70B.	Medium	SE023, SE026
CE016	Gimlet Labs reported that using Corsair for a speculative-decoding draft model delivered 2-5x end-to-end request speedups at equal energy relative to a GPU-only draft-model configuration.	Medium	SE027
CE017	d-Matrix argues that prefill is compute-bound while decode is memory-bound, so heterogeneous pipelines should mix GPUs with memory-centric accelerators instead of forcing every phase onto one device class.	Medium	SE016, SE017, SE018
CE018	In d-Matrix's published deployment model, speculative decoding lets Corsair run the memory-bound speculator stage while a larger GPU-resident model performs verification.	Medium	SE017, SE027
CE019	Keyformer is d-Matrix's published KV-cache sparsification technique and its example reports baseline accuracy with 50% of prompt KV cache plus 2.1x latency improvement and up to 2.4x token-throughput improvement.	Medium	SE020
CE020	dmx.compressor is a torch.fx-based quantization toolkit for PTQ and QAT that is meant to feed compressed graphs into Aviator Compiler.	Medium	SE021, SE009
CE021	Before Corsair, d-Matrix publicly described Nighthawk and then Jayhawk as chiplet platforms for in-memory inference, with Jayhawk introduced in 2023.	Medium	SE024, SE025
CE022	Jayhawk was presented as a modular chiplet platform on TSMC 6nm with 16 Gbps per wire bandwidth and less than 0.5 pJ/bit energy efficiency over organic substrates.	Medium	SE024, SE025
CE023	d-Matrix says 3DIMC has been validated on Pavehawk test silicon and is planned to debut commercially in the Raptor accelerator, the successor to Corsair.	Medium	SE008, SE011
CE024	d-Matrix claims that the future Raptor 3DIMC product will use 3D-stacked DRAM and deliver up to 10x faster inference than HBM4-based solutions.	Medium	SE008, SE011
CE025	The strategic purpose of 3DIMC is to extend d-Matrix's low-latency SRAM-centric architecture to larger-memory models and more demanding agentic pipelines.	Medium	SE011, SE012
CE026	d-Matrix's deployment model depends on partner qualification across server, switch, and rack layers rather than shipping a fully closed vertical appliance on its own.	Medium	SE004, SE007, SE013, SE026
CE027	The 3DIMC roadmap depends specifically on Alchip for ASIC and packaging expertise, making manufacturing execution partly external to d-Matrix.	Medium	SE008
CE028	The earlier Jayhawk platform also depended on TSMC process technology and BoW or UCIe-style chiplet ecosystems, reinforcing foundry and packaging exposure.	Medium	SE024, SE025
CE029	d-Matrix's current deployment story is intentionally open-standards based, using PCIe cards, standard Ethernet, and off-the-shelf top-of-rack switches rather than proprietary interconnect domains.	High	SE003, SE010, SE013
CE030	That open approach should lower retrofit friction in existing datacenters, but it also makes system quality dependent on partners such as Supermicro, Arista, Broadcom, GigaIO, and Liqid.	Medium	SE004, SE005, SE013
CE031	The incumbent GPU roadmap emphasizes HBM, NVLink, and increasingly liquid-cooled rack-scale systems, which differs materially from d-Matrix's PCIe-plus-Ethernet design center.	Medium	SE028, SE029, SE030
CE032	NVIDIA publishes H100 with 188 GB HBM3, H200 with 141 GB HBM3e and 4.8 TB/s bandwidth, and GB200 NVL72 as a liquid-cooled 72-GPU rack with 130 TB/s rack communication.	Medium	SE028, SE029, SE030
CE033	Independent market commentary says enterprise buyers increasingly want specialized inference hardware for data control, latency, and predictable cost rather than only general-purpose GPUs.	Medium	SE031, SE032
CE034	Independent market commentary also says NVIDIA still dominates accelerator share and packaging access, which raises the bar for any challenger that depends on external foundry and packaging capacity.	Medium	SE032, SE034
CE035	d-Matrix has at least one external practitioner validation signal because Gimlet publicly benchmarked Corsair in a speculative-decoding workflow.	Medium	SE027
CE036	The strongest public proof of deployment maturity is still at the reference-architecture and partner-demo layer rather than in named production customer case studies.	Medium	SE004, SE005, SE023, SE026
CE037	JetStream's published specification lists secure boot support, which is the clearest explicit product-security control surfaced in the reviewed corpus.	Medium	SE010
CE038	The reviewed official homepage, about, technology, product, ecosystem, and news surfaces do not publish SOC 2, ISO 27001, FedRAMP, or a public trust-center or status-page reference.	Medium	SE001, SE002, SE003, SE004, SE005
CE039	The reviewed corpus does not provide public MTBF, uptime, RMA, or field-failure metrics for Corsair or JetStream.	Medium	SE001, SE003, SE005, SE009, SE010
CE040	Company performance claims for Corsair, JetStream, and 3DIMC remain either preliminary or self-reported, so the independent benchmark base is still narrow.	Medium	SE009, SE010, SE027, SE033
CE041	The Raptor and Pavehawk roadmap has public architecture claims but no independently verified production silicon, customer deployments, or broad benchmark suite in this chapter corpus as of runDate.	Medium	SE008, SE011, SE023, SE033
CE042	d-Matrix's public developer surface is improving because Aviator builds on open-source tools and dmx.compressor points users to a GitHub repo, but the ecosystem still looks thinner than NVIDIA's enterprise software stack.	Low	SE009, SE021, SE028
CE043	d-Matrix positions its hardware for reasoning, agents, and video-generation inference workloads where token growth and KV-cache pressure amplify memory bottlenecks.	Medium	SE007, SE012, SE019
CE044	The hybrid SRAM-plus-DRAM roadmap gives d-Matrix a technical middle ground between tiny SRAM-only deployments and massive HBM-heavy GPU clusters.	Medium	SE011, SE012, SE029
CE045	JetStream carries accelerator-to-accelerator traffic over standard Ethernet and standard optics, so rack networking depends on conventional datacenter switching rather than a bespoke fabric.	High	SE010, SE014
CE046	Aviator's decoupled host and on-card runtime is intended to reduce launch latency by letting the host enqueue work independently of the card's current execution state.	Medium	SE009
CE047	d-Matrix says it built JetStream because traditional RDMA NICs would not satisfy the latency target or preserve decoupled execution across nodes.	Medium	SE010, SE014
CE048	d-Matrix explicitly pitches SquadRack to cloud providers, sovereign clouds, and enterprises that want high-performance inference without redesigning the entire datacenter around proprietary infrastructure.	Medium	SE005, SE013
CE049	d-Matrix's recent technical framing argues that rising context lengths make inference a context-management problem as much as a compute problem, increasing pressure on memory capacity, KV-cache efficiency, and data movement.	Medium	SE020, SE036, SE041
CE050	d-Matrix argues that agentic and reasoning workloads magnify latency sensitivity because chained tool use, long contexts, and verifier steps make per-token delay matter more than peak batch throughput.	Medium	SE037, SE041
CE051	d-Matrix explicitly frames open standards as a prerequisite for broader AI adoption, reinforcing its use of PCIe, Ethernet, and standard datacenter components instead of proprietary rack domains.	Medium	SE013, SE038
CE052	d-Matrix's recent product messaging argues that competitive inference now depends on optimizing every layer from model compression and runtime to networking and infrastructure, not just the accelerator die.	Medium	SE022, SE039
CE053	d-Matrix published its own announcement of the GigaIO partnership, showing that the SuperNODE scale-up design is part of the company-authored deployment strategy rather than only a partner press claim.	Medium	SE026, SE035
CE054	Alchip's own announcement corroborates that the 3DIMC roadmap depends on external ASIC and advanced-packaging collaboration, reinforcing packaging execution as a real dependency rather than just company positioning.	Medium	SE008, SE040
CE055	d-Matrix's recent reasoning-at-enterprise-scale messaging treats context management, low latency, software-hardware co-design, and open deployment as one system recipe, which is consistent with but does not independently validate the broader product thesis.	Medium	SE037, SE039, SE041
CU001	d-Matrix says the Series C capital is meant to support hyperscale, enterprise, and sovereign customers.	High	SU002, SU016
CU002	SquadRack materials explicitly target cloud providers, sovereign clouds, and enterprises that need low-latency inference capacity.	High	SU003, SU030
CU003	The public sales narrative targets infrastructure buyers who want low-latency inference inside standard PCIe servers and Ethernet-based datacenters rather than bespoke GPU racks.	Medium	SU009, SU010, SU023
CU004	GigaIO frames the first scale-up commercialization path as enterprise deployment inside a single SuperNODE rather than as a named hyperscale production cluster.	Medium	SU007, SU020, SU029
CU005	Gimlet frames the joint offer around model providers, frontier AI labs, and AI-native cloud workloads that benefit from heterogeneous inference infrastructure.	Medium	SU008, SU021, SU027
CU006	d-Matrix says customer conversations with large cloud service providers helped shape its inference-first product thesis.	Medium	SU013
CU007	The disclosed proof base fits enterprise, sovereign, and specialist-cloud operators more clearly than it fits named hyperscale production accounts.	Medium	SU002, SU022, SU023, SU027
CU008	Corsair was being sampled to early-access customers in November 2024 and was targeted for broad availability in the second quarter of 2025.	High	SU006, SU028
CU009	JetStream samples were available by September 2025 with full production expected by the end of that year.	Medium	SU004
CU010	SquadRack configurations were slated to be available for purchase through Supermicro in the first quarter of 2026.	High	SU003, SU030
CU011	Series C materials say the new capital will support multiple large-scale deployments, but they do not identify those deployments by customer name.	Medium	SU002, SU016, SU017
CU012	The ecosystem page's Supermicro, GigaIO, and Liqid statements are partner endorsements rather than disclosed end-customer case studies.	Medium	SU001, SU006
CU013	The d-Matrix ecosystem page explicitly says the showcased partner logos are not actual customers.	Medium	SU001
CU014	GigaIO public materials include an early-access call to action for SuperNODEs running Corsair, indicating an evaluation funnel rather than a disclosed installed base.	Medium	SU020, SU029
CU015	The d-Matrix and Gimlet combined solution was planned for select customers in the second half of 2026, so it remains pre-broad-availability at the run date.	High	SU008, SU026, SU027
CU016	Gimlet is a named external operator planning to deploy Corsair alongside GPUs in Gimlet Cloud.	High	SU008, SU027
CU017	Gimlet's technical writeup shows a concrete evaluated workload using gpt-oss-120b with a 1.6B speculative decoder on Corsair and GPU.	Medium	SU021
CU018	Gimlet says two Corsair cards can host the speculative decoder and that the cards are air-cooled and rack-compatible, which lowers pilot friction inside existing datacenters.	Medium	SU021
CU019	GigaIO says a single SuperNODE can support dozens of Corsair accelerators, making scale-up inside one node the first enterprise expansion path before broader scale-out.	Medium	SU020, SU029
CU020	SquadRack extends the expansion path from one node to an eight-server rack and then to hundreds of nodes across multiple racks.	High	SU003, SU030
CU021	d-Matrix repeatedly frames commercialization as land-and-expand inside existing datacenters using standard PCIe servers and standard Ethernet networking.	Medium	SU009, SU010, SU030
CU022	Existing-datacenter compatibility, air cooling, and standards-based networking are central to the company's appeal for enterprise and sovereign buyers with infrastructure constraints.	Medium	SU008, SU009, SU027
CU023	Public commercialization depends on OEM and integrator partners including Supermicro, GigaIO, Arista, Broadcom, and Liqid rather than on a closed turnkey appliance sold only by d-Matrix.	Medium	SU001, SU003, SU023
CU024	That partner-led route can widen reach but also makes expansion sensitive to OEM qualification, support handoffs, and channel execution.	Medium	SU023, SU027, SU030
CU025	Enterprise private-inference buyers often choose on-premises or near-edge deployments for data control, latency, and predictable cost, which matches d-Matrix's positioning.	Medium	SU023
CU026	Major system integrators package accelerators into turnkey servers, so d-Matrix's OEM relationships are a core part of customer conversion rather than a side channel.	Medium	SU023, SU030
CU027	AIMultiple's 2026 cloud-provider tiering suggests that d-Matrix's disclosed proof today speaks more clearly to specialist AI clouds and enterprise operators than to named hyperscalers.	Medium	SU022, SU027
CU028	No reviewed public source discloses d-Matrix's current active customer count by account or by segment as of 2026-05-26.	Medium	SU002, SU016, SU017, SU019
CU029	No reviewed public source discloses d-Matrix's net revenue retention.	Medium	SU002, SU014, SU016
CU030	No reviewed public source discloses gross retention, churn, or renewal rates for d-Matrix deployments.	Medium	SU002, SU016, SU017
CU031	No reviewed public source discloses average contract length, backlog conversion, or repeat-order cadence for d-Matrix customers.	Medium	SU002, SU016, SU019
CU032	No reviewed public source discloses top-customer concentration, channel mix, or revenue contribution by partner versus end customer.	Medium	SU002, SU015, SU016
CU033	Customer evidence is materially fresher on partner integrations and deployment architecture than on paid production-account counts or cohort durability.	Medium	SU015, SU016, SU027, SU030
CU034	GFM Review argues that challengers like d-Matrix must persuade customers to adopt new technologies that may require significant changes to existing workflows, and the GroqCloud and Cerebras product pages show that buyers can compare d-Matrix against other non-Nvidia inference alternatives during that evaluation.	Medium	SU025, SU034, SU035
CU035	GFM Review also argues that widespread adoption depends on proving consistent, reliable performance at scale against Nvidia's trusted ecosystem.	Medium	SU025
CU036	IntuitionLabs says Nvidia still dominates enterprise inference hardware while OEMs package alternative accelerators into deployable systems, and AMD's MI350 page separately markets drop-in compatibility plus Kubernetes-friendly deployment, underscoring how much deployment polish d-Matrix must match to reduce procurement inertia.	Medium	SU023, SU033
CU037	Forbes argues Nvidia keeps lowering deployment friction through software, solutions, and datacenter-scale offerings, which raises the bar for d-Matrix customer adoption.	Medium	SU024
CU038	Data Center Knowledge says the value of the Gimlet approach depends on an abstraction layer that lets developers use heterogeneous chips without rewriting code.	Medium	SU027
CU039	Data Center Knowledge reports that initial Gimlet targets include frontier AI labs and that cloud and enterprise deployment at scale remain the strategic goal.	Medium	SU027
CU040	d-Matrix's Series C release claims rapid customer growth, but the absence of counts, cohort metrics, or named production accounts means that growth cannot be underwritten quantitatively.	Medium	SU002, SU014, SU016
CU041	Supermicro purchase availability in Q1 2026 shifts proof from reference architecture toward channel transactability, but still not toward a disclosed installed-base metric.	Medium	SU003, SU030
CU042	GigaIO calls the 2025 announcement the next phase of a strategic partnership, suggesting d-Matrix moved from component collaboration toward a more integrated system sales story.	Medium	SU020, SU029
CU043	Supermicro, GigaIO, and Liqid all pitch TCO, deployment speed, or inference flexibility, but none of the public statements quantify live end-customer outcomes by logo.	Medium	SU001, SU003, SU029
CU044	Both the Gimlet partnership page and GigaIO's AIwire press carry explicit interest or early-access calls to action, indicating that the funnel still includes invitation-driven evaluation motions.	Medium	SU026, SU029
CU045	The central customer diligence issue is the gap between credible partner validation and the still-undisclosed number of production customers, retention, and concentration metrics.	Medium	SU001, SU002, SU027, SU029
CR001	JetStream was announced with sample availability and a year-end production target, so public proof still sits at an early commercialization stage rather than a mature installed base.	Medium	SR002
CR002	SquadRack is presented as a reference blueprint built with Arista, Broadcom, and Supermicro rather than a closed appliance, so deployment readiness depends on partner qualification.	Medium	SR001
CR003	d-Matrix says it built a transparent NIC because standard networking approaches would not hit the latency targets needed for distributed inference.	High	SR005, SR006
CR004	JetStream adds accelerator-to-accelerator transfers, dedicated switching paths, and standard-Ethernet coordination that increase integration complexity across NIC, switch, server, and software layers.	High	SR002, SR005, SR006
CR005	The 3DIMC roadmap is still pre-commercial because Pavehawk is described as lab validated and Raptor as the planned first product use rather than a shipping platform.	High	SR003, SR004
CR006	The going-vertical post describes Pavehawk as d-Matrix’s first crack at the next memory problem and emphasizes lab stress testing instead of customer deployment or production-yield evidence.	Medium	SR004
CR007	d-Matrix’s five-year retrospective says the company pivoted from analog IMC to digital IMC and later from early transformer workloads to generative inference, showing technical flexibility but also roadmap volatility.	Medium	SR026
CR008	d-Matrix argues that reasoning-heavy workloads after the DeepSeek moment increase inference-time compute and memory pressure, which makes the thesis more relevant but also forces the roadmap to keep pace with rapidly changing model behavior.	Medium	SR008
CR009	eeNews Europe reported Jayhawk as a second-generation chiplet platform tied to TSMC 6nm and open chiplet interfaces, reinforcing that packaging and interconnect execution have long been core dependencies.	Medium	SR012
CR010	SiliconANGLE reported that a future Raptor generation is expected to move from 6nm to 4nm, which would raise qualification and supply-chain execution sensitivity if timing slips.	Medium	SR010
CR011	CNN reported that silicon interposers and advanced packaging are major AI-chip bottlenecks, suggesting smaller accelerator vendors can face supply risk even when end demand is healthy.	Medium	SR014
CR012	VentureBeat reported acute Nvidia GPU shortages during the AI boom, showing how upstream component shortages can distort pricing and lead times across adjacent inference ecosystems.	Medium	SR013
CR013	SquadRack and the GigaIO partnership show that d-Matrix depends on an external ecosystem of servers, switches, interconnects, and scale-up platforms rather than a vertically integrated manufacturing stack.	High	SR001, SR024
CR014	The Gimlet partnership describes select-customer availability in 2H 2026, which is encouraging external validation but still early compared with broad production deployment evidence.	High	SR025, SR031
CR015	d-Matrix’s public materials still emphasize reference architectures, early access, and partner launches more than a named production customer roster, leaving adoption depth under-evidenced.	Medium	SR001, SR025, SR031
CR016	TechTarget argued that enterprises still need significant compute, networking, and power investment for generative AI, implying sales cycles can stall even when accelerator efficiency is attractive on paper.	Medium	SR015
CR017	The Washington Post argued that chatbot providers were losing money on every chat in the early LLM buildout, so customer unit economics must improve for buyers to accept switching and integration risk.	Medium	SR016
CR018	d-Matrix says Corsair ships as an air-cooled standard PCIe card, which lowers retrofit friction but does not itself prove field reliability or supportability at scale.	High	SR025, SR031
CR019	The careers page shows d-Matrix operating across Santa Clara, Sydney, Bengaluru, Toronto, and Belgrade, broadening talent access but increasing coordination demands for a hardware-software platform.	Medium	SR030
CR020	The retrospective says d-Matrix nearly ran out of cash in 2020 before raising follow-on funding, showing that the company has already experienced financing fragility during a long hardware gestation period.	Medium	SR026
CR021	Data Center Dynamics and SiliconANGLE reported that d-Matrix raised $275 million in late 2025 at a $2 billion valuation, bringing total funding to roughly $450 million before broad commercialization.	Medium	SR009, SR010
CR022	The same financing coverage says the new capital is meant to support commercialization, global expansion, and large-scale deployments, which implies continued capital intensity ahead of mature revenue proof.	Medium	SR009, SR010
CR023	BIS said in October 2023 that the United States was strengthening export controls on advanced computing semiconductors and circumvention pathways to countries of concern, making trade policy an active risk variable for advanced AI chip supply chains.	Medium	SR029
CR024	d-Matrix’s privacy policy is a website data-handling disclosure that mentions reasonable safeguards and compliance-by-law, but it is not a published product-security assurance package for datacenter buyers.	Medium	SR027
CR025	d-Matrix’s terms of use include broad warranty disclaimers for the site, while the reviewed public corpus did not surface a dedicated trust center, SOC 2, ISO 27001, or similar enterprise-control page for the product stack.	Medium	SR027, SR028
CR026	Intuition Labs says private inference buyers care about data control, predictable cost, and deployment support, so public compliance and security gaps can directly slow adoption in regulated or sovereign environments.	Medium	SR022
CR027	Groq publicly markets zero-data-retention and on-prem air-gapped options, and NVIDIA markets enterprise security features, which raises the disclosure bar for d-Matrix on security posture.	Medium	SR020, SR017
CR028	NVIDIA H200 increases incumbent memory and bandwidth for inference, and GB200 pushes the competitive bar to rack-scale systems, so d-Matrix competes against a moving full-stack target rather than one static GPU baseline.	High	SR017, SR018
CR029	AMD MI300X offers 192GB of HBM3 and 5.3 TB/s bandwidth, blunting the argument that memory-hungry inference buyers have no mainstream alternatives to d-Matrix.	Medium	SR019
CR030	Groq and Cerebras both market purpose-built inference platforms with public deployment narratives, so d-Matrix is competing inside a crowded specialist category rather than a greenfield niche.	Medium	SR020, SR021
CR031	Intuition Labs describes switching costs as stack-wide across software, networking, cooling, and operations, which means d-Matrix must overcome ecosystem lock-in as well as raw chip performance.	Medium	SR022
CR032	Silicon Analysts estimates NVIDIA keeps dominant AI accelerator share through 2026, implying challengers like d-Matrix still fight for a relatively narrow share-of-wallet unless they win very specific workloads or power-constrained segments.	Medium	SR023
CR033	Bloomberg described the AI energy crisis as a data-movement problem, which validates d-Matrix’s architecture thesis but also means the company must prove rack-level efficiency rather than only chip-level superiority.	High	SR011, SR007
CR034	d-Matrix’s own disaggregation materials argue that prefill, decode, memory, and networking must be co-optimized, which means system-level overhead can erode theoretical efficiency gains if integration is poor.	High	SR006, SR007
CR035	SquadRack, the Gimlet materials, and d-Matrix’s brownfield-retrofit blog all stress air-cooled standard-server deployment and reuse of spare data-center capacity, so a key diligence question is whether those benefits persist once networking, orchestration, and support overhead are included in real customer environments.	High	SR001, SR025, SR036
CR036	The highest residual execution risk is synchronized delivery across Corsair, JetStream, Aviator, OEM partners, and the Raptor roadmap because no single layer is sufficient on its own to win production deployments.	High	SR001, SR002, SR003, SR024
CR037	A practical thesis-break trigger is a material slip in JetStream or Raptor readiness relative to customer procurement cycles because current proof is still early and partner-led.	High	SR002, SR003, SR025
CR038	A second thesis-break trigger is failure to convert early partner and reference deployments into named production customers by 2026 or 2027, which would leave the $2 billion valuation resting mostly on architecture promise.	Medium	SR009, SR010, SR031
CR039	A third thesis-break trigger is evidence that incumbent GPU or specialist inference alternatives close the latency-per-watt gap without d-Matrix’s integration burden, removing the switching rationale.	Medium	SR017, SR018, SR020, SR021, SR022
CR040	A fourth thesis-break trigger is a material export-control or advanced-packaging disruption that stalls next-generation product availability or worsens unit economics.	Medium	SR029, SR014, SR013
CR041	Public risk mitigation today is more mature around architectural positioning and partner announcements than around disclosed reliability metrics, compliance credentials, or customer concentration data.	Medium	SR001, SR002, SR025, SR027, SR028
CR042	No public MTBF, uptime, or field-failure metrics were identified in the reviewed risk corpus, leaving durability under production conditions unresolved.	Low	SR001, SR002, SR027, SR028
CR043	No public named production customer list or deployment count was surfaced in the reviewed risk corpus beyond partner, reference, and early-access announcements.	Medium	SR001, SR025, SR031
CR044	The 2026 careers footprint and global expansion narrative suggest d-Matrix is scaling organizationally before large public customer proof, which can raise burn and program-management pressure if bookings lag.	Medium	SR009, SR026, SR030
CR045	d-Matrix’s risk profile is dominated not by one binary technical flaw but by compounding dependencies across roadmap maturity, ecosystem integration, customer proof, compliance disclosure, and financing discipline.	High	SR001, SR003, SR009, SR022, SR029
CV001	The latest disclosed d-Matrix financing was a $275 million Series C at a $2 billion valuation, bringing total disclosed funding to $450 million.	High	SV001, SV002, SV003, SV004
CV002	The 2025 round was described as oversubscribed and co-led by Bullhound Capital, Triatomic Capital, and Temasek, with QIA, EDBI, M12, and other investors participating.	High	SV001, SV002, SV003, SV004
CV003	d-Matrix said the Series C proceeds would advance the roadmap, accelerate global expansion, and support large-scale deployments for hyperscale, enterprise, and sovereign customers.	Medium	SV001, SV002, SV003, SV004
CV004	The Series C key facts disclosed 250+ employees and offices in Toronto, Sydney, Bangalore, and Belgrade in addition to the Santa Clara headquarters.	Medium	SV001
CV005	The Series C materials described rapid customer growth but did not disclose revenue, ARR, or customer count.	Medium	SV001, SV002, SV003, SV004
CV006	The reviewed public materials also do not disclose gross margin, retention, free cash flow, or cap-table preference terms for d-Matrix.	Medium	SV001, SV008, SV009, SV010, SV011, SV012
CV007	d-Matrix's 2023 Series B raised $110 million to begin commercializing Corsair after earlier chiplet launches and a lower-TCO inference pitch.	Medium	SV008
CV008	The founder's retrospective says d-Matrix first raised a $40 million Series A after pivoting toward digital in-memory compute and later refocused Corsair around generative AI inference.	Medium	SV010
CV009	Official and partner GigaIO materials say the combined SuperNODE plus Corsair system targets around 30,000 tokens per second at 2 milliseconds per token while lowering TCO and power versus GPU-heavy approaches.	Medium	SV011, SV013
CV010	d-Matrix's March 2026 Gimlet announcement said select customers could access the joint solution in the second half of 2026.	Medium	SV012
CV011	Gimlet's technical writeup found a 2-10x end-to-end interactivity improvement versus a GPU-only speculative-decoding setup at comparable energy efficiency in the studied configuration.	Medium	SV014
CV012	d-Matrix's 2024 Cambrian or Forbes repost highlighted a company claim of 30,000 tokens per second at 2 milliseconds per token for Llama 70B in a single rack.	Medium	SV009
CV013	The investor and partner mix suggests the Series C underwrites inference demand growth, energy efficiency, and strategic infrastructure optionality more than disclosed financial scale.	Medium	SV001, SV002, SV003, SV011, SV012
CV014	The 2025 round looks more like milestone financing around architecture and deployment readiness than conventional ARR-backed late-stage software underwriting.	Medium	SV001, SV003, SV007, SV008, SV010, SV012, SV013, SV014
CV015	Public support for d-Matrix's economic claims remains heavily company-authored because 10x performance, 3x lower cost, and 3-5x better energy efficiency recur across company, investor, and partner materials.	Medium	SV001, SV002, SV003, SV004, SV007, SV011, SV012, SV013
CV016	Independent corroboration exists but is narrower because Gimlet evaluated a specific speculative-decoding configuration rather than a whole-platform same-model benchmark against incumbent clusters.	Medium	SV012, SV014
CV017	No source reviewed in this chapter discloses d-Matrix revenue, ARR, gross margin, or free cash flow, so an EV/revenue or EV/ARR underwriting model cannot be supported publicly.	Medium	SV001, SV004, SV005, SV006, SV007, SV008, SV009, SV010, SV011, SV012
CV018	NVIDIA markets H100 as an inference accelerator integrated with NVLink, InfiniBand, Magnum IO, and AI Enterprise.	Medium	SV018
CV019	NVIDIA says H200 adds 141 GB of HBM3e and 4.8 TB/s bandwidth for larger-model inference.	Medium	SV019
CV020	NVIDIA markets GB200 NVL72 as a liquid-cooled rack-scale system with 72 Blackwell GPUs, 36 Grace CPUs, and a 130 TB/s NVLink domain.	Medium	SV020
CV021	AMD markets MI300X as a 192 GB HBM3 accelerator with 5.3 TB/s bandwidth and ROCm software support for large-model AI.	Medium	SV021
CV022	Groq publicly exposes inference service tiers and an on-prem GroqRack option, making its commercial packaging more visible than d-Matrix's current public pricing.	Medium	SV022
CV023	Cerebras positions wafer-scale hardware for both training and inference, which is a very different deployment model from a modular PCIe-card strategy.	Medium	SV023
CV024	AIMultiple and Forbes both frame inference hardware as a market still led by NVIDIA but increasingly judged on cost, throughput, and energy efficiency.	Medium	SV024, SV025
CV025	Silicon Analysts estimates NVIDIA still holds about 80-90% of AI accelerator revenue in 2025, indicating a highly concentrated market.	Low	SV026
CV026	NVIDIA's FY2026 10-K reports $215.9 billion of revenue and $193.737 billion of Data Center revenue.	High	SV029, SV032, SV033, SV035
CV027	NVIDIA's Q1 FY2027 10-Q reports $81.615 billion of quarterly revenue and 74.9% gross margin for the quarter ended April 26, 2026.	Medium	SV032, SV033, SV035
CV028	AMD's FY2025 10-K reports $34.639 billion of total revenue, $16.635 billion of Data Center revenue, and 50% gross margin.	High	SV030, SV031, SV034, SV036
CV029	AMD's Q1 2026 10-Q reports $10.253 billion of quarterly revenue and 53% gross margin for the quarter ended March 28, 2026.	Medium	SV031, SV034, SV036
CV030	Because NVIDIA and AMD disclose multi-billion revenue and filing-backed margins while d-Matrix discloses none, public GPU-company valuation logic cannot be transferred mechanically onto d-Matrix.	Medium	SV001, SV029, SV030, SV031, SV032
CV031	Low-confidence secondary coverage portrays SambaNova as another integrated enterprise platform, reinforcing that private challengers span multiple business models rather than one clean comparable set.	Low	SV022, SV023, SV027
CV032	The best-supported use of NVIDIA and AMD in this chapter is as scale fences and disclosure benchmarks, not as direct pricing anchors for a private startup with hidden denominator metrics.	Medium	SV018, SV019, SV020, SV021, SV029, SV030
CV033	CRN and Silicon Republic showed d-Matrix had startup mindshare by 2023-2024, but press attention is not equivalent to named production-customer disclosure.	Medium	SV016, SV017
CV034	Partner references with GigaIO and Gimlet show ecosystem progress, but they still stop short of public recurring-revenue or customer-concentration disclosure.	Medium	SV011, SV012, SV013, SV014
CV035	Groq's packaging is more commercially legible than d-Matrix's because public plan tiers and on-prem optionality are visible while d-Matrix public pricing remains absent.	Medium	SV001, SV011, SV012, SV022
CV036	Cerebras and SambaNova-style integrated systems imply a different buyer motion from d-Matrix's modular PCIe-card and NIC approach.	Low	SV013, SV023, SV027
CV037	Bloomberg and the market-report sources reinforce that energy, power, and infrastructure cost are central to inference buying and help explain why investors may pay for efficiency narratives.	Medium	SV015, SV024, SV028
CV038	The same sources also show that high power requirements, data-security concerns, and supply-chain stress remain structural adoption risks for inference hardware.	Low	SV015, SV028
CV039	Concentrated incumbent share and stack lock-in argue against paying a startup scarcity premium unless d-Matrix proves repeatable production wins.	Medium	SV018, SV020, SV022, SV024, SV025, SV026
CV040	The Series C price can be rationalized only if investors believe inference demand will shift materially toward specialized hardware and d-Matrix will win a meaningful share of that spend.	Medium	SV001, SV002, SV003, SV015, SV024, SV025, SV026
CV041	A reasonable bull case requires named scale deployments, independent benchmark normalization, and evidence that customers choose d-Matrix for production inference rather than pilots.	Medium	SV003, SV010, SV011, SV012, SV014, SV017
CV042	The base case assumes deployments continue and the company avoids a down round, but public evidence remains insufficient to prove premium economics above the latest private mark.	Medium	SV003, SV009, SV010, SV011, SV012, SV017
CV043	The bear case is plausible because commercialization proof is still thin relative to the scale of NVIDIA and AMD incumbency and the fragmentation of private challengers.	Medium	SV017, SV024, SV025, SV026, SV029, SV030
CV044	Valuation therefore has to be milestone-based rather than multiple-based because the denominator is hidden and the comp set is structurally mismatched.	Medium	SV017, SV029, SV030, SV031, SV032
CV045	This chapter's valuation ranges are scenario judgments around the last disclosed round, not market-cleared fair values or implied revenue-multiple outputs.	Medium	SV001, SV017, SV029, SV030, SV031, SV032
CV046	A bear-case valuation range of roughly $700 million to $1.0 billion reflects IP and partner optionality but assumes pilots stall or the next financing resets materially below Series C.	Medium	SV009, SV011, SV012, SV013, SV017, SV024, SV026
CV047	A base-case valuation range of roughly $1.2 billion to $1.8 billion assumes commercialization progresses and financing remains available, but public disclosure still does not justify a clear premium above the latest round.	Medium	SV001, SV003, SV010, SV011, SV012, SV015, SV017
CV048	A bull-case valuation range of roughly $2.5 billion to $3.5 billion requires named large deployments, third-party validated economics, and evidence that d-Matrix can hold a differentiated niche beside incumbent GPU infrastructure.	Medium	SV003, SV011, SV012, SV013, SV014, SV018, SV020
CV049	A 25% bear, 50% base, and 25% bull weighting centers the public-evidence midpoint around roughly $1.5 billion to $2.0 billion, below the last disclosed $2 billion round.	Medium	SV001, SV017, SV029, SV030, SV031, SV032
CV050	Because the Series C price sits above the chapter's midpoint and still requires unproven bull milestones, the current public evidence supports a stretched valuation stance.	Medium	SV001, SV017, SV029, SV030, SV031, SV032
CV051	The recommendation is research-more rather than buy because the company may be strategically relevant, but the price is ahead of public proof.	Medium	SV001, SV017, SV029, SV030, SV031, SV032
CV052	Confidence is medium because financing and product facts are real, yet the most decision-critical operating metrics remain undisclosed.	Medium	SV001, SV003, SV017
CV053	Risk rating is high because the investment case still depends on commercialization proof, supply-chain execution, and a financing or exit path that public evidence does not yet de-risk.	Medium	SV013, SV014, SV015, SV017, SV026, SV027, SV028
CV054	Entry discipline should require a materially lower price or hard data-room evidence on revenue quality and cap-table terms.	Medium	SV017, SV030, SV031, SV032
CV055	Thesis-breakers include a down round below roughly $1 billion, failure to convert pilots into named deployments, or independent benchmarks that negate the claimed TCO edge.	Medium	SV011, SV012, SV013, SV014, SV017, SV028
CV056	Diligence should request audited revenue or ARR, gross margin, customer concentration, bookings-to-deployment conversion, realized pricing, and preference-stack details before underwriting the Series C mark.	Medium	SV017, SV030, SV031, SV032
CV057	The reviewed corpus does not provide high-confidence private-market valuation data for Groq, Cerebras, or SambaNova comparable to d-Matrix's disclosed $2 billion round, so they can inform business-model comparison but not numeric pricing.	Low	SV022, SV023, SV027
CV058	Public evidence supports strategic sale or later secondary optionality more than a near-term IPO because d-Matrix has not reached public-company disclosure quality in the reviewed corpus.	Medium	SV017, SV029, SV030

Sources
ID	Publisher	Title	Quote
SO001	d-Matrix	d-Matrix - Ultra-low Latency Batched Inference for Generative AI	Chiplet-based design enables scaling SRAM-based architecture to power models up to 100B parameters.
SO002	d-Matrix	About d-Matrix \| Innovators in AI Compute Solutions & Technology	d-Matrix is led by a team of dedicated entrepreneurs with a 20+ year history in building businesses that have shipped over 100M chips and generated over $5B in revenue.
SO003	d-Matrix	d-Matrix News
SO004	d-Matrix	d-Matrix Raises $275 Million to Power the Age of AI Inference	Founded: 2019 \| HQ: Santa Clara, CA \| Global Offices: Toronto (Canada); Sydney (Australia); Bangalore (India); Belgrade (Serbia) \| Employees: 250+ worldwide.
SO005	d-Matrix	d-Matrix Unveils Corsair, the World’s Most Efficient AI Computing Platform for Inference in Datacenters	Corsair is sampling to early-access customers and will be broadly available in Q2’2025.
SO006	d-Matrix	d-Matrix Announces $110 Million in Series B Funding to Make Generative AI Commercially Viable with First-of-Its-Kind Inference Compute Platform	d-Matrix was founded in 2019 to solve the memory-compute integration problem, which is the final frontier in AI compute efficiency.
SO007	Bullhound Capital	Bullhound Capital leads $275M investment into AI inference leader d-Matrix	d-Matrix ... has closed $275 million in Series C funding, valuing the company at $2 billion and bringing the total raised to date to $450 million.
SO008	Qatar Investment Authority	QIA joins d-Matrix’s Series C USD 275m funding round	Valued at USD 2 billion and bringing the total raised to date to USD 450 million, d-Matrix will use the new capital to advance their roadmap and accelerate global expansion.
SO009	Data Center Dynamics	Chip startup d-Matrix raises $275m against $2bn valuation
SO010	SiliconANGLE	Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SO011	Converge Digest	d-Matrix Raises $275 Million to Accelerate the Age of AI Inference
SO012	CRN	10 Hottest Semiconductor Startups 2023
SO013	Silicon Republic	24 tech start-ups to watch in 2024
SO014	VentureBeat	New Microsoft partnership accelerates generative AI development
SO015	CNN	AI chips supply chain bottlenecks
SO016	CNBC	ChatGPT and generative AI are booming, but the costs can be extraordinary
SO017	TechTarget	Infrastructure to support OpenAI’s ChatGPT could be costly
SO018	The Washington Post	The hidden cost of ChatGPT is all the computing power
SO019	Bloomberg	AI Energy Crisis Boosts Need for More Efficient Chips
SO020	GigaIO	GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform	The new GigaIO SuperNODE platform, capable of supporting dozens of d-Matrix Corsair accelerators in a single node, is now the industry’s most scalable AI inference platform.
SO021	Gimlet Labs	Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU	Compared to the same speculative decoder on GPU and equivalent energy consumption, we've found that the Corsair-based solution delivers 2-5X end-to-end request speedup and up to 10X end-to-end speedup for energy-optimized configurations.
SO022	d-Matrix	d-Matrix Announces JetStream I/O Accelerators Enabling Ultra-Low Latency for AI Inference at Scale	JetStream NICs are full-height PCIe Gen5 cards delivering a maximum 400Gpbs bandwidth. Samples are available now, with full production expected by year-end.
SO023	d-Matrix	d-Matrix Announces SquadRack, Industry’s First Rack-Scale Solution Purpose-Built for AI Inference at Datacenter Scale	SquadRack™, the industry’s first blueprint for disaggregated standards-based rack-scale solutions for ultra-low latency batched inference.
SO024	d-Matrix	d-Matrix and Alchip Announce Collaboration on World’s First 3D DRAM Solution to Supercharge AI Inference	3DIMC will commercially debut on the d-Matrix Raptor™ inference accelerator, the successor to d-Matrix Corsair™.
SO025	GFM Review	The New Wave of AI Chips: Can Cerebras, d-Matrix and Groq take on Nvidia?	These new entrants face several hurdles, including the need for substantial capital investment to scale production, challenges in building brand recognition, and the difficulty of competing against Nvidia’s established ecosystem.
SM001	d-Matrix	Technology
SM002	d-Matrix	Product	Corsair is the first-of-its-kind AI compute platform. It offers unparalleled performance and efficiency for generative AI inference in the datacenter.
SM003	d-Matrix	d-Matrix Corsair Redefines Performance and Efficiency for AI Inference at Scale	Traditional accelerators often use costly and power-hungry High Bandwidth Memory (HBM)... d-Matrix breaks this memory barrier by integrating a multiplier directly into memory bit cell using a logic process.
SM004	d-Matrix	JetStream Product Brief	d-Matrix JetStream is a purpose-built network interface card (NIC) enabling efficient scaling of AI workloads and delivering ultra-low latency inference with Corsair clusters.
SM005	d-Matrix	The Power of the Middle Lane: Why a Hybridized Approach to Memory Gives the Best of Both Worlds	The per-token compute for a given inference has become the “cheap” part of the inference process, while that movement to and from memory is a massive “tax” per token generation.
SM006	d-Matrix	Scaling AI the Right Way: Introducing Our Rack-level Inference Solution	SquadRack is a reference rack blueprint ... based on rack power or rack height constraints. And it is air-cooled and doesn’t have any special cooling infrastructure requirements.
SM007	d-Matrix	Why We Needed a New Transparent NIC Solution	We consider there to be three total barriers to fast, efficient, high-performance AI inference: compute, memory, and I/O.
SM008	d-Matrix	Why We Decoupled Execution to Accelerate I/O	Rack-scale and multi-rack solutions with standard RoCE v2 systems accrue a large penalty scaling linearly due to the inherent costs of a system where the control plane and data plane both require a series of handshakes.
SM009	d-Matrix	Why Modern AI Workloads Demand a Disaggregated Approach	During decode, however, each new token is dependent on information from the prior one... making the speed of the response governed by the speed at which those trips to and from memory happen.
SM010	d-Matrix	Batching Just Right: How Interactive Apps Serve as a New Battleground	Lowering a batch size means a snappier, lower-latency user experience; while raising it increases overall throughput (and improves the economics).
SM011	d-Matrix	What Is AI Inference and Why It Matters in the Age of Generative AI	Generative AI models are orders of magnitude bigger ... They need multiple cards or servers to perform inference, consuming significantly more computational resources and energy.
SM012	d-Matrix	Impact of the DeepSeek Moment on Inference Compute	Reasoning models are highly memory bound and end up underutilizing the GPUs that are optimized for training.
SM013	d-Matrix	Introducing dmx.compressor	Quantization plays a key role in reducing memory usage, speeding up inference, and lowering energy consumption at inference time.
SM014	d-Matrix	d-Matrix Unveils Corsair, the World’s Most Efficient AI Computing Platform for Inference in Datacenters	Corsair delivers up to 10x faster interactive speed, 3x better performance per total cost of ownership (TCO), and 3x greater energy efficiency.
SM015	Data Center Dynamics	Chip startup d-Matrix raises $275m against $2bn valuation	It brings the total raised by d-Matrix to $450 million, which the company said it will use to advance its roadmap, accelerate global expansion, and support multiple large-scale deployments of its data center inference platform.
SM016	Bloomberg	AI Energy Crisis Boosts Need for More Efficient Chips	Every time the algorithm needs data, it goes to the library, known as a memory chip, checks out the data and takes it to another chip, known as a processor... a process that burns through lots of electricity.
SM017	VentureBeat	Nvidia GPU shortage is top gossip of Silicon Valley	The problem with inference is if the workload spikes very rapidly ... There is no way your GPU capacity can keep up with that because it was not built for that. It was built for training.
SM018	CNN	AI chips supply chain is straining under demand	Our datacenters depend on the availability of permitted and buildable land, predictable energy, networking supplies, and servers, including graphics processing units.
SM019	CNBC	ChatGPT and generative AI are booming, but the costs can be extraordinary	Curran believes that it could have cost OpenAI $40 million to process the millions of prompts people fed into the software that month.
SM020	TechTarget	Infrastructure to support OpenAI’s ChatGPT could be costly	Users have to build up not just compute, but their networking and power management.
SM021	GigaIO	GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform	This integration enables enterprises to deploy ultra-low-latency batched inference workloads at scale without the complexity of traditional distributed computing approaches.
SM022	Gimlet Labs	Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU	Compared to the same speculative decoder on GPU and equivalent energy consumption, we've found that the Corsair-based solution delivers 2-5X end-to-end request speedup on configurations optimized for interactivity, and up to 10X end-to-end speedup for energy-optimized configurations.
SM023	NVIDIA	H100 Tensor Core GPU	H100 extends NVIDIA’s market-leading inference leadership with several advancements that accelerate inference by up to 30X and deliver the lowest latency.
SM024	NVIDIA	H200 Tensor Core GPU	The H200 boosts inference speed by up to 2X compared to H100 GPUs when handling LLMs like Llama2.
SM025	NVIDIA	GB200 NVL72	The NVIDIA GB200 NVL72 ... delivers 30x faster real-time trillion-parameter large language model inference, with 10x greater performance for mixture-of-experts architectures.
SM026	Groq	Products	The AI inference platform built for developers. Fast responses, scalable performance, and costs you can plan for.
SM027	AIMultiple	AI Hardware	Hyperscalers run broad cloud platforms with GPU rental as one product among many. Specialist neoclouds focus on GPU and AI infrastructure as their core product.
SM028	Forbes	AI Inference Is King: Do You Know Which Chip Is Best?	MarketsandMarkets projects that the AI inference market is expected to grow from $106.15 billion in 2025 to $254.98 billion by 2030.
SM029	Silicon Analysts	Nvidia AI Accelerator Market Share 2024-2026	NVIDIA commands approximately 80-90% of the AI accelerator market by revenue as of 2025.
SP001	Data Center Dynamics	Chip startup d-Matrix raises $275m against $2bn valuation	Its full-stack platform is powered by the company’s Corsair inference accelerators, JetStream NICs, and Aviator software, which it claims can deliver 10X faster performance, 3X lower cost, and 3–5X better energy efficiency than GPU-based systems.
SP002	SiliconANGLE	Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SP003	TechStartups	AI chip startup d-Matrix raises $275M at $2B valuation to challenge Nvidia’s grip on AI inference	Traditional GPUs, including Nvidia’s, move data constantly between memory and processing cores, consuming huge amounts of energy.
SP004	Bloomberg	AI Energy Crisis Boosts Need for More Efficient Chips	AI requires massive amounts of data... a process that burns through lots of electricity.
SP005	CRN	10 Hottest Semiconductor Startups of 2023	Nvidia may dominate the AI computing space... but that isn’t stopping a slew of semiconductor startup companies, such as Cerebras Systems, d-Matrix, Lightmatter and Tenstorrent.
SP006	Silicon Republic	Start-ups to Watch in 2024	d-Matrix claims its chips are optimised to support generative AI systems and is competing in the same space as Nvidia.
SP007	eeNews Europe	Second-generation chiplet platform targets generative AI	The Jayhawk chiplet platform features... 16 Gbit/s/wire bandwidth and less than 0.5 pJ/bit energy efficiency.
SP008	HPCwire	d-Matrix Launches New Chiplet Connectivity Platform to Address Growing Compute Demand for Generative AI	By using a modular chiplet-based approach, data center customers can refresh compute platforms on a much faster cadence using a pre-validated chiplet architecture.
SP009	VentureBeat	Nvidia GPU shortage is top gossip of Silicon Valley	For LLM training... there is no substitute for Nvidia’s H100.
SP010	CNN	AI chips supply chain bottlenecks	The shortage could force companies to find creative ways around the problem.
SP011	CNBC	ChatGPT and generative AI are booming, but the costs can be extraordinary	At its peak in 2021, Latitude was spending nearly $200,000 a month on OpenAI’s software and Amazon Web Services.
SP012	TechTarget	Infrastructure to support OpenAI’s ChatGPT could be costly	Reports surfacing earlier this month indicate that just to develop training models and inferencing alone for OpenAI’s ChatGPT can require 10,000 Nvidia GPUs.
SP013	The Washington Post	ChatGPT’s hidden cost is GPU compute	The enormous cost of running today’s large language models is limiting their quality and threatening to throttle the global AI boom.
SP014	NVIDIA	NVIDIA H100	H100 extends NVIDIA’s market-leading inference leadership... with several advancements that accelerate inference by up to 30X and deliver the lowest latency.
SP015	NVIDIA	NVIDIA H200	The NVIDIA H200 is the first GPU to offer 141GB of HBM3e memory at 4.8TB/s.
SP016	NVIDIA	NVIDIA GB200 NVL72	The NVIDIA GB200 NVL72... delivers 30x faster real-time trillion-parameter LLM inference.
SP017	AMD	AMD Instinct MI300X Accelerators	Dedicated Memory Size 192 GB... Peak Memory Bandwidth 5.3 TB/s... Supported Technologies AMD ROCm.
SP018	Groq	Groq Products	Available in public, private, or co-cloud instances.
SP019	Cerebras	Cerebras Product Chip	Four trillion transistors. 125 petaflops. One silicon wafer.
SP020	AIMultiple	AI Hardware	We benchmarked NVIDIA’s B200, H200, H100, and AMD’s MI300X to assess how well they scale for LLM inference.
SP021	Intuition Labs	LLM Inference Hardware Enterprise Guide	NVIDIA’s ecosystem advantages (CUDA software stack, TensorRT, etc.) create high switching costs.
SP022	Forbes	AI Inference Is King: Do You Know Which Chip Is Best?	Now for the real test: is the NVL72 as fast as Nvidia promised at launch? Yes, it is thirty times faster than the 8-GPU H200.
SP023	GFM Review	The New Wave of AI Chips: Can Cerebras, d-Matrix, and Groq Take on Nvidia	The specialized nature of Cerebras, d-Matrix, and Groq’s technologies allows these companies to carve out niches where their solutions can outperform Nvidia’s more generalized approach.
SP024	Silicon Analysts	NVIDIA AI Accelerator Market Share 2024-2026	NVIDIA commands approximately 80-90% of the AI accelerator market by revenue as of 2025... priority TSMC CoWoS allocation maintain dominance.
SP025	Research and Markets via GlobeNewswire / Yahoo Finance	AI inference - Company Evaluation Report, 2025	The market faces constraints such as the high power requirements of AI chips... and concerns around data security and supply chain disruptions.
SP026	NVIDIA	NVIDIA L40S GPU	Experience breakthrough multi-workload performance with the NVIDIA L40S GPU. Combining powerful AI compute with best-in-class graphics and media acceleration, the L40S GPU is built to power the next generation of data center workloads—from generative AI and large language model (LLM) inference and training to 3D graphics, rendering, and video.
SI001	d-Matrix	d-Matrix Raises $275 Million to Power the Age of AI Inference	D-Matrix has closed $275 million in Series C funding, valuing the company at $2 billion and bringing the total raised to date to $450 million.
SI002	d-Matrix	d-Matrix launches Corsair for AI inference without GPUs / HBM	Corsair offers performance of 60,000 tokens/second at 1 ms/token for Llama3 8B in a single server and 30,000 tokens/second at 2 ms/token for Llama3 70B in a single rack.
SI003	d-Matrix	d-Matrix Unveils Corsair, the World's Most Efficient AI Computing Platform for Inference in Datacenters	Corsair is sampling to early-access customers and will be broadly available in Q2'2025.
SI004	d-Matrix	d-Matrix Announces $110 Million in Series B Funding to Make Generative AI Commercially Viable	The goal of the fundraise is to enable d-Matrix to begin commercializing Corsair.
SI005	d-Matrix	d-Matrix Corsair Redefines Performance and Efficiency for AI Inference at Scale	Performance, cost and power estimates are preliminary and subject to change. Results may vary.
SI006	d-Matrix	JetStream Product Brief	JetStream Product Specification ... Maximum bandwidth 400 Gbps ... Max TDP (w/ transceivers) 150 W.
SI007	Bullhound Capital	Bullhound Capital leads $275M investment into AI inference leader d-Matrix	D-Matrix ... has closed $275 million in Series C funding, valuing the company at $2 billion and bringing the total raised to date to $450 million.
SI008	Qatar Investment Authority	QIA joins d-Matrix's Series C USD 275m funding round	Valued at USD 2 billion and bringing the total raised to date to USD 450 million, d-Matrix will use the new capital to advance their roadmap, accelerate global expansion and support multiple large-scale deployments.
SI009	Data Center Dynamics	Chip startup d-Matrix raises $275m against $2bn valuation	Its single-card configuration provides up to 256GB of off-chip capacity memory and 2GB of performance memory at 150Tbps.
SI010	SiliconANGLE	Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SI011	Bloomberg	AI Energy Crisis Boosts Need for More Efficient Chips	AI requires massive amounts of data ... a process that burns through lots of electricity.
SI012	VentureBeat	New Microsoft partnership accelerates generative AI development	Project Bonsai is a platform ... and the early results are very encouraging.
SI013	VentureBeat	Nvidia GPU shortage is top gossip of Silicon Valley	When people use the word GPU shortage, they're really talking about a shortage of ... some component on the board, not the GPU itself.
SI014	CNN	AI chips supply chain bottlenecks	The shortage could force companies to find creative ways around the problem.
SI015	CNBC	ChatGPT and generative AI are booming, but the costs can be extraordinary	At its peak in 2021, Latitude was spending nearly $200,000 a month on OpenAI's generative AI software and Amazon Web Services.
SI016	TechTarget	Infrastructure to support OpenAI's ChatGPT could be costly	Users have to build up not just compute, but their networking and power management.
SI017	The Washington Post	The hidden cost of ChatGPT is all the computing power	AI chatbots have a problem: They lose money on every chat.
SI018	GigaIO	GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform	Our single-node server eliminates complex multi-node configurations and simplifies deployment, enabling enterprises to quickly adapt to evolving AI workloads while significantly improving their TCO and operational efficiency.
SI019	Gimlet Labs	Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU	Compared to the same speculative decoder on GPU and equivalent energy consumption, we've found that the Corsair-based solution delivers 2-5X end-to-end request speedup ... and up to 10X ... for energy-optimized configurations.
SI020	AIMultiple	AI Hardware	Cloud GPU list prices for the same model can differ several times over from one provider to another.
SI021	d-Matrix	d-Matrix Ecosystem	Not actual customers.
SI022	d-Matrix	d-Matrix Announces SquadRack	Combined with Supermicro's AI servers, Arista's ethernet switches, and Broadcom's PCIe and ethernet switch chips, we're delivering an AI inference rack that speeds up time to deployment.
SI023	d-Matrix	d-Matrix Announces JetStream I/O Accelerators	JetStream NICs are full-height PCIe Gen5 cards delivering a maximum 400Gpbs bandwidth. Samples are available now, with full production expected by year-end.
SI024	d-Matrix	d-Matrix Emerges from Stealth with Strong AI Performance and Efficiency	d-Matrix uses a hybrid approach to memory ... using SRAM as Performance Memory and a larger DRAM store for Capacity Memory.
SI025	d-Matrix	d-Matrix and Alchip Announce Collaboration on World's First 3D DRAM Solution	The collaboration has already enabled a key technology, d-Matrix 3DIMC, that is featured on d-Matrix Pavehawk test silicon and has been successfully validated in d-Matrix's labs.
SI026	d-Matrix	Gimlet Cloud to Deploy d-Matrix Corsair Alongside GPUs	The companies plan to make their combined solution available to select customers through Gimlet Cloud in 2H 2026.
SI027	d-Matrix	Why We Created a 3D DRAM Solution to Advance Low-Latency AI Inference	Our next task is meeting the incredible demand required by emerging AI workloads with high user expectations, and that starts with Pavehawk.
SI028	d-Matrix	How Speculative Decoding Supercharged AI Inference in Disaggregated Pipelines	Rather than reducing the batch size on a GPU and wasting compute to generate a low-latency experience, it reduces the total number of sequential inferences the powerful model handles.
SI029	d-Matrix	Democratizing AI Through Hardware-Software Codesign for LLM Inference	The team looks at the associated software design of modern systems including collective communication algorithms and the distributed inference serving stack.
SI030	NVIDIA via Stocklight	NVIDIA Corporation Annual Report 2026	These Data Center systems are extreme co-designed with the GPU, CPU, NVLink switch, DPU, NIC, and scale-out networking along with software stacks and algorithms to deliver data center-scale computing solutions.
SI031	GFM Review	The New Wave of AI Chips: Can Cerebras, d-Matrix, and Groq Take on Nvidia?	These new entrants face several hurdles, including the need for substantial capital investment to scale production, challenges in building brand recognition, and the difficulty of competing against Nvidia's established ecosystem.
SI032	d-Matrix	Transforming AI: d-Matrix's Pivotal Moments in Pursuit of GenAI Inference at Scale	After coming within two weeks of running out of cash, we had an a-ha moment ... why don't we go ask for $40 million and show a big vision to match the size of the opportunity.
SE001	d-Matrix	d-Matrix homepage
SE002	d-Matrix	Technology
SE003	d-Matrix	Product
SE004	d-Matrix	Ecosystem
SE005	d-Matrix	SquadRack announcement	Corsair delivers the compute-memory acceleration, while JetStream delivers I/O acceleration.
SE006	d-Matrix	JetStream announcement	JetStream NICs are full-height PCIe Gen5 cards delivering a maximum 400Gpbs bandwidth.
SE007	d-Matrix	Corsair launch announcement	Each Corsair card is powered by DIMC compute cores with 2400 TFLOPs of 8-bit peak compute, 2 GB of integrated Performance Memory, and up to 256 GB of off-chip Capacity Memory.
SE008	d-Matrix	d-Matrix and Alchip announce collaboration on 3D DRAM solution	3DIMC will commercially debut on the d-Matrix Raptor inference accelerator, the successor to d-Matrix Corsair.
SE009	d-Matrix	d-Matrix Corsair Redefines Performance and Efficiency for AI Inference at Scale	d-Matrix Aviator is an enterprise-grade software stack co-designed with d-Matrix hardware.
SE010	d-Matrix	JetStream product brief	Maximum bandwidth 400 Gbps
SE011	d-Matrix	Going vertical: Why we created a 3D DRAM solution to advance low-latency AI inference
SE012	d-Matrix	The power of the middle lane: why a hybridized approach to memory gives the best of both worlds
SE013	d-Matrix	Scaling AI the right way: introducing our rack-level inference solution
SE014	d-Matrix	Why we needed a new transparent NIC solution
SE015	d-Matrix	Why we decoupled execution to accelerate I/O
SE016	d-Matrix	Why modern AI workloads demand a disaggregated approach
SE017	d-Matrix	How speculative decoding supercharged AI inference in disaggregated pipelines
SE018	d-Matrix	Batching just right: how interactive apps serve as a new battleground
SE019	d-Matrix	Impact of the DeepSeek moment on inference compute
SE020	d-Matrix	Keyformer: KV cache reduction through attention sparsification for efficient generative inference
SE021	d-Matrix	Introducing dmx.compressor
SE022	d-Matrix	Democratizing AI through hardware-software codesign for LLM inference
SE023	Data Center Dynamics	Chip startup d-Matrix raises $275m against $2bn valuation
SE024	EE News Europe	Second generation chiplet platform targets generative AI
SE025	HPCwire	d-Matrix launches new chiplet connectivity platform to address growing compute demand for generative AI
SE026	GigaIO	GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SE027	Gimlet Labs	Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU	Compared to the same speculative decoder on GPU and equivalent energy consumption, we've found that the Corsair-based solution delivers 2-5X end-to-end request speedup.
SE028	NVIDIA	NVIDIA H100
SE029	NVIDIA	NVIDIA H200
SE030	NVIDIA	NVIDIA GB200 NVL72
SE031	AIMultiple	AI Hardware
SE032	Intuition Labs	Private LLM Inference: Key Hardware and Integrators for Enterprise
SE033	Forbes	AI Inference Is King: Do You Know Which Chip Is Best?
SE034	Silicon Analysts	NVIDIA AI accelerator market share 2024-2026
SE035	d-Matrix	GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SE036	d-Matrix	AI is a context problem
SE037	d-Matrix	The fight for latency: why agents have changed the game
SE038	d-Matrix	Open standards are the path to the next AI breakthrough
SE039	d-Matrix	Why optimizing every layer of AI workloads from software to infrastructure is now critical as apps take off
SE040	Alchip	d-Matrix and Alchip announce collaboration on world's first 3D DRAM solution to supercharge AI inference
SE041	d-Matrix	The complete recipe to unlock AI reasoning at enterprise scale
SU001	d-Matrix	AI Computing Solutions for Cloud & Enterprise Markets	*Not actual customers.
SU002	d-Matrix	d-Matrix Raises $275 Million to Power the Age of AI Inference
SU003	d-Matrix	SquadRack
SU004	d-Matrix	JetStream
SU005	d-Matrix	d-Matrix launches Corsair for AI inference without GPUs, HBM
SU006	d-Matrix	d-Matrix Unveils Corsair, the World's Most Efficient AI Computing Platform for Inference in Datacenters
SU007	d-Matrix	GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform
SU008	d-Matrix	Gimlet Cloud, built for running agentic AI inference, to deploy d-Matrix Corsair low latency, memory-optimized accelerators alongside GPUs
SU009	d-Matrix	Scaling AI the Right Way: Introducing Our Rack-Level Inference Solution
SU010	d-Matrix	Why We Needed a New Transparent NIC Solution
SU011	d-Matrix	How Speculative Decoding Supercharged AI Inference in Disaggregated Pipelines
SU012	d-Matrix	Impact of the DeepSeek Moment on Inference Compute
SU013	d-Matrix	Transforming AI: d-Matrix's Pivotal Moments in Pursuit of Gen AI Inference at Scale
SU014	Bullhound Capital	Bullhound Capital leads $275M investment into AI inference leader d-Matrix
SU015	QIA	QIA joins d-Matrix's Series C USD 275m funding round
SU016	Data Center Dynamics	Chip startup d-Matrix raises $275m against $2bn valuation
SU017	SiliconANGLE	Chip startup d-Matrix raises $275M to speed inference with memory-centric compute
SU018	TechStartups	AI chip startup d-Matrix raises $275M at $2B valuation to challenge Nvidia's grip on AI inference
SU019	Converge Digest	d-Matrix Raises $275 Million to Accelerate the Age of AI Inference
SU020	GigaIO	GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform
SU021	Gimlet Labs	Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU
SU022	AIMultiple	AI hardware
SU023	IntuitionLabs	Private LLM Inference: Key Hardware and Integrators for Enterprise
SU024	Forbes	AI Inference Is King. Do You Know Which Chip Is Best?
SU025	GFM Review	The New Wave of AI Chips: Can Cerebras, d-Matrix and Groq Take on Nvidia?
SU026	d-Matrix	d-Matrix and Gimlet Join Forces in Strategic Partnership
SU027	Data Center Knowledge	Gimlet Labs, d-Matrix Partner to Boost Agentic AI Inference
SU028	eeNews Europe	d-Matrix launches Corsair for AI inference without GPUs, HBM
SU029	AIwire	GigaIO Partners with d-Matrix to Deliver Ultra-Efficient Scale-Up AI Inference Platform
SU030	PR Newswire	d-Matrix Announces SquadRack, Industry's First Rack-Scale Solution Purpose-Built for AI Inference at Datacenter Scale
SU031	d-Matrix	Request Access or Buy
SU032	d-Matrix	The fight for latency: why agents have changed the game
SU033	AMD	AMD Instinct MI350 Series GPUs	AMD Instinct™ MI350 Series GPUs help enable frictionless adoption with drop-in compatibility, while the AMD GPU Operator simplifies deployment and workload configuration in Kubernetes.
SU034	Groq	GroqCloud	The AI inference platform built for developers. Fast responses, scalable performance, and costs you can plan for. Available in public, private, or co-cloud instances.
SU035	Cerebras	Chip	The world’s largest and most powerful processor for AI training and inference.
SR001	d-Matrix	SquadRack announcement
SR002	d-Matrix	JetStream announcement
SR003	d-Matrix	d-Matrix and Alchip announce collaboration on 3D DRAM solution
SR004	d-Matrix	Going vertical: Why we created a 3D DRAM solution to advance low-latency AI inference
SR005	d-Matrix	Why We Needed a New Transparent NIC Solution
SR006	d-Matrix	Why We Decoupled Execution to Accelerate I/O
SR007	d-Matrix	Why Modern AI Workloads Demand a Disaggregated Approach
SR008	d-Matrix	Impact of the DeepSeek Moment on Inference Compute
SR009	Data Center Dynamics	Chip startup d-Matrix raises $275m against $2bn valuation
SR010	SiliconANGLE	Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SR011	Bloomberg	AI Energy Crisis Boosts Need for More Efficient Chips
SR012	eeNews Europe	Second-generation chiplet platform targets generative AI
SR013	VentureBeat	Nvidia GPU shortage is top gossip of Silicon Valley
SR014	CNN	AI chips supply chain bottlenecks
SR015	TechTarget	Infrastructure to support OpenAI’s ChatGPT could be costly
SR016	The Washington Post	ChatGPT’s hidden cost is GPU compute
SR017	NVIDIA	NVIDIA H200
SR018	NVIDIA	NVIDIA GB200 NVL72
SR019	AMD	AMD Instinct MI300X Accelerators
SR020	Groq	Groq Products
SR021	Cerebras	Cerebras Product Chip
SR022	Intuition Labs	LLM Inference Hardware Enterprise Guide
SR023	Silicon Analysts	NVIDIA AI Accelerator Market Share 2024-2026
SR024	d-Matrix	GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SR025	d-Matrix	d-Matrix and Gimlet Labs accelerate agentic AI inference
SR026	d-Matrix	Transforming AI: d-Matrix’s pivotal moments in pursuit of Gen AI inference at scale
SR027	d-Matrix	Privacy Policy - d-Matrix
SR028	d-Matrix	Terms of Use - d-Matrix
SR029	Bureau of Industry and Security	Commerce Strengthens Restrictions on Advanced Computing Semiconductors, Semiconductor Manufacturing Equipment, and Supercomputing Items to Countries of Concern
SR030	d-Matrix	Join d-Matrix \| Careers in AI Innovation and Technology
SR031	d-Matrix	d-Matrix and Gimlet Join Forces in Strategic Partnership - d-Matrix
SR032	Reuters	AI chip startup d-Matrix raises $110 mln with backing from Microsoft
SR033	Supermicro	Supermicro expands AI solutions and adds d-Matrix inference and rack-scale systems
SR034	Alchip	d-Matrix and Alchip announce collaboration on world’s first 3D DRAM solution to supercharge AI inference
SR035	NVIDIA	NVIDIA AI Enterprise
SR036	d-Matrix	Using what’s on hand: spare data center space is an untapped gold mine - d-Matrix
SV001	d-Matrix	d-Matrix raises $275 million to power the age of AI inference	d-Matrix has closed $275 million in Series C funding, valuing the company at $2 billion and bringing the total raised to date to $450 million.
SV002	Bullhound Capital	Bullhound Capital leads $275M investment into AI inference leader d-Matrix
SV003	Qatar Investment Authority	QIA joins d-Matrix’s Series C USD 275m funding round
SV004	Data Center Dynamics	Chip startup d-Matrix raises $275m against $2bn valuation	It brings the total raised by d-Matrix to $450 million, which the company said it will use to advance its roadmap, accelerate global expansion, and support multiple large-scale deployments of its data center inference platform.
SV005	SiliconANGLE	Chip startup d-Matrix raises $275M to speed up inference with in-memory compute
SV006	TechStartups	AI chip startup d-Matrix raises $275M at $2B valuation to challenge Nvidia’s grip on AI inference
SV007	Converge Digest	d-Matrix raises $275 million to accelerate the age of AI inference
SV008	d-Matrix	d-Matrix announces $110 million in Series B funding
SV009	d-Matrix	d-Matrix emerges from stealth with strong AI performance and efficiency
SV010	d-Matrix	Transforming AI: d-Matrix’s pivotal moments in pursuit of GenAI inference at scale
SV011	d-Matrix	GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SV012	d-Matrix	Gimlet Cloud to deploy d-Matrix Corsair accelerators alongside GPUs
SV013	GigaIO	GigaIO partners with d-Matrix to deliver ultra-efficient scale-up AI inference platform
SV014	Gimlet Labs	Low-Latency Inference with Speculative Decoding on d-Matrix Corsair and GPU	By offloading the speculative decoder to d-Matrix Corsair, we achieved 2-10X interactivity improvements over a homogeneous speculative decoding setup for the same energy efficiency.
SV015	Bloomberg	AI Energy Crisis Boosts Need for More Efficient Chips
SV016	CRN	10 Hottest Semiconductor Startups of 2023
SV017	Silicon Republic	Start-ups to Watch in 2024
SV018	NVIDIA	NVIDIA H100 Tensor Core GPU
SV019	NVIDIA	NVIDIA H200 Tensor Core GPU
SV020	NVIDIA	NVIDIA GB200 NVL72
SV021	AMD	AMD Instinct MI300X
SV022	Groq	Groq products
SV023	Cerebras	The Future of AI is Wafer Scale
SV024	AIMultiple	Top 25+ AI Chip Makers: NVIDIA & Its Competitors
SV025	Forbes	AI Inference Is King: Do You Know Which Chip Is Best?
SV026	Silicon Analysts	NVIDIA AI accelerator market share 2024-2026
SV027	Intuition Labs	Private LLM Inference: Key Hardware and Integrators for Enterprise
SV028	Yahoo Finance / GlobeNewswire	AI inference - Company Evaluation Report, 2025	However, concerns around data security and supply chain disruptions remain persistent challenges for companies operating in the AI inference space.
SV029	Securities and Exchange Commission	NVIDIA Corporation Form 10-K for fiscal year ended January 25, 2026
SV030	Securities and Exchange Commission	Advanced Micro Devices, Inc. Form 10-K for fiscal year ended December 27, 2025
SV031	Securities and Exchange Commission	Advanced Micro Devices, Inc. Form 10-Q for quarter ended March 28, 2026
SV032	Securities and Exchange Commission	NVIDIA Corporation Form 10-Q for quarter ended April 26, 2026
SV033	Securities and Exchange Commission	NVIDIA SEC submissions JSON
SV034	Securities and Exchange Commission	AMD SEC submissions JSON
SV035	Securities and Exchange Commission	NVIDIA SEC companyfacts JSON
SV036	Securities and Exchange Commission	AMD SEC companyfacts JSON