VAST Data
High-growth AI data-platform vendor with a differentiated stack and a $30B valuation that outruns current public disclosure
VAST Data looks like a genuine AI infrastructure winner with real scale, strong ecosystem proof, and differentiated data-platform technology, but the current $30 billion valuation still requires private diligence on revenue quality, concentration, and cap-table terms before new money can be underwritten confidently.
Cover facts
Company profile
VAST Data is a private AI infrastructure company founded in 2016 that has expanded from a high-performance all-flash storage platform into what it now calls an AI Operating System. The platform combines DataStore, DataBase, DataEngine, and DataSpace under the DASE architecture to unify storage, database, and compute for enterprise, neocloud, and sovereign-AI workloads. Public evidence supports meaningful scale: VAST disclosed more than $4 billion in cumulative bookings, more than $500 million in committed ARR exiting the prior fiscal year, and positive operating margin plus free cash flow before its 2026 Series F. The company now sits at a $30 billion private valuation, with strong ecosystem validation from customers, OEMs, and NVIDIA-linked deployments, but still provides limited public disclosure on concentration, exact revenue definitions, and fully diluted economics.
- Website
- vastdata.com
- Founded
- 2016-01-01
- Founders
- Renen Hallak, Jeff Denworth
- Founding location
- Tel Aviv, Israel
- Headquarters
- New York, NY, USA
- Product
- VAST sells an AI Operating System built on its DASE architecture, combining universal file/object/block storage, a transactional and analytical data lake, streaming and event-processing services, a global namespace, and partner-packaged AI infrastructure for large-scale enterprise and neocloud environments.
- Customers
- Large enterprises, AI clouds, research institutions, sovereign-AI operators, and other organizations running data-intensive AI, analytics, media, healthcare, and HPC workloads.
- Business model
- Software subscriptions and platform licenses sold through direct and OEM-led enterprise channels, paired with validated hardware bundles and long-duration infrastructure deployments that expand over time.
- Stage
- late-stage private
- Funding status
- Closed an approximately $1 billion Series F transaction in April 2026 at a $30 billion valuation after a $118 million Series E in 2023 at $9.1 billion; public evidence suggests the latest round mixed primary capital and secondary liquidity.
Executive summary
Top strengths
- Unified AI Operating System / DASE architecture gives VAST a differentiated full-stack data layer rather than a commodity array story.
- Publicly disclosed traction is unusually strong for a private infrastructure vendor: more than $4B of bookings, more than $500M of committed ARR, positive operating margin, positive free cash flow, and a $1.17B CoreWeave agreement.
- Customer, OEM, and NVIDIA-adjacent ecosystem proof spans AI clouds, telecom, media, healthcare, research, and sovereign-AI deployments.
Top risks
- At $30B, valuation remains expensive versus public storage comps unless true recurring revenue is much closer to high-end third-party estimates than to the company’s own disclosed ARR floor.
- Public evidence still does not reconcile ARR definitions or disclose top-customer concentration, renewal quality, or fully diluted Series F terms.
- VAST’s platform expansion increases dependence on NVIDIA, Cisco, OEM hardware partners, and flagship AI-cloud accounts such as CoreWeave.
- Legal/IP overhang around Red Stapler plus Israel-linked operating continuity and privacy/regulatory exposure remain real diligence items.
Open gaps
- Audited bridge among bookings, committed ARR, projected ARR, and any broader ARR-like revenue construct.
- Top-customer ARR, renewal cohorts, and concentration exposure after the CoreWeave and neocloud buildout.
- Post-Series-F cap table, liquidation preferences, option-pool impact, and exact primary-versus-secondary split.
- Current headcount, security/compliance certifications, and independently verifiable reliability/SLA metrics.
Contents
01Company Overview
1.1 Identity, Stage, and Product Scope
VAST Data is a private AI infrastructure company founded in 2016 that now presents itself as the “AI Operating System” for data-intensive applications rather than as a point-storage vendor. The core company story in retained sources is consistent on direction: VAST says it unifies storage, database, and compute in a single data platform, while independent coverage describes the same transition as a three-phase move from flash-oriented storage into database services and then into broader AI operating-system orchestration. Product materials describe VAST DataStore as a universal data store that removes traditional storage tiers and scales from terabytes to exabytes, while partner materials from Supermicro frame the platform as the data-management layer for enterprise AI and GPU cloud deployments. Headquarters is less cleanly disclosed than product scope. Official 2023–2026 press materials alternately use “No-Headquarters” or “Remote-First Company” while anchoring announcements to New York City, and Reuters-described coverage calls the company New York-headquartered. For later chapters, the safest ground truth is that VAST operates as a remote-first, New York-anchored private company with a US-facing commercial surface and Israeli operational roots reflected in later location reporting.[CO001, CO002, CO003, CO004, CO005, CO006]
| Metric | Value / Status | Date | Confidence | Gap / Caveat |
|---|---|---|---|---|
| Founded | 2016 | 2016 | Medium | Stable fact; full founder roster is less cleanly corroborated than the founding year |
| Headquarters / operating model | Remote-first with New York City anchoring in official releases; Reuters-described coverage says New York-headquartered | 2025-08 to 2026-04 | Medium | Official materials do not disclose a conventional single-HQ statement |
| Current stage | Private, late-stage, post-Series F | 2026-04-22 | Medium | IPO preparation discussed publicly, but no formal IPO process started |
| One-line product / model | AI Operating System that unifies storage, database, and compute; sold through software, OEM, partner, and channel routes | 2026-05-27 | Medium | No public pricing schedule in retained sources |
| Series F valuation | $30B | 2026-04-22 | Medium | Private valuation; not externally audited |
| Approximate total financing | ~$1.38B implied after Series F | 2026-04-22 | Medium | Derived from pre-F total plus reported F transaction size |
| Current ARR / CARR disclosure | > $500M prior-year committed ARR / CARR | 2026-04-22 | Medium | Company-claimed; no audited revenue statement |
| Profitability signal | Positive operating margin, free cash flow, and GAAP profitability reported | 2026-04-22 | Medium | Private-company self-disclosure, not audited |
| Employee scale | >700 employees disclosed in late 2023 | 2023-12-06 | Medium | Exact 2026 headcount unsupported in retained sources |
| Customer scale | Thousands of organizations claimed; named logos include CoreWeave, Booking, Zoom, Pixar, U.S. Air Force, JPMorganChase, and others | 2023-12 to 2026-04 | Medium | Exact paying-customer count not publicly disclosed |
| Locations | US, Israel, and London reported; expansion into APAC, Middle East, and Europe reported in 2023 | 2023-12-08 | Low | Current 2026 office list is not fully published in retained sources |
Mixes company disclosures, independent reporting, and one derived financing total; unsupported 2026 headcount, board, debt, and exact customer-count fields are shown as gaps rather than invented values.
[CO001, CO003, CO006, CO007, CO008, CO015]How VAST's identity, platform architecture, customer proof, capital stack, and partner routes connect to the current investment narrative.
[CO002, CO003, CO005, CO014, CO015, CO035]A disclosure-quality lens on which headline KPIs are current, historical, estimated, or still missing despite VAST's large private-market valuation.
[CO020, CO023, CO024, CO031, CO032, CO034]1.2 Founders, Leadership, and Governance Signals
Renen Hallak is the clearest single point of leadership continuity across the retained record. He appears as founder and CEO across official financing releases, product positioning, and press interviews, making him the most important reusable identity anchor for later diligence. Jeff Denworth is also directly corroborated as a co-founder and remains a visible product spokesperson in both company and third-party materials, especially around the evolution from unified data platform to full AI operating system. Finance leadership is also now material: Amy Shapero, VAST’s first CFO and formerly Shopify’s CFO, is explicitly tied to IPO readiness in both CRN and Reuters-reported coverage. That combination suggests a governance stack that is maturing toward public-market discipline even though the company has not started a formal IPO process. Cloud leadership adds a more complicated signal. Jonsi Stefansson joined via the Red Stapler acqui-hire and was publicly cited by VAST during the Polaris launch, but the same executive remains tied to NetApp litigation risk. Public governance disclosure remains thinner than funding disclosure: the retained sources confirm a Fidelity board observer role, but they do not provide a clean, current full board list, which is a carry-forward diligence gap rather than a reason to invent governance detail.[CO009, CO010, CO011, CO012, CO013, CO014]
| Person | Role / status | Evidence-backed background | Why it matters | Key-person dependency |
|---|---|---|---|---|
| Renen Hallak | Founder and CEO | Appears across official financing releases and long-form press interviews as founder/CEO and chief strategy voice | Owns company identity, capital narrative, and product positioning | Critical — public face of funding, platform scope, and IPO readiness |
| Jeff Denworth | Co-founder and public product spokesperson | Quoted by third-party coverage on the unified platform and Thinking Machine vision | Connects architecture story to go-to-market narrative and product evolution | High — recurring co-founder interpreter of the platform roadmap |
| Amy Shapero | First CFO | Joined from Shopify and is explicitly tied to IPO preparation | Financial controls and public-company readiness are now strategic topics | High — finance leadership is central if VAST moves toward public markets |
| Jonsi Stefansson | GM, Cloud Solutions | Joined through the Red Stapler acqui-hire after serving as NetApp CTO | Represents cloud-service expansion and multicloud operational capability | Medium — strategically useful, but cloud role is shadowed by litigation context |
Partial public leadership map only; retained sources do not provide a complete current board list or a full executive org chart.
[CO009, CO010, CO011, CO012, CO013, CO014]1.3 Funding History, Valuation, and Public Scale Metrics
VAST’s financing arc is unusually steep even by AI-infrastructure standards. The company went from an $83 million Series D at a $3.7 billion valuation in May 2021 to a $118 million Series E at a $9.1 billion valuation in December 2023, and then to a roughly $1 billion Series F transaction at a $30 billion valuation in April 2026. The Series F round matters for three reasons beyond price. First, it included both primary and secondary capital, so some of the round monetized existing holders rather than simply funding balance-sheet growth. Second, the syndicate now blends financial investors such as Drive Capital, Access Industries, Fidelity, and NEA with NVIDIA as a strategic participant. Third, the company paired the valuation step-up with unusually strong company-claimed operating metrics: more than $4 billion in cumulative bookings, more than $500 million in prior-year committed ARR, and positive operating margin and free cash flow. Those are powerful signals but still not substitutes for audited financial statements. Public scale disclosure is uneven. Late-2023 sources support more-than-700 employees and offices in the US, Israel, and London, while 2026 exact headcount, customer count, and any debt or credit instruments remain unsupported in the retained source pack.[CO016, CO017, CO018, CO019, CO020, CO021]
| Stakeholder | Role | Control or economic importance | Diligence ask |
|---|---|---|---|
| Drive Capital | Series F lead investor | Led the 2026 round that set the current $30B valuation marker | What rights, governance influence, or performance milestones came with the F round? |
| Access Industries | Series F co-lead | Co-led the 2026 round and helps validate the scale-up syndicate | How active is Access in governance versus purely financial participation? |
| Fidelity Management & Research | Series E lead; Series F participant; board observer at Series E | Repeated capital provider and one of the few public governance signals | Has Fidelity's observer status changed as the company approaches IPO readiness? |
| NEA | Series E participant; quoted strategic backer | Large brand-name growth investor with public endorsement of AI GPU thesis | Does NEA hold a formal board seat or other governance rights not publicly disclosed? |
| NVIDIA | Strategic investor and technical partner | Appears in funding syndicates and partner architecture narratives across AI data pipelines | How much of VAST's market pull depends on NVIDIA ecosystem alignment? |
| Tiger Global Management | Series D lead investor | Backed the 2021 valuation reset to $3.7B | What remains of Tiger's position after subsequent rounds and secondaries? |
| Employees and early investors | Secondary liquidity beneficiaries | Series F and some Series E reporting point to liquidity alongside primary financing | How much of headline funding was balance-sheet capital versus liquidity for prior holders? |
Publicly disclosed stakeholder map only; omits undisclosed investors, ownership percentages, liquidation terms, and debt because retained sources do not provide them.
[CO016, CO020, CO021, CO022, CO027, CO028]1.4 Milestones, Partnerships, and Adverse Context
The milestone record shows why later chapters should treat VAST as more than a storage vendor. After founding in 2016, the company spent its first phase establishing a high-performance storage base, then layered database and compute functions on top, and by 2026 was launching control-plane and agentic-computing features rather than only hardware-adjacent software. The partner map is also central to the story. VAST’s customer and channel footprint runs through AI clouds, OEMs, and strategic technology partners, with named relationships across CoreWeave, Google Cloud, Lambda, Core42, Supermicro, HPE, Cisco, and NVIDIA-linked deployments. That ecosystem expands reach, but it also creates dependence on partner routes and hyperscale AI-capex cycles. The main adverse item worth preserving is the Red Stapler/NetApp dispute. VAST was not named as a defendant, and NetApp’s Florida case was dismissed on venue grounds, but the episode still introduced reputational and integration risk precisely while VAST was raising another mega-round. For ground-truth purposes, the legal issue is not thesis-breaking on current evidence; it is a real diligence memory item that should stay live until related Icelandic or appeal processes are clearly resolved.[CO035, CO036, CO037, CO038, CO039, CO040]
| Date | Event | Type | Amount / valuation / status | Participants | Implication |
|---|---|---|---|---|---|
| 2016 | VAST Data founded | founding | Company formation | Renen Hallak and other early founders | Start of the company's transition from storage thesis to broader data-platform ambition |
| 2021-05-04 | Series D closes | financing | $83M at $3.7B post-money | Tiger Global, NVIDIA, existing investors | Established VAST as a high-value independent infrastructure company |
| 2023-04 | Strategic partnership with HPE highlighted in later Series E materials | partnership | HPE GreenLake file-storage integration | VAST Data and HPE | Marked a route into enterprise file and AI data infrastructure via partner distribution |
| 2023-05 | NVIDIA DGX SuperPOD certification highlighted in later Series E materials | partnership | Certification milestone | VAST Data and NVIDIA | Strengthened technical credibility inside GPU-centric AI deployments |
| 2023-12-06 | Series E closes | financing | $118M at $9.1B valuation | Fidelity, NEA, BOND, Drive | Tripled valuation versus Series D and funded expansion into broader AI platform scope |
| 2025-06-10 | Independent reporting surfaces a new valuation target before the next round | scale | Targeting ~ $25B valuation | TechCrunch sources, VAST management context | Showed investor appetite and repricing before the eventual Series F |
| 2025-11-19 | Red Stapler / NetApp dispute becomes public | adverse | Litigation and integration overhang | NetApp, Jonsi Stefansson, VAST Data | Created reputational diligence risk while VAST was fundraising |
| 2026-01-26 | VAST Amplify launches | product | Up to 6x effective SSD capacity claimed | VAST Data | Showed VAST broadening from storage hardware economics into AI-infrastructure efficiency programs |
| 2026-02-25 | Polaris launched at VAST Forward | product | Global control plane available with expanded capabilities planned | VAST Data | Extended the company narrative from storage/data platform into multicloud AI orchestration |
| 2026-04-22 | Series F closes | financing | ~$1B transaction at $30B valuation | Drive Capital, Access, Fidelity, NEA, NVIDIA and others | Reset the company into the top tier of private AI infrastructure valuations |
This is the public chronology of record, not an exhaustive internal history; exact dates for some 2023 partnership milestones come from later retrospectives rather than standalone retained press releases.
[CO001, CO016, CO020, CO022, CO027, CO028]Public chronology from founding through the April 2026 Series F close, showing the company's shift from storage infrastructure into an AI operating system platform with one notable legal overhang.
[CO001, CO016, CO020, CO022, CO027, CO028]1.5 Exhibits
02Market Analysis
2.1 Market boundary, adjacencies, and substitutes
VAST is not best framed as a generic AI-infrastructure company. Its own positioning and white paper place it in the shared data layer that sits between raw storage media and higher-level AI application software: a unified platform for file, object, block, tabular, vector, and streaming data that can serve training, inference, analytics, and HPC workloads from a common control plane. That means the core market boundary includes high-performance shared storage, metadata and vector retrieval, global namespace and governance, and the software needed to keep those data services available across on-prem, cloud, and edge environments. It does not include GPU compute, networking fabric, general-purpose application software, or large swaths of generic IT services spend. The practical competitive frame is therefore narrower than total AI infrastructure but broader than a traditional NAS array. VAST’s own solution pages push into adjacent analytics, compliance, backup, container, and data-lakehouse use cases, while the white paper describes replacing separate object stores, analytics clusters, and orchestration layers with one data-centric platform. Status-quo substitutes remain incumbent all-flash arrays, scale-out NAS, object stores, and multi-product stacks assembled from discrete storage, database, and orchestration tools. The market-definition question matters because using a broader AI-infrastructure headline as TAM would overstate the spend pool VAST can realistically capture, while using only classic external-storage categories understates the software and orchestration layer VAST is trying to sell.[CM001, CM003, CM004, CM005, CM007, CM008]
| Segment / boundary | Included spend | Excluded spend | Buyer / payer | Relevance |
|---|---|---|---|---|
| Core served market | Shared AI data infrastructure: high-performance file, object, block, metadata, vector retrieval, governance, and global namespace services | GPU compute, networking, generic AI software, consulting services | CIO / infrastructure or platform budget owner; users are storage, ML, and data-platform teams | Direct |
| AI factory adjacency | Integrated storage plus OEM compute bundles, deployment software, reference architectures, and NVIDIA-aligned data services | Raw accelerator silicon spend and broad data-center construction | Central AI platform or infrastructure sponsor | Adjacent / partner-led |
| Data platform adjacency | Lakehouse, analytics, event, compliance, and backup workloads that land-and-expand into the same flash data fabric | Generic SaaS analytics seats and non-data application software | CDO / data platform leader or security / resilience owner | Adjacent |
| Excluded broad AI infrastructure | None beyond the shared data layer | Hyperscaler capex, servers, networking, power, cooling, and general AI services | Broader enterprise or provider capex pools | Too broad for TAM |
| Status-quo substitutes | Incumbent all-flash arrays, scale-out NAS, object stores, and stitched multi-product stacks | Not a separate spend pool; these are replacement paths | Storage, HPC, and platform operators | Compete for same workload |
Boundary logic combines VAST official positioning with public market-report definitions; relevance distinguishes direct spend from adjacency and explicit exclusion.
[CM001, CM003, CM005, CM007, CM008, CM021]2.2 Sizing lenses and methodology conflict
Public market data supports a range of defensible lenses, not a single clean TAM. At the narrow end, IDC’s external OEM enterprise storage forecast reaches about $37.7 billion in 2026 and is explicitly tied to branded storage systems, with all-flash arrays benefiting from AI and analytics demand. Fortune Business Insights places AI-powered storage at $44.94 billion in 2026, a broader cut that includes storage architectures optimized for AI workloads and separates file- and object-based approaches. Research and Markets and Coherent Market Insights both put broad AI infrastructure at roughly $90 billion in 2026, but those reports bundle much more than storage, spanning offerings, functions, technologies, deployment models, and end users. Gartner’s $1.366 trillion AI-infrastructure number is broader still and is only useful as context for how much total ecosystem capex can surround VAST’s category. For valuation work, the correct conclusion is not that one of these numbers is right and the rest are wrong. The right conclusion is that VAST sits inside several nested spend pools, and each publisher chooses a different boundary. The most defensible public framing is an evidence-constrained core lens bounded by enterprise external storage on the low side and AI-powered-storage categories on the high side, with broader AI-infrastructure figures treated as adjacency rather than TAM. Public sources do not disclose enough VAST-specific pricing, customer, or win-rate data to isolate a clean SAM or SOM, so contradictory estimates and methodology limits should be preserved rather than averaged away.[CM009, CM011, CM013, CM017, CM018, CM019]
| Publisher / lens | Year / geography | Value | CAGR / growth | Methodology | Confidence | Limitation |
|---|---|---|---|---|---|---|
| IDC external OEM enterprise storage | 2026 / global | $37.7B | 6.3% YoY | Branded external enterprise storage systems; AI and analytics demand, all-flash mix | Medium | Excludes hyperscaler self-build and non-OEM infrastructure |
| Gartner AI infrastructure | 2026 / global | $1,366.4B | 44% total AI spending growth | All AI infrastructure spend in Table 1 context; not storage-only | Medium | Far too broad to use as VAST TAM |
| Fortune AI-powered storage | 2026 / global | $44.94B | 25.2% CAGR to 2034 | AI-optimized storage systems across SAN, NAS, file, object, and end-user segments | Medium | Publisher scope is broader than shared enterprise AI storage |
| Research and Markets AI infrastructure | 2026 / global | $90.91B | 25.7% CAGR to 2030 | AI infrastructure segmented by offerings, function, technology, deployment, and end user | Medium | Bundles storage with broader infrastructure categories |
| Coherent AI infrastructure | 2026 / global | $90.0B | 24.0% CAGR to 2033 | Broad AI infrastructure with 54% hardware, 46% on-prem, 48% enterprise, and 40% North America | Medium | Category is infrastructure-wide, not VAST-specific |
| Evidence-constrained VAST core lens | 2026 / global | ≈$38B-$45B | n/a | Bounded by IDC storage floor and Fortune AI-storage ceiling for shared data infrastructure | Low | Still not a company-specific SAM or SOM because pricing, mix, and win rates are private |
All values are publisher-stated 2026 spend lenses except the final bounded estimate, which is an analytical bridge between narrow storage and broader AI-storage scopes rather than a true SAM.
[CM009, CM011, CM013, CM017, CM018, CM019]Nested spend pools from total AI infrastructure to the narrower shared AI data layer VAST plausibly serves.
The bottom layer is intentionally qualitative because no public source isolates VAST-specific SAM or SOM.
[CM009, CM011, CM013, CM017, CM018, CM020]Published 2025-2026 spend ranges, all in USD billions, showing why VAST market sizing must preserve scope differences.
Rows share a common unit but not a common market boundary; the figure is a scope comparison, not an additive TAM stack.
[CM009, CM011, CM013, CM018, CM019]2.3 Buyer, user, payer, and adoption path
The buyer map is segment-dependent, but the pattern is consistent: technical champions emerge from infrastructure, data-platform, HPC, or AI-engineering teams, while budget approval and risk review move upward into CIO, procurement, finance, and broader buying committees. Market reports and vendor reference architectures show why. Fortune’s end-user cuts point to enterprises, CSPs, telecom, and government as distinct demand pools. Coherent’s on-prem bias for sensitive sectors, plus VAST’s own enterprise AI-factory framing, suggests regulated enterprises and public-sector or HPC users remain important for buyers that need control, governance, and data locality. In these accounts, storage administrators, ML-platform teams, or research-computing leaders may specify the architecture, but they rarely buy alone. IDC says procurement, finance, and revenue-operations functions have become central decision-makers, while Forrester says the typical complex purchase now involves 13 internal stakeholders and nine external influencers. That governance-heavy buying process also explains the adoption path. Trials and referenceability matter because buyers must prove business value before scaling. Forrester says more than 60% of buyers use a trial and 78% of buyers above $10 million do so. Supermicro and VAST sell a turnkey route from design to deployment, while VAST’s reference architecture emphasizes starting small and scaling with demand. The practical funnel is therefore problem discovery, architecture validation, proof of value, initial production cluster, and then broader rollout across more workloads or sites. What is still missing publicly is the actual conversion rate through that funnel for VAST accounts.[CM016, CM018, CM022, CM026, CM029, CM031]
| Segment | Buyer | User | Payer / budget owner | Workflow | Adoption trigger |
|---|---|---|---|---|---|
| Enterprise AI factory | CIO, infrastructure VP, or central AI platform sponsor | ML platform, storage, and data engineering teams | Central infrastructure or transformation budget | Unified training / inference / RAG stack | Need to keep GPUs productive and shorten time-to-token |
| Regulated enterprise / sovereign data | CIO plus security, compliance, and procurement | Platform engineering and governed data teams | Core IT plus risk-reviewed capex | On-prem or hybrid AI deployment | Need control, auditability, and data-locality governance |
| Cloud service provider / GPU cloud | Platform or infrastructure GM | SRE, storage, and ML serving teams | Infrastructure capex owner | Shared multi-cluster data layer for AI services | Need scalable shared storage and fast rollout economics |
| Research / HPC / public sector | Research computing director or institutional IT lead | HPC admins, scientists, and data managers | Project, grant, or institutional infrastructure budget | Checkpoint-heavy training and analytics | Need high throughput, consistency, and manageable scale |
| Data / analytics expansion | CDO or data platform leader | Data engineering and analytics teams | Central data-platform budget | Lakehouse, event, compliance, and analytics workloads | Need one flash data fabric across structured and unstructured data |
Buyer and payer roles combine public 2026 buying-group research with VAST and partner deployment framing; public sources do not disclose VAST’s actual segment revenue mix.
[CM016, CM018, CM026, CM029, CM031, CM032]Role, deployment-bias, and governance-readiness matrix across the main VAST-adjacent buyer segments.
Cells are qualitative role assignments synthesized from segment definitions plus public evidence on governance-sensitive deployment bias and hybrid control-plane needs rather than from VAST-disclosed customer mix.
[CM018, CM026, CM029, CM031, CM032, CM043]Qualitative enterprise buying path from problem recognition to scaled production deployment.
The figure is a qualitative value-chain map because VAST does not publicly disclose stage-conversion rates.
[CM026, CM027, CM029, CM031, CM032, CM033]2.4 Growth drivers, adoption constraints, and diligence gaps
Demand drivers are real and multi-layered. IDC’s storage forecast, VAST’s own AI-factory messaging, and the reference architecture all point to a common theme: more AI production workloads mean more shared data infrastructure, more all-flash demand, and higher sensitivity to time-to-token and GPU utilization. Deloitte shows enterprise AI activity moving from pilot to production, while Cloudera describes 2026 as the shift from experimentation to operational intelligence orchestration. NetApp and Pure both underline that incumbents are already monetizing the same need for AI-ready, accessible enterprise data. In other words, VAST is selling into a real category, not a purely theoretical one. But the constraints are equally material. Gartner says enterprises remain ROI-sensitive and often prefer incumbent providers during the current AI cycle. Informatica and Deloitte both show governance, data quality, and skills gaps slowing scale-out. Electronic Design and SNIA treat storage as a first-order design constraint because it affects performance, power, cooling, and deployment risk, while Avnet shows that SSD and DRAM tightness can push up cost and extend supply risk through 2027. INFUSE adds a buyer-trust problem: more information does not necessarily produce more purchase confidence. The combination means VAST’s valuation should not assume frictionless adoption. The biggest unresolved diligence questions are company-specific rather than market-wide: pricing, customer count, workload mix, trial-to-production conversion, and how much of the broad AI capex wave actually lands in the shared data layer VAST serves.[CM010, CM012, CM024, CM028, CM034, CM036]
| Driver / constraint | Direction | Timing | Implication | Diligence ask |
|---|---|---|---|---|
| AI data growth and all-flash demand | Driver | Current | Supports persistent demand for high-throughput shared storage | How much of VAST bookings come from AI production workloads versus legacy storage refresh? |
| GPU utilization and time-to-token pressure | Driver | Current | Favors platforms that reduce data bottlenecks and keep accelerators busy | Show benchmark or customer proof on time-to-token and GPU utilization gains |
| Operational AI moving beyond pilots | Driver | 2026 | Expands demand from PoCs into standardized data-platform purchases | What percentage of VAST pipeline is production expansion versus first-time pilot? |
| Procurement and finance ROI scrutiny | Constraint | Current | Lengthens sales cycles and shifts power away from technical champions alone | What ROI framework and payback period does VAST use in live deals? |
| Governance and data-quality gaps | Constraint | Current | Raises adoption friction in regulated and agentic-AI use cases | What governance, lineage, and policy controls are used most often to win regulated accounts? |
| Buyer trust gap and trial dependence | Constraint | Current | Increases need for references, external validation, and PoC conversion support | What is VAST’s referenceability and trial-to-production conversion by segment? |
| Memory / SSD tightness and capex inflation | Constraint | 2026-2027 | Can compress ROI or delay deployment timing even when demand is present | How exposed are VAST BOMs and pricing to DRAM / SSD inflation? |
| Incumbent and channel power | Constraint | Current | Large buyers may default to NetApp, Pure, Dell, or bundled OEM stacks unless differentiation is proven | Where has VAST displaced incumbents at scale and what workload won the deal? |
Rows tie public 2026 demand signals to concrete adoption friction and due-diligence questions; several constraints require company-specific internal data to quantify precisely.
[CM010, CM012, CM024, CM028, CM033, CM034]2.5 Exhibits
03Competitors
3.1 Landscape, Peer Classes, and Substitute Paths
VAST’s competitive field is wider than “other flash array companies.” Coldago’s 2025 file-storage maps and StorageNewsletter’s methodology summary place VAST in overlapping enterprise and high-performance storage frames with Pure, NetApp, IBM, DDN, Qumulo, Hammerspace, and WEKA. The direct peer set is therefore AI-first file/object or high-performance data-platform vendors, not just one vendor with a similar appliance shape. The next ring is public-storage incumbents. Pure and NetApp now sell consumption-oriented commercial models on top of broad product portfolios, while IBM still positions Storage Scale as a software-defined AI/HPC and hybrid-data platform. The adjacent ring matters too: Hammerspace and Qumulo push orchestration, global namespace, and hybrid-cloud mobility, which can let a buyer de-emphasize VAST even without copying its DASE architecture. Substitutes are real rather than theoretical. AWS FSx for Lustre, S3, Azure Managed Lustre, and Google Filestore expose usage-based pricing, native billing, and existing cloud contracts. Internal-build and status-quo options also persist because Ceph and MinIO can unify key storage semantics without handing more leverage to a single proprietary vendor. The net result is a market where VAST sells into direct product rivalry, incumbent bundling, orchestration alternatives, hyperscaler-native substitutes, and open-source anti-lock-in narratives at the same time.[CP001, CP002, CP003, CP006, CP011, CP015]
| Competitor / class | Category | Scale / funding signal | Target customer | Product scope / strategic direction | Limitation / risk |
|---|---|---|---|---|---|
| Pure Storage | Public incumbent | FY2025 revenue surpassed $3B | Enterprises standardizing on file/object and AI data pipelines | FlashBlade plus Evergreen//One combine unified file/object storage with as-a-service packaging | Incumbent breadth can dilute AI-specific differentiation and pricing remains negotiated |
| NetApp | Public incumbent | Public incumbent; retained pages emphasize broad hybrid-cloud portfolio and Keystone STaaS | Hybrid-cloud enterprises wanting ONTAP continuity and contract flexibility | Unified storage portfolio across on-prem and public cloud with pay-as-you-go Keystone | Public list pricing is still limited and product sprawl can weaken a focused AI-performance story |
| WEKA | Direct AI-performance peer | $140M Series E at $1.6B valuation; 2026 partner and NVIDIA push | AI factories, HPC, and multi-tenant GPU environments | Software-defined NeuralMesh architecture focused on AI performance, multi-tenancy, and partner-led AI factories | Private-company economics stay partly opaque and broader enterprise distribution is still being built |
| DDN | Direct AI-performance peer | Private scale claims include 8/10 leading automotive firms and 7/10 top banking/securities firms | AI factories, sovereign AI, hyperscalers, regulated and HPC-heavy accounts | Data intelligence platform with Infinia and other AI-focused offerings across inference, sovereign AI, and scale-out workloads | Some strongest claims are self-authored and public pricing or capital structure is not refreshed in retained evidence |
| IBM Storage | Public incumbent / software-defined alternative | Global public incumbent; Storage Scale System 6000 is NVIDIA-certified and throughput-heavy | Regulated enterprises wanting software-defined AI/HPC data services with existing IBM relationships | Storage Scale plus Scale System sell file/object unification, content-aware intelligence, and GPU-ready performance | IBM’s broader portfolio can slow sharp AI-only messaging and pricing remains quote-led |
| Hammerspace | Adjacent orchestration challenger | $100M round at $500M+ valuation; TechCrunch names Meta and DoD as customers | GPU-rich environments wanting data locality, orchestration, and hybrid-cloud movement | Tier 0 local-NVMe architecture plus global namespace and policy-driven placement | Distribution is smaller than incumbents and its most aggressive VAST critiques are competitor-authored |
| Qumulo | Adjacent hybrid-cloud challenger | Private; retained pages emphasize NPS, exabyte cloud scale, and managed Azure route rather than refreshed funding | File-heavy enterprise and cloud workloads needing global namespace and managed cloud options | Modern file/object platform with Cloud Data Fabric, Azure Native managed service, and hybrid-cloud deployment paths | Scale, funding, and realized pricing remain less visible than for public incumbents |
| Hyperscaler-native substitutes | Substitute / status-quo extension | Existing cloud contracts and published usage constructs | Teams already anchored in AWS, Azure, or Google Cloud | Managed filesystems and object services with transparent usage billing and native procurement rails | Data movement, idle-capacity costs, and performance tuning can still make cloud “cheap” storage expensive in practice |
| Internal build / open-source | Substitute / anti-lock-in path | No vendor equity value; economics shift to hardware, labor, and operations | Platform teams willing to own storage engineering and integration | Ceph and MinIO show unified or exabyte-scale semantics without proprietary vendor lock-in | Operational burden, tuning complexity, and slower time-to-value raise execution risk |
Scale/funding cells use only what the retained public corpus exposed; private-company rows preserve when funding, revenue, or pricing remain undisclosed.
[CP003, CP005, CP006, CP007, CP010, CP012]VAST sits high on AI-performance specialization, but incumbents and hyperscalers still dominate procurement leverage while orchestration challengers pressure the control-plane narrative.
Scores are ordinal synthesis from retained evidence, not benchmark output. The x-axis weights AI-performance and hot-data specialization; the y-axis weights installed-base, contract leverage, and route-to-market power.
[CP005, CP010, CP015, CP020, CP027, CP035]3.2 Competitor Profiles and Capability Tradeoffs
The retained profiles show three distinct ways to attack the same budget. Pure and NetApp compete from the incumbent end by combining multiprotocol storage breadth with packaging that looks safer to procurement teams already standardizing on public-company vendors. WEKA, DDN, and IBM compete by arguing that AI performance, GPU efficiency, or scale-out data services matter more than appliance familiarity. Hammerspace and Qumulo instead lean into orchestration and data locality: Hammerspace’s Tier 0 story is that local NVMe plus shared access beats external storage bottlenecks, while Qumulo emphasizes global namespace, cloud-native deployment modes, and managed Azure routes. VAST’s public story is broader than raw storage hardware because it now claims an AI operating system spanning storage, database, and compute. That broader story is helpful, but the competitive downside is obvious: once VAST says it is the data and execution substrate for agentic AI, it is no longer compared only with filers. It is compared with every vendor claiming unified data services, every orchestrator promising data-local execution, and every cloud substitute offering transparent metering plus existing contracts. Capability therefore matters on two axes at once: the depth of the hot data plane for AI workloads, and the breadth of workflow, namespace, deployment, and procurement options around it.[CP001, CP003, CP006, CP008, CP011, CP013]
| Buying criterion | VAST | Pure | NetApp | WEKA | DDN | IBM | Hammerspace | Qumulo | Hyperscaler / internal build |
|---|---|---|---|---|---|---|---|---|---|
| Unified file + object posture | Strong | Strong | Strong | Moderate | Moderate-Strong | Strong | Moderate | Strong | Mixed |
| AI / HPC performance specialization | Very strong | Strong | Moderate | Very strong | Very strong | Strong | Strong on locality | Moderate | Mixed |
| Global namespace / orchestration | Moderate | Moderate | Moderate-Strong | Moderate | Moderate | Moderate | Strong | Strong | Mixed |
| GPU-locality / Tier 0 angle | Moderate | Weak-Moderate | Weak-Moderate | Strong | Strong | Moderate | Very strong | Weak | Weak-Mixed |
| Managed-service / cloud route | Unknown / sales-led | Moderate via Evergreen | Strong via Keystone | Moderate | Unknown | Moderate | Unknown | Strong in Azure | Very strong |
| Public pricing transparency | Unknown | Partial | Partial | Unknown | Unknown | Unknown | Unknown | Unknown | Strong |
| Procurement / compliance surface | Moderate | Strong | Strong | Moderate | Moderate-Strong | Strong | Moderate | Moderate-Strong | Strong |
| Anti-lock-in narrative | Moderate | Weak | Weak | Moderate | Moderate | Moderate | Strong | Moderate | Strong |
Cells are evidence-backed qualitative ratings synthesized from retained public sources; “Unknown” means the public corpus did not expose enough supportable detail.
[CP001, CP003, CP006, CP008, CP013, CP015]The main split is not raw feature count alone; it is whether a vendor emphasizes hot-data performance, orchestration breadth, cloud procurement ease, or anti-lock-in positioning.
[CP017, CP025, CP028, CP034, CP035, CP037]3.3 Pricing Transparency, Switching Friction, and Distribution Power
Pricing visibility is one of the clearest structural differences in the corpus. AWS, Azure, and Google all publish explicit or calculator-driven usage constructs for file or object services, while specialist storage vendors mostly route buyers to sales, demos, or negotiated subscription constructs. That does not make hyperscaler-native substitutes cheaper in every real workload, but it does lower procurement friction and makes them easier to compare inside an existing cloud contract. Pure and NetApp narrow the gap further by wrapping their platforms in STaaS models rather than only capex appliance sales. VAST’s switching costs still exist because petabyte-scale datasets, tuned AI pipelines, and operational familiarity are sticky. But the evidence suggests those costs are softer than monopoly-style lock-in: buyers can multi-home across cloud object, managed filesystems, orchestration layers, and open-source tiers, and several alternatives sell explicit anti-lock-in or portability stories. Distribution magnifies the point. Public incumbents and hyperscalers already have large account control, while WEKA and Hammerspace are still investing to expand sales or partnerships. VAST therefore needs more than speed claims to stay differentiated; it needs the product, migration path, and commercial model to remain convincing once buyers compare them against cloud billing convenience and incumbent account leverage.[CP004, CP007, CP018, CP021, CP022, CP023]
| Offer | Commercial model | Public pricing visibility | Meter / unit | Implication |
|---|---|---|---|---|
| VAST Data | Custom enterprise quote / consumption options referenced in product navigation | Low | Negotiated | Pricing opacity raises diligence burden and reduces easy side-by-side comparisons |
| Pure Evergreen//One | Storage as a service with SLAs | Partial | Used capacity, service tier, performance | Consumption model narrows capex disadvantage versus specialists |
| NetApp Keystone | Hybrid-cloud STaaS | Partial | Usage tier, term, performance / capacity | Lets NetApp answer AI-storage bids with opex language and cloud bursting |
| WEKA | Software subscription / partner-led AI factory packaging | Low | Negotiated | High-performance story is strong, but buyers still need custom quotes |
| DDN / Infinia | Enterprise platform quote | Low | Negotiated | DDN competes on AI ROI and density rather than transparent list pricing |
| IBM Storage Scale | Software or appliance quote | Low | Negotiated | IBM can package software-defined and appliance options but public pricing is sparse |
| Hammerspace | Software-layer enterprise quote | Low | Negotiated | Can reduce external flash needs, but realized savings require validation |
| Qumulo | Managed Azure plus self-hosted AWS/GCP models | Low to partial | Managed service or self-hosted cloud footprint | Cloud routes improve procurement flexibility even though list pricing is still sparse |
| AWS FSx for Lustre | Pay as you go | High | Storage, throughput, metadata IOPS, backups, transfer | Transparent pricing plus existing contract makes it an easy benchmark substitute |
| AWS S3 | Pay as you go | High | Storage, requests, transfer and related features | Cheap colder tier but not a complete replacement for high-performance shared filesystems |
| Azure Managed Lustre | Pay as you go / quote calculator | High | Per GiB per month and hour, performance tier | Managed service lowers ops burden for HPC and AI teams already in Azure |
| Google Filestore | Provisioned cloud filesystem | High | Provisioned capacity and IOPS characteristics | Provisioned billing makes idle-capacity cost explicit |
| Ceph / MinIO internal build | Software plus self-operated infrastructure | Mixed | Hardware, cloud, and labor rather than vendor subscription | Avoids vendor lock-in but externalizes engineering and support burden |
Rates are not normalized into a common TCO model here; the table compares packaging and public visibility, not negotiated net price after discounting.
[CP004, CP007, CP022, CP023, CP024, CP026]3.4 Moat Durability, Multi-Homing, and Adverse Read-Throughs
VAST’s moat is real, but the retained evidence says it is conditional rather than absolute. Coldago still treats VAST as a leader, and VAST’s own scale narrative now extends to a $30 billion valuation and an AI-operating-system pitch. That supports the bull case: a vendor with credible scale, strong AI positioning, and a shared data plane can consolidate storage, metadata, and parts of the AI workflow. The adverse evidence matters because it attacks the exact same thesis. theCUBE argues that VAST’s AI-OS ambition is ahead of current platform maturity, while StorageMath says VAST’s data-reduction narrative overstates reality and nudges customers toward deeper ecosystem lock-in. Hammerspace’s competitive brief attacks VAST on Tier 0 and cloud-scale orchestration. Meanwhile, Pure and NetApp are not standing still; they already pair technical breadth with safer commercial packaging. Cloud substitutes keep pressuring the cost and convenience end of the decision, and internal-build options keep the anti-lock-in alternative alive. The realistic read is that VAST’s moat depends on staying meaningfully better on hot AI data performance and integrated workflow value while preventing buyers from deciding that incumbent packaging, orchestration overlays, or cloud-native substitutes are “good enough.” Multi-homing is the key release valve: if customers can keep VAST only for the hottest tier and solve the rest elsewhere, moat durability weakens.[CP018, CP027, CP028, CP029, CP030, CP031]
| Moat claim | Threat | Severity | Evidence / implication | Mitigation / diligence ask |
|---|---|---|---|---|
| AI-first shared data plane | WEKA and DDN also sell AI-native performance narratives | High | Workload overlap is broad across VAST, WEKA, DDN, IBM, and Pure | Test win rates by workload class rather than assuming one generalized AI-storage lead |
| AI Operating System consolidation story | Independent analysts say product maturity still lags lakehouse / hyperscaler equivalents | High | theCUBE treats the AI-OS ambition as ahead of current platform maturity | Validate production references for database, vector, and agent-runtime layers, not only storage |
| Data reduction and efficiency marketing | Adverse sources say Flash Reclaim / Amplify claims are overstated and increase lock-in | High | StorageMath directly attacks VAST’s data-reduction math and lock-in incentives | Demand workload-level proofs, not portfolio averages |
| Hot-data performance and density | Hyperscaler and incumbent packaging can still win if “good enough” is cheaper or easier to buy | High | Cloud substitutes publish pricing while incumbents add STaaS wrappers | Benchmark against total workflow cost including idle GPUs and migration, not just raw capacity |
| Namespace and multi-site control | Hammerspace and Qumulo emphasize orchestration and placement more explicitly | Medium-High | Adjacent challengers can sit beside VAST instead of replacing it outright | Check whether VAST can own the control plane, not only the hottest storage tier |
| Regulated-enterprise credibility | Public incumbents and hyperscalers expose broader procurement rails | Medium | Azure Government procurement language and DDN sovereign-AI messaging make trust a selection axis | Map certifications and public-sector buying paths side by side |
| Installed-base leverage | Pure and NetApp can cross-sell into existing accounts | High | Public incumbents pair broad portfolio coverage with STaaS packaging | Quantify displacement inside incumbent-heavy accounts before underwriting expansion |
| Low switching probability after land | Multi-homing and open-source alternatives keep exit paths open | Medium-High | Ceph, MinIO, cloud filesystems, and orchestration overlays reduce all-or-nothing dependency | Interview customers on how much data stays on VAST versus moves to secondary tiers over time |
This register is a judgment layer rather than a benchmark; severity reflects how directly each threat can compress VAST’s differentiation or pricing power.
[CP018, CP029, CP031, CP033, CP035, CP036]Public numeric markers show that VAST faces competitors and substitutes with meaningful capital, scale, or cost signals even before negotiated discounts enter the model.
Values mix different units and are decision aids, not a valuation model. Each item is included because it is public and directly relevant to competitive durability.
[CP005, CP010, CP014, CP016, CP019, CP044]3.5 Exhibits
04Financials
4.1 Revenue model, pricing, and recognition issues
VAST's monetization is best described as infrastructure software sold through enterprise and partner channels rather than as a pure self-serve storage product. The company markets a unified AI Operating System that combines storage, database, and compute, and older company material explicitly says Gemini is a software consumption model that disaggregates hardware from software economics. Public commercial pages emphasize contact-sales and demo motions, not self-serve pricing. That means there is enough evidence to identify the revenue mechanism, but not enough to observe realized price cards or discounting behavior directly. Cisco's Global Price List provides one verified partner procurement path, while Polaris being included at no extra cost suggests at least part of the control plane is bundled into the platform. The CoreWeave agreement demonstrates that VAST can land very large partner-linked commitments, but the public record still does not bridge how bookings, committed recurring revenue, and any usage or system-delivery components are recognized into a single revenue-definition stack.[CI001, CI002, CI003, CI004, CI005, CI006]
| stream | mechanism | unit | current value / status | quality | diligence ask |
|---|---|---|---|---|---|
| AI Operating System software | Unified storage, database, and compute software sold through enterprise contracts | contract / annual commitment | Core recurring layer is confirmed; exact software-only revenue mix is undisclosed | High for existence, medium for mix | Request product-level revenue mix and recognized recurring revenue by module. |
| OEM / channel-attached deployments | VAST AI OS sold with Cisco and other OEM hardware stacks | partner-procured deployment | Cisco GPL confirms channel procurement path; realized partner markup is undisclosed | Medium | Request partner price books, take-rates, and channel-sourced bookings share. |
| Neocloud / hyperscaler commitments | Large strategic contracts anchored in AI-cloud deployments | multi-year commercial agreement | CoreWeave agreement valued at $1.17B; concentration share is undisclosed | Medium | Request top-customer bookings, ARR, and renewal schedule. |
| Bundled control-plane services | Platform services such as Polaris included inside AI OS bundle | bundled feature | Included at no extra cost; no standalone service revenue disclosure | Medium for existence, low for margin effect | Request attach-rate and support-cost allocation for bundled services. |
| Installed-base expansion / optimization | Programs such as Amplify monetize better use of customer-owned flash rather than only new hardware | capacity optimization engagement | Commercial path exists, but pricing and revenue contribution are undisclosed | Low | Request revenue contribution and gross margin from installed-base expansion programs. |
| Support / services | Support, deployment, and delivery economics likely exist but are not separately disclosed | unknown | No public segment breakout found | Low | Request support revenue, service-delivery costs, and attach rates by customer tier. |
Revenue mechanisms are supportable, but public mix by stream is not. Recognition appears to blend partner-linked system economics, committed recurring software, and broader non-committed revenue constructs, so each row is a mechanism statement rather than an audited mix statement.
[CI001, CI003, CI004, CI005, CI006, CI018]| price / unit / contract | list vs realized pricing | discounts / unknowns | source / supportable signal |
|---|---|---|---|
| Gemini software consumption model | No public numeric list price | Hardware/software split and discount schedule undisclosed | Series D release and About page describe software consumption model. |
| Enterprise direct contract via contact sales | No public list price | All realized ASPs undisclosed | Public commercial pages route buyers to contact sales and demos. |
| Cisco GPL channel procurement | Available through partner price list, but public unit pricing not visible | Cisco markup and rebate structure undisclosed | StorageNewsletter confirms GPL availability and Cisco support. |
| Historical initial land size | Average initial investment about $1M in 2021 | Current ACV may be materially higher; no current floor disclosed | Series D release. |
| Top-100 new customer commitments | More than $1.2M on average | Distribution around average is not disclosed | TNW and Sacra reporting. |
| Large strategic agreements | $1.17B CoreWeave commercial agreement | Recognition cadence, duration, and backlog treatment undisclosed | Official CoreWeave announcement and CRN coverage. |
| Contract duration | Typically five to seven years in TNW reporting | Renewal structure and termination rights undisclosed | TNW reporting. |
VAST clearly has monetization mechanisms, but almost all realized price points remain private. The table therefore distinguishes supportable commercial constructs from missing realized-pricing data rather than pretending a public list card exists.
[CI002, CI003, CI004, CI007, CI017, CI024]How VAST converts AI infrastructure demand into booked and recurring revenue through software, partner hardware, and strategic cloud contracts.
The bridge is directionally source-backed but not fully reconciled. Public sources do not break out the exact share of system-linked, committed recurring, and broader non-committed revenue.
[CI001, CI003, CI004, CI005, CI006, CI012]4.2 GTM motion and sales-efficiency proxies
VAST's public GTM evidence points to a high-touch enterprise motion with unusually large average land sizes and strong expansion dynamics. In 2021 the company said initial investments averaged about $1 million and expansion averaged 328 percent; in FY22 it said NRR exceeded 300 percent. More recent third-party reporting suggests top-100 new customers spend more than $1.2 million on average and that contract duration often runs five to seven years. Those are strong proxies for durable enterprise revenue, especially when paired with Cisco GPL access and cloud-partner distribution into AI infrastructure budgets. The company also appears to benefit from giant lighthouse accounts: CoreWeave alone signed a $1.17 billion agreement and multiple outlets reference xAI and other AI-cloud customers. What the public record does not disclose is CAC, CAC payback, the average sales cycle, or partner take-rates. So the chapter can argue that VAST likely has very efficient land-and-expand dynamics, but it cannot convert those demand signals into a closed sales-efficiency model.[CI006, CI007, CI008, CI017, CI018, CI024]
Publicly visible unit-economics proxies from large initial lands and expansion through margin and cash-generation claims.
This figure uses public proxies, not a closed unit-economic model. CAC, payback, and cycle-time nodes remain qualitative because VAST does not disclose them publicly.
[CI007, CI008, CI017, CI021, CI024, CI033]4.3 Cost structure, gross margin drivers, and capital intensity
The strongest public margin signals point toward a software-led cost structure even though VAST sells into a hardware-heavy market. Company disclosures from 2022 and 2023 described the business as built on software gross margins and later cited nearly 90 percent gross margin with positive cash flow. The architectural and commercial model matters here: Gemini separates hardware and software economics, the white paper says OEMs such as Cisco and SuperMicro can sell standard-server VAST clusters, and Cisco now carries the platform on its GPL. Those factors imply that some inventory, manufacturing, and balance-sheet working-capital burden may sit with OEM partners rather than with VAST itself. At the same time, bundled features such as Polaris and installed-base programs such as Amplify mean that support, control-plane delivery, and optimization services are embedded in the platform economics but not separately disclosed. The result is a favorable-looking gross-margin path with lower apparent capital intensity than a proprietary-array vendor, but one that still lacks a current segment-margin bridge and working-capital disclosure.[CI003, CI019, CI020, CI021, CI022, CI023]
| metric | value / status | confidence | why it matters | diligence ask |
|---|---|---|---|---|
| Committed recurring revenue floor | > $500M CARR exiting prior fiscal year | medium | Most solid current recurring-revenue floor in public record | Provide ARR/CARR bridge and quarterly cohort view. |
| Historical ARR anchor | $200M ARR at end-2023 | medium | Provides base point for trajectory analysis | Confirm audited ARR for FY24 and FY25. |
| Expansion / retention proxy | 328% average expansion in 2021; >300% NRR in FY22 | medium | Shows exceptional historical land-and-expand behavior | Provide current NRR and GRR by cohort. |
| Gross margin signal | Nearly 90% gross margin disclosed in 2023; software-gross-margin framing repeated | medium | Supports software-like economics if mix has not deteriorated | Provide current blended and segment gross margins. |
| Average commitment size | About $1M initial land historically; >$1.2M average for top-100 new customers | low to medium | Large ACVs can offset long enterprise sales cycles | Provide ACV distribution and median deal size by segment. |
| Contract duration proxy | Often five to seven years in 2026 reporting | low | Long duration improves revenue durability and payback tolerance | Provide term distribution, renewal rights, and early termination data. |
| CAC / CAC payback | Not publicly disclosed | low | Critical for judging whether AI-demand growth is efficient or merely expansive | Provide CAC, payback, and quota productivity by channel. |
| Sales cycle / partner economics | Not publicly disclosed | low | Needed to understand procurement friction and channel leverage | Provide sales-cycle length, partner take-rate, and attach-rate data. |
This table intentionally mixes confirmed metrics with missing ones. VAST's public evidence strongly supports expansion quality and high gross-margin potential, but not a closed CAC/payback model.
[CI007, CI008, CI009, CI010, CI012, CI017]Qualitative map of which parts of the VAST model appear software-like versus where public disclosure still leaves capital exposure unclear.
This is a qualitative exposure map, not a cash-flow statement. Public sources reveal where economics likely sit, but not the exact size of each exposure.
[CI019, CI020, CI022, CI023, CI030, CI031]4.4 Public traction and capital adequacy
Public traction is unusually strong for a private infrastructure company, but capital adequacy still has to be read through partial disclosures. Official sources show a staircase from $200 million ARR at the end of 2023 to more than $500 million of committed ARR at the exit of the prior fiscal year, alongside $4 billion of cumulative bookings, positive operating margin, and free cash flow. Independent outlets stretch that picture much further, with Reuters-linked reporting discussing $600 million of ARR and Calcalist/TNW discussing up to roughly $2 billion of ARR-like revenue including non-committed components. That gap is not just noise; it tells investors that VAST's publicly quoted revenue metrics are using different definitions. On capital adequacy, the 2026 Series F valued the company at $30 billion and brought in about $1 billion of total transaction value, but public reporting indicates that a large share of that headline number was secondary liquidity. Fidelity's SEC marks independently corroborate the valuation step-up, yet the company still has not publicly disclosed cash on hand, burn, runway, debt, or financing triggers.[CI010, CI011, CI012, CI014, CI015, CI016]
| item | current value / status | implication | diligence ask |
|---|---|---|---|
| Series F transaction value | Approximately $1B total at $30B valuation | Shows deep investor appetite and ample headline financing capacity | Provide signed financing documents and total primary cash received by the company. |
| Primary vs secondary split | Officially mixed primary and secondary; outside reporting says >$500M and possibly most was secondary | Headline funding likely overstates fresh operating cash | Provide exact primary proceeds, secondary proceeds, and seller list. |
| Use of proceeds | Primary proceeds earmarked for global growth and strategic transactions | Signals expansion rather than rescue financing | Provide budgeted use-of-funds schedule and M&A reserve assumptions. |
| Profitability / capital dependence | Positive operating margin and free cash flow reported; CEO says external capital not required to operate | Supports capital adequacy, but still not a substitute for treasury data | Provide monthly cash flow statements and current cash balance. |
| Third-party valuation marks | Fidelity N-PORT marks rose from ~$1.08M to ~$3.51M for the same preferred-share basket between 2024 and 2026 | Independent portfolio marks corroborate valuation uplift | Provide latest investor marks and cap-table reconciliation. |
| Cash / burn / runway | Not publicly disclosed | Prevents a true downside runway analysis | Provide treasury pack and 12/24-month runway scenarios. |
| Debt / project finance obligations | Not publicly disclosed in reviewed sources | Cannot rule out hidden covenant or equipment-finance pressure | Provide debt schedule, covenant package, and any equipment finance agreements. |
This table focuses on forward capital adequacy rather than round chronology. The central issue is not whether VAST can raise capital; it is how much of the 2026 headline round actually strengthened the balance sheet and what undisclosed obligations still sit beneath the surface.
[CI026, CI027, CI028, CI029, CI030, CI031]Publicly visible numeric band for VAST's recurring-revenue and valuation signals, showing why underwriting still needs a metric-definition bridge.
The figure intentionally mixes committed ARR, third-party ARR-like estimates, and valuation marks because the core diligence problem is definitional inconsistency, not lack of directional growth.
[CI012, CI014, CI015, CI016, CI026, CI027]4.5 Financial verdict and diligence blockers
The financial read-through is favorable but not fully underwriteable. On the positive side, VAST combines software-like margin signals, multi-year contract evidence, very large average commitments, positive free cash flow claims, and exceptional demand from AI-cloud customers. That is a stronger revenue-quality profile than most late-stage infrastructure startups show publicly. The caution is that several of the most important inputs remain private: customer concentration, actual cash balance and runway, CAC and payback, realized pricing, debt obligations, and a reconciled bridge across bookings, CARR, and any broader ARR or revenue constructs. Large AI-cloud accounts also introduce concentration and renewal-timing risk precisely because they can move results so dramatically. The right underwriting stance is therefore not to dismiss the business model, but to separate what the public record can support from what still needs company disclosure. VAST looks financially strong enough to justify serious diligence, yet the evidence set still stops short of a closing-grade financial model.[CI030, CI036, CI037, CI038, CI039, CI040]
| missing private metric | impact on underwriting | exact diligence path |
|---|---|---|
| Bookings-to-revenue-to-ARR bridge | Without a metric-definition bridge, public revenue quality cannot be normalized against peers | Request quarterly bridge from bookings to recognized revenue to CARR / ARR, including non-committed revenue. |
| Current cash balance and runway | No true downside liquidity view is possible | Request treasury reports and board runway scenarios. |
| Debt / covenant package | Cannot assess hidden financing pressure or equipment-finance obligations | Request debt schedule, covenant summary, and legal confirmation of secured debt. |
| Customer concentration and renewals | Large AI-cloud deals could dominate risk and revenue timing | Request top-10 customer ARR/bookings and renewal calendar. |
| CAC, payback, and sales-cycle data | Sales efficiency remains qualitative, not model-ready | Request funnel conversion, CAC, payback, and sales-cycle metrics by channel. |
| Realized pricing and discount schedules | Public pricing mechanisms cannot be converted into realized ASPs or margin assumptions | Request price books, sample order forms, and partner pricing schedules. |
| Segment gross margins and support / services costs | Current margin path cannot be audited by stream | Request product-line and channel gross margin bridge plus support-cost allocations. |
These are the minimum missing inputs required to turn a strong public narrative into a real investment-grade financial model. The company looks commercially impressive, but the data room still has to supply the underwriting core.
[CI031, CI032, CI033, CI034, CI037, CI040]4.6 Exhibits
05Product & Technology
5.1 Product scope in customer workflow terms
VAST sells the platform as an AI Operating System rather than as a single storage SKU. In customer workflow terms, the retained record says the buyer is trying to collapse multiple stages of the data path onto one shared substrate: ingest data through file, object, streaming, or Kubernetes interfaces; catalog and query it through DataBase; trigger event-driven processing in DataEngine; and keep the same data visible across sites through DataSpace rather than copying it into separate silos. The AI reference architecture is explicit that the company wants one flash-native platform to cover data capture, refinement, model training, model serving, audit logging, and RAG-style retrieval. Kubernetes material makes that concrete at operator level, because the same cluster can be exposed either as NFS or as CSI-provisioned persistent volumes. The workflow value proposition is therefore not merely speed; it is reducing data movement, avoiding separate storage/database/streaming stacks, and presenting one control model to infrastructure teams.[CE001, CE002, CE003, CE004, CE005, CE007]
| module / asset | primary user / buyer | public status / maturity | what it does | differentiation | diligence gap |
|---|---|---|---|---|---|
| Universal Storage / DataStore | Storage architects, AI infrastructure teams, platform ops | Mature core; repeatedly documented since before the 2026 AI OS reframing | Shared file and object data plane presented over NFS, SMB, and S3 with the same underlying elements | One write path can be surfaced through multiple protocols rather than isolated silos | Need current public benchmark and reference-deployment detail by workload, not just architectural claims. |
| VAST DataBase | Data engineers, analytics teams, RAG pipeline owners | Documented module; materially expanded in 2024-2026 messaging | ACID tabular engine tied to catalog, pushdown, SQL analytics, and provenance workloads | Couples structured querying to the same platform that stores unstructured data | Need public GA docs and customer references for the newest GPU-accelerated database paths. |
| VAST DataEngine | Pipeline builders, platform engineers, AI application teams | Visible but still earlier than storage core in public documentation depth | Event-driven processing environment for functions, triggers, pipelines, topics, and compute resources | Turns the platform from passive storage into an execution surface | Need more public production examples and operational SLOs for DataEngine workloads. |
| DataSpace / Polaris control layer | Multi-site operators, cloud teams, governance owners | DataSpace is documented; Polaris is newer and more announcement-driven | Global namespace plus hybrid-control orchestration for distributed VAST environments | Makes hybrid and multi-site operation part of the product story rather than an external overlay | Need public admin docs, customer references, and GA-state clarity for Polaris. |
| AI OS accelerators (CNode-X, PolicyEngine, TuningEngine) | Enterprise AI platform teams, agentic-AI builders | 2026 launch-stage surface; not yet as deeply documented as platform core | GPU-accelerated compute path, inline policy enforcement, and closed-loop model tuning | Pushes VAST above storage into full-stack AI workflow control and acceleration | Need proof of customer production use, actual availability, and support boundaries. |
| Developer / operator tooling | SREs, automation teams, Kubernetes operators | Active and public | CSI driver, Helm chart, Python SDK, Go client, Terraform provider, and DataEngine CLI | Lowers integration friction and gives the product a visible practitioner surface | Need usage telemetry or download/install evidence beyond repository existence and activity. |
Status/maturity reflects depth of retained public evidence rather than internal shipment data. The core storage platform is better documented than the newest AI-OS control and acceleration layers.
[CE001, CE003, CE005, CE007, CE008, CE009]| user job | current workflow / pain | VAST solution | measurable / public benefit | limitation |
|---|---|---|---|---|
| Consolidate AI data ingestion and model-serving data paths | Teams copy data across separate archive, training, and inference stacks | Single flash-native platform for capture, refinement, model serving, and RAG retrieval | Public architecture frames reduced copying and one shared platform across pipeline stages | Public corpus is architecture-heavy; customer-specific ROI data is limited here. |
| Provide persistent storage to Kubernetes workloads | Native container storage is host-local or manually provisioned | Use VAST as NFS-backed storage or CSI-based dynamic provisioning | Official documentation shows both modes and official CSI distribution paths | No public broad-scale deployment metrics for CSI adoption were retained. |
| Run real-time analytics and vector retrieval on the same data plane | Query engines and vector systems often sit beside rather than inside storage | DataBase plus GPU-accelerated SQL and cuVS-backed vector retrieval on CNode-X | 2026 materials claim lower query time/cost and retrieval acceleration | The strongest numbers are company-issued benchmark claims, not independent tests. |
| Add block and streaming without adding a separate SAN or Kafka estate | Enterprises keep block workloads or event streams on separate islands | NVMe/TCP block plus Event Broker / Kafka-compatible ingest on the common platform | Public reports say block inherits snapshots, clones, replication, and QoS | Public evidence for production scale of the newest surfaces is still thinner than for file/object core. |
| Build cyber-resilient backup or governed data-retention workflows | Backup platforms often need separate immutable targets and added storage tiers | Commvault validates VAST as NFS or S3 target with WORM immutability options | Public partner doc supports disk-library and cloud-library usage with immutable copies | Validation exists, but this chapter did not retain broader multi-vendor backup references. |
Benefits are limited to what retained sources support directly. Independent deployment outcome data is strongest for integration existence, not for a broad public ROI benchmark set.
[CE001, CE002, CE009, CE021, CE023, CE024]How VAST is publicly positioned to carry data from ingestion through query, event processing, and governed AI use.
[CE001, CE002, CE005, CE007, CE008, CE009]5.2 Architecture and operating model
The architectural center of gravity is DASE, VAST's disaggregated shared-everything model. Public technical documents describe stateless compute nodes handling protocol and software services while persistent data and system state sit on NVMe-backed storage enclosures, which allows performance and capacity to scale independently. That matters because the platform pitch depends on all protocols seeing the same underlying data plane: DataStore exposes file and object interfaces, DataBase adds tabular and query services, Event Broker extends the platform toward Kafka-style streaming, and block access is now being added through NVMe/TCP. The architecture is also presented as deployment-flexible. The older reference architecture still shows classic CBox and DBox layouts, while current partner material from Supermicro and Cisco shows packaged solutions that keep VAST's software abstractions but adapt them to OEM hardware and AI-factory reference designs. The net read-through is that VAST's product is not a loose bundle of separate products; it is one layered software platform whose value depends on the shared data plane actually behaving consistently across multiple access methods and node topologies.[CE003, CE014, CE015, CE016, CE017, CE018]
| layer / component | role | dependency | key risk |
|---|---|---|---|
| Access protocols and clients | Present shared data through NFS, SMB, S3, SQL/table, CSI, and newer NVMe/TCP block interfaces | Client compatibility, protocol translation, and stable common metadata semantics | Newer block and streaming surfaces may lag the maturity of legacy file/object interfaces. |
| CNodes / CBoxes / CNode-X | Run VAST software services, protocol handling, AI accelerations, and management functions | CPU or GPU server supply plus OEM partner packaging from Cisco and Supermicro | Accelerated roadmap execution becomes tied to partner hardware and NVIDIA enablement. |
| DNodes / DBoxes / NVMe fabric | Hold persistent metadata and data state while exposing shared access across the cluster | NVMe flash economics, fabric design, and data-protection algorithms | Hardware topology and flash assumptions remain central to performance and rebuild claims. |
| DataBase and catalog layer | Provide ACID tables, catalog metadata, pushdown, analytics, and provenance support | Query-engine integrations, GPU libraries, and consistent object/file metadata capture | Public deployment proof is thinner for newest accelerated SQL claims than for architecture diagrams. |
| DataEngine and Event Broker | Run event-driven functions, triggers, pipelines, topics, and streaming workflows | Developer tooling, runtime support, and pipeline authoring ergonomics | DataEngine is visible publicly, but runtime maturity is harder to underwrite than storage maturity. |
| DataSpace and Polaris | Extend visibility, control, and orchestration across hybrid and multi-site deployments | Agent-based fleet management, cloud integrations, and consistent policy propagation | Public docs for Polaris still trail the ambition of the launch narrative. |
This is a qualitative operating-model map, not a packet-level spec. Risk cells focus on where public evidence is thinner than the product claim, especially for the newest layers.
[CE014, CE015, CE016, CE017, CE018, CE020]Publicly supportable layers of the VAST platform from multiprotocol access through shared data services, orchestration, and underlying DASE infrastructure.
[CE003, CE014, CE015, CE017, CE021, CE024]5.3 Deployment, integration, and roadmap maturity
VAST has enough public integration surface to look operable in modern infrastructure environments rather than merely demoable. The Juniper validated design exposes concrete cluster topology, VIP rebalancing behavior, GUI and CLI management, and a REST API under the hood. The Kubernetes knowledge-base article plus the CSI repo and Helm documentation show that container platforms are not an afterthought. The GitHub organization, Python SDK, Terraform provider, DataEngine CLI, and Go client together create a public operator toolchain that can be used for provisioning, querying, automation, and pipeline management. The roadmap, however, is moving faster than the deepest public documentation. Block storage and Event Broker were surfaced in 2025, and the 2026 narrative adds Polaris, CNode-X, PolicyEngine, TuningEngine, and open DataEngine blueprints. Those moves expand the buyer story from storage into orchestration and agentic-AI operations, but they also mean the newest layers are backed more by launch materials and commentary than by the kind of mature administrator documentation visible for the core platform and Kubernetes integrations.[CE009, CE010, CE011, CE012, CE013, CE020]
| date / stage | feature / milestone | public status | implication | source signal |
|---|---|---|---|---|
| 2025 launch | Native block storage via NVMe/TCP | Publicly announced and described in news coverage | Extends VAST from file/object/table into SAN-adjacent workloads and remote boot scenarios | DBTA and The Next Platform coverage |
| 2025 launch | Event Broker / Kafka-compatible streaming | Publicly announced and described in news coverage | Makes streamed data a first-class peer of file/object/table data on the same platform | DBTA and The Next Platform coverage |
| 2026 announcement | CNode-X accelerated server path | Announced with OEM route to market | Deepens NVIDIA dependence while materially broadening the AI-compute story | VAST press release, StorageNewsletter, Cisco/Supermicro references |
| 2026 announcement | Polaris global control plane | Announced and described by independent conference coverage | Expands VAST from storage/data layer into fleet orchestration across on-prem, neocloud, and public cloud | SiliconANGLE and TechArena |
| 2026 roadmap | PolicyEngine | Slated to roll out through / by end of 2026 | Inline governance could become a key differentiator for regulated agentic-AI workflows | SiliconANGLE and TechArena |
| 2026 roadmap | TuningEngine | Slated to roll out through / by end of 2026 | Moves VAST toward closed-loop model improvement inside the enterprise boundary | SiliconANGLE and TechArena |
| 2026 ecosystem push | Open DataEngine blueprints | Publicly announced | Suggests VAST wants reusable reference workloads, not just infrastructure primitives | VAST press release and StorageNewsletter |
2025 items are better grounded as released capability extensions, while several 2026 control-plane and agent-governance items still read as roadmap-stage or early-rollout features rather than deeply documented general-availability surfaces.
[CE022, CE024, CE025, CE027, CE029, CE030]5.4 Differentiation, dependencies, and technical risks
The supportable differentiation thesis is stronger on shared data-plane breadth than on every new AI-OS layer being fully mature today. VAST does have a real product distinction in the retained corpus: one platform claims multiprotocol file, object, table, streaming, and now block access; DASE is repeatedly positioned as the reason the platform can scale without the classical trade-offs of shared-nothing or tiered storage; and the NVIDIA collaboration brings GPU acceleration into SQL, vector search, retrieval, and long-context inference paths. That said, the same evidence surfaces meaningful dependencies. Accelerated deployments depend on NVIDIA libraries, DPUs, and networking assumptions; Cisco and Supermicro are explicit go-to-market and hardware routes for CNode-X; and the newest governance and tuning layers still read as announced capabilities rather than widely evidenced production defaults. The product therefore looks technologically differentiated, but the risk register sits at the seams: partner hardware availability, execution on forward-looking agentic-AI features, and the challenge of making a much broader AI-OS claim feel as mature as the underlying storage core.[CE016, CE018, CE021, CE025, CE026, CE027]
VAST's newer product layers depend materially on partner hardware, NVIDIA software, and surrounding operator ecosystems.
[CE018, CE020, CE025, CE027, CE030, CE037]Public-evidence maturity is highest for the core platform and operator tooling, and lower for the newest AI-OS control and acceleration layers.
Ratings are ordinal judgments derived from documentation depth, repository visibility, and launch-stage evidence. They are not internal shipment, adoption, or support metrics.
[CE003, CE009, CE012, CE021, CE029, CE039]5.5 Trust, compliance, and control posture
Public trust evidence is real but uneven. The strongest artifacts are not a polished public trust-center package; they are deployment and hardening materials. VAST's Security Configuration Guide says the platform is a STIG-hardened appliance, documents federal procurement orientation, and states the product is listed on the DoDIN Approved Products List. It also lays out concrete control surfaces: HTTPS-based VMS management, CLI access, predefined roles including CSI-specific permissions, identity integration with Active Directory, LDAP, and NIS, and trusted TLS prerequisites before SSO and MFA are configured. The AI reference architecture adds the broader design claim of zero trust, RBAC, ABAC, encryption, auditing, network isolation, and tenant-specific key management. Commvault's integration guide adds cyber-resilience evidence by documenting WORM-mode immutable backup copies on VAST-backed cloud-library workflows. The caution is that this chapter's retained corpus still does not provide a broad public certification catalog, clear SLA or incident-history evidence, or strong public reference proof that the newest Polaris and Policy/Tuning layers are already widely deployed in production.[CE032, CE033, CE034, CE035, CE036, CE037]
| control / artifact | status | scope | supporting evidence | gap / risk |
|---|---|---|---|---|
| STIG-hardened appliance posture | Publicly documented | Federal deployment and hardening baseline | Security Configuration Guide says the platform is STIG-hardened and on the DoDIN APL | Strong hardening signal, but not the same as a broad public enterprise trust pack. |
| Zero-trust and tenant controls | Publicly claimed and partially documented | RBAC, ABAC, auditing, tenant isolation, QoS, network isolation, tenant-specific encryption keys | AI architecture and security guide describe these controls | Need wider independent validation and current certification mapping. |
| Identity and admin control plane | Publicly documented | HTTPS VMS, CLI, roles, AD/LDAP/NIS/local users, TLS before SSO/MFA | Security guide provides concrete access-method and role detail | Need current public evidence on password policy defaults, incident response SLAs, and external audits. |
| Immutable protection and cyber-resilience | Publicly supported | Snapshots and WORM-mode immutable backup copies through partner workflows | AI architecture plus Commvault integration guide | Retained corpus did not include broader public restore-test or ransomware-recovery case studies. |
| AI-lifecycle security extension | Announced in 2026 | CrowdStrike telemetry and coordinated detection/response across ingestion, training, runtime, inference | CrowdStrike press release and VAST Forward coverage | Release explicitly warns that some referenced functionality may not yet be generally available. |
| Public trust visibility | Partial | Compliance marketing and hardening docs exist | Compliance page plus security docs reference governance, security, and regulated contexts | No broad public catalog of current audit reports, SLA commitments, or incident-history evidence was retained here. |
This table distinguishes between documented security controls and the broader trust posture an enterprise buyer may still expect. The main gap is not zero evidence; it is uneven public packaging of that evidence.
[CE032, CE033, CE034, CE035, CE036, CE037]5.6 Exhibits
06Customers
6.1 Customer segmentation and buyer / user / payer map
The public record supports a broad but not fully enumerated VAST customer base. Official customer-proof pages and roster disclosures show VAST reaching AI-cloud operators, telecom sovereign-AI builders, financial-services institutions, healthcare and life-sciences teams, government research bodies, and media organizations. The visible buyer-user-payer pattern is segment specific rather than uniform. In media, the NHL frames the purchase around production and archive workflows run by media-operations teams for downstream editors, producers, broadcasters, and eventually AI-assisted content discovery. In sovereign AI and AI-cloud contexts, the technical operator is a platform or infrastructure team, but the platform is procured so enterprise, research, and public-sector tenants can consume GPU and data services. In healthcare, genomics, and quant trading, the end users are researchers, scientists, and quants, while central infrastructure budgets fund the platform. That means VAST is not selling to a single universal persona; it is selling a deep infrastructure layer whose commercial sponsor is usually central IT or platform engineering, while the economic justification comes from higher-level data teams and business workflows.[CU001, CU002, CU003, CU004, CU024, CU037]
| Segment | Representative buyers / payers | Primary users | Named proof | Use case / scale | Gap |
|---|---|---|---|---|---|
| AI cloud / neocloud | Cloud infrastructure leadership and platform budgets | Model builders and tenant AI teams | CoreWeave; 2023 roster also names Lambda and Core42 | Primary data foundation for AI cloud, training, inference, shared-customer delivery | Direct end-customer revenue share and top-account mix are undisclosed |
| Telecom / sovereign AI | SK Telecom infrastructure and national AI program budgets | Government, research, and enterprise AI tenants | SK Telecom | Sovereign GPUaaS / AI cloud, 1,000 Blackwell GPUs, 5–10 minute provisioning | No public contract value or renewal structure |
| Financial services | Central data / innovation / trading technology budgets | Banking AI teams, asset-servicing teams, quant researchers | HSBC, CACEIS, Jump Trading | AI-driven banking, client conversations, algorithmic trading, low-latency HPC | Most retained pages are title-level or lightly quantified |
| Healthcare / life sciences | Research IT and bioinformatics infrastructure budgets | Scientists, genetic analysts, genomic R&D teams | Invitae, PacBio, HHS, Boston Children’s roster mention | Genomics, sequencing, medical research, public-health science workflows | Customer count by subsegment is undisclosed |
| Government / research | Public-sector and lab infrastructure budgets | Researchers and HPC / ML teams | HHS, LLNL, U.S. Air Force, DOE roster mention, Chan Zuckerberg Initiative | Drug efficacy, public-health research, government AI and science workloads | Production depth varies sharply by named logo |
| Media / sports | League media-operations and content-platform budgets | Editors, producers, archivists, broadcasters | NHL | 20+ PB archive, all 32 arenas, real-time replication, outdoor-event pilot | No public revenue or contract-length disclosure |
Segment map combines direct case-study proof with broader roster disclosures. Buyer, user, and payer are inferred from named deployment descriptions rather than from a single company segmentation disclosure.
[CU001, CU002, CU003, CU004, CU013, CU024]Typical VAST customer path from problem recognition through technical codesign, production rollout, and later expansion into adjacent workflows.
[CU004, CU006, CU011, CU015, CU043]6.2 Production adoption trajectory and named customer proof
The strongest proof is where VAST or the customer describes a live deployment with scope, chronology, or measurable outcomes. The NHL relationship is the clearest long-duration example: VAST says the league first adopted the platform about six years ago, moved more than 20 PB of archive content onto it, extended the footprint to all 32 arenas plus headquarters, and in 2026 successfully piloted a portable node for outdoor events that securely replicated more than 2 TB of footage. SK Telecom provides the clearest sovereign-AI and telco example, with VAST integrated into Petasus AI Cloud so GPU environments can be provisioned in about 5–10 minutes and scaled across a 1,000-Blackwell-GPU stack. CoreWeave is the highest-stakes AI-cloud example: VAST announced a $1.17 billion commercial agreement in late 2025, and both official and third-party sources describe VAST as a key data foundation inside CoreWeave’s cloud. Beyond those flagship cases, the retained set includes quantified healthcare outcomes from Invitae and PacBio and title-level case pages across HSBC, CACEIS, Jump Trading, HHS, LLNL, and Chan Zuckerberg Initiative, which together show real vertical spread even when not every page has the same evidentiary depth.[CU005, CU006, CU007, CU008, CU009, CU010]
| Metric / milestone | Value / status | Date | Source basis | Confidence | Implication / missing denominator |
|---|---|---|---|---|---|
| Average initial land size | About $1M initial investment | 2021-05-04 | Series D official release | Medium | Shows enterprise ACV floor historically, but not current mix or median |
| Installed-base expansion | 328% average net-revenue expansion; several customers >$10M | 2021-05-04 | Series D official release | Medium | Very strong land-and-expand signal, but no current NRR/GRR disclosure |
| Breadth of named roster | 2023 official roster spans Booking, U.S. Air Force, DOE, Verizon, Boston Children’s, Pixar, Zoom plus AI-cloud partners | 2023-12-06 | Series E official release | Medium | Confirms vertical breadth, but not paying-customer count |
| CoreWeave expansion | $1.17B commercial agreement and expanded partnership | 2025-11-06 | Official press release plus independent news | High | Largest public expansion proof also increases concentration risk |
| SK Telecom deployment maturity | GPU environments provisioned in 5–10 minutes on sovereign AI cloud | 2025-08-14 / 2025 | Customer page, blog, and press release | High | Shows production readiness and operational benefit |
| NHL production rollout | All 32 arenas plus headquarters on VAST; 2026 outdoor pilot moved >2 TB successfully | 2025-2026 | Customer page and 2026 blog | Medium | Strong repeat-usage signal, but no commercial spend disclosed |
| Healthcare outcome proof | Invitae reports 6x data-access improvement and 30x IOPS improvement; PacBio added 2 PB with no issues | 2026 | Customer-proof pages | Medium | Quantified outcomes exist for select customers, not the full base |
Trajectory table mixes historical expansion metrics, named deployment milestones, and outcome disclosures. It highlights what is supportable publicly while preserving missing denominators such as total customers, retention by cohort, and revenue share by account.
[CU008, CU009, CU012, CU015, CU020, CU021]| Customer | Segment | Deployment / use case | Production vs pilot | Outcome / proof quality | Limitation |
|---|---|---|---|---|---|
| NHL | Media / sports | Archive modernization, 32-arena replication, real-time media workflows, outdoor-event node | Production with 2026 pilot extension | 6-year relationship, >20 PB archive, all 32 arenas, >2 TB replicated in 2026 pilot | Commercial value and renewal terms are not public |
| SK Telecom | Telecom / sovereign AI | Petasus AI Cloud / GPUaaS for sovereign AI workloads | Production-grade deployment | 5–10 minute provisioning, 1,000 Blackwell GPUs, near-bare-metal throughput claims | No public contract value or tenant count |
| CoreWeave | AI cloud / neocloud | Primary data foundation for AI cloud and shared-customer workloads | Production / expansion account | $1.17B agreement, 23 data centers, 250,000+ GPUs, 500+ PB, 99.9999% uptime claim | Direct VAST revenue share and end-customer attribution remain unclear |
| Invitae | Healthcare / life sciences | Genetics analysis on VAST Universal Storage | Production deployment | 6x faster data access and 30x IOPS improvement cited | Single-page outcome proof; no spend or retention data |
| PacBio | Healthcare / life sciences | Sequencing and genomics data platform | Production deployment | Added another 2 PB with no issues; cites scalable, affordable HPC fit | Still company-hosted proof rather than independent customer testimony |
Enumeration is intentionally partial and limited to the strongest named public proofs retained for this chapter. Many additional logos exist, but these rows were chosen because they contain production detail, measurable outcomes, or fresh evidence rather than logo-only marketing.
[CU005, CU006, CU007, CU008, CU009, CU011]Observed path from evaluation to scaled production for the clearest named VAST customer proofs.
[CU006, CU009, CU012, CU015, CU020, CU043]Public customer proof is not uniform; the best evidence combines fresh dates, quantified outcomes, and clear production maturity.
Qualitative cells reflect the quality of public evidence retained for this chapter, not internal customer stages. They compare freshness and specificity of retained proof only.
[CU023, CU024, CU025, CU026, CU042]6.3 Retention, repeat usage, and satisfaction signals
Durability is supportable, but only indirectly. VAST’s own 2021 Series D release remains the cleanest historical expansion datapoint: average initial customer investment of about $1 million, average net-revenue expansion of 328%, and several customers already above $10 million of spend. More recent public proof shows continued repeat usage and expansion in named accounts rather than disclosed portfolio-wide retention metrics. The NHL relationship expanded from archive modernization into real-time arena-to-headquarters workflows and then into a 2026 outdoor-event pilot. CoreWeave evolved from an existing partnership into a disclosed $1.17 billion agreement. PacBio explicitly says it added another 2 PB to its VAST cluster with no issues, and Invitae cites 6x faster data access plus 30x IOPS improvement. Independent satisfaction evidence is directionally positive but thin: FeaturedCustomers aggregates a large body of testimonials and reference ratings, while PeerSpot reports 5.0/5 and 100% willingness to recommend from a small sample. The caution is that no retained source discloses current GRR, NRR, churn, cohort renewal, or top-customer share, so the public durability case is positive but incomplete.[CU020, CU021, CU022, CU029, CU030, CU031]
| Signal | Value / status | Segment / account | Confidence | What it says | Diligence ask |
|---|---|---|---|---|---|
| Historical expansion | 328% average net-revenue expansion | Broad installed base (2021) | Medium | Strong early land-and-expand behavior | Request current NRR / GRR by cohort and segment |
| Large repeat spend | Several customers >$10M by 2021 | Broad installed base | Medium | Some customers expanded far beyond the first land | Request current top-20 expansion curve and logo vintage |
| Long-lived named relationship | About six years of collaboration | NHL | Medium | Supports durability beyond a pilot logo | Request contract term, renewal cadence, and annual spend |
| Expansion from partner to megadeal | $1.17B disclosed agreement | CoreWeave | High | Proof of major expansion but also concentration risk | Request share of ARR / bookings and renewal schedule |
| Independent review sentiment | 4.8/5 from 1,417 reference ratings; 65 testimonials; 37 case studies | Broad review / reference base | Medium | Directionally positive third-party satisfaction signal | Request matched customer references by cohort and vertical |
| Current portfolio retention metrics | Not publicly disclosed | Company-wide | Low | Major public gap on GRR, NRR, churn, and renewals | Request current NRR, GRR, churn, renewal, and cohort bridge |
Public durability evidence is a mix of historical company metrics, named-account repeat usage, and third-party review sentiment. It is not a substitute for current cohort-level retention disclosure.
[CU006, CU015, CU020, CU021, CU022, CU029]Durability looks positive on public evidence, but the strongest signals are indirect and the core cohort metrics remain undisclosed.
This scorecard mixes disclosed metrics, named-account repeat usage, and review sentiment. It is a public-evidence lens, not a company-reported retention dashboard.
[CU021, CU030, CU032, CU040, CU044]6.4 Expansion, concentration, channel dependence, and unresolved gaps
The same public evidence that proves adoption also sharpens risk. CoreWeave is simultaneously VAST’s best public expansion case and clearest concentration flag because one disclosed partner-linked agreement is worth $1.17 billion, and third-party reporting says VAST underpins workloads CoreWeave delivers to Meta, OpenAI, and Microsoft. Public VAST materials also frame adoption through partners such as NVIDIA, Supermicro, HPE, Lambda, and Core42, which suggests partner-led distribution matters structurally for customer acquisition and deployment. What public sources do not show is equally important: there is no retained disclosure of current total customer count, channel mix, sales-cycle length, partner take-rates, cohort retention, or top-customer revenue share. The user-specified names BMW, Nasdaq, Goldman Sachs, and Dell are also revealing. The retained NVIDIA pages on BMW and Nasdaq prove adjacent AI infrastructure credibility, but they do not mention VAST. Sacra supports Goldman Sachs and Dell Technologies Capital as investors, not paying customers. So the chapter can support real adoption, real expansion, and real partner leverage, but it cannot close the loop on direct proof for every marquee name or on the concentration math needed for underwriting-grade customer durability.[CU018, CU019, CU033, CU034, CU035, CU036]
| Expansion driver / risk | Public evidence | Potential impact | Confidence | Diligence path |
|---|---|---|---|---|
| Land-and-expand motion | 2021 official metric of 328% net-revenue expansion and several customers >$10M | Positive indicator for account growth and wallet-share capture | Medium | Request current expansion by cohort and product module |
| CoreWeave concentration | $1.17B agreement with one disclosed AI-cloud partner/customer | Could create revenue and renewal concentration in a single neocloud account | High | Request top-customer ARR / bookings concentration and contract timing |
| Shared-customer dependence | SDxCentral says VAST under CoreWeave supports Meta, OpenAI, and Microsoft workloads | Exposure can be indirect and partner-mediated rather than directly owned | Medium | Request split between direct end customers and partner-routed customers |
| Partner-led route to market | Official sources emphasize CoreWeave, Lambda, Core42, NVIDIA, HPE, Supermicro | Improves reach but raises dependence on ecosystem priorities and certifications | Medium | Request partner-sourced bookings mix, pipeline share, and take-rates |
| Procurement friction visibility | No public list price, sales-cycle disclosure, or partner economics retained | Hard to judge how scalable procurement is outside marquee accounts | Low | Request average sales cycle, discounting norms, and procurement blockers by segment |
| Marquee-name proof gap | BMW and Nasdaq are adjacent NVIDIA references; Goldman and Dell show up as investors, not retained customer proof | Logo inflation risk if ecosystem references are mistaken for paying customers | Medium | Request direct customer-reference pack with production status, spend band, and reference contacts |
This table separates true public proof of expansion from the gaps that matter for underwriting concentration and channel dependence. The most important missing private package is customer concentration by ARR/bookings plus partner-sourced mix.
[CU018, CU019, CU033, CU034, CU035, CU036]6.5 Exhibits
07Risks
7.1 Ranked risk stack and investment implication
VAST is not a scandal or obvious product-failure case; the risk is mostly underwriting risk at a peak-cycle AI infrastructure price before public disclosure catches up. The company does have real proof points: it closed a $30 billion Series F, says it has more than $4 billion of cumulative bookings and more than $500 million of committed ARR, landed a $1.17 billion CoreWeave partnership, and now ships through validated NVIDIA, Cisco, and Supermicro architectures. But those same facts create the top residual stack. First is valuation-reset risk, because the price assumes durable AI build-out and continued acceptance of private metrics. Second is concentration risk across Nvidia, Cisco, OEM hardware, and a small set of marquee AI-cloud customers. Third is execution breadth as VAST stretches from storage into a full AI operating system and cloud control plane. Fourth is legal and compliance exposure around Red Stapler and new data rules. Fifth is Israel-linked continuity and talent exposure. Investment implication: the deal is still diligencable, but only if legal provenance, concentration, and compliance evidence become bounded rather than open-ended risks.[CR001, CR002, CR005, CR006, CR007, CR011]
Qualitative severity map across the risks that matter most to underwriting VAST at a 2026 private-market price.
This is a qualitative matrix built from retained evidence rather than a probabilistic model.
[CR006, CR018, CR021, CR030, CR036, CR045]How VAST's major risks transmit into revenue, margin, financing options, and exit value.
The DAG is directional rather than mathematical and is meant to show where small upstream shocks become larger underwriting problems.
[CR006, CR021, CR025, CR030, CR036, CR038]7.2 Legal, regulatory, privacy, and sovereignty risk
The clearest live legal overhang is the Red Stapler chain. NetApp sued former CTO Jonsi Stefansson over alleged trade-secret misuse tied to the startup VAST acquired, and although the Florida complaint was dismissed without prejudice, the dismissal was venue-based and NetApp appealed. That means VAST is not yet facing a proven liability event, but the matter is also not fully extinguished. Separately, the public legal surface shows a company that processes contact, device, and candidate data through a website and cloud stack while pushing deeper into regulated sectors through Cisco reference designs for healthcare and financial-services use cases. That makes the 2025 bulk-data regime and broader U.S. privacy enforcement more relevant than they would be for a simple storage vendor. Israel adds another layer. VAST publicly tied itself to a sovereign Israeli AI cloud and operates a large Tel Aviv-centered development footprint, while Israeli legal commentary highlights export-control, dual-use, and IIA-grant constraints that can matter in financings, IP transfers, and exits. Public evidence is enough to rank these as real legal and regulatory risks, but not enough to close them without document review.[CR021, CR022, CR023, CR024, CR025, CR026]
| Rank | Rule / license / case | Jurisdiction | Status | Likelihood | Severity | Mitigation | Residual exposure | Diligence path |
|---|---|---|---|---|---|---|---|---|
| 1 | NetApp / Red Stapler IP dispute | U.S. / Iceland | Florida complaint dismissed without prejudice; appeal continues; VAST not named defendant | Medium | High | Keep acquired code and personnel provenance ring-fenced; rely on SPA indemnities and outside counsel review | Medium-High | Obtain acquisition docs, code provenance memo, indemnity package, and any Iceland action updates |
| 2 | DOJ bulk-data rule plus expanding U.S. privacy enforcement | United States | Effective from 2025; applies to certain vendor, investor, employee, and data-access flows | Medium | High | Contractual controls, data-mapping, least-privilege access, and customer-specific security programs can mitigate | Medium | Request data-flow map, subprocessors, cross-border access controls, and privacy compliance matrix |
| 3 | Israel export-control / dual-use / IIA-grant constraints | Israel | Sector-level obligation is clear; VAST-specific grant or license status not public | Low-Medium | Medium-High | Early legal structuring and local-compliance review can contain transaction risk | Medium | Confirm any IIA grants, defense-linked customers, export licenses, or know-how transfer restrictions |
| 4 | Regulated-workload compliance obligations | Multi-jurisdiction | Financial-services and healthcare use cases are marketed, but public attestation set is incomplete | Medium | Medium-High | Cisco zero-trust framing and enterprise architectures help, but public proof remains thin | Medium-High | Request current SOC 2, ISO 27001, penetration-test summaries, and regulated-customer approval references |
Partial public register of the main legal and regulatory items touching VAST as of runDate; private customer approvals, grants, and internal controls remain outside public view.
[CR021, CR022, CR023, CR024, CR025, CR026]7.3 Operational, quality, and dependency risk
Operational risk is less about a publicly documented outage history than about how many critical layers VAST does not fully control. The current product story relies on NVIDIA acceleration, Cisco distribution, Supermicro reference systems, and broader OEM hardware from vendors that also work with VAST competitors. The CoreWeave contract and SDS sovereign-cloud build prove that VAST can win very large deployments, but they also show how revenue and roadmap execution can become tied to a handful of capital-intensive counterparties whose own GPU budgets, data-center timing, and procurement priorities can shift abruptly. Competitive pressure sharpens this problem. Dell is not a partner in the evidence reviewed; it is an active rival publicly attacking VAST on power, space, and benchmark framing, while StorageMath has criticized one of VAST's optimization claims. That does not prove product weakness, but it does mean VAST must keep both technical credibility and partner alignment high while scaling across mission-critical environments.[CR007, CR008, CR009, CR011, CR012, CR013]
| Rank | Failure mode | Likelihood | Severity | Mitigation maturity | Residual exposure | Unresolved gap |
|---|---|---|---|---|---|---|
| 1 | NVIDIA roadmap or allocation disruption slows AI OS deployments | Medium | High | Medium | High | No public disclosure of allocation protection, alternative accelerator strategy, or supply commitments |
| 2 | Cloud control-plane integration from Red Stapler slips under legal or engineering pressure | Medium | High | Low-Medium | Medium-High | No public code provenance review or post-acquisition integration KPI pack |
| 3 | Security and compliance proof lags regulated-customer ambition | Medium | High | Low-Medium | Medium-High | Retained sources do not show a current public certification or audit set |
| 4 | Large sovereign or neocloud rollouts create reliability and multi-tenant complexity | Medium | Medium-High | Medium | Medium | No public SLA, incident-log, or postmortem package was found in retained sources |
| 5 | Marketing and benchmark disputes erode technical credibility in competitive deals | Medium | Medium | Medium | Medium | Independent technical validation of disputed claims is limited in the public record |
This register focuses on externally observable operational modes; absence of public outage history is not treated as proof of absence and is carried as a diligence gap.
[CR013, CR014, CR015, CR017, CR019, CR020]| Rank | Dependency | Counterparty | Role | Concentration | Failure scenario | Severity | Mitigation | Residual exposure |
|---|---|---|---|---|---|---|---|---|
| 1 | Accelerated AI stack | NVIDIA | GPU, libraries, certification, and reference architecture backbone | Critical | NVIDIA roadmap or allocation changes weaken VAST's current AI OS positioning | High | Certified designs and deep partnership | High |
| 2 | Marquee neocloud revenue | CoreWeave | Primary disclosed mega-deal and cloud distribution anchor | High | Capex pause, renewal stress, or repricing hits growth optics and bookings concentration | High | Multi-year partnership and technical integration | High |
| 3 | Enterprise channel and procurement | Cisco | GPL route, validated stack, and enterprise access | Medium-High | Cisco prioritizes incumbent alternatives or the joint stack underperforms in field execution | High | Direct GPL placement and joint solution support | Medium-High |
| 4 | Validated hardware platforms | Supermicro plus OEM stack | Reference systems and deployment hardware | Medium-High | Hardware constraints, spec shifts, or OEM conflict slow deployments | Medium-High | Multi-OEM posture rather than single-box dependency | Medium |
| 5 | Israel sovereign cloud footprint | SDS and Israel-based operations | Regional flagship deployment and local execution base | Medium | Geopolitical disruption or local labor stress slows delivery and support | Medium-High | Remote-first posture and global customer base | Medium |
Counterparty concentration is ranked by disclosed strategic importance, not by undisclosed revenue share; top-customer and top-partner economics remain private.
[CR007, CR008, CR011, CR012, CR013, CR014]The current VAST stack depends on external GPU, channel, hardware, cloud, and regional execution nodes rather than on a single self-contained platform.
This dependency map is an exposure map, not a full partner list; it highlights the nodes with the strongest evidence-backed risk transmission.
[CR007, CR011, CR013, CR014, CR017, CR027]7.4 Financial, commercial, and competitive risk
The financial-model risk is not an imminent cash crunch in the public record; it is valuation fragility and concentration fragility. VAST's financing access is clearly strong, but the round structure mixed primary and secondary capital, so the headline size exaggerates fresh operating cash. At the same time, the strongest scale metrics remain company-claimed rather than publicly reconciled. That matters because macro context is no longer uniformly supportive. Gartner still forecasts huge AI infrastructure spend in 2026, but it also says the sector is in a trough of disillusionment and that incumbents will capture much of enterprise AI demand. Goldman frames the build-out debate as assumption-sensitive, not guaranteed. Those two facts can coexist: spend can grow while the market becomes less forgiving of premium private valuations. For VAST, that means the downside case is not just slower top-line growth; it is slower top-line growth combined with harder competition from Dell and other incumbents and a public-market multiple reset before VAST closes its disclosure gap.[CR001, CR002, CR004, CR005, CR006, CR018]
7.5 People, execution, mitigations, and kill criteria
People risk is meaningful because VAST's market narrative still runs heavily through founder CEO Renen Hallak, while the cloud-control-plane expansion vector depends on the Red Stapler team whose arrival is precisely the area touched by the NetApp dispute. There are also classic late-stage private-company diligence limits: the public source set still does not provide top-customer concentration, renewal schedules, debt, current security certifications, or a clean treasury view. The good news is that some mitigations are real. VAST has deep financing access, partner validation from Cisco and Supermicro, and credible adoption in large AI environments. The bad news is that those mitigations do not answer the most thesis-sensitive questions on their own. Investors should therefore treat several items as explicit post-signing monitors rather than vague watchpoints: any new litigation step that drags VAST directly into the Red Stapler fight, evidence of customer concentration worse than expected, inability to produce current compliance attestations for regulated workloads, or any financing event that implies a valuation reset. Those are clearer kill criteria than generic concerns about competition or macro noise.[CR022, CR037, CR038, CR039, CR042, CR048]
| Rank | Role / function | Dependency or gap | Likelihood | Severity | Mitigation | Diligence path |
|---|---|---|---|---|---|---|
| 1 | Founder / CEO | Commercial, product, and capital narrative remain strongly centered on Renen Hallak | Medium | High | Add senior bench depth and board process | Request succession plan, board committee structure, and escalation map |
| 2 | Cloud-control-plane leadership | Red Stapler team is central to hyperscale cloud ambition and is adjacent to the live legal narrative | Medium | High | Ring-fence IP review and integration governance | Request post-acquisition roadmap milestones and code ownership review |
| 3 | Compliance and security go-to-market | Regulated-sector push appears ahead of publicly visible compliance attestation | Medium | Medium-High | Use Cisco and partner validation while building direct control set | Request current certifications, customer questionnaires, and security staffing plan |
| 4 | Board and disclosure depth | Public disclosures still leave board composition, debt, and treasury detail thin for a $30B company | Medium | Medium-High | Late-stage CFO discipline and investor processes | Request board list, debt schedule, cash policy, and governance calendar |
| 5 | Israel R&D and talent concentration | Large Israel engineering footprint faces macro labor and geopolitical stress | Medium | Medium | Remote-first posture and global commercial surface | Request org chart by geography, attrition trends, and business continuity plan |
This table ranks execution dependence by how directly a single team or control gap can affect roadmap delivery, fundability, or regulated-customer conversion.
[CR021, CR022, CR029, CR030, CR037, CR038]| Risk | Monitorable trigger | Threshold / event | Action implication |
|---|---|---|---|
| Valuation reset | Financing or IPO signal | Down-round, structured rescue financing, or IPO pricing materially below recent private marks | Pause new capital deployment and re-underwrite on public-comp multiples |
| Customer concentration | Diligence disclosure | Top-customer or top-partner share materially exceeds underwriting assumptions, especially around CoreWeave | Cut position size or require stronger downside protections |
| Legal / IP overhang | Case status | VAST is named directly, faces injunction risk, or cannot evidence clean code provenance | Treat as thesis-break until legal risk is quantified |
| Security / compliance gap | Certification evidence | Company cannot produce current attestation set for regulated workloads inside diligence window | Limit underwriting to non-regulated growth case or stop |
| Partner dependency | Roadmap / channel slippage | NVIDIA, Cisco, or OEM roadmap slip pushes major deployments by more than one planning cycle | Revise growth and margin expectations downward |
| Israel continuity risk | People and operating resilience | Meaningful Israel leadership attrition or sovereign-cloud disruption with no fast fallback | Reprice execution risk and require continuity plan before closing |
These triggers are designed to be monitorable inside diligence and the first post-close year rather than to rely on vague macro sentiment.
[CR036, CR042, CR045, CR046, CR047, CR048]7.6 Exhibits
08Valuation
8.1 Price-sensitive recommendation, not a company-quality verdict
VAST Data looks like a real category leader rather than a hype-only asset. The company has an unusually strong combination of AI-infrastructure relevance, marquee customers, rapid growth, and positive margin signals for a private storage-and-data platform. The problem is not whether the business matters; the problem is whether fresh capital at a $30 billion valuation can still earn an attractive return using only public evidence. At the official floor of more than $500 million of committed ARR, the round price implies roughly 60x. Even the older Reuters-style $600 million ARR projection still implies about 50x. Only the most aggressive external estimate of roughly $2 billion of ARR pulls the entry price down to about 15x, which is still rich versus public storage peers. That makes the recommendation price-sensitive and evidence-sensitive: track or research more for new money, hold only if the basis is materially lower, and do not confuse a strong company with a justified entry price.[CV001, CV007, CV009, CV019, CV020, CV021]
| dimension | current read | evidence basis | decision implication |
|---|---|---|---|
| Recommendation | Track / research-more for fresh capital; hold only for existing low-basis holders | Fresh-money underwriting clears only in the bull case while company-quality signals remain strong | Do not initiate at the round price without a private diligence package or a better entry point. |
| Confidence | Medium | Core financing facts are real, but revenue-definition and preference-stack gaps remain unresolved | Treat the call as evidence-sensitive rather than final. |
| Risk rating | High | Price requires premium growth assumptions plus continued AI-capex support and clean concentration dynamics | Underwrite downside first. |
| Valuation stance | Expensive | The round implies ~60x official CARR, ~50x a Reuters-style ARR projection, and ~15x an aggressive external ARR estimate | Current price embeds success that public evidence has not fully proven. |
| Target return / hold / exit | New money needs ~$60B for 2x gross or ~$90B for 3x gross; existing holders can hold for IPO optionality | Return hurdle is difficult to support versus public comp multiples today | Fresh buyers need a price reset or much stronger evidence. |
This table is explicitly price-sensitive. Return hurdles are arithmetic from the $30 billion round valuation and do not adjust for future dilution or financing terms.
[CV019, CV020, CV021, CV044, CV050, CV051]The recommendation follows from real product and demand proof flowing into an unresolved revenue bridge and then into an expensive price test.
This flow is analytical rather than mechanistic. It shows how the recommendation is derived from evidence quality and price support, not how VAST's product technically works.
[CV007, CV014, CV024, CV027, CV030, CV044]8.2 Financing context, marks, and entry discipline
The financing record is directionally supportive but still incomplete where underwriting matters most. March 2026 reporting surfaced the raise early, while the company officially disclosed the closed Series F on April 22, 2026. Management confirmed that the roughly $1 billion transaction combined primary and secondary capital, and outside reporting suggests that liquidity for early holders may have represented around half the headline amount. That matters because a secondary-heavy round is a weaker signal on fresh balance-sheet strength than a pure primary raise. Fidelity portfolio filings show an independent mark-up in VAST preferred shares from roughly $18.16 per share in August 2024 to roughly $59.22 by February 2026, so the valuation step-up did not come from nowhere. But those same sources still do not disclose the exact preference stack, dilution, or revenue-definition bridge. Entry discipline therefore has to be simple: assume the public record supports a valuation below the round unless diligence proves that recurring revenue is much closer to the high-end estimates and that late-stage financing terms are clean.[CV004, CV005, CV006, CV017, CV018, CV019]
| lens | thesis | anti-thesis | what would change the view |
|---|---|---|---|
| Market | AI infrastructure demand is real and VAST sits in a large buildout cycle. | AI-capex multiples can compress sharply if the bubble narrative wins or utilization disappoints. | Show durable demand through customer renewals and resilient growth through a weaker AI-spend tape. |
| Product | A unified AI Operating System can justify a premium over legacy storage vendors. | The premium is not investable if product breadth does not translate into monetized recurring revenue. | Provide module-level monetization and attach-rate disclosure. |
| Customers | Marquee customers and partners support strategic relevance. | Public sources do not quantify concentration, so one or two large neoclouds may dominate economics. | Share top-customer ARR and renewal schedule. |
| Financials | Official signals show >$500M CARR, >$4B bookings, and positive margin/cash-flow claims. | Those facts still leave a wide gap to the ~$2B ARR estimate needed for comfort at this price. | Bridge bookings, CARR, ARR, and recognized revenue in audited form. |
| Competition | VAST should trade above NetApp and Pure if it is truly an AI-infrastructure platform rather than a storage box company. | Even premium public comps trade well below VAST's implied multiple unless the high-end ARR estimate is right. | Prove software-like durability and margin quality at materially larger scale. |
| Financing / governance | Independent Fidelity marks support a real valuation step-up. | Secondary-heavy structure and undisclosed preference terms weaken price confidence. | Provide primary/secondary split and full cap-table waterfall. |
Each row pairs a real positive with the corresponding underwriting objection. The chapter recommendation only improves if the “what would change the view” column is satisfied.
[CV018, CV024, CV027, CV030, CV035, CV036]The valuation debate is most sensitive to revenue-definition clarity and public-market multiple selection, with concentration and multiple compression acting as the main negative offsets.
Values are directional dollar-billion deltas from a mid-teens base case, designed to show which assumptions matter most rather than to present a precise DCF.
[CV019, CV021, CV024, CV027, CV030, CV038]8.3 Scenarios and comparable valuation anchors
Public comparables frame the valuation problem clearly. Everpure, the rebranded Pure Storage, trades around 7.7x EV to trailing revenue; NetApp trades around 4.1x; and CoreWeave, the clearest public AI-infrastructure premium reference, trades around 10.9x EV to annualized first-quarter revenue while carrying much heavier leverage. Private AI infrastructure rounds such as Lambda, Crusoe, and Together AI confirm that investors are paying up for GPU-cloud and AI-platform assets, but none of those milestones close the gap between VAST's official $500 million-plus CARR floor and a $30 billion price. The base case therefore sits below the round, because a mid-point revenue estimate combined with a public AI-infrastructure premium still lands in the low-to-high teens. The bull case reaches the high twenties to mid-thirties only if the roughly $2 billion ARR estimate is directionally right, concentration is manageable, and public markets remain open to premium AI infrastructure multiples. The bear case collapses quickly toward public-storage bands if AI spending slows or the revenue bridge fails.[CV024, CV027, CV030, CV031, CV032, CV033]
| scenario | assumptions | valuation range (USD B) | implied result vs $30B entry | probability signal / triggers |
|---|---|---|---|---|
| Bull | Public high-end ARR estimate (~$2B) is directionally right; concentration is manageable; VAST earns a 14x-18x premium multiple into an open IPO market. | 28-36 | Flat to moderately positive for fresh buyers; attractive for low-basis holders | Needs private diligence to validate revenue scale and exit readiness. |
| Base | Normalized recurring-revenue power is about $1.3-$1.5B; VAST earns a premium to Pure/NetApp but not a bubble multiple. | 13-21 | Below the current round; fresh entry unattractive | Best fit with today's public record. |
| Bear | Durable recurring revenue is closer to $0.8B; AI multiples compress toward public-storage bands; concentration or exit timing worsens. | 6.4-8.0 | Severe markdown from the round price | Triggered by failed revenue bridge, customer concentration, or a shut IPO window. |
Scenario values are estimated equity ranges, not negotiated term-sheet outcomes. They purposely use broad bands because the public record does not support point precision.
[CV041, CV042, CV043, CV044, CV045, CV051]| comparable | status | current value | revenue anchor | implied multiple / milestone | relevance | limitation |
|---|---|---|---|---|---|---|
| Pure Storage / Everpure | Public | EV ~$28.0B on 2026-05-27 | TTM revenue ~$3.66B | ~7.7x EV/revenue | Best listed storage-platform premium comp with good margins and cash | Still more mature and less AI-native than VAST. |
| NetApp | Public | EV ~$27.1B on 2026-05-27 | TTM revenue ~$6.70B | ~4.1x EV/revenue | Useful lower-bound storage incumbent comp | Legacy mix and slower growth likely understate AI-platform upside. |
| CoreWeave | Public | EV ~$90.7B on 2026-05-27 | Annualized Q1'26 revenue ~$8.3B | ~10.9x EV/annualized revenue | Closest public AI-infrastructure premium reference | Heavily levered and power-intensive; not a clean apples-to-apples comp. |
| Lambda | Private round | $480M Series D; Reuters-valued at ~$2.5B | Public revenue undisclosed | Milestone reference only | Shows investors will pay up for AI cloud capacity | No current revenue or margin disclosure. |
| Crusoe | Private round | $1.375B Series E at >$10B valuation | Public revenue undisclosed | Milestone reference only | Large capital raise for vertically integrated AI infrastructure | Valuation is not paired with public recurring-revenue detail. |
| Together AI | Private round | $305M Series B at $3.3B valuation | Public revenue undisclosed | Milestone reference only | Another cloud/AI-platform valuation marker | Model mix and financial profile are not storage-infrastructure comparable. |
Public rows use runDate market data; CoreWeave revenue is annualized from Q1 2026 for directional comparison; private rows are milestone references because current revenue is not publicly disclosed. This is a partial enumeration of the most relevant observable comps, not an exhaustive late-stage AI infrastructure universe.
[CV024, CV027, CV030, CV031, CV032, CV033]Scenario ranges show that public evidence supports a base case below the round and that fresh-money return hurdles are demanding from a $30 billion entry.
Scenario ranges are estimated equity values. The return hurdle row is pure arithmetic from the round price and is shown to highlight why valuation discipline matters.
[CV041, CV042, CV043, CV051, CV052]8.4 Exit readiness, thesis-break triggers, and final diligence asks
The most supportable exit path is an eventual IPO, but management itself says that VAST has not started a formal IPO process or hired bankers yet. That means investors should not model a near-term public exit as though it were already underway. The public record also leaves open the questions that most often break late-stage deals: what fraction of revenue comes from a handful of hyperscaler or neocloud accounts, how repeatable those contracts are, what the post-Series-F cap table looks like, and whether audited reporting systems are ready for public-market scrutiny. Those are not cosmetic details at this valuation. If ARR definitions fail to reconcile, if customer concentration is high, or if the company remains far from IPO readiness despite demanding public-style pricing, the thesis should break immediately. The diligence agenda is therefore straightforward: ask for the private metric bridge, the concentration schedule, the financing waterfall, and the IPO-readiness roadmap before treating the $30 billion round as investable rather than merely impressive.[CV039, CV040, CV045, CV046, CV048, CV049]
| trigger | threshold | why it breaks thesis | action implication |
|---|---|---|---|
| Revenue bridge fails | Audited ARR or recurring revenue lands well below ~$1.5B | The current round price only works if scale is far above the official $500M CARR floor | Stop or reprice the deal. |
| Customer concentration too high | Top one or two customers dominate recurring revenue or upcoming renewals | Late-stage premium multiples should compress if demand is narrow or lumpy | Increase downside weighting or walk away. |
| AI multiple compression | Public AI infrastructure comps derate toward legacy storage bands | VAST is already priced for premium multiples | Avoid fresh entry until price resets. |
| IPO readiness stalls | No bankers, controls, or reporting roadmap despite premium public-style pricing | Holding period extends while valuation support weakens | Treat as longer-duration private asset or decline. |
| Preference overhang surprises | Series F waterfall or anti-dilution terms materially reduce common-equity value | Headline valuation no longer maps to investor economics | Rebuild return model from the cap table, not from the headline valuation. |
Kill triggers are intentionally specific and deal-oriented. They focus on conditions that would invalidate the current pricing rather than conditions that merely slow growth.
[CV036, CV038, CV046, CV048, CV049]| topic | missing evidence | why it matters | owner / diligence path |
|---|---|---|---|
| Revenue bridge | Bookings-to-CARR-to-ARR-to-recognized-revenue reconciliation by year and by top customer | This is the single biggest swing factor in fair value | Finance diligence / CFO package. |
| Primary vs secondary mix | Signed financing summary showing exact fresh cash and seller liquidity | Determines dilution and how much runway actually improved | Legal + investor-relations diligence. |
| Cap table / preferences | Series F term sheet, liquidation waterfall, and fully diluted ownership | Headline valuation may overstate common-equity value | Legal diligence / board materials. |
| Customer concentration | Top-20 ARR, renewal dates, margin profile, and hyperscaler exposure | Concentration can collapse the bull case quickly | Commercial diligence / revenue-ops export. |
| IPO readiness | Controls, audit status, board composition, and banker-readiness milestones | IPO is the most supportable exit path but not yet formally started | CEO/CFO diligence + governance review. |
These are the shortest path to changing the recommendation. Without them, the public record is good enough to admire the company but not to underwrite a fresh $30 billion entry.
[CV037, CV039, CV048, CV049]The IC-ready scorecard is strong on market and product proof, middling on evidence quality, and weak on valuation support at the current entry price.
Scores are committee-style heuristics on a 0-10 scale intended to summarize evidence quality and price support, not to present a weighted algorithm.
[CV035, CV036, CV044, CV045, CV047, CV049]8.5 Exhibits
Disclaimer
This report is for informational purposes only and is based on public sources available as of 2026-05-27. VAST Data is a private company with limited public disclosure, so several financial, governance, and customer-concentration conclusions rely on management statements and third-party reporting rather than audited filings. Scenario analysis and valuation views are illustrative and not investment advice.
Evidence index
| ID | Statement | Confidence | Sources |
|---|---|---|---|
| CO001 | VAST Data was founded in 2016. | Medium | SO007, SO026 |
| CO002 | VAST describes itself as an AI Operating System company and a global data platform for agentic and data-intensive applications. | Medium | SO001, SO029 |
| CO003 | VAST says its platform natively unifies storage, database, and compute in one system. | Medium | SO011, SO029 |
| CO004 | VAST DataStore is marketed as a universal data store that eliminates storage tiers and scales from terabytes to exabytes. | Medium | SO005, SO018 |
| CO005 | VAST attributes its platform economics and scale claims to its DASE architecture. | Medium | SO007, SO008, SO025 |
| CO006 | Recent official materials anchor VAST to New York City while also branding the company as remote-first or no-headquarters. | Medium | SO007, SO011, SO008 |
| CO007 | Reuters-described coverage characterized VAST as New York-headquartered in 2025. | Medium | SO028 |
| CO008 | After closing its Series F financing, VAST remained a private, late-stage company rather than a public issuer. | Medium | SO007, SO016 |
| CO009 | Renen Hallak is VAST's founder and CEO or co-founder and CEO in retained public sources. | Medium | SO007, SO011, SO016 |
| CO010 | Jeff Denworth is a co-founder who remains a recurring public spokesperson for VAST's platform strategy. | Medium | SO011, SO017, SO023 |
| CO011 | Amy Shapero is VAST's first CFO and joined from Shopify as the company prepared for public-company readiness. | Medium | SO016, SO028 |
| CO012 | Jonsi Stefansson became VAST's general manager of cloud solutions after VAST bought Red Stapler and hired its team in 2025. | Medium | SO020, SO021, SO009 |
| CO013 | Fidelity's 2023 investment gave it a board observer role rather than a publicly disclosed board seat. | Low | SO025 |
| CO014 | Hallak is the main public face across VAST's financing, product-positioning, and IPO-readiness disclosures in the retained source set. | Medium | SO007, SO016, SO028 |
| CO015 | VAST appears to use multiyear licensing, OEM relationships, resellers, and cloud partners as major routes to market rather than a purely direct sales model. | Medium | SO016, SO018, SO026 |
| CO016 | VAST raised $83 million in Series D in May 2021 at a $3.7 billion post-money valuation led by Tiger Global Management. | Medium | SO010, SO019 |
| CO017 | VAST said NVIDIA also participated in the Series D round. | Medium | SO010, SO019 |
| CO018 | VAST said its balance sheet was $230 million strong after adding the Series D proceeds to prior rounds. | Medium | SO010, SO019 |
| CO019 | VAST said it exited its second year with nearly $100 million of annualized software run rate and cash-flow positivity around the Series D announcement. | Medium | SO010, SO019 |
| CO020 | VAST raised $118 million in Series E in December 2023 at a $9.1 billion valuation led by Fidelity Management & Research Company. | Medium | SO011, SO023, SO024 |
| CO021 | NEA, BOND Capital, and Drive Capital also participated in the Series E round. | Medium | SO011, SO024 |
| CO022 | Independent coverage said part of the Series E proceeds would support a stock buyback or secondary sale for existing shareholders. | Medium | SO023, SO024 |
| CO023 | At the end of fiscal Q3 in 2023, VAST said cumulative software bookings had surpassed $1 billion. | Medium | SO011, SO023, SO024 |
| CO024 | VAST said late-2023 growth was 3.3 times year over year with nearly 90 percent gross margin and positive cash flow. | Medium | SO011, SO023, SO025 |
| CO025 | VAST disclosed more than 700 employees worldwide in late 2023. | Medium | SO011, SO025 |
| CO026 | Enterprise Times reported that VAST had offices in the US and Israel plus an office in London in late 2023. | Low | SO025 |
| CO027 | VAST closed its Series F financing in April 2026 at a $30 billion valuation. | Medium | SO007, SO014, SO016 |
| CO028 | The Series F transaction value was approximately $1 billion and included both primary and secondary capital. | Medium | SO007, SO014, SO015 |
| CO029 | Drive Capital led the Series F round and Access Industries acted as co-lead. | Medium | SO007, SO014, SO016 |
| CO030 | Fidelity, NEA, and NVIDIA were named as participating investors in the Series F financing. | Medium | SO007, SO014, SO016 |
| CO031 | Combining the roughly $381 million raised before Series F with the approximately $1 billion 2026 transaction implies total capital raised of about $1.38 billion. | Medium | SO024, SO026, SO028, SO007 |
| CO032 | By April 2026, VAST said it had surpassed $4 billion in cumulative bookings and exited the prior fiscal year with more than $500 million in committed annual recurring revenue. | Medium | SO007, SO014, SO016 |
| CO033 | Hallak told CRN that VAST was GAAP profitable and cash-flow positive. | Medium | SO016 |
| CO034 | Reuters-reported coverage said VAST had reached $200 million in ARR by January 2025 and had internal projections to reach $600 million the following year. | Low | SO028 |
| CO035 | Official 2026 financing materials said thousands of organizations rely on VAST, including CoreWeave, Lowe's, the U.S. Air Force, Cursor, JPMorganChase, Crusoe, and Mistral AI. | Medium | SO007, SO013 |
| CO036 | Official late-2023 materials listed Booking Holdings, the U.S. Air Force, the U.S. Department of Energy, Verizon, Boston Children's Hospital, Pixar, and Zoom as customer examples. | Medium | SO011, SO025 |
| CO037 | TechCrunch reported that VAST's customers include Pixar, ServiceNow, xAI, CoreWeave, and Lambda. | Medium | SO022 |
| CO038 | VAST's customer stories page highlights the NHL, SK Telecom, NVIDIA-linked cybersecurity work, and animated-feature production use cases. | Medium | SO004 |
| CO039 | Fierce Network reported VAST cloud partnerships across AWS, Google Cloud, Microsoft Azure, CoreWeave, Crusoe, Lambda, and Core42. | Medium | SO026 |
| CO040 | Supermicro markets the VAST platform as a high-performance, scalable data management solution optimized for AI and GPU cloud markets. | Medium | SO018 |
| CO041 | StorageReview reported that VAST planned to deliver CNode-X servers through OEM partners such as Cisco and Supermicro. | Medium | SO017 |
| CO042 | VAST launched the Amplify program in January 2026 and said it can deliver up to six times more effective capacity from existing SSD estates depending on workload. | Medium | SO008 |
| CO043 | At VAST Forward in February 2026, VAST launched Polaris as a global control plane for distributed AI infrastructure across cloud and on-premises environments. | Medium | SO009, SO017 |
| CO044 | StorageReview said the PolicyEngine and TuningEngine were slated for release by the end of 2026. | Medium | SO017 |
| CO045 | The NetApp and Red Stapler dispute created reputational and integration risk for VAST even though VAST itself was not named as a defendant in the Florida complaint. | Medium | SO020, SO021 |
| CO046 | NetApp's Florida lawsuit against Stefansson was dismissed on jurisdictional grounds, but NetApp said it had appealed and was pursuing related action in Iceland. | Medium | SO020 |
| CO047 | Calcalist reported that the Red Stapler dispute arrived while VAST was trying to raise a new round at roughly $28 billion to $30 billion. | Medium | SO021 |
| CO048 | Hallak said in April 2026 that VAST had not started an IPO process even though internal preparation was underway. | Medium | SO016, SO028 |
| CO049 | The retained source pack does not disclose an exact current customer count despite company language about thousands of organizations and multiple named logos. | Medium | SO007, SO013, SO022 |
| CO050 | The retained source pack does not disclose a precise 2026 employee count beyond the more-than-700 figure published in late 2023. | Medium | SO011, SO025 |
| CM001 | VAST positions its AI Operating System as a unified layer for storage, database, and compute aimed at agentic and other data-intensive applications. | Medium | SM001 |
| CM002 | VAST says AI adoption is constrained by exploding data volume, outdated compute architectures, and the high cost of effective AI. | Medium | SM002 |
| CM003 | VAST markets the platform as enterprise scale-out all-flash infrastructure for both structured and unstructured workloads that removes storage tiering and HDD dependence. | Medium | SM015 |
| CM004 | VAST says modern AI workloads require storage, networking, and compute to operate as a tightly coupled data-centric fabric across datacenters. | Medium | SM016 |
| CM005 | VAST says its platform covers file, object, block, tabular, vector, and streaming data through the DataStore, DataBase, and DataSpace services. | Medium | SM016 |
| CM006 | VAST says its DASE architecture disaggregates compute from physical storage and scales from terabytes to exabytes over NVMe-over-Fabrics. | Medium | SM016, SM018 |
| CM007 | VAST argues that a unified control plane can replace separate object stores, analytics clusters, orchestration layers, and message brokers in large AI environments. | Medium | SM016 |
| CM008 | VAST explicitly frames AI, compliance, containers, data analytics, backup and recovery, and HPC as solution areas around the platform. | Medium | SM015 |
| CM009 | IDC projected the external OEM enterprise storage market to reach about $37.7 billion in 2026 after roughly $35.4 billion in 2025 and $33.6 billion in 2024. | Medium | SM004 |
| CM010 | IDC highlighted all-flash arrays as a key growth driver because enterprises are shifting toward high-performance flash for AI, analytics, and high-throughput processing. | Medium | SM004 |
| CM011 | Gartner forecast total AI spending of $2.528 trillion in 2026, including $1.366 trillion for AI infrastructure and $3.119 billion for AI Data. | Medium | SM003 |
| CM012 | Gartner said AI would sit in the Trough of Disillusionment through 2026 and that enterprise scale depends on improved predictability of ROI. | Medium | SM003 |
| CM013 | Fortune Business Insights sized the AI-powered storage market at $44.94 billion in 2026 after valuing it at $35.90 billion in 2025. | Medium | SM005 |
| CM014 | Fortune Business Insights attributed AI-powered storage growth to high-throughput data access, large datasets, HPC demand, and cloud-service adoption. | Medium | SM005 |
| CM015 | Fortune Business Insights said file-based architectures historically dominated the category while object-based storage is expected to grow faster for AI and ML data sets. | Medium | SM005 |
| CM016 | Fortune Business Insights grouped AI-powered-storage buyers into enterprises, telecom companies, cloud service providers, and government bodies, with enterprises accounting for 30% share in 2026. | Medium | SM005 |
| CM017 | Research and Markets valued the broader AI infrastructure market at $90.91 billion in 2026 and structured it by offerings, function, technology, deployment type, and end user. | Medium | SM006 |
| CM018 | Coherent Market Insights valued AI infrastructure at $90 billion in 2026 and said the category is 54% hardware, 46% on-prem, 48% enterprise, and 40% North America. | Medium | SM007 |
| CM019 | Public 2026 market estimates conflict mainly because they measure incompatible scopes ranging from OEM storage systems to AI-powered storage to full AI infrastructure. | Medium | SM003, SM004, SM005, SM006, SM007 |
| CM020 | No public source cleanly isolates a VAST-specific SAM or SOM, so the defensible public framing is a multi-lens range rather than a single TAM headline. | Medium | SM004, SM005, SM006, SM007 |
| CM021 | VAST’s closest public-market fit is the shared data-infrastructure layer inside enterprise AI factories rather than total AI infrastructure or general AI software spend. | Medium | SM001, SM016, SM018 |
| CM022 | VAST and NVIDIA position the platform for accelerated analytics, generative AI, agentic AI, and small-scale HPC inside enterprise AI factories. | Medium | SM014, SM018 |
| CM023 | The VAST reference architecture says enterprises can begin with four-node scalable units and expand as AI demand grows. | Medium | SM018 |
| CM024 | The VAST reference architecture lists faster time-to-token, resource efficiency, lower TCO, and reduced deployment risk as core business benefits. | Medium | SM018 |
| CM025 | The VAST reference architecture says DASE can deliver enough parallelism to challenge the assumption that NFS is inadequate for AI and HPC workloads. | Medium | SM018 |
| CM026 | Supermicro and VAST market CNode-X as a turnkey enterprise AI data platform that bundles compute, storage, and software for AI factory deployment. | Medium | SM017 |
| CM027 | Supermicro says the joint platform is designed to keep GPUs fed with data and move teams faster from deployment to first token. | Medium | SM017 |
| CM028 | IDC buyer-shift research says 2026 buyers demand precise, measurable ROI and time-to-impact early in the buying process. | Medium | SM019 |
| CM029 | IDC buyer-shift research says procurement, finance, and revenue operations have become central decision-makers rather than supporting stakeholders. | Medium | SM019 |
| CM030 | IDC buyer-shift research says boards and budget owners increasingly expect scenario-based planning and dynamic TAM-SAM-SOM-style forecasting before qualifying vendors. | Medium | SM019 |
| CM031 | Forrester says the typical 2026 business buying decision includes 13 internal stakeholders and nine external influencers. | Medium | SM020 |
| CM032 | Forrester says procurement is a decision-maker in 53% of business buying cycles. | Medium | SM020 |
| CM033 | Forrester says more than 60% of buyers use a trial and 78% of buyers spending more than $10 million engage in a trial first. | Medium | SM020 |
| CM034 | Deloitte says worker access to AI rose 50% in 2025 and the share of companies with at least 40% of projects in production is set to double within six months. | Medium | SM009 |
| CM035 | Deloitte says the AI skills gap remains the biggest barrier to integration and only one in five companies has mature governance for autonomous AI agents. | Medium | SM009 |
| CM036 | Informatica says 57% cite data reliability as a top barrier, three out of four say governance has not kept pace, and 86% plan to increase data-management investment. | Medium | SM012 |
| CM037 | Electronic Design and SNIA say storage now directly shapes AI performance, efficiency, power, cooling, rack density, and time-to-market. | Medium | SM011 |
| CM038 | Avnet says enterprise SSD demand is expected to grow 41% and AI data centers could consume about 70% of high-end DRAM in 2026. | Medium | SM008 |
| CM039 | Avnet says DRAM and NAND tightness could persist through 2027 as capacity expansion lags demand. | Medium | SM008 |
| CM040 | NetApp says organizations are increasingly choosing AI-ready data infrastructure and that its all-flash business reached a $3.6 billion annualized run rate in Q1 FY26. | Medium | SM021 |
| CM041 | Pure says AI-era competitive advantage depends on data accessibility and breaking data free from application silos. | Medium | SM023 |
| CM042 | Pure’s subscription ARR rose from $1.7 billion in Q1 FY26 to $1.8 billion in Q3 FY26, showing incumbents are monetizing AI-era data-platform demand. | Medium | SM022, SM023 |
| CM043 | Cloudera says 2026 is when enterprises move from experimentation to operational intelligence orchestration and need unified control planes, governance, and hybrid execution. | Medium | SM024 |
| CM044 | MinIO argues legacy storage architectures can become the primary bottleneck for enterprise AI and that scalable object storage is well-suited to distributed AI workloads. | Low | SM013 |
| CM045 | INFUSE says buyer confidence continues to erode despite AI-assisted research and that vendors must demonstrate value quickly and precisely. | Medium | SM025 |
| CM046 | Public evidence still lacks VAST pricing, customer count, conversion rates from trial to production, and workload-level revenue mix, which limits precise SOM modeling. | Medium | SM019, SM020 |
| CP001 | VAST publicly presents itself as an AI Operating System that unifies storage, database, and compute for agentic and data-intensive applications. | Medium | SP028 |
| CP002 | VAST’s platform overview says its architecture is meant for petabyte- and exabyte-scale datasets with high-performance random I/O. | Medium | SP001 |
| CP003 | Pure FlashBlade//S is marketed as a scale-out unstructured storage platform with native file and object protocols for analytics, AI/ML, and HPC workloads. | Medium | SP002 |
| CP004 | Pure’s Evergreen//One commercial model is consumption-based, SLA-backed, and built around paying for used capacity rather than provisioned capacity. | Medium | SP003 |
| CP005 | Pure said fiscal 2025 revenue surpassed $3 billion and grew 12% year over year, giving it public-company scale beyond most private specialists. | Medium | SP004 |
| CP006 | NetApp markets a unified storage environment spanning on premises, public clouds, and hybrid cloud under an ONTAP-led portfolio. | Medium | SP005 |
| CP007 | NetApp Keystone sells block, file, and object storage through pay-as-you-go terms and cloud bursting rather than only fixed hardware purchases. | Medium | SP006 |
| CP008 | WEKA markets NeuralMesh as software-defined AI infrastructure built on containerized microservices, true multi-tenancy, and independent service scaling. | Medium | SP007 |
| CP009 | WEKA’s 2026 newsroom highlights NVIDIA-linked AI data platform solutions and regional partners, implying strategic emphasis on AI factory delivery and channel expansion. | Medium | SP008 |
| CP010 | Globes reported that WEKA raised $140 million at a $1.6 billion valuation in 2024 and had raised about $400 million in total. | Medium | SP009 |
| CP011 | DDN positions itself as a data intelligence platform for AI factories, inference, sovereign AI, hyperscalers, and regulated workloads, not only classic HPC storage. | Medium | SP010 |
| CP012 | DDN says 8 of 10 leading automotive companies and 7 of 10 top banking and securities firms trust it, signaling enterprise reach in demanding verticals. | Low | SP010 |
| CP013 | DDN markets Infinia as an end-to-end AI data platform across cloud, core, and edge and says it delivers five nines of availability. | Medium | SP011 |
| CP014 | DDN says Infinia can fit 100 PB of storage in a single rack, which is a scale and density claim aimed squarely at AI factory economics. | Low | SP011 |
| CP015 | IBM Storage Scale is software-defined storage for AI, HPC, and analytics and unifies file and object data across data centers, cloud, and edge. | Medium | SP012 |
| CP016 | IBM’s Storage Scale System 6000 page lists NVIDIA-certified positioning and up to 340 GB/s throughput, reinforcing IBM’s enterprise-AI performance story. | Medium | SP013 |
| CP017 | Hammerspace Tier 0 uses GPU server-local NVMe as shared storage and says it can be activated without proprietary clients or agents. | Medium | SP014 |
| CP018 | Hammerspace’s own competitive brief says VAST has no Tier 0 equivalent and that VAST cloud clusters remain limited on performance and scale. | Low | SP014 |
| CP019 | TechCrunch reported that Hammerspace raised $100 million at a valuation above $500 million and named Meta and the U.S. Department of Defense as customers. | Medium | SP015 |
| CP020 | Qumulo describes itself as a modern file and object data platform spanning edge, data center, and cloud with a global namespace and AI-assisted visibility. | Medium | SP016 |
| CP021 | Qumulo’s cloud packaging mixes a fully managed Azure Native offer with self-hosted AWS and Google Cloud deployments. | Medium | SP017 |
| CP022 | AWS FSx for Lustre uses usage-based pricing with separate charges for storage, throughput, metadata IOPS, backups, and some data transfer. | Medium | SP018 |
| CP023 | AWS S3 exposes explicit usage-based pricing for storage, requests, and related data-movement line items, making it a transparent substitute for colder tiers. | Medium | SP019 |
| CP024 | Azure Managed Lustre is a pay-as-you-go managed filesystem for HPC and AI workloads with per-GiB and per-hour pricing constructs. | Medium | SP020 |
| CP025 | Azure’s pricing page explicitly notes Azure Government procurement eligibility, showing that trust and regulated purchasing rails are part of the substitute story. | Medium | SP020 |
| CP026 | Google Filestore bills on provisioned capacity and, for custom performance, on provisioned IOPS, so cloud simplicity can still come with idle-capacity cost. | Medium | SP021 |
| CP027 | Ceph documentation shows that internal-build paths can combine object, block, and file storage with POSIX semantics and S3- or Swift-compatible APIs in one system. | Medium | SP022 |
| CP028 | MinIO AIStor markets exabyte-scale AI storage with hyperscaler economics and no vendor lock-in, directly challenging proprietary AI storage vendors on TCO messaging. | Medium | SP023 |
| CP029 | Coldago’s 2025 map places VAST, Pure, NetApp, IBM, DDN, Qumulo, Hammerspace, and WEKA inside the same enterprise, high-performance, or cloud file-storage competitive frame. | Medium | SP024 |
| CP030 | StorageNewsletter says Coldago ranks companies on portfolio breadth, installed base, recent performance, and product availability rather than on a single benchmark result. | Medium | SP025 |
| CP031 | theCUBE Research argues that VAST’s AI OS ambition is ahead of its current maturity because it is not yet equivalent to a hyperscaler data platform, Databricks lakehouse, or Snowflake-grade database. | Medium | SP026 |
| CP032 | theCUBE says VAST is trying to extend from storage into distributed scheduling, event streaming, vector search, database indexing, and agent execution. | Medium | SP026, SP028 |
| CP033 | StorageMath argues that VAST’s Amplify and Flash Reclaim claims overstate effective capacity and turn data reduction into a lock-in narrative. | Low | SP027 |
| CP034 | Most specialist vendor pages in the retained set route buyers toward sales-led or calculator-led procurement rather than durable public list prices. | Medium | SP002, SP003, SP005, SP006, SP007, SP010, SP011, SP012, SP013, SP014, SP016, SP017 |
| CP035 | Hyperscaler substitutes have a procurement advantage because pricing, metering, and billing are already integrated into existing cloud contracts. | Medium | SP018, SP019, SP020, SP021 |
| CP036 | Pure and NetApp both counter specialist vendors with as-a-service packaging, which narrows any specialist advantage that depends only on moving capex to opex. | Medium | SP003, SP006 |
| CP037 | Hammerspace and Qumulo foreground global namespace, data movement, or multi-cloud placement more explicitly than VAST’s retained public platform pages do. | Medium | SP014, SP016, SP017, SP001 |
| CP038 | DDN, IBM, Pure, Qumulo, WEKA, and VAST all publicly target AI, HPC, or large unstructured-data workloads, so VAST does not enjoy a narrow uncontested wedge. | Medium | SP001, SP002, SP007, SP010, SP012, SP016 |
| CP039 | Internal-build substitutes reduce vendor lock-in but move integration, lifecycle management, and operational burden back onto the buyer’s platform team. | Medium | SP022, SP023 |
| CP040 | Qumulo’s managed Azure route plus self-hosted AWS and Google Cloud variants show that some challengers can land through hyperscaler channels rather than only direct appliance sales. | Medium | SP017 |
| CP041 | WEKA’s funding, ARR estimates, and 2026 partner-heavy newsroom indicate a better-capitalized and more ecosystem-oriented rival than an early-stage niche point product. | Medium | SP008, SP009 |
| CP042 | Hammerspace’s plan to use fresh capital for sales and marketing implies differentiated architecture but smaller current distribution reach than public incumbents and hyperscalers. | Medium | SP015 |
| CP043 | DDN’s sovereign-AI narrative and Azure Government procurement language show that trust and regulatory posture are a competitive dimension, not just a feature checklist. | Medium | SP010, SP020 |
| CP044 | VAST’s homepage says the company tripled valuation to $30 billion in a new Series F financing round, which supports its scale but also raises the bar for moat durability. | Medium | SP028 |
| CP045 | Multi-homing remains practical because buyers can combine a primary AI storage layer with Qumulo or Hammerspace orchestration, cloud filesystems, object stores, and open-source tiers. | Medium | SP014, SP017, SP018, SP019, SP022, SP023 |
| CP046 | VAST’s switching costs appear to come more from data gravity, tuned pipelines, and operational familiarity than from uniquely transparent pricing or a uniquely dominant compliance surface. | Medium | SP001, SP018, SP020, SP022, SP023 |
| CI001 | VAST positions the AI Operating System as a unified platform combining storage, database, and compute rather than a single-purpose storage product. | Medium | SI002, SI004 |
| CI002 | Across the reviewed public VAST commercial surfaces, buyers are routed to contact sales or demos rather than a self-serve list price. | Medium | SI001, SI002, SI004 |
| CI003 | VAST describes Gemini as a software consumption model that disaggregates hardware from software economics. | Medium | SI001, SI011 |
| CI004 | Cisco made the VAST AI Operating System available through its Global Price List and supports the joint stack as part of Cisco procurement. | Medium | SI025 |
| CI005 | Polaris is included at no extra cost, implying that control-plane functionality is bundled into the AI OS rather than separately monetized. | Medium | SI007 |
| CI006 | VAST signed a commercial agreement valued at $1.17 billion with CoreWeave, making VAST the primary data foundation for CoreWeave's AI cloud. | Medium | SI014, SI030 |
| CI007 | VAST said in 2021 that customers made initial investments averaging about $1 million, that expansion averaged 328 percent, and that several customers had already invested more than $10 million. | Medium | SI011 |
| CI008 | In FY22 VAST said it more than doubled its customer roster, managed more than three exabytes, averaged roughly 12PB per customer, and posted net revenue retention above 300 percent. | Medium | SI010 |
| CI009 | VAST's 2023 Series E release said the company had surpassed $1 billion in cumulative software bookings, grown 3.3x year over year, maintained positive cash flow for 12 quarters, and reached nearly 90 percent gross margin. | Medium | SI009 |
| CI010 | VAST's 2024 Forbes Cloud 100 release said the company ended 2023 at $200 million of ARR, remained cash-flow positive for three years, and posted a Rule of X of 162 percent. | Medium | SI012 |
| CI011 | VAST's 2025 Forbes Cloud 100 release said the company had remained cash-flow positive for more than four years and was experiencing 5x year-over-year sales growth in FY25 with a Rule of X of 551 percent. | Medium | SI013 |
| CI012 | VAST's 2026 Series F announcement said the company had surpassed $4 billion in cumulative bookings and exited the prior fiscal year with more than $500 million in committed annual recurring revenue, plus positive operating margin and free cash flow. | Medium | SI006, SI015 |
| CI013 | VAST says thousands of organizations rely on its platform and that it supports environments powering millions of GPUs globally. | Medium | SI006, SI002 |
| CI014 | Independent 2025 reporting said VAST reached $200 million of ARR by January 2025 and projected around $600 million of ARR in the following year. | Low | SI021, SI026 |
| CI015 | Independent 2026 reporting estimated that VAST's ARR-like revenue could be roughly $2 billion by the end of 2025, far above the company's disclosed committed-ARR floor. | Low | SI022, SI024 |
| CI016 | Public recurring-revenue signals are not directly comparable because they mix committed ARR, projected ARR, and broader ARR-like or non-committed revenue constructs. | Medium | SI006, SI021, SI022, SI024 |
| CI017 | TNW and Sacra reported that top-100 new customers spent more than $1.2 million on average and that contracts typically ran five to seven years, while Sacra added that three customers represented more than $100 million of total commitments. | Low | SI024, SI026 |
| CI018 | VAST's go-to-market combines direct enterprise selling with partner and OEM distribution, as shown by contact-sales surfaces, Cisco GPL availability, and large cloud-partner agreements. | Medium | SI001, SI014, SI025 |
| CI019 | VAST's white paper says OEMs such as SuperMicro, Lenovo, and Cisco wanted to sell VAST clusters on standard servers, reinforcing the company's partner-hardware model. | Medium | SI005 |
| CI020 | Because Gemini separates software from hardware and OEMs can sell VAST on standard servers, VAST likely carries less inventory and manufacturing working capital than a vertically integrated appliance vendor. | Medium | SI005, SI011, SI025 |
| CI021 | VAST repeatedly frames its economics as software-gross-margin oriented, and the last explicit public gross-margin datapoint was nearly 90 percent in 2023. | Medium | SI009, SI010 |
| CI022 | VAST Amplify shows the company can monetize installed-base flash capacity and capacity optimization even when customers delay new hardware procurement. | Medium | SI008 |
| CI023 | Bundled services such as Polaris imply that at least some service-delivery and control-plane costs sit inside platform margin rather than in separately disclosed SKUs. | Medium | SI002, SI007 |
| CI024 | Cisco GPL availability and Cisco support materially lower procurement friction and improve the odds of channel-assisted expansion into enterprise AI budgets. | Medium | SI025 |
| CI025 | Fierce reported that Google Cloud, CoreWeave, AWS, Azure, Crusoe, Lambda, and Core42 appear in VAST's cloud-partner ecosystem, suggesting broad partner-led distribution into AI clouds. | Low | SI026 |
| CI026 | VAST's 2026 financing valued the company at $30 billion, more than tripling the $9.1 billion Series E valuation from late 2023. | Medium | SI006, SI015 |
| CI027 | The Series F was a mixed primary-and-secondary transaction, and outside coverage suggested more than $500 million and possibly most of the round went to secondary liquidity for employees and early investors. | Low | SI006, SI019, SI022, SI024 |
| CI028 | VAST said primary proceeds from the Series F would be used for global growth and strategic transactions that expand its technology footprint and partnerships. | Medium | SI006 |
| CI029 | SiliconANGLE reported that Hallak said VAST has been cash-flow positive for several years and does not require external capital to operate. | Medium | SI023 |
| CI030 | Because a meaningful portion of the headline Series F appears secondary, the gross $1 billion transaction value likely overstates net new balance-sheet cash available to fund operations. | Medium | SI006, SI019, SI024 |
| CI031 | No reviewed source publicly disclosed VAST's cash on hand, monthly burn, cash runway, or a board-defined next-round trigger as of runDate. | Low | SI006, SI015, SI023, SI026 |
| CI032 | No reviewed source publicly disclosed debt facilities, project-finance obligations, or covenant packages for VAST. | Low | SI006, SI015, SI027, SI028, SI029 |
| CI033 | No reviewed source disclosed CAC, CAC payback, or a quantified sales cycle for VAST's enterprise and partner channels. | Low | SI001, SI006, SI015, SI026 |
| CI034 | No reviewed source disclosed realized ASPs, partner take-rates, discount schedules, or separate support and services gross margins. | Low | SI001, SI002, SI025 |
| CI035 | Fidelity's N-PORT filings marked the same basket of VAST preferred shares in one portfolio at roughly $1.08 million in August 2024, $1.34 million in February 2025, and $3.51 million in February 2026. | Medium | SI027, SI028, SI029 |
| CI036 | Public traction supports strong revenue quality because the company reports positive free cash flow, positive operating margin, long-duration contracts, high average land sizes, and multi-year customer expansion. | Medium | SI006, SI013, SI024, SI026 |
| CI037 | A full financial underwrite is blocked by unreconciled recurring-revenue definitions, no public customer-concentration schedule, and missing treasury and debt detail. | Medium | SI006, SI021, SI022, SI023 |
| CI038 | Public customer references indicate meaningful exposure to AI-cloud and hyperscaler-style accounts such as CoreWeave and xAI, which can accelerate growth but create concentration and renewal-timing risk. | Medium | SI014, SI024, SI026 |
| CI039 | VAST increasingly resembles an infrastructure-software company sold alongside partner hardware, but the public record does not reconcile what share of revenue is software, services, or system-linked economics. | Medium | SI003, SI005, SI011, SI025 |
| CI040 | The financial verdict is positive on revenue quality and margin potential but incomplete for underwriting because cash, burn, CAC, pricing realization, and concentration remain private. | Medium | SI006, SI023, SI024, SI026 |
| CI041 | The gap between bookings, committed ARR, and higher ARR-like estimates suggests that VAST's public metrics mix delivery-timed system economics with recurring software constructs that are not yet bridged publicly. | Medium | SI006, SI011, SI024 |
| CE001 | VAST publicly defines the AI Operating System as software that unifies storage, database, and compute. | High | SE001, SE017 |
| CE002 | VAST's 2026 product narrative frames the platform as a single layer for ingestion, retrieval, analytics, and inference rather than as a storage-only product. | Medium | SE004, SE023 |
| CE003 | Public technical and partner materials present the platform as a set of named modules that includes DataStore, DataBase, DataSpace, and DataEngine. | High | SE006, SE017 |
| CE004 | VAST says DataStore can present the same underlying data over NFS, SMB, and S3 simultaneously. | Medium | SE006 |
| CE005 | VAST describes DataBase as an ACID-compliant high-speed data lake for transactional and analytical workloads at exabyte scale. | Medium | SE006 |
| CE006 | VAST says DataBase supports query engines such as Trino and Spark with pushdown capabilities. | Medium | SE006, SE010 |
| CE007 | VAST describes DataEngine as a distributed processing environment for event-driven AI workflows that initially writes streaming events directly into the VAST DataBase. | Medium | SE006 |
| CE008 | VAST says DataSpace creates a global namespace across data centers, cloud, and edge using global lease management and intelligent data movement. | Medium | SE006 |
| CE009 | VAST's public Kubernetes guidance supports both direct NFS-backed usage and CSI-based dynamic provisioning for persistent volumes. | High | SE009, SE011, SE015 |
| CE010 | VAST distributes the CSI integration through an official GitHub repository and Helm chart instructions. | Medium | SE011, SE015 |
| CE011 | Public operator tooling includes a RESTful VMS API, Swagger exposure, Python SDK, Terraform provider, and DataEngine CLI. | High | SE019, SE012, SE013, SE014 |
| CE012 | The public VAST GitHub organization page showed 55 repositories and several repositories updated during May 2026 when reviewed for this run. | Medium | SE010 |
| CE013 | The public vast-csi repository showed 844 commits when reviewed for this run. | Medium | SE011 |
| CE014 | DASE is documented as separating computational resources from persistent data and system state. | High | SE006, SE019 |
| CE015 | VAST says any CNode can access all data, metadata, and system state directly while persistent state resides on NVMe-backed storage enclosures. | Medium | SE006 |
| CE016 | VAST's architecture is positioned as allowing compute and capacity to scale independently rather than being locked together per node. | High | SE006, SE018, SE019 |
| CE017 | The retained public architecture shows workload-optimized deployments with CBoxes and DBoxes and a BlueField-optimized mode with CNodes on NVIDIA DPUs. | Medium | SE006, SE008 |
| CE018 | Cisco and Supermicro both publicly package VAST AI OS and DASE-based services into partner infrastructure offers. | High | SE017, SE018 |
| CE019 | Supermicro says its EBox runs CNode and DNode containers on the same server while preserving DASE-style independent scaling across the NVMe fabric. | Medium | SE018 |
| CE020 | Juniper's validated design exposes concrete VMS GUI and CLI management, REST API usage, VIP rebalancing, and a specific AI-lab cluster topology. | Medium | SE019 |
| CE021 | The retained corpus supports a multiprotocol platform spanning NFS, SMB, S3, SQL or table access, and native block via NVMe/TCP by 2025. | High | SE006, SE021, SE022 |
| CE022 | The Next Platform reported that VAST expected iSCSI support later, implying the block story was newer and not yet protocol-complete versus legacy SAN expectations. | Medium | SE022 |
| CE023 | The Next Platform reported that VAST block volumes inherit snapshots, clones, replication, and QoS from the common platform. | Medium | SE022 |
| CE024 | DBTA and The Next Platform reported that Event Broker adds Kafka-compatible streaming so streamed data can land on the shared platform and become queryable through SQL. | Medium | SE021, SE022 |
| CE025 | VAST's 2026 launch materials say CNode-X embeds NVIDIA technologies including cuDF through Sirius, cuVS, NIM microservices, and CMX support into core VAST services. | High | SE004, SE023, SE024 |
| CE026 | VAST's 2026 materials cite early Sirius benchmarks of up to 44 percent lower query time and up to 80 percent lower query cost. | Medium | SE004, SE023, SE024 |
| CE027 | Polaris is described as a Kubernetes-based global control plane for public cloud, neocloud, and on-prem VAST deployments. | Medium | SE023, SE025 |
| CE028 | PolicyEngine is described as an inline zero-trust enforcement layer with tamper-proof audit logs for agentic workflows. | Medium | SE023, SE025 |
| CE029 | TuningEngine is described as a closed-loop model-improvement service and both TuningEngine and PolicyEngine were presented as rolling out through 2026 rather than as long-proven defaults. | Medium | SE023, SE025 |
| CE030 | VAST says CNode-X will come to market through OEM partners including Cisco and Supermicro. | Medium | SE004, SE024 |
| CE031 | VAST says it is open-sourcing DataEngine blueprints for video search, enterprise document RAG, and genomics workflows. | Medium | SE004, SE024 |
| CE032 | VAST's AI reference architecture and security guide describe zero-trust controls including RBAC, ABAC, encryption, auditing, network isolation, tenant isolation, and tenant-specific key management. | Medium | SE006, SE007 |
| CE033 | The Security Configuration Guide says the platform is a STIG-hardened appliance and is listed on the DoDIN Approved Products List. | Medium | SE007 |
| CE034 | The Security Configuration Guide documents HTTPS-based VMS management, CLI management, and predefined roles including Administrators, Read Only, Configuration, CSI, and Debug Metrics. | Medium | SE007 |
| CE035 | The Security Configuration Guide says VAST supports Active Directory, LDAP, NIS, and local users and requires trusted TLS before configuring SSO and MFA. | Medium | SE007 |
| CE036 | Commvault documents VAST as a validated NFS disk library or S3 cloud library target and says WORM mode can create immutable backup copies. | Medium | SE020 |
| CE037 | CrowdStrike and VAST say their 2026 partnership extends detection and response across data ingestion, model training, runtime operation, and inference, but the release also warns that described features may not yet be generally available. | Medium | SE026 |
| CE038 | VAST's compliance page markets the platform around governance, security, and regulated-data contexts including SEC, GDPR, and FINRA. | Medium | SE005 |
| CE039 | This chapter's retained public corpus does not show a broad public trust-center style catalog of current audit reports or certifications for VAST Data beyond compliance marketing and deployment documentation. | Low | SE005, SE007 |
| CE040 | Cisco's public data sheet says VAST software licensing is subscription-based, portable, and sold across components that include DataStore, DataBase, DataEngine, and DataSpace. | Medium | SE017 |
| CE041 | VASTPY publicly supports API-token authentication on VAST 5.3 and later and exposes protocol metrics for NFS, SMB, and S3. | Medium | SE013 |
| CE042 | The Terraform provider publicly supports registry installation and importing existing VAST resources. | Medium | SE012 |
| CE043 | The retained corpus shows a visible public developer surface across CSI, SDKs, Terraform, CLI, and Go tooling, but it does not show usage telemetry strong enough to quantify practitioner adoption. | Medium | SE010, SE011, SE012, SE013, SE014, SE016 |
| CE044 | Public 2026 materials describe Polaris, PolicyEngine, and TuningEngine in detail, but the retained product-tech corpus does not show broad public customer reference deployments or mature admin documentation for them yet. | Medium | SE023, SE025, SE026 |
| CE045 | VAST's most supportable differentiation is broad shared-data-plane scope, while its most visible technical risk is making the AI Operating System claim feel as mature as the underlying storage core. | Medium | SE006, SE022, SE023 |
| CE046 | The Security Configuration Guide says the product should be deployed inside a hardened customer environment, implying secure operation depends on customer-side hardening as well as product defaults. | Medium | SE007 |
| CE047 | This chapter's retained public corpus does not provide a clear current public SLA, uptime dashboard, or incident-history source for VAST Data reliability. | Low | SE001, SE002, SE005, SE007, SE017 |
| CU001 | Official and customer-proof pages show VAST adoption across AI cloud, financial services, healthcare and life sciences, government research, media, and telecom segments. | Medium | SU001, SU007, SU010, SU013, SU017 |
| CU002 | Retained customer-proof pages name NHL, SK Telecom, CoreWeave, HSBC, CACEIS, Jump Trading, HHS, Invitae, PacBio, LLNL, and Chan Zuckerberg Initiative. | Medium | SU002, SU007, SU010, SU013, SU014, SU015, SU016, SU017, SU018, SU019, SU020 |
| CU003 | The official 2023 Series E release also lists Booking Holdings, U.S. Air Force, U.S. Department of Energy, Verizon, Boston Children’s Hospital, Pixar, and Zoom as customer-roster examples. | Medium | SU022 |
| CU004 | VAST’s buyer-user-payer pattern is segment specific, with central infrastructure budgets sponsoring deployments for downstream researchers, creators, analysts, or cloud tenants. | Medium | SU007, SU010, SU013, SU015, SU017 |
| CU005 | The NHL says it originally selected VAST as a production platform for media workflows rather than as a generic storage refresh. | Medium | SU007 |
| CU006 | The NHL-VAST relationship has lasted roughly six years and expanded from archive management into live media operations. | Medium | SU007, SU008 |
| CU007 | The NHL says it moved more than 20 PB of archive footage onto the VAST platform. | Medium | SU007 |
| CU008 | By 2025 the NHL had implemented VAST in all 32 arenas plus headquarters to create a single logical cluster or namespace. | Medium | SU007, SU008 |
| CU009 | In a 2026 outdoor-game pilot, the NHL’s portable VAST node replicated more than 2 TB of footage with stable operation and minimal turnaround time. | Medium | SU009 |
| CU010 | The NHL says the next phase is to operationalize its distributed VAST model across major events and league-wide arenas. | Medium | SU009 |
| CU011 | SK Telecom deployed VAST AI OS as a core component of its sovereign AI cloud in 2025. | High | SU010, SU012 |
| CU012 | SK Telecom says VAST-backed GPU environments can be provisioned in roughly 5–10 minutes. | High | SU010, SU011, SU012 |
| CU013 | SK Telecom positions the deployment as infrastructure for government, research, and enterprise AI customers inside South Korea. | Medium | SU010, SU012 |
| CU014 | SK Telecom’s VAST-enabled sovereign AI stack uses 1,000 NVIDIA Blackwell GPUs and claims throughput approaching 14 GB/s via RDMA passthrough. | Medium | SU011 |
| CU015 | VAST announced a $1.17 billion commercial agreement with CoreWeave in November 2025. | High | SU003, SU004, SU005, SU006 |
| CU016 | Official and independent coverage describe CoreWeave as a long-standing or expanded partner using VAST as a primary data foundation for its AI cloud. | High | SU002, SU003, SU004, SU006 |
| CU017 | VAST’s CoreWeave customer story claims 23 global data centers, more than 250,000 GPUs, more than 500 PB of capacity, up to 2.3 GB/s per GPU, and 99.9999% uptime. | Medium | SU002 |
| CU018 | CoreWeave evidence is partner mediated because VAST’s growth is tied not only to CoreWeave as an account but also to workloads CoreWeave delivers to shared end customers. | Medium | SU003, SU004, SU005 |
| CU019 | SDxCentral says VAST’s platform under CoreWeave supports AI training and inference for CoreWeave customers such as Meta, OpenAI, and Microsoft. | Medium | SU005 |
| CU020 | In 2021 VAST said average initial customer investment was about $1 million. | Medium | SU021 |
| CU021 | In 2021 VAST said net revenue within existing customers had expanded 328% on average. | Medium | SU021 |
| CU022 | In 2021 VAST said several customers had already invested more than $10 million. | Medium | SU021 |
| CU023 | Customer-proof freshness is uneven because NHL, SKT, and CoreWeave have 2025-2026 proof while many other named logos appear only in 2023 roster disclosures or title-level case pages. | Medium | SU007, SU009, SU010, SU012, SU022 |
| CU024 | HSBC, CACEIS, and Jump Trading extend proof into financial services, but the retained public pages provide less quantified outcome detail than the AI-cloud and media references. | Medium | SU016, SU017, SU018 |
| CU025 | Invitae says VAST improved data access 6x and IOPS 30x for genetics workflows. | Medium | SU013 |
| CU026 | PacBio says it added another 2 PB to its VAST cluster with no issues and uses VAST to scale genomics workflows. | Medium | SU014 |
| CU027 | HHS testimony on FeaturedCustomers says the department deployed scalable flash-based architectures with VAST to modernize science workflows and improve public health research. | Medium | SU015, SU025 |
| CU028 | LLNL’s case page presents VAST as the platform underpinning HPC and machine-learning work for drug-efficacy research. | Medium | SU019 |
| CU029 | FeaturedCustomers lists 65 testimonials, 37 case studies, 36 customer videos, and a 4.8/5 score based on 1,417 reference ratings for VAST. | Medium | SU025 |
| CU030 | PeerSpot shows VAST at 5.0/5 from 2 reviews with 100% willing to recommend as of May 2026. | Medium | SU026 |
| CU031 | PeerSpot also cites write-performance and read-write-ratio weaknesses and says VAST is not the cheapest option. | Medium | SU026 |
| CU032 | Public durability proof is indirect because the retained set shows historical expansion, long-lived named relationships, and review sentiment rather than current cohort metrics. | Medium | SU007, SU021, SU025, SU026 |
| CU033 | The 2023 official release says VAST customer partnerships with CoreWeave, Lambda, and Core42 are central to next-generation AI-cloud infrastructure. | Medium | SU022 |
| CU034 | SK Telecom evidence shows VAST adoption routed through NVIDIA and Supermicro ecosystem partners rather than only through direct enterprise procurement. | Medium | SU010, SU012 |
| CU035 | The retained NVIDIA Nasdaq customer story documents Nasdaq AI outcomes, but it does not mention VAST, so it is adjacent ecosystem evidence rather than retained VAST customer proof. | Medium | SU027 |
| CU036 | The retained NVIDIA BMW case study documents BMW AI-production outcomes, but it does not mention VAST, so BMW remains unverified as a VAST customer in this chapter. | Medium | SU028 |
| CU037 | Sacra identifies Goldman Sachs and Dell Technologies Capital as investors in VAST rather than as customer references. | Medium | SU024 |
| CU038 | The retained public proof set skews toward large enterprises, public-sector research bodies, AI clouds, and infrastructure-heavy buyers rather than SMB or self-serve usage. | Medium | SU001, SU022, SU026 |
| CU039 | No retained source disclosed current total customer count, channel mix, sales-cycle length, partner take-rates, or a public list-price schedule. | Medium | SU001, SU022, SU024, SU026 |
| CU040 | No retained source disclosed current GRR, NRR, churn, or renewal rates by cohort as of the run date. | Medium | SU021, SU022, SU023, SU024 |
| CU041 | Official proof shows use cases spanning media production, sovereign AI cloud, AI-cloud infrastructure, genomics, quantitative trading, and financial-services AI. | Medium | SU007, SU010, SU013, SU014, SU016, SU017, SU018 |
| CU042 | The strongest production-grade proof in the retained set comes from NHL, SKT, CoreWeave, Invitae, and PacBio because those references include deployment detail or measured outcomes rather than just logos. | Medium | SU002, SU007, SU009, SU010, SU013, SU014 |
| CU043 | VAST’s customer journey often includes evaluation or codesign, production deployment, multisite rollout, and later expansion into adjacent workflows. | Medium | SU007, SU009, SU011, SU021 |
| CU044 | CoreWeave is simultaneously VAST’s strongest public expansion proof and clearest public concentration risk because one disclosed partner-linked agreement is worth $1.17 billion. | High | SU003, SU004, SU005, SU006 |
| CU045 | PeerSpot’s review traffic shows interest concentrated in large enterprises and led by financial-services researchers, reinforcing that VAST is sold into heavyweight environments. | Medium | SU026 |
| CU046 | The official 2023 customer roster is qualitative and broad, but it does not disclose a denominator for total customers or the revenue mix behind those logos. | Medium | SU022 |
| CU047 | The main unresolved underwriting asks are current customer count, cohort retention, top-customer concentration, and partner-sourced revenue mix. | Medium | SU022, SU023, SU024, SU026 |
| CR001 | VAST closed its Series F at a $30 billion valuation, more than tripling the $9.1 billion valuation disclosed for the 2023 Series E. | Medium | SR001, SR002 |
| CR002 | The Series F included both primary and secondary capital and carried total transaction value of about $1 billion. | Medium | SR001, SR002, SR003 |
| CR003 | VAST said primary proceeds from the Series F will fund global growth, technology expansion, and strategic transactions or partnerships. | Medium | SR001 |
| CR004 | TechCrunch reported in June 2025 that VAST was already targeting a roughly $25 billion valuation before the 2026 round closed. | Medium | SR003 |
| CR005 | VAST's strongest public operating proof still comes from company-claimed metrics such as more than $4 billion of cumulative bookings, more than $500 million of committed ARR, and positive free cash flow. | Medium | SR001 |
| CR006 | At a $30 billion valuation, the public underwriting case still leans heavily on company-claimed scale metrics rather than audited public disclosure. | Medium | SR001, SR002, SR003 |
| CR007 | VAST disclosed a $1.17 billion commercial partnership with CoreWeave. | Medium | SR004, SR005, SR006 |
| CR008 | The CoreWeave agreement makes VAST's AI OS the primary data platform underneath CoreWeave's compute infrastructure. | Medium | SR004, SR006 |
| CR009 | SDxCentral reported that the CoreWeave environment VAST supports feeds workloads for Meta, OpenAI, and Microsoft. | Medium | SR006 |
| CR010 | Globes reported that VAST's customer base includes xAI, CoreWeave, AWS, and other neocloud or enterprise accounts. | Medium | SR018 |
| CR011 | VAST's AI Operating System became available through Cisco's Global Price List with Cisco support for the joint stack. | Medium | SR009, SR010 |
| CR012 | Cisco, NVIDIA, and VAST published a validated enterprise AI data platform architecture inside Cisco Secure AI Factory with NVIDIA. | Medium | SR010 |
| CR013 | Supermicro and VAST launched the CNode-X integrated AI data platform with NVIDIA in February 2026. | Medium | SR008 |
| CR014 | VAST says its AI OS runs directly on NVIDIA-powered servers and uses NVIDIA libraries to accelerate compute and data services. | Medium | SR007, SR022 |
| CR015 | VAST obtained NVIDIA-Certified Storage status in 2025, tightening its product story to NVIDIA's ecosystem and validation path. | Medium | SR022 |
| CR016 | VAST's partner ecosystem page says deployments rely on a global network of resellers, integrators, service providers, technology leaders, and cloud providers. | Medium | SR026 |
| CR017 | CRN reported that VAST's storage fabric is delivered on top of OEM hardware from Lenovo, Dell, HPE, SuperMicro, and Cisco. | Medium | SR021 |
| CR018 | Dell publicly identified VAST as a direct rival in enterprise AI storage and claimed better power and rack-efficiency economics than VAST. | Medium | SR021 |
| CR019 | VAST responded to Dell's comparison by calling it misrepresentation and marketing math, showing that benchmark credibility is now a live competitive battleground. | Medium | SR021 |
| CR020 | StorageMath published a direct adverse critique arguing that VAST Amplify's six-times-capacity claim relied on fake math and lock-in framing. | Low | SR025 |
| CR021 | NetApp sued former CTO Jonsi Stefansson alleging he built Red Stapler while still employed at NetApp and that VAST later acquired the resulting company. | Medium | SR011 |
| CR022 | A Florida court dismissed NetApp's complaint without prejudice on forum grounds and NetApp appealed rather than accepting a merits loss. | Medium | SR012 |
| CR023 | The retained litigation reporting says VAST is not named as a defendant in the NetApp complaint. | Medium | SR011, SR012 |
| CR024 | VAST's public privacy policy says its website processing includes IP addresses, device identifiers, contact data, job-candidate submissions, cookies, and named third-party tools, with storage in the United States and cloud infrastructure. | Medium | SR015 |
| CR025 | Wilson Sonsini says the DOJ bulk-data rules effective April 8, 2025 can restrict vendor, investor, and employee access to covered datasets and can carry material civil and criminal penalties. | Medium | SR013 |
| CR026 | White & Case says the 2025-2026 privacy landscape layers the bulk-data rule, new state laws, and more aggressive enforcement onto companies handling sensitive or cross-border data. | Medium | SR014 |
| CR027 | VAST's September 2025 SDS announcement was issued from Tel Aviv and positioned VAST inside a sovereign AI cloud buildout for Israel. | Medium | SR017 |
| CR028 | The SDS sovereign-cloud build relies on thousands of NVIDIA Blackwell GPUs and dozens of petabytes of VAST infrastructure. | Medium | SR017 |
| CR029 | Globes reported in 2025 that VAST had about 1,000 employees, roughly half in Israel, with its main development center in Tel Aviv. | Medium | SR018 |
| CR030 | The Israel Innovation Authority says Israeli high-tech output has been stagnant for two years, R&D roles fell 6.5% year over year, venture fundraising fell sharply, and foreign capital dominates deep-tech. | Medium | SR019 |
| CR031 | JD Supra says Israeli export-control scrutiny, dual-use rules, and IIA grants can constrain IP transfer, manufacturing, and exit structuring. | Medium | SR020 |
| CR032 | Goldman Sachs frames the AI build-out as assumption-sensitive rather than fixed, keeping open the possibility that spending enthusiasm outruns durable demand. | Medium | SR023 |
| CR033 | Gartner says AI is in a 2026 trough of disillusionment and will often be sold by incumbent providers rather than attached to moonshot projects. | Medium | SR024 |
| CR034 | Gartner still forecasts AI infrastructure spending of about $1.366 trillion in 2026, up roughly $401 billion year over year. | Medium | SR024 |
| CR035 | Supermicro says CNode-X follows the NVIDIA AI Data Platform reference architecture and wraps validated hardware, networking, and management around VAST software. | Medium | SR008 |
| CR036 | On residual severity, the most material underwriting risks are valuation reset, partner and customer concentration, execution breadth, legal or IP overhang, and Israel-linked continuity. | Medium | SR001, SR004, SR011, SR019, SR021, SR023, SR024 |
| CR037 | VAST's public announcements still center founder and CEO Renen Hallak across financing, Cisco, and Israel sovereign-cloud launches, indicating meaningful leadership concentration. | Medium | SR001, SR009, SR017 |
| CR038 | The Red Stapler acquisition was part of VAST's push into hyperscale cloud and control-plane capability, so any injunction or code-provenance issue would hit an active expansion vector rather than a legacy side project. | Medium | SR011, SR012 |
| CR039 | Globes described VAST president Mike Wing as a former Dell executive, showing added commercial depth beyond the founders without removing founder dependence. | Medium | SR018 |
| CR040 | VAST's public contract surface routes services through a public EULA plus separate authorized-distributor or written contracts rather than through website terms alone. | Medium | SR016 |
| CR041 | Cisco's solution overview places VAST inside use cases spanning financial services, healthcare, retail, manufacturing, and enterprise IT, pushing the product into regulated or mission-critical workloads. | Medium | SR010 |
| CR042 | The retained public source set does not disclose VAST's top-customer concentration, renewal timing, burn or runway, debt, or current security certifications. | Low | SR001, SR010, SR018 |
| CR043 | Gartner's incumbent-led 2026 AI adoption view increases go-to-market risk for challengers even if total infrastructure spend keeps rising. | Medium | SR021, SR024 |
| CR044 | CRN says Dell remains a storage market leader while VAST is gaining traction mainly in AI and unstructured-data segments, underscoring competitive pressure from better-capitalized incumbents. | Medium | SR021 |
| CR045 | VAST's current delivery stack is multi-layer dependent: NVIDIA for accelerated architecture, Cisco for enterprise channel, Supermicro and other OEMs for validated hardware, and CoreWeave or SDS for flagship cloud deployments. | Medium | SR004, SR007, SR008, SR009, SR010, SR017, SR021, SR026 |
| CR046 | Because the Series F included major secondary liquidity, the headline $1 billion round overstates fresh balance-sheet cash available for execution. | Medium | SR001, SR002, SR003 |
| CR047 | Because VAST's flagship disclosed wins are GPU-cluster and neocloud centric, demand remains exposed to AI capex timing and model-lab spending concentration rather than to a fully diversified enterprise base. | Medium | SR004, SR006, SR018, SR023, SR024 |
| CR048 | Mitigation maturity is strongest where VAST has financing access and validated partner architectures, and weakest where investors still need private disclosure or external legal and compliance clearance. | Medium | SR001, SR008, SR009, SR010, SR011, SR014, SR019 |
| CR049 | The cleanest thesis-break triggers are a down-round or failed IPO pricing event, VAST being pulled directly into Red Stapler litigation or an injunction, inability to evidence security certifications for regulated workloads, or diligence showing concentration materially worse than the public record suggests. | Medium | SR001, SR011, SR012, SR014, SR018 |
| CR050 | The ranked risk stack is mitigated enough for continued diligence, but not enough for blind underwriting without concentration, legal provenance, and compliance evidence. | Medium | SR001, SR006, SR011, SR014, SR019, SR021 |
| CR051 | Supermicro's solution page says the EBox platform is purpose-built for VAST's DASE architecture and optimized for AI and GPU cloud markets. | Medium | SR027 |
| CR052 | VAST's public Terms of Use say website terms do not govern storage software or services, which instead fall under the EULA or separate written contracts. | Medium | SR028, SR016 |
| CR053 | Israel's April 2025 national AI report frames AI as a national priority, which can support local infrastructure demand while increasing policy involvement in the ecosystem. | Medium | SR029 |
| CR054 | Deloitte and F2 say the foundation layer of Israel's AI stack captured only about 13 percent of AI funding and that hardware, infrastructure, and cloud are only moderate-potential hubs, reinforcing the capital-intensity and ecosystem limits around infrastructure vendors. | Medium | SR030 |
| CR055 | Partner breadth across Cisco, Supermicro, OEMs, and the wider ecosystem softens single-vendor dependence but does not remove system-level reliance on external channels and hardware partners. | Medium | SR009, SR021, SR026, SR027 |
| CV001 | VAST Data said on 2026-04-22 that it closed a Series F financing at a $30 billion valuation with approximately $1 billion of total transaction value. | High | SV001, SV003 |
| CV002 | The Series F valuation represented more than a threefold increase from VAST's $9.1 billion Series E valuation in late 2023. | High | SV001, SV011, SV003 |
| CV003 | Drive Capital led the Series F with Access Industries as co-lead, and Fidelity, NEA, and NVIDIA also participated. | Medium | SV001, SV003 |
| CV004 | March 2026 trade-press reports described the $30 billion raise before the company officially disclosed the closed round on 2026-04-22, so the public record contains both an early-reporting window and a later close date. | Medium | SV004, SV006, SV001 |
| CV005 | The company said the financing mixed primary and secondary capital rather than being a purely primary growth round. | High | SV001, SV003 |
| CV006 | StorageNewsletter reported that outside databases described the transaction as roughly half new financing and half secondary liquidity, but VAST did not publicly confirm an exact split. | Medium | SV006, SV008 |
| CV007 | VAST said it exited the previous fiscal year with more than $500 million of committed annual recurring revenue. | High | SV001, SV005 |
| CV008 | VAST said it has surpassed $4 billion in cumulative bookings. | High | SV001, SV005 |
| CV009 | VAST said it exited that same fiscal year with positive operating margin and positive free cash flow. | High | SV001, SV005 |
| CV010 | VAST said its most recent fiscal year produced a Rule of X score of 228 percent. | Medium | SV001, SV005 |
| CV011 | VAST's CEO said the company often sells three-year and five-year licenses, which makes annual recurring revenue smaller than total multi-year contract value. | Medium | SV016, SV010 |
| CV012 | Reuters reported in June 2025 that VAST had reached roughly $200 million of ARR by January 2025 and was projected to reach about $600 million the following year. | Medium | SV012 |
| CV013 | CTech reported that VAST's ARR rose from about $350 million in early 2024 to above $500 million by July 2024, around $700 million by March 2025, and roughly $2 billion by the end of 2025. | Medium | SV009 |
| CV014 | The public revenue record therefore spans at least three non-equivalent constructs: more than $500 million of committed ARR, about $600 million of sourced ARR projection, and roughly $2 billion of third-party ARR-like estimate. | Medium | SV001, SV012, SV009 |
| CV015 | A Fidelity Select Tech Hardware filing for 2024-08-31 marked one VAST Series A preferred line at about $18.16 per share. | Medium | SV015 |
| CV016 | A Fidelity filing for 2025-02-28 marked the same VAST Series A preferred line at about $22.54 per share. | Medium | SV014 |
| CV017 | A Fidelity filing for 2026-02-28 marked that same VAST Series A preferred line at about $59.22 per share, roughly 3.3 times the August 2024 mark. | Medium | SV013, SV015 |
| CV018 | Fidelity's filing progression independently corroborates a steep private-market step-up in VAST's mark ahead of the Series F announcement. | Medium | SV013, SV014, SV015 |
| CV019 | A $30 billion price tag implies roughly 60 times VAST's official $500 million CARR floor. | Medium | SV001 |
| CV020 | A $30 billion valuation implies roughly 50 times the Reuters-cited $600 million ARR projection. | Medium | SV012 |
| CV021 | A $30 billion valuation implies roughly 15 times the CTech estimate of about $2 billion of ARR. | Medium | SV009 |
| CV022 | Everpure, the rebranded Pure Storage, reported fiscal 2026 revenue above $3.6 billion and total cash, cash equivalents, and marketable securities of about $1.5 billion. | High | SV017, SV019 |
| CV023 | Stock Analysis showed Everpure trading at about $29.35 billion of market capitalization and $28.02 billion of enterprise value on 2026-05-27. | Medium | SV036 |
| CV024 | Everpure therefore traded at roughly 7.7 times EV to trailing revenue at runDate. | Medium | SV017, SV019, SV036 |
| CV025 | NetApp reported $1.71 billion of Q3 FY2026 revenue and guided fiscal 2026 revenue to $6.772-$6.922 billion. | High | SV020, SV022 |
| CV026 | Analyst-market-data sources showed NetApp at about $27.42 billion of market capitalization and $27.14 billion of enterprise value with $6.70 billion of trailing revenue on 2026-05-27. | Medium | SV022, SV035 |
| CV027 | NetApp therefore traded at roughly 4.1 times EV to trailing revenue at runDate. | Medium | SV022, SV035 |
| CV028 | CoreWeave reported Q1 2026 revenue of $2.078 billion, revenue backlog of $99.4 billion, cash of about $3.32 billion, and debt of about $24.9 billion. | Medium | SV023 |
| CV029 | Stock Analysis showed CoreWeave at about $57.77 billion of market capitalization and $90.65 billion of enterprise value on 2026-05-27. | Medium | SV034, SV025 |
| CV030 | CoreWeave's enterprise value equated to about 10.9 times annualized Q1 2026 revenue, illustrating the high end of public AI infrastructure multiples but with materially heavier leverage than VAST discloses. | Medium | SV023, SV034 |
| CV031 | Lambda officially announced a $480 million Series D round, and Reuters said sources valued that round at about $2.5 billion post-money. | Medium | SV026, SV037 |
| CV032 | Crusoe announced a $1.375 billion Series E at a valuation above $10 billion to scale its AI infrastructure platform. | Medium | SV027 |
| CV033 | Together AI said its $305 million Series B valued the company at $3.3 billion to expand AI acceleration cloud capacity. | High | SV028, SV029 |
| CV034 | Relative to these private AI infrastructure rounds, VAST's $30 billion price is a far larger absolute bet and only looks comparable if its true recurring revenue base is much closer to $2 billion than to the disclosed $500-$600 million band. | Medium | SV009, SV027, SV029, SV037 |
| CV035 | VAST merits some premium to legacy storage peers because the public record still shows AI Operating System positioning, marquee AI and enterprise customers, rapid growth, and profitability. | Medium | SV001, SV005, SV016 |
| CV036 | Public evidence does not yet prove enough revenue scale or metric consistency to justify paying a larger premium than the market already assigns to faster-growing AI infrastructure names. | Medium | SV005, SV009, SV030, SV034 |
| CV037 | Because the round included material secondary liquidity, the headline $1 billion does not equal $1 billion of fresh operating cash for VAST. | Medium | SV001, SV006, SV008 |
| CV038 | CNBC's January 2026 survey captured widespread concern that record AI valuations and deal activity could be vulnerable to bubble-like multiple compression. | Medium | SV030 |
| CV039 | VAST's CEO said the company has not started an IPO process and has not hired bankers yet. | Medium | SV016 |
| CV040 | The most supportable exit path is eventual IPO rather than strategic sale, but timing remains uncommitted and therefore cannot be priced with precision. | Medium | SV016, SV004 |
| CV041 | A bear case that assumes only about $0.8 billion of durable recurring revenue, public-storage-style 8x-10x multiples, and weaker AI-capex sentiment implies roughly $6.4-$8.0 billion of value. | Low | SV001, SV017, SV020, SV030 |
| CV042 | A base case that assumes about $1.3-$1.5 billion of normalized recurring revenue power and 10x-14x AI infrastructure multiples implies roughly $13-$21 billion of value. | Low | SV001, SV009, SV034, SV036 |
| CV043 | A bull case that assumes the roughly $2 billion ARR estimate is directionally right and that VAST earns 14x-18x premium multiples implies roughly $28-$36 billion of value. | Low | SV001, SV009, SV034, SV036 |
| CV044 | The public record supports a track or research-more stance for fresh capital at $30 billion, with medium confidence, high risk, and an expensive valuation stance. | Medium | SV009, SV030, SV034, SV036 |
| CV045 | The recommendation would improve only if private diligence proves at least about $1.5-$2.0 billion of durable recurring revenue, limited customer concentration, and a cleaner preference stack than public evidence reveals. | Medium | SV006, SV009, SV016 |
| CV046 | Immediate thesis-break triggers are failure to reconcile ARR definitions, evidence of outsized customer concentration, or an IPO-ready revenue profile that still cannot support public-comp multiples. | Medium | SV009, SV016, SV030 |
| CV047 | The company-quality case survives better than the entry-price case: product, customer, and demand signals remain strong even though fresh-money underwriting does not clear at the round price. | Medium | SV001, SV005, SV016 |
| CV048 | The reviewed public sources do not disclose exact Series F liquidation preferences, participation rights, option-pool effects, or fully diluted ownership. | Medium | SV001, SV006, SV016 |
| CV049 | The highest-value diligence asks are an audited revenue bridge, top-customer ARR and renewals, the post-Series-F cap table, and concrete IPO-readiness milestones. | Medium | SV006, SV016, SV023 |
| CV050 | Existing low-basis investors can rationally hold for IPO optionality, but new investors at $30 billion need either a price reset or materially stronger evidence before adding capital. | Medium | SV016, SV030, SV036 |
| CV051 | A $30 billion entry requires roughly a $60 billion exit for 2x gross and a $90 billion exit for 3x gross before dilution. | Medium | SV001 |
| CV052 | At Pure-like 7.7x EV/revenue, a $60 billion exit would require roughly $7.8 billion of revenue, and even at CoreWeave-like 10.9x it would still require roughly $5.5 billion. | Medium | SV017, SV019, SV023, SV034, SV036 |