Cast AI
Autonomous Cloud Efficiency: Cast AI's Kubernetes Cost Platform
Cast AI delivers measurable cloud savings and AI-infrastructure optionality, but undisclosed revenue and round economics keep the unicorn valuation from looking clearly underwritten.
Cover facts
Company profile
Cast AI is a Miami-headquartered, Vilnius-centered infrastructure software company that sells a Kubernetes automation platform spanning cost monitoring, rightsizing, autoscaling, spot management, and newer multicloud GPU orchestration through OMNI Compute. The company was founded in 2019 by repeat founders who previously experienced cloud-cost pain at Zenedge, and it now targets mid-to-large enterprises that run meaningful cloud-native and AI workloads across public clouds.
- Website
- cast.ai
- Founded
- 2019-01-01
- Founders
- Yuri Frayman, Leon Kuperman, Laurent Gil
- Founding location
- Lithuania / Miami, FL
- Headquarters
- Miami, FL
- Product
- Automated Kubernetes rightsizing, autoscaling, bin packing, and spot-instance management, plus OMNI Compute for multicloud GPU and external capacity provisioning.
- Customers
- Mid-to-large enterprises running Kubernetes, cloud-native, and AI workloads on public cloud
- Business model
- Freemium monitoring plus enterprise software / usage-based automation pricing
- Stage
- Series C / Growth
- Funding status
- >$1B valuation in January 2026; >$180M disclosed funding after the 2025 Series C, with the 2026 strategic round amount undisclosed
Executive summary
Top strengths
- Proven cloud savings and automation outcomes in public enterprise case studies
- Cross-cloud Kubernetes optimization plus GPU / OMNI Compute adjacency
- Strong customer and investor signaling around the 2025-2026 growth phase
Top risks
- Native cloud tools and broader FinOps suites can compress pricing power
- ARR, gross margin, NRR, and 2026 round economics remain undisclosed
- Product breadth and deep infrastructure access create execution and trust risk
Open gaps
- Current ARR and revenue growth are not publicly disclosed
- Exact amount and economics of the January 2026 strategic investment remain undisclosed
- Gross margin, NRR, burn, and customer concentration data are unavailable
Contents
01Company Overview
1.1 Identity and Platform Narrative
Cast AI should be understood as more than a dashboarding or cost-visibility vendor. Official materials consistently position the company as an Application Performance Automation platform that began with Kubernetes cost optimization and has expanded into broader infrastructure automation, AI workload efficiency, and cross-cloud GPU provisioning. The origin story is unusually coherent: the founders say Cast AI was created after the team struggled with runaway cloud bills while scaling Zenedge before Oracle acquired that business in 2018. Public company and investor materials anchor legal founding in 2019 and link the business model to automated rightsizing, autoscaling, bin packing, spot management, and continuous performance-aware infrastructure decisions. The January 2026 OMNI Compute launch pushed the narrative another step forward by framing Cast AI as a control plane for external compute and GPU capacity, not just a savings layer inside one Kubernetes cluster. That matters because it broadens the company from FinOps tooling into AI infrastructure orchestration while preserving the core promise of better performance with less manual work and no application rewrites.[CO001, CO002, CO003, CO004, CO005, CO006]
| Metric | Value / Status | Date | Confidence | Gap / Caveat |
|---|---|---|---|---|
| Founded | 2019 | 2019 | High | Corroborated by TechCrunch and Cota founder materials |
| Headquarters / operating model | Miami HQ with major Vilnius engineering center | 2026 | Medium | Corporate base and engineering center are clear, but exact legal-entity structure is not publicly detailed |
| 2025 financing | Series C: $108M at about $850M valuation | 2025-04-30 | High | Valuation comes from Reuters-syndicated reporting rather than a company filing |
| 2026 financing | Strategic investment from Pacific Alliance Ventures; valuation >$1B | 2026-01-12 | High | Investment amount and round mechanics were not disclosed publicly |
| Customers | 2,100+ organizations globally / over 2,000 companies | 2025-2026 | High | Official materials use both phrasings across adjacent disclosures |
| Named customer proof | Akamai, BMW, Cisco, FICO, HuggingFace, NielsenIQ, Swisscom, TGS, Samsung | 2025-2026 | Medium | Not every named logo has a standalone public case study |
| Workforce signal | ~200 employees to 300+ employees across 34 countries | 2025-2026 | Medium | Public headcount signals conflict by source and methodology |
| Official platform metrics | 6.46B CPUs provisioned; 372.4M nodes provisioned | 2026 | Medium | Marketing counters are current-site claims without independent audit |
| Value proposition | Roughly 40% waste reduction on site metrics; 50-80% savings in flagship case studies | 2025-2026 | Medium | Savings vary by use case and are partly company-reported |
Public customer count, workforce, and savings figures come from mixed official and independent sources; the table preserves the most supportable current range rather than forcing a single precise number.
[CO001, CO007, CO009, CO010, CO016, CO019]Cast AI links Kubernetes automation, AI workload orchestration, enterprise customers, and investor capital into one infrastructure-automation thesis.
[CO001, CO003, CO005, CO006, CO008, CO016]1.2 Founders, Leadership, and Geography
The founder bench is unusually stable and clearly identified in official and investor materials. Cast AI publicly lists Yuri Frayman as CEO, Leon Kuperman as CTO, and Laurent Gil as President, with all three described as co-founders. Third-party interview material adds useful context: the trio previously built Viewdle and Zenedge together, giving them a long operating history before launching Cast AI. The present-day executive bench appears broader than a simple founder-led startup, with leadership roles spanning finance, people operations, customer success, and global sales. Geography is equally important to the diligence narrative. Independent reporting consistently calls Cast AI Miami-based for corporate positioning, but multiple sources also describe Vilnius as a core engineering center and the source of the company's Lithuanian-unicorn identity. That mix is strategically useful for recruiting and narrative positioning, but public governance disclosure remains thin. Reviewed sources do not clearly disclose board composition, voting control, or protective investor rights, so founder influence looks strong but cannot be cleanly quantified from public evidence alone.[CO009, CO010, CO011, CO012, CO013, CO014]
| Person | Role | Background / context | Functional coverage | Key-person dependency |
|---|---|---|---|---|
| Yuri Frayman | CEO & Co-Founder | Repeat founder from Viewdle and Zenedge; public face of the capital and category narrative | Corporate strategy, fundraising, partner ecosystem, market narrative | High |
| Leon Kuperman | CTO & Co-Founder | Repeat founder and technical co-architect of the platform | Architecture, automation engine, product reliability, GPU orchestration | High |
| Laurent Gil | President & Co-Founder | Repeat founder with strong product and go-to-market voice | Product vision, strategic partnerships, category framing, field positioning | High |
| Ferréol Hoppenot | EVP Global Sales | Officially listed senior GTM leader | Enterprise sales execution and regional expansion | Medium |
| Pierre Liduena | Chief Financial Officer | Officially listed finance leader | Capital planning, budget discipline, disclosure readiness | Medium |
| Gabija Marganavičė | Chief People Officer | Officially listed people leader | Hiring, culture, and distributed-team scaling | Medium |
| Moti Gabay | EVP Customer Success | Officially listed post-sales leader | Implementation quality, renewal support, and enterprise adoption | Medium |
Founder roles are directly disclosed by Cast AI; wider executive biographies are thinner in public materials than titles and functions.
[CO001, CO012, CO013, CO014, CO015]1.3 Capital Base and Unicorn Milestone
The best-documented financing event in the public record is the April 2025 Series C. Official, Reuters-syndicated, and tech press sources align on the round size at $108M and on the lead investors: G2 Venture Partners and SoftBank Vision Fund 2, with Aglaé Ventures joining existing backers such as Hedosophia, Cota Capital, Vintage Investment Partners, Creandum, and Uncorrelated Ventures. Reuters reported that the round valued Cast AI at about $850M and pushed total disclosed capital above $180M. The January 2026 milestone is different. Cast AI and BusinessWire confirmed a strategic investment from Pacific Alliance Ventures, Shinsegae Group's U.S. corporate venture arm, and said the company's valuation had crossed $1B. But public materials did not disclose the amount of the investment, whether it was primary or mixed with secondaries, or how the financing affected total raised and preference-stack dynamics. The result is a credible unicorn milestone with incomplete cap-table transparency: the valuation event is real, the strategic investor is known, but the economics of the 2026 round remain materially under-disclosed.[CO016, CO017, CO018, CO019, CO020, CO021]
| Stakeholder | Role | Why it matters | Diligence ask |
|---|---|---|---|
| G2 Venture Partners | Series C co-lead | Validation from an infrastructure-focused growth investor | Confirm ownership level and governance rights after Series C |
| SoftBank Vision Fund 2 | Series C co-lead | Adds signaling power and AI-infrastructure network access | Clarify board seat or information-right package |
| Aglaé Ventures | New Series C investor | Adds luxury-family-office capital and headline validation | Assess follow-on appetite and ownership |
| Hedosophia | Existing investor | Longer-tenured cap-table participant | Reconstruct preference stack and pro-rata rights |
| Cota Capital | Existing investor and vocal supporter | Provides founder-history context and public sponsorship | Confirm current stake and any board influence |
| Vintage Investment Partners | Existing investor | Part of the multi-round growth syndicate | Check ownership and mark policy |
| Creandum | Existing investor | European VC signal and earlier-stage continuity | Confirm whether it still holds material influence |
| Uncorrelated Ventures | Existing investor | Part of the recurring backer set across rounds | Review pro-rata participation rights |
| Pacific Alliance Ventures / Shinsegae Group | 2026 strategic investor | Brings Asia market access and the unicorn step-up narrative | Disclose amount invested, round structure, and any commercial rights |
Public sources identify the participants in the best-known rounds but do not disclose the full current cap table, preference terms, board composition, or secondary mix.
[CO017, CO018, CO021, CO022, CO023]1.4 Traction, Milestones, and Risk Signals
Customer proof is one of the strongest parts of Cast AI's public file. Across official funding and launch materials, the company names Akamai, BMW, Cisco, FICO, HuggingFace, NielsenIQ, Swisscom, Samsung, and TGS, while case studies provide deeper evidence that the platform is used in production rather than only tested in pilots. NielsenIQ reported 60-80% savings on non-production clusters and payback within two months; project44 reported 50% compute-cost savings within one month on its initial rollout; and Branch described a path to lower EC2 spend without spot-related incidents. These studies reinforce management's claim that Cast AI increasingly sells reliability and automation, not only lower bills. Milestone quality is also decent: the company ties its 2025 Series C to Application Performance Automation, its 2026 launch to OMNI Compute, and its geographic expansion to India, Singapore, and additional regional offices. The risk side is subtler but real. Cybernews flagged setup complexity, reporting limitations, and higher pricing for smaller teams, while StatusGator showed that service-health incidents can still surface around Kubernetes node provisioning. Headcount and capital totals also remain imprecise in public sources.[CO007, CO008, CO021, CO028, CO030, CO031]
| Date | Event | Type | Amount / status | Participants | Implication |
|---|---|---|---|---|---|
| 2018 | Oracle acquires Zenedge, the founders' prior company | governance | Exit / predecessor event | Oracle; Frayman; Gil; Kuperman | Creates the origin story for Cast AI's cloud-cost problem statement |
| 2019 | Cast AI founded | founding | Company formation | Yuri Frayman; Laurent Gil; Leon Kuperman | Launches the company around Kubernetes automation and cloud efficiency |
| 2024 | AI Enabler launched for LLM deployment optimization | product | Launch | Cast AI | Extends the platform into model selection and GPU-heavy AI workloads |
| 2024 | Futuriom 50 / IDC / G2 recognition highlighted in company materials | scale | Recognition | Cast AI; Futuriom; IDC; G2 | Signals category visibility but not audited financial performance |
| 2025-04-30 | Series C closed | financing | $108M at roughly $850M valuation | G2 Venture Partners; SoftBank Vision Fund 2; Aglaé; existing investors | Provides capital to expand APA and pushes the company near unicorn status |
| 2025 | India and Singapore offices opened after Series C | scale | Expansion | Cast AI | Demonstrates push into high-growth markets |
| 2026-01-12 | Pacific Alliance Ventures strategic investment announced | financing | Amount undisclosed; valuation >$1B | PAV; Shinsegae Group | Confirms unicorn milestone while leaving round economics opaque |
| 2026-01-12 | OMNI Compute launched | product | Unified compute / GPU control plane | Cast AI; Oracle; customers such as Uniphore | Repositions Cast AI toward multi-cloud GPU orchestration |
| 2026-01 | Cast AI publicly framed as Lithuania's fifth unicorn | scale | Milestone | Lithuanian startup ecosystem media | Improves regional brand power and recruiting narrative |
| 2026-06-05 | StatusGator showed a partial outage involving Azure AKS node provisioning failures | adverse | Partial outage | StatusGator; Cast AI status feed | Shows that infrastructure-automation services still carry operational incident risk |
This is the best-supported public chronology across founding, financing, expansion, product, and adverse-service milestones through the 2026 run date; several dates are month- or year-level because official day-level dating is not consistently disclosed.
[CO002, CO016, CO019, CO021, CO022, CO024]Key public milestones from founder origin story through the unicorn milestone, OMNI Compute launch, and current operating risk signals.
[CO002, CO016, CO019, CO021, CO024, CO029]Current public metrics that most directly frame maturity, traction, and diligence opacity.
Headcount and savings data preserve public ranges and case-specific outcomes instead of implying one normalized company-wide benchmark.
[CO007, CO016, CO019, CO021, CO026, CO027]1.5 Exhibits
02Market Analysis
2.1 Market Boundary and Status-Quo Substitutes
The cleanest way to frame Cast AI’s market is as an overlap problem. The company is not simply a cloud-cost dashboard vendor, a generic observability tool, or a pure GPU cloud provider. Instead it sits at the intersection of cloud FinOps, Kubernetes cost management, and AI/GPU workload optimization. The broadest lens comes from CloudOps and cloud FinOps software, where vendors promise financial visibility, optimization, governance, and increasingly automated execution across hybrid and multi-cloud estates. The narrower and more direct lens is Kubernetes cost management, which focuses on rightsizing, autoscaling, cost allocation, and governance for containerized workloads. A third adjacency has become much more important as AI workloads scale: GPU allocation, hybrid infrastructure placement, and the economics of inference. Market boundary also depends on substitutes. Google’s GKE Autopilot, Microsoft’s AKS optimization guidance, Red Hat OpenShift cost management, IBM’s hybrid cloud optimization stack, and open-source Karpenter all cover pieces of the same problem. That means Cast AI’s real addressable market is not every dollar of cloud spend but the subset of organizations that want cross-platform optimization beyond native controls.[CM006, CM007, CM008, CM009, CM024, CM025]
| Segment / Category | Included Spend | Excluded Spend | Primary Buyer / Payer | Cast AI Relevance |
|---|---|---|---|---|
| Cloud FinOps / cloud financial management | Spend visibility, governance, optimization, chargeback, forecasting across cloud estates | Generic ERP spend management or non-technology procurement | CFO / CIO / FinOps leader | Broad outer boundary; relevant but too wide alone |
| Kubernetes cost management | Cluster cost allocation, rightsizing, autoscaling, showback, optimization for containerized workloads | Non-container application monitoring and generic observability budgets | Platform engineering, SRE, infrastructure | Core — closest direct market lens |
| AI / GPU workload optimization | GPU provisioning, workload placement, hybrid inference economics, utilization improvements | Standalone model training SaaS or chip manufacturing economics | AI infrastructure and platform teams | High-growth adjacency increasingly bundled into the same buying motion |
| Managed Kubernetes native controls | Autopilot / AKS / provider-native automation and billing controls | Third-party cross-cloud governance layers | Cloud platform team | Status-quo substitute, not full third-party TAM |
| Hybrid / multicloud cost governance | Cross-cloud visibility, tagging, policy, budget controls, showback | Broad data-center CapEx programs | FinOps, finance, central IT | Relevant because Cast competes when native tools fragment |
| Generic observability / APM | Telemetry, traces, performance monitoring | Cost optimization execution and financial governance | SRE / observability team | Adjacent but mostly excluded from Cast-specific SAM |
The boundary intentionally excludes raw public-cloud spend and generic IT software because Cast AI monetizes only the portion of spend that requires optimization, governance, or automation for Kubernetes-heavy and AI-intensive workloads.
[CM006, CM007, CM019, CM024, CM025, CM026]The buying motion spans platform teams, FinOps, finance, and AI infrastructure leaders, all reacting to the same waste-and-complexity loop.
[CM019, CM020, CM022, CM025, CM026, CM028]2.2 Sizing Lenses and Constrained TAM/SAM/SOM
Public market sizing for Cast AI’s space is directionally useful but definition-sensitive. IDC’s April 2026 keynote on intelligent CloudOps software provides the broadest software-market lens: $23.4B in 2024 growing to $45.0B by 2029 at a 14% CAGR. MarketsandMarkets offers a somewhat narrower cloud FinOps lens of $14.88B in 2025 growing to $26.91B by 2030, emphasizing cost management and optimization as the largest application and multi-cloud as the largest deployment environment. The most relevant direct lens comes from The Business Research Company, which sizes Kubernetes cost management at $1.75B in 2025, $2.23B in 2026, and $5.78B in 2030. Verified Market Reports and Business Research Insights publish even broader cloud cost management figures in the $9.2B-$11.01B 2026 range, again showing that boundary choices matter. The practical conclusion is that Cast AI’s true serviceable market is narrower than broad CloudOps and somewhat wider than strict Kubernetes cost management because AI/GPU optimization and hybrid-placement decisions increasingly belong in the same buying motion. Public evidence supports a range rather than a single precise TAM story.[CM001, CM002, CM003, CM004, CM005, CM006]
| Publisher | Year | Geography | Value / Range (USD B) | CAGR | Methodology | Confidence | Limitation |
|---|---|---|---|---|---|---|---|
| IDC Intelligent CloudOps Software | 2024-2029 | Global | 23.4 → 45.0 | 14.0% | Broad CloudOps software revenue forecast | High | Too broad for Cast AI; includes adjacent automation categories |
| MarketsandMarkets Cloud FinOps | 2025-2030 | Global | 14.88 → 26.91 | 12.6% | Cloud FinOps market forecast by capability and deployment model | Medium | Covers broader governance and services beyond Kubernetes-native automation |
| The Business Research Company Kubernetes Cost Management | 2025-2030 | Global | 1.75 → 2.23 → 5.78 | 26.9% to 27.1% | Direct market lens for Kubernetes cost management software and services | Medium | Narrower than Cast’s AI/GPU and hybrid optimization adjacency |
| Verified Market Reports CCMO | 2026-2034 | Global | 9.2 → 35.4 | 14.1% | Cloud cost management and optimization market snapshot | Low | Methodology is vendor-generated and likely broader than rigorous FinOps definitions |
| Business Research Insights CCMO | 2026-2035 | Global | 11.01 → 38.4 | 14.8% | Cloud cost optimization market forecast | Low | Definition overlaps with Verified and contains obvious copy-edit noise |
| Constrained Cast-relevant SAM (author estimate) | 2026 | Global | 2.0 → 4.0 | n/a | Anchored on Kubernetes cost management plus AI/GPU optimization adjacency | Low | Derived estimate; no public source sizes Cast’s exact overlap market |
| Constrained third-party SOM (author estimate) | 2026-2030 | Global | 0.3 → 0.8 | n/a | Assumes modest share of high-spend Kubernetes and AI-platform buyers | Low | Depends on native-tool penetration and conversion to third-party automation |
The table preserves multiple incompatible sizing lenses because public market research houses define the category differently; Cast AI should be evaluated using a constrained overlap market, not any single headline forecast.
[CM001, CM002, CM003, CM004, CM005, CM043]Cast AI’s opportunity narrows from broad cloud FinOps into Kubernetes-focused optimization and then into a third-party slice that also values AI/GPU automation.
[CM002, CM003, CM038, CM043, CM044]Public market estimates differ widely because research houses define the category from broad CloudOps software down to narrow Kubernetes cost management.
Midpoints for multi-year research-house estimates are analytical waypoints, not vendor-published annual values; the final row is an author range anchored to the narrower Kubernetes cost-management lens plus AI/GPU adjacency.
[CM001, CM002, CM003, CM043]2.3 Buyers, Users, and Budget Owners
FinOps and cloud cost optimization are inherently cross-functional, which matters because Cast AI’s buyer map is not a single budget line. The FinOps Foundation explicitly frames the discipline around shared ownership among engineering, finance, product, operations, procurement, and executives. Google’s cost-optimization guidance names CTOs, CIOs, CFOs, architects, developers, administrators, and operators as relevant stakeholders, while Microsoft describes FinOps as an operating bridge between financial management and cloud engineering. In practice, the hands-on users are usually platform engineering, SRE, DevOps, or AI infrastructure teams responsible for cluster behavior and resource efficiency. The paying centers are typically central cloud platforms, infrastructure engineering budgets, or executive-controlled optimization programs tied to finance and procurement. The CNCF Kubernetes FinOps microsurvey shows why these buyers care: for many organizations Kubernetes already consumes a meaningful share of cloud budget, from up to a quarter for half of respondents to above one million dollars per month for a large-spend cohort. That spending concentration makes even modest utilization gains material to finance and platform teams.[CM010, CM011, CM012, CM013, CM019, CM020]
| Segment | Primary Buyer | Primary User | Payer / Budget Owner | Workflow | Adoption Trigger | Why Cast AI Can Matter |
|---|---|---|---|---|---|---|
| Platform engineering / SRE | Head of platform engineering | SREs, DevOps, cluster operators | Engineering infrastructure budget | Rightsizing, node provisioning, showback, reliability optimization | Kubernetes spend outgrows manual tuning | Automation can improve efficiency without forcing developers to micromanage nodes |
| Central FinOps / cloud economics | FinOps lead or cloud economics manager | FinOps analysts and engineering partners | CFO / CIO shared governance budget | Chargeback, visibility, forecasting, optimization governance | Budget variance or board pressure on cloud spend | Cross-team accountability turns waste reduction into a finance-plus-engineering motion |
| AI infrastructure team | VP / director of platform or AI infrastructure | ML platform engineers, infra engineers | Executive AI budget or central platform budget | GPU allocation, placement, hybrid cost control | Inference bills and GPU scarcity spike | Cast AI’s OMNI and GPU optimization narrative directly fits this buyer |
| Regulated hybrid enterprise | CIO / CTO | Platform architects and security operators | Central IT and finance | Hybrid workload placement with cost and compliance guardrails | Data sovereignty, security, and cost pressure | Needs cross-environment optimization rather than single-cloud tooling |
| Mid-market Kubernetes adopter | Engineering manager | Small DevOps team | Engineering budget owner | Basic cost visibility and autoscaling | Unexpected monthly cloud spikes | May adopt only if third-party automation saves more than native tools |
| Procurement-backed optimization program | CFO / procurement | FinOps + engineering working group | Procurement and finance | Commitments, vendor rationalization, chargeback standards | Renewal cycle or vendor-consolidation push | Can favor vendors that unify optimization across providers |
Buyer mapping is synthesized from FinOps Foundation personas, cloud-provider cost documentation, and CNCF’s Kubernetes FinOps survey; actual ownership varies materially by company maturity and vertical.
[CM010, CM011, CM012, CM019, CM020, CM022]The market matures when organizations move from raw spend pressure to measurement, ownership, and finally automated optimization.
[CM014, CM019, CM021, CM022, CM037, CM041]2.4 Growth Drivers and Adoption Constraints
The demand case for Cast AI’s category is straightforward. CNCF survey data and Cast AI’s own benchmark work both show that overprovisioning remains structural rather than incidental, with utilization far below what finance teams expect and what cloud-native teams assume they are buying. AI compounds the problem: Deloitte argues that inference costs have collapsed on a per-unit basis while aggregate AI spending keeps rising because usage growth overwhelms efficiency gains. That pushes more organizations into serious workload-placement decisions across public cloud, private infrastructure, and edge environments, and it also increases the value of automation around GPU provisioning, region choice, and mixed-resource scheduling. At the same time, constraints are real. Kubernetes remains hard to operate, toolchains are fragmented, skills are scarce, and many enterprises can solve a meaningful percentage of their problem with native services such as GKE Autopilot, AKS autoscaling, Karpenter-based node provisioning, or Red Hat showback. As a result, adoption favors organizations with enough cloud complexity, enough spend, and enough operational pain that third-party automation meaningfully outperforms native controls.[CM014, CM015, CM016, CM017, CM018, CM030]
| Driver / Constraint | Direction | Timing | Implication | Diligence Ask |
|---|---|---|---|---|
| Structural overprovisioning in Kubernetes | Driver | Current / ongoing | Creates clear ROI for rightsizing and automation | Measure how much customer waste is still addressable versus already optimized |
| Multi-cloud and hybrid deployment growth | Driver | Current / ongoing | Increases complexity and weakens single-provider optimization strategies | Confirm whether buyers need cross-cloud visibility or mostly single-cloud controls |
| AI inference economics and GPU scarcity | Driver | Near-term 2026-2028 | Raises urgency for GPU placement, sharing, and hybrid compute choices | Test whether Cast’s GPU capabilities are production-grade or still mostly narrative |
| Chargeback / showback and FinOps maturity | Driver | Current / ongoing | Pushes buyers toward granular cost allocation and ownership models | Check how often Cast replaces spreadsheets or native billing exports |
| Native cloud tools improving | Constraint | Current / ongoing | Shrinks third-party urgency for lower-complexity accounts | Benchmark Cast against GKE Autopilot, AKS, Karpenter, and Red Hat workflows |
| Kubernetes complexity and skills shortage | Constraint | Current / ongoing | Can both create demand and slow implementation success | Quantify required onboarding effort and customer-success load |
| Fragmented billing and tagging data | Constraint | Current / ongoing | Makes FinOps adoption harder and can delay value realization | Assess how much data cleanup customers must do before Cast creates insight |
| Market-definition ambiguity | Constraint | Ongoing | Complicates TAM storytelling and valuation comparables | Use constrained overlap markets rather than vendor-sized headline categories |
The same factor can be both a growth driver and an adoption brake; complexity creates demand for automation but also raises deployment friction and elevates customer-success costs.
[CM008, CM009, CM014, CM015, CM017, CM018]2.5 Exhibits
03Competitors
3.1 Competitive Landscape and Solution Classes
The buyer does not face a single monolithic competitor set when evaluating Cast AI. Instead the market breaks into at least four credible solution classes that solve overlapping versions of the same job. First are automation-first commercial products, where Cast AI and Flexera Ocean both promise continuous infrastructure optimization, node selection, and cost savings through autonomous actions rather than reporting alone. Second are visibility-first FinOps and cost-allocation products such as IBM Kubecost and OpenCost, which help teams attribute spend, identify waste, and govern Kubernetes economics but often stop short of replacing the cluster control plane. Third are rightsizing specialists such as StormForge and Kubex, which focus on pod, node, and workload tuning with machine-learning assistance. Fourth are native or open-source substitutes like GKE Autopilot, Karpenter, and AKS node auto-provisioning. This framing matters because Cast AI is rarely competing only against one startup. It is usually competing against a buyer's willingness to combine native tooling, open source, and process change instead of paying for a dedicated cross-cloud optimization platform.[CP001, CP002, CP004, CP006, CP008, CP010]
| Competitor / class | Category | Scale / ownership signal | Target segment | Differentiation | Limitation |
|---|---|---|---|---|---|
| Cast AI | Automation-first Kubernetes optimization | Private unicorn after January 2026 strategic round | Mid-to-large enterprises running Kubernetes across AWS, Azure, GCP, and adjacent AI workloads | Cross-cloud control layer spanning monitoring, optimization, autoscaling, spot automation, and newer GPU positioning | Pricing is sales-led and public reviewer evidence still flags onboarding and reporting friction |
| Flexera Ocean / former Spot | Automation-first FinOps and container optimization | Spot portfolio now owned by Flexera | Enterprises and MSPs seeking automated workload-cost reduction inside a broader FinOps suite | AI/ML-driven container optimization tied to a wider cloud-financial-management portfolio | Less clearly positioned as a standalone cross-cloud Kubernetes control plane than Cast AI |
| IBM Kubecost | Visibility and governance | Acquired by IBM in 2024 and distributed via Apptio | FinOps, platform, and engineering teams needing attribution and governance | Strong allocation, showback, governance, and fast deployment | Primary value proposition is visibility first; autonomous infrastructure control is less central |
| StormForge | Rightsizing specialist | CloudBolt-owned optimization product | Teams that want guardrailed workload-level tuning | Autonomous vertical rightsizing that works with HPA and GitOps-friendly workflows | Narrower than a full multicloud optimization suite |
| Kubex | Rightsizing plus AI / GPU optimization | Rebranded Densify / private enterprise software vendor | Large enterprises optimizing Kubernetes, nodes, and GPU-heavy workloads | Predictive pod, node, and pre-warming optimization with policy guardrails | Enterprise-heavy positioning and less evidence of broad self-serve adoption |
| Native cloud tools | Provider-native substitute | Built into AWS, Google Cloud, and Azure platform contracts | Single-cloud platform teams | Can be included or preconfigured inside the cloud stack with strong native integration | Usually single-cloud and fragmented across providers |
| OpenCost / internal build | Open-source and process substitute | Vendor-neutral open source plus in-house engineering effort | Cost-conscious teams with sufficient platform talent | Low-cost visibility floor and flexible internal composition | Does not provide one-click autonomous execution out of the box |
Rows cover the main direct, adjacent, and substitute classes a buyer can realistically consider as of the 2026 run date; ownership signals are used where current financing detail is not public.
[CP004, CP005, CP006, CP007, CP008, CP010]Evidence-backed ordinal map of the main solution classes by automation depth and scope breadth.
Axes are ordinal judgments derived from reviewed capability pages and substitute coverage, not source-reported market scores.
[CP027, CP028, CP029, CP030, CP031, CP035]3.2 Direct Vendor Profiles and Buyer Tradeoffs
Cast AI's strongest direct overlap is with vendors that promise an operational outcome, not just a dashboard. Flexera Ocean sits closest on that dimension because it markets AI- and ML-driven Kubernetes infrastructure optimization, scaling, and cost control as a managed product inside a broader FinOps suite. IBM Kubecost is a major alternative for buyers who prioritize allocation, governance, and internal showback over autonomous execution. StormForge and Kubex overlap in a narrower way: both emphasize rightsizing and workload-level efficiency, which can reduce waste materially, but neither source set presents them as replacing multi-cloud cluster operations as fully as Cast AI tries to. IBM Turbonomic enters from above, offering application resource management across hybrid and multicloud infrastructure rather than only Kubernetes economics. The result is a buyer tradeoff matrix rather than a single winner-take-all market. Buyers leaning toward finance visibility and broad enterprise suites may prefer IBM-led options, while teams optimizing engineering automation and spot-led savings are more likely to compare Cast AI against Flexera Ocean and selective native-cloud substitutes.[CP003, CP004, CP005, CP006, CP007, CP008]
| Buying criterion | Cast AI | IBM Kubecost | Flexera Ocean | StormForge / Kubex | Native cloud + OpenCost |
|---|---|---|---|---|---|
| Real-time cost allocation | Yes | Yes, core strength | Partial | Partial | OpenCost yes; native cloud varies |
| Autonomous node provisioning | Yes | Limited / secondary | Yes | Limited | Yes in GKE, AKS NAP, or Karpenter contexts |
| Workload rightsizing | Yes | Recommendation-oriented | Yes | Core strength | Partial |
| Spot / cheaper-capacity orchestration | Yes | Not core | Yes | Not core | Provider specific |
| Multicloud control surface | Yes | Yes for visibility | Broader FinOps suite, but container tooling position varies | Yes but enterprise-policy oriented | No, usually provider specific |
| GPU / AI optimization narrative | Yes, increasingly explicit | Not central in reviewed pages | AI budget pressure mentioned at suite level | Kubex explicit; StormForge less explicit | Usually separate service families |
Cells reflect only capabilities evidenced on reviewed product, documentation, or comparison pages; where the source set was ambiguous, the cell is marked partial or limited rather than inferred as full support.
[CP002, CP004, CP006, CP008, CP010, CP014]Capability lens showing where the market separates into visibility, execution, and native-cloud substitutes.
[CP023, CP024, CP030, CP031, CP032, CP033]3.3 Native Cloud, Open-Source, and Switching-Cost Dynamics
The most serious displacement risk does not come only from venture-backed peers. It comes from the fact that hyperscalers and open-source projects now solve large chunks of the problem directly inside the stack. Karpenter already gives AWS-oriented teams open-source just-in-time node provisioning and cost-aware consolidation logic. Google markets GKE Autopilot as a mode where Google manages nodes, scaling, security, and infrastructure choices, while also describing cost-optimization features as included in GKE pricing. Microsoft is pushing the same direction by making AKS node auto-provisioning a Karpenter-based native capability and keeping the classic cluster autoscaler available for lighter use cases. OpenCost establishes a low-cost floor under cost allocation and showback, and the FinOps Foundation reinforces that usage optimization can be treated as a broad internal operating practice rather than a purchased product. These facts lower switching costs and encourage multi-homing. A buyer can combine visibility from Kubecost or OpenCost with native provisioning from Karpenter, GKE, or AKS instead of standardizing fully on Cast AI.[CP012, CP013, CP014, CP015, CP016, CP017]
| Vendor / class | Pricing signal | Contract model | Included capabilities | Unknowns / discounting | Implication |
|---|---|---|---|---|---|
| Cast AI | Free monitoring entry point plus sales-led paid automation | Usage-based / negotiated enterprise contract | Monitoring, optimization, autoscaling, spot automation, multicloud operations | Realized enterprise pricing and percentage-of-savings terms are not public | Strong ROI story for large clusters; harder for SMB buyers to underwrite |
| IBM Kubecost | Free to install and free tier messaging is explicit | Freemium to enterprise subscription inside IBM / Apptio motion | Cost allocation, governance, visibility, optimization recommendations | Discounting and bundle terms are private | Appeals to finance visibility buyers with lower initial commitment |
| Flexera Ocean | No public list price on reviewed pages | Enterprise FinOps suite / negotiated | Container optimization, cost visibility, AI/ML automation, partner ecosystem | Seat, cluster, or savings-share economics not public | Procurement usually rides a wider FinOps suite sale |
| StormForge | Free trial / demo oriented | Enterprise software sale | Autonomous rightsizing, HPA alignment, guardrails, GitOps compatibility | Public list pricing not visible | Most attractive where buyers want narrow workload efficiency first |
| Kubex | No public list price on reviewed product pages | Enterprise software / policy-driven automation sale | Pod, node, pre-warming, and GPU-aware optimization | Realized pricing and deployment minimums are unclear | Likely strongest in large regulated or AI-heavy estates |
| Native cloud + OpenCost | Often included or open source | Cloud consumption plus internal engineering effort | Provisioning, autoscaling, and visibility building blocks | Hidden cost is people time and fragmented tooling | Sets the price floor under third-party vendors |
Public pricing transparency is limited across the category, so the table distinguishes explicit free or included signals from unknown negotiated enterprise economics instead of implying comparability that the sources do not support.
[CP003, CP015, CP021, CP022, CP026, CP031]3.4 Moat Durability and Competitive Risks
Cast AI still has a credible wedge, but the moat is conditional rather than absolute. Its best-supported differentiation is packaging multiple value levers into one control layer: cross-cloud support, optimization recommendations, autonomous scaling, spot or capacity orchestration, and an increasingly explicit GPU or AI-efficiency narrative. That broader product story, combined with fresh capital and unicorn status, gives Cast AI more room to invest than a smaller point tool. But the durability question is whether these features remain uniquely valuable once native clouds improve, OpenCost keeps visibility commoditized, and enterprise suites like IBM and Flexera bundle adjacent FinOps capabilities into wider contracts. The adverse evidence matters here. Cybernews and competitor-authored comparisons both suggest that onboarding clarity, IAM complexity, and affordability for smaller teams remain points of friction. That means Cast AI's competitive position is strongest in complex multi-cloud estates or GPU-heavy Kubernetes environments where native tools are fragmented. It is weakest when the customer is single-cloud, price sensitive, or already standardized on a broad enterprise FinOps vendor.[CP021, CP022, CP030, CP031, CP034, CP035]
| Moat claim | Primary threat | Severity | Current evidence | Mitigation / diligence ask |
|---|---|---|---|---|
| Cross-cloud automation is harder to replicate than single-cloud tooling | Hyperscalers keep adding native node provisioning and cost controls | High | GKE Autopilot and AKS NAP already automate meaningful parts of the workflow | Ask for win rates against native tools in single-cloud accounts |
| One platform can unify visibility and execution | OpenCost plus native cloud tooling can be assembled modularly | High | OpenCost, Karpenter, and AKS/GKE features lower switching costs | Request proof that unified execution materially outperforms modular alternatives |
| GPU / AI optimization creates new differentiation | Broad FinOps suites may bundle AI budget controls faster than Cast scales distribution | Medium | Flexera and IBM both market broader FinOps expansion around AI-era cloud budgets | Verify current revenue and customer adoption from GPU-related modules |
| Fresh capital improves product velocity | Large incumbents have wider enterprise channels and contract leverage | Medium | IBM, Flexera, and Turbonomic all sit inside broader enterprise motions | Test whether Cast still wins when bundled into wider FinOps RFPs |
| Reviewer love signals product-market fit | Onboarding complexity and smaller-team pricing hurt expansion at the low end | Medium | Cybernews and competitor-authored comparisons both cite setup or cost concerns | Ask for gross retention by customer-size band and implementation timeline data |
Severity reflects competitive risk to selling and renewal quality rather than existential risk to category demand; the table focuses on the threats that most directly compress Cast AI pricing power or win rates.
[CP030, CP031, CP032, CP034, CP037, CP038]Compact scorecard of the traits that currently strengthen or weaken Cast AI’s defensibility.
Values are qualitative judgments synthesized from reviewed source evidence rather than reported third-party benchmark scores.
[CP034, CP035, CP036, CP038, CP040]3.5 Exhibits
04Financials
4.1 Revenue Model and Pricing Signals
The public evidence points to a software-first land-and-expand model rather than a services-heavy business. Cast AI's documentation centers on cost monitoring, optimization recommendations, autoscaling, bin packing, and spot automation delivered through one platform. Review surfaces add the clearest monetization clues: G2 shows a free Kubernetes cost monitoring tier, while Software Advice lists a starting price of $200 per month and describes automation features such as autoscaling, rightsizing, and spot instance management. Cast's own pricing page is notably sales-led and does not publish a transparent enterprise rate card, which implies meaningful contract-level variation by cluster size, module mix, and support needs. This is consistent with an ROI-based enterprise sale in which the product proves savings first and monetizes deeper automation after trust is established. The emerging OMNI Compute and GPU-control-plane narrative also suggests that the company may be broadening beyond classic optimization subscription revenue, but the public file does not disclose whether newer AI or GPU features are monetized separately, bundled into the platform, or tied to external compute marketplace economics.[CI001, CI002, CI003, CI004, CI005, CI006]
| Revenue stream | Mechanism | Unit | Current value / status | Quality | Diligence ask |
|---|---|---|---|---|---|
| Free monitoring land motion | Free tier used to generate usage data, savings reports, and product adoption | Free / lead-gen | Publicly visible on G2 and pricing surfaces | High | Quantify conversion from free monitoring to paid automation |
| Core optimization automation | Paid platform for autoscaling, rightsizing, bin packing, and spot automation | Likely subscription or usage-based | Clearly core to the product, but realized contract terms are not public | Medium | Provide actual pricing cards and contract archetypes |
| AI / GPU optimization and OMNI Compute | Incremental monetization from GPU and external-capacity control plane | Unknown | Strategically important in 2026, but monetization design is undisclosed | Low | Break out attach rate and revenue contribution from AI/GPU modules |
| Enterprise onboarding / support | Implementation, premium support, and admin features for larger buyers | Service or add-on fee | Enterprise support is implied by review pages but not priced publicly | Low | Clarify what onboarding and support are included versus separately billed |
| Partner / ecosystem motion | Cloud or strategic-partner influenced selling and co-marketing | Unknown | Partnership evidence is visible, direct channel economics are not | Low | Disclose referral or marketplace contribution to pipeline and bookings |
Public evidence supports the existence of these monetization layers, but only the free entry point and entry paid pricing signal are directly visible; most realized economics remain undisclosed.
[CI001, CI002, CI003, CI005, CI007, CI009]| Source / plan signal | Price / unit / contract | List vs realized | Included capabilities | Discounts / unknowns | Implication |
|---|---|---|---|---|---|
| G2 product page | Free Kubernetes cost monitoring | List signal only | Monitoring and initial savings visibility | No paid contract detail | Supports low-friction top-of-funnel motion |
| Software Advice listing | Starting at $200 per month | Third-party list signal | Automation, autoscaling, rightsizing, bin packing, spot automation | May not reflect current enterprise pricing or module mix | Indicates paid entry can start small relative to enterprise cloud budgets |
| Cast pricing page | Sales-led / contact-oriented | List pricing not disclosed | Broader platform and automation positioning | No public enterprise rate card | Pricing likely varies by cluster scale and features |
| Customer savings cases | Value framed as 50-80%+ savings or millions annually | Outcome proxy, not price | Cloud cost reduction and operational efficiency | Savings are customer-specific and partly company-reported | Suggests ROI-led pricing conversations |
| OMNI Compute / GPU launch | New monetization likely adjacent to core platform | Unknown | GPU and external capacity orchestration | No public SKU, fee, or take-rate disclosure | Potentially changes revenue mix and gross-margin profile |
The table distinguishes public price signals from customer ROI outcomes so that value proof is not mistaken for realized revenue or margin.
[CI002, CI003, CI004, CI013, CI014, CI027]Public evidence suggests Cast AI monetizes by converting free monitoring and savings proof into paid automation and expansion into AI / GPU modules.
[CI001, CI002, CI003, CI005, CI007, CI009]4.2 Traction and Sales-Efficiency Proxies
Because Cast AI does not disclose ARR or cohort metrics, the best public traction evidence comes from customer count, logo quality, third-party growth commentary, and time-to-value case studies. The strongest signals are unusually concrete for a private infrastructure company. Public materials tied to the Series C say Cast doubled its customer base between 2023 and 2024 and reached more than 2,100 organizations. Reuters-linked coverage said total funding exceeded $180 million after the 2025 round and described sharply rising demand as AI adoption increased Kubernetes automation needs. Customer outcomes reinforce that the value proposition is not hypothetical: NielsenIQ reported savings of up to 80 percent, project44 reported 50 percent savings on GKE in one month, and Branch highlighted several million dollars of annual AWS savings. Those are customer-finance outcomes, not just technical benchmarks. G2-based recognition and a large review base also suggest that Cast has enough installed base and usage breadth to support efficient proof-driven selling. Still, the public file does not expose win rates, CAC payback, average contract value, or net retention, so sales efficiency remains a proxy judgment rather than a measured fact.[CI011, CI012, CI013, CI014, CI015, CI016]
| Metric | Value / public proxy | Confidence | Why it matters | Diligence ask |
|---|---|---|---|---|
| ARR / revenue run rate | Low | Core scale indicator for any late-stage software business | Provide current ARR, trailing-12-month revenue, and growth by module | |
| Average contract value | Low | Needed to interpret sales efficiency and segment fit | Share ACV / median deal size by SMB, mid-market, and enterprise | |
| Gross margin | Low | Critical to judge whether automation and GPU features behave like software or services | Provide GAAP and non-GAAP gross margin plus module-level mix | |
| CAC payback | Low | Determines capital efficiency of go-to-market spend | Provide sales and marketing spend, new ARR, and payback calculation | |
| Net revenue retention | Low | Tests whether savings products expand naturally inside customers | Provide NRR, gross retention, and upsell drivers | |
| Time to initial customer value | project44: 50% savings in one month; NielsenIQ: large savings quickly evidenced | Medium | Fast time-to-value can improve close rates and payback | Quantify median time from pilot to savings realization across recent cohorts |
| Pricing leverage vs customer savings | Savings framed as 50-80% or several million dollars annually | Medium | ROI framing can support strong pricing power even without public list pricing | Show realized price as a share of verified customer savings |
Nulls reflect genuine public-data gaps, not author omission; the only usable public proxies are customer savings outcomes and initial entry-price signals from review marketplaces.
[CI012, CI013, CI014, CI023, CI024, CI026]The public unit-economics story is inferred from customer savings outcomes rather than disclosed company metrics.
The bridge is conceptual because Cast AI does not disclose its realized pricing take rate, support burden, or gross margin.
[CI012, CI013, CI014, CI027, CI032, CI033]4.3 Capital Adequacy and Cost Structure
Capital adequacy appears comfortably positive in the near term, but with major blind spots. Cast closed an oversubscribed $108 million Series C in April 2025, and Reuters-linked coverage said total funding exceeded $180 million after that event. In January 2026 the company announced a strategic investment from Pacific Alliance Ventures and said valuation had crossed $1 billion, which indicates a fresh balance-sheet step-up and continued investor confidence. TechCrunch said the Series C proceeds were earmarked for more research and development plus geographic expansion, while investor commentary linked the round to rapid revenue growth and surging demand. What remains unknown is equally important. The public record does not reveal cash on hand, monthly burn, runway, debt obligations, or the exact size of the 2026 investment. It also leaves open whether OMNI Compute or broader GPU access changes Cast's cost structure relative to a pure control-plane SaaS model. Compared with public comparables such as IBM, Datadog, and NetApp—each of which files full 10-K statements with the SEC—Cast offers almost no direct margin or cash-flow visibility. That opacity is the main reason the financial chapter cannot move from directional confidence to full underwriting confidence.[CI017, CI018, CI019, CI020, CI021, CI022]
| Capital item | Public value / status | Confidence | Why it matters | Diligence ask |
|---|---|---|---|---|
| 2025 Series C | $108M oversubscribed round | High | Meaningful fresh capital for R&D and go-to-market | Confirm post-close cash balance and investor ownership |
| Total funding after Series C | Over $180M | High | Establishes scale of cumulative equity support before the 2026 strategic round | Reconcile total capital raised after the 2026 event |
| 2026 strategic investment | Announced; valuation >$1B; amount undisclosed | High | Improves financing flexibility but leaves dilution and runway opaque | Disclose amount invested, security type, and cash proceeds to the balance sheet |
| Planned use of funds | R&D plus expansion in core markets | Medium | Indicates management priorities and expected spending areas | Provide detailed budget allocation by product, GTM, and geography |
| Debt / credit facilities | Low | Debt obligations can alter runway and downside risk | Confirm whether any venture debt, cloud commitments, or financing obligations exist | |
| Monthly burn | Low | Needed to convert financing into runway | Provide current net burn and expected burn after GPU / OMNI investments | |
| Runway months | Low | Core capital-adequacy measure for late-stage private companies | Provide runway under base and downside plan assumptions |
Capital adequacy is directionally favorable because disclosed equity funding is large, but cash, burn, and exact 2026 proceeds are absent from the public record.
[CI017, CI018, CI019, CI020, CI021, CI022]Public company-value signals stepped sharply upward from late 2023 through the January 2026 unicorn milestone.
The 2025 midpoint is an analytical bridge between Reuters-linked ~$850M reporting and TechCrunch’s near-$900M framing; the 2026 item is a floor because public materials say only that valuation exceeded $1B.
[CI018, CI019, CI021, CI022, CI034]Equity funding appears to finance product expansion, GTM scaling, and AI / GPU adjacency, but cash conversion remains opaque.
[CI019, CI020, CI021, CI022, CI025, CI034]4.4 Financial Verdict and Diligence Blockers
The financial verdict is positive but incomplete. Cast AI looks like a growth-stage infrastructure software company with genuine proof of customer value, a credible free-to-paid funnel, meaningful enterprise traction, and enough capital to keep investing through the current cycle. Those are real strengths. But nearly every metric required for disciplined underwriting is still missing from the public file: ARR, GAAP revenue growth, gross margin, burn, runway, NRR, churn, customer concentration, and the economics of GPU or marketplace-style products. The result is that Cast can be judged as commercially promising yet still financially under-disclosed. From an investor or acquirer perspective, the next diligence step is not more headline funding data; it is contract-level economics. Specifically, one needs to see whether pricing is subscription, percentage-of-savings, or hybrid; whether gross margin compresses as GPU or external-capacity services expand; and whether rapid customer-count growth is converting into healthy retention and efficient sales payback. Until those questions are answered, the company merits a medium-confidence financial assessment: solid capital support and attractive customer ROI, but insufficient transparency on the underlying engine that converts those outcomes into durable software economics.[CI023, CI024, CI025, CI026, CI031, CI032]
| Missing metric | Impact on underwriting | Exact diligence path |
|---|---|---|
| ARR and revenue growth | Prevents direct valuation-multiple or Rule-of-40 underwriting | Request board deck or monthly KPI pack with current ARR, revenue, and growth bridge |
| Gross margin by product line | Blocks judgment on whether GPU / marketplace features dilute software economics | Request P&L by core automation, AI/GPU, support, and any marketplace components |
| Burn and runway | Makes capital-adequacy assessment incomplete despite large financing rounds | Request cash balance, burn trend, and 12-24 month operating plan |
| NRR and churn | Obscures whether savings products naturally expand or are vulnerable to replacement by native tools | Request cohort retention table and downgrade reasons |
| Customer concentration | Important for enterprise software resilience and GTM efficiency | Request top-10 customer revenue share and vertical concentration |
| Contract structure | Without subscription vs savings-share mix, revenue quality cannot be judged cleanly | Sample recent contracts and summarize pricing mechanics |
| GPU / OMNI monetization mix | New products could drive upside or margin complexity, but public disclosures are silent | Break out pipeline, bookings, and revenue contribution for GPU-related modules |
These are the minimum missing fields needed to move from medium-confidence directional analysis to an investable financial model.
[CI023, CI024, CI025, CI026, CI031, CI033]4.5 Exhibits
05Product & Technology
5.1 Platform Definition and Module Map
Cast AI's own materials consistently describe more than a dashboard or recommendation engine. The product starts from Kubernetes cost monitoring and optimization suggestions, but the real center of gravity is execution: autoscaling, node selection, rightsizing, Spot orchestration, bin packing, and policy controls that alter how clusters actually run. The module map now extends beyond that original core. Cluster hibernation lets non-production environments scale to zero while keeping the control plane intact. OMNI adds a multicloud or cross-region compute control layer for scarce GPU and compute capacity. GPU Optimization for AI Infrastructure adds workload partitioning, Dynamic Resource Allocation, and throughput optimization for expensive accelerators. Security broadens the surface again through Kvisor, which Cast documents as an open-source security agent for runtime monitoring, image scanning, and network observability. The result is a stack with at least five meaningful product layers: monitoring, autoscaling, cost optimization, AI/GPU orchestration, and security or compliance tooling. That breadth is exactly what differentiates Cast AI from point tools but also raises complexity, implementation scope, and documentation demands.[CE001, CE002, CE005, CE006, CE008, CE009]
| Module / asset | User problem | Technical mechanism | Evidence of maturity | Current status |
|---|---|---|---|---|
| Core autoscaler / optimization | Overprovisioned clusters and inefficient node mix | Autoscaling, rightsizing, spot automation, bin packing, policy controls | Docs, Terraform resource, AWS Marketplace, multiple customer cases | Mature core product |
| Cluster hibernation | Non-production clusters incur unnecessary 24/7 compute spend | Scale-to-zero while preserving control plane and resuming critical components first | Dedicated documentation with manual, scheduled, API, and Terraform workflows | Shipped / documented |
| OMNI Compute | GPU scarcity and multicloud / cross-region capacity fragmentation | Extend clusters to other regions and clouds; autoscaler compares price and availability | Docs, launch press, and external news coverage | Early access |
| GPU optimization / AI infrastructure | Low GPU utilization and expensive AI workloads | GPU sharing, partitioning, Dynamic Resource Allocation, bin packing | GPU product page, benchmark report, ALLEN Digital case study | Active growth area |
| Kvisor security | Need runtime security, vulnerability scanning, and compliance | Open-source agent plus dashboard, scans, and network observability | Security docs, CIS certification press, GitHub repo | Live but undergoing changes |
Rows reflect the major product surfaces visible in reviewed public materials as of the 2026 run date; status labels capture the maturity signals those materials explicitly provide.
[CE001, CE002, CE005, CE006, CE008, CE010]Cast AI sits as an execution layer between workload demand, cluster state, cloud-provider capacity, and security visibility.
[CE001, CE006, CE008, CE012, CE026, CE034]5.2 Architecture and Operating Model
The technical operating model is best read as a continuous control loop around existing Kubernetes clusters. In the core product, Cast AI agents and policies monitor demand, compare pricing and capacity, and then drive node-level and workload-level actions such as rightsizing, autoscaling, or smart eviction. The Mercedes-Benz.io engineering write-up gives unusually concrete outside evidence of how this works in production: the team moved from static node autoscaling to dynamic workload-aware autoscaling, runtime bin packing, and smart eviction under zero-downtime constraints. Hibernation shows another part of the operating model, where essential components are deliberately brought back first via resume nodes before normal workloads return. OMNI extends the architecture beyond one region or provider by letting the autoscaler evaluate external locations and scarce GPU capacity across clouds. GPU Optimization then layers on partitioning, sharing, and placement logic for accelerators. This makes the platform technically ambitious and operationally valuable, but it also means Cast is deeply dependent on provider APIs, permissions, scheduler behavior, capacity signals, and correct orchestration of its own critical components.[CE002, CE003, CE004, CE006, CE007, CE008]
| Use case | Primary user | Workflow trigger | Cast AI action | Outcome |
|---|---|---|---|---|
| Production Kubernetes cost optimization | Platform engineering / SRE | Persistent overprovisioning or volatile traffic | Autoscaler, rightsizing, node selection, smart eviction | Lower cost with maintained reliability |
| Development / staging shutdown | Platform engineering | Known idle windows outside business hours | Cluster hibernation to zero nodes | Compute spend reduced to control-plane floor |
| Cross-cloud GPU acquisition | AI infrastructure team | Primary region has no affordable GPU capacity | OMNI extends cluster to new region or provider | AI jobs keep running without refactoring |
| Security hardening and compliance review | Security / platform team | Need posture and vulnerability visibility | Kvisor scans, dashboard insights, CIS-aligned controls | Higher security visibility and audit readiness |
| IaC-driven cluster policy management | Platform engineer / DevOps | Need repeatable autoscaler settings across clusters | Terraform resource and Helm configuration | Policy changes move into standard platform workflows |
The workflows focus on customer-facing operating motions described in docs, marketplace pages, and engineering case studies instead of generic feature labels.
[CE002, CE003, CE006, CE012, CE015, CE017]| Architecture layer | Role | Key dependency | Observed risk | Why it matters |
|---|---|---|---|---|
| Agents and controllers | Collect cluster state and apply automation logic | Correct cluster permissions and agent scheduling | Permission misconfiguration or failed critical-component resume | Without agents, Cast cannot execute savings actions |
| Autoscaler policy engine | Turns demand and price signals into node actions | Pricing data, workload metadata, scheduler constraints | Bad policy tuning can hurt reliability | This is the core control loop |
| Smart eviction / bin packing | Rebalances workloads onto fewer or better nodes | Pod disruption behavior and runtime safety | Operational risk during rebalance | Key to delivering higher utilization |
| OMNI multicloud extension | Finds external regions and providers for scarce capacity | Cloud-provider capacity and cross-cloud connectivity | Early-access change risk and GPU availability volatility | Critical for AI / GPU differentiation |
| Security layer (Kvisor) | Adds scans, runtime monitoring, and compliance views | Helm / console deployment and dashboard integration | Docs warn the feature set is still changing | Important for enterprise trust and platform breadth |
The architecture table captures the main operating layers visible in the technical docs and practitioner write-ups, emphasizing dependencies and failure modes rather than claiming secret internal architecture.
[CE003, CE004, CE005, CE006, CE015, CE019]The operator workflow runs from onboarding and policy setup into continuous optimization and optional AI / GPU expansion.
[CE002, CE003, CE011, CE014, CE017, CE018]The product depends on correct permissions, cloud-provider signals, cluster scheduling behavior, and scarce GPU capacity.
[CE003, CE004, CE006, CE015, CE019, CE035]5.3 Deployment, Integrations, and Developer Signal
The developer signal around Cast AI is stronger than for many private infrastructure startups because the product is exposed through infrastructure-as-code and public operator examples, not only through marketing copy. Cast publishes a Terraform provider, GitHub documentation for the autoscaler resource, Helm-based security configuration, and multiple product docs that assume platform teams will automate policy and cluster settings. The Terraform resource itself exposes cluster limits, node-downscaler settings, evictor behavior, and timing controls, which is a strong sign that the product is intended to be tuned as part of a larger platform engineering workflow. External developer-style surfaces reinforce that point. A Dev.to step-by-step EKS integration guide shows the product is concrete enough to be implemented by practitioners outside the company. The AWS Marketplace listing adds review-like operator feedback on usability, monthly cloud savings, and cluster policy control. Case studies from Akamai, project44, Branch, and ALLEN Digital then show that the platform is being used on real systems with demanding performance, AI, or SLA requirements. This gives Cast more technical credibility than a startup with only slideware, even if some sources are vendor-authored.[CE011, CE012, CE014, CE015, CE016, CE017]
| Control area | Public evidence | Mechanism | Confidence | Gap or caveat |
|---|---|---|---|---|
| Runtime security | Kvisor overview | Open-source agent scanning images and runtime behavior | Medium | Security docs say the feature set is changing |
| Configuration and scan control | Kvisor configuration docs | Helm-based settings for intervals, scans, and features | High | Operational burden still sits with the platform team |
| Security posture dashboard | Security dashboard docs | Centralized posture and CIS-compliance visibility | Medium | Docs describe capability, not customer-specific effectiveness |
| CIS benchmark trust signal | CIS certification press release | Security Report certified against CIS Kubernetes benchmarks | Medium | Certification is useful, but not a substitute for live customer audit evidence |
| Open-source transparency | GitHub Kvisor repository | Public repo and Apache 2.0 license | Medium | Open source alone does not guarantee maturity or support quality |
| Operational reliability | StatusGator and IsDown | Public incident reporting and outage aggregation | Medium | Aggregator snapshots do not replace detailed root-cause reporting |
This table records the trust controls publicly evidenced for the product surface and also preserves the explicit caveat that some security features are mid-transition in the docs.
[CE025, CE026, CE027, CE028, CE029, CE030]Capability maturity is uneven: core autoscaling is mature, OMNI is early access, and security is live but in transition.
[CE005, CE009, CE010, CE021, CE026, CE030]5.4 Trust, Security, and Technology Risks
Trust and quality controls are real but unevenly mature in the public file. On the positive side, Cast documents Kvisor as an open-source security agent that can scan images, monitor runtime behavior, observe network activity, and expose centralized compliance views in the security dashboard. Cast also publicized CIS Benchmark certification for its Security Report across major managed-Kubernetes environments, which is a meaningful trust signal for regulated or enterprise buyers. At the same time, the documentation repeatedly warns that the Kubernetes security feature set is undergoing significant changes and that some features are being deprecated or moved in the console. That warning is valuable because it reveals product evolution honestly, but it also highlights transition risk. Reliability signals are similar. StatusGator and IsDown show that the platform has public incident history, and several 2026 signals point to short cloud-provider-related degradations rather than perfect invisibility. Together, these sources suggest the technology is production-worthy and improving, but not frictionless. Cast's moat is technical depth; its risk is that the same depth requires careful permissions, documentation, and incident handling to keep customer trust high.[CE020, CE025, CE026, CE027, CE028, CE029]
| Capability | Latest public release signal | Stage | Strategic implication | Diligence ask |
|---|---|---|---|---|
| Core autoscaler and cluster optimization | Continuing case-study and marketplace evidence in 2026 | Production / mature | Main commercial engine appears battle-tested | Request uptime, rollback, and adoption statistics by cloud |
| OMNI Compute | 2026 launch plus docs marked early access | Early access | Potentially strongest new moat in AI / GPU era | Clarify GA availability timeline and production design partners |
| GPU optimization | Dedicated 2026 product page and GPU utilization benchmark report | Expansion stage | Meaningful adjacency to AI infrastructure budget | Disclose attach rate and production scale |
| AI Enabler / LLM tooling | 2025 launch press around model-selection automation | Expansion stage | Pushes Cast beyond pure infrastructure tuning | Show customer references and model-governance boundaries |
| Kubernetes security / Kvisor | Docs explicitly say significant changes are underway | Transitional | Potential trust differentiator, but feature churn adds implementation risk | Explain roadmap, deprecations, and support guarantees during transition |
Stage labels come from explicit public cues such as launch timing, docs language, and case-study depth rather than from internal product-roadmap disclosures.
[CE005, CE010, CE011, CE018, CE020, CE021]5.5 Exhibits
06Customers
6.1 Customer Segmentation and Fit
The visible customer base suggests Cast AI is not primarily an SMB tool. The named references point toward mid-market and enterprise buyers that run meaningful Kubernetes, cloud, or AI workloads and care about reliability as much as cost. Public materials and customer pages place Cast across a diverse set of sectors: BMW Group and Mercedes-Benz.io in automotive and digital platforms; Cisco and Akamai in networking and cloud infrastructure; FICO in analytics and financial decisioning; Swisscom in telecom; NielsenIQ in data analytics; project44 in logistics software; Branch in mobile attribution; Hugging Face in AI infrastructure; and ALLEN Digital in education AI. This breadth matters because it implies the product is not confined to one niche workload pattern. At the same time, the jobs-to-be-done are fairly consistent: large-scale public-cloud operations, Kubernetes management, GPU or CPU-intensive workloads, and the need to automate savings without sacrificing performance. The public file therefore points to a customer profile defined more by cloud complexity and spend intensity than by vertical alone.[CU001, CU002, CU003, CU004, CU005, CU006]
| Customer / cohort | Sector | Public proof depth | Why it fits Cast AI | Implication |
|---|---|---|---|---|
| BMW Group / Mercedes-Benz.io | Automotive / digital platforms | Mercedes has deep case evidence; BMW is logo-level reference | Large digital platforms with Kubernetes complexity and cost sensitivity | Automotive accounts suggest global enterprise credibility |
| Cisco / Akamai | Networking / cloud infrastructure | Akamai has deep case evidence; Cisco is logo-level reference | Infrastructure-heavy environments where reliability and scale matter | Good fit with Cast’s performance-plus-savings narrative |
| FICO / Swisscom | Analytics / telecom | Logo-level reference | Regulated or mission-critical environments with cost-control needs | Supports enterprise and regulated-market relevance |
| NielsenIQ / project44 / Branch | Data, logistics, mobile software | Deep quantified case studies | Cloud-native software operators with large Kubernetes footprints | Best public ROI proof set |
| Hugging Face / ALLEN Digital | AI / education AI | Partnership plus AI and GPU case evidence | CPU / GPU-intensive workloads where automation helps unlock AI economics | Supports expansion into AI infrastructure budgets |
The table groups logos by sector and proof depth so that deep deployment evidence is not conflated with simple logo mentions.
[CU004, CU005, CU006, CU007, CU008, CU009]The customer path usually starts with cloud-spend pain, moves through proof of savings, and then expands into broader automation or AI workloads.
[CU001, CU016, CU017, CU021, CU034, CU037]6.2 Named Customer Proof and Deployment Depth
The strongest part of Cast AI's customer evidence is not the logo slide; it is the fact that several references contain specific before-and-after operating outcomes. NielsenIQ's case study says Cast cut cloud costs by up to 80 percent. project44 reports 50 percent GKE savings in one month. Branch describes several million dollars of annual AWS savings. ALLEN Digital frames Kimchi Inference as a 71 percent LLM-cost reduction and better GPU utilization. Hugging Face partnership materials describe automatic cluster optimization for AI workloads on AWS and Google. The Mercedes-Benz.io engineering write-up adds third-party depth by explaining how dynamic autoscaling, smart eviction, and runtime bin packing were applied in a large internal platform environment. Akamai's case study is especially valuable because it ties Cast to a demanding cloud-infrastructure operator with strict SLAs, not only a cost-sensitive startup. Collectively, those examples make a stronger deployment-depth case than generic enterprise name-dropping. They show that Cast AI is being used in production on customer systems where uptime, performance, and budget matter simultaneously.[CU016, CU017, CU018, CU019, CU020, CU021]
| Period / signal | Public metric | Source | Interpretation | Caveat |
|---|---|---|---|---|
| 2023 to 2024 | Customer base doubled | Unicorns Lithuania / Cast-linked reporting | Strong adoption acceleration before Series C | No exact denominator or paid-customer split |
| April 2025 | 2,100+ organizations trusted | Series C reporting | Large installed base for a specialized infra product | Organization count is not the same as paying enterprise accounts |
| Spring 2026 | 20 badges across 36 G2 reports | Cast AI G2 leader press release | Review and market-presence signal indicates broad product usage | Company-authored summary of a marketplace signal |
| 2026 archived G2 page | Large review base visible | G2 page snapshot | Useful repeat-usage and satisfaction proxy | Does not reveal retention or contract value |
| 2026 public file | Named enterprise logos across multiple sectors | Case-study hub and press materials | Supports enterprise credibility and vertical breadth | Depth differs widely by customer |
This table isolates adoption proxies because public evidence is stronger on customer count and proof surfaces than on revenue or cohort economics.
[CU002, CU003, CU025, CU026, CU027, CU036]| Customer | Public source type | Proof depth | Quoted / reported outcome | What it demonstrates |
|---|---|---|---|---|
| NielsenIQ | Case study | High | Up to 80% cloud-cost reduction | Strong savings proof in data-intensive analytics environment |
| project44 | Case study | High | 50% savings on GKE in one month | Fast time-to-value and cloud-native deployment depth |
| Branch | Case study | High | Several million dollars annually in AWS savings | Meaningful dollar-denominated ROI for a software customer |
| ALLEN Digital | Case study | High | 71% lower LLM costs via Kimchi Inference | GPU / AI workload relevance and non-core expansion potential |
| Hugging Face | Partnership press release | Medium | Reduced cost of deploying LLMs and real-time cluster optimization | AI workload credibility and CPU/GPU optimization fit |
| Akamai | Case study | High | Complex SLA-bound infrastructure optimized with bin packing and Spot automation | Strong enterprise reference quality |
| Mercedes-Benz.io | Case study + customer engineering blog | High | Lowered operational overhead and costs using dynamic autoscaling | Third-party technical corroboration of deployment depth |
| BMW / Cisco / FICO / Swisscom | Logo references | Low-Medium | Named as current customers | Enterprise logo quality without deep public deployment detail |
Proof depth distinguishes quantitative case studies and third-party engineering write-ups from lighter-touch logo mentions.
[CU004, CU016, CU017, CU018, CU019, CU020]Public evidence narrows from broad organization count down to a smaller set of deeply documented reference customers.
The funnel contrasts public proof layers, not internal conversion data; named-logo and quantified-case counts reflect only the reviewed source set.
[CU002, CU003, CU024, CU035, CU036]Customer proof quality varies from logo-only references to quantified ROI and third-party technical corroboration.
[CU024, CU029, CU035, CU038]6.3 Adoption, Satisfaction, and Repeat-Usage Proxies
Cast AI does not publish classic SaaS retention or cohort metrics, so customer quality has to be inferred from adoption proxies. The best ones are meaningful. The company says it doubled its customer base between 2023 and 2024 and serves more than 2,100 organizations. G2-derived marketing materials say the product won 20 badges across 36 Spring 2026 reports, and the archived G2 page shows a large review base for a relatively specialized infrastructure product. Those signals matter because infrastructure tools rarely accumulate broad review footprints unless the deployment base is both real and reasonably satisfied. At the same time, the adverse evidence should not be ignored. Cybernews highlights onboarding friction, IAM complexity, and documentation clarity issues, which are plausible headwinds for expansion and customer-success load. The result is a balanced customer-quality read: Cast likely has strong fit in cloud-native, platform-engineering-led accounts, but the public file still does not reveal whether those wins translate into repeatable expansion, multi-product adoption, or exceptional retention over time.[CU002, CU003, CU025, CU026, CU027, CU028]
| Signal | Public evidence | Confidence | Why it matters | Gap |
|---|---|---|---|---|
| Review volume | Large G2 review base visible | Medium | Suggests non-trivial adoption and product usage over time | No mapping to retained ARR or logo retention |
| Marketplace recognition | G2 Spring 2026 leader and badge count | Medium | Useful proxy for customer satisfaction and mindshare | Company summarizes the signal rather than publishing raw cohort data |
| Deep case-study repetition | Multiple detailed case studies across sectors | Medium | Shows repeatable reference generation rather than a single lighthouse logo | Still mostly vendor-authored evidence |
| Onboarding friction | Cybernews flags setup and docs clarity concerns | Medium | Potential drag on onboarding-to-expansion motion | No measured churn or implementation-failure rate |
| Expansion potential | AI / GPU modules extend product relevance inside existing accounts | Low-Medium | Could increase wallet share in mature customers | No public attach-rate or expansion-revenue data |
Retention is not publicly disclosed, so this table preserves only the observable repeat-usage and satisfaction proxies along with their caveats.
[CU025, CU026, CU027, CU028, CU029, CU037]Public evidence does not provide true cohort retention, so this figure instead visualizes proof-depth retention across evidence types over the customer-lifecycle narrative.
The cohort is not a revenue or logo-retention cohort; it is an evidence-depth cohort scored as percentage presence across lifecycle stages in the reviewed public file.
[CU025, CU026, CU027, CU028, CU029, CU038]6.4 Concentration and Reference-Quality Risks
The main customer risk is not lack of logos; it is lack of financial context around those logos. Public evidence does not show how revenue is distributed across the account base, whether any single customer is above a material threshold, how many logos are paying at scale versus piloting, or what retention looks like beyond individual success stories. SEC disclosure guidance is relevant conceptually here because it illustrates that public issuers would typically need to call out material customer concentrations, while Cast—as a private company—does not have to. That leaves a real diligence gap. There is also a reference-quality issue. Many of the strongest public proofs are vendor-authored case studies or press releases, and some named customers such as BMW, Cisco, FICO, and Swisscom appear more often as referenced logos than as deeply documented deployments. This does not invalidate the customer story, but it does mean the evidence is uneven. A prudent reader should separate high-depth proof accounts like NielsenIQ, project44, Branch, ALLEN Digital, Hugging Face, Akamai, and Mercedes-Benz.io from logo-only references whose deployment scope is not publicly described.[CU004, CU024, CU029, CU030, CU031, CU035]
| Risk area | Public status | Impact | Best public evidence | Diligence ask |
|---|---|---|---|---|
| Customer concentration | Unknown | Could materially affect revenue durability if one or two large logos dominate | No public concentration disclosures | Request top-10 customer revenue share and any >10% customers |
| Retention / NRR | Unknown | Without it, logo quality cannot be translated into durable revenue quality | No cohort or renewal metrics disclosed | Request NRR, gross retention, and logo churn by segment |
| Logo depth inconsistency | Known | Some logos are deeply evidenced while others are only mentioned | Case-study depth differs sharply across named customers | Map each logo to actual deployment scope and contract size |
| Vendor-authored proof bias | Known | Could overstate benefits if independent references are sparse | Many proof points come from Cast-owned case studies and press releases | Provide reference calls and customer-authored ROI decks |
| AI / GPU expansion inside base | Plausible but unquantified | Could improve expansion economics if attach rates are real | Hugging Face and ALLEN Digital show AI fit, but not attach-rate breadth | Break out AI / GPU customer count and expansion ARR |
The key customer risk is not absence of logos but absence of revenue-distribution and retention context around those logos.
[CU024, CU029, CU030, CU031, CU037, CU038]6.5 Exhibits
07Risks
7.1 Legal, Privacy, and Compliance Risk
The legal and privacy framework around Cast AI is substantive, which is good, but it also reveals where the company is exposed. The Terms of Service are a binding order-form model effective February 2025, meaning disputes, suspension rights, onboarding responsibilities, and service access are contractually centralized. The privacy policy and DPA go further by splitting controller responsibilities between the U.S. entity and the Lithuanian entity and by explicitly placing Cast in the processor role for customer-uploaded cloud-service data. That helps enterprise buyers frame GDPR and U.S. privacy obligations, but it also means Cast's compliance burden is real and cross-border. The information security policy, SOC 2 Type II announcement, CIS certification press release, and CIS partner page provide meaningful trust signals, especially for regulated or security-sensitive buyers. Still, those are control-layer mitigants, not proofs that every enterprise deployment is low risk. The product operates inside customer Kubernetes environments, so any mismatch between contractual processor obligations, actual permissions granted, and real-world incident handling could create legal exposure, audit pain, or slowed procurement in regulated segments. A further wrinkle is that outside commentary shows AI-governance requirements are tightening in 2026, which may increase diligence burden as Cast expands AI-oriented tooling around model and GPU workflows.[CR001, CR002, CR003, CR004, CR005, CR006]
| Risk | Why it exists | Evidence | Severity | Mitigation / diligence ask |
|---|---|---|---|---|
| Contractual suspension / access risk | Terms govern service access and onboarding acceptance through binding order forms | Terms of Service effective Feb. 6, 2025 | Medium | Review suspension, termination, and limitation clauses against enterprise risk appetite |
| Cross-border privacy governance | Controller roles split across U.S. and Lithuanian entities while customer data is processed globally | Privacy Policy + DPA | Medium | Map customer jurisdiction to controller / processor responsibilities |
| Data-processing compliance | Processor obligations attach when customer personal data flows through Cast services | DPA references GDPR, CCPA, and other privacy laws | High | Validate SCCs, subprocessors, and breach-notification mechanics |
| Procurement / audit burden | Security certifications help but procurement teams will still test deployment-specific controls | SOC 2, ISO 27001, CIS materials | Medium | Request latest reports, bridge letters, and customer-control matrix |
| Regulated-customer expansion risk | Finance, telecom, and global enterprises can impose more demanding compliance requirements | Named customers and CIS alignment | Medium | Assess whether product evidence meets target vertical requirements |
The legal register focuses on risks created by the company’s own contracts, privacy roles, and security/compliance obligations rather than broad macro regulation alone.
[CR001, CR002, CR003, CR004, CR005, CR006]Likelihood and impact are highest where Cast combines deep infrastructure access with evolving product surfaces or upstream dependency.
[CR022, CR024, CR025, CR026, CR027, CR032]7.2 Operational, Quality, and Security Risk
Operationally, Cast AI is exposed because it is not sitting outside the stack; it is making or influencing changes inside clusters. The platform-permissions docs show that the product depends on explicit permissions, data collection, port openings, and access to cluster/cloud metadata. The database-optimizer security docs reinforce that data minimization matters: query SQL is anonymized, parameters are hashed, and retention is intentionally limited. Those are good controls, but they also underscore that Cast is touching meaningful telemetry and infrastructure pathways. Security-specific risk is similarly double-sided. Kvisor is documented as an open-source security agent with image scanning, runtime monitoring, and network observability, yet several security docs warn that the feature set is undergoing significant changes and that some functionality is being deprecated or moved. That creates change-management risk for customers who care about stable control frameworks. Finally, external outage aggregators make clear that reliability is not invisible: StatusGator and IsDown both track incidents, and IsDown reports a non-trivial history of outages since January 2025. For a product operating in mission-critical Kubernetes environments, even short disruptions or configuration errors can have outsized reputational consequences.[CR008, CR009, CR010, CR011, CR012, CR013]
| Risk | Trigger / mechanism | Evidence | Severity | Mitigation / diligence ask |
|---|---|---|---|---|
| Permissions misconfiguration | Platform depends on explicit cluster/cloud permissions and network openings | Platform permissions docs | High | Review least-privilege model and failed-onboarding examples |
| Feature transition risk | Security docs say functionality is moving and some features are deprecated | Kvisor + security docs | Medium-High | Get current roadmap and support guarantees during migration |
| Security blind spots | Open-source and dashboard tooling help, but customer-specific posture still depends on proper enablement | Kvisor overview + dashboard | Medium | Confirm default coverage and required customer action |
| Incident / outage visibility | External services track repeated incidents and average resolution windows | StatusGator + IsDown | Medium | Request MTTR, incident severities, and root-cause examples |
| Data handling inside telemetry-rich products | Even anonymized query processing and cluster telemetry create trust obligations | DB Optimizer security docs | Medium | Trace data flows and retention periods by product |
This table isolates risks created by how the product operates, changes, and handles telemetry inside customer infrastructure.
[CR008, CR009, CR010, CR011, CR012, CR013]Permissions, documentation churn, or upstream outages can cascade into customer trust, compliance, and revenue risk.
[CR008, CR016, CR017, CR022, CR025, CR031]7.3 Partner, Dependency, and Execution Risk
The product architecture makes Cast highly dependent on outside systems and on its own ability to keep specialist talent aligned with a fast-moving roadmap. Dependence on cloud providers is obvious from the docs: permissions, ports, price feeds, status-page components, and region or GPU capacity all sit upstream of Cast's automation engine. That matters because a customer may blame Cast even when the root cause is a cloud-provider issue or a capacity shortfall. Dependency risk also increased with the push into OMNI Compute and GPU fungibility, where scarce supply and multicloud orchestration become part of the value proposition. People risk is subtler but still important. The careers page emphasizes speed, ownership, customer obsession, and hiring the best; the SOC 2 blog emphasizes the security pedigree of the founding team and the CTO's background in security products. Those are good signs, yet they also imply a company that must keep hiring scarce platform and security talent while shipping aggressively. Combined with docs that explicitly admit feature movement and transition, the execution question is not whether Cast understands the category. It is whether it can sustain high-quality delivery, support, and documentation across a widening product surface without overextending itself.[CR018, CR019, CR020, CR021, CR025, CR026]
| Dependency | Why it matters | Public evidence | Severity | Diligence ask |
|---|---|---|---|---|
| Cloud-provider APIs and permissions | Core automation depends on accurate permissions, metadata, and service availability | Permissions docs and status aggregators | High | Validate fallback behavior during provider incidents |
| GPU availability and multicloud capacity | OMNI / GPU value proposition depends on external scarce supply | OMNI launch and benchmark materials | High | Review actual provider mix, fallback logic, and supply concentration |
| Strategic partner / investor expectations | Large strategic backers can influence go-to-market or expansion assumptions | PAV / Shinsegae investment disclosures | Medium | Clarify commercial rights, if any, attached to strategic capital |
| Compliance ecosystem | CIS and SOC signals help procurement but can become expected table stakes | CIS partner page, SOC 2 blog | Medium | Check renewal and audit burden in regulated accounts |
| Public reputation surfaces | External status and review sites amplify operational failures quickly | Status pages and Cybernews | Medium | Review comms playbook and customer notification SLAs |
Dependency risk is high because Cast’s value depends on upstream systems and ecosystem trust signals it does not fully control.
[CR014, CR016, CR017, CR020, CR021, CR025]| Execution risk | Public clue | Why it matters | Severity | Mitigation / diligence ask |
|---|---|---|---|---|
| Maintaining security expertise | SOC 2 post highlights founder / CTO security background | Security depth is a core differentiator and requirement | Medium | Assess bench strength beyond founders and named leaders |
| Scaling product breadth | Careers page emphasizes speed, ownership, and broad hiring needs | More modules mean more support, docs, and QA load | Medium-High | Review org chart and product-support staffing by module |
| Documentation debt | Security docs explicitly warn that some surfaces are changing | Docs drift can slow onboarding and create misconfiguration risk | Medium-High | Inspect documentation update cadence and ownership |
| Support load from enterprise deployments | Mission-critical customers may require deep onboarding and incident support | Review signals plus enterprise logos | Medium | Quantify customer-success ratios and onboarding timelines |
| Execution under rapid growth | 2100+ company claim plus unicorn expansion implies organizational scaling pressure | Fast growth can strain process quality and retention | Medium | Request voluntary attrition, hiring velocity, and team-distribution data |
People and execution risks are inferred from growth posture, product breadth, and explicit documentation-change signals rather than from public HR metrics.
[CR007, CR010, CR018, CR019, CR021, CR027]Cast AI depends on legal framing, provider infrastructure, telemetry permissions, and specialized security / platform execution to deliver safely.
[CR003, CR005, CR006, CR018, CR025, CR026]7.4 Mitigations and Kill Criteria
Cast AI does have a credible mitigation stack. On paper it combines legal controls (terms, privacy policy, DPA), security governance (ISO 27001, SOC 2 Type II), compliance tooling (CIS-certified reporting, security dashboard, Kvisor), and public technical documentation about permissions and data handling. Those controls reduce but do not eliminate risk. The key diligence question is therefore not whether controls exist; it is whether they are specific enough for the buyer's deployment context and whether the organization can support them reliably while the platform evolves. Practical kill criteria follow directly from the public record's weak spots. If management cannot clearly map required permissions and data flows, explain the current and future state of the security transition, provide incident-management evidence beyond aggregator snapshots, or demonstrate that GPU and multicloud dependencies are operationally resilient, risk should rise materially. Likewise, if customer concentration, retention, or support burden turn out to be worse than the public file suggests, the legal and operational mitigants will matter less because the business model itself will be more fragile. In short, the mitigants are real, but they need customer-specific validation before being treated as sufficient.[CR022, CR023, CR024, CR028, CR029, CR033]
| Area | Existing mitigation | Residual concern | Kill criterion / escalation trigger |
|---|---|---|---|
| Privacy / legal | Terms, privacy policy, DPA, processor framing | Customer-specific data flows still need validation | Cannot clearly explain controller / processor boundaries or breach obligations |
| Security governance | ISO 27001, SOC 2 Type II, CIS-certified reporting, Kvisor | Security features are evolving and need configuration | Cannot show current roadmap, audit evidence, or migration support |
| Operational reliability | Public docs, status surfaces, and resume / permissions guidance | External incidents and misconfiguration still possible | No credible MTTR history or incident-management narrative |
| Dependency risk | Multicloud value proposition and provider integrations | Cloud outages and GPU scarcity remain upstream constraints | No fallback plan for provider or capacity disruption |
| Execution risk | Security pedigree and hiring culture | Broadening platform may outrun docs and support | Org cannot demonstrate enough product, support, and security capacity to sustain expansion |
The table converts public mitigations into explicit diligence thresholds so that control signals are not mistaken for fully closed risk.
[CR022, CR023, CR024, CR028, CR029, CR033]7.5 Exhibits
08Valuation
8.1 Current Valuation Facts and Why Opacity Matters
The cleanest valuation facts in the public record are the step-ups, not the economics underneath them. Cast AI's April 2025 Series C was reported by TechCrunch as a near-unicorn round close to $900 million post-money, while Reuters-linked coverage carried by Yahoo Finance and MarketScreener pointed to roughly $850 million and total funding above $180 million. In January 2026 Cast AI and Business Wire said a strategic investment from Pacific Alliance Ventures had pushed valuation above $1 billion. Tech.eu then framed the milestone as Lithuania's fifth unicorn. Those facts make the $1 billion headline credible as a market event. What they do not provide is a way to evaluate whether the valuation was cheap, expensive, or simply strategic. The amount of the 2026 round remains undisclosed, so outsiders cannot observe dilution, preference structure, or whether the new money meaningfully changed implied valuation quality. Third-party profile sites such as Premier Alternatives add more noise than clarity because they summarize the headline valuation while admitting incomplete funding-history imports. For valuation work, that means the entry mark is real but the capital-stack context remains materially opaque.[CV001, CV002, CV003, CV004, CV005, CV006]
| Dimension | Current view | Why | Confidence |
|---|---|---|---|
| Recommendation | Track | Valuation is credible but not underwritten enough for aggressive conviction | Medium |
| Valuation stance | Fair | Premium can be justified only if AI-native revenue and retention are already strong | Medium |
| Primary strength | Strong customer proof | Named logos and measured savings support market confidence | High |
| Primary weakness | Opaque economics | ARR, gross margin, NRR, and round amount remain undisclosed | High |
| Key swing factor | AI / GPU attach rate | If AI modules are material, premium multiple support improves | Low |
This table distills the conclusion of the valuation chapter; it is not a substitute for a full priced round model or cap-table analysis.
[CV001, CV005, CV015, CV024, CV026, CV027]| Lens | Bull thesis | Anti-thesis | What would decide it |
|---|---|---|---|
| Category | Cast is AI-native infrastructure automation, not plain cost tooling | Market could still price it as cloud FinOps software with native-tool pressure | Module-level ARR and customer adoption |
| Customer proof | Deep enterprise references support pricing power | Most proof is vendor-authored and concentration unknown | Reference calls and revenue concentration data |
| Product moat | GPU and OMNI expansion justify scarcity premium | AI narrative may outrun monetization reality | Attach-rate and gross-margin evidence |
| Round quality | Unicorn milestone validates market demand | Undisclosed 2026 amount clouds term quality and dilution | 2026 round mechanics and security terms |
| Comparable set | Premium public AI-infra comps can anchor upside | Legacy or mixed infra comps can compress fair value quickly | Growth, NRR, and margin profile versus comps |
The anti-thesis is not about a business collapse; it is about how quickly the market could compress valuation if Cast fails to prove premium AI-native economics.
[CV005, CV008, CV015, CV022, CV023, CV024]The recommendation moves from a credible unicorn fact pattern through comp-based uncertainty to a fair / track conclusion.
[CV015, CV023, CV024, CV026, CV027, CV038]8.2 Comparable Framework and Multiple Context
Because Cast AI does not disclose ARR, gross margin, or NRR publicly, the valuation chapter has to start with frameworks rather than a precision model. The key question is whether Cast should be benchmarked closer to premium AI-native infrastructure software or to ordinary cloud-software / SaaS businesses. Analyst-market-data sources suggest a wide spread. SaasRise cites a median 21.2x EV/revenue in VC rounds and 11.5x in M&A for AI-native software, while Windsor Drake places AI-native application software near an 11x public benchmark and notes that investors are still paying 15x to 30x for foundation-model labs. Multiples.vc adds that public software multiples in 2026 are increasingly driven by AI relevance, technical complexity, and market position rather than generic TAM claims. PublicComps matters less as a number source than as proof of how investors benchmark software: EV/NTM revenue, retention, ACV, analyst estimates, and historicals. To translate those frameworks into a Cast lens, it is helpful to note that public comp candidates with audited filings span both high-quality observability / infrastructure names like Datadog, Cloudflare, Dynatrace, and Snowflake and more mixed infrastructure or hybrid names like IBM, NetApp, DigitalOcean, and MongoDB. That dispersion is exactly why Cast's missing private metrics matter so much.[CV007, CV008, CV009, CV010, CV011, CV012]
| Scenario | Implied narrative | Indicative multiple | ARR needed for $1B EV (USD M) | Interpretation |
|---|---|---|---|---|
| Bull | Premium AI-native infrastructure winner with real GPU monetization | 21.2x | 47.2 | Only modest ARR is needed if the market grants full AI-native VC-style premium |
| Base+ | High-quality AI-native public software benchmark | 11.0x | 90.9 | Still requires real scale and retention, but plausible for a growth-stage leader |
| Base | Solid infrastructure software with partial AI premium | 8.0x | 125 | Requires more mature economics and customer depth than the public file proves today |
| Bear | Legacy-style cloud software or commoditized optimization | 5.5x | 181.8 | Would require substantially more revenue than public evidence implies |
| Downside M&A | Lower-premium strategic exit environment | 3.8x | 263.2 | The $1B mark would look stretched unless revenue is much higher than visible proxies suggest |
ARR requirements are simple reverse-engineered math from $1.0B enterprise value divided by each multiple; they are illustrative because actual EV may differ and Cast does not disclose ARR.
[CV010, CV011, CV016, CV017, CV018, CV019]| Comparable set | Why it fits | Public evidence | Limitation | Filing status |
|---|---|---|---|---|
| Datadog / Dynatrace / Cloudflare | Cloud observability and developer-infrastructure platforms with premium software profiles | Current SEC 10-Ks available; used in software comp frameworks | Not pure cloud-cost optimization vendors | Filed |
| Snowflake / MongoDB / DigitalOcean | Infrastructure / platform software with meaningful developer and cloud economics | Current SEC 10-Ks available | Different business models and data / infra exposure | Filed |
| IBM / NetApp | Broader infra and optimization-adjacent vendors with hybrid or enterprise software exposure | Current SEC 10-Ks available | Mixed hardware / services or wider legacy exposure dilute comp purity | Filed |
| AI-native software benchmarks | Best external multiple context for premium narrative | SaasRise and Windsor Drake 2026 reports | Sector baskets are not company-specific | Analyst / market data |
| Public software multiple aggregators | Useful for EV/NTM revenue and historical software benchmarks | PublicComps and Multiples.vc | Methodologies and peer sets vary | Analyst / market data |
The valuation table intentionally mixes direct software comp frameworks with adjacent audited public companies because no perfect public Cast AI analog exists.
[CV011, CV012, CV013, CV014, CV025, CV034]A $1B enterprise value implies very different ARR requirements depending on which multiple band the market ultimately assigns.
Values are reverse-engineered using EV ÷ revenue multiple; they are sensitivity points, not disclosed ARR ranges for Cast AI.
[CV010, CV011, CV016, CV017, CV018, CV019]For a few illustrative ARR bands, the fair-value range swings materially depending on whether Cast is treated as AI-native or closer to legacy software.
Low uses a 3.8x legacy-software M&A multiple, mid uses an 11.0x AI-native public benchmark, and high uses a 21.2x AI-native VC median from external analyst-market-data sources.
[CV010, CV011, CV016, CV017, CV018, CV019]8.3 Scenario View and Recommendation
The easiest way to pressure-test Cast's unicorn pricing is to invert the multiple math. If the company deserves an approximately 11x AI-native software multiple, then a $1 billion value implies around $91 million of ARR. If it deserves the more aggressive 21.2x AI-native VC median, the same valuation implies only about $47 million of ARR. But if the business is ultimately judged closer to legacy or lower-premium software at 5.5x or 3.8x, the implied ARR requirement jumps to roughly $182 million to $263 million. That spread is huge, and it captures the entire debate. The bull case is that Cast's customer proof, multicloud automation, and GPU adjacency justify premium AI-infrastructure treatment. The bear case is that native-cloud commoditization, opaque round terms, and absent retention / margin disclosure eventually force investors to underwrite it like a less differentiated cloud-software company. The base case therefore lands in the middle: the $1 billion mark is believable and not obviously reckless, but it is also not conservative enough to make the company look clearly mispriced in an investor's favor. The right practical stance is fair / track rather than aggressive buy-in.[CV016, CV017, CV018, CV019, CV022, CV023]
| Trigger | Why it matters | Public warning sign | Diligence test |
|---|---|---|---|
| ARR materially below premium-multiple thresholds | Would make the $1B mark too rich | No public ARR disclosure | Request current ARR and forward growth bridge |
| Gross margin below premium software expectations | Would weaken AI-native infra multiple case | No public gross margin disclosure | Request GAAP gross margin and product mix |
| Weak NRR / expansion despite strong logos | Would imply customer proof is less monetizable than it appears | No public retention metrics | Request NRR, cohort retention, and AI-module expansion rates |
| GPU / AI modules not materially monetized | Would reduce the AI-infrastructure premium narrative | Public evidence emphasizes products, not attach rate | Request module-level ARR and active customer count |
| Undisclosed round terms prove investor protection or weak quality | Could mean the unicorn headline overstates economic quality | 2026 amount undisclosed | Review term sheet, liquidation preferences, and secondary mix |
| Native-cloud competition compresses willingness to pay | Would push Cast toward lower comp buckets | CloudZero comparison and broad substitute set | Review win-loss data and pricing pressure by cloud |
These are the few variables that can most quickly move Cast from a fair premium story to an overvalued one.
[CV005, CV015, CV028, CV029, CV031, CV039]Compact KPI view of the metrics that matter most for the investment case and its weakest disclosure points.
[CV001, CV004, CV007, CV015, CV033, CV042]8.4 Diligence Asks and Thesis Breaks
The valuation thesis breaks in only a few ways, but they are fundamental. First, if ARR turns out to be materially below the rough thresholds implied by premium AI-native multiples, the current price is too full. Second, if gross margins or retention resemble lower-quality infrastructure software rather than sticky, high-value automation, the premium multiple case weakens. Third, if AI and GPU expansion turns out to be more narrative than monetized reality, the company should probably be valued on the narrower Kubernetes-optimization story rather than on a broader AI-infrastructure narrative. Finally, if native cloud and open-source substitutes continue to compress willingness-to-pay, Cast may struggle to hold a premium even if growth remains respectable. The diligence asks are therefore straightforward and non-negotiable: current ARR, gross margin, NRR, customer concentration, AI / GPU module attach rates, contract structure, and the exact economic terms of the January 2026 round. Until those are disclosed, valuation work remains an exercise in scenario framing and comparables rather than a true intrinsic model. That is enough to say the company is worth tracking, but not enough to say the $1 billion mark is definitively cheap.[CV015, CV020, CV028, CV029, CV030, CV038]
| Ask | Why it is required | Public status |
|---|---|---|
| Current ARR and growth by module | Needed to map the company onto a real comp band | Not public |
| Gross margin and cloud / GPU COGS | Needed to know whether the product deserves premium software multiples | Not public |
| NRR, churn, and customer concentration | Needed to test whether logos translate into durable value | Not public |
| 2026 round amount and full term sheet | Needed to assess actual valuation quality and dilution | Not public |
| AI / GPU attach rates and monetization | Needed to validate the premium AI-native narrative | Not public |
| Win-loss data versus native-cloud and FinOps substitutes | Needed to know if premium pricing is durable | Not public |
Until these asks are answered, valuation work should remain scenario-based and medium-confidence rather than conviction underwriting.
[CV015, CV020, CV028, CV029, CV030, CV038]8.5 Exhibits
Disclaimer
This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.
Evidence index
| ID | Statement | Confidence | Sources |
|---|---|---|---|
| CO001 | Cast AI was founded in 2019 by Yuri Frayman, Laurent Gil, and Leon Kuperman. | High | SO018, SO020, SO028 |
| CO002 | The founders created Cast AI after confronting rapidly rising cloud costs at Zenedge before Oracle acquired that company in 2018. | High | SO002, SO018, SO020 |
| CO003 | Cast AI positions itself as an Application Performance Automation platform rather than only a cost-visibility tool. | High | SO002, SO005 |
| CO004 | Cast AI’s homepage markets the company around Kubernetes optimization for performance and cost efficiency. | High | SO001, SO002 |
| CO005 | January 2026 launch materials introduced OMNI Compute as a unified compute control plane and GPU marketplace for cross-cloud capacity. | High | SO004, SO014, SO015 |
| CO006 | Cast AI says OMNI Compute lets enterprises provision and operate GPUs across clouds and regions without code changes. | High | SO004, SO014 |
| CO007 | Cast AI’s current public materials say the company is trusted by 2,100+ companies globally. | High | SO002, SO018, SO025 |
| CO008 | Current public materials name Akamai, BMW, Cisco, FICO, HuggingFace, NielsenIQ, Swisscom, and TGS among Cast AI customers. | High | SO004, SO005, SO017 |
| CO009 | Independent 2025 reporting described Cast AI as Miami-based with most development located in Lithuania, Poland, Romania, and Bulgaria. | Medium | SO018 |
| CO010 | AIN described Cast AI as a Miami and Vilnius-based business, reinforcing the company’s cross-border operating identity. | Medium | SO017 |
| CO011 | Tech Funding News said Vilnius remains one of Cast AI’s most important centers despite its globally distributed structure. | Medium | SO016 |
| CO012 | Cast AI officially lists Yuri Frayman as CEO and co-founder. | Medium | SO002 |
| CO013 | Cast AI officially lists Leon Kuperman as CTO and co-founder. | Medium | SO002 |
| CO014 | Cast AI officially lists Laurent Gil as president and co-founder. | Medium | SO002 |
| CO015 | Cast AI’s leadership page also names Ferréol Hoppenot, Pierre Liduena, Gabija Marganavičė, and Moti Gabay in senior executive roles. | Medium | SO002 |
| CO016 | Cast AI closed an oversubscribed $108M Series C in April 2025. | High | SO005, SO018, SO023, SO024, SO025 |
| CO017 | G2 Venture Partners and SoftBank Vision Fund 2 co-led the Series C. | High | SO005, SO018, SO023 |
| CO018 | Aglaé Ventures joined the Series C alongside existing investors Hedosophia, Cota Capital, Vintage Investment Partners, Creandum, and Uncorrelated Ventures. | High | SO005, SO018 |
| CO019 | Reuters-syndicated coverage said the April 2025 round valued Cast AI at about $850M post-money. | High | SO023, SO024 |
| CO020 | Reuters-syndicated coverage said Cast AI’s total funding exceeded $180M after the April 2025 Series C. | High | SO023, SO024 |
| CO021 | Cast AI announced in January 2026 that a strategic investment from Pacific Alliance Ventures had pushed its valuation above $1B. | High | SO004, SO014, SO015, SO017 |
| CO022 | Reviewed January 2026 sources did not disclose the amount of Pacific Alliance Ventures’ investment. | Medium | SO004, SO014, SO017 |
| CO023 | Pacific Alliance Ventures is the U.S. corporate venture arm of Shinsegae Group, which public sources describe as a $50B-plus conglomerate. | High | SO004, SO014, SO017 |
| CO024 | Multiple regional tech outlets described Cast AI as Lithuania’s fifth unicorn after the January 2026 step-up. | Medium | SO016, SO017, SO028 |
| CO025 | January 2026 launch materials said Cast AI had opened offices in Bangalore, London, New York, and Tel Aviv and subsidiaries in Canada, France, India, Korea, Lithuania, Singapore, and the UK. | Medium | SO004 |
| CO026 | Tech Funding News reported that Cast AI employed more than 300 people across 34 countries in early 2026. | Medium | SO016 |
| CO027 | BalticVC described Cast AI as roughly 200 employees globally, creating a public headcount discrepancy versus TFN’s higher figure. | Medium | SO028 |
| CO028 | Cast AI’s 2025 materials said the company doubled its customer base between 2023 and 2024. | High | SO005, SO026 |
| CO029 | Series C and investor materials said Cast AI opened India and Singapore offices as part of its post-2024 expansion. | High | SO005, SO026, SO027 |
| CO030 | Cast AI’s NielsenIQ case study reported 60-80% savings on non-production clusters, 40-50% savings on production clusters, and payback within two months. | Medium | SO010 |
| CO031 | Cast AI’s project44 case study reported 50% compute-cost savings on the initial rollout cluster within one month. | Medium | SO011 |
| CO032 | Cast AI’s Branch case study said the platform helped cut over 25% of EC2 compute costs while replacing multi-million-dollar upfront savings-plan spending. | Medium | SO012 |
| CO033 | Cast AI’s about page currently displays cumulative counters of 6,458,974,835 CPUs provisioned and 372,359,137 nodes provisioned. | Medium | SO002 |
| CO034 | Official OMNI Compute launch materials said Oracle is one of the cloud providers making GPU capacity available through the new product. | High | SO004, SO014 |
| CO035 | Cast AI published a partnership announcement saying Hugging Face works with the company to optimize AI workloads on AWS and Google Cloud. | Medium | SO006 |
| CO036 | Cast AI launched AI Enabler to optimize LLM deployment and automate model selection before the January 2026 OMNI Compute release. | Medium | SO007 |
| CO037 | Recognition cited by Cast AI, including Futuriom 50, IDC Innovator, and G2 leadership references, supports visibility but does not substitute for audited operating metrics. | Medium | SO005, SO008, SO026 |
| CO038 | Cybernews’ 2026 review praised Cast AI’s automation and multi-cloud support but flagged advanced setup complexity, limited cost-reporting granularity, and higher pricing for smaller teams. | Medium | SO022 |
| CO039 | StatusGator reported a partial outage involving intermittent Azure AKS node provisioning failures in select regions at the time of review. | Medium | SO021 |
| CO040 | Laurent Gil told Tech Funding News that Cast AI increasingly views itself as an SLO-first or performance-led platform rather than only a cost-optimization tool. | Medium | SO016 |
| CO041 | January 2026 launch materials included public endorsements from Samsung and Uniphore for Cast AI’s production infrastructure capabilities. | High | SO004, SO014 |
| CO042 | Cast AI says OMNI Compute reduces vendor lock-in by allowing teams to control where workloads run for compliance, resilience, and performance reasons. | High | SO004, SO014 |
| CO043 | Reuters reported that Cast AI saw major demand acceleration for Kubernetes automation as AI adoption surged in the six months leading into the Series C. | Medium | SO024 |
| CM001 | IDC estimated worldwide intelligent CloudOps software revenue at $23.4B in 2024 and $45.0B in 2029, implying 14.0% CAGR. | Medium | SM006 |
| CM002 | MarketsandMarkets projected the cloud FinOps market to grow from $14.88B in 2025 to $26.91B in 2030 at 12.6% CAGR. | Medium | SM009 |
| CM003 | The Business Research Company sized Kubernetes cost management at $1.75B in 2025, $2.23B in 2026, and $5.78B in 2030. | Medium | SM008 |
| CM004 | Verified Market Reports sized the broader cloud cost management and optimization market at $9.2B in 2026 and $35.4B in 2034. | Medium | SM010 |
| CM005 | Business Research Insights published a different broader cloud cost optimization estimate of $11.01B in 2026 rising to $38.4B in 2035. | Low | SM018 |
| CM006 | Public market research shows Cast AI participates in a fast-growing but definition-sensitive market rather than one universally sized category. | Medium | SM006, SM008, SM009, SM010, SM018 |
| CM007 | The Business Research Company defines Kubernetes cost management around monitoring, analyzing, and optimizing costs for Kubernetes workloads, spanning software and services, cloud and on-prem, and functions such as resource optimization, cost allocation, budgeting, and governance. | Medium | SM008 |
| CM008 | MarketsandMarkets said cost management and optimization is the largest application or capability inside cloud FinOps by 2030. | Medium | SM009 |
| CM009 | MarketsandMarkets said multi-cloud will hold the largest deployment-environment share and hybrid cloud will be the fastest-growing deployment mode in cloud FinOps. | Medium | SM009 |
| CM010 | CNCF’s Kubernetes FinOps microsurvey found that Kubernetes had driven cloud spend up for nearly half of respondents. | Medium | SM005 |
| CM011 | In the CNCF microsurvey, 50% of respondents said Kubernetes consumed up to 25% of cloud spend and 28% said it consumed up to 50%. | Medium | SM005 |
| CM012 | In the same CNCF microsurvey, 26% of respondents spent less than $50k per month on cloud while 22% spent more than $1M per month. | Medium | SM005 |
| CM013 | CNCF’s microsurvey reported that 49% of respondents operated up to 50 Kubernetes nodes, 17% ran 101-250 nodes, and 19% ran 251 or more nodes. | Medium | SM005 |
| CM014 | CNCF’s microsurvey identified overprovisioning as the leading cause of overspend at 70%, followed by lack of responsibility at 45% and technical debt or sprawl at 43% each. | Medium | SM005 |
| CM015 | Cast AI’s 2026 optimization report measured average Kubernetes CPU utilization at 8%, memory utilization at 20%, and GPU utilization at 5% across tens of thousands of production clusters. | High | SM001, SM002 |
| CM016 | Cast AI’s report said CPU overprovisioning had reached 69% year over year and memory overprovisioning stood at 79%. | Medium | SM001 |
| CM017 | Cast AI’s report said AWS raised H200 Capacity Block prices by 15% in January 2026. | Medium | SM001 |
| CM018 | Cast AI’s report said fewer than 2% of GPUs ran on Spot through most of 2025. | Medium | SM001 |
| CM019 | The FinOps Foundation defines FinOps as a cross-functional practice where engineering, finance, product, and related teams work together to optimize technology value rather than only cut spend. | Medium | SM004 |
| CM020 | The FinOps Foundation lists executives, engineers, FinOps practitioners, operations, finance, and procurement among the core personas involved in FinOps. | High | SM004, SM022 |
| CM021 | The FinOps Foundation’s Usage Optimization capability says engineering ultimately becomes the primary owner of workload optimization, using rightsizing, scaling, scheduling, geographic shifting, and AI optimization methods. | Medium | SM021 |
| CM022 | Google’s cost-optimization guidance identifies CTOs, CIOs, CFOs, architects, developers, administrators, and operators as relevant stakeholders in cloud cost decisions. | Medium | SM023 |
| CM023 | Microsoft’s FinOps documentation similarly frames the practice as a bridge between financial management and cloud engineering and operations. | Medium | SM025 |
| CM024 | Google says cloud cost models differ fundamentally from on-premises CapEx models because most cloud consumption is treated as OpEx. | Medium | SM023 |
| CM025 | GKE Autopilot is a native substitute that manages nodes, scaling, scheduling, and resource defaults while simplifying billing forecasts. | Medium | SM013 |
| CM026 | AKS cost guidance covers autoscaling, VPA, KEDA, Karpenter-based node autoprovisioning, GPU sharing, and multitenancy trade-offs. | Medium | SM014 |
| CM027 | Karpenter is an open-source node-provisioning project that represents an internal-build or open-source substitute for parts of Cast AI’s value proposition. | Medium | SM012, SM014 |
| CM028 | Red Hat OpenShift cost management provides cluster, project, and application visibility with showback and multicloud cost models. | Medium | SM024 |
| CM029 | IBM says organizations waste about 32% of cloud spend and that both cloud-provider tools and independent multicloud tools are used to optimize it. | Medium | SM026 |
| CM030 | Deloitte said inference costs fell 280-fold over the prior two years while aggregate enterprise AI spending still accelerated. | Medium | SM011 |
| CM031 | Deloitte said on-premises deployment can become more economical when cloud costs exceed 60% to 70% of equivalent hardware costs for predictable high-volume AI workloads. | Medium | SM011 |
| CM032 | Deloitte said leading enterprises are moving toward a three-tier hybrid model of cloud for elasticity, on-premises for consistency, and edge for immediacy. | Medium | SM011 |
| CM033 | Deloitte and Google both highlight complexity as a major constraint when organizations manage heterogeneous or multicloud platforms. | High | SM011, SM023 |
| CM034 | Deploybase estimated B200 allocation timelines at 6-8 weeks and H200 lead times at 2-4 weeks in 2026, illustrating the operational friction around GPU availability. | Low | SM017 |
| CM035 | CoreWeave markets a Kubernetes-native GPU environment and cross-cloud AI experience, showing that specialized GPU clouds are now part of the same buyer conversation. | Medium | SM016 |
| CM036 | Global Growth Insights sized the broader Kubernetes solutions market at $2.51B in 2025, $3.11B in 2026, and $21.11B by 2035, driven by cloud-native, DevOps, multi-cloud, and AI orchestration trends. | Low | SM020 |
| CM037 | Global Growth Insights also cited toolchain complexity, skills gaps, vendor lock-in, and monitoring limitations as important Kubernetes-market restraints. | Low | SM020 |
| CM038 | Cast AI’s effective addressable market is the overlap of CloudOps software, Kubernetes cost management, and AI/GPU workload optimization rather than any one headline category alone. | Medium | SM006, SM008, SM009, SM011, SM015 |
| CM039 | The primary buyer segments for Cast AI-type tooling are platform engineering or SRE teams, central FinOps or cloud-economics teams, and AI infrastructure platform teams. | Medium | SM004, SM021, SM022, SM023, SM025 |
| CM040 | Budget ownership usually spans CTO/CIO/CFO-level executives, engineering cost-center owners, and finance or procurement partners rather than a single persona. | Medium | SM004, SM022, SM023, SM025 |
| CM041 | Adoption triggers include budget overruns, weak cost visibility, multi-cloud sprawl, AI GPU scarcity, and pressure to implement showback or chargeback. | Medium | SM005, SM011, SM021, SM024 |
| CM042 | Native cloud controls and internal platform teams reduce the cleanly addressable third-party market because some organizations can solve meaningful portions of the problem without buying Cast AI. | Medium | SM012, SM013, SM014, SM024, SM026 |
| CM043 | A constrained 2026 Cast-relevant SAM of roughly $2-4B is supportable by anchoring on the $2.23B Kubernetes cost-management market and adding adjacent AI/GPU and hybrid optimization budgets. | Medium | SM008, SM009, SM011 |
| CM044 | A near-term third-party SOM of roughly $0.3-0.8B is plausible only if vendors win a modest slice of high-spend Kubernetes and AI infrastructure buyers, so public data supports a range rather than a precise target. | Low | SM005, SM008, SM009, SM011 |
| CP001 | Cast AI competes across four main alternative classes: automation-first optimization suites, visibility-first FinOps tools, rightsizing specialists, and native-cloud or open-source substitutes. | Medium | SP006, SP008, SP011, SP019, SP023 |
| CP002 | Cast AI describes itself as an all-in-one Kubernetes automation, optimization, security, and cost-management platform spanning AWS, Azure, GCP, OCI, and Cast AI Anywhere. | High | SP001, SP002 |
| CP003 | Public Cast AI pricing evidence points to a free monitoring entry point and sales-led, usage-based contracting for deeper automation rather than a simple self-serve list price. | Medium | SP003, SP026, SP027 |
| CP004 | IBM Kubecost emphasizes real-time visibility, cost allocation, governance, and optimization guidance for Kubernetes spend. | Medium | SP006 |
| CP005 | IBM said its 2024 acquisition of Kubecost broadened IBM’s hybrid-cloud cost management and FinOps portfolio. | High | SP006, SP007 |
| CP006 | Flexera Ocean markets Kubernetes infrastructure scaling, cost savings, and container-cost insight through AI- and ML-driven automation. | Medium | SP008 |
| CP007 | Spot’s container-optimization assets are now owned by Flexera after the 2025 acquisition of NetApp’s Spot portfolio. | High | SP008, SP009 |
| CP008 | StormForge Optimize Live is presented as autonomous Kubernetes rightsizing that works with the Kubernetes HPA and can be onboarded quickly. | High | SP011, SP012 |
| CP009 | StormForge’s reviewed product materials concentrate on workload rightsizing and guardrails rather than on replacing a whole multicloud control plane. | Medium | SP010, SP011, SP012 |
| CP010 | Kubex positions itself around AI-driven pod, node, pre-warming, and GPU-aware optimization, including explicit Karpenter-node optimization. | High | SP013, SP014 |
| CP011 | IBM Turbonomic is broader than Cast AI because it markets application resource management across compute, storage, network, VMs, and Kubernetes in hybrid and multicloud environments. | Medium | SP015 |
| CP012 | Karpenter is an open-source project for just-in-time Kubernetes node provisioning based on unscheduled-pod demand. | High | SP016, SP017 |
| CP013 | AWS documents Karpenter as a way to provision right-sized nodes through NodePools and pod scheduling constraints. | Medium | SP017 |
| CP014 | GKE Autopilot is a managed mode in which Google manages node configuration, scaling, security, and other infrastructure settings. | Medium | SP019 |
| CP015 | Google says GKE pricing includes automated infrastructure cost optimization and free monthly credits equivalent to one Autopilot or zonal Standard cluster. | Medium | SP020 |
| CP016 | AKS node auto-provisioning automatically provisions and manages optimal VM configurations and is based on open-source Karpenter. | High | SP021, SP022 |
| CP017 | The classic AKS cluster autoscaler remains a lighter native option that scales node pools up and down based on unschedulable pods and underused nodes. | Medium | SP022 |
| CP018 | OpenCost is a vendor-neutral open-source project for measuring and allocating Kubernetes and cloud infrastructure costs in real time. | High | SP023, SP024 |
| CP019 | Reviewed OpenCost materials emphasize cost allocation, chargeback, and exports rather than autonomous bin packing or spot orchestration. | Medium | SP023, SP024 |
| CP020 | The FinOps Foundation frames usage optimization as a broad operating capability across cloud, SaaS, and on-prem environments, which supports internal build and process-driven alternatives to buying Cast AI. | Medium | SP025 |
| CP021 | Cybernews rated Cast AI 4.5 out of 5 while citing IAM-heavy setup complexity, a steep learning curve, and weaker reporting depth for some users. | Medium | SP026 |
| CP022 | The archived G2 product page shows Cast AI with 189 reviews and a free Kubernetes cost monitoring tier. | Medium | SP027 |
| CP023 | CloudZero says the competitive landscape changed after IBM acquired Kubecost and after Cast AI expanded into GPU, LLM, and database optimization. | Medium | SP028 |
| CP024 | CloudZero’s comparison presents Kubecost as stronger on granular namespace-, pod-, and label-level cost attribution while Cast AI offers overlapping monitoring plus broader optimization. | Medium | SP006, SP028 |
| CP025 | nOps argues that Cast AI is concentrated on Kubernetes autoscaling and leaves commitment management, SaaS spend, AI budgets, and non-Kubernetes compute to other tools. | Medium | SP029 |
| CP026 | nOps argues that usage-based pricing can become expensive as Kubernetes usage scales, even if savings do not rise proportionally. | Medium | SP029 |
| CP027 | Flexera Ocean is the closest automation-first commercial alternative to Cast AI because both center on autonomous Kubernetes infrastructure optimization rather than on dashboards alone. | Medium | SP002, SP008, SP009 |
| CP028 | IBM Kubecost and OpenCost compete more on visibility, allocation, and governance than on hands-free infrastructure control. | Medium | SP006, SP023, SP028 |
| CP029 | StormForge and Kubex are best understood as rightsizing specialists that overlap with Cast AI on efficiency but are narrower than a full cross-cloud control layer. | Medium | SP011, SP014 |
| CP030 | Native cloud tools now solve meaningful parts of Cast AI’s job because AWS exposes Karpenter, Google manages nodes and scaling in Autopilot, and Azure offers Karpenter-based node auto-provisioning. | High | SP017, SP019, SP021 |
| CP031 | Native-cloud substitutes are most dangerous in single-provider estates because Google and Azure publicly present important cost and provisioning features as included or preconfigured. | High | SP020, SP021, SP022 |
| CP032 | Switching costs are moderate rather than prohibitive because buyers can assemble modular substitutes from OpenCost, Karpenter, and native cloud autoscaling services. | Medium | SP016, SP021, SP023, SP025 |
| CP033 | Multi-homing is plausible because visibility tools like Kubecost or OpenCost can be paired with native cloud provisioning or specialist rightsizing products. | Medium | SP006, SP011, SP023 |
| CP034 | IBM, Flexera, and IBM Turbonomic each enter deals with broader enterprise distribution and adjacent FinOps or infrastructure portfolios than Cast AI. | Medium | SP007, SP009, SP015 |
| CP035 | Cast AI’s clearest differentiation is packaging visibility, optimization, autoscaling, and multicloud execution into one product rather than selling a single point function. | Medium | SP001, SP002, SP028 |
| CP036 | Reviewer evidence suggests Cast AI’s weakest areas remain onboarding clarity, IAM setup friction, reporting depth, and affordability for smaller teams. | Medium | SP026, SP029 |
| CP037 | The core commoditization threat comes from native-cloud bundling and the low-cost floor established by OpenCost and internal build patterns. | Medium | SP020, SP021, SP023, SP025 |
| CP038 | Cast AI’s moat is most defensible where buyers need multicloud automation or GPU-aware efficiency beyond what native providers expose today. | Medium | SP002, SP004, SP028 |
| CP039 | Flexera says Spot strengthens a comprehensive FinOps offering that now covers cloud commitments, workload-cost reduction, and container optimization. | High | SP008, SP009 |
| CP040 | Cast AI entered 2026 with more capital than many point competitors after a 2025 $108 million Series C and a January 2026 valuation above $1 billion. | High | SP004, SP005 |
| CI001 | Cast AI documentation positions the product as a platform for cost monitoring, optimization suggestions, autoscaling, spot automation, and bin packing. | Medium | SI002 |
| CI002 | The archived G2 product page shows a free Kubernetes cost monitoring tier. | Medium | SI015 |
| CI003 | Software Advice lists Cast AI pricing as starting at $200 per month. | Medium | SI016 |
| CI004 | Cast AI’s own pricing page does not publish a transparent enterprise rate card and instead points toward a sales-led process. | Medium | SI001 |
| CI005 | The public monetization evidence is most consistent with a free-to-paid software motion in which monitoring leads into paid automation. | Medium | SI001, SI015, SI016 |
| CI006 | Customer savings case studies imply that Cast AI sells on measurable ROI rather than only on generic infrastructure tooling. | Medium | SI004, SI005, SI006 |
| CI007 | Cast AI says the business began with Kubernetes automation and expanded into broader cloud and AI workload efficiency. | High | SI007, SI008 |
| CI008 | Cast AI publicly tied Hugging Face partnership activity to AI workload optimization on AWS and Google, signaling revenue adjacency beyond classic Kubernetes cost management. | Medium | SI027 |
| CI009 | The 2026 OMNI Compute launch introduced a control plane for provisioning GPUs and external compute capacity across clouds and regions. | High | SI008, SI009 |
| CI010 | CloudZero’s 2026 comparison says Cast AI expanded into GPU optimization, LLM cost management, and database optimization beyond its original scope. | Medium | SI026 |
| CI011 | Cast AI’s public case-study set spans multiple industries and workload types, including analytics, logistics, mobile attribution, education AI, content delivery, and automotive software. | Medium | SI003, SI004, SI005, SI006, SI028, SI029, SI030 |
| CI012 | The NielsenIQ case study says Cast AI helped cut cloud costs by up to 80 percent. | Medium | SI004 |
| CI013 | The project44 case study says Cast AI drove 50 percent savings on GKE in one month. | Medium | SI005 |
| CI014 | The Branch case study says Cast AI helped save several million dollars annually in AWS cloud spend. | Medium | SI006 |
| CI015 | Unicorns Lithuania reported that Cast AI doubled its customer base between 2023 and 2024 and was trusted by over 2,100 organizations by the Series C announcement. | Medium | SI012 |
| CI016 | Cast AI said G2’s Spring 2026 reports awarded it 20 badges across 36 reports, indicating meaningful customer-review scale and market presence. | Medium | SI022 |
| CI017 | Reuters-linked coverage said Cast AI saw major acceleration in demand for Kubernetes automation as AI adoption surged. | High | SI017, SI018 |
| CI018 | Reuters-linked reporting said Cast AI’s total funding exceeded $180 million after the April 2025 Series C. | High | SI017, SI018 |
| CI019 | TechCrunch said the 2025 Series C proceeds would fund more R&D and expansion in core markets such as the U.S. and elsewhere. | Medium | SI010 |
| CI020 | Cota Capital described the 2025 round as a reflection of rapid revenue growth and surging demand for Cast AI’s platform. | Medium | SI013 |
| CI021 | Cast AI’s 2025 Series C was oversubscribed and led by G2 Venture Partners and SoftBank Vision Fund 2. | High | SI007, SI010 |
| CI022 | The January 2026 Pacific Alliance Ventures event pushed Cast AI’s valuation above $1 billion, but public materials did not disclose the amount invested. | High | SI008, SI009 |
| CI023 | No reviewed public source disclosed Cast AI’s ARR or annual revenue run rate. | Medium | SI007, SI008, SI010, SI017 |
| CI024 | No reviewed public source disclosed Cast AI’s gross margin or cost-of-service profile. | Medium | SI007, SI008, SI010, SI017 |
| CI025 | No reviewed public source disclosed Cast AI’s cash balance, burn rate, runway, or debt obligations. | Medium | SI007, SI008, SI009, SI017 |
| CI026 | No reviewed public source disclosed Cast AI’s net revenue retention, churn, or CAC payback. | Medium | SI007, SI010, SI017 |
| CI027 | The combination of free monitoring, paid automation language, and savings-based case studies suggests Cast AI monetizes value capture rather than simple seat counts alone. | Medium | SI001, SI004, SI005, SI006, SI015, SI016 |
| CI028 | The free tier plus a low visible starting price create a low-friction entry motion that could support efficient product-led proof before heavier enterprise expansion. | Medium | SI015, SI016 |
| CI029 | The move into OMNI Compute and GPU orchestration may make Cast AI’s cost structure less purely SaaS-like, but public sources do not disclose the economics of that shift. | Medium | SI008, SI009, SI026 |
| CI030 | IBM, Datadog, and NetApp each had current Form 10-K filings publicly available from the SEC during the run. | High | SI019, SI020, SI021 |
| CI031 | Compared with public infrastructure software comparables that file audited financial statements, Cast AI’s public disclosure is materially thinner and harder to underwrite. | Medium | SI019, SI020, SI021, SI023, SI017 |
| CI032 | Revenue quality looks directionally promising because Cast AI’s public proof points are tied to measurable customer savings and operational outcomes. | Medium | SI004, SI005, SI006, SI017 |
| CI033 | Underwriting remains incomplete because the public record does not reveal whether Cast AI’s realized contracts are subscription, percentage-of-savings, or hybrid. | Medium | SI001, SI015, SI016 |
| CI034 | Capital adequacy appears strong in the near term because Cast AI added a $108 million Series C in 2025 and crossed a $1 billion valuation in 2026. | Medium | SI007, SI009, SI017 |
| CI035 | The exact size of the 2026 strategic investment and resulting cash balance remain undisclosed, which prevents a clean runway estimate. | Medium | SI008, SI009 |
| CI036 | Cast AI’s financial public file relies on proxy indicators such as customer count, review volume, funding milestones, and savings case studies rather than audited operating metrics. | Medium | SI012, SI015, SI017, SI022 |
| CI037 | Sales efficiency is probably helped by free monitoring and fast public time-to-value stories, but it cannot be quantified without CAC and cohort data. | Medium | SI005, SI015, SI016 |
| CI038 | Cast AI should be treated financially as a growth software business with strong customer ROI and meaningful upside, but with unresolved retention and margin questions. | Low | SI017, SI022, SI021 |
| CI039 | The most important diligence blockers are ARR, gross margin, burn and runway, NRR, customer concentration, and GPU monetization mix. | Medium | SI023, SI024, SI021 |
| CE001 | Cast AI documentation describes the platform as an all-in-one Kubernetes automation, optimization, security, and cost-management product. | High | SE001, SE002 |
| CE002 | Cluster hibernation scales a Kubernetes cluster to zero nodes while preserving the control plane and cluster state. | Medium | SE003 |
| CE003 | Cast AI says hibernation can be managed through the console, API, and Terraform with both manual and scheduled automation. | Medium | SE003 |
| CE004 | When a hibernated cluster resumes, Cast AI relies on resume nodes and system-cluster-critical priority so essential components are scheduled first. | Medium | SE003 |
| CE005 | OMNI is explicitly documented as an early-access feature that should be tested in non-production environments first. | Medium | SE004 |
| CE006 | OMNI extends Kubernetes clusters to additional regions and cloud providers so Cast AI’s autoscaler can choose the most cost-effective location based on pricing and availability. | Medium | SE004 |
| CE007 | Cast AI says the primary use case for OMNI is unlocking GPU capacity when the main cluster region lacks supply. | Medium | SE004 |
| CE008 | The GPU optimization product page says AI teams can operate scarce GPU and compute capacity across clouds and regions without refactoring applications. | Medium | SE005 |
| CE009 | Cast AI’s GPU product materials emphasize sharing and partitioning GPUs, bin packing, and Dynamic Resource Allocation to improve throughput. | Medium | SE005 |
| CE010 | Cast AI’s 2026 optimization report said average GPU utilization was 5 percent, average CPU utilization 8 percent, and average memory utilization 20 percent in non-optimized clusters. | Medium | SE009 |
| CE011 | Cast AI publicly launched AI Enabler as a product surface for optimizing LLM deployment and automating model selection. | Medium | SE006 |
| CE012 | The Hugging Face partnership shows Cast AI positioning its platform as a way to optimize AI workloads on AWS and Google. | Medium | SE007 |
| CE013 | Business Wire and SDxCentral described the 2026 OMNI launch as a multicloud GPU marketplace or platform that makes GPUs fungible across clouds. | High | SE020, SE021 |
| CE014 | Cast AI maintains a public Terraform provider for the platform on GitHub. | High | SE011, SE012 |
| CE015 | The public autoscaler resource documentation exposes cluster limits, node-downscaler settings, and evictor behavior as code. | Medium | SE011 |
| CE016 | DeepWiki mirrors and summarizes the Cast AI Terraform provider docs, which is a sign of external developer reuse of the IaC surface. | Medium | SE023 |
| CE017 | The AWS Marketplace listing positions Cast AI as fully automated cost optimization and monitoring for EKS and includes recent review text about cost reduction and GPU-aware optimization. | Medium | SE013 |
| CE018 | A Dev.to step-by-step EKS integration guide shows Cast AI can be implemented by practitioners outside the company’s official docs. | Medium | SE022 |
| CE019 | Mercedes-Benz.io’s engineering blog describes using Cast AI for dynamic workload-aware autoscaling, smart eviction, and runtime bin packing under zero-downtime constraints. | Medium | SE014 |
| CE020 | The Mercedes-Benz.io case study says Cast AI reduced Kubernetes operational overhead and costs using automation. | Medium | SE015 |
| CE021 | The ALLEN Digital case study says Kimchi Inference increased GPU utilization and saved 71 percent on LLM costs. | Medium | SE016 |
| CE022 | The Akamai case study highlights bin packing, cost-efficient instance selection, Spot automation, and deep Kubernetes cost analytics as part of the product workflow. | Medium | SE017 |
| CE023 | The project44 case study says Cast AI delivered 50 percent savings on GKE in one month. | Medium | SE018 |
| CE024 | The Branch case study says Cast AI saved several million dollars annually in AWS cloud spend. | Medium | SE019 |
| CE025 | StatusGator and IsDown provide public incident or outage surfaces for Cast AI, showing the platform is observable to operators rather than invisible when degraded. | Medium | SE010, SE024 |
| CE026 | Kvisor is documented as an open-source security agent that runs as both a controller and an agent inside Kubernetes clusters. | High | SE027, SE031 |
| CE027 | Kvisor provides image scanning, runtime security monitoring, and network observability according to the Cast docs. | Medium | SE027 |
| CE028 | Kvisor behavior is configurable through Helm, including scan frequencies and specialized feature toggles. | Medium | SE028 |
| CE029 | The security dashboard docs describe centralized Kubernetes security posture and CIS-compliance visibility across clusters. | Medium | SE029 |
| CE030 | Cast AI announced CIS Benchmark certification for its Security Report across AWS, Azure, and GCP managed Kubernetes environments. | Medium | SE030 |
| CE031 | The public Kvisor repository is available on GitHub and states an Apache 2.0 license. | Medium | SE031 |
| CE032 | Multiple Kvisor and security docs warn that the Kubernetes Security feature set is undergoing significant changes and that some features are being deprecated or moved. | High | SE027, SE028, SE029 |
| CE033 | Core autoscaling appears more mature than OMNI and Kvisor because the former is supported by extensive IaC surfaces and customer cases, while OMNI is early access and security docs are in transition. | Medium | SE003, SE004, SE011, SE014, SE027 |
| CE034 | Cast AI’s technical moat centers on combining cross-cloud autoscaling, runtime bin packing, GPU orchestration, and infrastructure-as-code controls in one platform. | Medium | SE004, SE005, SE011, SE014, SE017 |
| CE035 | The product depends on correct permissions, provider price and availability data, Kubernetes scheduling behavior, and scarce GPU supply. | Medium | SE003, SE004, SE014, SE021 |
| CE036 | Cast AI exposes console, API, Terraform, and Helm control surfaces, which fits standard platform-engineering workflows. | Medium | SE003, SE011, SE022, SE028 |
| CE037 | Cast AI says GPU capacity can be added across clouds and regions without code changes to the workload. | High | SE004, SE005 |
| CE038 | Technical risk remains because dynamic autoscaling, smart eviction, and resume sequencing can create reliability challenges if policies or dependencies are wrong. | Medium | SE003, SE014 |
| CE039 | The product workflow begins with onboarding and policy definition, then moves into continuous optimization, observability, and optional AI/GPU expansion. | Medium | SE003, SE004, SE011, SE017 |
| CE040 | The reviewed source set supports treating Cast AI as an execution layer or control loop rather than a passive reporting layer. | Medium | SE002, SE003, SE011, SE014, SE017 |
| CU001 | Cast AI’s case-study hub presents the product as something companies use to cut cloud costs, improve performance, and boost DevOps productivity. | Medium | SU001 |
| CU002 | Unicorns Lithuania reported that Cast AI doubled its customer base between 2023 and 2024. | Medium | SU024 |
| CU003 | The same April 2025 reporting said Cast AI was trusted by over 2,100 organizations. | Medium | SU024 |
| CU004 | Current public materials name BMW, Cisco, FICO, Hugging Face, Swisscom, and Akamai among Cast AI customers or reference logos. | Medium | SU008, SU009, SU023 |
| CU005 | BMW Group is a global automotive manufacturer and enterprise-scale digital operator. | Medium | SU011 |
| CU006 | Cisco positions itself around AI infrastructure, secure networking, and software solutions. | Medium | SU012 |
| CU007 | FICO positions itself as an applied-intelligence company focused on customer connections and decisioning. | Medium | SU013 |
| CU008 | Swisscom’s public about page frames it as a telecom and communications incumbent. | Medium | SU014 |
| CU009 | Akamai describes itself as a cloud-computing, security, and content-delivery company. | Medium | SU015 |
| CU010 | NielsenIQ is a global analytics and audience-intelligence company. | Medium | SU016 |
| CU011 | project44 positions itself as a decision-intelligence platform for logistics and supply chains. | Medium | SU017 |
| CU012 | Branch positions itself as a mobile measurement and deep-linking platform. | Medium | SU018 |
| CU013 | Mercedes-Benz.io builds software and digital platforms for Mercedes-Benz. | Medium | SU019 |
| CU014 | Hugging Face publicly presents itself as an AI community and platform, making it a technically credible AI-infrastructure reference. | Medium | SU020 |
| CU015 | ALLEN Digital’s case study describes an AI-powered education platform using GPU-heavy machine learning models. | Medium | SU006 |
| CU016 | The NielsenIQ case study says Cast AI helped cut cloud costs by up to 80 percent. | Medium | SU002 |
| CU017 | The project44 case study says Cast AI delivered 50 percent GKE savings in one month. | Medium | SU003 |
| CU018 | The Branch case study says Cast AI generated several million dollars of annual AWS savings. | Medium | SU004 |
| CU019 | Akamai’s case study shows Cast AI deployed on a large, complex infrastructure environment with strict SLAs and feature use cases such as bin packing and Spot automation. | Medium | SU005 |
| CU020 | The Mercedes-Benz.io case study says Cast AI reduced Kubernetes operational overhead and costs using automation. | Medium | SU007 |
| CU021 | The ALLEN Digital case study says Cast AI’s Kimchi Inference cut LLM costs by 71 percent. | Medium | SU006 |
| CU022 | The Hugging Face partnership says Cast AI optimized customer LLMs on automatically optimized Kubernetes clusters across AWS and Google. | Medium | SU008 |
| CU023 | The public customer mix spans automotive, networking, analytics, telecom, AI, logistics, mobile marketing, and education AI. | Medium | SU011, SU012, SU013, SU014, SU015, SU016, SU017, SU018, SU019, SU020 |
| CU024 | Public customer proof is uneven: some logos have deep quantified case studies while others are referenced mainly as logos or named customers. | Medium | SU001, SU005, SU007, SU008, SU009, SU023 |
| CU025 | Cast AI’s public materials consistently present the company as trusted by more than 2,100 organizations globally. | Medium | SU009, SU024 |
| CU026 | Cast AI’s G2 Spring 2026 release reported 20 badges across 36 reports, suggesting broad market presence and customer-review activity. | Medium | SU010 |
| CU027 | The archived G2 product page shows a large review base for Cast AI. | Medium | SU022 |
| CU028 | Cybernews says onboarding can be confusing because of IAM permissions and documentation clarity. | Medium | SU021 |
| CU029 | No reviewed public source disclosed retention, NRR, or churn for Cast AI’s customer base. | Medium | SU009, SU010, SU021, SU022 |
| CU030 | No reviewed public source disclosed customer concentration or identified any customer contributing a material share of revenue. | Medium | SU009, SU023, SU024 |
| CU031 | SEC disclosure guidance illustrates why material customer concentration would matter for a public issuer, even though Cast AI as a private company does not have to disclose it publicly. | Medium | SU025 |
| CU032 | The visible customer base suggests Cast AI fits mid-market and enterprise accounts with meaningful Kubernetes or AI spend rather than small casual users. | Medium | SU002, SU003, SU005, SU008, SU011, SU012, SU023 |
| CU033 | Customer evidence is strongest in cloud-native, platform-engineering-led, or AI-heavy environments where cost and reliability must be managed together. | Medium | SU003, SU005, SU006, SU008, SU026 |
| CU034 | Across public case studies, Cast AI’s customer outcomes cluster around cost reduction, improved performance, and lower engineering toil. | Medium | SU001, SU002, SU003, SU004, SU005, SU007 |
| CU035 | The highest-depth public reference set consists of NielsenIQ, project44, Branch, ALLEN Digital, Hugging Face, Akamai, and Mercedes-Benz.io. | Medium | SU002, SU003, SU004, SU005, SU006, SU007, SU008 |
| CU036 | Having multiple quantified case studies across sectors makes Cast AI’s public customer proof stronger than a simple logo wall. | Medium | SU001, SU002, SU003, SU004, SU006, SU007 |
| CU037 | AI and GPU-oriented customer references such as Hugging Face and ALLEN Digital suggest real expansion potential beyond classic Kubernetes cost optimization. | Medium | SU006, SU008 |
| CU038 | Most of the strongest customer narratives are still vendor-authored rather than independent customer-authored ROI disclosures. | Medium | SU001, SU005, SU007, SU008, SU021, SU022 |
| CU039 | The most important unresolved customer question is whether one or two large logos account for a disproportionate share of revenue or reference value. | Low | |
| CU040 | The public case-study set points to customers that are heavily exposed to public-cloud, Kubernetes, or AI workload complexity rather than generic IT users. | Medium | SU002, SU003, SU005, SU006, SU008 |
| CR001 | Cast AI’s Terms of Service are effective February 6, 2025. | Medium | SR001 |
| CR002 | The Terms of Service make customer onboarding and order forms part of a legally binding agreement governing service use. | Medium | SR001 |
| CR003 | Cast AI’s privacy policy splits controller responsibility between the U.S. entity and Cast AI Baltic UAB in Lithuania depending on customer geography. | Medium | SR002 |
| CR004 | The privacy policy says Cast acts as a processor for information customers upload to the Cast AI cloud services. | Medium | SR002 |
| CR005 | Cast AI’s DPA supplements the Terms of Service and references GDPR, CCPA, and other data-protection laws. | Medium | SR003 |
| CR006 | Cast AI’s information security policy says the company has achieved ISO 27001 certification. | Medium | SR004 |
| CR007 | The information security policy names compliance, risk appetite, and incident detection and resolution as core information-security goals. | Medium | SR004 |
| CR008 | Cast AI’s SOC 2 blog says the company passed an independent SOC 2 Type II examination. | Medium | SR005 |
| CR009 | The platform-permissions docs say Cast AI components require explicit permissions, port openings, and data collection access. | Medium | SR006 |
| CR010 | The database-optimizer security docs say Cast anonymizes query SQL at ingestion and hashes query parameters so no PII is stored. | Medium | SR007 |
| CR011 | Kvisor is documented as an open-source security agent that runs as both a Kubernetes controller and an agent. | High | SR008, SR025 |
| CR012 | Kvisor provides image scanning, runtime security monitoring, and network observability according to the docs. | Medium | SR008 |
| CR013 | Kvisor supports Helm-based configuration of scan intervals and feature toggles. | Medium | SR009 |
| CR014 | The Security dashboard docs describe centralized CIS-compliance and security-posture visibility across clusters. | Medium | SR010 |
| CR015 | Cast AI announced that its Security Report was awarded CIS Benchmark certification, and the CIS partner page independently lists multiple certified products. | High | SR011, SR012 |
| CR016 | StatusGator and IsDown provide public outage tracking for Cast AI. | Medium | SR013, SR014 |
| CR017 | Cybernews flags setup complexity, IAM friction, and reporting limitations as real user-facing drawbacks. | Medium | SR015 |
| CR018 | Cast AI’s about page says the platform is trusted by 2,100-plus companies globally. | Medium | SR017 |
| CR019 | Cast AI’s careers page emphasizes learning fast, ownership, and hiring the best, indicating continued hiring intensity and execution pressure. | Medium | SR018 |
| CR020 | The January 2026 strategic investment came from Pacific Alliance Ventures, the U.S. venture arm of Shinsegae Group. | High | SR019, SR020 |
| CR021 | Cast AI’s 2026 optimization report says GPU, CPU, and memory utilization remain low in non-optimized clusters, showing that the company operates around high-value but underutilized infrastructure. | Medium | SR022 |
| CR022 | Legal risk is elevated because access, onboarding, and service use are governed through a binding terms-and-order-form structure. | Medium | SR001 |
| CR023 | Privacy and compliance risk is elevated because Cast must honor processor obligations while customers may fall under GDPR or U.S. privacy regimes. | Medium | SR002, SR003 |
| CR024 | ISO 27001, SOC 2 Type II, CIS certification, and documented security controls materially mitigate but do not eliminate trust risk. | Medium | SR004, SR005, SR011, SR012 |
| CR025 | Operational risk is high because Cast depends on correct permissions, network access, data flows, and stable feature behavior inside customer infrastructure. | Medium | SR006, SR008, SR009, SR021 |
| CR026 | Dependency risk is high because Cast’s value proposition depends on upstream cloud-provider behavior, available capacity, and external status conditions it does not control. | Medium | SR013, SR014, SR019, SR020, SR022 |
| CR027 | People risk exists because a fast-growing platform company must continue hiring and retaining scarce security and platform talent while shipping quickly. | Medium | SR018, SR021 |
| CR028 | Cast AI’s mitigation stack includes contractual privacy controls, information-security policy, SOC 2, CIS-certified reporting, security dashboarding, and open-source security tooling. | Medium | SR001, SR003, SR004, SR005, SR010, SR011, SR012, SR025 |
| CR029 | A key kill criterion is whether Cast can clearly map required permissions, data flows, and incident responsibilities for a buyer’s exact deployment shape. | Medium | SR006, SR007, SR009 |
| CR030 | Regulatory and legal risk is likely highest in enterprise accounts subject to strict privacy, audit, or CIS-style security controls. | Medium | SR003, SR010, SR011, SR012 |
| CR031 | Operational risk is especially important because external outage services show Cast incidents are visible to engineering teams and can influence trust. | Medium | SR013, SR014 |
| CR032 | Cloud and GPU dependencies are likely the most material partner-like risk because OMNI and optimization outcomes are constrained by upstream provider behavior and capacity. | Medium | SR019, SR020, SR022 |
| CR033 | The security docs themselves reveal change-management risk because they explicitly say some features are being deprecated or moved. | High | SR008, SR009, SR010, SR021 |
| CR034 | Public policies and docs reduce opacity but do not substitute for customer-specific audit evidence or incident data. | Medium | SR001, SR004, SR005, SR011 |
| CR035 | Founder and executive security pedigree, highlighted in the SOC 2 announcement, provides some mitigation against pure execution risk. | Medium | SR005 |
| CR036 | Fast-growth culture and broad hiring needs can create documentation and support strain even when the underlying technology is strong. | Medium | SR017, SR018 |
| CR037 | The database-optimizer and permissions docs show Cast is attempting data minimization and explicit configuration guidance as mitigants to telemetry and access risk. | Medium | SR006, SR007 |
| CR038 | Dependency risk is amplified because a product that automates customer infrastructure may still be blamed for upstream provider failures or scarce GPU supply. | Medium | SR013, SR014, SR019, SR020 |
| CR039 | Because external services track outages and customer-review sites flag onboarding issues, operational failures can quickly become reputational problems. | Medium | SR013, SR014, SR015 |
| CR040 | Overall risk appears medium: Cast has credible mitigants, but execution, dependency, and trust risks remain material because the product operates so deeply inside customer environments. | Medium | SR004, SR005, SR011, SR014, SR015, SR018 |
| CR041 | External legal commentary shows that state and cross-jurisdiction AI rules are proliferating in 2026, increasing documentation and governance expectations for AI-adjacent products. | Medium | SR026, SR028, SR029, SR030 |
| CR042 | External compliance commentary says buyers increasingly map SOC 2 controls to CIS-style controls, which raises the bar for concrete control evidence beyond headline certifications. | Medium | SR027 |
| CR043 | As Cast AI expands AI and GPU-oriented tooling, tightening AI-governance expectations could indirectly increase its compliance and diligence burden even if the core platform is infrastructure software. | Medium | SR019, SR026, SR028, SR029, SR030 |
| CV001 | Cast AI said in January 2026 that its valuation exceeded $1 billion. | High | SV001, SV002 |
| CV002 | Business Wire said the January 2026 valuation milestone coincided with a strategic investment from Pacific Alliance Ventures, the U.S. venture arm of Shinsegae Group. | Medium | SV002 |
| CV003 | TechCrunch described the April 2025 Series C as a near-unicorn round close to $900 million post-money. | Medium | SV003 |
| CV004 | Reuters-linked reporting carried by Yahoo Finance and MarketScreener said the April 2025 round valued Cast AI at roughly $850 million. | High | SV004, SV005 |
| CV005 | Neither Cast AI nor Business Wire publicly disclosed the amount of the January 2026 strategic investment. | Medium | SV001, SV002 |
| CV006 | Tech.eu described Cast AI as Lithuania’s fifth unicorn after the January 2026 milestone. | Medium | SV022 |
| CV007 | Cast AI’s 2026 optimization report said average GPU utilization was 5 percent, CPU utilization 8 percent, and memory utilization 20 percent in non-optimized clusters. | High | SV014, SV024 |
| CV008 | CloudZero’s 2026 comparison said Cast AI had expanded beyond its original Kubernetes scope into GPU optimization, LLM cost management, and database optimization. | Medium | SV015 |
| CV009 | The FinOps Foundation frames usage optimization as a formal capability, which supports the category relevance of cloud-optimization software. | Medium | SV016 |
| CV010 | SaasRise reported that AI-native software commanded a median 21.2x EV/revenue in VC rounds and 11.5x in M&A buyouts in Q1 2026. | Medium | SV006 |
| CV011 | Windsor Drake’s Q2 2026 research placed AI-native application software near an 11x EV/revenue benchmark and foundation-model labs at 15x to 30x. | Medium | SV009 |
| CV012 | Multiples.vc said public software investors in 2026 increasingly reward AI application, technical complexity, market position, and specialization depth. | Medium | SV010 |
| CV013 | PublicComps highlights EV/NTM revenue, retention, ACV, analyst estimates, and historicals as core software-benchmarking inputs. | Medium | SV007 |
| CV014 | IBM, Datadog, NetApp, Cloudflare, Dynatrace, DigitalOcean, MongoDB, and Snowflake all had current SEC 10-K filings available during the run. | High | SV011, SV012, SV013, SV017, SV018, SV019, SV020, SV021 |
| CV015 | No reviewed public source disclosed Cast AI’s ARR, gross margin, or NRR, making any multiple-based valuation inherently approximate. | Medium | SV001, SV003, SV004, SV015 |
| CV016 | At an 11.0x EV/revenue multiple, a $1.0B enterprise value implies roughly $90.9M of ARR. | Medium | SV009 |
| CV017 | At a 21.2x EV/revenue multiple, a $1.0B enterprise value implies roughly $47.2M of ARR. | Medium | SV006 |
| CV018 | At a 5.5x EV/revenue multiple, a $1.0B enterprise value implies roughly $181.8M of ARR. | Medium | SV006 |
| CV019 | At a 3.8x EV/revenue multiple, a $1.0B enterprise value implies roughly $263.2M of ARR. | Medium | SV006 |
| CV020 | Because the January 2026 round amount is undisclosed, the exact post-money valuation quality and dilution cannot be assessed from public sources. | Medium | SV001, SV002, SV026 |
| CV021 | Premier Alternatives shows a $1.0B valuation and $180.8M total funding but also says complete funding history has not been imported, underscoring third-party data uncertainty. | Low | SV026 |
| CV022 | The bull thesis is that Cast AI should be treated as AI-native infrastructure automation because of multicloud GPU orchestration, customer proof, and strong enterprise demand. | Medium | SV001, SV014, SV015, SV025 |
| CV023 | The anti-thesis is that Cast may ultimately be priced more like cloud software or optimization tooling if native-cloud substitutes compress willingness to pay. | Medium | SV015, SV016 |
| CV024 | A fair-valuation stance is more defensible than a cheap-valuation stance because the current public file is strong on traction but weak on core underwriting metrics. | Medium | SV001, SV003, SV004, SV015, SV025 |
| CV025 | The public comp set should include both premium observability / developer-infrastructure vendors and more mixed infrastructure software names because no perfect public analog exists. | Medium | SV011, SV012, SV013, SV017, SV018, SV019, SV020, SV021 |
| CV026 | The recommended action is to track rather than aggressively underwrite the current valuation. | Medium | SV015, SV025, SV026 |
| CV027 | The most defensible valuation stance today is fair rather than clearly cheap or obviously inflated. | Medium | SV006, SV009, SV015 |
| CV028 | Return potential depends heavily on whether AI and GPU features produce meaningful incremental revenue without undermining software-like economics. | Medium | SV001, SV014, SV015 |
| CV029 | A core thesis-break is if customer retention, gross margin, or module attach-rate data fails to support premium AI-native software comparables. | Medium | SV006, SV009, SV015 |
| CV030 | The final diligence asks should center on ARR, gross margin, NRR, customer concentration, contract structure, and the exact terms of the 2026 round. | Medium | SV001, SV002, SV004, SV015 |
| CV031 | If Cast AI’s ARR were only around $50M, the $1B mark would require a very rich AI-native multiple. | Medium | SV006, SV009 |
| CV032 | If Cast AI’s ARR were around $100M, the unicorn valuation would look more reasonable against premium public AI-software benchmarks. | Medium | SV006, SV009 |
| CV033 | Customer scale above 2,100 organizations and a strong optimization gap support a premium narrative, but they do not by themselves determine fair value. | Medium | SV014, SV025 |
| CV034 | Valuation dispersion is wide because public comparables span both high-quality software profiles and more mixed infrastructure businesses. | Medium | SV010, SV011, SV012, SV013, SV017, SV018, SV019, SV020, SV021 |
| CV035 | The strongest skeptical point is not operational failure but opacity: undisclosed round amount, missing ARR, and incomplete third-party funding data. | Medium | SV002, SV015, SV026 |
| CV036 | TechCrunch and Reuters-linked coverage show strong market confidence and demand momentum, but not audited economics. | Medium | SV003, SV004, SV005 |
| CV037 | 2026 multiple context shows premium AI valuations are increasingly conditional on demonstrated revenue rather than narrative alone. | Medium | SV009, SV010 |
| CV038 | Any public valuation model for Cast AI is illustrative rather than underwritten because revenue and profitability are not disclosed. | Medium | SV006, SV009, SV015 |
| CV039 | A key thesis-break would be evidence that native-cloud competition or module economics push Cast into lower comp buckets closer to legacy software. | Medium | SV015, SV016 |
| CV040 | Another thesis-break would be weak retention or concentration data that turns strong logo proof into fragile economic quality. | Medium | SV015, SV025 |
| CV041 | Third-party private-company data services disagree or remain incomplete enough that they should not be treated as authoritative on Cast AI’s full funding history. | Low | SV026 |
| CV042 | The overall investment judgment is track with medium confidence and a fair valuation stance. | Medium | SV006, SV009, SV015, SV025, SV026 |
| ID | Publisher | Title | Quote |
|---|---|---|---|
| SO001 | Cast AI | Kubernetes Optimization Platform for Performance - Cast AI | Kubernetes Optimization Platform for Performance |
| SO002 | Cast AI | About Cast AI - Company, Team & Leadership | Trusted by 2100+ companies globally |
| SO003 | Cast AI | Cast AI News & Press Releases | Cast AI News & Press Releases |
| SO004 | Cast AI | Cast AI Valued at Over $1 Billion With the Launch of its GPU Marketplace | With this round of funding, Cast AI’s valuation exceeds $1 billion. |
| SO005 | Cast AI | Cast AI Raises $108M to Lead Application Performance Automation | we closed an oversubscribed $108 million Series C round led by G2 Venture Partners and SoftBank Vision Fund 2 |
| SO006 | Cast AI | Hugging Face partners with CAST AI to Optimize AI Workloads | CAST AI’s workload optimization for intensive CPU and GPU workloads reduces the cost of running AI |
| SO007 | Cast AI | CAST AI Launches AI Enabler to Optimize LLM Deployment and Automate Model Selection | CAST AI, the leading Kubernetes automation platform, today announced the launch of AI Enabler |
| SO008 | Cast AI | Cast AI Included in the Futuriom 50 List of Top Cloud and AI Infrastructure Companies | Cast AI, the leading automation platform, today announced it has been named to the Futuriom 50 list |
| SO009 | Cast AI | Cast AI Case Studies: Real Kubernetes Cost Savings & Automation | Cast AI Case Studies: Real Kubernetes Cost Savings & Automation |
| SO010 | Cast AI | NielsenIQ case study | By implementing Cast, NielsenIQ generated 60–80% cost savings on their non-production deployments and 40–50% savings for production clusters. |
| SO011 | Cast AI | project44 case study | By implementing Cast, project44 saw 50% of cost reduction on compute costs within the initial rollout cluster during the first month. |
| SO012 | Cast AI | Branch case study | The result for Branch was to eliminate the upfront spend of several million dollars per year on Savings Plans ... while saving millions of dollars of Cloud OpEx spend. |
| SO013 | Cast AI | Cast AI companies spend three times more than they should on cloud costs | companies overspend by 60 percent due to overprovisioning containerized applications |
| SO014 | Business Wire | Cast AI Valued at Over $1 Billion With the Launch of Its GPU Marketplace | The company also announced a strategic investment from Pacific Alliance Ventures (PAV) ... With this round of funding, Cast AI’s valuation exceeds $1 billion. |
| SO015 | SiliconANGLE | Cast AI raises funds from Pacific Alliance Ventures at $1B valuation to launch unified GPU marketplace | Cast AI Group Inc. ... raised new funding from Pacific Alliance Ventures, surpassing a $1 billion valuation, to launch a unified cloud graphics processing unit marketplace. |
| SO016 | Tech Funding News | Inside Lithuania’s fifth unicorn: How Cast AI redefined global AI infrastructure | Cast AI has grown into a distributed organisation of more than 300 employees across 34 countries. |
| SO017 | AIN | Cast AI becomes Lithuania’s 5th unicorn | Cast AI is a Miami and Vilnius-based Application Performance Automation platform |
| SO018 | TechCrunch | Cast AI raises $108M to get the most out of AI, Kubernetes, and other workloads | The company has raised a $108 million Series C ... the round has the company at “near unicorn” valuation, post-money. |
| SO019 | SiliconANGLE | Cloud optimization startup Cast AI raises $108 million to achieve almost unicorn valuation | Cloud optimization startup Cast AI raises $108 million to achieve almost unicorn valuation |
| SO020 | Cota Capital | Founders Spotlight: Laurent Gil, Leon Kuperman, Yuri Frayman | Laurent Gil, Leon Kuperman, and Yuri Frayman co-founded the company in 2019 after experiencing firsthand the challenges of managing cloud costs at scale during their previous venture, Zenedge |
| SO021 | StatusGator | CAST AI Status. Check if CAST AI is down or having an outage. | StatusGator reports that CAST AI is currently experiencing a partial outage. Intermittent Azure AKS Node Provisioning Failures in Select Regions |
| SO022 | Cybernews | Cast AI Review 2026: Can This AI Really Cut Cloud Costs by 50%? | Advanced setup and policies can be difficult for teams new to Kubernetes |
| SO023 | Marketscreener / Reuters | Cast AI secures $108 million funding to expand cloud automation | The oversubscribed round ... valued the company at $850 million, a person familiar with the matter said. |
| SO024 | Yahoo Finance / Reuters | Cast AI secures $108 million funding to expand cloud automation | This brings Cast AI's total funding to over $180 million |
| SO025 | Unicorns.lt | 2,100 Customers in 3 Years: Cast AI Closes a $108 Million Series C Round | Today, Cast is trusted by over 2,100 organizations, including Akamai, BMW, FICO, HuggingFace, NielsenIQ, and Swisscom. |
| SO026 | Cota Capital | A New Era of Cloud Automation: The Cast AI Growth Story | Over the last year, Cast has doubled its customer base. Today, more than 2,100 leading organizations across industries rely on its technology |
| SO027 | Tech Funding News | The next Lithuanian Unicorn? Cast AI grabs $108M at $850M valuation. | While officially headquartered in Miami, Cast AI’s core operations are based in Vilnius |
| SO028 | BalticVC | Lithuanian Cast AI gets unicorn status | The company was founded in Lithuania in 2019 by Yuri Frayman, Laurent Gil, and Leon Kuperman. |
| SO029 | Cast AI | Cast AI News & Press Releases | Cast AI News & Press Releases |
| SM001 | Cast AI | 2026 State of Kubernetes Resource Optimization: CPU at 8%, Memory at 20%, and Getting Worse | CPU utilization fell to 8% ... Memory dropped from 23% to 20% ... GPU utilization ... stood at just 5%. |
| SM002 | Cast AI | 2026 State of Kubernetes Optimization Report | 2026 State of Kubernetes Optimization Report |
| SM003 | CNCF | FinOps for Kubernetes: engineering cost optimization | cost model often isn’t sufficient for anything but an informed starting point |
| SM004 | FinOps Foundation | FinOps Foundation - What is FinOps? | Cross-functional teams in Engineering, Finance, Product, etc. work together to enable faster product delivery, while at the same time gaining more financial control and predictability |
| SM005 | CNCF | Cloud Native and Kubernetes FinOps Microsurvey | Half ... said they are spending up to a quarter of their budget on Kubernetes |
| SM006 | IDC / Flexera | Going for the Gold with FinOps Forward and AI | Worldwide Intelligent CloudOps Software Revenue ... Total: 23.4 ... Total: 45.0 |
| SM007 | Flexera | Intelligent Kubernetes container and infrastructure optimization | Intelligent Kubernetes container and infrastructure optimization |
| SM008 | The Business Research Company | Global Kubernetes Cost Management Market Report 2026 | Kubernetes Cost Management market size has reached to $1.75 billion in 2025 |
| SM009 | MarketsandMarkets | Cloud FinOps Market Report 2025-2030 | The cloud FinOps market is projected to reach USD 26.91 billion by 2030 from USD 14.88 billion in 2025 |
| SM010 | Verified Market Reports | Cloud Cost Management and Optimization Market 2026-2034 | Market Size (2026) USD 9.2 billion ... Forecast Year (2034) USD 35.4 billion |
| SM011 | Deloitte | The AI infrastructure reckoning: Optimizing compute strategy in the age of inference economics | While inference costs have plummeted, dropping 280-fold over the last two years, enterprises are experiencing explosive growth in overall AI spending. |
| SM012 | Karpenter | Karpenter | Karpenter |
| SM013 | Google Cloud | GKE Autopilot overview | Google manages your infrastructure configuration, including your nodes, scaling, security, and other preconfigured settings. |
| SM014 | Microsoft Azure | Optimize Azure Kubernetes Service (AKS) usage and costs | This article provides guidance on ... automatic scaling, cluster right-sizing, GPU optimizations, multitenancy |
| SM015 | CNCF | Reports | Reports |
| SM016 | CoreWeave | The Essential Cloud for AI | Get the GPU compute you need for your AI workloads though a Kubernetes-native environment |
| SM017 | Deploybase | GPU Shortage 2026 - Availability, Allocation Timelines and Price Impact Analysis | Allocation timelines for B200 clusters approach 6-8 weeks for standard configurations. |
| SM018 | Business Research Insights | Cloud Cost Management and Optimization Market Report | Forecast [2026-2035] | The global cloud cost management and optimization market is valued at approximately USD 11.01 Billion in 2026 and is projected to reach USD 38.4 Billion by 2035. |
| SM019 | DataStackHub | Cloud Cost Statistics For 2025–2026 – Spending, Optimization & FinOps Trends | The average organization wastes 30% of its cloud budget on unused or misconfigured resources. |
| SM020 | Global Growth Insights | Kubernetes Solutions Market Size, Trends 2026-2035 | The Global Kubernetes Solutions Market was valued at USD 2,514.9 million in 2025 |
| SM021 | FinOps Foundation | Usage Optimization FinOps Framework Capability | Moving beyond traditional IaaS workloads ... resources ... are properly selected, correctly sized, only run when needed |
| SM022 | FinOps Foundation | FinOps Personas | Core Personas provide all of the organizational disciplines to successfully use cloud effectively. |
| SM023 | Google Cloud | Well-Architected Framework: Cost optimization pillar | The intended audience includes CTOs, CIOs, CFOs ... Architects, developers, administrators, and operators |
| SM024 | Red Hat | Cost management for Kubernetes on Red Hat OpenShift | Cost management should provide cost visibility across hybrid and multicloud environments. |
| SM025 | Microsoft | FinOps documentation - Cloud Computing | FinOps combines financial management principles with cloud engineering and operations |
| SM026 | IBM | What is cloud cost optimization? | Organizations waste about 32% of their spending on cloud services |
| SP001 | Cast AI | Kubernetes Optimization Platform for Performance - Cast AI | |
| SP002 | Cast AI | Cast AI Documentation | Getting Started | Cast AI Docs | The platform includes cost monitoring for real-time and longer-period cost reports at the cluster, namespace, and workload levels. It also offers cost optimization suggestions and automatic optimization using autoscaling, Spot Instance automation, bin packing, and other features. |
| SP003 | Cast AI | Cast AI Pricing for Automated Kubernetes Management – Cast AI | |
| SP004 | Cast AI | Cast AI Valued at Over $1 Billion With the Launch of its GPU Marketplace | |
| SP005 | TechCrunch | Cast AI raises $108M to get the most out of AI, Kubernetes, and other workloads | The company has raised a $108 million Series C. |
| SP006 | Apptio / IBM | IBM Kubecost - K8s Cost Monitoring - Apptio | IBM Kubecost helps teams continuously reduce the cost of operating Kubernetes with real-time visibility, allocation, optimization, and governance. |
| SP007 | IBM Newsroom | IBM Acquires Kubecost to Broaden Hybrid Cloud Cost Management Capabilities | Today, IBM is announcing the acquisition of Kubecost, a leading Kubernetes cost monitoring and optimization software company. |
| SP008 | Flexera | Kubernetes Container Optimization (FinOps) | Flexera | Ocean, Flexera’s container optimization solution, offers optimal Kubernetes infrastructure scaling while solving Day 2 challenges with enterprise-grade serverless container automation. |
| SP009 | Thoma Bravo | Flexera Completes Acquisition of NetApp's Spot Portfolio | Thoma Bravo | Flexera, the global leader in technology spend and risk management, today announced it has completed the acquisition of Spot from NetApp. |
| SP010 | StormForge | Automated Kubernetes Resource Management | |
| SP011 | StormForge | Optimize Live | Autonomous Kubernetes Rightsizing | Put estate-wide rightsizing on autopilot with hands-free lifecycle automation. |
| SP012 | CloudBolt | Kubernetes Rightsizing | StormForge by CloudBolt | CloudBolt | |
| SP013 | Kubex | Kubernetes Resource Optimization | |
| SP014 | Kubex | Kubernetes Resource Optimization | Optimizes Karpenter node autoscaling. |
| SP015 | IBM | Turbonomic | Application Resource Management (ARM) - IBM | Built for hybrid and multicloud complexity, IBM Turbonomic automates application resource management at scale. |
| SP016 | Karpenter | Karpenter | Karpenter automatically launches just the right compute resources to handle your cluster’s applications. |
| SP017 | Amazon Web Services | Karpenter - Amazon EKS | Karpenter automates provisioning and deprovisioning of nodes based on the specific scheduling needs of pods, allowing efficient scaling and cost optimization. |
| SP018 | Amazon Web Services | Workload Rightsizing - AWS Compute Optimizer - AWS | |
| SP019 | Google Cloud Documentation | GKE Autopilot overview | Google Kubernetes Engine (GKE) | GKE Autopilot is a mode of operation in GKE in which Google manages your infrastructure configuration, including your nodes, scaling, security, and other preconfigured settings. |
| SP020 | Google Cloud | Google Kubernetes Engine pricing | GKE includes fully automated cluster lifecycle management, pod and cluster autoscaling, cost visibility, automated infrastructure cost optimization, and multi-cluster management features at no extra cost. |
| SP021 | Microsoft Learn | Overview of Node Auto-Provisioning (NAP) in Azure Kubernetes Service (AKS) | NAP automatically deploys, configures, and manages Karpenter on your AKS clusters and is based on the open-source Karpenter and AKS Karpenter provider projects. |
| SP022 | Microsoft Learn | Use the Cluster Autoscaler in Azure Kubernetes Service (AKS) | The cluster autoscaler component watches for pods in your cluster that can’t be scheduled because of resource constraints and scales up the number of nodes in the node pool to meet application demands. |
| SP023 | OpenCost | OpenCost — open source cost monitoring for cloud native environments | OpenCost is a vendor-neutral open source project for measuring and allocating cloud infrastructure and container costs in real time. |
| SP024 | Cloud Native Computing Foundation | OpenCost | |
| SP025 | FinOps Foundation | Usage Optimization FinOps Framework Capability | Analyze and optimize resources across FinOps Scopes to match actual usage patterns, while ensuring that workloads operate efficiently, sustainably, and generate sufficient business value relative to their cost. |
| SP026 | Cybernews | Cast AI Review 2026: Can This AI Really Cut Cloud Costs by 50%? | The initial setup can be confusing for teams new to Kubernetes, especially regarding precise IAM permissions. |
| SP027 | G2 | The G2 on Cast AI | Filter 189 reviews by the users’ company size, role or industry to find out how Cast AI works for a business like yours. |
| SP028 | CloudZero | CAST AI vs Kubecost: Kubernetes Cost Tools Compared (2026) | Kubecost was acquired by IBM and is now part of the Apptio product family, while Cast AI has expanded into GPU optimization, LLM cost management, and database optimization beyond its original Kubernetes focus. |
| SP029 | nOps | Cast AI Alternatives: 11 Best Kubernetes Cost Optimization Tools | Because Cast AI focuses narrowly on Kubernetes autoscaling, significant savings in commitments, SaaS, AI workloads, and non-Kubernetes compute often go untouched. |
| SI001 | Cast AI | Cast AI Pricing for Automated Kubernetes Management – Cast AI | |
| SI002 | Cast AI | Cast AI Documentation | Getting Started | Cast AI Docs | The platform includes cost monitoring for real-time and longer-period cost reports at the cluster, namespace, and workload levels. |
| SI003 | Cast AI | Cast AI Case Studies: Real Kubernetes Cost Savings & Automation | |
| SI004 | Cast AI | How NielsenIQ Saved Up to 80% on Cloud Costs – Cast AI | Learn how NielsenIQ cut Kubernetes cloud costs by 80% and reduced operational overhead with Cast AI. |
| SI005 | Cast AI | How project44 Saved 50% on GKE in One Month – Cast AI | See how project44 achieved 50% cloud savings on Google Kubernetes Engine through automation. |
| SI006 | Cast AI | How Branch Saved Millions in AWS Cloud Spend – Cast AI | Learn how Branch saves several million dollars annually on AWS compute using Cast AI. |
| SI007 | Cast AI | Leading the Charge in Application Performance Automation: Our $108 Million Series C | We closed an oversubscribed $108 million Series C round led by G2 Venture Partners and SoftBank Vision Fund 2. |
| SI008 | Cast AI | Cast AI Valued at Over $1 Billion With the Launch of its GPU Marketplace | With this round of funding, Cast AI’s valuation exceeds $1 billion. |
| SI009 | Business Wire | Cast AI Valued at Over $1 Billion With the Launch of Its GPU Marketplace | The company also announced a strategic investment from Pacific Alliance Ventures ... With this round of funding, Cast AI’s valuation exceeds $1 billion. |
| SI010 | TechCrunch | Cast AI raises $108M to get the most out of AI, Kubernetes, and other workloads | The company has raised a $108 million Series C that it will be using for more R&D and to expand its business in core markets like the U.S. and elsewhere. |
| SI011 | SiliconANGLE | Cloud optimization startup Cast AI raises $108 million to achieve 'almost unicorn' valuation | |
| SI012 | Unicorns Lithuania | 2,100 Customers in 3 Years: Cast AI Closes a $108 Million Series C Round to Propel the Future of Application Performance Automation | Today, Cast is trusted by over 2,100 organizations ... with the company doubling its customer base between 2023 and 2024. |
| SI013 | Cota Capital | A New Era of Cloud Automation: The Cast AI Growth Story | This funding is a clear reflection of Cast’s rapid revenue growth and the surging demand for its platform. |
| SI014 | Cybernews | Cast AI Review 2026: Can This AI Really Cut Cloud Costs by 50%? | The initial setup can be confusing for teams new to Kubernetes, especially regarding precise IAM permissions. |
| SI015 | G2 | The G2 on Cast AI | Pricing provided by Cast AI. Kubernetes cost monitoring. Free. |
| SI016 | Software Advice | CAST AI Software Reviews, Demo & Pricing | Starting at $200.00 per month. |
| SI017 | Yahoo Finance / Reuters | Cast AI secures $108 million funding to expand cloud automation | This brings Cast AI’s total funding to over $180 million. |
| SI018 | MarketScreener / Reuters | Cast AI secures $108 million funding to expand cloud automation | The oversubscribed round ... valued the company at $850 million. |
| SI019 | Securities and Exchange Commission | IBM 2025 Form 10-K | |
| SI020 | Securities and Exchange Commission | Datadog 2025 Form 10-K | |
| SI021 | Securities and Exchange Commission | NetApp 2025 Form 10-K | |
| SI022 | Cast AI | Cast AI Named a Leader in G2 Spring 2026 Reports for Cloud Cost Management and Auto Scaling | Cast AI ... has been recognized as a Leader in G2’s Spring 2026 Grid Reports for Cloud Cost Management and Auto Scaling, earning top rankings and 20 badges across 36 reports. |
| SI023 | Cast AI | 2026 State of Kubernetes Optimization Report - Cast AI | This report is based on our analysis of tens of thousands of Kubernetes clusters across AWS, GCP, and Azure. |
| SI024 | FinOps Foundation | Usage Optimization FinOps Framework Capability | |
| SI025 | Apptio / IBM | IBM Kubecost - K8s Cost Monitoring - Apptio | |
| SI026 | CloudZero | CAST AI vs Kubecost: Kubernetes Cost Tools Compared (2026) | Cast AI has expanded into GPU optimization, LLM cost management, and database optimization beyond its original Kubernetes focus. |
| SI027 | Cast AI | Hugging Face partners with CAST AI to Optimize AI Workloads on AWS and Google | |
| SI028 | Cast AI | ALLEN Digital | Discover how ALLEN Digital dramatically increased GPU utilization and saved 71% on LLM costs with Kimchi Inference. |
| SI029 | Cast AI | Akamai Kubernetes Optimization Case Study - Cast AI | See how Akamai used Kubernetes optimization automation to stay reliable under change while reducing toil and waste as a byproduct. |
| SI030 | Cast AI | Mercedes-Benz.io | Discover how Mercedes-Benz.io reduces Kubernetes operational overhead and costs using automation from Cast AI. |
| SE001 | Cast AI | Kubernetes Optimization Platform for Performance - Cast AI | |
| SE002 | Cast AI | Cast AI Documentation | Getting Started | Cast AI Docs | Cast AI is an all-in-one Kubernetes automation, optimization, security, and cost management platform. |
| SE003 | Cast AI | Cluster Hibernation | Node Autoscaling | Cast AI Docs | Cluster hibernation allows you to optimize costs by temporarily scaling your cluster to zero nodes while preserving the control plane and cluster state. |
| SE004 | Cast AI | Cast AI Omni | Multi-Cloud Compute Overview | Cast AI Docs | OMNI extends your Kubernetes cluster to additional regions and cloud providers. |
| SE005 | Cast AI | GPU Optimization for AI Infrastructure - Cast AI | Deploy more AI workloads on fewer GPUs anywhere. |
| SE006 | Cast AI | CAST AI Launches AI Enabler to Optimize LLM Deployment and Automate Model Selection | |
| SE007 | Cast AI | Hugging Face partners with CAST AI to Optimize AI Workloads on AWS and Google | |
| SE008 | Cast AI | Cast AI Valued at Over $1 Billion With the Launch of its GPU Marketplace | |
| SE009 | Cast AI | Cast AI's 2026 State of Kubernetes Optimization Report Reveals GPU Utilization at 5% | GPU utilization averaged just 5% across the clusters analyzed. |
| SE010 | StatusGator | CAST AI Status. Check if CAST AI is down or having an outage. | |
| SE011 | GitHub | terraform-provider-castai/docs/resources/autoscaler.md at master · castai/terraform-provider-castai | CAST AI autoscaler resource to manage autoscaler settings. |
| SE012 | GitHub | GitHub - castai/terraform-provider-castai: Terraform provider for CAST AI platform | |
| SE013 | AWS Marketplace | Cast AI - EKS fully automated cost optimization and monitoring | Get EKS monitoring and automated cost optimization in one easy-to-use platform. |
| SE014 | Mercedes-Benz.io | Node Scaling Optimization at Scale: Cut your Kubernetes cluster costs while assuring zero downtime | We solved it by partnering with Cast.ai to bring dynamic, workload-aware autoscaling to life. |
| SE015 | Cast AI | Mercedes-Benz.io | |
| SE016 | Cast AI | ALLEN Digital | Discover how ALLEN Digital dramatically increased GPU utilization and saved 71% on LLM costs with Kimchi Inference. |
| SE017 | Cast AI | Akamai Kubernetes Optimization Case Study - Cast AI | Cast AI offered a robust set of features that perfectly matched Akamai’s use cases and requirements: maximized resource utilization with bin packing, automatic selection of the most cost-efficient compute instances, Spot instance automation throughout the entire instance lifecycle. |
| SE018 | Cast AI | How project44 Saved 50% on GKE in One Month – Cast AI | |
| SE019 | Cast AI | How Branch Saved Millions in AWS Cloud Spend – Cast AI | |
| SE020 | Business Wire | Cast AI Valued at Over $1 Billion With the Launch of Its GPU Marketplace | |
| SE021 | SDxCentral | Cast AI hits unicorn status & launches multicloud platform to make GPUs fungible | |
| SE022 | DEV Community | CAST AI Integration with Amazon EKS — Step-by-Step Guide | |
| SE023 | DeepWiki | castai/terraform-provider-castai | DeepWiki | |
| SE024 | IsDown | Is CAST Down? Check current status and user reports | |
| SE025 | FinOps Foundation | Usage Optimization FinOps Framework Capability | |
| SE026 | CloudZero | CAST AI vs Kubecost: Kubernetes Cost Tools Compared (2026) | |
| SE027 | Cast AI Docs | Kvisor Overview | Kubernetes Security | Cast AI Docs | Kvisor is an open-source security agent designed to enhance the security posture of your Kubernetes clusters. |
| SE028 | Cast AI Docs | Configuring Kvisor Features | Kubernetes Security | Cast AI Docs | Kvisor supports multiple configuration options that can be set via Helm during installation or upgrade. |
| SE029 | Cast AI Docs | Security Dashboard | Kubernetes Security | Cast AI Docs | The Security dashboard provides a comprehensive overview of your Kubernetes clusters’ security posture. |
| SE030 | Cast AI | CAST AI & Security Report awarded CIS Benchmark™ Certification | CAST AI’s Security Report has been certified by the Center for Internet Security. |
| SE031 | GitHub | GitHub - castai/kvisor: Real time Kubernetes issues and vulnerabilities scanning | Real time Kubernetes issues detection and vulnerabilities scanning and runtime. |
| SU001 | Cast AI | Cast AI Case Studies: Real Kubernetes Cost Savings & Automation | Learn how companies are using Cast AI to cut cloud costs, improve performance, and boost DevOps productivity. |
| SU002 | Cast AI | How NielsenIQ Saved Up to 80% on Cloud Costs – Cast AI | Learn how NielsenIQ cut Kubernetes cloud costs by 80% and reduced operational overhead with Cast AI. |
| SU003 | Cast AI | How project44 Saved 50% on GKE in One Month – Cast AI | See how project44 achieved 50% cloud savings on Google Kubernetes Engine through automation. |
| SU004 | Cast AI | How Branch Saved Millions in AWS Cloud Spend – Cast AI | Learn how Branch saves several million dollars annually on AWS compute using Cast AI. |
| SU005 | Cast AI | Akamai Kubernetes Optimization Case Study - Cast AI | Cast AI offered a robust set of features that perfectly matched Akamai’s use cases and requirements. |
| SU006 | Cast AI | ALLEN Digital | Discover how ALLEN Digital dramatically increased GPU utilization and saved 71% on LLM costs with Kimchi Inference. |
| SU007 | Cast AI | Mercedes-Benz.io | Discover how Mercedes-Benz.io reduces Kubernetes operational overhead and costs using automation from Cast AI. |
| SU008 | Cast AI | Hugging Face partners with CAST AI to Optimize AI Workloads on AWS and Google | CAST AI and Hugging Face announced a partnership designed to dramatically reduce the cost of deploying LLMs in the cloud. |
| SU009 | Cast AI | Cast AI Valued at Over $1 Billion With the Launch of its GPU Marketplace | |
| SU010 | Cast AI | Cast AI Named a Leader in G2 Spring 2026 Reports for Cloud Cost Management and Auto Scaling | Cast AI ... has been recognized as a Leader in G2’s Spring 2026 Grid Reports ... earning top rankings and 20 badges across 36 reports. |
| SU011 | BMW Group | BMW Group | |
| SU012 | Cisco | AI Infrastructure, Secure Networking, and Software Solutions | |
| SU013 | FICO | Applied Intelligence – Powering Your Customer Connections. | |
| SU014 | Swisscom | Swisscom home page: About us | |
| SU015 | Akamai | Cloud Computing, Security, Content Delivery (CDN) | Akamai | |
| SU016 | NielsenIQ | Global English Homepage | |
| SU017 | project44 | Decision Intelligence Platform | project44 | |
| SU018 | Branch | Mobile Measurement & Deep Linking Platform | Branch | |
| SU019 | Mercedes-Benz.io | Mercedes-Benz.io | |
| SU020 | Hugging Face | Hugging Face – The AI community building the future. | |
| SU021 | Cybernews | Cast AI Review 2026: Can This AI Really Cut Cloud Costs by 50%? | The initial setup can be confusing for teams new to Kubernetes, especially regarding precise IAM permissions. |
| SU022 | G2 | The G2 on Cast AI | |
| SU023 | Business Wire | Cast AI Valued at Over $1 Billion With the Launch of Its GPU Marketplace | |
| SU024 | Unicorns Lithuania | 2,100 Customers in 3 Years: Cast AI Closes a $108 Million Series C Round to Propel the Future of Application Performance Automation | Cast’s relentless innovation has fueled significant growth ... with the company doubling its customer base between 2023 and 2024. Today, Cast is trusted by over 2,100 organizations. |
| SU025 | Securities and Exchange Commission | SEC.gov | Disclosure Guidance | |
| SU026 | FinOps Foundation | Usage Optimization FinOps Framework Capability | |
| SR001 | Cast AI | Terms of service | These Terms of Service are a legally binding agreement between the applicable Cast AI contracting party and customer. |
| SR002 | Cast AI | Privacy policy | Where applicable ... the controller of your personal data is Cast AI Baltic UAB. |
| SR003 | Cast AI | Customer data processing | This Customer Data Processing Addendum supplements and forms part of the Cast AI Terms of Service. |
| SR004 | Cast AI | Information security policy | Cast AI has achieved ISO 27001 certification. |
| SR005 | Cast AI | Cast AI is Officially SOC 2 Type II Compliant | Cast AI has passed the independent SOC 2 Type II examination. |
| SR006 | Cast AI Docs | Platform Permissions and Data Privacy | Cast AI Docs | This section provides an overview of the permissions used by Cast AI components, required port openings, and the data collected by the components. |
| SR007 | Cast AI Docs | Security and Compliance | Database Optimizer | Cast AI Docs | All query SQL is anonymized at ingestion time ... so that no personally identifiable information is stored. |
| SR008 | Cast AI Docs | Kvisor Overview | Kubernetes Security | Cast AI Docs | Kvisor is an open-source security agent designed to enhance the security posture of your Kubernetes clusters. |
| SR009 | Cast AI Docs | Configuring Kvisor Features | Kubernetes Security | Cast AI Docs | Kvisor supports multiple configuration options that can be set via Helm during installation or upgrade. |
| SR010 | Cast AI Docs | Security Dashboard | Kubernetes Security | Cast AI Docs | The Security dashboard provides a comprehensive overview of your Kubernetes clusters security posture. |
| SR011 | Cast AI | CAST AI & Security Report awarded CIS Benchmark™ Certification | Cast AI’s Security Report has been certified by the Center for Internet Security. |
| SR012 | Center for Internet Security | Cast AI | Cast AI products have been awarded CIS Security Software Certification for CIS Benchmark(s). |
| SR013 | StatusGator | CAST AI Status. Check if CAST AI is down or having an outage. | |
| SR014 | IsDown | Is CAST Down? Check current status and user reports | IsDown has monitored CAST continuously since January 2025 ... documenting 42 outages and incidents. |
| SR015 | Cybernews | Cast AI Review 2026: Can This AI Really Cut Cloud Costs by 50%? | The initial setup can be confusing for teams new to Kubernetes, especially regarding precise IAM permissions. |
| SR016 | Securities and Exchange Commission | SEC.gov | Disclosure Guidance | |
| SR017 | Cast AI | About Cast AI: The Application Performance Automation Platform | Trusted by 2100+ companies globally. |
| SR018 | Cast AI | Cast AI Careers: Join Our Team – Cast AI | Working at Cast AI means pushing your limits, learning fast, and seeing your impact. |
| SR019 | Cast AI | Cast AI Valued at Over $1 Billion With the Launch of its GPU Marketplace | |
| SR020 | Business Wire | Cast AI Valued at Over $1 Billion With the Launch of Its GPU Marketplace | |
| SR021 | Cast AI Docs | Getting Started with Kubernetes Security | Kubernetes Security | Cast AI Docs | The Cast AI Kubernetes Security feature set is undergoing significant changes. |
| SR022 | Cast AI | Cast AI's 2026 State of Kubernetes Optimization Report Reveals GPU Utilization at 5% | |
| SR023 | FinOps Foundation | Usage Optimization FinOps Framework Capability | |
| SR024 | Cast AI | Kubernetes Optimization Platform for Performance - Cast AI | |
| SR025 | GitHub | GitHub - castai/kvisor: Real time Kubernetes issues and vulnerabilities scanning | Real time Kubernetes issues detection and vulnerabilities scanning and runtime. |
| SR026 | Cooley | State AI Laws – Where Are They Now? | |
| SR027 | Konfirmity | SOC 2 Controls Mapped To CIS: Best Practices and Key Steps for 2026 | |
| SR028 | Kiteworks | AI Regulation in 2026: The Complete Survival Guide for Businesses | |
| SR029 | Baker Donelson | 2026 AI Legal Forecast: From Innovation to Compliance | |
| SR030 | Godfrey & Kahn | 2026 AI Laws Update: Key Regulations and Practical Guidance | |
| SV001 | Cast AI | Cast AI Valued at Over $1 Billion With the Launch of its GPU Marketplace | With this round of funding, Cast AI’s valuation exceeds $1 billion. |
| SV002 | Business Wire | Cast AI Valued at Over $1 Billion With the Launch of Its GPU Marketplace | |
| SV003 | TechCrunch | Cast AI raises $108M to get the most out of AI, Kubernetes, and other workloads | The round has the company at near unicorn valuation, post-money — close to $900 million. |
| SV004 | Yahoo Finance / Reuters | Cast AI secures $108 million funding to expand cloud automation | The oversubscribed round ... valued the company at $850 million. |
| SV005 | MarketScreener / Reuters | Cast AI secures $108 million funding to expand cloud automation | This brings Cast AI’s total funding to over $180 million. |
| SV006 | SaasRise | The AI Software Valuation Report 2026 | AI-native companies command a median 21.2x EV/Revenue in VC rounds and 11.5x in M&A buyouts. |
| SV007 | Public Comps | Public Comps | Need EV/NTM Revenue or what is best in class payback periods? Get benchmarks and comps instantly. |
| SV008 | Opslyft | Cloud Unit Economics & Cloud COGS Playbook for FinOps (2026) | |
| SV009 | Windsor Drake | AI Valuations: Q2 2026 | Windsor Drake’s Q2 2026 EV/Revenue benchmark for AI-native application software sits near 11x. |
| SV010 | Multiples.vc | Public Software Valuation Multiples — May 2026 | Public investors seem to currently value software companies based on AI application, technical complexity, market position, and specialization depth. |
| SV011 | Securities and Exchange Commission | IBM 2025 Form 10-K | |
| SV012 | Securities and Exchange Commission | Datadog 2025 Form 10-K | |
| SV013 | Securities and Exchange Commission | NetApp 2025 Form 10-K | |
| SV014 | Cast AI | Cast AI's 2026 State of Kubernetes Optimization Report Reveals GPU Utilization at 5% | GPU utilization averaged just 5% across the clusters analyzed. |
| SV015 | CloudZero | CAST AI vs Kubecost: Kubernetes Cost Tools Compared (2026) | Cast AI has expanded into GPU optimization, LLM cost management, and database optimization beyond its original Kubernetes focus. |
| SV016 | FinOps Foundation | Usage Optimization FinOps Framework Capability | |
| SV017 | Securities and Exchange Commission | Cloudflare 2025 Form 10-K | |
| SV018 | Securities and Exchange Commission | Dynatrace 2026 Form 10-K | |
| SV019 | Securities and Exchange Commission | DigitalOcean 2026 Form 10-K | |
| SV020 | Securities and Exchange Commission | MongoDB 2026 Form 10-K | |
| SV021 | Securities and Exchange Commission | Snowflake 2026 Form 10-K | |
| SV022 | Tech.eu | Cast AI becomes Lithuania’s 5th Unicorn | |
| SV023 | Cast AI | Kubernetes Optimization Platform for Performance - Cast AI | |
| SV024 | Cast AI | 2026 State of Kubernetes Optimization Report - Cast AI | |
| SV025 | Unicorns Lithuania | 2,100 Customers in 3 Years: Cast AI Closes a $108 Million Series C Round to Propel the Future of Application Performance Automation | The company doubled its customer base between 2023 and 2024. Today, Cast is trusted by over 2,100 organizations. |
| SV026 | Premier Alternatives | Cast AI Valuation: $1.0B (2026) | Funding history data has not been imported for this company yet. |
| SV027 | Cooley | State AI Laws – Where Are They Now? | |
| SV028 | Konfirmity | SOC 2 Controls Mapped To CIS: Best Practices and Key Steps for 2026 | |
| SV029 | Kiteworks | AI Regulation in 2026: The Complete Survival Guide for Businesses | |
| SV030 | Baker Donelson | 2026 AI Legal Forecast: From Innovation to Compliance |