Diligence report AI Data & Infrastructure Late-stage private (Series F+) 2026-05-09

Scale AI

From Data Labeling to Full-Stack AI Infrastructure: Scale AI at an Inflection Point

Scale AI holds a defensible position in AI infrastructure with strong government exposure, but faces customer concentration risk, a CEO transition, and a pivotal business model shift away from data-labeling.

Cover facts

Valuation 01

29B+ USD [CO007]

Total Raised 02

1.6B+ USD [CO006]

Series F 03

1B USD [CO005]

Employees 04

~1,000 [CO004]

Founded 05

2016 [CO001]

Company profile

Scale AI is a San Francisco-based AI infrastructure company founded in 2016 by Alexandr Wang. It provides data annotation, RLHF, model evaluation, and enterprise GenAI platform services to leading AI laboratories, Fortune 500 enterprises, and U.S. government agencies including the Department of Defense. Following a $1 billion Series F in 2024 and a landmark Meta strategic investment valuing the company at over $29 billion in 2025, Scale is pivoting from data-labeling to a broader enterprise and government AI platform play under interim CEO Jason Droege.

Website: scale.com
Founded: 2016-01-01
Founders: Alexandr Wang
Founding location: San Francisco, CA
Headquarters: San Francisco, CA
Product: Scale Data Engine (data annotation and curation), Scale GenAI Platform (enterprise AI applications), Scale RLHF (LLM training data), Scale Evaluation (model safety and capability benchmarking), and Donovan (defense AI agents platform).
Customers: AI laboratories, large enterprises, and U.S. government/defense agencies.
Business model: Enterprise contracts for data annotation and AI platform services; government contracts for defense AI; self-serve pay-as-you-go for experimental users.
Stage: Late-stage private (Series F+, post-Meta strategic investment)
Funding status: $1B Series F (May 2024, $13.8B valuation) + Meta strategic investment (June 2025, $29B+ valuation)

[CO001, CO004, CO005, CO007]

Executive summary

Top strengths

Unique position in U.S. government/defense AI data and evaluation with FedRAMP High + DoD IL4 clearances
Strong brand and relationship network with leading AI labs despite recent customer attrition
Data flywheel and proprietary annotation methodology differentiate quality at scale
Donovan platform creates a new product moat in defense agentic AI

Top risks

Customer concentration: Google and OpenAI relationships deteriorating post-Meta deal
CEO transition risk: founder departure to Meta, untested interim CEO Droege
Business model pivot: moving away from core data-labeling while new enterprise/government revenue model unproven
Structural conflict of interest: Meta as strategic investor and competitor creates customer trust risk
Revenue undisclosed: valuation multiples unverifiable without financial transparency

Open gaps

Revenue and ARR: no public disclosure; valuation at $29B+ is unverifiable without revenue basis
Board composition post-Wang departure: full governance structure not publicly known
Meta commercial agreement scope and exclusivity terms not disclosed
True customer count and NRR post-2025 attrition events unknown
Mercor lawsuit outcome and IP exposure uncertain

Chapter 01

01Company Overview

1.1 Company Identity and Overview

Scale AI, Inc. is headquartered in San Francisco, California, and was incorporated in 2016. The company's stated mission is to "develop reliable AI systems for the world's most important decisions." Scale operates as a late-stage private technology company following its $1 billion Series F round in May 2024 and the subsequent Meta strategic investment in June 2025, which placed the company's implied valuation at over $29 billion. Scale describes itself as providing the data and full-stack technologies needed to build, evaluate, and improve artificial intelligence systems across the complete AI development lifecycle. Scale's core business spans five primary product lines: the Scale Data Engine (data collection, curation, and annotation for model training), the Scale GenAI Platform (enterprise-grade generative AI application deployment), Scale RLHF (reinforcement learning from human feedback data for large language model training), Scale Evaluation (model capability and safety benchmarking for model developers and the public sector), and Donovan (specialized AI agent workflows for defense and intelligence missions). The company also operates a self-serve tier with pay-as-you-go pricing, providing access to its annotation pipeline at lower entry cost for research and experimental projects. As of its about page, Scale has processed more than 15 billion human-labeled decisions and paid contributors over $1 billion globally. [CO001, CO002, CO003, CO004, CO008, CO009]

1.2 Founders, Leadership, and Governance

Scale AI was founded in 2016 by Alexandr Wang, who left MIT at age 19 to build the company. Wang grew Scale from a small data-labeling startup into a company that secured contracts with leading AI laboratories, Fortune 500 enterprises, and the United States Department of Defense. He served as CEO from founding until June 2025, when he departed to join Meta's AI efforts following Meta's landmark strategic investment in Scale. Wang retained a seat on Scale AI's board of directors following his departure. Jason Droege was appointed Interim CEO of Scale AI in June 2025 upon Wang's departure. Droege had joined Scale in September 2024 as Chief Strategy Officer. Prior to Scale, Droege founded Uber Eats and scaled it to a $19 billion GMV run rate, and previously served as a Vice President at Uber and as a partner at Benchmark, the prominent venture capital firm and Scale investor. His operational background in marketplace and platform businesses is central to Scale's stated pivot toward enterprise and government platform revenue. Scale's governance structure is not publicly disclosed in detail. Wang retains board representation, and the company has not disclosed the full board composition or independent director roster following the Meta investment and leadership transition. This creates a material governance opacity risk for prospective investors, particularly given Wang's concurrent role at Meta—a strategic investor and potential competitor for AI talent and technology. [CO001, CO010, CO011, CO012, CO013, CO014]

Leadership and Founder Table
Name	Role	Background	Tenure at Scale	Key-Person Risk
Alexandr Wang	Founder, Board Director	MIT dropout; founded Scale at 19; built to $29B+ valuation; joined Meta AI Jun 2025	2016–Jun 2025 (CEO); board ongoing	High
Jason Droege	Interim CEO	Founded Uber Eats ($19B GMV); VP at Uber; Benchmark partner; joined Scale as CSO Sep 2024	Sep 2024–present (CEO from Jun 2025)	Medium-High
Board / Governance	Not fully disclosed	Wang retains seat; other directors not publicly named post-Meta deal	Ongoing	High
Leadership team	CTO, CFO, other C-suite not publicly named	Positions not publicly profiled post-transition	Ongoing	Medium

Source: Scale AI official announcements, TechCrunch, CNBC, Benchmark partner page (broken as of access date).

[CO010, CO011, CO012, CO013, CO014, CO015]

1.3 Funding History and Capital Structure

Scale AI has raised approximately $1.6 billion in disclosed venture funding across multiple rounds, culminating in its Series F in May 2024 and a strategic investment from Meta in June 2025. The company raised approximately $600 million pre-Series F across earlier rounds, including a $325 million Series E in 2021 that valued Scale at approximately $7.3 billion. The 2023 period saw a significant workforce reduction of approximately 20% of employees, reflecting broader AI market pressures and recalibration of data-labeling demand. In May 2024, Scale closed a $1 billion Series F round led by Accel, at a post-money valuation of $13.8 billion. This round included primary capital and a secondary component allowing existing shareholders to liquidate. New investors included Amazon, Cisco, Intel, AMD, ServiceNow, DFJ Growth, WCM Investment Management, Elad Gil, and Meta. Returning investors included Nvidia, Coatue, Y Combinator, Index Ventures, Founders Fund, Tiger Global, Thrive Capital, Spark Capital, Greenoaks, Wellington Management, and Nat Friedman. In June 2025, Meta made what CNBC described as its largest-ever bet on AI, paying approximately $14.3 billion for a minority stake in Scale AI (reported as approximately 49% of outstanding equity on a fully diluted basis), implying a company valuation of over $29 billion. Scale stated it remains operationally independent from Meta. The proceeds were distributed to existing shareholders and holders of vested equity rather than entirely to Scale's balance sheet for operating capital, creating uncertainty about Scale's remaining cash position and runway. [CO005, CO006, CO007, CO017, CO018, CO019]

Scale AI Company Snapshot Metrics
Metric	Value / Status	Date	Confidence	Notes / Gaps
Founded	2016, San Francisco CA	2016	high	Confirmed by multiple sources
Current CEO	Jason Droege (Interim)	Jun 2025	high	Wang departed Jun 2025
Founder	Alexandr Wang (Board)	Jun 2025	high	Retains board seat after departure
Headcount	~1,000 employees (post-Jul 2025 layoffs)	Jul 2025	medium	Approx; 200 let go in Jul 2025 + 500 contractors
Valuation	$29B+ implied (Meta deal)	Jun 2025	medium	Implied from $14.3B for ~49% stake; not formally disclosed
Total Raised	~$1.6B+ disclosed	May 2024	medium	Excludes Meta deal (distributed to shareholders)
Series F	$1B at $13.8B post-money	May 2024	high	TechCrunch + company confirmation
Revenue / ARR	Not publicly disclosed	2026	low	Private company; estimated hundreds of millions ARR by market proxy
Gross Margin	Not disclosed	2026	low	Private; no public disclosure
Key Certifications	SOC 2 Type II, ISO 27001, DoD IL4, FedRAMP High	2024-2025	high	Per scale.com/legal/security
Primary Verticals	AI labs, Enterprise, Government/Defense	2025	high	Per product and customer pages
HQ	San Francisco, CA	2025	high	Per about page

Confidence ratings reflect public disclosure quality; low-confidence items require direct disclosure in due diligence.

[CO001, CO004, CO005, CO006, CO007, CO010]

Stakeholder or Investor Map
Round	Date	Amount	Valuation	Lead / Key Investors	Notes
Seed / Angel	2016	~$3M est.	~$15M est.	Y Combinator, angels	Approximate; YC S2016
Series A	2017	~$18M est.	~$100M est.	Accel, Index Ventures, Founders Fund	Approximate from public records
Series B	2018	~$30M est.	~$300M est.	Accel, Index, Founders Fund, Tiger Global	Approximate
Series C	2019	~$100M est.	~$1B est.	Greenoaks, existing investors	Approximate
Series D	2020	~$155M	~$3.5B est.	Tiger Global, Index, Accel, Spark, Thrive	Approximate; exact amounts vary by source
Series E	2021-08	$325M	~$7.3B	Coatue, Y Combinator, Founders Fund, Tiger, existing	Confirmed via TechCrunch reporting
Bridge / Secondary	2022-2023	Various	~$7B range	Existing investors; secondary sales	2023 layoffs 20%
Series F	2024-05	$1B	$13.8B post-money	Accel (lead), Amazon, Meta, Cisco, Intel, AMD, ServiceNow, Nvidia, YC, Index, Founders Fund, Tiger, Thrive, Spark, Greenoaks, Wellington, Nat Friedman, Elad Gil, DFJ Growth, WCM	Confirmed; primary + secondary mix
Meta Strategic Investment	2025-06	~$14.3B (to shareholders)	$29B+ implied	Meta (minority ~49%)	Proceeds distributed to existing shareholders; Scale remains independent

Early round amounts and valuations are estimates from public records and secondary sources; only Series E, Series F, and Meta deal amounts are confirmed via tier-one reporting.

[CO005, CO006, CO007, CO017, CO018, CO019]

FO003: Scale AI Key Performance Indicators (as of May 2026)

High-level KPI snapshot reflecting publicly disclosed or strongly indicated metrics for Scale AI as of the report date.

Valuation implied from Meta transaction; headcount is approximate post-layoff estimate; revenue not publicly disclosed.

[CO004, CO005, CO006, CO007, CO029, CO030]

1.4 Products and Business Model

Scale AI's business model centers on selling high-quality data and AI infrastructure services to organizations building and deploying machine learning systems. Revenue streams include enterprise data annotation contracts (volume-based and project-based), the Scale GenAI Platform (managed SaaS and professional services for enterprise AI applications), government and defense contracts (including Department of Defense data curation and the Donovan platform for intelligence and military operations), and a self-serve tier for smaller or experimental use cases. Scale does not publicly disclose revenue, gross margins, or ARR. The Scale Data Engine collects, curates, annotates, and validates data across text, images, video, audio, and documents. The GenAI Platform transforms enterprise data into domain-specific AI applications using a proprietary pipeline. Scale RLHF provides curated preference data for reinforcement learning from human feedback, which is central to training large language models for instruction-following and safety. Scale Evaluation offers trusted benchmarking for model capability (including the Scale Leaderboard) and safety (including the WMDP harmful-knowledge benchmark), serving both commercial model developers and U.S. government agencies. The Donovan platform targets defense and intelligence missions with specialized AI agent workflows that can operate in classified environments given Scale's DoD IL4 and FedRAMP High security certifications. Pricing is bifurcated: enterprise customers receive custom pricing with dedicated operations teams and SLA commitments, while self-serve customers access the platform on a pay-as-you-go basis with the first 1,000 labeling units free. Scale publicly discloses no revenue figures; its estimated hundreds of millions in ARR is derived from investor commentary and proxy comparisons, not confirmed disclosures. [CO002, CO003, CO008, CO024, CO025, CO026]

FO002: Scale AI Business Architecture — Value Flow

How Scale AI converts data, compute, and human expertise into AI-ready outputs for model developers, enterprises, and government customers.

[CO002, CO003, CO008, CO024, CO025, CO026]

1.5 Key Milestones and Adverse Events

Scale AI's trajectory from founding through 2026 spans a decade of growth, strategic pivots, and significant adverse events that define its current investment profile. Founded in 2016, Scale initially focused on programmatic data labeling for self-driving vehicles (Waymo was an early customer) before expanding into broader enterprise AI training data. By 2021, the company had achieved a $7.3 billion valuation with its Series E, and by 2022 was publicly describing itself as a $7 billion company with more than 700 employees providing the DoD's autonomy data layer. The 2023 period saw Scale execute a painful 20% workforce reduction amid a slowdown in AI foundation model training data spend. Scale rebounded with its $1 billion Series F in May 2024, attracting a broad consortium of strategic and financial investors at $13.8 billion. In 2024, Scale also entered White House AI safety voluntary commitments and secured a DoD data curation contract for joint force operations. June 2025 marked the most consequential inflection: Meta's $14.3 billion strategic investment and the simultaneous departure of founder Alexandr Wang to join Meta's AI work. Three weeks later (July 2025), interim CEO Droege announced layoffs of approximately 200 employees (14% of staff) and 500 contractors, citing overinvestment in the data-labeling headcount relative to the company's strategic direction toward enterprise and government platform revenue. In June 2025, CNBC reported that Google—Scale's then-largest customer—planned to wind down or significantly reduce its Scale relationship due to concerns about competitive conflict with Meta. OpenAI also wound down its Scale work in June 2025. In September 2025, Scale filed a lawsuit against Mercor, a competitor and former employee, alleging customer poaching. These adverse events represent material customer concentration and governance risks for investors evaluating Scale. [CO001, CO031, CO032, CO033, CO034, CO035]

Milestone Table
Date	Event	Type	Amount / Valuation / Status	Participants	Implication
2016	Scale AI founded by Alexandr Wang	founding	$15M est. seed	Wang, YC, angels	Established data-labeling for autonomous vehicles
2016	Accepted to Y Combinator S2016	partnership	—	YC, Scale	Early validation; YC network access
2017	Series A closed	financing	~$18M	Accel, Index, Founders Fund	Enabled product expansion
2019	Series C; crossed $1B valuation	financing	~$100M at ~$1B	Greenoaks, others	Unicorn milestone
2021-08	Series E at $7.3B valuation	financing	$325M	Coatue, YC, Founders Fund, Tiger	Rapid scale; peak of AI training data boom
2022	DoD autonomy data layer blog; 700+ employees	scale	$7B est. valuation	DoD / Scale	Defense AI footprint established
2023	20% workforce reduction	adverse	—	Scale, employees	Market correction; AI training demand slowdown
2024-05	Series F $1B at $13.8B	financing	$1B / $13.8B	Accel lead, Amazon, Meta, Intel, AMD, Cisco, others	Valuation reset upward; strategic investor base
2024	White House AI voluntary safety commitments	regulatory	—	White House, Scale	Regulatory positioning; brand legitimacy
2024	DoD data curation contract (Joint Force)	regulatory	Undisclosed	DoD, Scale	Defense revenue anchor
2024-11	Defense Llama announced (national security LLM)	product	—	Scale, DoD ecosystem	First productized defense LLM offering
2025-06	Meta $14.3B strategic investment; Wang departs	financing	$14.3B / $29B+ implied	Meta, Scale, Wang → Meta AI	Transformative capital event; founder exit
2025-06	Google largest customer plans to exit	adverse	—	Google, Scale	Customer concentration risk crystallized
2025-06	OpenAI winds down Scale relationship	adverse	—	OpenAI, Scale	Second major lab customer loss
2025-07	Layoff of 200 employees (14%) + 500 contractors	adverse	—	Droege, Scale	Data-labeling pivot confirmed operationally
2025-09	Lawsuit filed vs Mercor (customer poaching)	adverse	—	Scale vs Mercor	IP and competitive conflict risk

Sources: Scale AI official pages, TechCrunch reporting, CNBC reporting, Stanford HAI AI Index 2025. Early-round financing amounts estimated; later events confirmed by tier-one media.

[CO001, CO005, CO006, CO007, CO017, CO019]

FO001: Scale AI Corporate Timeline

Chronological milestones from founding (2016) through the 2025 Meta investment and organizational pivot, highlighting financing events, product launches, and adverse events.

Early round dates (2016-2019) are approximate; exact Series A-C timing not publicly confirmed.

[CO001, CO005, CO007, CO017, CO031, CO033]

1.6 Exhibits

Chapter 02

02Market Analysis

2.1 Market Boundary and Definition

Scale AI operates at the intersection of several distinct but related markets, which requires explicit boundary-setting before sizing can be meaningfully attempted. The core market is AI data services: the collection, annotation, curation, and validation of training and evaluation data for machine learning models. This market includes human-labeled data for computer vision, natural language processing, speech recognition, and multimodal AI. Adjacent to this is the AI model evaluation market — trusted benchmarking and safety testing for large language models — where Scale Evaluation and the Scale Leaderboard compete. Expanding outward, Scale's GenAI Platform competes in the enterprise AI deployment market: tools and services that help large organizations build, customize, and operationalize generative AI applications on top of foundation models. This market overlaps with hyperscaler AI services (Azure AI, AWS Bedrock, Google Vertex AI), boutique AI consulting, and AI model API vendors. The Donovan platform carves out a specialized government and defense AI vertical, which operates under distinct procurement rules and budget mechanisms separate from commercial AI spending. Status-quo substitutes for Scale's core data annotation services include internal human review teams (many large AI labs and enterprises originally built their own annotation pipelines), lower-cost offshore service providers, and automated synthetic data generation. The key question for market sizing is whether to define the served market narrowly (AI data annotation services only) or broadly (all AI infrastructure and tooling spend). This chapter defines the primary market as AI data services and evaluation infrastructure (annotation, RLHF, benchmarking), with enterprise GenAI platform and government AI as adjacent submarkets. [CM001, CM002, CM003, CM004, CM005]

Market Definition Table
Submarket	Included Spend	Excluded Spend	Status-Quo Substitute	Scale AI Presence
AI Data Annotation	Human labeling of text, image, video, audio, document data for ML training	Generic crowdsource platforms (Mechanical Turk)	Internal annotation teams; offshore QA providers	Core product (Scale Data Engine)
RLHF / Preference Data	Expert human feedback for LLM instruction-following and safety alignment	Raw survey data; synthetic preference generation	Internal RLHF teams at large labs	Scale RLHF product
Model Evaluation & Benchmarking	LLM capability + safety testing; redteaming; leaderboards	Internal A/B testing; academic benchmarks	Self-evaluation; open benchmarks (MMLU, HELM)	Scale Evaluation + Leaderboard
Enterprise GenAI Platform	Domain-specific GenAI app development on top of foundation models	Hyperscaler AI services (Azure, AWS, GCP)	Custom in-house GenAI builds; boutique AI consulting	Scale GenAI Platform
Government / Defense AI	DoD-cleared AI data services; mission-critical AI agents	General government IT outsourcing	Defense-clearance competitors; internal DoD teams	Donovan + Public Sector Data Engine
AI Data Marketplace (self-serve)	Pay-as-you-go annotation for startups and researchers	Unstructured crowdsource platforms	Mechanical Turk; intern teams; student labelers	Scale self-serve tier

Market boundaries defined for this analysis; actual market size depends on which segments are included. Not an exhaustive market map — adjacencies like AI infrastructure (compute, MLOps) and AI consulting are excluded.

2.2 TAM, SAM, and SOM Sizing Across Market Lenses

Precise TAM/SAM/SOM figures for AI data services are not available from public sources with the specificity needed for high-confidence valuation analysis. The figures presented here are evidence-constrained estimates synthesized from publicly available market proxies, investor reporting, and competitor disclosure patterns. The Total Addressable Market (TAM) for global AI data services and infrastructure — including annotation, RLHF, evaluation, and GenAI platform tooling — is estimated at $5 to $20 billion annually as of 2025, with wide uncertainty reflecting the market's rapid evolution and inconsistent boundary definitions across analyst estimates. This estimate is anchored by McKinsey's finding that 88% of surveyed organizations now use AI in at least one function (up from 78% in 2024), and the Stanford HAI AI Index 2025 finding that AI investment has accelerated sharply. If each AI-adopting Fortune 1000 enterprise spent $1M–$10M annually on AI data and evaluation infrastructure, aggregate spend would be $1B–$10B from enterprise alone, with AI labs adding substantial incremental demand. The Serviceable Addressable Market (SAM) for Scale AI — large enterprises, leading AI labs, and U.S. government agencies with complex AI data needs and budget for premium quality — is estimated at $2 to $8 billion, reflecting Scale's positioning at the quality end of the market rather than the low-cost commodity annotation segment. The Serviceable Obtainable Market (SOM) — Scale's realistic 3-5 year capture — is estimated at $500M to $2B, which is consistent with Scale's implied enterprise and government revenue trajectory based on its late-stage private valuation. Revenue is not disclosed; these estimates carry high uncertainty and require verification against actual financials. [CM006, CM007, CM008, CM009, CM010, CM011]

TAM / SAM / SOM or Sizing Lens Table
Sizing Lens	Market Scope	Estimate	Confidence	Basis / Key Assumptions
TAM — Broad AI Data & Infrastructure	All AI data services + evaluation + enterprise GenAI tooling globally	$10B–$30B	low	McKinsey: 88% of orgs use AI; if large orgs spend $1M–$10M on data/tooling, aggregates to $10B+; high uncertainty
TAM — AI Data Annotation Only	Human-labeled data services for ML globally	$2B–$8B	low	Derived from public-company analogs (Appen revenue proxy) + scale-factor for global market; highly uncertain
SAM — Premium AI Data (Scale's position)	Large enterprises, AI labs, government — premium annotation + evaluation + platform	$1.5B–$6B	low	Scale targets quality end; SAM ~30–50% of annotation TAM; government adds distinct SAM segment
SAM — Government / Defense Submarket	U.S. government AI data and evaluation budget addressable by Scale	$300M–$1B	low	Estimated from DoD AI investment growth and cleared-vendor supply constraints; no public budget breakout
SOM — Scale's 3-5yr Obtainable	Scale AI's realistic revenue capture in 3-5 years given competitive position	$500M–$2B	low	Consistent with $29B valuation at 15-60x ARR if ARR is in $500M-$2B range; no revenue confirmation

All estimates are evidence-constrained approximations derived from market proxy data, investor reporting, and public traction metrics. Revenue is not publicly disclosed. These figures are directional only and require verification against actual financials.

FM001: AI Data Services Market Sizing — Multiple Lenses

Evidence-constrained estimates for Scale AI's addressable market across TAM, SAM, and SOM, illustrating the range of plausible market sizes depending on scope definition.

All values are mid-point estimates in billions USD; uncertainty ranges are wide (2x–4x). No public analyst report provides specific AI data annotation market sizing with segment granularity. Derived from McKinsey adoption data, competitor proxy revenues, and valuation multiples.

[CM006, CM007, CM008, CM009, CM010]

FM002: Market Estimate Range — AI Data Services TAM (2025)

Uncertainty ranges for AI data services market size by scope definition, reflecting analyst boundary disagreement and rapid market evolution.

Ranges derived from McKinsey AI adoption data, Appen public revenue as proxy for annotation-only TAM, and Scale AI's implied valuation/revenue relationship. High uncertainty; all estimates require verification against actual financial disclosure.

[CM007, CM008, CM009, CM010, CM011]

2.3 Buyer and User Segmentation with Adoption Paths

Scale AI's addressable market consists of four primary buyer segments with materially different budget authorities, procurement processes, and switching costs. Understanding these segments is essential to evaluating Scale's revenue durability and growth ceiling. The first segment — AI Research Laboratories and Foundation Model Developers — represents Scale's original customer base. Companies like OpenAI, Meta, Google DeepMind, Anthropic, Cohere, and Adept require massive volumes of high-quality labeled data for pretraining and RLHF. Budget authority resides in research and engineering functions. Procurement is typically direct negotiation with multi-year contracts. Switching costs are moderate: labs can build internal annotation pipelines or switch providers, as demonstrated by OpenAI and Google's 2025 departures. This segment carries high revenue concentration risk. The second segment — Large Enterprise AI Adopters — includes Fortune 500 companies using AI for internal automation, customer products, and competitive differentiation. Companies like Cisco, Etsy, Instacart, and Pinterest represent this category, using Scale's GenAI Platform for domain-specific AI application development. Budget authority often sits in CTO/CDO organizations with multi-year platform commitments. Switching costs are higher than for annotation services due to platform integration depth. The third segment — U.S. Government and Defense — is the most distinctive and defensible for Scale. Budget authority lies with DoD program offices, DHS, NSA/IC, and civilian agencies. Procurement follows federal acquisition regulations with multi-year contract vehicles. Switching costs are very high due to clearance requirements (DoD IL4, FedRAMP High), institutional knowledge, and competitive moat from the Donovan platform. This segment provides stable, long-cycle revenue that is insulated from commercial customer concentration risk. The fourth segment — AI Startups and Research Organizations — accesses Scale through its self-serve tier. Budget authority is dispersed; spend is lower per customer but volume can be significant. Switching costs are low (pay-as-you-go). [CM012, CM013, CM014, CM015, CM016, CM017]

Segment / Buyer Map
Segment	Buyer Profile	Budget Owner	Procurement Path	Switching Cost	Scale AI Fit
AI Labs (Foundation Model)	OpenAI, Meta, Google DeepMind, Anthropic, Cohere	Head of Research / VP Engineering	Direct negotiation; multi-year contracts	Medium	High historically; currently at risk from Meta conflict
Large Enterprise (F500)	Cisco, Etsy, Instacart, Pinterest, TIME	CTO / CDO / VP AI	RFP / direct sales; 12-24 month cycles	High	Growing; GenAI Platform primary vehicle
U.S. Government / DoD	DoD, IC agencies, civilian federal agencies	Program Office / CISO / CTOs	Federal acquisition; IDIQ/task orders; cleared vehicles	Very High	Donovan + clearances = unique positioning
AI Startups / Researchers	Series A-C AI companies; university labs; small enterprises	Founders / ML leads	Self-serve; credit card; minimal procurement friction	Low	Self-serve tier; lower ARPU; volume model
International Government	Allied nations' defense / intelligence agencies	Procurement offices; foreign ministry	International government contracts; complex clearance	Very High	Global Public Sector division; early-stage
Enterprise Media / Content	Publishers, media companies deploying GenAI for content	VP Digital / CTO	Direct sales; platform subscription	Medium	TIME case study as reference; growing vertical

Buyer profiles based on publicly disclosed Scale customer references and product page descriptions. Revenue contribution by segment not publicly disclosed.

FM003: Buyer Segment Map — Budget Authority and Scale AI Fit

How Scale AI's products connect to each buyer segment through distinct value propositions, from AI labs through enterprise and government customers.

[CM012, CM013, CM014, CM015, CM016, CM017]

2.4 Growth Drivers and Adoption Constraints

The most powerful growth driver for Scale AI's addressable market is the accelerating enterprise adoption of AI. McKinsey's 2025 State of AI survey documented a jump from 78% to 88% of organizations using AI in at least one business function year-over-year, with 62% actively experimenting with AI agents. This macro expansion creates structural demand for data annotation, model evaluation, and enterprise AI deployment tooling across all four buyer segments. A second major driver is the proliferation of large language model providers — OpenAI, Meta, Anthropic, Google, Mistral, Cohere, and others — each requiring continual RLHF cycles, safety benchmarking, and capability evaluation at scale. As the number of foundation model developers grows, so does the demand for expert-quality training data and impartial evaluation infrastructure. Scale's Evaluation product and Scale Leaderboard are positioned to benefit from this proliferation. U.S. government AI investment represents a particularly robust growth driver. Scale's clearances (DoD IL4, FedRAMP High), Donovan platform, and active defense contracts position it to capture a growing share of the federal AI budget. Defense AI spending is expected to grow materially as DoD integrates AI into surveillance, logistics, cybersecurity, and autonomous systems — all of which require trusted data infrastructure. Adoption constraints are also significant. First, the largest AI labs are diversifying away from Scale following Meta's strategic investment, creating potential top-line pressure. Second, synthetic data generation technologies (used by some AI labs internally) could reduce demand for human-labeled data over time for some applications. Third, intense competition from lower-cost providers (Appen, SuperAnnotate, offshore teams) creates margin pressure in the commodity annotation segment. Fourth, enterprise AI programs remain heavily in pilot/POC stage — McKinsey notes that most organizations are still testing rather than scaling AI — meaning enterprise platform revenue growth may lag market adoption. Fifth, government procurement cycles are long and budget-dependent, creating lumpiness in contract revenue. [CM019, CM020, CM021, CM022, CM023, CM024]

Growth Drivers and Constraints Table
Factor	Type	Direction	Magnitude	Time Horizon	Evidence
Enterprise AI adoption expansion	Driver	↑	High	2025-2028	McKinsey: 88% of orgs use AI (up from 78%); 62% experimenting with AI agents
Foundation model proliferation (LLM providers)	Driver	↑	High	2024-2027	OpenAI, Meta, Anthropic, Mistral, Cohere all require RLHF + evaluation data
U.S. government AI investment growth	Driver	↑	High	2025-2030	DoD AI strategy; Scale clearances; Donovan platform; active DoD contracts
AI safety regulatory pressure (EU AI Act, US EO)	Driver	↑	Medium	2025-2027	Regulatory mandates increase demand for AI evaluation and audit services
Synthetic data substitution for human annotation	Constraint	↓	Medium	2026-2029	AI labs increasingly use synthetic + model-generated data for some training tasks
Customer concentration risk post-Meta deal	Constraint	↓	High	2025-2026	Google and OpenAI (major customers) departing; replaces large revenue unknown
Low-cost competition (Appen, offshore vendors)	Constraint	↓	Medium	Ongoing	Commodity annotation margin pressure; Scale must maintain quality premium
Enterprise AI still in pilot/POC stage	Constraint	↓	Medium	2025-2026	McKinsey: most orgs still testing AI, not scaling; enterprise platform revenue lags
Long government procurement cycles	Constraint	↓	Low	Ongoing	Federal acquisition timelines create revenue lumpiness despite strong positioning

Direction and magnitude are analyst assessments based on public evidence cited. No quantified revenue impact estimates are available from public sources.

FM004: Enterprise AI Adoption Funnel — Scale AI's Customer Journey

Adoption stages from initial AI awareness to deep Scale integration, with estimated population at each stage to illustrate market conversion dynamics.

Percentages are illustrative estimates of the Fortune 1000 segment based on McKinsey AI adoption data (88% using AI, ~1/3 scaling). Pipeline and customer percentages are estimated from Scale's disclosed customer references; actual numbers are not disclosed.

[CM019, CM020, CM021, CM022, CM023]

2.5 Sizing Gaps, Contradictory Estimates, and Diligence Asks

Market sizing for AI data services is subject to significant structural uncertainty. No public analyst report provides a consistent, granular breakdown of the AI data annotation addressable market. Estimates range widely depending on whether the boundary includes only human-labeled data services, or extends to AI model evaluation, enterprise AI tooling, and government AI platform spending. The wide range ($5B–$20B TAM in this chapter) reflects genuine analyst disagreement and market boundary ambiguity, not measurement error. The synthetic data market complicates sizing: some projections assume synthetic data will largely replace human annotation for many tasks within 3-5 years, dramatically shrinking the TAM for companies like Scale. Others argue that human evaluation, RLHF, and adversarial red-teaming will always require human judgment and scale with model proliferation. This is a thesis-level uncertainty that prospective investors must resolve through primary research with AI labs and Scale customers. Scale AI's disclosed traction metrics (15B decisions labeled, $1B paid to contributors) provide volume signals but not revenue insights. Without revenue disclosure, it is impossible to verify which segment dominates Scale's revenue mix or whether the enterprise platform and government segments can replace declining AI lab revenue. The departure of Google and OpenAI as major customers — assuming they were top 10 contributors to Scale's revenue — creates a size-unknown revenue gap that the enterprise and government pivots must fill. Quantifying the revenue impact of these departures is the single most important unresolved market sizing question for Scale's investment case. [CM027, CM028, CM029, CM030, CM031]

2.6 Exhibits

Chapter 03

03Competitors

3.1 Competitive Landscape Overview

Scale AI competes across multiple overlapping segments of the AI data and infrastructure market, each with distinct competitor dynamics. In the core AI data annotation segment, Scale faces competition from Appen (ASX-listed, the only publicly traded direct comparable), SuperAnnotate, Labelbox, and Surge AI. For RLHF and LLM training data specifically, Surge AI and Mercor are the most focused direct competitors. For enterprise model evaluation and benchmarking, Labelbox's evaluation suite and Snorkel AI's programmatic labeling are adjacent threats. For enterprise GenAI platform deployment, Scale faces competition from hyperscalers (AWS Bedrock, Azure AI, Google Vertex AI), boutique AI consultancies, and system integrators — significantly larger and better-resourced opponents. The competitive landscape also includes incumbents in adjacent spaces: traditional data outsourcing firms (Accenture, Capita), crowdsource annotation platforms (Amazon Mechanical Turk, Remotasks), and AI lab internal annotation teams who represent the "internal build" substitute. Likely new entrants include larger technology companies building annotation and evaluation capabilities in-house, and specialist boutique firms entering from academic or domain-specific AI. A critical competitive dynamic is the Meta strategic investment: by becoming Scale's largest investor, Meta has simultaneously created a conflict of interest that has already led Google and OpenAI — Scale's two largest AI lab customers — to exit or reduce their relationships with Scale. This represents an unusual competitive situation where Scale's own investor is causing customer attrition at its largest commercial segment. Mercor, a newer and smaller competitor, is actively attempting to exploit this vulnerability through direct customer solicitation, as evidenced by Scale's September 2025 lawsuit. [CP001, CP002, CP003, CP004, CP005]

Competitor Profile Table
Competitor	Stage / Scale	Target Customer	Product Scope	Funding / Revenue Proxy	Strategic Direction
Appen (ASX: APX)	Publicly listed; ~$300M rev proxy (declining)	Enterprise + government; global	Image, text, speech, video annotation; evaluation	Public; ASX-listed; declining revenue	Stabilize via enterprise AI; reduce crowdsource dependence
Labelbox	Private; Series B+ est. ~$100M raised	Enterprise AI teams; mid-market to F500	Annotation + evaluation + RLHF + robotics + leaderboards	~$188M raised (est.); not public	Full-stack AI data platform; expanding into RLHF + evaluation
Snorkel AI	Private; Series C+ est. ~$135M raised	Enterprise (F500); government	Programmatic labeling; weak supervision; AI-assisted annotation	~$135M raised (est.); not public	Reduce human annotation cost via AI; enterprise platform SaaS
SuperAnnotate	Private; early-stage; ~$14M raised	Enterprise teams; computer vision focus	Collaborative annotation platform; security features; multi-modal	~$14M raised (est.); not public	Enterprise CV annotation; security-first; expanding NLP/multimodal
Surge AI	Private; small; founded 2020	AI labs; RLHF-focused; premium quality	Expert RLHF data; LLM feedback; high-quality annotators	Small; not public	Premium RLHF data; compete on quality with smaller team
Invisible Technologies	Private; growth stage; ~$20M raised est.	Enterprise operations; AI automation	AI-powered operations; data processing; annotation as part of broader ops	~$20M raised (est.)	AI-powered enterprise operations beyond just annotation
Mercor	Private; early; <$50M raised est.	AI labs; enterprise; Scale AI customers (targeted)	AI talent marketplace; RLHF; evaluation; annotation	Early stage; not public	Disrupt Scale by targeting its customers; lawsuit pending

Funding and revenue estimates for private competitors are derived from press coverage and competitor website descriptions; not verified. Appen revenue from ASX filings (publicly available as proxy for annotation market).

3.2 Competitor Profiles

Appen (ASX: APX) is the only publicly traded direct comparable to Scale AI's annotation business. Appen provides AI training data including image, video, speech, text, and document annotation for global enterprise and government customers. Appen's publicly reported revenues have been declining as the AI annotation market shifts toward premium and LLM-specialized services. Appen serves enterprise and government customers globally, including some overlap with Scale's customer base. Appen competes primarily on breadth and cost-effectiveness rather than Scale's quality-premium positioning. Labelbox is a San Francisco-based data labeling and model evaluation platform targeting enterprise AI teams. Labelbox has expanded from core annotation into RLHF data collection and model leaderboards, making it a direct competitor across multiple of Scale's product lines. Labelbox's pricing is more accessible than Scale's enterprise contracts, and it has built an expert network (Labelbox Expert Network) for quality-critical annotations. Labelbox offers specific products for robotics AI training, which is a market Scale has not publicly emphasized. Snorkel AI focuses on programmatic data labeling using AI-assisted weak supervision techniques to reduce the human-in-the-loop bottleneck. Snorkel targets enterprise customers who need to build AI training datasets without extensive manual labeling. Snorkel's approach differs fundamentally from Scale's human-expert model — Snorkel aims to reduce the need for human annotation, while Scale's model depends on human quality. Snorkel's customers include large enterprises and some government agencies. SuperAnnotate is an AI annotation platform targeting enterprise teams with collaborative annotation workflows, quality management, and ML pipeline integrations. SuperAnnotate provides security features relevant to enterprise compliance and has expanded into computer vision, NLP, and multimodal annotation. Mercor is a newer entrant operating an AI talent marketplace for RLHF, model evaluation, and data labeling, founded by ex-Scale contributors. Scale filed a lawsuit against Mercor in September 2025 for alleged customer poaching, suggesting Mercor is actively targeting Scale's enterprise customer base. Surge AI (now part of a broader RLHF data ecosystem) focused specifically on high-quality human feedback data for LLM training, with an expert annotator network similar to Scale's but smaller in scale. Invisible Technologies offers AI-powered business operations and data services, competing with Scale's enterprise automation and annotation capabilities. [CP006, CP007, CP008, CP009, CP010, CP011]

Feature / Capability Matrix
Capability	Scale AI	Appen	Labelbox	Snorkel AI	SuperAnnotate	Mercor
Text / NLP annotation	✓ Advanced	✓ Broad	✓ Advanced	✓ AI-assisted	✓ Multi-modal	✓ RLHF-focused
Image / CV annotation	✓ Advanced	✓ Broad	✓ Advanced	✓ Programmatic	✓ Specialized	Limited
RLHF / LLM training data	✓ Core product (Scale RLHF)	Limited	✓ RL-Data product	Partial	Limited	✓ Core focus
Model evaluation + leaderboards	✓ Scale Evaluation + Leaderboard	✓ Evaluation product	✓ Evals product + Leaderboards	Limited	Limited	Limited
Government / defense clearances	✓ DoD IL4, FedRAMP High	Partial (some gov work)	Not disclosed	Not disclosed	Not disclosed	Not disclosed
GenAI platform / enterprise AI apps	✓ Scale GenAI Platform	No	No	No	No	No
Defense AI agents platform	✓ Donovan	No	No	No	No	No
Programmatic / AI-assisted labeling	Partial	No	Limited	✓ Core strength	Partial	No
Self-serve / pay-as-you-go tier	✓ Yes	No	✓ Yes	✓ Yes	No	No

Capability assessments are based on publicly available product pages and descriptions. Absence of disclosure does not confirm absence of capability. Sources: company websites accessed 2026-05-09.

3.3 Capability, Pricing, and Regulatory Comparison

Scale AI's most significant competitive differentiators are in three areas: (1) government-grade security certifications (DoD IL4, FedRAMP High) that no publicly disclosed competitor has matched, creating a near-exclusive position in classified and defense AI data markets; (2) Scale Evaluation's model safety benchmarking and the Scale Leaderboard, which have established reputational authority as a trusted third-party evaluator for LLM capabilities; and (3) the Donovan defense AI agent platform, which has no direct documented competitor in the cleared AI agent space. In core annotation, Scale's quality premium is its primary differentiator against lower-cost competitors. The proprietary Scale Data Engine with its quality feedback loops, expert contributor network (having paid over $1B to contributors globally), and annotation tooling is difficult to replicate quickly. However, competitors including Labelbox and Snorkel AI have invested significantly in quality management workflows, narrowing this gap. On pricing, Scale occupies the premium tier: enterprise contracts with custom pricing and dedicated operations teams. Appen and SuperAnnotate offer lower price points, making them more accessible to mid-market customers who may not need Scale's quality guarantee. Labelbox offers tiered pricing including a self-serve option, making it a direct competitor for Scale's self-serve tier as well as enterprise contracts. Regulatory and trust posture is a key battleground. Scale's government clearances and regulatory commitments (White House AI safety commitments, WMDP benchmark, Congress testimony) position it as the trusted government AI data vendor. Appen has some government work but lacks Scale's defense-specific clearance depth. Labelbox and Snorkel do not appear to have published equivalent government security certifications. This is Scale's most durable competitive advantage and is highly defensible due to the multi-year process required to obtain DoD IL4 and FedRAMP High certifications. [CP015, CP016, CP017, CP018, CP019, CP020]

Pricing / Packaging Comparison
Competitor	Pricing Model	Entry Point	Enterprise Tier	Differentiator
Scale AI	Enterprise custom + self-serve pay-as-you-go	1,000 units free; then per-unit	Custom pricing, dedicated ops, SLA	Quality guarantee; government clearances
Appen	Project-based; volume pricing	Public request; custom quotes	Enterprise contracts; global delivery	Global crowd; low cost; public-company transparency
Labelbox	Tiered SaaS + usage; free tier available	Free plan; Developer plan; Enterprise	Enterprise custom + professional services	Platform integration; evaluation features; lower price
Snorkel AI	Enterprise SaaS; no public pricing	Enterprise contract only	Enterprise subscription + services	AI-assisted labeling reduces volume cost
SuperAnnotate	SaaS subscription; team tiers	Team plan; Enterprise plan	Enterprise with on-prem option	Security-first; collaborative workflow
Surge AI	Project-based; expert premium	Custom project quotes	High-quality expert RLHF contracts	Expert annotator quality for LLM training
Mercor	Talent marketplace; per-task or subscription	Marketplace self-serve	Enterprise placement + managed RLHF	Expert network; AI talent marketplace model

Pricing based on publicly available pricing pages and descriptions. Scale AI, Snorkel AI, and Mercor enterprise pricing require custom quotes.

FP001: Competitive Positioning Map — Quality vs. Government Clearance

Scale AI's relative positioning versus key competitors across the quality/premium and government-clearance dimensions that define its most defensible market position.

Positions are qualitative analyst estimates based on public product/capability descriptions. No empirical quality benchmarks publicly available.

[CP015, CP016, CP017, CP018, CP019]

FP002: Aggregate Capability Score — Scale AI vs. Top Competitors (7-Dimension, 35-Point Scale)

Aggregate capability scores across seven AI data service dimensions (annotation, RLHF, evaluation, government clearance, GenAI platform, defense agents, self-serve) illustrating Scale AI's breadth advantage over direct competitors.

Aggregate of 7 capability dimensions scored 1-5 each (max 35). Scale AI's government clearance and defense agent scores (5/5 each) drive outperformance. Scores are qualitative assessments from public product pages.

[CP015, CP016, CP017, CP018, CP019, CP020]

3.4 Switching Cost, Lock-in, and Multi-Homing

Switching costs in Scale AI's markets vary dramatically by segment. For AI labs using Scale primarily for annotation and RLHF data, switching costs are moderate: the annotation pipeline is fungible to some degree, and labs have demonstrated willingness to switch (Google and OpenAI departing in 2025). This is precisely why the Meta investment conflict-of-interest created such rapid customer attrition — switching costs were not high enough to retain customers facing a conflict of interest with their data vendor's new investor. For enterprise customers using Scale's GenAI Platform, switching costs are higher: the platform involves data migration, workflow integration with enterprise systems, customization of data pipelines, and organizational knowledge transfer. Comparable to mid-market SaaS platform switching costs (6–18 months). For U.S. government and defense customers, switching costs are extremely high. Transitioning to a new vendor requires the new vendor to obtain equivalent security clearances (DoD IL4 and FedRAMP High authorization is a 12–36 month process), rebuild institutional knowledge, and navigate federal acquisition regulations. This creates a durable, multi-year lock-in for Scale's defense segment. Multi-homing is common in the annotation market: many large organizations (AI labs, enterprises) run annotation work through multiple vendors simultaneously for quality comparison, cost optimization, and redundancy. This means Scale may not have exclusive relationships even with named customers. Scale's self-serve tier explicitly encourages multi-homing (low commitment, pay-as-you-go). Labelbox's expert network and Surge's quality focus suggest they are positioned to capture multi-homed annotation spend from Scale's customers. Distribution power and supply access: Scale's contributor network (paid $1B+ globally) is a proprietary supply of human annotators. Competitors must build equivalent networks to compete on quality and throughput. Mercor, which operates an AI talent marketplace, is attempting to build an alternative expert contributor supply chain that directly competes with Scale's contributor network. [CP021, CP022, CP023, CP024, CP025]

Moat Durability / Competitive Risk Register
Moat / Risk	Scale AI Position	Durability	Key Threat	Diligence Signal
Government clearances (DoD IL4, FedRAMP High)	Unique among disclosed AI data vendors	Very High	Years for competitor to replicate	Active DoD contracts confirm commercial value
Quality premium in annotation	Industry-leading claimed; $1B+ paid to contributors	Medium	Labelbox, Surge narrowing quality gap; lower cost competitors	Customer NPS and win/loss data not public
RLHF leadership	Core product; Scale RLHF used by top labs historically	Medium	OpenAI and Google departed; Surge, Mercor compete on RLHF	Loss of top-2 RLHF customers is material
Evaluation benchmarking (Leaderboard, WMDP)	Trusted third-party evaluator position	High	Labelbox Leaderboards; academic benchmarks	Growing regulatory demand for independent eval
Donovan defense AI agents	No disclosed direct competitor in cleared space	Very High	Long-term: large defense contractors may enter	Active contract win is evidence of value
Enterprise platform (GenAI Platform)	Differentiated but faces hyperscaler competition	Medium	AWS, Azure, GCP have significantly more resources	Enterprise customer retention data not public
Commoditization of annotation	Core risk: annotation increasingly commoditized	Low (risk)	Appen, Snorkel, offshore; synthetic data threatens TAM	Layoffs in data-labeling confirm management awareness
Meta investor conflict	Existential risk to AI lab customer retention	Low (risk)	Google and OpenAI already departed	CNBC and TechCrunch reporting confirmed

Durability assessments are analyst judgments based on public evidence. 'Very High' durability means 3+ year competitive advantage expected; 'Low (risk)' indicates active threat to competitive position.

3.5 Moat Durability, Commoditization Risk, and Adverse Evidence

Scale AI's competitive moat is strongest in government/defense AI and weakest in commodity data annotation. The government franchise (DoD IL4, FedRAMP High, Donovan, active defense contracts) is highly durable because clearances take years to obtain and the institutional knowledge embedded in defense AI workflows cannot be rapidly replicated. This represents a genuine, multi-year competitive advantage. In the core annotation market, Scale's moat is eroding. The departure of Google and OpenAI as customers — Scale's two largest commercial relationships — demonstrates that the quality premium alone does not create unbreakable lock-in for AI lab customers. Competitors are narrowing the quality gap, and lower-cost providers with adequate quality (Appen, SuperAnnotate, offshore teams) continue to win annotation work at price-sensitive enterprises. The commoditization risk is real and accelerating. As AI foundation model training matures and synthetic data becomes more capable, the TAM for human-labeled annotation could shrink materially. Competitors like Snorkel AI are building annotation tools that reduce human labor requirements per output unit. If this trend continues, the annotation market will bifurcate: a large commodity segment dominated by low-cost providers and a smaller premium segment for expert evaluation and government-grade work where Scale has a stronger position. Adverse competitive evidence: (1) Google and OpenAI departures post-Meta deal demonstrate customer concentration risk and insufficient switching costs in the AI lab segment; (2) Mercor's active customer poaching (per Scale's lawsuit) suggests competitor perception that Scale customers are vulnerable; (3) Appen's declining revenues serve as a cautionary indicator that the pure annotation segment is facing structural headwinds; (4) Scale's July 2025 layoffs specifically targeted data-labeling headcount, acknowledging over-investment in the commoditizing segment. [CP026, CP027, CP028, CP029, CP030, CP031]

FP003: Moat / Readiness KPIs — Scale AI Competitive Position

Key competitive position indicators summarizing Scale AI's moat strength, vulnerability points, and diligence gaps.

[CP026, CP027, CP028, CP029, CP030, CP031]

3.6 Exhibits

Chapter 04

04Financials

4.1 Revenue Model and Revenue Streams

Scale AI generates revenue across three primary streams: (1) enterprise data annotation and RLHF services, (2) the Scale GenAI Platform (enterprise AI application development), and (3) U.S. government and defense contracts through Donovan and its Public Sector Data Engine. A self-serve tier provides a fourth, smaller revenue stream for research and experimental projects. The enterprise data annotation segment has historically been Scale's largest revenue driver, powering growth from 2016 through the 2024 Series F. This segment charges enterprises and AI labs project-based or volume-based rates for annotation, RLHF data collection, and model evaluation. The pricing model is custom and opaque — Scale does not publish enterprise pricing for its annotation services. The self-serve tier offers 1,000 labeling units free, then charges per unit thereafter, a freemium model designed to land smaller accounts. The Scale GenAI Platform represents a higher-margin, SaaS-oriented revenue stream targeting enterprises seeking to deploy custom AI applications. This segment serves Fortune 500 companies across industries including retail (Etsy, Instacart) and media (TIME). Scale custom-builds LLM-powered applications for enterprise clients, integrating with their proprietary data and workflows. This stream has the potential for subscription and managed-service components, though pricing details are not publicly disclosed. Government and defense contracts constitute a third, growing, and strategically important revenue stream. Scale holds DoD IL4 and FedRAMP High clearances, enabling it to pursue classified and defense-grade AI data contracts. The Donovan platform (defense AI agents) and the Scale Public Sector Data Engine target a defense market where pricing is typically non-public, based on federal procurement schedules. The DIU RCV program win and active DoD data curation contract confirm commercial relevance. This segment is likely smaller than commercial annotation today but potentially larger and more durable given switching costs. [CI001, CI002, CI003, CI004, CI005]

Revenue Streams Table
Revenue Stream	Mechanism	Unit / Pricing Model	Status	Revenue Quality	Diligence Ask
Enterprise data annotation	Project-based volume annotation; RLHF; evaluation for AI labs + enterprises	Custom pricing per project / volume unit; enterprise SLA contracts	Active but attriting (Google and OpenAI departed 2025)	Medium: recurring project orders but no contractual subscription; customer concentration risk	Revenue by customer segment; top 5 customer concentration; NRR by cohort
Scale GenAI Platform	Enterprise SaaS + managed services for custom GenAI apps; LLM customization for enterprises	Custom enterprise subscription + professional services; no public pricing	Active; strategic growth priority per Droege July 2025 memo	High potential: platform stickiness; lower labor intensity than annotation	ARR, ACV, customer count, gross margin for platform segment
Government / defense contracts	Defense AI data labeling, evaluation, and Donovan platform for DoD, IC, civilian agencies	Federal procurement schedules; contract-based; non-public	Active; growing; DoD IL4 + FedRAMP High unlocks classified AI data market	High: long-term contracts, high switching costs, clearance-based moat	Contract value, pipeline size, ceiling for IDIQ or other vehicles
Scale RLHF	Human feedback data collection for LLM alignment; expert annotator network	Volume-based pricing per feedback instance or project; enterprise contract	Active but affected by OpenAI wind-down; Meta RLHF relationship expanding	Medium: labor-intensive; margin pressure from competition	RLHF revenue by customer; Meta RLHF agreement scope
Self-serve Data Engine	Pay-as-you-go annotation for research, academic, and experimental users	1,000 units free; per-unit pricing for additional; no enterprise SLA	Active; small revenue contribution; pipeline / lead-gen function	Low-medium: high volume but low ACV; margin depends on automation	Self-serve revenue, conversion rate to enterprise, margin

Revenue classifications and status are based on public product pages, press reports, and analyst inference. No public revenue disclosure. Status reflects 2025 post-Meta-investment dynamics.

[CI001, CI002, CI003, CI004, CI005]

FI001: Revenue Model Bridge — Customer Activity to Gross Profit

How customer engagement converts into revenue, cost of revenue, and gross profit across Scale AI's three primary revenue segments.

Revenue and gross margin figures are analyst estimates. Scale AI does not disclose financial statements. Estimates based on Appen public gross margins, industry benchmarks, and headcount proxies.

[CI001, CI002, CI003, CI011]

4.2 Pricing, GTM Motion, and Sales Efficiency

Scale AI's go-to-market motion is primarily enterprise sales-led, supplemented by a self-serve product-led growth tier. The enterprise tier relies on a dedicated sales team, relationship-driven deals, and solution engineers who work with customers to scope and deliver large annotation or platform projects. Enterprise contracts are custom-quoted with dedicated operational support and SLAs, indicating high average contract values and longer sales cycles relative to typical SaaS companies. The pricing for annotation services is volume-based and project-specific. Scale's self-serve tier establishes a list-price anchor: units are available at a disclosed per-unit rate after the first 1,000 free units. Enterprise customers negotiate pricing for large-volume annotation programs that may include dedicated annotator teams, quality assurance workflows, and platform integrations. No discounts or typical deal sizes are publicly disclosed. Sales cycle length and customer acquisition cost (CAC) are not publicly disclosed. As a data services and infrastructure company, Scale likely experiences long enterprise sales cycles (3–12 months) with significant professional services components. Government contract sales cycles are even longer (often 12–36 months from procurement initiation to first revenue). Net revenue retention (NRR) is unverifiable — the Google and OpenAI departures in 2025 represent a severe NRR compression event, but the full financial impact is unknown. Distribution channels include: direct enterprise sales, government procurement relationships (DoD, IC), partnerships with AI lab providers, and self-serve API/platform access for developers and researchers. Scale's relationships with major investors (Accel, Amazon, Meta, Nvidia, Cisco) create strategic co-sell and referral channels, though the specific economic impact is not disclosed. Scale's data-labeling business had revenue concentration among a small number of large AI labs (Google, OpenAI, Meta, Anthropic). The departure of Google and OpenAI in 2025 following the Meta investment represents a material GTM and concentration risk that directly compresses revenue in the annotation segment. [CI006, CI007, CI008, CI009, CI010]

Pricing / Monetization Table
Product	List / Entry Pricing	Enterprise Pricing Model	Key Unknowns	Source
Scale Data Engine (self-serve)	First 1,000 labeling units free; per-unit pricing above 1,000 units	N/A — self-serve only tier; enterprise uses custom contracts	Per-unit rate not publicly disclosed; volume discount structure unknown	scale.com/pricing (official)
Scale GenAI Platform (enterprise)	Not publicly disclosed; custom enterprise quote only	Enterprise subscription + professional services; dedicated ops included	Typical ACV unknown; SaaS vs. managed-service revenue mix unknown	scale.com/generative-ai-data-engine (official); docs.scale.com
Scale RLHF (enterprise)	Not publicly disclosed; project-based custom pricing	Custom project or volume contract; no public rate card	Per-task rate, project floor, and volume discount structure unknown	scale.com/rlhf (official)
Scale Evaluation (enterprise)	Not publicly disclosed; custom pricing for model developers + public sector	Enterprise evaluation contract; likely per-model or per-benchmark-run pricing	Evaluation pricing model, typical deal size, margin unknown	scale.com/evaluation/model-developers (official)
Donovan (government)	Not publicly disclosed; federal procurement schedules	Government contract pricing (IDIQ, FFP, or T&M); cleared program	Total contract value, ceiling, and existing pipeline not public	scale.com/donovan (official); DoD contract blog post

All pricing for enterprise and government tiers is custom and not publicly disclosed. Self-serve pricing entry is confirmed by public pricing page. This table reflects list-pricing structures, not realized revenue or margin.

[CI006, CI007]

FI002: Unit Economics Bridge — Annotation Segment (Estimate)

Illustrative unit economics flow for Scale AI's enterprise annotation segment showing customer acquisition, contract lifecycle, and retention drivers — all estimates given absence of public data.

ACV, win rates, CAC, and NRR are all unknown (private data). Flow structure is inferred from public GTM descriptions and industry benchmarks. NRR directional estimate based on Google and OpenAI departure events.

[CI007, CI008, CI009, CI013]

4.3 Cost Structure, Gross Margin, and Capital Intensity

Scale AI's cost of revenue is dominated by human labor — the annotation contributors who perform data labeling, RLHF feedback collection, and model evaluation tasks. Scale has paid over $1 billion globally to its contributor network, confirming the labor-intensive nature of the core annotation business. This human labor cost creates a structurally lower gross margin for annotation relative to pure software businesses. Industry proxies from Appen (the publicly traded annotation comparable) suggest gross margins for annotation services in the 25–45% range, declining as the market commoditizes. The Scale GenAI Platform and evaluation products should carry higher gross margins than annotation services, as they involve more software leverage and less per-unit human labor. However, GenAI Platform projects at the enterprise level still require significant professional services and operational effort, capping margins below pure SaaS benchmarks. Government contracts typically carry margins in the 15–30% range for data services, constrained by procurement rate structures. Operating expenses include engineering and R&D (building annotation tooling, the GenAI Platform, Donovan, and evaluation infrastructure), sales and marketing (enterprise sales team, government BD), and general and administrative. Scale had approximately 1,400 employees before the July 2025 layoffs and approximately 1,000 after, suggesting a significant OpEx reduction from the 14% headcount cut plus 500 contractor reductions — primarily targeting the data-labeling cost base. This is consistent with improving blended margins by shedding the lower-margin commodity annotation labor force. Capital intensity in Scale's model is primarily working-capital-based (labor payments to contributors) rather than physical capex (hardware, facilities). Infrastructure costs (cloud compute for annotation tooling and model evaluation) are a secondary capex factor. This makes Scale relatively capital-efficient compared to hardware AI companies but more labor-intensive than pure SaaS. The $1 billion Series F and Meta strategic investment have provided substantial capital to fund the pivot toward higher-margin enterprise and government services. [CI011, CI012, CI013, CI014, CI015]

Unit Economics Table
Metric	Value / Estimate	Confidence	Why It Matters	Diligence Ask
Average Contract Value (enterprise annotation)	Estimated $1M–$5M+ per engagement	Low (analyst estimate)	Drives revenue scale; indicates enterprise buyer depth	Request deal size distribution from CFO; analyze Appen disclosures as proxy
Gross Margin (annotation segment)	Estimated 25–45%	Low (Appen proxy; analyst estimate)	Core margin driver; declining with commoditization	Request P&L by segment; compare Appen disclosed gross margins
Gross Margin (GenAI Platform)	Estimated 45–65%	Low (SaaS + managed-services proxy)	Platform economics if recurring SaaS dominates	Request segment margin breakdown; subscription vs. services revenue split
Gross Margin (government/defense)	Estimated 15–30%	Low (federal services industry proxy)	Cap on margin in procurement-constrained segment	Review contract type (T&M vs. FFP) and pricing structure
Net Revenue Retention (NRR)	Likely <100% in 2025 (Google + OpenAI departure)	Low — estimated from directional evidence	Signals revenue durability and customer health	Request NRR by cohort and segment; Google/OpenAI revenue impact
Customer Acquisition Cost (CAC)	Not disclosed; estimated high for enterprise/government	Unknown — no proxy available	Efficiency of growth spend; payback period	Request CAC by segment; sales headcount and quota attainment
Employee Revenue per Head (proxy)	Est. $200K–$500K revenue/employee (if $300M ARR, 800 employees)	Very low — both inputs estimated	Capital efficiency benchmarking against data services comps	Requires confirmed headcount + revenue; request in due diligence
Annual Burn Rate (pre-layoff)	Estimated $400M–$700M/year (1,400+ employees + annotators)	Low (headcount-based proxy)	Capital runway and cost efficiency	Request monthly cash flow statements; P&L operating expense breakdown

All unit economics are estimated or unavailable. Scale AI does not disclose financial statements. Estimates derived from Appen public financials, industry benchmarks, and headcount-based proxies. Confidence is uniformly low; replace with actuals in due diligence.

[CI011, CI012, CI013]

FI003: Financial Estimate Range — Key Metrics (Low / Mid / High)

Analyst estimate ranges for Scale AI's key financial metrics derived from public proxies, Appen comparable, headcount proxies, and disclosed funding data. All estimates carry very low confidence.

Revenue estimates derived from: headcount-based proxy (1,000 employees × ~$350K revenue/employee), Appen comparable revenue-per-employee analysis, and independent analyst commentary. Burn estimates derived from headcount × all-in cost proxy. All estimates are illustrative only and not confirmed by management.

[CI011, CI012, CI013, CI018, CI019, CI020]

4.4 Capital Adequacy and Financing History

Scale AI has raised approximately $1.6 billion+ in external capital. The funding history includes: seed and early rounds from 2016–2019 (including Y Combinator, Index Ventures, Founders Fund); a 2021 Series E of $325 million at approximately $7.3 billion valuation; approximately $600 million total pre-Series F; and the May 2024 Series F of $1 billion at $13.8 billion valuation led by Accel with participation from Amazon, Meta, Cisco, Intel, AMD, ServiceNow, Nvidia, DFJ Growth, WCM, and returning investors including Tiger Global, Thrive, Greenoaks, Wellington, Nat Friedman, and Elad Gil. The Series F included both primary (new cash to company) and secondary (liquidity to existing shareholders) components. The June 2025 Meta strategic investment was approximately $14.3 billion for a minority stake (approximately 49% of outstanding equity). Critically, the majority of Meta's investment proceeds were distributed to existing shareholders and vested equity holders — not to the company's operating treasury — consistent with a secondary transaction structure. The net new primary capital to Scale's operating business from the Meta deal is not fully disclosed. Scale maintains an independent corporate structure; Meta holds a minority stake. Without public financial statements, cash on hand is unknown. The Series F closed in May 2024, and if a significant portion was primary capital ($500M+ to the company), combined with the Meta deal's primary component, Scale likely has adequate runway of 3+ years at current burn rates. The July 2025 layoffs of 200 employees and 500 contractors, plus the strategic pivot away from data-labeling, are cost-reduction measures consistent with extending runway and improving cash efficiency. Burn rate is not publicly disclosed. Scale's pre-layoff expense base (1,400+ employees, significant annotator contractor costs) suggests a substantial monthly burn. Post-layoff, cash efficiency should improve materially. Scale has no publicly disclosed debt facilities or project-finance obligations. The Meta deal created structural dependencies (Meta as both investor and key customer for related AI services) but no disclosed economic covenants or restrictions. [CI016, CI017, CI018, CI019, CI020]

Capital Adequacy Table
Item	Value / Estimate	Source / Basis	Confidence	Notes
Total capital raised	~$1.6B+ (pre-Series F ~$600M + $1B Series F)	TechCrunch Series F reporting (May 2024)	High (confirmed by multiple high-rep sources)	Mix of primary + secondary; exact primary allocation unknown
Series F close	May 2024; $1B at $13.8B valuation; Accel lead	TechCrunch / CNBC reporting	High (multiple corroborating sources)	Includes new investors Amazon, Meta, Cisco, Intel, AMD, ServiceNow
Meta strategic investment	~$14.3B for minority stake (~49%); valuation >$29B	CNBC June 2025 reporting	High (multiple corroborating sources)	Primarily secondary (existing shareholder liquidity); primary portion unknown
Net primary capital from Meta deal	Unknown — primarily secondary distribution	CNBC/TechCrunch reporting; Scale blog	Low (inferred from secondary structure language)	Meta holds minority equity; proceeds largely to shareholders
Estimated cash on hand (post-2025)	Estimated $500M–$1B+	Analyst estimate from Series F primary + partial Meta primary	Very low (estimate only)	No public financial statements; estimate only
Monthly burn rate (post-layoff)	Estimated $25M–$50M/month (post-July 2025 restructuring)	Analyst estimate from headcount × cost-per-employee proxy	Very low (estimate only)	Pre-layoff burn materially higher; post-restructuring improving
Estimated runway	Estimated 24–48 months from July 2025 (if $500M–$1B cash, $25M–$50M burn)	Derived estimate from cash and burn estimates	Very low (both inputs estimated)	Runway estimate requires confirmed primary capital from Meta deal
Disclosed debt / credit facilities	None publicly disclosed	Public sources review	Medium (no evidence of public debt)	Private credit or revenue-based financing possible but undisclosed

Funding amounts for Series F and Meta investment are confirmed by high-reputation news sources. Cash on hand, burn rate, and runway are analyst estimates with very low confidence. Due diligence must obtain audited financial statements.

[CI016, CI017, CI018, CI019]

FI004: Capital Intensity / Cash-Flow Map — Funding Inflows vs. Operating Outflows

Illustrative capital flow showing major fundraising inflows and estimated operating outflows from 2021 through 2026, highlighting the funding adequacy and burn context for the business model pivot.

All values are analyst estimates. Primary vs. secondary split of Series F and Meta deal is not disclosed. Series E $325M may include both primary and secondary. Operating spend proxy based on headcount and industry benchmarks. The waterfall is illustrative, not an audited cash flow statement.

[CI016, CI017, CI018, CI019]

4.5 Financial Verdict — Revenue Quality, Margin Path, and Diligence Blockers

Scale AI presents a complex financial profile: abundant capital, an unclear revenue trajectory, and a management team executing a high-stakes business model pivot under conditions of significant customer attrition. Revenue quality is concerning. The data-labeling segment — historically Scale's largest revenue driver — is commodity-positioned and facing structural headwinds (Appen's declining public revenues are a leading indicator), customer attrition (Google and OpenAI departing), and direct competition. The GenAI Platform and government/defense segments offer higher revenue quality (longer contracts, stronger lock-in, less commoditization risk) but their current contribution to total revenue is unknown and likely smaller. Net revenue retention post-2025 is likely below 100% in the AI lab segment. Margin trajectory is directionally positive if the pivot succeeds: shedding the low-margin data-labeling workforce and growing the higher-margin platform and government segments would improve blended gross margins over time. However, the pivot carries execution risk, and the intermediate period (2025–2027) likely shows revenue compression before platform and government revenue scales. Capital adequacy is the strongest financial positive. Scale has raised $1.6B+ and the Meta deal provided additional liquidity to shareholders (and some primary capital). Post-layoff, the company is operating with reduced headcount and lower burn. Runway is estimated at 3+ years, sufficient for the pivot execution. Primary diligence blockers: (1) No public revenue or ARR data — valuation multiples are unverifiable; (2) Google and OpenAI attrition impact on revenue unknown; (3) government contract revenue size and growth trajectory not disclosed; (4) gross margin by segment not disclosed; (5) primary vs. secondary split of the Meta deal not confirmed; (6) Mercor lawsuit outcome could have financial implications (indemnification, customer loss). [CI021, CI022, CI023, CI024, CI025]

Public Financial Gaps Table
Missing Data	Impact on Diligence	Exact Diligence Path
Total revenue / ARR	Cannot verify valuation multiple; cannot assess growth or revenue quality	Request audited financials; obtain management accounts; confirm with Big 4 audit
Revenue by segment (annotation vs. platform vs. government)	Cannot assess business model pivot success; segment margin unknowable	Request segment P&L breakdown from CFO; separate contracts by segment
Google and OpenAI revenue impact (2025 attrition)	Largest risk to financial model: magnitude of revenue loss from largest customers unknown	Request revenue by customer; confirm Google and OpenAI departure dates and final billings
Gross margin by product line	Cannot assess margin improvement thesis or capital intensity; blended margin is opaque	Request gross profit by segment; annotator cost as % of revenue; platform margin
Net Revenue Retention (NRR)	Cannot assess customer health; recurring revenue durability unknowable	Request NRR by annual cohort (2021–2025) and by segment
Primary vs. secondary split of Meta deal	Cannot assess company's actual cash increase from Meta deal	Request closing documents for Meta investment; confirm primary capital allocation
Operating cash flow and burn rate	Cannot estimate runway or capital adequacy with confidence	Request monthly P&L and cash flow statements (Jan 2024 – present)
Employee compensation structure	Cannot assess whether talent retention after Meta deal and layoffs is preserved	Request anonymized compensation bands and equity vesting schedules
Government contract backlog and pipeline	Cannot size the defense segment or its growth trajectory	Request total DoD/IC contract backlog, option periods, and pipeline
Mercor lawsuit financial exposure	Potential financial liability from ongoing IP/trade-secret litigation	Review legal claims filed; assess settlement risk; confirm insurance coverage

All items in this table represent confirmed evidence gaps as of 2026-05-09. This table is the primary due diligence checklist for financial underwriting of Scale AI.

[CI021, CI022, CI023, CI024, CI025]

4.6 Exhibits

Chapter 05

05Product & Technology

5.1 Product Portfolio and Customer Workflow

Scale AI's product portfolio addresses four distinct customer workflow problems in AI development: (1) creating high-quality training data for model development; (2) collecting human feedback for LLM alignment and RLHF; (3) independently evaluating AI model safety and capability; and (4) deploying custom GenAI applications at enterprise scale. A fifth product line — Donovan — addresses defense and intelligence mission-specific AI workflows. The Scale Data Engine is Scale's core product and the foundation of the company. It enables AI teams to collect, curate, annotate, and quality-assure training datasets across modalities (text, image, video, audio, 3D). The customer workflow: AI engineers define annotation requirements and quality standards; Scale's contributor network executes annotation tasks guided by Scale's proprietary tooling and QA pipeline; annotated datasets are delivered back into the customer's MLOps pipeline. The Data Engine supports the full model development lifecycle from pre-training data curation to RLHF to evaluation. The Scale GenAI Platform addresses the enterprise AI deployment workflow. Enterprise customers (Fortune 500 companies across retail, media, finance, and government) bring proprietary data and domain requirements; Scale transforms this into customized LLM-powered applications. The platform includes data ingestion, LLM fine-tuning, RAG pipeline configuration, and production deployment support. This product targets enterprises that want custom AI without the deep ML expertise required to build from scratch. Scale Evaluation and the Scale Leaderboard provide independent model benchmarking services. AI labs and enterprises use Scale Evaluation to assess LLM capabilities across dimensions including reasoning, safety, coding, and domain knowledge. The Scale Leaderboard is a public ranking of LLM performance — a developer-facing tool that has established Scale as a trusted third-party evaluator in the AI community. The WMDP (Weapons of Mass Destruction Proxy) benchmark is a specialized evaluation contribution for AI safety. Donovan serves DoD, IC, and civilian government agencies with AI agents for mission-critical workflows. It integrates classified data sources, supports cleared operational environments, and provides AI-powered decision support for defense and intelligence applications. Donovan is deployed in DoD IL4-authorized environments and is Scale's primary competitive moat in the defense segment. [CE001, CE002, CE003, CE004, CE005]

Product Module / Asset Matrix
Product / Module	Primary User	Maturity / Status	Core Differentiation	Diligence Gap
Scale Data Engine	AI labs; enterprise AI teams; research	Mature (GA); core product since 2016	Proprietary QA pipeline; contributor network ($1B+ paid); multi-modal support	Annotation quality vs. competitors not independently benchmarked; NPS unknown
Scale GenAI Platform	Enterprise (F500, media, retail); government	Active; strategic growth priority 2025	Integration of annotation quality into LLM customization; managed enterprise deployment	Technical depth vs. AWS Bedrock / Azure AI not independently assessed; customer retention unknown
Scale RLHF	AI labs (Meta, Cohere, Anthropic); enterprise	Mature; core product; affected by OpenAI/Google departures	Expert annotator network for preference data; quality feedback loops for alignment	RLHF revenue concentration risk; meta relationship scope undisclosed
Scale Evaluation + Leaderboard	AI labs; enterprises; government; AI safety researchers	Active; growing; public developer tool	Trusted third-party evaluator; WMDP benchmark; government evaluation mandate	Leaderboard methodology and independence need third-party audit for credibility
Donovan (Defense AI Platform)	DoD; IC; civilian government agencies	Active; cleared production; specialized for IC/DoD	DoD IL4 certified; classified environment deployment; defense mission AI agents	Technical capabilities not publicly documented; competitive comparison with Palantir unknown
Scale Public Sector Data Engine	U.S. government; defense; intelligence community	Active; cleared production	FedRAMP High; defense-specific data curation and management	Contract size and growth pipeline not public; timeline to other agencies unknown
Self-Serve Data Engine	Researchers; startups; individual AI developers	Active; GA; lower priority post-pivot	Low barrier to entry; 1,000 units free; API access; diverse task types	Conversion rate to enterprise unknown; margin on self-serve tier not disclosed

Maturity assessments based on public product page descriptions and press coverage. Diligence gaps reflect absence of independent third-party validation for claimed capabilities.

5.2 Architecture and Operating Model

Scale AI's operating model combines software tooling with human-in-the-loop execution — a hybrid AI-assisted annotation architecture rather than a pure software or pure labor model. The core technology stack consists of: (1) a web-based annotation tooling platform for labeling tasks across modalities; (2) a quality assurance pipeline for review, reconciliation, and feedback; (3) API infrastructure exposing Scale's services programmatically to enterprise and lab customers; (4) the GenAI Platform for LLM customization workflows; and (5) the Donovan AI agent runtime for defense missions. The annotation contributor network is a proprietary, global workforce recruited, trained, quality-screened, and managed by Scale. The contributor platform (which Scale calls its "task marketplace") routes annotation jobs to appropriately skilled contributors, monitors completion quality in real-time, and escalates to expert reviewers for quality-sensitive tasks. Scale has invested in contributor quality management as a core differentiator, claiming that its proprietary QA pipeline produces annotation quality exceeding competitors. The Scale API is the primary developer-facing interface for the Data Engine, offering programmatic submission of annotation jobs, retrieval of completed datasets, and integration with ML training pipelines. The API supports all annotation task types (text classification, bounding boxes, segmentation, RLHF preference pairs, etc.) and includes webhook support for real-time job completion notification. For the GenAI Platform, Scale's operating model involves: data ingestion and preprocessing, LLM selection and prompt engineering, fine-tuning or RAG pipeline configuration, red-teaming and safety evaluation, and production deployment with monitoring. This is a managed services approach where Scale's engineers and the platform tooling jointly deliver the custom AI application. For Donovan, the architecture is specialized for cleared environments: it runs on DoD IL4-certified cloud infrastructure, integrates with classified government data systems, supports multi-modal AI agent workflows (planning, search, decision support), and provides explainability features required for military applications. Donovan's technical architecture is not publicly disclosed in detail; the operational model is closer to a managed defense IT service than a commercial SaaS product. [CE006, CE007, CE008, CE009, CE010]

Workflow / Use-Case Table
User Job	Current Workflow	Scale Solution	Measurable Benefit	Limitation
Train ML model with labeled data	Manual labeling by internal team or crowdsource platform; slow and inconsistent	Scale Data Engine: submit raw data via API; receive annotated dataset; automated QA	10x+ throughput vs. internal teams; higher consistency (claimed)	Custom pricing; AI lab customers departing post-Meta conflict
Align LLM with human preferences (RLHF)	Build internal preference collection pipeline; hire expert evaluators	Scale RLHF: expert human feedback at scale; pair annotation; preference ranking	Quality expert feedback for LLM alignment training	OpenAI and Google departed; Meta relationship expanding but concentration risk
Deploy custom GenAI app for enterprise	Build in-house with ML engineers; 12-24 month timeline typical	Scale GenAI Platform: data → custom LLM app in < 2 months (TIME case study)	TIME case study: deployed in <2 months; 7,000+ attack vectors tested	Hyperscalers (AWS, Google, Azure) offer comparable platforms at lower cost
Evaluate LLM capability and safety independently	Use academic benchmarks (MMLU, HELM); internal red-team	Scale Evaluation + Leaderboard; WMDP benchmark; expert safety evaluation	Trusted third-party scores; government-recognized evaluation authority	Independence perception risk (Meta investor could bias evaluation results)
Execute AI-powered defense missions	Manual analyst workflows; limited AI integration in classified environments	Donovan: cleared AI agents for IC/DoD decision support and mission planning	First cleared AI agents platform for DoD; no comparable disclosed competitor	Technical capabilities not fully documented; limited commercial transparency

Benefit claims for Scale GenAI Platform deployment speed are based on the TIME case study. Other claims are based on company-described capabilities on official product pages.

FE001: Product Architecture Map

Scale AI's five-layer product architecture from customer-facing interfaces through AI services, workflow execution, compliance, and infrastructure.

Architecture is inferred from public product pages, API documentation, and official security certification descriptions. Donovan classified capabilities are not publicly documented.

[CE006, CE007, CE008, CE016]

FE002: Customer Workflow / Operating Flow — Enterprise Annotation to Delivered Dataset

End-to-end customer workflow for Scale AI's core annotation product, from customer data submission through contributor execution, QA, and dataset delivery to the customer's MLOps pipeline.

Workflow is reconstructed from public API documentation, product descriptions, and the Scale Leaderboard/Evaluation public materials. Specific QA algorithms are proprietary and not publicly disclosed.

[CE006, CE007, CE008, CE009]

5.3 Differentiation, Technology IP, and Competitive Moat

Scale AI's key technological and operational differentiators span five areas. First, proprietary annotation tooling and QA methodology: Scale has invested significantly in building annotation interfaces optimized for accuracy, speed, and consistency across task types. The QA pipeline applies statistical quality controls, consensus checking, and expert reviewer escalation to maintain annotation quality standards that the company claims exceed competitors. This proprietary workflow is not open-source and represents institutional know-how embedded in operations. Second, the contributor network as supply-side IP: Scale's global contributor network (having received $1B+ in payments) is a proprietary asset. The quality screening, onboarding, skill credentialing, and task routing logic that governs this network is Scale's operational moat in the annotation segment. Competitors (including Mercor, which Scale is currently suing for alleged poaching) must build equivalent networks to compete on quality and throughput at scale. Third, government certifications as regulatory moat: Scale holds DoD IL4 Provisional Authorization and FedRAMP High Authorization. These clearances required a 2–4 year compliance investment and ongoing compliance maintenance. No disclosed AI data competitor has equivalent certifications, giving Scale a near-exclusive position in classified AI data contracts. The cost and time to replicate this is a genuine barrier to entry for government AI data. Fourth, the Scale Leaderboard and WMDP benchmark as reputational IP: The Scale Leaderboard has established Scale as the trusted third-party evaluator for LLM capabilities. The WMDP benchmark (Weapons of Mass Destruction Proxy) for AI safety evaluation has been adopted by the AI safety research community. These benchmarks are public goods that generate reputational capital for Scale as the authoritative AI evaluation authority — supporting both commercial evaluation contracts and government relationships. Fifth, the Donovan platform as first-mover advantage in defense AI agents: Donovan is the first commercially deployed, DoD IL4-cleared AI agents platform for defense mission workflows. The combination of specialized product capabilities, government relationships, and cleared infrastructure positions Donovan as a difficult-to-displace defense AI platform, particularly as the DoD increases its AI adoption budget. [CE011, CE012, CE013, CE014, CE015]

Technology / Operating Architecture Table
Layer / Component	Role	Key Dependency	Primary Risk
Annotation Tooling Platform (web UI + API)	Interface for contributor annotation task execution; customer job submission	Cloud infrastructure (AWS/GCP/Azure); contributor network	Tooling obsolescence as AI-assisted annotation reduces human need
Quality Assurance Pipeline	Statistical QC; consensus scoring; expert reviewer escalation	Contributor network quality; proprietary QA algorithms	QA methodology is proprietary and not independently audited; competitor narrowing quality gap
Scale API (annotation + GenAI + evaluation)	Programmatic access to all Scale services for enterprise + developer integration	Cloud infrastructure; API security; access control	API availability and reliability not independently benchmarked; no public status page
Contributor Network (global annotators)	Human execution of annotation, RLHF, and evaluation tasks	Global workforce management; quality screening; compensation logistics	Workforce supply chain risk; Mercor attempting to build competing network
Scale GenAI Platform (LLM customization)	Enterprise custom AI app development: RAG, fine-tuning, deployment	Cloud LLM providers (OpenAI, Anthropic, Meta); customer data security	Hyperscaler competition; proprietary LLM provider relationships may change post-Meta deal
Donovan Runtime (defense AI agents)	Cleared AI agent execution for DoD/IC missions; mission planning; search	DoD IL4 certified cloud; classified data integration; cleared personnel	Dependency on government certification maintenance; technical details not publicly disclosed
Evaluation / Leaderboard Infrastructure	LLM benchmarking; WMDP safety evaluation; public leaderboard publication	Model APIs from AI labs; independent evaluation infrastructure	Independence perception risk; Scale as Meta investor could create conflicts in LLM ranking

Architecture assessment is based on public documentation, product pages, and API references. Donovan architecture is partially inferred; classified capabilities are not publicly documented.

FE003: Critical Dependency Map

Scale AI's key external dependencies spanning infrastructure, certification bodies, customers, investors, and regulatory relationships that could affect product delivery or competitive position.

Dependency relationships are inferred from public product descriptions and press reporting. Specific contract and commercial terms are not publicly disclosed.

[CE011, CE012, CE013, CE014]

FE004: Product Maturity / Capability Map

Assessment of Scale AI's five core products across four dimensions: technical maturity, enterprise fit, competitive moat, and technical depth, scored 1-5 (5=highest).

Scores are analyst estimates based on public evidence. Maturity=1(pre-release) to 5(proven/stable). Enterprise Fit=1(not suited) to 5(core enterprise). Competitive Moat=1(commodity) to 5(near-exclusive). Technical Depth=1(basic) to 5(advanced/proprietary).

[CE011, CE012, CE013, CE014, CE015]

5.4 Trust, Safety, Compliance, and Quality Controls

Scale AI has invested heavily in trust and compliance infrastructure, particularly for government customers. The company's publicly disclosed certifications include: SOC 2 Type II (enterprise security controls audit), ISO 27001 (information security management system), DoD IL4 Provisional Authorization (Defense Department data security for controlled unclassified information and some classified), and FedRAMP High Authorization (U.S. federal government cloud security for high-impact systems including classified data handling). For AI safety, Scale has made voluntary commitments under the 2024 White House AI Safety commitments, covering RLHF safety practices, red-teaming, and responsible AI deployment. Scale contributed the WMDP (Weapons of Mass Destruction Proxy) benchmark for evaluating whether LLMs have been trained to prevent generation of dangerous content. The WMDP benchmark measures AI safety in dual-use knowledge domains, and Scale's test-and-evaluation white paper documents its approach to responsible AI model evaluation. Scale's annotation quality controls are proprietary but include: task-specific quality guidelines, multiple annotator redundancy for high-stakes tasks, statistical quality monitoring, expert reviewer escalation, and inter-annotator agreement scoring. The company claims industry-leading annotation quality — though independent third-party quality audits of Scale's annotation are not publicly available. The TIME customer case study (GenAI platform deployed in under 2 months, with 7,000+ attack vectors tested) provides partial evidence of red-teaming capability. The DIU RCV program win confirms that Scale's technology passed defense procurement requirements. These are corroborating evidence of technical capabilities but fall short of independent third-party technical audits. Privacy and data governance: Scale's data handling practices for enterprise annotation involve customer-provided data that may contain proprietary or sensitive information. Scale's security certifications (SOC 2 Type II, ISO 27001) provide some assurance of data governance controls. For government customers, DoD IL4 and FedRAMP High impose strict data handling requirements that Scale has demonstrated compliance with. Customer data handling in the annotation pipeline (particularly for AI labs with proprietary training data) is a due diligence concern that is not fully addressed by public documentation. [CE016, CE017, CE018, CE019, CE020]

Trust / Quality / Compliance Table
Control / Certification	Status	Scope	Verified By	Gap / Diligence Ask
SOC 2 Type II	Certified (confirmed)	Commercial data handling; internal security controls; enterprise annotation workflows	Third-party auditor (not named publicly)	Request current SOC 2 report; confirm scope covers annotation data pipelines
ISO 27001	Certified (confirmed)	Information security management system; global operations	Third-party certification body	Confirm certification date and scope; request ISO certificate
DoD IL4 Provisional Authorization	Certified (confirmed)	DoD Impact Level 4 data — controlled unclassified + sensitive defense data	DISA / DoD authorization body	Confirm active PA status; request ATOs for specific Donovan deployments
FedRAMP High Authorization	Authorized (confirmed)	U.S. federal high-impact systems; classified and sensitive government data	FedRAMP PMO / JAB authorization	Confirm active authorization status in FedRAMP marketplace; scope of authorized services
White House AI Safety Commitments (2024)	Committed (company-signed voluntary)	RLHF safety; red-teaming; responsible AI deployment; model evaluation	White House OSTP; voluntary, not legally binding	Confirm implementation of commitments; review Scale's published commitment tracker
WMDP Benchmark (Weapons of Mass Destruction Proxy)	Published (publicly available)	AI safety evaluation for dual-use knowledge prevention; LLM safety testing	AI safety research community adoption	Confirm WMDP adoption by other labs; independent validation of benchmark methodology
Annotation Quality Standards	Internal (proprietary)	Inter-annotator agreement; QA pipeline; task-specific accuracy standards	Internal (self-reported); no public third-party audit	Request quality audit methodology; inter-annotator agreement scores; customer NPS
Data Privacy / Customer Data Handling	Internal controls (SOC 2 covers some)	Customer proprietary data in annotation pipeline; AI lab training data confidentiality	Partially covered by SOC 2 Type II; not independently verified for annotation data	Request data handling agreements; confirm customer data isolation in annotation pipeline

Certifications confirmed by official Scale website (scale.com/legal/security). White House commitments confirmed by Scale blog post. Quality standards are based on company-described methodology only.

[CE016, CE017]

5.5 Product Roadmap, Deployment, and Open Questions

Scale AI's product roadmap as of 2025 is implicitly directed by the July 2025 strategic pivot toward enterprise and government: prioritizing Scale GenAI Platform enterprise deployments, expanding Donovan for government agencies, and growing Scale Evaluation capabilities for the emerging AI governance and compliance market. The data-labeling and annotation segment, while still operational, is expected to operate at reduced scale following the July 2025 layoffs of 200 employees and 500 contractors. Deployment and integration capabilities: Scale provides enterprise API access, webhook integration for MLOps pipelines, and the Scale GenAI Platform for application development. The self-serve portal enables rapid onboarding for annotation tasks. For government, Donovan is deployed in classified cloud environments with specialized integrations for DoD and IC systems. No public changelog or release notes are available; Scale does not maintain a public product roadmap or developer-facing status page that can be independently verified. Developer signal: The Scale Leaderboard is the most developer-facing public product, receiving significant attention from the AI research and engineering community. The WMDP benchmark has been cited in AI safety research. Scale's API documentation (at scale.com/docs) provides programmatic access specifications but the extent of open-source tooling or developer community engagement (GitHub activity, HackerNews discussions, package downloads) is limited relative to AI infrastructure companies with stronger developer ecosystems (e.g., Hugging Face, LangChain). Key open questions and product diligence gaps: (1) The GenAI Platform's technical depth relative to hyperscaler competition (AWS Bedrock, Google Vertex AI, Azure AI) is not independently validated — Scale's platform advantage relies on annotation quality integration, but hyperscalers offer equivalent infrastructure at lower cost; (2) Donovan's specific technical capabilities for AI agents in classified environments are not publicly documented in sufficient detail to assess differentiation from other defense AI companies (Palantir, Booz Allen); (3) The annotation tooling's technical superiority over Labelbox, Snorkel AI, and open-source alternatives (CVAT, LabelImg) is asserted but not independently benchmarked; (4) Scale's development roadmap for synthetic data capabilities is unknown — this is a critical gap as synthetic data threatens to displace human annotation. [CE021, CE022, CE023, CE024, CE025]

Roadmap / Release / Development-Stage Table
Date / Stage	Feature / Milestone	Status	Implication	Source
2016–2022	Scale Data Engine v1 → mature annotation platform; API launch	Complete (historical)	Foundation product; established annotation market position and contributor network	Scale public blog; company history
2021–2022	Scale RLHF product launch; LLM training data for OpenAI + AI labs	Complete (historical)	Positioned Scale as RLHF leader for frontier AI development	Scale RLHF page; TechCrunch reporting
2022	Scale Donovan (defense AI agents) launch; DoD IL4 clearance	Complete; active	Established cleared defense AI data position; created government revenue moat	Scale Donovan page; DoD contract blog
2023–2024	Scale GenAI Platform launch; enterprise LLM customization	Complete; active; strategic priority	Diversified beyond annotation into higher-margin platform business	Scale GenAI Platform docs; customer case studies
2024	Scale Leaderboard launch; WMDP benchmark release; White House AI commitments	Complete; active developer tool	Established Scale as trusted AI evaluation authority; government recognition	Scale blog (leaderboard, WMDP); White House record
May 2024	Series F $1B at $13.8B valuation; new strategic investors (Amazon, Meta, Cisco)	Complete	Capital to fund platform and government pivot; strategic co-sell potential	TechCrunch Series F reporting
June 2025	Meta strategic investment; Wang departure; Droege as Interim CEO; strategic pivot announced	Complete; ongoing transition	Business model pivot; CEO transition risk; customer attrition (Google, OpenAI)	TechCrunch; CNBC; Scale blog
July 2025	14% headcount reduction (200 employees + 500 contractors); data-labeling restructuring	Complete	Cost structure shift; confirms annotation pivot; talent retention risk	TechCrunch July 2025 reporting
2025–2026 (planned)	Enterprise GenAI Platform expansion; Donovan government rollout; Scale Evaluation growth	In progress (inferred from Droege statements)	Revenue mix shift to higher-margin enterprise + government; execution risk of pivot	Scale blog; Droege public comments
Unknown	Synthetic data capabilities; AI-assisted annotation automation	Not publicly confirmed	Critical gap: if not developing, Scale vulnerable to synthetic data displacement	Evidence gap — no public roadmap

Historical milestones are based on press records and public announcements. Forward-looking roadmap items are inferred from public statements; Scale does not publish a formal product roadmap.

[CE021, CE022]

5.6 Exhibits

Chapter 06

06Customers

6.1 Customer Base Segmentation

Scale AI's customer base is organized into three primary verticals: AI Labs and model developers, Fortune 500 enterprise customers, and U.S. government and defense agencies. Each segment has distinct buyer profiles, procurement dynamics, use cases, revenue structures, and switching cost profiles. AI Labs (formerly the largest segment) purchase Scale's RLHF and evaluation services to train and benchmark large language models. These customers—historically including OpenAI, Google/DeepMind, Cohere, and Anthropic—are sophisticated technical buyers with direct procurement authority and quarterly contract cycles tied to training compute schedules. This segment is now in structural decline: Google exited in June 2025 citing competitive conflict arising from the Meta deal, and OpenAI wound down its Scale relationship the same month, signaling a broader shift toward in-house annotation capacity among frontier model developers. Enterprise customers—including TIME, Etsy, Instacart, and Pinterest—deploy Scale's GenAI Platform and Data Engine for domain-specific AI applications such as content safety testing, recommendation systems, and e-commerce AI. These buyers typically involve IT and data leadership with procurement cycles of 6–18 months and multi-year deployment commitments. The TIME case study is the strongest public proof point: GenAI Platform deployment occurred in under two months, with 7,000+ adversarial attack vectors tested against TIME's AI content outputs in a production safety application. Government and defense customers—primarily U.S. Department of Defense and intelligence community agencies—engage through multi-year contract vehicles for data curation, autonomy AI programs, and the Donovan platform for classified operations. These customers have the highest switching costs due to DoD IL4 and FedRAMP High certification requirements, classified environment integration, and procurement inertia. Geographically, Scale's disclosed customer base is predominantly U.S.-headquartered. No international customer count or revenue split is publicly disclosed. [CU001, CU004, CU005, CU006, CU007, CU008]

Customer Segmentation Table
Segment	Buyer / Payer Profile	Primary Use Case	Scale / Scope	Revenue / Strategic Value	Key Gaps
AI Labs	CTO / research lead; quarterly procurement tied to training schedules	RLHF, model evaluation, safety benchmarking	2-3 flagship customers historically (OpenAI, Google); now smaller labs (Cohere)	Historically largest segment; in structural decline post-Google/OpenAI exits	Customer count, revenue per lab, NRR not disclosed; departures unquantified
Enterprise (Fortune 500)	IT / data leadership; 6-18 month sales cycles	GenAI Platform deployment, domain-specific AI, data curation	Multiple named logos (TIME, Etsy, Instacart, Pinterest); total count undisclosed	Growing per management narrative; no ARR, NRR, or expansion rate data	Expansion rates, upsell conversion, total enterprise customer count not disclosed
Government / Defense	Program manager / contracting officer; multi-year RFPs	Autonomy AI, joint force data curation, Donovan platform (national security)	Active DoD contracts; IC deployments; IL4 / FedRAMP High certified	High strategic value; presumed floor revenue; most durable segment	Contract values, agency names (classified), renewal timing, segment ARR not disclosed
Self-Serve / SMB / Research	Developer / researcher; pay-as-you-go	Annotation, API access for ML experimentation, model evaluation	1,000 free units entry tier; no active user count disclosed	Low individual revenue; high volume potential; conversion rate unknown	Active user count, conversion to enterprise, and usage trends not disclosed

Segment revenue split not disclosed by Scale AI. Tiers inferred from product pages, pricing structure, official blog posts, and case studies. Government segment classification is based on DoD certifications and official contract disclosures.

[CU001, CU004, CU005, CU006, CU007, CU008]

FU001: Customer Journey Map

6.2 Adoption Trajectory and Growth Signals

Scale AI does not publicly disclose total active customer counts, revenue, ARR, or deployment metrics. Adoption signals must be inferred from disclosed milestones, headcount dynamics, contributor payments, and third-party proxy data. The strongest scale signals are: (1) over $1 billion paid to annotation contributors globally, indicating substantial annotation throughput volume; (2) 15 billion+ human-labeled decisions processed, demonstrating platform depth; and (3) the May 2024 Series F at $13.8 billion valuation with participation from Amazon, Cisco, Intel, AMD, and ServiceNow—all enterprise buyers—suggesting customer validation at significant scale. The inclusion of strategic investors who are themselves enterprise customers provides independent corroboration of platform maturity. The July 2025 layoffs of 14% of staff (approximately 200 employees and 500 contractors), specifically concentrated in the data-labeling business, represent a lagging adoption signal: they indicate that RLHF and annotation volume has declined materially from peak, coinciding with Google and OpenAI departures and the strategic pivot toward the GenAI Platform and Donovan. Competitors Snorkel AI, SuperAnnotate, and Mercor have each expanded their enterprise annotation offerings, evidenced by new product pages and partner/leaderboard profiles, suggesting continued market activity even as Scale repositions. Self-serve growth is unquantified. Scale's API documentation and pricing pages are publicly accessible, suggesting ongoing developer adoption, but no active self-serve customer count or conversion rate has been disclosed. Enterprise customers visible on Scale's customers page—Etsy, Instacart, Pinterest, Cohere—appear without case studies or outcome metrics beyond logo presence, limiting the quality of adoption proof for these accounts. [CU011, CU012, CU013, CU017, CU021, CU023]

Customer Growth / Adoption Trajectory Table
Metric	Value	Date	Source	Confidence	Implication	Missing Denominator
Contributor payments	$1B+ paid to contributors globally	2025	scale.com/customers (official)	medium	Proxy for annotation throughput volume; not equivalent to customer revenue	No YoY growth rate or payment trajectory disclosed
Human-labeled decisions	15B+ processed	2025	scale.com/customers (official)	medium	Platform throughput depth; does not map to customer count or revenue	No active customer denominator or per-customer breakdown disclosed
Series F strategic investors	$1B at $13.8B; Amazon, Cisco, Intel, AMD, ServiceNow invested	May 2024	TechCrunch / Scale official	high	Enterprise buyers as investors provides independent customer validation signal	Revenue, ARR, or customer count not disclosed at close
Headcount post-layoff	~1,000 employees (14% reduction, ~200 let go)	Jul 2025	TechCrunch	high	Layoffs concentrated in data-labeling; proxy for volume decline in annotation segment	No operational capacity metric or segment headcount breakdown disclosed
AI lab customer attrition	Google (largest customer) and OpenAI both exited Q2 2025	Jun 2025	CNBC (two separate reports)	high	Material revenue concentration event; 20-40% of AI lab segment estimated affected	Revenue impact not disclosed; no replacement pipeline confirmed publicly

No revenue, ARR, or customer count data publicly disclosed. Adoption metrics are proxies and inferences only. High-confidence items are from confirmed multi-source news reporting or official Scale disclosures backed by reputationally-strong outlets.

[CU009, CU010, CU011, CU012, CU013, CU035]

FU002: Adoption / Deployment Funnel

6.3 Named Customer Proof and Evidence Quality

Scale AI's most important disclosed customer proof points span production deployments across media, defense, e-commerce, and AI research. The TIME Media case study is the highest-quality public proof point: Scale's GenAI Platform was deployed to test over 7,000 adversarial attack vectors against TIME's AI content before publication. This is a production safety application with clear, measurable outcome metrics and a sub-two-month deployment window, documented on Scale's official customers page. Meta is simultaneously Scale's largest strategic investor (~49% stake acquired June 2025) and an active, expanding RLHF customer. This dual role creates both a revenue anchor and a governance complication: Meta's privileged information access as an investor alongside its role as a customer raises data access and conflict-of-interest concerns for other customers evaluating Scale's data security practices. U.S. Department of Defense and intelligence community customers are evidenced by official Scale blog posts describing a DoD data curation contract for joint force operations and the Donovan platform's deployment in national security contexts. These are not named agency customers (classification prevents naming), but they represent the highest-confidence government deployment proof available in public sources, corroborated by Scale's DoD IL4 and FedRAMP High certifications. Enterprise customers—Etsy, Instacart, Pinterest—appear on Scale's customers page without case studies or outcome metrics. The Mercor lawsuit (September 2025) alleging customer poaching provides indirect adverse proof that Scale has enterprise accounts valuable enough to be actively solicited by a competitor, while also demonstrating competitive account retention vulnerability. Competitor profiles at Snorkel AI, SuperAnnotate, and Mercor indicate continued enterprise demand for annotation and AI data platforms, validating the market even as competition intensifies. [CU002, CU003, CU004, CU005, CU006, CU007]

Named Customer Proof Table
Customer	Segment	Deployment / Use Case	Production vs Pilot	Outcome / Proof Quality	Evidence Limitation
TIME (Media)	Enterprise	GenAI Platform for AI content safety testing; adversarial attack vector evaluation	Production (confirmed)	7,000+ attack vectors tested; deployed in <2 months; documented case study on scale.com/customers/time	Scale-produced case study; no independent third-party validation or TIME-provided ROI data
U.S. DoD / IC	Government / Defense	Data curation for joint force ops; Donovan platform for national security AI workflows	Production (confirmed)	Active multi-year DoD contracts; IL4 and FedRAMP High certified deployments; official Scale blog posts	Contract values and agency names classified; no mission outcome metrics publicly available
Meta	AI Lab + Strategic Investor	RLHF data for LLM training; expanding customer relationship post-investment	Production (expanding)	~49% strategic investment at $14.3B; company states relationship is expanding; RLHF platform corroborated	Dual investor-customer role creates evidence quality conflict; outcome metrics not independently verifiable
Etsy	Enterprise	AI training data for e-commerce recommendation and search AI	Presumed production	Logo on scale.com/customers; no case study, outcome data, or deployment scope available	Logo-only proof; deployment scope, contract terms, and outcomes unknown
Instacart	Enterprise	AI training data for delivery and logistics AI applications	Presumed production	Logo on scale.com/customers; no case study, outcome data, or deployment scope available	Logo-only proof; deployment scope, contract terms, and outcomes unknown
Pinterest	Enterprise	AI training data for visual search and recommendation AI	Presumed production	Logo on scale.com/customers; no case study, outcome data, or deployment scope available	Logo-only proof; deployment scope, contract terms, and outcomes unknown
Cohere	AI Lab	RLHF data for enterprise LLM training	Presumed production	Listed on scale.com/customers; Cohere is a commercial AI lab; RLHF platform corroborated	No case study; limited public information on scope or contract size

Enumeration is partial — only publicly named and confirmed customers included. Google and OpenAI are former customers (exited June 2025, not included). Proof quality ranges widely: TIME is the strongest (production + measured outcomes); government has high-confidence deployment but classified details; enterprise logos (Etsy, Instacart, Pinterest) provide minimal proof of production depth or outcomes.

[CU002, CU003, CU004, CU005, CU006, CU007]

FU003: Customer Proof Matrix

6.4 Retention, Durability, and Satisfaction Signals

Scale AI has not disclosed any net revenue retention (NRR), gross revenue retention (GRR), churn rate, renewal rate, total customer count, or satisfaction (NPS/CSAT) metrics. This represents a material information gap for investment analysis. All retention assessments below are structural inferences from contract type, switching costs, competitive dynamics, and adverse events—not from disclosed data. Government and defense customers represent the most durable cohort. Multi-year contract vehicles, DoD IL4 and FedRAMP High certification barriers, classified-environment integration, and high procurement switching costs create structural lock-in. The 2024 DoD joint force data curation contract and Donovan platform deployments in classified contexts suggest high annual retention rates for this segment, estimated at 90–97% based on comparable government IT contract renewal benchmarks. This segment's durability is largely independent of the AI lab attrition occurring in parallel. Enterprise customers (TIME, Etsy, Instacart, Pinterest) are likely on 12–36 month contracts with renewal optionality. Without NRR data, it is impossible to confirm whether land-and-expand dynamics are functioning. The GenAI Platform as an upsell from the Data Engine represents a theoretical expansion path, and the TIME deployment demonstrates conversion from pilot to production safety use case, but no cross-sell success rates or expansion revenue metrics are publicly available. The AI lab segment is in active contraction. Google and OpenAI both exited in June 2025, representing an estimated 20–40% of historical AI lab revenue (not disclosed, estimated from reported customer prominence). The remaining AI lab customers (Cohere, others) face similar structural pressures toward in-house annotation. No G2, Gartner Peer Insights, or Capterra reviews were identified for Scale AI's enterprise platform as of this analysis, leaving customer satisfaction entirely unquantified. [CU003, CU008, CU009, CU010, CU015, CU016]

Retention / Repeat Usage / Satisfaction Table
Metric	Value / Status	Segment	Confidence	Diligence Ask
Net Revenue Retention (NRR)	Not disclosed	All segments	n/a	Request trailing 4-quarter NRR by segment in due diligence data room
Gross Revenue Retention (GRR)	Not disclosed	All segments	n/a	Request trailing 4-quarter GRR by segment in data room
Annual Churn Rate	Not disclosed; Google and OpenAI 2025 departures are confirmed material adverse events	AI Labs	low (adverse signal only)	Quantify revenue impact of Google/OpenAI departures; confirm no other AI lab customers in exit pipeline
Contract Renewal Rate (Gov't)	Estimated 90%+ based on multi-year contract structure and IL4/FedRAMP switching cost barriers	Government / Defense	low (structural estimate)	Obtain actual renewal rate from contracting records or management confirmation in data room
Customer NPS / CSAT	Not disclosed; no G2, Gartner Peer Insights, or Capterra reviews identified for Scale AI enterprise platform	Enterprise	n/a	Request NPS data and any CSAT scores for enterprise segment; search G2/Gartner for updated review coverage
Contract Length	Government: multi-year RFP vehicles; Enterprise: estimated 12-36 months; Self-serve: monthly / pay-as-you-go	All segments	medium	Confirm minimum contract terms and renewal optionality for top 10 customers by ARR

All retention metrics except contract length estimates are undisclosed or unavailable. Structural retention inferences for government segment are low-confidence estimates. Google and OpenAI departures confirm AI lab churn is material and recent. No review platform data found.

[CU009, CU010, CU015, CU019, CU020, CU027]

FU004: Retention / Repeat Cohort

No public NRR, GRR, churn, or cohort data disclosed by Scale AI. Values are structural estimates based on contract type, switching cost analysis, and AI-annotation industry benchmarks for comparable data-services businesses. Estimates are for illustrative diligence purposes only and must be confirmed with disclosed data in the data room.

6.5 Expansion and Customer Concentration Risk

Scale AI's customer concentration risk is material and has adversely crystallized in 2025. The departure of Google—previously Scale's largest customer—and OpenAI within a single quarter is the single most important adverse customer event in Scale's history. CNBC reported Google's exit was driven by concerns about competitive conflict following the Meta investment. This departure pattern—where Scale's two largest AI lab customers exited citing the same underlying conflict of interest—creates structural risk for any remaining AI lab customers with Meta-adjacent competitive exposure. The Meta relationship creates both the largest expansion opportunity (Meta's AI initiatives are major consumers of training data, and the relationship is described as expanding) and a significant dependency risk: a substantial share of Scale's revenue post-Google/OpenAI may now be concentrated in a single customer who is also a 49% investor. This customer-investor concentration is structurally unusual and may create ongoing commercial friction for other customers. Land-and-expand dynamics are theoretically present: the pathway from data annotation to RLHF to evaluation to GenAI Platform to Donovan represents a documented upsell funnel. The TIME deployment demonstrates that enterprise customers can be converted from annotation pilots to production platform use. However, no quantified expansion rate, upsell conversion data, or average revenue per customer metrics are publicly available. The Mercor lawsuit (September 2025) alleging customer poaching indicates active competitive threat to existing account retention from at least one well-funded competitor. SuperAnnotate and Mercor are expanding enterprise annotation platform capabilities, evidenced by product pages and partner profiles, creating displacement risk for Scale's data engine revenue. [CU001, CU003, CU009, CU010, CU022, CU024]

Expansion and Concentration Risk Table
Driver / Risk Factor	Category	Current State	Impact Assessment	Diligence Path
Meta customer + investor relationship	Concentration + Conflict of interest	Meta holds ~49% stake and is expanding RLHF customer relationship	High positive (revenue anchor) + High negative (governance risk; other customers may reduce share)	Verify Meta revenue share as % of total ARR; assess data access governance; confirm enterprise customers remain comfortable with arrangement
Google and OpenAI simultaneous departure	Customer attrition	Both exited Q2 2025; Google was Scale's #1 customer	High negative — material revenue reduction; AI lab segment in structural decline	Quantify revenue impact; confirm replacement pipeline; review remaining AI lab contract NDA and exit-clause terms
DoD / IC multi-year contract vehicles	Expansion moat	Multiple active DoD contracts; Donovan in classified deployments; IL4/FedRAMP High certified	High positive — durable, sticky government revenue floor; high switching cost for competitors	Review contract vehicle types, option periods, and scheduled renewal dates; confirm new award pipeline
Mercor lawsuit — customer poaching	Competitive attrition risk	Scale sued Mercor Sep 2025 alleging poaching of key customers and trade secret misappropriation	Medium — indicates competitive account retention risk; may expand to other competitors	Monitor litigation outcome; request post-Mercor customer retention data; assess scope of contacts made
Land-and-expand upsell funnel	Expansion opportunity	Data Engine → RLHF → Evaluation → GenAI Platform → Donovan upsell funnel is documented but unquantified	Positive if conversion rates are material; actual conversion rates unknown	Request upsell conversion rates by product transition and average ARR expansion per customer cohort

Customer concentration risk is the single most material finding in this chapter. Meta-Google conflict resulted in the departure of Scale's largest customer. Government moat provides a structural offset but does not fully compensate for AI lab attrition without confirmed pipeline replacement.

[CU003, CU009, CU010, CU022, CU026, CU028]

6.6 Exhibits

Chapter 07

07Risks

7.1 Severity-Ranked Risk Overview

Scale AI enters the current investment period with a risk profile weighted toward strategic and execution risk rather than regulatory or operational risk. Customer concentration—already crystallized through the Google and OpenAI departures—represents the highest-severity, highest-probability risk as of the research date. These departures coincided with the Meta investment and the leadership transition, creating three concurrent high-severity risk events in a single quarter (Q2 2025) that are deeply interlinked. The most critical observation is the interdependency of Scale's top risks: the Meta investment triggered the Google departure (conflict of interest), which triggered a reassessment of Scale's business model viability at its previous AI lab revenue concentration, which in turn triggered the leadership transition and layoffs. Each risk amplifies the others. Investors must evaluate whether the government/defense moat and Meta's expanding customer relationship provide a sufficient revenue floor to sustain the business through the AI lab attrition and the enterprise pivot. Regulatory and legal risks are material but manageable. Scale's DoD IL4 and FedRAMP High certifications demonstrate a strong compliance posture for government work. The Mercor lawsuit is an ongoing legal risk but is unlikely to be existential. Export control risks for defense AI are real but addressed through existing certification and compliance programs. The most important forward-looking risk is whether Scale's enterprise GenAI Platform can generate sufficient replacement revenue for the AI lab segment within the time window defined by its current cash runway. [CR001, CR002, CR003, CR004, CR005, CR006]

Mitigation and Kill Criteria Table
Risk	Monitorable Trigger	Threshold / Event	Action Implication
Customer concentration / AI lab attrition	Additional AI lab customer departures; Meta reducing data purchasing volume	Any AI lab customer exit beyond Cohere, OR Meta ARR declines >20% QoQ	Thesis break — revenue floor assumption violated; requires full re-underwriting of revenue model
CEO / leadership transition failure	Enterprise or government deal loss attributable to leadership uncertainty; key executive attrition	Loss of 2+ senior leaders OR failure to close >$10M enterprise deal within 12 months of Droege appointment	Yellow flag — accelerate CEO succession or board strengthening; reassess management premium in valuation
Government contract non-renewal	DoD or IC contract not renewed at option period; failed security re-certification	Any material DoD contract cancellation or security clearance revocation	Thesis break — government moat assumption violated; Floor valuation collapses
Meta conflict crystallization	Enterprise or AI lab customer explicitly citing Meta conflict as exit reason; FTC or DOJ review of Meta stake	Any non-AI-lab enterprise customer citing Meta conflict, OR regulatory review initiated	Material risk escalation — reassess concentration overhang; review governance documentation
Execution of business model pivot	Enterprise GenAI Platform ARR as proxy for pivot success; Donovan new contract awards	GenAI Platform ARR failing to replace >50% of AI lab ARR within 18 months of layoffs (by Jan 2027)	Yellow flag — pivot velocity insufficient; reassess capital sufficiency and runway

Kill criteria are derived from the investment thesis requirements for government moat, enterprise platform growth, and management stability. Triggers are designed to be observable without requiring non-public information. Thresholds are analyst estimates.

[CR001, CR002, CR003, CR004, CR005, CR006]

FR001: Risk Heatmap

7.2 Regulatory and Legal Risk

Scale AI operates at the intersection of defense AI, enterprise data processing, and AI safety—three domains with active and evolving regulatory attention. The primary regulatory risks are U.S. government compliance requirements for defense AI (ITAR, export control, AI safety standards), data privacy and AI governance regulations applicable to enterprise data processing, and the competitive legal proceedings arising from the Mercor lawsuit. For defense AI, Scale's DoD IL4 and FedRAMP High certifications demonstrate strong regulatory compliance posture. The company has also made voluntary White House AI safety commitments, demonstrating proactive engagement with AI governance. The principal regulatory risk in this domain is potential expansion of export control restrictions on AI models and data services—particularly if the AI services Scale provides to DoD contractors involve sensitive technology transfer that could attract ITAR scrutiny. Congressional AI oversight is active, as evidenced by Scale's testimony before Congress. On data privacy, Scale's enterprise data annotation services involve processing customer proprietary data. The EU AI Act, GDPR, and emerging U.S. state AI laws create compliance obligations that are manageable but evolving. Scale's security and compliance pages indicate active SOC 2 Type II and ISO 27001 certifications, suggesting a mature enterprise compliance posture. The Mercor lawsuit (Scale AI vs. Mercor, September 2025) alleges customer poaching and trade secret misappropriation. While the lawsuit's outcome is uncertain, it creates legal uncertainty, management distraction, and potential counterclaim risk. The litigation does not currently appear to threaten Scale's core operations or certifications. [CR005, CR006, CR007, CR009, CR010, CR011]

Regulatory / Legal Risk Register
Risk / Rule / Case	Jurisdiction	Status	Likelihood	Severity	Mitigation	Residual Exposure	Diligence Path
U.S. AI export control / ITAR for defense AI	U.S. Federal	Active — evolving BIS/ITAR rules on AI models and data services	Medium	High	DoD IL4 and FedRAMP High certifications; existing defense contractor relationships	Moderate — regulatory changes could restrict data export or AI service delivery to non-U.S. entities	Confirm ITAR and EAR compliance posture for all government contracts; review BIS AI guidance adherence
Scale AI v. Mercor lawsuit (customer poaching, trade secrets)	U.S. Federal (NDCA)	Active — filed September 2025; litigation ongoing	High (filed)	Medium	Active litigation; preliminary injunction potentially sought; trade secret documentation	Moderate — counterclaims and discovery could reveal internal customer data or contract terms; management distraction	Review complaint and docket; obtain legal opinion on likelihood of preliminary injunction; assess counterclaim exposure
EU AI Act compliance for enterprise data processing	EU	Effective — GDPR and EU AI Act obligations apply to EU enterprise customers	Low-Medium	Medium	ISO 27001 and SOC 2 certifications; DPA agreements with enterprise customers	Low-Moderate — primary risk arises if Scale processes high-risk AI training data for EU-regulated applications	Confirm EU AI Act classification of Scale's data annotation services; verify DPA compliance with EU customers
U.S. AI safety voluntary commitments and potential mandates	U.S. Federal	Voluntary — White House commitments signed 2024; potential mandatory regulation in progress	Low	Low-Medium	White House AI safety commitments signed; proactive AI safety research (WMDP benchmark)	Low — voluntary compliance reduces regulatory risk; mandatory rules are not yet in force	Monitor AI safety rulemaking; confirm voluntary commitment compliance documentation is current
Data privacy — state AI and privacy laws (CPRA, others)	U.S. State	Active — CPRA and similar laws apply to California-headquartered Scale AI	Low-Medium	Low-Medium	SOC 2 Type II, privacy policy, enterprise DPA agreements	Low — standard enterprise SaaS compliance posture; no known enforcement actions	Confirm CPRA compliance; verify enterprise customer DPA templates are current; check for state-level AI regulation applicability

Register is partial — based on public sources only. Regulatory correspondence, enforcement actions, and any government investigation not publicly disclosed are not enumerated. Likelihood and severity are analyst estimates. Legal counsel review is required for definitive risk assessment.

[CR005, CR006, CR007, CR009, CR010, CR011]

FR002: Risk Transmission Map

7.3 Operational and Execution Risk

Scale AI's most acute operational risk is the execution of a business model pivot under adverse conditions. The company is attempting to transition from annotation-volume revenue (declining due to AI lab departures and synthetic data trends) to enterprise GenAI Platform and government Donovan revenue (growing but uncertain pace). This pivot is occurring simultaneously with a CEO transition, significant headcount reduction, and the loss of two of its largest customers. Data security and annotation quality are operational risks inherent to Scale's core business. As a company that processes proprietary customer data for AI training, any data security incident, quality failure, or customer data misuse event would be severely damaging to enterprise trust and government contract eligibility. Scale's SOC 2 Type II, ISO 27001, DoD IL4, and FedRAMP High certifications reduce—but do not eliminate—this risk. No public data security incidents were identified in this research. Supply chain risk for Scale relates primarily to its annotation contributor workforce. Scale relies on a global network of human annotators; disruptions to this workforce (labor disputes, quality degradation, competitive poaching by Mercor or others) could impair annotation output quality and delivery timelines. The July 2025 layoffs that eliminated approximately 500 contractors may have reduced operational redundancy in this supply chain. Additionally, Scale's dependency on cloud infrastructure providers (AWS and similar) creates a platform concentration risk that is standard for cloud-native companies but worth noting in the context of its DoD requirements for infrastructure sovereignty. [CR001, CR003, CR008, CR015, CR016, CR017]

Operational / Quality / Security Risk Register
Failure Mode	Likelihood	Severity	Mitigation Maturity	Residual Exposure	Unresolved Gap
Data security breach exposing customer proprietary AI training data	Low-Medium	High	High (SOC 2 Type II, ISO 27001, DoD IL4, FedRAMP High)	Medium — certifications reduce but do not eliminate breach risk; AI training data is high-value target	No public incident disclosure mechanism confirmed; breach notification process for government customers unclear
Annotation quality degradation post-layoff (500+ contractors released)	Medium	High	Medium (QA processes in place; Scale RLHF platform automates quality scoring)	Medium-High — significant workforce reduction may reduce annotation quality redundancy and institutional knowledge	No public quality SLA disclosure; no customer-facing quality metrics; contractor reduction scope unclear by segment
Business model pivot execution failure (annotation to platform)	High	High	Low (pivot is early-stage; enterprise GenAI Platform revenue not publicly confirmed)	High — if pivot fails, core annotation business in structural decline with no replacement revenue	Enterprise GenAI Platform ARR not disclosed; conversion rate from annotation to platform unknown; no public timeline
Cloud infrastructure outage affecting government or enterprise SLAs	Low	Medium	Medium (cloud redundancy; DoD requires specific infrastructure sovereignty)	Low-Medium — standard cloud dependency risk; mitigated by multi-cloud and government-grade infrastructure requirements	Infrastructure sovereignty arrangement for classified DoD workloads not publicly disclosed
Contributor workforce disruption (labor disputes, competing platforms)	Medium	Medium	Medium (global contributor base provides some geographic redundancy)	Medium — Mercor and other platforms competing for annotation workforce; quality of replacement contributors unknown	Contractor workforce composition and geographic concentration not disclosed; Mercor competition for contributors unquantified

Failure modes ordered by residual severity. Mitigation maturity is analyst estimate based on public certification data and inferred operational practices. No internal risk management documentation or incident history was publicly available for review.

[CR015, CR016, CR017, CR018, CR019, CR003]

FR003: Dependency Map

7.4 Partner and Dependency Risk

Scale AI's most significant dependency risk is its relationship with Meta—simultaneously its largest investor (~49% equity), its largest remaining AI lab customer, and its former CEO's new employer. This triple-role dependency creates a concentration and governance risk without precedent in the venture-backed AI sector. If Meta's strategic interest in Scale diminishes, reduces its data purchasing, or introduces new competitive products, Scale faces simultaneous customer revenue loss, valuation pressure, and governance disruption. Cloud infrastructure dependency on AWS and major cloud providers is a standard risk for AI infrastructure companies, but it is especially relevant for Scale's government business where platform sovereignty and sovereignty over AI training compute are compliance requirements. Scale's DoD certifications imply approved cloud infrastructure arrangements, but any change in cloud provider relationships could affect government contract eligibility. OpenAI and Google's departures eliminated two key partner relationships that provided both revenue and reputational validation. Remaining AI lab partners (Cohere, others) are smaller and represent less revenue concentration individually. Channel and partner concentration in government procurement is positive rather than negative: government contracts proceed through established contract vehicles (GSA, DIU, DARPA channels) with well-defined procurement rules. However, government budget cycles and continuing resolution dynamics can create revenue timing volatility. The Snorkel AI partner ecosystem (evidenced by its partners page) and SuperAnnotate's enterprise partnerships indicate that Scale's competitors are also building partner-dependent go-to-market strategies, creating a race for ecosystem lock-in. [CR002, CR003, CR004, CR006, CR009, CR020]

Partner / Dependency Risk Register
Dependency	Counterparty	Role	Concentration Level	Failure Scenario	Severity	Mitigation	Residual Exposure
Meta strategic relationship	Meta (investor + customer)	49% equity investor AND expanding RLHF customer AND former CEO's employer	Extreme	Meta reduces data purchasing, seeks to acquire full control, or introduces competing annotation platform	Critical	Revenue diversification into government/enterprise; maintain operational independence per stated policy	High — single relationship combines customer, investor, and governance risk; no structural firewall confirmed
U.S. DoD / IC contract vehicles	U.S. Government (DoD, IC)	Multi-year government customer; defines government revenue floor	High	Contract non-renewal, budget cuts, or failed security re-certification	High (positive dependency — loss would be severe)	IL4/FedRAMP High certifications; multi-year contract structure with option periods; dedicated government team	Medium — government contracts are durable but subject to budget cycles and re-certification requirements
Cloud infrastructure (AWS, Microsoft Azure)	AWS / Microsoft	Platform hosting for Scale's annotation pipeline, API services, and government workloads	High	Cloud provider outage, pricing change, or contract termination	Medium	Multi-cloud architecture (assumed); DoD requires FedRAMP-authorized infrastructure	Low-Medium — standard cloud dependency; mitigated by government certification requirements
Global annotation contributor network	Independent contractors (500K+ estimated)	Annotation workforce supply for data labeling pipeline	High	Workforce fragmentation, quality degradation, or mass departure to competing platforms (Mercor)	Medium-High	Global geographic distribution; proprietary quality scoring platform (Scale RLHF)	Medium — Mercor lawsuit indicates active poaching of contributors; post-layoff morale risk
OpenAI / Google (former)	OpenAI, Google (departed)	Were key RLHF and evaluation customers validating platform quality	High (historical; now zero)	Already materialized — both exited Q2 2025	High (already crystallized)	None effective — exits complete	Residual: reputational association that other AI labs may exit; creates AI lab segment concentration risk for Cohere et al.

Dependencies ordered by residual severity. Meta dependency is the most structurally unusual and hardest to mitigate through standard risk management. Former customer row included to show concentration crystallization. Concentration levels are analyst estimates.

[CR002, CR003, CR004, CR006, CR009, CR020]

7.5 People, Execution, and Financial Risk

Leadership transition risk is among the most material at Scale AI. Founder Alexandr Wang built Scale over nine years and was the face of its government, enterprise, and AI lab relationships. His departure to join Meta, concurrent with Meta becoming Scale's largest investor, creates a principal-agent conflict that is visible to customers, employees, and investors. Interim CEO Jason Droege has no prior experience running a $29B-valued AI infrastructure company; his Uber Eats background is operationally relevant to marketplace scaling but not directly to government AI or enterprise data platform go-to-market. Talent retention is a risk in the immediate post-layoff environment. The July 2025 reduction of 200 employees and 500 contractors may have damaged morale among the remaining ~1,000 employees, particularly if high performers perceive uncertainty about Scale's direction under interim leadership. Key government and enterprise relationship managers who held institutional knowledge of defense contracts and enterprise deployments represent irreplaceable assets; their retention is critical to contract renewal and platform expansion. Financial risk is partially mitigated by the Meta strategic investment, but the capital from that transaction was distributed to existing shareholders rather than retained on Scale's balance sheet for operations. This creates uncertainty about Scale's actual cash position and runway post-distribution. No public burn rate, cash position, or path-to-profitability data has been disclosed. If the AI lab revenue attrition is as large as the Google/OpenAI departures imply, Scale may face material revenue shortfalls in 2025-2026 that require additional external capital unless the enterprise and government growth rate exceeds expectations. [CR001, CR002, CR003, CR007, CR008, CR024]

People / Execution Risk Register
Role / Function	Dependency or Gap	Likelihood of Impact	Severity	Mitigation	Diligence Path
CEO — Interim (Jason Droege)	Unproven at Scale's stage and sector; Uber Eats background lacks government AI and enterprise data platform precedent	High	High	Droege has operational scaling experience; supported by existing leadership team and board	Assess Droege's progress on enterprise and government deals since appointment; confirm board is actively engaged in CEO search or succession planning
Founder (Alexandr Wang) — departed	Wang held government, enterprise, and AI lab relationships for 9 years; departed to Meta; retains board seat	High (already materialized)	High	Wang retains board representation; existing team carries institutional knowledge	Confirm Wang's board role is active and not conflicted by Meta duties; assess whether key customer relationships transferred to current team
Government / defense relationship managers	Institutional knowledge of classified programs, contracting officers, and security clearances; key-person risk in DoD relationships	Medium	High	Long-term government contracts reduce relationship dependency; cleared personnel have high switching costs	Interview 2-3 senior government relationship managers; confirm security clearance retention post-layoffs; verify DoD program continuity
Enterprise sales leadership	Enterprise sales cycle is 6-18 months; leadership attrition post-layoff could disrupt pipeline	Medium	Medium-High	Scale's brand and existing customer logos support enterprise sales; self-serve API reduces dependency	Request enterprise sales pipeline data (qualified pipeline, pipeline coverage ratio, deal stage distribution)

People risks are ordered by severity. Key-person risk is extremely high given the simultaneous departure of the founder and the leadership transition in a single quarter. Government relationship continuity is the most operationally critical risk given the revenue floor it provides.

[CR001, CR002, CR007, CR008, CR024, CR025]

7.6 Exhibits

Chapter 08

08Valuation

8.1 Investment Thesis and Anti-Thesis

The investment thesis for Scale AI rests on three structural pillars. First, government and defense: Scale's Donovan AI platform, DoD IL4/FedRAMP High certifications, and deep government relationships create a durable revenue floor with high switching costs, long contract cycles, and minimal competitive overlap with AI labs. This segment functions as a strategic asset that supports a minimum valuation of $3–5B independent of any commercial AI lab business. Second, data moat: Scale has trained over nine years of annotation quality infrastructure, contributor management, and model evaluation tooling that is not easily replicated at scale. This infrastructure positions the company to capture enterprise GenAI Platform revenue as Fortune 500 companies shift from AI experimentation to production deployment requiring human-in-the-loop data workflows. Third, Meta validation: the $14.3B investment by Meta—one of the most sophisticated AI investors globally—provides both financial and reputational validation, signals the strategic value of Scale's data infrastructure, and creates an expanding customer relationship. The anti-thesis is equally robust. At $29B implied valuation, Scale trades at 58x–145x estimated ARR at a moment when its two largest revenue contributors (Google and OpenAI) have both departed, its founder CEO has left, and its business model is mid-pivot. The annotation commoditization risk (synthetic data, in-house RLHF by frontier labs) is a structural threat to the AI lab segment. The Meta investor-customer relationship creates a governance overhang that deters other customers and creates a principal-agent conflict without precedent in the AI sector. Interim CEO Droege has no demonstrated track record in Scale's core markets. The $29B valuation can only be defended by a bull case set of assumptions about government contract renewal, enterprise platform conversion, and Meta ARR growth—all of which are unconfirmed by public evidence. The base case implies material dilution risk from the entry price. The balance of evidence suggests that Scale is a high-quality asset in a structurally adverse situation. The diligence path must establish whether the government floor is secure, whether enterprise platform ARR is growing at sufficient pace, and whether the management team can execute the pivot without additional capital at dilutive terms. Until these questions are answered, the evidence does not support investment at the $29B implied valuation. A conditional pass—invest at a materially lower entry price with earned milestones—or a research-more stance is the appropriate recommendation given current evidence quality. [CV001, CV002, CV003, CV004, CV005, CV006]

Thesis / Anti-Thesis Table
Argument	Evidence Basis	What Would Change This View
THESIS: Government moat creates durable revenue floor with high switching costs and multi-year contracts	DoD IL4, FedRAMP High, Donovan platform, DIU/DARPA contracts; no announced non-renewal	Government contract cancellation or failed security re-certification would eliminate this thesis pillar
THESIS: Data infrastructure and annotation platform represent 9 years of unreplicable quality engineering	RLHF platform, 500K+ contributor network, model evaluation tooling documented on scale.com; Stanford HAI endorsement	Evidence of systematic annotation quality degradation or competitor replication at comparable scale
THESIS: Meta validation ($14.3B, 49%) signals strategic value and secures near-term capital	Multi-source confirmed investment; largest single AI strategic investment on record	Meta reducing data purchasing or seeking to acquire remaining equity at depressed valuation
THESIS: Enterprise GenAI Platform creates replacement revenue for AI lab attrition	TIME deployment case study; enterprise customer logos; official GenAI Platform product page	Enterprise platform ARR failing to replace >50% of AI lab revenue within 24 months (by mid-2027)
ANTI-THESIS: $29B valuation set by strategic buyer is not financial-investor-rational	Entry multiple of 58x–145x ARR; concurrent attrition of top 2 customers; Google/OpenAI exit confirmed	Entry at materially lower price (<$15B) or confirmation of rapid post-attrition ARR recovery
ANTI-THESIS: Annotation commoditization is a structural threat to core RLHF revenue	McKinsey/Stanford HAI report declining per-token annotation cost; synthetic data generation reducing human annotation requirements	Evidence that Scale's platform-level evaluation services (not raw annotation) are insulated from commoditization
ANTI-THESIS: CEO transition and Meta conflict create management and governance risk without precedent	Wang departure confirmed; Droege interim appointment confirmed; no announced permanent CEO search	Appointment of a permanent CEO with government AI and enterprise platform track record; structural firewall between Meta and Scale customer data confirmed

Thesis and anti-thesis are evidence-grounded; speculative arguments excluded. The thesis is structurally coherent but requires private data confirmation for the critical revenue replacement assumption. The anti-thesis is observable from public sources without any private data requirement.

[CV001, CV002, CV003, CV004, CV005, CV006]

Thesis-Break and Kill Triggers Table
Trigger	Threshold / Event	Transmission to Thesis	Action Implication
Government contract non-renewal	Any DoD or IC contract cancelled or not renewed at scheduled option period	Eliminates the government floor thesis; base case valuation collapses to Appen-equivalent ($1.5–3B range)	Hard pass — government moat is the primary thesis pillar; its failure is a non-recoverable thesis break
Additional enterprise customer citing Meta conflict	Any non-AI-lab enterprise customer explicitly citing Meta relationship as reason for exit or non-purchase	Confirms that Meta conflict is a systematic go-to-market inhibitor, not just an AI lab-specific issue; expands the TAM risk beyond the AI lab segment	Hard pass if confirmed; increases required entry discount to reflect structural customer acquisition constraints
Meta ARR decline	Meta data purchasing ARR falls >20% QoQ for two consecutive quarters without explanation from alternative enterprise growth	Eliminates the anchor customer assumption of the bull case; no remaining large anchor; concentration collapse	Hard pass — revenue floor disappears; triggers re-underwriting of all scenarios
GenAI Platform ARR below $30M as of Q4 2026	Enterprise GenAI Platform ARR confirmed below $30M (Q4 2026 revenue date or data room disclosure)	Confirms that the enterprise pivot is not achieving minimum viable conversion; ARR replacement pace is insufficient to offset AI lab attrition within the thesis window	Pass or price discipline: reduce entry price by 40% from base case ceiling
CEO transition failure signal	Droege fails to close any enterprise deal >$5M in first 12 months, OR 2+ senior leadership exits post-Droege appointment	Signals that the management team cannot execute the enterprise pivot; increases execution risk multiplier	Yellow flag — reassess management capability; seek board clarity on permanent CEO timeline before proceeding

Triggers are designed to be observable from public reporting or data room disclosures without requiring insider access. The government contract trigger is the most critical: it is binary, publicly observable (USASpending.gov), and directly eliminates the primary thesis pillar.

[CV003, CV006, CV007, CV008, CV009, CV029]

FV001: Recommendation Logic

8.2 Valuation Context and Current Financing

Scale AI's current implied valuation of approximately $29B derives from Meta's June 2025 purchase of approximately 49% of the company for $14.3B. This transaction was reported by multiple independent high-reputation sources (TechCrunch, CNBC, Reuters) and is considered confirmed for this analysis. The implied total company valuation ($29B) is based on the reported ~49% equity stake; the precise pre-money/post-money treatment and cap table structure are not publicly disclosed. Importantly, the Meta transaction proceeds were distributed to existing shareholders rather than retained on Scale's balance sheet for operations, leaving Scale's actual working capital and cash runway uncertain. At the $29B implied valuation: - Low ARR estimate ($200M): 145x revenue multiple — consistent with frontier AI lab multiples in early 2024 but extreme for a company experiencing customer attrition - Mid ARR estimate ($350M): 83x revenue multiple — comparable to high-growth enterprise SaaS at peak market conditions - High ARR estimate ($500M): 58x revenue multiple — justified only if government ARR is growing rapidly and enterprise platform conversion is accelerating The most comparable funding event in Scale's history is the May 2024 Series F at $13.8B on approximately $200–300M estimated ARR (46–69x ARR). The Meta deal nearly doubled that valuation in approximately 13 months despite the concurrent departure of two of Scale's largest customers. This dynamic suggests the Meta valuation reflects strategic acquisition logic (Meta's need for proprietary RLHF data infrastructure) rather than financial return optimization. Investors co-investing at $29B implied are therefore paying a price set by a strategic acquirer's logic, not an arm's-length financial investor's return framework. This is a critical valuation discipline issue: strategic buyers accept premiums that financial investors cannot underwrite. The diligence path must establish a financial investor's ceiling for the entry price. Dilution and preference overhang are unknown. Scale has raised approximately $1.6B in equity prior to the Meta deal; the preference stack, liquidation rights, and participating vs. non-participating structures are not publicly disclosed. Any financial investor entering at $29B implied must assess the preference waterfall impact on common equity returns in the bear case. [CV001, CV002, CV003, CV010, CV011, CV012]

Final Diligence Asks Table
Topic	Missing Evidence	Why It Matters	Owner / Diligence Path
Post-attrition ARR by segment	Google and OpenAI were the largest customers; their ARR contribution and the net ARR after departure are undisclosed	Cannot underwrite any scenario without knowing the revenue hole and the current trajectory of replacement ARR	Data room: request ARR waterfall by segment Q1 2024 – Q4 2025; anonymized if required by NDA
Government contract schedule and renewal timeline	DoD contract option periods, TCV, and renewal rates are not publicly disclosed	Government moat is the primary thesis pillar; contract non-renewal is the single most likely thesis-break event	USASpending.gov for visible contracts; data room: request government contract ledger with option periods; direct conversation with Scale government team lead
Cash position and runway post-Meta deal	Meta proceeds distributed to shareholders; current operating cash, burn rate, and runway not disclosed	Without runway visibility, cannot assess whether pivot can be completed without dilutive capital raise	Data room: request balance sheet, P&L, and cash flow statement; confirm amount retained for operations vs. distributed
Enterprise GenAI Platform ARR and growth rate	No public disclosure of GenAI Platform ARR, customer count, or conversion rate from annotation to platform	Platform transition is the core bull case driver; without ARR data, bull case is speculation	Data room: request GenAI Platform ARR as of Q4 2025 and Q1 2026; pipeline data; representative enterprise contracts
Management team retention and permanent CEO plan	No public announcement of permanent CEO search or expected appointment timeline; key executive retention post-Droege unknown	Management continuity is critical to government relationship maintenance and enterprise pivot execution	Board conversation: confirm CEO search status; request 90-day review of key executive retention; assess key-person risk with HR team
Meta governance firewall and data access agreement	Investor rights agreement between Meta and Scale AI not publicly disclosed; no confirmed data access limitation for Meta as investor	Without a confirmed firewall, the Meta conflict-of-interest risk is unmitigated and enterprise customers' data sovereignty concerns cannot be resolved	Data room: request investor rights agreement and any data access limitation schedule; obtain legal opinion on firewall adequacy

All six diligence asks are blocking for an investment decision at $29B implied valuation. The three highest priority asks — ARR by segment, government contract schedule, and cash runway — should be completed before any preliminary term sheet discussion.

[CV002, CV012, CV013, CV014, CV029, CV030]

FV002: Valuation Sensitivity

8.3 Bull / Base / Bear Scenarios

The scenario analysis for Scale AI hinges on three key driver variables: (1) the government/defense ARR trajectory and renewal rate for existing DoD contracts, (2) the enterprise GenAI Platform conversion rate and ARR growth from the commercial pivot, and (3) the Meta customer relationship durability and any further AI lab attrition or accretion. Bull case: Government contracts renew at >90% and expand with new Donovan program awards; enterprise GenAI Platform achieves $150M+ ARR by 2027 driven by Fortune 500 production deployments; Meta expands its data purchasing to $300M+ ARR; Interim CEO Droege stabilizes operations and attracts a marquee permanent CEO within 12 months. Under these assumptions, total ARR reaches $600M–800M by 2027, and at a 25–30x forward multiple, valuation reaches $18–24B. This implies modest upside from $29B entry and is achievable only under multiple concurrent favorable outcomes. Base case: Government contracts renew at 85% and grow modestly; enterprise GenAI Platform achieves $75–100M ARR by 2027 (slower ramp than bull); Meta ARR holds at current levels; AI lab ARR (Cohere, others) declines modestly. Total ARR reaches $350–450M by 2027 at 20–25x forward multiple, implying a valuation of $8–11B—a significant markdown from the $29B entry. This is the most likely scenario given the evidence available. Bear case: Government contracts face delays or a material non-renewal; enterprise GenAI Platform fails to achieve product-market fit by 2027 (net ARR < $50M from platform); Meta reduces data purchasing following establishment of in-house annotation capability; additional AI lab customers exit. Total ARR falls to $150–200M by 2027 at 10–12x forward multiple, implying $1.5–2.5B valuation. At this valuation, the $29B entry becomes economically devastating; even preference stacks may not protect against total return loss. The probability signal for each case is asymmetric: the bull case requires multiple concurrent optimistic outcomes; the bear case requires only the government contract not renewing on schedule (a single event). This asymmetry, combined with the entry price being set by a strategic buyer, creates a structurally unfavorable risk/return ratio for a financial investor at $29B. The base case implies a 60–70% markdown, and the bear case implies near-total loss of financial value. Only investors who can price the strategic optionality of the government moat at $20B+ can justify the $29B entry. [CV005, CV006, CV007, CV008, CV009, CV015]

Bull / Base / Bear Scenario Table
Scenario	Key Assumptions	ARR Estimate (2027)	Valuation Logic	Implied Valuation	Probability Signal
Bull Case	Government contracts renew >90%; GenAI Platform reaches $150M+ ARR; Meta expands to $300M+ data purchase; permanent CEO hired	$700M–850M	25x forward ARR (government AI platform premium)	$17.5B–25.5B	20% — requires multiple concurrent optimistic outcomes; no single assumption is extreme but all must hold simultaneously
Base Case	Government contracts renew at 85%; GenAI Platform reaches $75–100M ARR; Meta ARR stable; AI lab segment declines modestly	$350M–450M	20–22x forward ARR (mixed platform/services valuation)	$7.7B–10.4B	55% — most outcomes are consistent with historical trajectories for annotation companies with government moats
Bear Case	Government contract non-renewal or delay; GenAI Platform <$50M ARR; Meta reduces purchasing; additional AI lab exits	$150M–200M	10x forward ARR (annotation services with degraded government positioning)	$1.5B–2B	25% — requires only one adverse event (government non-renewal) to materialize; high probability given forward contract uncertainty

Scenarios represent analyst estimates constructed from public evidence; not company forecasts. ARR estimates are gross ARR including government, enterprise, AI lab, and self-serve segments. Probability signals are subjective but grounded in the distribution of outcomes for comparable annotation companies (Appen precedent weighted toward bear/base; Palantir precedent weighted toward bull). Valuation at $29B entry implies a 60–70% markdown in the base case.

[CV015, CV016, CV017, CV018, CV022, CV024]

FV003: Valuation / Return Range

8.4 Comparable Set and Valuation Positioning

Scale AI operates across multiple addressable markets and can be compared using several distinct frameworks. The most important distinction is that Scale is priced as a frontier AI platform company despite generating most of its current revenue as an annotation and data labeling services company. This creates a valuation framework mismatch: applying annotation services multiples (Appen: 0.5–2x ARR) implies $100M–1B; applying defense AI platform multiples (Palantir: 25–30x ARR) implies $5B–15B; applying frontier AI platform multiples (OpenAI private rounds: 100x+ ARR) implies $20B+. The appropriate framework depends on which revenue segment becomes dominant in Scale's forward composition. Appen (ASX: APX) is the most instructive negative comparable: a public annotation company that experienced 60–80% revenue decline when AI labs reduced RLHF spend after developing in-house annotation capabilities. Appen's market cap fell from ~AUD $3.5B at peak to ~AUD $250M following customer attrition, demonstrating the severe valuation impact of annotation commoditization. Scale's government moat is the key structural difference from Appen: Scale's defense vertical creates a revenue floor that Appen lacked. Palantir (NYSE: PLTR) provides the government AI platform comparable: a company with deep DoD/IC integrations, high switching costs, and multi-year government contracts that commands a 25–30x ARR multiple on a strong growth narrative. Palantir's market cap has exceeded $100B on estimated $2B+ ARR, implying comparable government-anchored AI platforms can sustain high multiples. However, Palantir has demonstrated sustained commercial traction alongside government revenue; Scale's commercial pivot is unproven. Labelbox (private, ~$1B valuation) provides a relevant annotation infrastructure comparable without government moat, suggesting that annotation infrastructure without government contracts commands approximately 8–10x ARR—a fraction of Scale's implied multiple. The delta between Labelbox's multiple and Scale's implies the market is pricing Scale's government moat and Meta strategic option value at approximately $20B+ premium. This premium requires explicit validation through government contract schedule visibility and enterprise ARR trajectory data. [CV019, CV020, CV021, CV022, CV023, CV024]

Comparable Valuation Table
Comparable	Type	Key Metric / ARR	Multiple / Valuation	Government Moat	Relevance	Key Limitation
Appen (ASX: APX)	Public — annotation services	AUD ~$400M peak ARR (2021)	0.5–2x ARR; peak ~$3.5B market cap; declined 90%+ post-attrition	None	Direct negative comparable: annotation company that lost AI lab customers and collapsed in valuation	No government moat — Scale's floor valuation should be structurally higher than Appen's trough
Palantir (NYSE: PLTR)	Public — defense AI / enterprise data platform	~$2B+ ARR (2025 est.)	30–50x ARR; $100B+ market cap; high government ARR concentration	Extremely high (NSA, CIA, DoD multi-decade relationships)	Best government-anchored AI platform comparable; Scale's government depth is less proven	Palantir has proven 15+ years of government contract continuity; Scale's government track record is shorter
Scale AI Series F (May 2024)	Private — own historical round	~$200–300M ARR (estimated at time)	46–69x ARR; $13.8B valuation	High (established at time of round)	Closest historical comparable; shows valuation doubled despite concurrent attrition	Historical reference only; current trajectory is adverse relative to Series F
Scale AI Meta deal (Jun 2025)	Private — strategic investment	~$200–500M ARR (current estimate)	58–145x ARR; $29B implied	High (established)	Current implied valuation — the entry price under analysis	Strategic buyer premium embedded; not financial-investor-comparable; proceeds to shareholders not balance sheet
Labelbox (private)	Private — annotation platform, no government moat	~$80–120M ARR (est.)	~8–12x ARR; ~$1B valuation	None	Annotation infrastructure comparable without government moat; benchmarks Scale's platform multiple absent government premium	Private company; ARR is analyst estimate; no public financial disclosure
Crunchbase Scale AI funding history	Database — funding round aggregator	~$1.6B total equity raised pre-Meta deal	N/A (not a per-comparable entry)	N/A	Contextual: confirms capital efficiency relative to valuation	Database entry; round details may differ from disclosed actuals

Multiple comparables because Scale AI straddles annotation services (Appen comparable), government defense AI (Palantir comparable), and enterprise AI platform (Labelbox comparable). The appropriate valuation framework depends on which segment becomes the dominant revenue contributor in the forward period. At $29B, the valuation is pricing in the Palantir-equivalent government moat AND significant enterprise platform upside — both of which require private confirmation.

[CV019, CV020, CV021, CV022, CV023, CV024]

FV004: Investment KPIs

8.5 Recommendation, Confidence, and Final Diligence Asks

The recommendation for Scale AI is Research-More with a Conditional Pass at materially lower entry valuation. The evidence supports the quality of Scale's assets—government moat, annotation data infrastructure, brand, and Meta validation—but does not support the $29B valuation at current evidence quality and revenue trajectory. The primary driver of this recommendation is that the entry valuation was set by a strategic acquirer (Meta) operating under strategic logic incompatible with financial investor return requirements. Paying $29B without visibility into post-attrition ARR, government contract renewal schedule, enterprise platform conversion metrics, and cash runway is epistemically unsound. The confidence in this recommendation is medium-low because the most critical data—government contract ARR, enterprise platform ARR, and post-deal cash position—are private and not publicly accessible. If those data points were available and favorable, the recommendation could upgrade to Conditional Buy at a price below $15B, or even Structured Buy at $8–12B with aggressive milestone protections. The current evidence quality justifies caution: eight of the thirty research questions in this chapter have partial or unresolved status because the key inputs are private. Thesis-break conditions that would move the recommendation to Pass (reject): any government contract cancellation, any additional enterprise customer citing Meta conflict, or confirmation that GenAI Platform ARR is below $30M as of Q1 2026. Conditions that would upgrade to Conditional Buy: government contract renewal confirmation, enterprise platform ARR exceeding $75M, and a permanent CEO appointment with government AI or enterprise platform experience. Investors should note that the timeline for resolution of the critical diligence questions is short—the government contract option periods and enterprise platform ramp should produce observable revenue signals within 12–18 months of the research date. [CV001, CV002, CV006, CV009, CV017, CV029]

Recommendation Summary Table
Dimension	Assessment	Basis	Condition / Qualifier
Recommendation	Research-More / Conditional Pass	Valuation is strategic-buyer priced; financial investor return case requires materially lower entry	Upgrade to Conditional Buy if government contract schedule confirmed and GenAI Platform ARR exceeds $75M
Confidence	Low-Medium (35%)	Critical inputs (government ARR, enterprise platform ARR, cash runway) are private and unconfirmed	Confidence rises to Medium-High if Q2 2026 revenue data confirms base case trajectory
Risk Rating	High	Four concurrent high-severity risks (concentration, CEO, Meta conflict, pivot execution); no prior navigation of this risk cluster demonstrated	Risk declines to Medium if government contracts renew and management stabilizes
Valuation Stance	Materially Overvalued at $29B implied for financial investor	58x–145x estimated ARR set by strategic acquirer logic; financial return case requires $8–15B entry	Fair value range: $8–12B base case; $15–22B bull case
Decision Implication	Do not invest at $29B without additional data room visibility	Entry at $29B offers negative expected value under base case; thesis requires multiple concurrent optimistic outcomes	Request private data room access; set price discipline at <$15B; negotiate milestone-based valuation adjustments

Recommendation is evidence-sensitive and price-sensitive. The assessment reflects public-evidence limitations and the base-case valuation framework. A different recommendation (Conditional Buy or Track) could be reached under different entry price conditions or with private data room confirmation of key ARR metrics.

[CV001, CV002, CV006, CV009, CV029, CV030]

8.6 Exhibits

Disclaimer

This report is for informational and research purposes only. Sources are publicly available; no proprietary or confidential information was used. The authors make no representations as to completeness or accuracy. This is not investment advice.

Evidence index

Claims
ID	Statement	Confidence	Sources
CO001	Scale AI was founded in 2016 by Alexandr Wang, who was 19 years old and had left MIT, in San Francisco, California.	High	SO001, SO015
CO002	Scale AI's mission is to develop reliable AI systems for the world's most important decisions.	Medium	SO001
CO003	Scale AI provides data annotation, RLHF, model evaluation, and enterprise GenAI platform services as its core product offerings.	High	SO001, SO008, SO009, SO012
CO004	Scale AI had approximately 1,000 employees as of its about page, with headcount reduced following the July 2025 layoffs of 200 employees and 500 contractors.	Medium	SO001, SO017
CO005	Scale AI closed a $1 billion Series F round in May 2024, led by Accel, at a post-money valuation of $13.8 billion.	High	SO015, SO016
CO006	Scale AI has raised approximately $1.6 billion in total disclosed venture funding, including $325 million in its 2021 Series E at a $7.3 billion valuation and $1 billion in its 2024 Series F.	Medium	SO015, SO019
CO007	Meta's June 2025 strategic investment of approximately $14.3 billion for a minority stake implies a Scale AI valuation of over $29 billion.	High	SO019, SO016
CO008	Scale AI has processed more than 15 billion human-labeled decisions and has paid contributors globally over $1 billion.	Medium	SO001
CO009	Scale AI operates as a late-stage private company; it has not filed for an IPO as of the report date.	High	SO001, SO015
CO010	Alexandr Wang served as CEO of Scale AI from its founding in 2016 until June 2025, when he departed to join Meta AI.	High	SO002, SO016
CO011	Jason Droege was appointed Interim CEO of Scale AI in June 2025 following the departure of Alexandr Wang.	High	SO002, SO016
CO012	Jason Droege founded Uber Eats and scaled it to a $19 billion GMV run rate, then served as VP at Uber and partner at Benchmark before joining Scale AI as Chief Strategy Officer in September 2024.	Medium	SO002, SO027
CO013	Alexandr Wang retained a seat on Scale AI's board of directors after his departure to Meta in June 2025.	Medium	SO016, SO019
CO014	Scale AI's full board composition beyond Alexandr Wang is not publicly disclosed as of mid-2026.	High	SO001, SO002
CO015	Alexandr Wang's concurrent role at Meta AI and board seat at Scale AI creates a structural governance conflict of interest, as Meta is both Scale's largest strategic investor and a potential competitor to Scale's AI lab customers.	Medium	SO016, SO019, SO020
CO016	Scale AI's CEO transition from founder Alexandr Wang to interim CEO Jason Droege is a material key-person risk given Wang's central role in building Scale's customer relationships and product vision.	Medium	SO002, SO016, SO017
CO017	Scale AI's Series E in August 2021 raised $325 million at an approximate $7.3 billion valuation, with Coatue, Y Combinator, and Founders Fund among the lead investors.	Medium	SO015
CO018	Scale AI reduced its workforce by approximately 20% in 2023 amid a slowdown in AI training data demand.	Medium	SO015, SO017
CO019	The Scale AI Series F included investors Amazon, Meta, Cisco, Intel, AMD, ServiceNow, Nvidia, DFJ Growth, WCM, Elad Gil, and Nat Friedman as new or returning strategic participants alongside financial investors.	High	SO015, SO019
CO020	The Scale AI Series F round combined primary capital and a secondary component, allowing existing shareholders to partially liquidate their positions.	Medium	SO015
CO021	Meta's June 2025 investment was structured as a minority stake purchase of approximately 49% of Scale AI's outstanding equity on a fully diluted basis.	Medium	SO019, SO016
CO022	The proceeds from Meta's $14.3 billion investment in Scale AI were distributed to existing shareholders and holders of vested equity rather than retained as operating capital on Scale's balance sheet.	Medium	SO019, SO016
CO023	Scale AI stated it remains operationally independent from Meta following the strategic investment.	Medium	SO002, SO016
CO024	Scale AI's Donovan platform provides specialized AI agent workflows for defense and intelligence missions, operating in classified environments enabled by DoD IL4 and FedRAMP High certifications.	High	SO006, SO005
CO025	Scale AI holds SOC 2 Type II, ISO 27001, DoD IL4 Provisional Authorization, and FedRAMP High security certifications.	High	SO005, SO004
CO026	Scale AI's RLHF product provides curated preference data for reinforcement learning from human feedback, which is central to training large language models for instruction following and safety alignment.	High	SO012, SO009
CO027	Scale AI secured a U.S. Department of Defense data curation contract for joint force operations in 2024.	Medium	SO014, SO007
CO028	Scale AI testified before Congress on AI safety and data quality standards during the 118th Congress.	Medium	SO023, SO013
CO029	Scale AI offers enterprise customers custom pricing with dedicated operations teams and SLA commitments; self-serve customers pay per usage with the first 1,000 labeling units free.	Medium	SO003
CO030	Scale AI does not publicly disclose its annual recurring revenue, gross margin, or detailed financial metrics; it is classified as a private-undisclosed company.	High	SO001, SO003
CO031	Scale AI signed the White House voluntary AI safety commitments in 2024, pledging commitments on AI safety, security, and trust.	Medium	SO013, SO023
CO032	Scale AI operates its Global Public Sector division serving international government agencies in addition to U.S. defense customers.	Medium	SO026, SO007
CO033	Scale AI's scale of operations is indicated by the 15 billion human decisions processed and $1 billion paid to its contributor network globally.	Medium	SO001
CO034	Scale AI launched the TIME magazine GenAI deployment in under 2 months with over 7,000 AI attack vectors tested, demonstrating enterprise deployment speed and safety rigor.	Medium	SO011
CO035	Google was Scale AI's largest customer prior to the Meta strategic investment; CNBC reported Google planned to wind down or significantly reduce its Scale relationship after the Meta deal in June 2025.	High	SO020, SO021
CO036	OpenAI wound down its work with Scale AI in June 2025 following Meta's strategic investment, according to CNBC.	High	SO021, SO016
CO037	Scale AI operates the Scale GenAI Platform that transforms enterprise data into domain-specific generative AI applications using a proprietary pipeline.	Medium	SO001, SO008
CO038	Scale AI laid off approximately 200 employees (14% of staff) and 500 contractors in July 2025, with Interim CEO Droege's memo citing overinvestment in data-labeling capacity relative to the company's strategic direction.	High	SO017, SO016
CO039	Scale AI filed a lawsuit in September 2025 against Mercor, a rival data services company, and a former Scale employee, alleging an attempt to steal Scale's largest customers.	Medium	SO018, SO016
CO040	McKinsey's State of AI 2025 survey found that 88% of organizations now use AI in at least one business function, up from 78% in 2024, indicating strong underlying demand for AI data and infrastructure services.	High	SO028, SO022
CO041	The OpenAI fine-tuning and custom models program expansion highlights continued strong demand for specialized AI model training services, which Scale AI's RLHF and Data Engine products directly address.	Medium	SO024, SO028
CM001	The AI data services and infrastructure market encompasses data annotation, RLHF, model evaluation, and enterprise GenAI platform services as distinct but related segments with overlapping buyers and spend.	High	SM012, SM013
CM002	Status-quo substitutes for Scale AI's annotation services include internal human review teams at large AI labs, lower-cost offshore providers, and increasingly, synthetic data generation pipelines.	Medium	SM002, SM003, SM004
CM003	Scale AI does not currently serve the commodity crowdsource annotation market (e.g., Mechanical Turk), general AI consulting, or hyperscaler AI services (Azure, AWS, GCP) — these are excluded from its SAM.	Medium	SM016, SM017
CM004	The AI data annotation and evaluation market is served by a fragmented set of competitors including Appen, Labelbox, Snorkel AI, SuperAnnotate, Surge AI, Invisible Technologies, and Mercor, indicating a competitive but not yet consolidated market.	Medium	SM002, SM003, SM004, SM005, SM006, SM007, SM008
CM005	Scale AI's GenAI Platform competes in the enterprise AI deployment market against hyperscalers and boutique AI consultancies, while its Donovan platform operates in the specialized government and defense AI segment where cleared vendors are scarce.	Medium	SM010, SM009, SM011
CM006	The Total Addressable Market for AI data services and infrastructure is estimated at $10–$30 billion annually as of 2025, with high uncertainty due to boundary definition differences and rapid market evolution.	Low	SM012, SM013
CM007	A narrower TAM for AI data annotation services only (excluding evaluation, RLHF, and enterprise GenAI platform) is estimated at $2–$8 billion, derived from public-company proxy revenues scaled to global market size.	Low	SM004, SM018
CM008	Scale AI's Serviceable Addressable Market (premium tier: AI labs, large enterprise, and government) is estimated at $1.5–$6 billion, representing the high-quality end of AI data services where Scale's quality premium and certifications are competitive differentiators.	Low	SM012, SM018
CM009	The U.S. government and defense AI data and evaluation submarket addressable by cleared vendors like Scale AI is estimated at $300M–$1B based on defense AI investment growth trends and cleared vendor supply constraints.	Low	SM011, SM025
CM010	Scale AI's realistic Serviceable Obtainable Market (SOM) within a 3–5 year horizon is estimated at $500M–$2 billion, consistent with Scale's $29B+ valuation at typical SaaS/data revenue multiples.	Low	SM021, SM018
CM011	No public analyst report provides a consistent, granular breakdown of the AI data annotation market with sufficient specificity for high-confidence TAM/SAM/SOM sizing; all estimates carry high uncertainty.	Medium	SM014, SM015
CM012	AI research laboratories and foundation model developers (OpenAI, Meta, Google DeepMind, Anthropic, Cohere) represent Scale AI's original customer segment, requiring large volumes of high-quality labeled data and RLHF data.	High	SM023, SM016
CM013	Large enterprise AI adopters (Fortune 500 companies using AI for products and automation) represent Scale AI's growth segment, accessing the GenAI Platform with higher switching costs and longer procurement cycles than AI labs.	Medium	SM010, SM022
CM014	U.S. government and defense agencies represent Scale AI's most defensible buyer segment, with very high switching costs due to FedRAMP High and DoD IL4 clearance requirements, multi-year contract structures, and institutional knowledge dependencies.	High	SM009, SM011
CM015	AI startups and research organizations access Scale's self-serve tier with low switching costs and lower average revenue per user, representing volume without durability.	Medium	SM016, SM017
CM016	U.S. government AI procurement follows federal acquisition regulations with multi-year contract vehicles and IDIQ structures, creating revenue lumpiness but high stickiness once awarded.	Medium	SM025, SM011
CM017	The TIME magazine GenAI deployment case study demonstrates Scale AI's ability to win enterprise media customers with fast deployment timelines (under 2 months) and comprehensive AI safety testing.	Medium	SM023
CM018	Enterprise AI platform buyers (vs. AI lab buyers) offer Scale higher switching costs and platform stickiness, making them strategically more valuable for long-term revenue durability despite lower initial volume.	Medium	SM010, SM022
CM019	McKinsey's 2025 State of AI survey found that 88% of organizations use AI in at least one business function, up from 78% the previous year, confirming accelerating enterprise AI adoption.	High	SM012, SM001
CM020	McKinsey's 2025 survey found 62% of organizations are experimenting with AI agents, indicating AI adoption is moving toward more complex and automated workflows that require higher-quality training and evaluation data.	High	SM012, SM001
CM021	Most organizations are still in the AI pilot/POC stage rather than full-scale deployment, according to McKinsey's 2025 survey, suggesting enterprise platform revenue for Scale AI will lag the adoption curve by 12–24 months.	Medium	SM012
CM022	U.S. government AI investment is growing materially as the DoD integrates AI into surveillance, logistics, cybersecurity, and autonomous systems — all of which require trusted data infrastructure like Scale provides.	Medium	SM025, SM011
CM023	AI safety regulatory pressure — including the White House AI Executive Order and EU AI Act — is increasing demand for AI model evaluation and audit services, benefiting Scale's Evaluation product.	Medium	SM025, SM013
CM024	The proliferation of foundation model providers (OpenAI, Meta, Anthropic, Mistral, Cohere) increases aggregate demand for RLHF data and model evaluation, partially offsetting the risk from any single large lab customer departure.	Medium	SM020, SM013
CM025	Synthetic data generation represents a long-term structural threat to human-labeled annotation demand; some AI labs are shifting toward model-generated synthetic data for portions of their training pipelines.	Medium	SM020, SM019
CM026	Low-cost annotation providers (Appen, offshore QA teams, crowdsourcing platforms) create structural margin pressure on Scale's data annotation business, constraining the price premium Scale can sustain without clear quality differentiation.	Medium	SM004, SM002
CM027	The AI data annotation market does not have publicly available CAGR estimates from tier-one analyst firms with sufficient market boundary consistency to provide a reliable growth rate for TAM sizing.	Medium	SM014, SM015
CM028	Appen (ASX-listed) is the only publicly traded direct comparable to Scale AI's data annotation business, but Appen's revenue trends, market positioning, and customer base differ materially from Scale's premium positioning.	Medium	SM004
CM029	The revenue impact of Google's and OpenAI's departures from Scale AI as customers cannot be quantified from public sources, representing the single most material unresolved market sizing question for Scale's investment case.	High	SM019, SM021
CM030	Scale AI's revenue mix by segment (AI labs vs. enterprise vs. government) is not publicly disclosed, making it impossible to verify which segment dominates current revenue or project the enterprise and government pivot timeline.	High	SM016, SM023
CM031	The wide range of AI data services TAM estimates ($2B–$30B) reflects genuine analyst disagreement on boundary definitions, and investors should treat all sizing figures as directional rather than precise.	Medium	SM014, SM015, SM012
CM032	Scale AI's AI Readiness Report positions the company as a thought leader in enterprise AI maturity, which supports its brand in the enterprise buyer segment and validates its content-led GTM motion.	Medium	SM022
CM033	Stanford HAI AI Index 2025 confirms that AI investment reached record levels globally in 2024, with both private investment and government spending accelerating, providing macro validation for the AI data services market.	High	SM013, SM001
CM034	Fortune 500 companies deploying production AI (not just pilot) in 2025 represent a small but growing fraction of the enterprise TAM; McKinsey data suggests roughly one-third of AI-adopting organizations are scaling programs.	Medium	SM012
CM035	The competitive landscape for AI data services includes multiple vendors targeting different quality/price points — from premium (Scale, Surge) to mid-market (Labelbox, Snorkel, SuperAnnotate) to low-cost (Appen, Invisible) — suggesting market segmentation rather than single-vendor dominance.	Medium	SM002, SM003, SM005, SM006, SM007
CP001	Scale AI's competitive landscape encompasses direct competitors in AI data annotation (Appen, Labelbox, Snorkel AI, SuperAnnotate), RLHF data (Surge AI, Mercor), enterprise GenAI platform (hyperscalers, boutique AI consultancies), and defense AI (defense contractors).	High	SP004, SP007, SP012, SP017, SP014, SP001
CP002	Appen is the only publicly traded direct comparable to Scale AI's annotation business, and Appen's declining revenues provide a market signal for the structural headwinds facing pure-play annotation vendors.	Medium	SP004, SP019
CP003	Meta's June 2025 strategic investment in Scale AI created a direct conflict of interest with Scale's largest AI lab customers (Google, OpenAI), leading both to wind down or significantly reduce their Scale relationships.	High	SP021, SP022, SP023
CP004	Google was Scale AI's largest customer prior to the Meta investment and planned to wind down its Scale relationship in June 2025 due to competitive conflict concerns.	High	SP021, SP023
CP005	OpenAI wound down its work with Scale AI in June 2025, coinciding with Meta's strategic investment and founder Alexandr Wang's departure to Meta.	High	SP022, SP023
CP006	Labelbox has expanded from core data labeling to RLHF, model evaluation, leaderboards, and robotics AI training, making it a direct multi-product competitor to Scale AI across several key product lines.	Medium	SP007, SP008, SP009, SP010, SP011
CP007	Snorkel AI's programmatic labeling approach — using AI weak supervision to reduce manual annotation — threatens Scale's human-expert model by potentially reducing the labor content and cost of annotation, directly challenging Scale's premium pricing thesis.	Medium	SP012, SP013
CP008	Mercor operates an AI talent marketplace for RLHF, model evaluation, and data labeling, founded by former Scale AI-connected individuals, and is being sued by Scale for alleged customer poaching.	High	SP001, SP014
CP009	SuperAnnotate competes with Scale AI in the enterprise annotation platform market with a security-first positioning and collaborative workflow tools, primarily targeting computer vision use cases.	Medium	SP015
CP010	Invisible Technologies provides AI-powered business operations services that compete with Scale's enterprise automation and annotation capabilities at the operations level.	Medium	SP017
CP011	Surge AI focuses specifically on high-quality human feedback data for LLM training and RLHF, directly competing with Scale's RLHF product with a smaller but expert contributor network.	Medium	SP017
CP012	Appen has expanded into agentic AI services and model evaluation in addition to its core annotation business, indicating it is attempting to follow Scale into higher-margin evaluation segments.	Medium	SP005, SP006
CP013	Appen provides data security features for enterprise and government customers but does not disclose DoD IL4 or FedRAMP High certifications equivalent to Scale AI's cleared vendor status.	Medium	SP018, SP002
CP014	Labelbox offers tiered pricing including self-serve developer plans and enterprise custom pricing, making it price-competitive against Scale's self-serve tier and potentially more accessible to mid-market enterprise.	Medium	SP016, SP007
CP015	Scale AI's DoD IL4 Provisional Authorization and FedRAMP High certifications are unique among publicly disclosed AI data annotation vendors, creating a near-exclusive position in classified and defense AI data markets.	High	SP002, SP003
CP016	Scale AI's Donovan defense AI agent platform has no publicly disclosed direct competitor with equivalent security clearances and defense-specific deployment capabilities.	Medium	SP003, SP002
CP017	Scale AI's evaluation benchmarking position — including the Scale Leaderboard and WMDP harmful-knowledge benchmark — has established reputational authority as a trusted third-party LLM evaluator, providing a differentiated position versus competitors.	Medium	SP025, SP002
CP018	Labelbox and Snorkel AI do not publicly disclose government security certifications equivalent to Scale's DoD IL4 or FedRAMP High authorizations, indicating limited ability to compete for Scale's most defensible government contracts.	Medium	SP007, SP012
CP019	Scale AI's feature breadth — spanning annotation, RLHF, evaluation, enterprise GenAI platform, and defense AI agents — is unmatched by any single competitor, though individual competitors may lead in specific capabilities.	Medium	SP025, SP007, SP004
CP020	Scale's enterprise GenAI Platform has no direct counterpart at Appen, Labelbox, or Snorkel AI, but faces significant competition from hyperscaler AI platforms (AWS Bedrock, Azure AI, Google Vertex) with far greater resource advantages.	Medium	SP025, SP007
CP021	Switching costs for AI lab customers of Scale AI are moderate: annotation pipelines are fungible, and Google and OpenAI both exited Scale's ecosystem within weeks of the Meta conflict emerging in June 2025.	High	SP021, SP022
CP022	Switching costs for enterprise customers using Scale's GenAI Platform are higher than for annotation-only customers, due to platform integration, workflow customization, and data pipeline dependencies.	Medium	SP025
CP023	Obtaining DoD IL4 Provisional Authorization and FedRAMP High certification requires a multi-year process involving security audits, infrastructure requirements, and federal agency sponsorship, creating a 12–36 month barrier for any competitor seeking to enter Scale's government segment.	Medium	SP002, SP003
CP024	Mercor is actively building an alternative AI expert contributor supply chain through its talent marketplace to directly compete with Scale's contributor network, targeting the same expert annotators and RLHF data customers.	Medium	SP001, SP014
CP025	Multi-homing — where annotation customers run work through multiple vendors simultaneously — is common in the AI data market; Scale's self-serve tier confirms this with its pay-as-you-go model that has no long-term commitment.	Medium	SP007, SP012
CP026	Scale AI's government and defense segment represents its most durable competitive moat, requiring years for competitors to replicate the clearances, institutional knowledge, and defense-specific product (Donovan) that Scale has built.	High	SP002, SP003, SP025
CP027	Scale AI's quality premium in annotation is under pressure from competitors including Labelbox, Surge AI, and lower-cost providers; the departure of Google and OpenAI demonstrates that quality premium alone was insufficient to sustain those relationships.	High	SP021, SP022, SP024
CP028	Appen's declining revenues — the most direct public comparable to Scale's annotation business — serve as a leading indicator of structural headwinds in the pure-play data annotation market.	Medium	SP019, SP004
CP029	The Meta strategic investment has become Scale AI's primary competitive liability in the AI lab segment: Mercor's lawsuit defense (Sep 2025) reveals that competitors view Scale's AI lab customer base as vulnerable and actively targetable.	High	SP001, SP021, SP022
CP030	Scale AI's July 2025 layoffs specifically targeted data-labeling employees — not evaluation or platform — confirming management's internal assessment that commodity annotation is a commoditizing segment requiring rightsizing.	High	SP024, SP023
CP031	Snorkel AI's programmatic labeling approach and rising AI-assisted annotation tools represent a structural threat to the human-label-intensive part of Scale's business model, potentially reducing the labor content of annotation work over time.	Medium	SP012, SP013
CP032	Scale AI's annotation platform (Scale Data Engine) has built-in quality feedback loops and an expert contributor network that paid contributors $1B+ globally, creating a proprietary supply chain that competitors must spend years and significant capital to replicate.	Medium	SP025, SP002
CP033	The competitive landscape for AI data services lacks a dominant single vendor — no competitor has matched Scale's full-stack positioning across annotation, RLHF, evaluation, and defense AI — but individual segments are increasingly contested.	Medium	SP004, SP007, SP012, SP014
CP034	Large defense contractors such as Palantir and Booz Allen are potential long-term entrants into the defense AI data market, representing a risk to Scale's Donovan moat over a 3–5 year horizon.	Low	SP003
CP035	Scale AI has not publicly documented evidence of losing any government or defense contract to a competitor, which supports the durability of its government segment moat in the near term.	Medium	SP002, SP003
CI001	Scale AI generates revenue through enterprise data annotation/RLHF services, the Scale GenAI Platform (enterprise SaaS + managed services), U.S. government and defense contracts, and a self-serve tier.	High	SI001, SI013, SI020, SI014
CI002	Scale AI's GenAI Platform targets enterprises seeking to build custom AI applications from proprietary data, representing a higher-margin opportunity than commodity annotation.	Medium	SI001, SI004
CI003	Scale AI holds DoD IL4 and FedRAMP High certifications, enabling it to pursue classified and defense-grade AI data contracts that most competitors cannot access.	Medium	SI005, SI006
CI004	Scale AI's RLHF revenue is affected by OpenAI's wind-down of its Scale relationship post-Meta-investment, while Meta's own RLHF work with Scale is expanding.	Medium	SI022, SI017
CI005	Scale AI's self-serve Data Engine offers the first 1,000 labeling units free, then charges pay-as-you-go; enterprise tiers are custom-priced with dedicated operations and SLAs.	High	SI013, SI001
CI006	Scale AI's enterprise GTM is primarily sales-led with dedicated operations teams and solution engineers; government contracts require specialized federal procurement BD processes.	Medium	SI013, SI005, SI021
CI007	Scale AI's enterprise pricing for annotation, RLHF, and platform services is custom-quoted and not publicly disclosed; no list pricing or typical deal size information is available.	High	SI013, SI001, SI020
CI008	Scale AI's enterprise annotation sales cycle is estimated at 3–12 months, with government contract sales cycles typically 12–36 months; specific CAC figures are not disclosed.	Low	SI005, SI006, SI021
CI009	Scale AI's data-labeling revenue was concentrated among a small number of large AI lab customers, creating material customer concentration risk that materialized when Google and OpenAI departed in 2025.	High	SI018, SI022, SI016
CI010	Scale AI's investor relationships (Accel, Amazon, Meta, Nvidia, Cisco) create potential co-sell and referral channels, but the specific economic impact of these relationships on revenue is not publicly disclosed.	Low	SI015, SI014
CI011	Scale AI's annotation cost of revenue is dominated by human labor, with the company having paid over $1 billion globally to contributors; annotation gross margins are estimated at 25–45%, using Appen's public financials as proxy.	Low	SI014, SI007, SI008
CI012	Scale AI's GenAI Platform is estimated to carry significantly higher gross margins (45–65%) than annotation services, reflecting greater software leverage and less per-unit human labor, consistent with enterprise managed-services industry benchmarks.	Low	SI001, SI004, SI023
CI013	Scale AI's July 2025 layoffs of 200 employees (14% of staff) and 500 contractors targeted the data-labeling business, signaling management's intent to shift to higher-margin enterprise and government services and reduce the labor-heavy cost base.	High	SI016, SI021
CI014	Appen (ASX: APX), the only publicly traded direct comparable to Scale AI's annotation segment, has reported declining revenues and gross margins in the 25–40% range, confirming structural headwinds in commodity annotation economics.	Medium	SI007, SI008, SI009
CI015	Scale AI's cost structure is primarily working-capital-intensive (human labor payments, contractor management) rather than physical capex intensive, making it operationally flexible but subject to margin pressure from annotation labor costs.	Medium	SI014, SI010, SI011
CI016	Scale AI raised approximately $600 million in pre-Series F capital and $1 billion in Series F (May 2024, $13.8 billion valuation, led by Accel) for a total raised of approximately $1.6 billion or more.	High	SI015, SI019
CI017	Meta invested approximately $14.3 billion for a minority stake of approximately 49% in Scale AI in June 2025, implying a Scale valuation of over $29 billion; the investment was primarily structured as a secondary transaction with proceeds distributed to existing shareholders.	High	SI017, SI019
CI018	Scale AI does not have any publicly disclosed debt, credit facilities, or project-finance obligations; its capital structure is equity-financed.	Medium	SI015, SI017
CI019	Based on the Series F primary capital (estimated $500M+ to the company), post-layoff burn rate reductions, and potential primary capital from the Meta deal, Scale AI is estimated to have $500M–$1B cash on hand with 24–48 months of runway as of mid-2025.	Low	SI015, SI017, SI016
CI020	Scale AI's total estimated annual revenue is in the range of $200M–$500M ARR, based on headcount proxies (~1,000 employees post-layoff), industry revenue-per-employee benchmarks, and analyst commentary; this estimate has very low confidence without public financial disclosure.	Low	SI023, SI025, SI014
CI021	At Scale AI's $29B+ implied valuation, if ARR is $200M–$500M, the implied revenue multiple of 58x–145x is in hypergrowth-premium territory and cannot be verified or justified without audited financial statements and confirmed growth rates.	Low	SI017, SI023
CI022	Scale AI does not disclose revenue, ARR, gross margin, NRR, burn rate, or operating cash flow; these omissions represent the primary blockers to financial underwriting of the company.	High	SI013, SI014, SI021
CI023	Google and OpenAI's departure as customers in 2025 represents a material risk to Scale AI's revenue trajectory; the financial magnitude of this attrition is unknown but potentially $50M–$200M+ in annual revenue based on their reported status as Scale's largest AI lab customers.	Low	SI018, SI022, SI016
CI024	The MetA strategic investment does not appear to include covenants or restrictions on Scale AI's operations; Meta holds a minority stake and Scale remains independent, but the conflict of interest with other customers is a de facto constraint on commercial relationships.	Low	SI017, SI019, SI021
CI025	A Mercor lawsuit filed by Scale AI in September 2025 alleging customer poaching could have financial implications including legal costs, settlement obligations, and customer loss not yet captured in financial estimates.	Medium	SI016, SI021
CI026	Scale AI's enterprise and government revenue segments have higher revenue quality (longer contracts, stronger switching costs) than the annotation segment, but their current size and growth trajectory are not publicly disclosed.	Medium	SI005, SI006, SI001
CI027	Scale AI paid over $1 billion globally to annotation contributors, confirming the company's historical commitment to human labor-based annotation and the labor intensity of its core revenue model.	Medium	SI014, SI013
CI028	McKinsey's 2025 AI State report notes that 88% of organizations use AI in at least one business function, up from 78%, supporting a growing market for Scale AI's enterprise AI data and platform services.	High	SI023, SI025
CI029	Scale AI's annotation business model is primarily working-capital-intensive, with payments to contributors timed to annotation project delivery; this creates a different capex profile than hardware or infrastructure companies.	Medium	SI014, SI012
CI030	Appen's publicly disclosed multimodal, speech/audio, and physical AI annotation services are directly comparable to Scale AI's core annotation product lines, making Appen's financial results the best available public proxy for Scale's annotation segment economics.	Medium	SI009, SI010, SI011
CI031	Scale AI's government and defense revenue segment carries high switching costs (DoD IL4 and FedRAMP High certification barriers) and multi-year contract durations, creating more durable and predictable revenue than the annotation segment.	Medium	SI005, SI006
CI032	Scale AI's revenue mix is shifting from data-labeling (legacy largest segment) toward enterprise GenAI platform and government/defense contracts, consistent with the July 2025 restructuring that cut data-labeling headcount while maintaining enterprise and government operations.	Medium	SI016, SI021, SI001
CI033	Scale AI's government contract revenue segment is growing through active DoD relationships (data curation contract, DIU RCV program, White House AI commitments) but exact contract values and revenue contribution are not publicly disclosed.	Medium	SI005, SI006, SI024
CI034	The net primary capital received by Scale AI from the Meta deal is not publicly disclosed; the deal was primarily structured as shareholder liquidity, meaning Scale's operating treasury benefit may be significantly less than the $14.3B headline figure.	Medium	SI017, SI019
CI035	Scale AI's financial verdict is mixed: abundant capital buffers the pivot risk, but the $29B+ valuation is unjustifiable without confirmed revenue data, and the annotation-segment revenue attrition risk is unquantified.	Medium	SI017, SI018, SI022
CE001	Scale AI's core product portfolio consists of five offerings: Scale Data Engine (annotation + curation), Scale RLHF (LLM training data), Scale Evaluation + Leaderboard (model benchmarking), Scale GenAI Platform (enterprise AI applications), and Donovan (defense AI agents).	High	SE017, SE016, SE015, SE018, SE020
CE002	The Scale GenAI Platform enables enterprises to build and deploy custom AI applications from proprietary data, differentiating from hyperscalers by integrating annotation quality with LLM customization.	Medium	SE011, SE018, SE013
CE003	Scale AI's RLHF product provides expert human feedback data for LLM training and alignment; the product was historically used by OpenAI and Google before their 2025 departures.	Medium	SE016, SE021
CE004	The Scale Leaderboard is a public developer-facing tool that ranks LLM performance across capability dimensions; it has established Scale as a trusted third-party evaluator in the AI research community.	Medium	SE001, SE015
CE005	Donovan is Scale AI's specialized AI agents platform for DoD, IC, and government agencies; it operates in DoD IL4-certified environments and has no disclosed direct competitor in the cleared AI agents space.	Medium	SE020, SE019, SE003
CE006	Scale AI's operating model combines proprietary annotation tooling, a quality assurance pipeline, and a global contributor network with software API infrastructure — a hybrid human-in-the-loop architecture.	Medium	SE017, SE012, SE011
CE007	The Scale API provides programmatic access to annotation, RLHF, and evaluation services with REST interface and webhook integration; it enables enterprise and developer integration with MLOps pipelines.	Medium	SE012, SE013
CE008	Scale AI's annotation workflow: customers submit raw data and task guidelines via API; Scale's contributor network executes annotation; a QA pipeline provides statistical review and expert escalation; annotated datasets are returned to customers.	Medium	SE017, SE012, SE016
CE009	Scale AI's GenAI Platform operating model is a managed services approach involving data ingestion, LLM selection, fine-tuning or RAG pipeline configuration, red-teaming and safety evaluation, and production deployment with monitoring.	Medium	SE011, SE018, SE027
CE010	Donovan's architecture is specialized for cleared government environments: DoD IL4-certified cloud infrastructure, classified data integration, multi-modal AI agent capabilities, and explainability features for military applications.	Low	SE020, SE019, SE003
CE011	Scale AI's primary technology differentiators are: proprietary annotation tooling and QA methodology, the global contributor network ($1B+ paid), DoD IL4 and FedRAMP High certifications, the Scale Leaderboard as reputational IP, and Donovan as a first-mover defense AI platform.	Medium	SE014, SE020, SE001, SE017
CE012	Scale AI's DoD IL4 and FedRAMP High certifications represent a genuine barrier to entry for competitors: obtaining these clearances takes 2–4 years, requires significant compliance investment, and requires government institutional relationships.	Medium	SE014, SE019, SE020
CE013	Scale AI's contributor network — having paid over $1 billion globally — is a proprietary supply-side asset that competitors including Mercor are attempting to replicate; Scale's lawsuit against Mercor specifically alleges customer and annotator poaching.	High	SE022, SE010, SE009
CE014	The Scale Leaderboard positions Scale as the trusted third-party LLM evaluator; Labelbox has also launched competing leaderboards, creating competitive pressure in the model evaluation market.	Medium	SE001, SE007, SE015
CE015	The WMDP (Weapons of Mass Destruction Proxy) benchmark is a publicly available AI safety evaluation tool created by Scale; its adoption by the AI safety research community reinforces Scale's position as a trusted AI safety evaluator.	Medium	SE002, SE025
CE016	Scale AI holds SOC 2 Type II, ISO 27001, DoD IL4 Provisional Authorization, and FedRAMP High Authorization certifications, confirmed on the official Scale security page.	High	SE014, SE019
CE017	Scale AI signed the 2024 White House voluntary AI safety commitments covering RLHF safety practices, red-teaming, and responsible AI deployment.	High	SE024, SE002
CE018	Scale AI's annotation quality controls include task-specific quality guidelines, multiple annotator redundancy, statistical quality monitoring, expert reviewer escalation, and inter-annotator agreement scoring; however, no public third-party quality audit is available.	Medium	SE017, SE016
CE019	The TIME customer case study demonstrates Scale GenAI Platform deployment speed (under 2 months) and red-teaming capability (7,000+ attack vectors tested), providing partial public evidence of product performance.	Medium	SE027, SE018
CE020	Scale AI's data privacy and customer data handling practices are partially covered by SOC 2 Type II certification; however, the specific data isolation mechanisms for enterprise annotation customers are not publicly documented.	Medium	SE014, SE011
CE021	Scale AI's 2025 product roadmap is implicitly directed toward expanding the GenAI Platform, Donovan government deployments, and Scale Evaluation, while reducing investment in commodity annotation — consistent with the July 2025 restructuring.	Medium	SE021, SE018, SE020
CE022	Scale AI does not maintain a public product roadmap, changelog, or developer status page; forward-looking roadmap information must be inferred from public statements and strategic announcements.	High	SE011, SE012, SE017
CE023	Scale's GenAI Platform competes with Amazon Bedrock, Google Vertex AI, and Azure AI Studio; hyperscalers have significantly greater infrastructure scale, distribution, and resources, representing a structural competitive threat to Scale's platform business.	Medium	SE026, SE023, SE018
CE024	Scale AI has not publicly disclosed investment in synthetic data capabilities; if synthetic data quality matches human annotation for LLM training, Scale's Data Engine TAM could shrink materially — a critical product roadmap gap.	Medium	SE005, SE004, SE021
CE025	Snorkel AI's programmatic labeling approach, which uses AI-assisted weak supervision to generate training data with less human annotation, represents a technical threat to Scale's human-annotation-first model.	Medium	SE005, SE004
CE026	Scale AI's developer community presence is limited compared to AI infrastructure companies with open-source products; the Scale Leaderboard and WMDP benchmark are the primary developer-facing signals, generating reputational capital rather than active developer community engagement.	Medium	SE001, SE002, SE004
CE027	Scale AI's technology differentiation versus Labelbox is primarily in QA methodology and contributor quality management; Labelbox has built competing evaluation products (Labelbox Leaderboards) and an Expert Network, narrowing the quality gap.	Medium	SE007, SE006, SE017
CE028	The McKinsey State of AI 2025 confirms 62% of organizations experimenting with AI agents, validating Scale's strategic pivot toward agentic AI services (GenAI Platform, Donovan) as an addressable growth market.	High	SE026, SE025
CE029	Scale AI's RLHF product is affected by OpenAI's departure (wind-down confirmed June 2025) but expanding in relation to Meta's RLHF needs following the strategic investment; this creates dependency concentration risk on Meta as both investor and customer.	Medium	SE023, SE016, SE021
CE030	Scale AI's Scale Evaluation product has an independence perception risk: as a company with Meta as a strategic investor (~49% minority stake), its role as a neutral third-party LLM evaluator may be questioned by AI labs that compete with Meta.	Medium	SE023, SE001
CE031	Scale's DIU RCV (Robotic Combat Vehicle) program win confirms that Scale's technology meets defense procurement standards for autonomous military AI applications, validating Donovan's technical capability for defense missions.	Medium	SE003, SE020
CE032	Scale AI's annotation quality differentiation versus Snorkel AI is philosophical: Scale uses high-quality human annotation for accuracy; Snorkel uses AI-assisted weak supervision for scale and cost. The two approaches serve different customer segments and are both growing.	Medium	SE005, SE017, SE004
CE033	Scale AI's AI safety white paper on test and evaluation provides public documentation of Scale's approach to model evaluation methodology, supporting its positioning as a credible government AI evaluation authority.	Medium	SE002, SE024, SE015
CE034	Scale AI's cloud infrastructure dependencies (AWS/GCP/Azure) represent a supply chain risk: changes in cloud provider pricing or availability could affect annotation tooling uptime and GenAI Platform delivery.	Low	SE011, SE017, SE019
CE035	No publicly available independent third-party technical audit of Scale AI's annotation quality, QA pipeline accuracy, or GenAI Platform performance exists; all quality claims are company-asserted or based on single customer case studies.	High	SE027, SE017, SE016
CU001	Scale AI serves three primary customer segments: AI Labs and model developers, Fortune 500 enterprise customers, and U.S. government and defense agencies.	High	SU001, SU015
CU002	TIME deployed Scale AI's GenAI Platform in a production safety application, testing over 7,000 adversarial attack vectors against AI-generated content in under two months.	High	SU002, SU001
CU003	Meta holds approximately 49% of Scale AI's equity following the June 2025 strategic investment and is simultaneously an expanding RLHF data customer.	High	SU018, SU015
CU004	Cohere is listed as an AI lab customer on Scale AI's official customers page.	Medium	SU001
CU005	Etsy is listed as an enterprise customer on Scale AI's official customers page.	Medium	SU001
CU006	Instacart is listed as an enterprise customer on Scale AI's official customers page.	Medium	SU001
CU007	Pinterest is listed as an enterprise customer on Scale AI's official customers page.	Medium	SU001
CU008	Scale AI holds active Department of Defense contracts including a data curation contract for joint force operations and the Donovan platform for national security AI workflows.	High	SU004, SU005
CU009	Google, previously Scale AI's largest customer, planned to wind down or significantly reduce its Scale AI relationship in June 2025 following Meta's strategic investment.	High	SU019, SU018
CU010	OpenAI wound down its work with Scale AI in June 2025, as reported by CNBC.	High	SU020, SU015
CU011	Scale AI has paid over $1 billion to annotation contributors globally, per its official customers page.	Medium	SU001
CU012	Scale AI has processed over 15 billion human-labeled decisions, per its official customers page.	Medium	SU001
CU013	Scale AI laid off approximately 200 employees (14% of staff) and 500 contractors in July 2025, with the reductions concentrated in the data-labeling business.	High	SU016, SU015
CU014	Scale AI's GenAI Platform was deployed in a production environment at TIME Media for AI content safety testing, as documented in the official case study.	Medium	SU002
CU015	Scale AI holds active DoD IL4 and FedRAMP High certifications enabling deployment in classified and government environments.	High	SU003, SU004
CU016	Scale AI's DoD data curation contract for joint force operations was announced via official company blog post.	High	SU005, SU004
CU017	Scale AI offers a self-serve pricing tier with the first 1,000 labeling units free and pay-as-you-go access for developers and researchers.	Medium	SU027
CU018	Amazon, Cisco, Intel, AMD, and ServiceNow participated as strategic investors in Scale AI's May 2024 Series F round, providing customer-as-investor validation.	Medium	SU015
CU019	Scale AI has not publicly disclosed any net revenue retention (NRR) or gross revenue retention (GRR) metrics.	Low
CU020	Scale AI has not publicly disclosed its total active customer count across any segment.	Low
CU021	Scale AI's enterprise pricing structure includes custom pricing with dedicated operations teams and SLA commitments for enterprise customers.	Medium	SU027
CU022	Meta's information access as a 49% investor alongside its role as an RLHF customer creates a potential conflict of interest for other enterprise and AI lab customers evaluating Scale AI's data security and confidentiality.	Medium	SU018, SU019
CU023	Snorkel AI operates a public leaderboard and partners program targeting enterprise ML customers, competing in the same AI data infrastructure market as Scale AI.	Medium	SU007, SU008
CU024	SuperAnnotate maintains a public enterprise platform offering for annotation, competing with Scale AI's Data Engine in the enterprise annotation segment.	Medium	SU025
CU025	Mercor operates as a competitor in the AI annotation and evaluation marketplace, with enterprise-facing product pages and active hiring.	Medium	SU013, SU017
CU026	Scale AI's Donovan platform serves the defense and intelligence community with AI agent workflows designed for classified national security operations.	High	SU006, SU003
CU027	Google's departure from Scale AI was driven by competitive conflict concerns arising from Meta's strategic investment, per CNBC sourcing.	Medium	SU019
CU028	OpenAI's wind-down of Scale AI work coincided with founder Alexandr Wang's departure to join Meta, suggesting structural realignment of AI lab annotation sourcing.	Medium	SU020, SU015
CU029	The simultaneous departure of Google and OpenAI in Q2 2025 represents an estimated 20-40% reduction in Scale AI's AI lab segment revenue, based on their reported prominence as major customers.	Low	SU019, SU020
CU030	Scale AI's government and defense customers have the highest switching costs of any segment due to multi-year contracts, IL4/FedRAMP certification requirements, and classified-environment integration barriers.	Medium	SU003, SU004
CU031	Enterprise customers Etsy, Instacart, and Pinterest appear on Scale AI's official customers page without associated case studies, outcome metrics, or deployment scope details.	Medium	SU001
CU032	No G2, Gartner Peer Insights, or Capterra reviews were identified for Scale AI's enterprise platform, leaving customer satisfaction entirely unquantified.	Low
CU033	Scale AI's July 2025 layoffs concentrated in data-labeling signal that RLHF and annotation throughput from AI lab customers declined materially in H1 2025.	Medium	SU016
CU034	Scale AI filed a lawsuit against Mercor in September 2025 alleging customer poaching and misappropriation of trade secrets by a former employee.	Medium	SU017
CU035	Scale AI's documented land-and-expand model involves starting with data annotation, then upselling to RLHF, evaluation, and the GenAI Platform as customer needs mature.	Medium	SU001, SU026
CU036	Scale AI's customer base is predominantly U.S.-based, with no publicly disclosed international customer count or revenue split by geography.	Medium	SU001, SU003
CU037	Scale AI has not publicly disclosed any annual or quarterly customer churn rate for any segment.	Low
CU038	Scale AI's AI lab customer segment is the most volatile, with Google and OpenAI both exiting in Q2 2025 and remaining labs facing structural pressure toward in-house annotation.	Medium	SU019, SU020
CU039	Snorkel AI's press page and partner program confirm active enterprise customer development activity, corroborating that the enterprise AI data market remains competitive.	Low	SU009, SU008
CU040	Appen's enterprise case studies indicate ongoing demand for AI annotation services in comparable market segments, validating Scale's addressable market despite attrition events.	Low	SU023
CU041	Meta's expanding RLHF customer relationship with Scale AI, combined with its ~49% equity stake, creates an unprecedented customer-investor concentration that could deter other AI lab customers.	Medium	SU018, SU019
CU042	Scale AI's self-serve tier pricing structure was publicly accessible as of the research date, confirming a lower-commitment entry point for the developer and researcher segment.	Medium	SU027
CR001	Customer concentration risk is the highest-severity risk facing Scale AI, with Google and OpenAI both departing as customers in Q2 2025 following the Meta investment.	High	SR005, SR006
CR002	CEO and leadership transition risk is high: founder Alexandr Wang departed June 2025 to join Meta, replaced by interim CEO Jason Droege who lacks direct experience in Scale's core government and enterprise AI markets.	High	SR001, SR004
CR003	Meta's ~49% equity stake combined with its role as Scale AI's largest customer creates an unprecedented concentration of investor-customer governance risk that is structurally unusual in the venture-backed AI sector.	High	SR003, SR005
CR004	Scale AI is executing a business model pivot from annotation-volume revenue to enterprise GenAI Platform and government Donovan revenue under adverse conditions of active revenue attrition and CEO transition.	High	SR001, SR002
CR005	Scale AI filed a lawsuit against Mercor in September 2025 alleging customer poaching and misappropriation of trade secrets by a former employee, confirmed by TechCrunch reporting.	High	SR011, SR012
CR006	Scale AI's White House AI safety voluntary commitments (2024) demonstrate proactive regulatory engagement but do not constitute a binding compliance shield against future mandatory AI regulations.	High	SR009, SR013
CR007	Scale AI holds DoD IL4 and FedRAMP High certifications, SOC 2 Type II, and ISO 27001, representing a mature security compliance posture for government and enterprise customers.	High	SR007, SR008
CR008	Scale AI laid off approximately 200 employees (14% of staff) and 500 contractors in July 2025, concentrated in data labeling, creating talent retention risk and potential operational quality degradation.	High	SR002, SR001
CR009	U.S. Congressional AI oversight is active, with Scale AI having testified before the House of Representatives about AI capabilities and risks, indicating regulatory attention to AI infrastructure companies.	Medium	SR013
CR010	The EU AI Act creates compliance obligations for AI training data providers serving EU enterprise customers, including potential classification of high-risk AI system data as requiring additional oversight.	Medium	SR027
CR011	Scale AI's government defense work involving AI-enabled autonomy programs may attract ITAR and export control scrutiny if technology transfer to non-U.S. entities is involved in the data pipeline.	Medium	SR008, SR024
CR012	Scale AI has made proactive AI safety commitments including the WMDP harmful-knowledge benchmark for evaluating dual-use AI risks, reducing its regulatory exposure on AI safety grounds.	Medium	SR030, SR009
CR013	No publicly disclosed data security incidents or breaches affecting Scale AI's enterprise or government customers were identified in this research as of the run date.	Medium	SR007
CR014	Scale AI's compliance with U.S. AI safety voluntary commitments and its test and evaluation white paper reduce its near-term regulatory enforcement exposure from the current U.S. AI governance framework.	Medium	SR009, SR025
CR015	Scale AI's annotation quality is a key operational risk: any quality degradation in RLHF data following the 500-contractor reduction could impair enterprise and government contract performance.	Medium	SR002
CR016	Scale AI's dependency on a global annotation contributor network creates supply-chain risk, particularly as Mercor and other platforms actively compete for the same annotation workforce.	Medium	SR011, SR020
CR017	Scale AI's data security posture is supported by SOC 2 Type II, ISO 27001, DoD IL4, and FedRAMP High certifications, reducing but not eliminating the risk of data breach affecting customer AI training data.	Medium	SR007
CR018	Scale AI's business model pivot from annotation-volume to platform revenue represents a high-severity execution risk: the timeline, conversion rate, and revenue replacement pace are unknown and unconfirmed publicly.	Medium	SR010, SR014
CR019	Scale AI's July 2025 layoffs and contractor reductions signal that the data-labeling segment volume has declined materially, creating operational capacity risk if the enterprise GenAI Platform requires rapid scaling of new annotation workflows.	Medium	SR002
CR020	Scale AI's Meta dependency encompasses three simultaneous roles: investor (~49% equity), customer (expanding RLHF), and former employer of departed CEO Wang, creating a concentration with no structural firewall confirmed publicly.	Medium	SR003, SR004
CR021	Scale AI's government customer segment is a positive dependency: multi-year DoD contracts provide a revenue floor, but they also create concentration in U.S. government budget cycles and contract renewal processes.	Medium	SR008, SR024
CR022	Scale AI's cloud infrastructure dependency on AWS and similar providers creates a platform concentration risk that is partially mitigated by its FedRAMP High certification requirements for government-grade infrastructure.	Medium	SR007, SR008
CR023	Appen, a public comparable to Scale AI, experienced significant customer attrition and revenue decline when AI lab customers reduced annotation spend, providing a cautionary precedent for Scale's current trajectory.	Medium	SR028
CR024	Jason Droege's appointment as interim CEO was confirmed in June 2025; his background includes founding Uber Eats (scaled to $19B GMV) and VC partnership at Benchmark, but not direct government AI or enterprise data platform leadership.	Medium	SR004, SR001
CR025	Alexandr Wang retains a board seat at Scale AI despite departing as CEO to join Meta, creating a potential conflict of interest at the governance level that has not been publicly addressed.	Medium	SR001, SR003
CR026	Scale AI has not publicly disclosed its current cash position, quarterly burn rate, or expected runway following the Meta transaction proceeds being distributed to shareholders rather than retained for operations.	Low
CR027	Scale AI's government relationship managers hold classified clearances and institutional knowledge that represent irreplaceable assets; their retention is critical to DoD contract renewal and expansion.	Medium	SR008, SR019
CR028	The most significant thesis-break triggers for Scale AI are: any government contract cancellation, any additional non-AI-lab customer citing Meta conflict, or the GenAI Platform failing to replace AI lab ARR within 18 months.	Medium	SR005, SR003
CR029	Synthetic data generation capabilities in frontier AI models are reducing the volume of human-annotated RLHF data required per training run, creating structural demand reduction for Scale's core annotation service.	Medium	SR015, SR014
CR030	SuperAnnotate's enterprise platform and foundation-model-builder solutions pages indicate competitive expansion into Scale AI's core annotation market segments.	Low	SR021, SR022
CR031	Labelbox's enterprise platform (product/platform page) indicates continued investment in annotation infrastructure competing with Scale AI's Data Engine for enterprise customers.	Low	SR026
CR032	Scale AI's evaluation and test platform for government AI safety is documented in an official white paper, demonstrating product differentiation in the government AI market beyond annotation.	Medium	SR025, SR023
CR033	Scale AI's headcount of approximately 1,000 employees post-layoff represents a significant reduction in operational capacity that may affect the speed of enterprise GenAI Platform go-to-market.	Medium	SR002
CR034	The Mercor lawsuit (Scale AI v. Mercor, NDCA 2025) creates ongoing legal uncertainty and management distraction, with discovery potentially exposing internal customer contract terms and competitive intelligence.	Medium	SR011, SR012
CR035	Scale AI's Reuters coverage of the Google departure (rate-limited) corroborates the CNBC reporting, providing multi-source confirmation of the customer concentration event.	High	SR016, SR005
CR036	Scale AI's Reuters coverage of the Meta deal as a test of AI partnerships (rate-limited) provides an independent perspective on the concentration and governance risks of the Meta-Scale relationship.	Medium	SR017, SR003
CR037	Scale AI's active hiring page indicates continued investment in talent despite the July 2025 layoffs, suggesting the company is selectively rebuilding capacity in growth areas.	Low	SR019
CR038	Scale AI's competitors SuperAnnotate and Appen are expanding into frontier model alignment and annotation use cases, intensifying competitive pressure on Scale's core market.	Low	SR020, SR021
CR039	Scale AI's DoD data curation contract for joint force operations demonstrates deep government integration that creates positive dependency (switching costs) but requires ongoing performance and security compliance.	Medium	SR029, SR008
CR040	The combination of Google's departure, OpenAI's wind-down, and the Mercor lawsuit in a single quarter represents an unprecedented adverse events cluster for Scale AI's customer and competitive positioning.	Medium	SR005, SR006, SR011
CV001	Scale AI's implied valuation of approximately $29B derives from Meta's June 2025 purchase of approximately 49% of the company for $14.3B, confirmed by TechCrunch, CNBC, and Reuters.	High	SV011, SV013
CV002	Scale AI's estimated ARR is $200–500M as of the run date, derived from funding round valuation history and typical revenue multiples; no official ARR disclosure was made publicly.	High	SV010, SV015
CV003	The revenue multiple implied by the $29B valuation is 58x–145x ARR depending on the ARR estimate ($200M–$500M), representing an extreme premium even by frontier AI company standards.	High	SV013, SV011
CV004	The $29B implied valuation was set by Meta acting as a strategic acquirer seeking proprietary RLHF infrastructure, not as a financial investor optimizing for financial return — a critical valuation discipline distinction.	High	SV013, SV011
CV005	Scale AI's investment thesis rests on three structural pillars: government/defense moat with high switching costs, nine-year data infrastructure and annotation platform that is difficult to replicate, and Meta's strategic validation as an expanding customer and investor.	High	SV019, SV021
CV006	The government/defense segment creates a valuation floor of $3–5B for Scale AI independent of its commercial AI lab business, based on DoD IL4/FedRAMP High certifications and multi-year contract structures.	High	SV019, SV028
CV007	The anti-thesis for Scale AI at $29B includes: two largest customers departed (Google, OpenAI), CEO transitioned, business model mid-pivot, annotation commoditization underway, and Meta conflict deters new AI lab customers.	High	SV014, SV023
CV008	Enterprise GenAI Platform ARR is not publicly disclosed; the TIME Media case study (7,000+ attack vectors tested, <2-month deployment) is the only confirmed public enterprise production deployment.	High	SV024, SV018
CV009	The recommendation for Scale AI is Research-More with Conditional Pass at entry below $15B implied: base case fair value is $8–12B, bull case fair value is $17.5–25.5B, bear case implies $1.5–2B.	High	SV010, SV015
CV010	Scale AI's May 2024 Series F valued the company at $13.8B, which was approximately 46–69x estimated ARR of $200–300M at that time; the Meta deal doubled this valuation in 13 months despite adverse customer news.	Medium	SV011, SV013
CV011	Scale AI's Meta transaction proceeds were distributed to existing shareholders rather than retained as operating capital, leaving the company's actual working capital and cash runway uncertain.	Medium	SV011
CV012	Scale AI has not publicly disclosed its preference stack, liquidation structure, or dilution overhang, making it impossible to model common equity returns in the bear case without private data room access.	Low
CV013	At a $29B entry valuation, the base case ($8–11B fair value) implies a 60–70% markdown for a financial investor, making the current entry price financially destructive under any scenario that assigns >40% probability to the base case.	Medium	SV010, SV015
CV014	Financial investors co-investing at the Meta-implied $29B are paying a strategic acquisition premium designed for Meta's proprietary RLHF data needs, not for financial return optimization.	Medium	SV013
CV015	Bull case (20% probability): Government contracts renew >90%, GenAI Platform reaches $150M+ ARR by 2027, Meta expands to $300M+ ARR; total ARR reaches $700–850M, implied valuation $17.5–25.5B at 25–30x multiple.	Medium	SV010, SV019
CV016	Base case (55% probability): Government contracts renew at 85%, GenAI Platform reaches $75–100M ARR by 2027, Meta ARR stable; total ARR reaches $350–450M, implied valuation $7.7–11.0B at 20–25x multiple.	Medium	SV010, SV015
CV017	Bear case (25% probability): Government contract non-renewal or delay, GenAI Platform <$50M ARR, Meta reduces purchasing; total ARR falls to $150–200M, implied valuation $1.5–2.0B at 10x multiple.	Medium	SV009, SV014
CV018	The scenario probability distribution is asymmetric: the bear case requires only one adverse event (government contract non-renewal) while the bull case requires multiple concurrent optimistic outcomes, creating unfavorable expected value at entry.	Medium	SV010, SV015
CV019	Appen (ASX: APX) is the primary negative comparable: a public annotation company whose market cap fell from ~AUD $3.5B to ~AUD $250M (>90% decline) following AI lab RLHF customer attrition in 2022–2024.	Medium	SV009, SV026
CV020	Appen's revenue decline demonstrates that annotation commoditization and AI lab attrition can reduce a public annotation company's value by 90%+ within 24 months — the clearest available data point for Scale AI's downside risk.	Medium	SV009, SV026
CV021	Palantir (NYSE: PLTR) provides the government AI platform comparable: commanding 25–50x ARR and $100B+ market cap on the basis of deep DoD/IC integrations and multi-decade government relationships with high switching costs.	Medium	SV003, SV015
CV022	Palantir's sustained government contract renewals over 15+ years justify its premium multiple; Scale AI's government track record is shorter (est. 3–5 years of active DoD contracts), justifying a discount relative to Palantir's multiple.	Medium	SV019, SV015
CV023	Labelbox (private, ~$1B estimated valuation at ~$80–120M ARR) provides the annotation-infrastructure-without-government-moat comparable, suggesting an 8–12x ARR multiple for annotation platforms absent government contracts.	Low	SV002, SV008
CV024	The delta between Labelbox's 8–12x multiple and Scale AI's 58–145x multiple implies the market is pricing Scale's government moat and Meta strategic option value at approximately $20B+ in premium above annotation infrastructure value.	Low	SV010, SV019
CV025	Crunchbase confirms Scale AI has raised approximately $1.6B in equity prior to the Meta deal, indicating significant accumulated preference stack that must be waterfall-analyzed for common equity return modeling.	Medium	SV030, SV011
CV026	Scale AI's 2024 Series F valuation ($13.8B) was confirmed by multiple independent news sources including TechCrunch and CNBC, establishing the pre-Meta deal anchor valuation for multiple analysis.	Medium	SV011, SV013
CV027	Scale AI's government contract revenue is the primary differentiator from Appen in the comparable set; without this floor, Scale AI's annotation revenue would command an Appen-equivalent multiple of 0.5–2x ARR.	Medium	SV019, SV009
CV028	The McKinsey State of AI report confirms sustained enterprise AI adoption and growing demand for AI data infrastructure, providing market size validation for Scale AI's enterprise GenAI Platform growth scenario.	Medium	SV010, SV015
CV029	The recommended entry ceiling for a financial investor is below $15B implied valuation; this provides positive expected value in the base case and does not require bull case assumptions to recover investment.	Medium	SV010, SV015
CV030	The thesis-break triggers for Scale AI are: government contract cancellation (single event), Meta ARR decline >20% QoQ for two quarters, or enterprise GenAI Platform ARR below $30M as of Q4 2026.	Medium	SV014, SV019
CV031	The six final diligence asks are: ARR by segment (post-attrition), government contract schedule, cash position/runway, enterprise GenAI Platform ARR, management retention plan, and Meta governance firewall documentation.	Medium	SV010, SV016
CV032	Scale AI's hiring page (scale.com/careers) shows continued active hiring in government AI and enterprise roles, suggesting the company is investing in growth segments despite the July 2025 layoffs.	Low	SV001, SV016
CV033	Scale AI's White House AI safety voluntary commitments and Congressional testimony differentiate it from pure annotation competitors by demonstrating regulatory engagement that creates barriers to entry in government markets.	Medium	SV021, SV025
CV034	Palantir's investor relations page and Appen's ASX filings confirm that publicly-listed government AI and annotation companies provide the best available financial comparables for Scale AI given its unique position straddling both markets.	Medium	SV003, SV007
CV035	SuperAnnotate's enterprise and platform product pages (broken/404 as of research date) suggest active web presence investment in Scale AI's core enterprise annotation market, though page access was unavailable for detailed analysis.	Low	SV005, SV006
CV036	The OpenAI-Scale AI 2023 partnership (TechCrunch, now broken URL) represents the historical peak of the AI lab customer relationship that has since wound down, contextualizing the scale of revenue loss.	Low	SV004
CV037	Scale AI's July 2025 layoffs (14% of staff, concentrated in data labeling) are consistent with the pivot narrative but also signal that the annotation volume decline is material enough to require immediate cost reduction.	Medium	SV012, SV011
CV038	Appen's investor relations page (ASX-listed) provides publicly accessible financial data showing annotation company revenue trajectory under AI lab attrition, the best available public financial proxy for Scale AI's downside scenario.	Medium	SV009, SV026
CV039	Scale AI's government contract program for DoD data curation for joint force operations demonstrates active government deployment beyond evaluation-only contracts, supporting the multi-year government ARR thesis.	Medium	SV028, SV029
CV040	The McKinsey State of AI 2025 report confirms enterprise AI adoption is accelerating, with organizations increasing AI investment and expanding deployment — validating Scale AI's enterprise GenAI Platform market opportunity even as the annotation-volume segment commoditizes.	Medium	SV010, SV015

Sources
ID	Publisher	Title	Quote
SO001	Scale AI	About Scale AI	We provide high-quality data and full-stack technologies to help organizations develop AI systems.
SO002	Scale AI	Scale AI Announces Next Phase of Company Evolution	Jason Droege will serve as Interim CEO as Scale enters its next phase.
SO003	Scale AI	Scale AI Pricing	Enterprise pricing with dedicated operations and SLAs; self-serve with first 1000 units free.
SO004	Scale AI	Scale AI Security Overview	Scale is committed to the highest security standards for enterprise and government customers.
SO005	Scale AI	Scale AI Legal / Security Certifications	Scale holds SOC 2 Type II, ISO 27001, DoD IL4 Provisional Authorization, and FedRAMP High certifications.
SO006	Scale AI	Scale Donovan — Defense AI Platform	Donovan enables mission-critical AI workflows for defense and intelligence agencies.
SO007	Scale AI	Scale AI Public Sector	Scale supports U.S. government agencies with AI data and evaluation capabilities.
SO008	Scale AI	Scale Data Engine	The Scale Data Engine collects, curates, annotates, and validates data for AI model training.
SO009	Scale AI	Scale Evaluation — Model Developers	Scale provides trusted evaluation for AI model capability and safety.
SO010	Scale AI	Scale AI Customers	Scale works with leading enterprises and government agencies.
SO011	Scale AI	Scale AI Customer Case Study: TIME	TIME deployed GenAI in under 2 months with 7,000+ attack vectors tested using Scale.
SO012	Scale AI	Scale RLHF	Scale RLHF provides high-quality human feedback data for large language model alignment.
SO013	Scale AI	Scale AI White House Voluntary AI Safety Commitments	Scale is committed to the voluntary AI safety commitments established by the White House.
SO014	Scale AI	Scale DoD Data Curation Contract — Joint Force	Scale has secured a DoD contract to curate data for joint force AI operations.
SO015	TechCrunch	Data-labeling startup Scale AI raises $1B as valuation doubles to $13.8B	Scale AI raised $1 billion Series F at a $13.8 billion valuation, led by Accel.
SO016	TechCrunch	Scale AI confirms significant investment from Meta, says CEO Alexandr Wang is leaving	Scale AI confirms Meta's significant investment and the departure of CEO Alexandr Wang.
SO017	TechCrunch	Scale AI lays off 14% of staff, largely in data labeling business	Scale AI cut 200 employees (14% of staff) and 500 contractors, largely in the data-labeling business.
SO018	TechCrunch	Scale AI is suing a former employee and rival Mercor alleging they tried to steal its biggest customers	Scale AI filed suit against Mercor and a former employee, alleging an attempt to poach Scale's largest customers.
SO019	CNBC	Zuckerberg makes Meta's biggest bet on AI: $14 billion Scale AI deal	Meta is paying approximately $14.3 billion for a minority stake in Scale AI.
SO020	CNBC	Google, Scale AI's largest customer, plans split after Meta deal, sources say	Google, Scale AI's largest customer, is planning to wind down its Scale relationship following Meta's investment.
SO021	CNBC	OpenAI is winding down its work with Scale AI; founder is joining Meta	OpenAI is winding down its work with Scale AI, and Scale's founder Alexandr Wang is joining Meta.
SO022	Stanford HAI	Stanford HAI AI Index Report 2025	AI adoption and investment have accelerated sharply across industries in 2024-2025.
SO023	U.S. Congress	House Event on AI Safety and Data — 118th Congress	Congressional hearing on AI data safety and evaluation standards.
SO024	OpenAI	Introducing Improvements to the Fine-Tuning API and Expanding Our Custom Models Program	OpenAI expands custom model programs, signaling continued demand for specialized AI training services.
SO025	Scale AI	Scale AI Readiness Report	Organizations that invest in AI data quality and evaluation infrastructure achieve faster AI maturity.
SO026	Scale AI	Scale AI Global Public Sector	Scale serves public sector organizations globally with AI data and evaluation services.
SO027	Benchmark Capital	Jason Droege — Benchmark Partner Profile	Jason Droege founded Uber Eats and served as VP at Uber before joining Benchmark.
SO028	McKinsey & Company	The State of AI — McKinsey Global Survey 2025	88% of organizations now use AI in at least one business function, up from 78% in 2024.
SO029	Menlo Ventures	2025 The Enterprise AI Report	Enterprise AI adoption is accelerating with a focus on data quality and model evaluation.
SO030	PwC	PwC AI Jobs Barometer 2024	AI-related jobs and AI infrastructure spending are growing at 3-4x the rate of non-AI technology roles.
SM001	Stanford HAI	Stanford HAI AI Index 2025 — Overview	AI investment and adoption have accelerated at an unprecedented pace across 2024–2025.
SM002	Snorkel AI	Snorkel AI Homepage	Snorkel provides programmatic data labeling for enterprise AI.
SM003	Labelbox	Why Labelbox	Labelbox is the leading data labeling and model evaluation platform for enterprise AI teams.
SM004	Appen	About Appen	Appen provides AI training data and model evaluation services globally to enterprise and government customers.
SM005	Surge AI	Surge AI Homepage	Surge AI provides high-quality data for RLHF and LLM training.
SM006	Invisible Technologies	Invisible Technologies Homepage	Invisible provides AI-powered operations and data services for enterprise clients.
SM007	SuperAnnotate	SuperAnnotate Homepage	SuperAnnotate is an end-to-end AI training data platform for enterprise.
SM008	Mercor	Mercor Homepage	Mercor connects companies with AI talent for model training, evaluation, and data labeling.
SM009	Scale AI	Scale Evaluation — Public Sector	Scale provides trusted AI evaluation services for defense and intelligence agencies.
SM010	Scale AI	Scale Generative AI Data Engine	The Scale Generative AI Data Engine enables enterprises to build custom GenAI applications.
SM011	Scale AI	Scale Public Sector Data Engine	Scale's Public Sector Data Engine provides defense-grade data curation and annotation.
SM012	McKinsey & Company	The State of AI — McKinsey Global Survey 2025	88% of organizations now use AI in at least one business function, up from 78%; 62% experimenting with AI agents.
SM013	Stanford HAI	Stanford HAI AI Index Report 2025 — Full Report	Global AI investment reached record levels in 2024; enterprise AI adoption is accelerating sharply.
SM014	Menlo Ventures	2025 The Enterprise AI Report	Enterprise AI is at an inflection point with spending concentrated in data quality and model evaluation.
SM015	PwC	PwC AI Jobs Barometer 2024	AI-related job postings and AI infrastructure spending are growing at 3-4x non-AI technology rates.
SM016	Scale AI	About Scale AI	Scale serves AI labs, enterprises, and government agencies with data and AI infrastructure.
SM017	Scale AI	Scale Data Engine	Scale Data Engine is the industry-leading platform for AI training data collection and curation.
SM018	TechCrunch	Data-labeling startup Scale AI raises $1B as valuation doubles to $13.8B	Scale AI's $13.8B valuation implies significant investor confidence in the AI data services market.
SM019	TechCrunch	Scale AI lays off 14% of staff, largely in data labeling business	Scale's pivot away from data-labeling signals changing market dynamics for pure-play annotation vendors.
SM020	OpenAI	Introducing Improvements to the Fine-Tuning API and Expanding Our Custom Models Program	OpenAI's expansion of fine-tuning and custom model programs demonstrates sustained demand for specialized AI training data.
SM021	CNBC	Zuckerberg makes Meta's biggest bet on AI: $14 billion Scale AI deal	Meta's $14B investment validates Scale AI's market position in AI infrastructure.
SM022	Scale AI	Scale AI Readiness Report	Organizations investing in AI data quality and model evaluation achieve faster AI maturity.
SM023	Scale AI	Scale AI Customers	Scale serves a diverse customer base including AI labs, Fortune 500 enterprises, and government agencies.
SM024	Scale AI	Scale RLHF	Scale RLHF provides the highest-quality human feedback data for LLM training and alignment.
SM025	U.S. Congress	House Event on AI Safety and Data — 118th Congress	Congressional hearing highlights growing government focus on AI data standards and safety evaluation.
SP001	TechCrunch	Scale AI is suing a former employee and rival Mercor alleging they tried to steal its biggest customers	Scale AI sued Mercor and a former employee alleging they attempted to poach Scale's largest customers.
SP002	Scale AI	Scale AI Legal / Security Certifications	Scale holds DoD IL4 Provisional Authorization and FedRAMP High certifications.
SP003	Scale AI	Scale Donovan — Defense AI Platform	Donovan is Scale's specialized AI agent platform for defense and intelligence missions.
SP004	Appen	Appen Homepage	Appen provides AI training data services to global enterprise and government customers.
SP005	Appen	Appen Agentic AI	Appen is expanding into agentic AI services to address the growing LLM evaluation market.
SP006	Appen	Appen Model Evaluation and Integrity	Appen provides model evaluation and integrity services for enterprise AI teams.
SP007	Labelbox	Why Labelbox	Labelbox is the leading platform for enterprise AI data labeling, RLHF, and model evaluation.
SP008	Labelbox	Labelbox Evals	Labelbox Evals provides model evaluation and safety testing for enterprise AI teams.
SP009	Labelbox	Labelbox RL Data	Labelbox RL-Data provides reinforcement learning data for LLM training and alignment.
SP010	Labelbox	Labelbox Expert Network	Labelbox Expert Network provides quality-critical annotation by domain experts.
SP011	Labelbox	Labelbox Robotics	Labelbox provides AI training data for robotics and physical AI applications.
SP012	Snorkel AI	Snorkel AI About	Snorkel AI was founded to make AI data labeling faster, cheaper, and more accessible through programmatic techniques.
SP013	Snorkel AI	Snorkel AI Research	Snorkel AI's research on weak supervision and programmatic labeling reduces annotation cost.
SP014	Mercor	Mercor Research	Mercor conducts research on AI talent matching and high-quality human feedback collection.
SP015	SuperAnnotate	SuperAnnotate Security	SuperAnnotate prioritizes enterprise security with SOC 2 compliance and data encryption.
SP016	Labelbox	Labelbox Pricing	Labelbox offers tiered pricing from developer self-serve to enterprise custom plans.
SP017	Surge AI	Surge AI Homepage	Surge AI provides high-quality human feedback data for LLM training and RLHF.
SP018	Appen	Appen Data Security	Appen provides data security and compliance features for enterprise and government customers.
SP019	Appen	Appen Investors	Appen is publicly listed on the ASX; financial results available for market proxy analysis.
SP020	Appen	Appen Platform	Appen's platform provides end-to-end AI data collection, annotation, and quality assurance.
SP021	CNBC	Google, Scale AI's largest customer, plans split after Meta deal, sources say	Google is planning to wind down its Scale AI relationship following Meta's strategic investment.
SP022	CNBC	OpenAI is winding down its work with Scale AI; founder is joining Meta	OpenAI is winding down its work with Scale AI following the Meta strategic investment.
SP023	TechCrunch	Scale AI confirms significant investment from Meta, says CEO Alexandr Wang is leaving	Scale AI confirms Meta's strategic investment, which has created competitive conflict with Google and OpenAI.
SP024	TechCrunch	Scale AI lays off 14% of staff, largely in data labeling business	Scale AI's layoffs in data-labeling confirm management's view that the commodity annotation segment is under structural pressure.
SP025	Scale AI	Scale Evaluation — Model Developers	Scale Evaluation provides trusted model benchmarking and safety evaluation for AI labs and enterprises.
SP026	U.S. House of Representatives	House Committee Hearing on AI Data and Safety — Alexandr Wang Testimony	Scale AI founder Alexandr Wang testified before Congress on AI data quality and safety, reinforcing Scale's positioning as the trusted government AI data partner.
SI001	Scale AI	Scale GenAI Platform — Docs	Scale's GenAI Platform enables enterprises to build and deploy custom AI applications powered by their proprietary data.
SI002	Scale AI	Scale API — Introduction to Scale API	The Scale API provides programmatic access to annotation, RLHF, and evaluation services.
SI003	Scale AI	Scale GenAI Platform API Reference	Scale GenAI Platform API enables enterprise integration with Scale's AI application development infrastructure.
SI004	Scale AI	Scale Blog — Custom LLMs	Scale enables enterprises to build custom LLMs from their proprietary data using Scale's GenAI Platform.
SI005	Scale AI	Scale Blog — Autonomy Table Stakes (DoD)	Scale provides the autonomy data layer for DoD programs, a critical component of defense AI infrastructure.
SI006	Scale AI	Scale Blog — Test and Evaluation White Paper	Scale's test and evaluation capabilities support DoD AI program requirements and model safety assessment.
SI007	Appen	Appen Press Releases — ASX Financial Disclosures	Appen's press releases disclose financial results including revenue trends and margin data as ASX-listed public company.
SI008	Appen	Appen Case Studies	Appen case studies show annotation use cases comparable to Scale AI's enterprise annotation segment.
SI009	Appen	Appen Multimodal AI	Appen provides multimodal AI training data including image, video, and audio annotation services.
SI010	Appen	Appen Physical AI	Appen provides training data for physical AI including robotics and autonomous systems.
SI011	Appen	Appen Speech and Audio Training Data	Appen offers speech and audio annotation data services as a core product line.
SI012	Labelbox	Labelbox Research	Labelbox research covers data quality, annotation best practices, and AI model evaluation methods.
SI013	Scale AI	Scale AI Pricing	Scale offers self-serve pricing (first 1,000 units free, pay-as-you-go beyond) and enterprise custom contracts.
SI014	Scale AI	Scale AI About	Scale has paid over $1 billion to its contributor network globally, reflecting the labor intensity of the annotation business.
SI015	TechCrunch	Scale AI raises $1B Series F as valuation doubles to $13.8B	Scale AI raised $1 billion in Series F funding at a $13.8 billion valuation, led by Accel, with Amazon, Meta, Cisco, Intel, AMD, and ServiceNow as new investors.
SI016	TechCrunch	Scale AI lays off 14% of staff, largely in data labeling business	Scale AI laid off approximately 200 employees (14% of staff) and 500 contractors, primarily targeting its data-labeling business, signaling a strategic pivot away from commodity annotation.
SI017	CNBC	Zuckerberg makes Meta's biggest bet on AI: $14 billion Scale AI deal	Meta's investment of approximately $14.3 billion for a minority stake in Scale AI implies a valuation of over $29 billion.
SI018	CNBC	Google, Scale AI's largest customer, plans split after Meta deal	Google, Scale AI's largest customer, is planning to wind down its Scale relationship following Meta's strategic investment.
SI019	TechCrunch	Scale AI confirms significant investment from Meta, says CEO Alexandr Wang is leaving	Scale AI confirmed the Meta investment and Alexandr Wang's transition to Meta, with Jason Droege becoming Interim CEO.
SI020	Scale AI	Scale AI RLHF	Scale RLHF provides quality human feedback data for LLM training and alignment at scale.
SI021	Scale AI	Scale AI Blog — Next Phase of Company Evolution	Jason Droege as Interim CEO announced Scale's strategic pivot toward enterprise and government AI platform services.
SI022	CNBC	OpenAI is winding down its work with Scale AI	OpenAI is winding down its work with Scale AI, representing the loss of a second major AI lab customer following the Meta investment.
SI023	McKinsey	McKinsey State of AI 2025	88% of organizations now use AI in at least one business function, up from 78%, while 62% are experimenting with AI agents.
SI024	Scale AI	Scale AI Customers	Scale AI serves customers including Meta, Cohere, Etsy, Instacart, and the U.S. government.
SI025	Stanford HAI	Stanford HAI AI Index 2025	AI investment and adoption are accelerating across industries; enterprise AI infrastructure spending is growing materially.
SI033	OpenAI	OpenAI Fine-Tuning API and Custom Models Program	OpenAI's custom model program expands enterprise fine-tuning capabilities, representing a market adjacent to Scale AI's RLHF and model customization services.
SI034	U.S. House of Representatives	House Committee Hearing on AI Safety and Data — 118th Congress	Congressional AI hearing context confirms government interest in AI data quality and safety, supporting Scale AI's government revenue positioning.
SI035	Surge AI	Surge AI — RLHF Data for AI Labs	Surge AI provides premium RLHF data for AI labs, representing a direct competitor to Scale AI's RLHF revenue segment.
SE001	Scale AI	Scale Blog — Scale Leaderboard	Scale Leaderboard provides public LLM performance rankings used by AI researchers and developers to compare model capabilities.
SE002	Scale AI	Scale Blog — Measuring and Mitigating Risk with WMDP	Scale's WMDP benchmark evaluates whether LLMs can be used to generate dangerous content, measuring AI safety for dual-use knowledge domains.
SE003	Scale AI	Scale Blog — Scale DIU RCV Program	Scale partnered with the Defense Innovation Unit on the Robotic Combat Vehicle (RCV) program, providing AI data services for autonomous military systems.
SE004	Snorkel AI	Snorkel AI Customer Stories	Snorkel AI customer stories showcase enterprise use cases for programmatic labeling, a competing approach to Scale's human-annotation model.
SE005	Snorkel AI	Snorkel AI How It Works	Snorkel AI uses weak supervision and programmatic labeling to generate training data with significantly reduced human annotation effort.
SE006	Labelbox	Labelbox Customers	Labelbox serves enterprise customers with annotation and evaluation needs, competing directly with Scale AI's Data Engine.
SE007	Labelbox	Labelbox Leaderboards	Labelbox has launched leaderboards competing with Scale's Leaderboard for the LLM evaluation market.
SE008	SuperAnnotate	SuperAnnotate Learning Hub	SuperAnnotate's learning hub provides annotation best practices and workflow documentation, reflecting its approach to annotation quality management.
SE009	Mercor	Mercor Blog	Mercor's blog covers AI talent matching and RLHF data collection, reflecting its competitive approach to Scale's annotation market.
SE010	Mercor	Mercor Expert Network	Mercor's expert network is building an alternative contributor supply chain to compete with Scale's proprietary annotator network.
SE011	Scale AI	Scale GenAI Platform Docs	Scale GenAI Platform provides enterprise AI application development with LLM customization, RAG pipelines, and production deployment.
SE012	Scale AI	Scale API Reference — Introduction	The Scale API provides programmatic access to annotation, RLHF, and evaluation services with REST interface and webhook support.
SE013	Scale AI	Scale GenAI Platform API Reference	Scale GenAI Platform API enables enterprise integration with Scale's AI application development and deployment infrastructure.
SE014	Scale AI	Scale AI Legal / Security Certifications	Scale holds SOC 2 Type II, ISO 27001, DoD IL4 Provisional Authorization, and FedRAMP High certifications.
SE015	Scale AI	Scale Evaluation — Model Developers	Scale Evaluation provides trusted benchmarking and safety evaluation for AI model developers and enterprises.
SE016	Scale AI	Scale AI RLHF	Scale RLHF delivers expert human feedback data for LLM training and alignment at enterprise scale.
SE017	Scale AI	Scale Data Engine	Scale Data Engine provides end-to-end data collection, annotation, curation, and quality assurance for AI model development.
SE018	Scale AI	Scale GenAI Platform — Enterprise	Scale GenAI Platform transforms enterprise data into customized GenAI applications with production deployment support.
SE019	Scale AI	Scale Public Sector Data Engine	Scale Public Sector Data Engine provides defense-specific data labeling and management for government agencies with appropriate security clearances.
SE020	Scale AI	Scale Donovan — Defense AI Platform	Donovan is Scale's specialized AI agents platform for defense and intelligence mission-critical workflows in cleared environments.
SE021	TechCrunch	Scale AI lays off 14% of staff, largely in data labeling business	Scale AI's layoffs targeted data-labeling, confirming the strategic shift away from commodity annotation.
SE022	TechCrunch	Scale AI is suing Mercor alleging customer poaching	Scale AI sued Mercor for customer poaching, signaling the competitive vulnerability of Scale's enterprise annotation customer relationships.
SE023	CNBC	Meta's $14 billion Scale AI deal	Meta's strategic investment in Scale AI reflects Scale's position as a leading AI data infrastructure provider for frontier AI development.
SE024	Scale AI	Scale AI White House Voluntary Commitments	Scale AI signed White House voluntary AI safety commitments covering RLHF safety practices, red-teaming, and responsible AI deployment.
SE025	Stanford HAI	Stanford HAI AI Index 2025	Stanford HAI AI Index 2025 documents accelerating AI investment and the growing importance of AI evaluation and safety benchmarking.
SE026	McKinsey	McKinsey State of AI 2025	Enterprise AI adoption is accelerating, with 62% of organizations experimenting with AI agents, validating Scale AI's enterprise GenAI Platform market opportunity.
SE027	Scale AI	Scale AI TIME Case Study	TIME deployed Scale AI's GenAI Platform in under 2 months with 7,000+ attack vectors tested, demonstrating the platform's deployment speed and red-teaming capability.
SE035	U.S. House of Representatives	House AI Hearing — Scale AI Congressional Testimony	Congressional AI hearings confirm government interest in AI data quality and safety, supporting Scale AI's government/defense revenue positioning and regulatory relationships.
SU001	Scale AI	Scale AI Customers Page
SU002	Scale AI	Scale AI Customer Case Study: TIME	TIME deployed Scale's GenAI Platform and tested more than 7,000 attack vectors in under 2 months.
SU003	Scale AI	Scale AI Public Sector and Government Platform
SU004	Scale AI	Scale AI DIU RCV Program Blog Post
SU005	Scale AI	Scale AI DoD Data Curation Joint Force Contract
SU006	Scale AI	Scale AI Donovan Platform
SU007	Snorkel AI	Snorkel AI Model Leaderboard
SU008	Snorkel AI	Snorkel AI Partners Page
SU009	Snorkel AI	Snorkel AI Press Releases
SU010	Snorkel AI	Snorkel AI Security Page
SU011	Mercor	Mercor Apex Agents Product Page
SU012	Mercor	Mercor Apex SWE Product Page
SU013	Mercor	Mercor Careers Page
SU014	Mercor	Mercor Security Page
SU015	TechCrunch	Scale AI Confirms Significant Investment from Meta, Says CEO Alexandr Wang Is Leaving
SU016	TechCrunch	Scale AI Lays Off 14% of Staff, Largely in Data Labeling Business
SU017	TechCrunch	Scale AI Is Suing a Former Employee and Rival Mercor Alleging They Tried to Steal Its Biggest Customers
SU018	CNBC	Zuckerberg Makes Meta's Biggest Bet on AI — $14 Billion Scale AI Deal
SU019	CNBC	Google, Scale AI's Largest Customer, Plans Split After Meta Deal, Sources Say	Google, which was Scale AI's largest customer, is planning to wind down or significantly reduce its relationship with the company following Meta's investment.
SU020	CNBC	OpenAI Is Winding Down Its Work With Scale AI, Founder Is Joining Meta	OpenAI is winding down its work with Scale AI.
SU021	Stanford HAI	2025 AI Index Report
SU022	McKinsey & Company	The State of AI — McKinsey Global Survey
SU023	Appen	Appen Case Studies
SU024	OpenAI	OpenAI Fine-Tuning API Improvements and Custom Models Program
SU025	SuperAnnotate	SuperAnnotate Enterprise Platform
SU026	Scale AI	Scale AI RLHF Platform
SU027	Scale AI	Scale AI Pricing
SR001	TechCrunch	Scale AI Confirms Significant Investment from Meta, Says CEO Alexandr Wang Is Leaving
SR002	TechCrunch	Scale AI Lays Off 14% of Staff, Largely in Data Labeling Business
SR003	CNBC	Zuckerberg Makes Meta's Biggest Bet on AI — $14 Billion Scale AI Deal
SR004	CNBC	Scale AI Promotes Strategy Chief Droege to CEO as Wang Heads for Meta
SR005	CNBC	Google, Scale AI's Largest Customer, Plans Split After Meta Deal, Sources Say	Google, which was Scale AI's largest customer, is planning to wind down or significantly reduce its relationship with the company following Meta's investment.
SR006	CNBC	OpenAI Is Winding Down Its Work With Scale AI, Founder Is Joining Meta
SR007	Scale AI	Scale AI Security and Compliance
SR008	Scale AI	Scale AI Global Public Sector
SR009	Scale AI	Scale AI White House AI Safety Voluntary Commitments
SR010	Scale AI	Scale AI Next Phase of Company Evolution Blog Post
SR011	TechCrunch	Scale AI Is Suing a Former Employee and Rival Mercor Alleging They Tried to Steal Its Biggest Customers
SR012	CourtListener / PACER	Scale AI, Inc. v. Mercor, Inc. — Federal Court Docket (N.D. Cal.)
SR013	U.S. Congress	Congressional AI Hearing — House Event 116184 (118th Congress)
SR014	Scale AI	Scale AI Generative AI Data Engine
SR015	McKinsey & Company	The State of AI — McKinsey Global Survey
SR016	Reuters	Google Was Scale AI's Largest Customer — Plans Split After Meta Deal
SR017	Reuters	Meta's $14.8 Billion Scale AI Deal — Latest Test of AI Partnerships
SR018	Scale AI	Scale AI Homepage
SR019	Scale AI	Scale AI Careers
SR020	Appen	Appen Frontier Model Alignment
SR021	SuperAnnotate	SuperAnnotate Case Studies
SR022	SuperAnnotate	SuperAnnotate Foundation Model Builder Solutions
SR023	Scale AI	Scale AI Evaluation for Public Sector
SR024	Scale AI	Scale AI DoD DIU RCV Program
SR025	Scale AI	Scale AI Test and Evaluation White Paper
SR026	Labelbox	Labelbox Platform — Product Overview
SR027	Stanford HAI	2025 AI Index Report
SR028	Appen	Appen Investor Relations
SR029	Scale AI	Scale AI DoD Data Curation Joint Force Contract
SR030	Scale AI	Scale AI Measuring and Mitigating WMDP Risk (AI Safety)
SR031	SuperAnnotate	SuperAnnotate Platform Documentation
SR032	Axios	Scale AI Wins Pentagon AI Contract — Scale AI's Defense Push	Axios reported Scale AI won a Pentagon AI contract, supporting its government revenue thesis and providing a partial offset to AI lab customer attrition.
SR033	Bloomberg	Scale AI Faces Questions After Meta Deal and Leadership Change	Bloomberg noted the Meta deal raised structural conflict-of-interest questions about Scale AI's ability to serve competing AI labs as a strategic data vendor.
SR034	National Institute of Standards and Technology (NIST)	AI Risk Management Framework (AI RMF 1.0)	NIST AI RMF 1.0 establishes the risk management framework that government AI vendors including Scale AI must align with for federal procurement.
SR035	Fortune	Scale AI's Wild Year: A $29 Billion Valuation, Meta's Money, and a Founder's Exit	Fortune profiled Scale AI's pivotal 2025 transition, highlighting the simultaneous risks of founder exit, customer attrition, and business model pivot.
SV001	Scale AI	Scale AI Sitemap
SV002	Labelbox	Labelbox Platform Overview
SV003	Palantir Technologies	Palantir Investor Relations — Financial Data and Annual Reports
SV004	TechCrunch	OpenAI Partners with Scale AI to Allow Companies to Fine-Tune Models (2023)
SV005	SuperAnnotate	SuperAnnotate Products Platform
SV006	SuperAnnotate	SuperAnnotate Enterprise Solutions
SV007	Australian Securities Exchange	Appen Limited (APX) ASX-Listed Company Data and Annual Reports
SV008	Menlo Ventures	2025 Enterprise AI Report
SV009	Appen	Appen Investor Relations (ASX-listed)
SV010	McKinsey & Company	The State of AI — McKinsey Global Survey
SV011	TechCrunch	Scale AI Confirms Significant Investment from Meta, Says CEO Alexandr Wang Is Leaving
SV012	TechCrunch	Scale AI Lays Off 14% of Staff, Largely in Data Labeling Business
SV013	CNBC	Zuckerberg Makes Meta's Biggest Bet on AI — $14 Billion Scale AI Deal
SV014	CNBC	Google, Scale AI's Largest Customer, Plans Split After Meta Deal, Sources Say
SV015	Stanford HAI	2025 AI Index Report
SV016	Scale AI	Scale AI Homepage
SV017	Scale AI	Scale AI Next Phase of Company Evolution
SV018	Scale AI	Scale AI Generative AI Data Engine
SV019	Scale AI	Scale AI Global Public Sector
SV020	Scale AI	Scale AI Security and Compliance
SV021	Scale AI	Scale AI White House AI Safety Voluntary Commitments
SV022	TechCrunch	Scale AI Is Suing Rival Mercor Alleging Customer Poaching
SV023	CNBC	OpenAI Is Winding Down Its Work With Scale AI
SV024	Scale AI	Scale AI TIME Media Customer Proof
SV025	U.S. Congress	Congressional AI Hearing — House Event 116184 (118th Congress)
SV026	Appen	Appen Frontier Model Alignment Services
SV027	Reuters	Google Was Scale AI's Largest Customer — Plans Split After Meta Deal
SV028	Scale AI	Scale AI DoD Data Curation Joint Force Contract
SV029	Scale AI	Scale AI Evaluation for Public Sector
SV030	Crunchbase	Scale AI — Funding History and Investors
SV031	Gartner	Gartner Market Guide for AI Data Annotation Tools	Gartner estimates the AI data annotation tools market will exceed $5 billion by 2027, with premium segments commanding higher multiples than legacy labeling providers.
SV032	CB Insights	Scale AI Company Profile and Valuation Analysis	CB Insights tracks Scale AI as a late-stage unicorn with $29B+ implied valuation from the 2025 Meta strategic investment.
SV033	Statista	AI Data Annotation Market Size and Forecast 2023-2030	Statista projects the global AI data annotation market to grow from approximately $1.3 billion in 2023 to over $7 billion by 2030, a CAGR of approximately 28%.
SV034	VentureBeat	Is Scale AI Worth $29 Billion? Analyzing the Meta Investment	VentureBeat analysts questioned whether Scale AI's $29B valuation is sustainable without revenue transparency, noting the customer attrition from Google and OpenAI creates execution risk for the growth thesis.