Startup Diligence
Diligence report AI Data & Infrastructure Late-stage private (Series F+) 2026-05-09

Scale AI

From Data Labeling to Full-Stack AI Infrastructure: Scale AI at an Inflection Point

Scale AI holds a defensible position in AI infrastructure with strong government exposure, but faces customer concentration risk, a CEO transition, and a pivotal business model shift away from data-labeling.

Cover facts

Valuation 01
29B+ USD [CO007]
Total Raised 02
1.6B+ USD [CO006]
Series F 03
1B USD [CO005]
Employees 04
~1,000 [CO004]
Founded 05
2016 [CO001]

Company profile

Scale AI is a San Francisco-based AI infrastructure company founded in 2016 by Alexandr Wang. It provides data annotation, RLHF, model evaluation, and enterprise GenAI platform services to leading AI laboratories, Fortune 500 enterprises, and U.S. government agencies including the Department of Defense. Following a $1 billion Series F in 2024 and a landmark Meta strategic investment valuing the company at over $29 billion in 2025, Scale is pivoting from data-labeling to a broader enterprise and government AI platform play under interim CEO Jason Droege.

Website
scale.com
Founded
2016-01-01
Founders
Alexandr Wang
Founding location
San Francisco, CA
Headquarters
San Francisco, CA
Product
Scale Data Engine (data annotation and curation), Scale GenAI Platform (enterprise AI applications), Scale RLHF (LLM training data), Scale Evaluation (model safety and capability benchmarking), and Donovan (defense AI agents platform).
Customers
AI laboratories, large enterprises, and U.S. government/defense agencies.
Business model
Enterprise contracts for data annotation and AI platform services; government contracts for defense AI; self-serve pay-as-you-go for experimental users.
Stage
Late-stage private (Series F+, post-Meta strategic investment)
Funding status
$1B Series F (May 2024, $13.8B valuation) + Meta strategic investment (June 2025, $29B+ valuation)
[CO001, CO004, CO005, CO007]

Executive summary

Top strengths

  • Unique position in U.S. government/defense AI data and evaluation with FedRAMP High + DoD IL4 clearances
  • Strong brand and relationship network with leading AI labs despite recent customer attrition
  • Data flywheel and proprietary annotation methodology differentiate quality at scale
  • Donovan platform creates a new product moat in defense agentic AI

Top risks

  • Customer concentration: Google and OpenAI relationships deteriorating post-Meta deal
  • CEO transition risk: founder departure to Meta, untested interim CEO Droege
  • Business model pivot: moving away from core data-labeling while new enterprise/government revenue model unproven
  • Structural conflict of interest: Meta as strategic investor and competitor creates customer trust risk
  • Revenue undisclosed: valuation multiples unverifiable without financial transparency

Open gaps

  • Revenue and ARR: no public disclosure; valuation at $29B+ is unverifiable without revenue basis
  • Board composition post-Wang departure: full governance structure not publicly known
  • Meta commercial agreement scope and exclusivity terms not disclosed
  • True customer count and NRR post-2025 attrition events unknown
  • Mercor lawsuit outcome and IP exposure uncertain

Contents

Chapter 01

01Company Overview

1.1 Company Identity and Overview

Scale AI, Inc. is headquartered in San Francisco, California, and was incorporated in 2016. The company's stated mission is to "develop reliable AI systems for the world's most important decisions." Scale operates as a late-stage private technology company following its $1 billion Series F round in May 2024 and the subsequent Meta strategic investment in June 2025, which placed the company's implied valuation at over $29 billion. Scale describes itself as providing the data and full-stack technologies needed to build, evaluate, and improve artificial intelligence systems across the complete AI development lifecycle. Scale's core business spans five primary product lines: the Scale Data Engine (data collection, curation, and annotation for model training), the Scale GenAI Platform (enterprise-grade generative AI application deployment), Scale RLHF (reinforcement learning from human feedback data for large language model training), Scale Evaluation (model capability and safety benchmarking for model developers and the public sector), and Donovan (specialized AI agent workflows for defense and intelligence missions). The company also operates a self-serve tier with pay-as-you-go pricing, providing access to its annotation pipeline at lower entry cost for research and experimental projects. As of its about page, Scale has processed more than 15 billion human-labeled decisions and paid contributors over $1 billion globally. [CO001, CO002, CO003, CO004, CO008, CO009]

1.2 Founders, Leadership, and Governance

Scale AI was founded in 2016 by Alexandr Wang, who left MIT at age 19 to build the company. Wang grew Scale from a small data-labeling startup into a company that secured contracts with leading AI laboratories, Fortune 500 enterprises, and the United States Department of Defense. He served as CEO from founding until June 2025, when he departed to join Meta's AI efforts following Meta's landmark strategic investment in Scale. Wang retained a seat on Scale AI's board of directors following his departure. Jason Droege was appointed Interim CEO of Scale AI in June 2025 upon Wang's departure. Droege had joined Scale in September 2024 as Chief Strategy Officer. Prior to Scale, Droege founded Uber Eats and scaled it to a $19 billion GMV run rate, and previously served as a Vice President at Uber and as a partner at Benchmark, the prominent venture capital firm and Scale investor. His operational background in marketplace and platform businesses is central to Scale's stated pivot toward enterprise and government platform revenue. Scale's governance structure is not publicly disclosed in detail. Wang retains board representation, and the company has not disclosed the full board composition or independent director roster following the Meta investment and leadership transition. This creates a material governance opacity risk for prospective investors, particularly given Wang's concurrent role at Meta—a strategic investor and potential competitor for AI talent and technology. [CO001, CO010, CO011, CO012, CO013, CO014]

Leadership and Founder Table
NameRoleBackgroundTenure at ScaleKey-Person Risk
Alexandr WangFounder, Board DirectorMIT dropout; founded Scale at 19; built to $29B+ valuation; joined Meta AI Jun 20252016–Jun 2025 (CEO); board ongoingHigh
Jason DroegeInterim CEOFounded Uber Eats ($19B GMV); VP at Uber; Benchmark partner; joined Scale as CSO Sep 2024Sep 2024–present (CEO from Jun 2025)Medium-High
Board / GovernanceNot fully disclosedWang retains seat; other directors not publicly named post-Meta dealOngoingHigh
Leadership teamCTO, CFO, other C-suite not publicly namedPositions not publicly profiled post-transitionOngoingMedium

Source: Scale AI official announcements, TechCrunch, CNBC, Benchmark partner page (broken as of access date).

[CO010, CO011, CO012, CO013, CO014, CO015]

1.3 Funding History and Capital Structure

Scale AI has raised approximately $1.6 billion in disclosed venture funding across multiple rounds, culminating in its Series F in May 2024 and a strategic investment from Meta in June 2025. The company raised approximately $600 million pre-Series F across earlier rounds, including a $325 million Series E in 2021 that valued Scale at approximately $7.3 billion. The 2023 period saw a significant workforce reduction of approximately 20% of employees, reflecting broader AI market pressures and recalibration of data-labeling demand. In May 2024, Scale closed a $1 billion Series F round led by Accel, at a post-money valuation of $13.8 billion. This round included primary capital and a secondary component allowing existing shareholders to liquidate. New investors included Amazon, Cisco, Intel, AMD, ServiceNow, DFJ Growth, WCM Investment Management, Elad Gil, and Meta. Returning investors included Nvidia, Coatue, Y Combinator, Index Ventures, Founders Fund, Tiger Global, Thrive Capital, Spark Capital, Greenoaks, Wellington Management, and Nat Friedman. In June 2025, Meta made what CNBC described as its largest-ever bet on AI, paying approximately $14.3 billion for a minority stake in Scale AI (reported as approximately 49% of outstanding equity on a fully diluted basis), implying a company valuation of over $29 billion. Scale stated it remains operationally independent from Meta. The proceeds were distributed to existing shareholders and holders of vested equity rather than entirely to Scale's balance sheet for operating capital, creating uncertainty about Scale's remaining cash position and runway. [CO005, CO006, CO007, CO017, CO018, CO019]

Scale AI Company Snapshot Metrics
MetricValue / StatusDateConfidenceNotes / Gaps
Founded2016, San Francisco CA2016highConfirmed by multiple sources
Current CEOJason Droege (Interim)Jun 2025highWang departed Jun 2025
FounderAlexandr Wang (Board)Jun 2025highRetains board seat after departure
Headcount~1,000 employees (post-Jul 2025 layoffs)Jul 2025mediumApprox; 200 let go in Jul 2025 + 500 contractors
Valuation$29B+ implied (Meta deal)Jun 2025mediumImplied from $14.3B for ~49% stake; not formally disclosed
Total Raised~$1.6B+ disclosedMay 2024mediumExcludes Meta deal (distributed to shareholders)
Series F$1B at $13.8B post-moneyMay 2024highTechCrunch + company confirmation
Revenue / ARRNot publicly disclosed2026lowPrivate company; estimated hundreds of millions ARR by market proxy
Gross MarginNot disclosed2026lowPrivate; no public disclosure
Key CertificationsSOC 2 Type II, ISO 27001, DoD IL4, FedRAMP High2024-2025highPer scale.com/legal/security
Primary VerticalsAI labs, Enterprise, Government/Defense2025highPer product and customer pages
HQSan Francisco, CA2025highPer about page

Confidence ratings reflect public disclosure quality; low-confidence items require direct disclosure in due diligence.

[CO001, CO004, CO005, CO006, CO007, CO010]
Stakeholder or Investor Map
RoundDateAmountValuationLead / Key InvestorsNotes
Seed / Angel2016~$3M est.~$15M est.Y Combinator, angelsApproximate; YC S2016
Series A2017~$18M est.~$100M est.Accel, Index Ventures, Founders FundApproximate from public records
Series B2018~$30M est.~$300M est.Accel, Index, Founders Fund, Tiger GlobalApproximate
Series C2019~$100M est.~$1B est.Greenoaks, existing investorsApproximate
Series D2020~$155M~$3.5B est.Tiger Global, Index, Accel, Spark, ThriveApproximate; exact amounts vary by source
Series E2021-08$325M~$7.3BCoatue, Y Combinator, Founders Fund, Tiger, existingConfirmed via TechCrunch reporting
Bridge / Secondary2022-2023Various~$7B rangeExisting investors; secondary sales2023 layoffs 20%
Series F2024-05$1B$13.8B post-moneyAccel (lead), Amazon, Meta, Cisco, Intel, AMD, ServiceNow, Nvidia, YC, Index, Founders Fund, Tiger, Thrive, Spark, Greenoaks, Wellington, Nat Friedman, Elad Gil, DFJ Growth, WCMConfirmed; primary + secondary mix
Meta Strategic Investment2025-06~$14.3B (to shareholders)$29B+ impliedMeta (minority ~49%)Proceeds distributed to existing shareholders; Scale remains independent

Early round amounts and valuations are estimates from public records and secondary sources; only Series E, Series F, and Meta deal amounts are confirmed via tier-one reporting.

[CO005, CO006, CO007, CO017, CO018, CO019]
FO003: Scale AI Key Performance Indicators (as of May 2026)

High-level KPI snapshot reflecting publicly disclosed or strongly indicated metrics for Scale AI as of the report date.

Valuation implied from Meta transaction; headcount is approximate post-layoff estimate; revenue not publicly disclosed.

[CO004, CO005, CO006, CO007, CO029, CO030]

1.4 Products and Business Model

Scale AI's business model centers on selling high-quality data and AI infrastructure services to organizations building and deploying machine learning systems. Revenue streams include enterprise data annotation contracts (volume-based and project-based), the Scale GenAI Platform (managed SaaS and professional services for enterprise AI applications), government and defense contracts (including Department of Defense data curation and the Donovan platform for intelligence and military operations), and a self-serve tier for smaller or experimental use cases. Scale does not publicly disclose revenue, gross margins, or ARR. The Scale Data Engine collects, curates, annotates, and validates data across text, images, video, audio, and documents. The GenAI Platform transforms enterprise data into domain-specific AI applications using a proprietary pipeline. Scale RLHF provides curated preference data for reinforcement learning from human feedback, which is central to training large language models for instruction-following and safety. Scale Evaluation offers trusted benchmarking for model capability (including the Scale Leaderboard) and safety (including the WMDP harmful-knowledge benchmark), serving both commercial model developers and U.S. government agencies. The Donovan platform targets defense and intelligence missions with specialized AI agent workflows that can operate in classified environments given Scale's DoD IL4 and FedRAMP High security certifications. Pricing is bifurcated: enterprise customers receive custom pricing with dedicated operations teams and SLA commitments, while self-serve customers access the platform on a pay-as-you-go basis with the first 1,000 labeling units free. Scale publicly discloses no revenue figures; its estimated hundreds of millions in ARR is derived from investor commentary and proxy comparisons, not confirmed disclosures. [CO002, CO003, CO008, CO024, CO025, CO026]

FO002: Scale AI Business Architecture — Value Flow

How Scale AI converts data, compute, and human expertise into AI-ready outputs for model developers, enterprises, and government customers.

[CO002, CO003, CO008, CO024, CO025, CO026]

1.5 Key Milestones and Adverse Events

Scale AI's trajectory from founding through 2026 spans a decade of growth, strategic pivots, and significant adverse events that define its current investment profile. Founded in 2016, Scale initially focused on programmatic data labeling for self-driving vehicles (Waymo was an early customer) before expanding into broader enterprise AI training data. By 2021, the company had achieved a $7.3 billion valuation with its Series E, and by 2022 was publicly describing itself as a $7 billion company with more than 700 employees providing the DoD's autonomy data layer. The 2023 period saw Scale execute a painful 20% workforce reduction amid a slowdown in AI foundation model training data spend. Scale rebounded with its $1 billion Series F in May 2024, attracting a broad consortium of strategic and financial investors at $13.8 billion. In 2024, Scale also entered White House AI safety voluntary commitments and secured a DoD data curation contract for joint force operations. June 2025 marked the most consequential inflection: Meta's $14.3 billion strategic investment and the simultaneous departure of founder Alexandr Wang to join Meta's AI work. Three weeks later (July 2025), interim CEO Droege announced layoffs of approximately 200 employees (14% of staff) and 500 contractors, citing overinvestment in the data-labeling headcount relative to the company's strategic direction toward enterprise and government platform revenue. In June 2025, CNBC reported that Google—Scale's then-largest customer—planned to wind down or significantly reduce its Scale relationship due to concerns about competitive conflict with Meta. OpenAI also wound down its Scale work in June 2025. In September 2025, Scale filed a lawsuit against Mercor, a competitor and former employee, alleging customer poaching. These adverse events represent material customer concentration and governance risks for investors evaluating Scale. [CO001, CO031, CO032, CO033, CO034, CO035]

Milestone Table
DateEventTypeAmount / Valuation / StatusParticipantsImplication
2016Scale AI founded by Alexandr Wangfounding$15M est. seedWang, YC, angelsEstablished data-labeling for autonomous vehicles
2016Accepted to Y Combinator S2016partnershipYC, ScaleEarly validation; YC network access
2017Series A closedfinancing~$18MAccel, Index, Founders FundEnabled product expansion
2019Series C; crossed $1B valuationfinancing~$100M at ~$1BGreenoaks, othersUnicorn milestone
2021-08Series E at $7.3B valuationfinancing$325MCoatue, YC, Founders Fund, TigerRapid scale; peak of AI training data boom
2022DoD autonomy data layer blog; 700+ employeesscale$7B est. valuationDoD / ScaleDefense AI footprint established
202320% workforce reductionadverseScale, employeesMarket correction; AI training demand slowdown
2024-05Series F $1B at $13.8Bfinancing$1B / $13.8BAccel lead, Amazon, Meta, Intel, AMD, Cisco, othersValuation reset upward; strategic investor base
2024White House AI voluntary safety commitmentsregulatoryWhite House, ScaleRegulatory positioning; brand legitimacy
2024DoD data curation contract (Joint Force)regulatoryUndisclosedDoD, ScaleDefense revenue anchor
2024-11Defense Llama announced (national security LLM)productScale, DoD ecosystemFirst productized defense LLM offering
2025-06Meta $14.3B strategic investment; Wang departsfinancing$14.3B / $29B+ impliedMeta, Scale, Wang → Meta AITransformative capital event; founder exit
2025-06Google largest customer plans to exitadverseGoogle, ScaleCustomer concentration risk crystallized
2025-06OpenAI winds down Scale relationshipadverseOpenAI, ScaleSecond major lab customer loss
2025-07Layoff of 200 employees (14%) + 500 contractorsadverseDroege, ScaleData-labeling pivot confirmed operationally
2025-09Lawsuit filed vs Mercor (customer poaching)adverseScale vs MercorIP and competitive conflict risk

Sources: Scale AI official pages, TechCrunch reporting, CNBC reporting, Stanford HAI AI Index 2025. Early-round financing amounts estimated; later events confirmed by tier-one media.

[CO001, CO005, CO006, CO007, CO017, CO019]
FO001: Scale AI Corporate Timeline

Chronological milestones from founding (2016) through the 2025 Meta investment and organizational pivot, highlighting financing events, product launches, and adverse events.

Early round dates (2016-2019) are approximate; exact Series A-C timing not publicly confirmed.

[CO001, CO005, CO007, CO017, CO031, CO033]

1.6 Exhibits

Chapter 02

02Market Analysis

2.1 Market Boundary and Definition

Scale AI operates at the intersection of several distinct but related markets, which requires explicit boundary-setting before sizing can be meaningfully attempted. The core market is AI data services: the collection, annotation, curation, and validation of training and evaluation data for machine learning models. This market includes human-labeled data for computer vision, natural language processing, speech recognition, and multimodal AI. Adjacent to this is the AI model evaluation market — trusted benchmarking and safety testing for large language models — where Scale Evaluation and the Scale Leaderboard compete. Expanding outward, Scale's GenAI Platform competes in the enterprise AI deployment market: tools and services that help large organizations build, customize, and operationalize generative AI applications on top of foundation models. This market overlaps with hyperscaler AI services (Azure AI, AWS Bedrock, Google Vertex AI), boutique AI consulting, and AI model API vendors. The Donovan platform carves out a specialized government and defense AI vertical, which operates under distinct procurement rules and budget mechanisms separate from commercial AI spending. Status-quo substitutes for Scale's core data annotation services include internal human review teams (many large AI labs and enterprises originally built their own annotation pipelines), lower-cost offshore service providers, and automated synthetic data generation. The key question for market sizing is whether to define the served market narrowly (AI data annotation services only) or broadly (all AI infrastructure and tooling spend). This chapter defines the primary market as AI data services and evaluation infrastructure (annotation, RLHF, benchmarking), with enterprise GenAI platform and government AI as adjacent submarkets. [CM001, CM002, CM003, CM004, CM005]

Market Definition Table
SubmarketIncluded SpendExcluded SpendStatus-Quo SubstituteScale AI Presence
AI Data AnnotationHuman labeling of text, image, video, audio, document data for ML trainingGeneric crowdsource platforms (Mechanical Turk)Internal annotation teams; offshore QA providersCore product (Scale Data Engine)
RLHF / Preference DataExpert human feedback for LLM instruction-following and safety alignmentRaw survey data; synthetic preference generationInternal RLHF teams at large labsScale RLHF product
Model Evaluation & BenchmarkingLLM capability + safety testing; redteaming; leaderboardsInternal A/B testing; academic benchmarksSelf-evaluation; open benchmarks (MMLU, HELM)Scale Evaluation + Leaderboard
Enterprise GenAI PlatformDomain-specific GenAI app development on top of foundation modelsHyperscaler AI services (Azure, AWS, GCP)Custom in-house GenAI builds; boutique AI consultingScale GenAI Platform
Government / Defense AIDoD-cleared AI data services; mission-critical AI agentsGeneral government IT outsourcingDefense-clearance competitors; internal DoD teamsDonovan + Public Sector Data Engine
AI Data Marketplace (self-serve)Pay-as-you-go annotation for startups and researchersUnstructured crowdsource platformsMechanical Turk; intern teams; student labelersScale self-serve tier

Market boundaries defined for this analysis; actual market size depends on which segments are included. Not an exhaustive market map — adjacencies like AI infrastructure (compute, MLOps) and AI consulting are excluded.

2.2 TAM, SAM, and SOM Sizing Across Market Lenses

Precise TAM/SAM/SOM figures for AI data services are not available from public sources with the specificity needed for high-confidence valuation analysis. The figures presented here are evidence-constrained estimates synthesized from publicly available market proxies, investor reporting, and competitor disclosure patterns. The Total Addressable Market (TAM) for global AI data services and infrastructure — including annotation, RLHF, evaluation, and GenAI platform tooling — is estimated at $5 to $20 billion annually as of 2025, with wide uncertainty reflecting the market's rapid evolution and inconsistent boundary definitions across analyst estimates. This estimate is anchored by McKinsey's finding that 88% of surveyed organizations now use AI in at least one function (up from 78% in 2024), and the Stanford HAI AI Index 2025 finding that AI investment has accelerated sharply. If each AI-adopting Fortune 1000 enterprise spent $1M–$10M annually on AI data and evaluation infrastructure, aggregate spend would be $1B–$10B from enterprise alone, with AI labs adding substantial incremental demand. The Serviceable Addressable Market (SAM) for Scale AI — large enterprises, leading AI labs, and U.S. government agencies with complex AI data needs and budget for premium quality — is estimated at $2 to $8 billion, reflecting Scale's positioning at the quality end of the market rather than the low-cost commodity annotation segment. The Serviceable Obtainable Market (SOM) — Scale's realistic 3-5 year capture — is estimated at $500M to $2B, which is consistent with Scale's implied enterprise and government revenue trajectory based on its late-stage private valuation. Revenue is not disclosed; these estimates carry high uncertainty and require verification against actual financials. [CM006, CM007, CM008, CM009, CM010, CM011]

TAM / SAM / SOM or Sizing Lens Table
Sizing LensMarket ScopeEstimateConfidenceBasis / Key Assumptions
TAM — Broad AI Data & InfrastructureAll AI data services + evaluation + enterprise GenAI tooling globally$10B–$30BlowMcKinsey: 88% of orgs use AI; if large orgs spend $1M–$10M on data/tooling, aggregates to $10B+; high uncertainty
TAM — AI Data Annotation OnlyHuman-labeled data services for ML globally$2B–$8BlowDerived from public-company analogs (Appen revenue proxy) + scale-factor for global market; highly uncertain
SAM — Premium AI Data (Scale's position)Large enterprises, AI labs, government — premium annotation + evaluation + platform$1.5B–$6BlowScale targets quality end; SAM ~30–50% of annotation TAM; government adds distinct SAM segment
SAM — Government / Defense SubmarketU.S. government AI data and evaluation budget addressable by Scale$300M–$1BlowEstimated from DoD AI investment growth and cleared-vendor supply constraints; no public budget breakout
SOM — Scale's 3-5yr ObtainableScale AI's realistic revenue capture in 3-5 years given competitive position$500M–$2BlowConsistent with $29B valuation at 15-60x ARR if ARR is in $500M-$2B range; no revenue confirmation

All estimates are evidence-constrained approximations derived from market proxy data, investor reporting, and public traction metrics. Revenue is not publicly disclosed. These figures are directional only and require verification against actual financials.

FM001: AI Data Services Market Sizing — Multiple Lenses

Evidence-constrained estimates for Scale AI's addressable market across TAM, SAM, and SOM, illustrating the range of plausible market sizes depending on scope definition.

All values are mid-point estimates in billions USD; uncertainty ranges are wide (2x–4x). No public analyst report provides specific AI data annotation market sizing with segment granularity. Derived from McKinsey adoption data, competitor proxy revenues, and valuation multiples.

[CM006, CM007, CM008, CM009, CM010]
FM002: Market Estimate Range — AI Data Services TAM (2025)

Uncertainty ranges for AI data services market size by scope definition, reflecting analyst boundary disagreement and rapid market evolution.

Ranges derived from McKinsey AI adoption data, Appen public revenue as proxy for annotation-only TAM, and Scale AI's implied valuation/revenue relationship. High uncertainty; all estimates require verification against actual financial disclosure.

[CM007, CM008, CM009, CM010, CM011]

2.3 Buyer and User Segmentation with Adoption Paths

Scale AI's addressable market consists of four primary buyer segments with materially different budget authorities, procurement processes, and switching costs. Understanding these segments is essential to evaluating Scale's revenue durability and growth ceiling. The first segment — AI Research Laboratories and Foundation Model Developers — represents Scale's original customer base. Companies like OpenAI, Meta, Google DeepMind, Anthropic, Cohere, and Adept require massive volumes of high-quality labeled data for pretraining and RLHF. Budget authority resides in research and engineering functions. Procurement is typically direct negotiation with multi-year contracts. Switching costs are moderate: labs can build internal annotation pipelines or switch providers, as demonstrated by OpenAI and Google's 2025 departures. This segment carries high revenue concentration risk. The second segment — Large Enterprise AI Adopters — includes Fortune 500 companies using AI for internal automation, customer products, and competitive differentiation. Companies like Cisco, Etsy, Instacart, and Pinterest represent this category, using Scale's GenAI Platform for domain-specific AI application development. Budget authority often sits in CTO/CDO organizations with multi-year platform commitments. Switching costs are higher than for annotation services due to platform integration depth. The third segment — U.S. Government and Defense — is the most distinctive and defensible for Scale. Budget authority lies with DoD program offices, DHS, NSA/IC, and civilian agencies. Procurement follows federal acquisition regulations with multi-year contract vehicles. Switching costs are very high due to clearance requirements (DoD IL4, FedRAMP High), institutional knowledge, and competitive moat from the Donovan platform. This segment provides stable, long-cycle revenue that is insulated from commercial customer concentration risk. The fourth segment — AI Startups and Research Organizations — accesses Scale through its self-serve tier. Budget authority is dispersed; spend is lower per customer but volume can be significant. Switching costs are low (pay-as-you-go). [CM012, CM013, CM014, CM015, CM016, CM017]

Segment / Buyer Map
SegmentBuyer ProfileBudget OwnerProcurement PathSwitching CostScale AI Fit
AI Labs (Foundation Model)OpenAI, Meta, Google DeepMind, Anthropic, CohereHead of Research / VP EngineeringDirect negotiation; multi-year contractsMediumHigh historically; currently at risk from Meta conflict
Large Enterprise (F500)Cisco, Etsy, Instacart, Pinterest, TIMECTO / CDO / VP AIRFP / direct sales; 12-24 month cyclesHighGrowing; GenAI Platform primary vehicle
U.S. Government / DoDDoD, IC agencies, civilian federal agenciesProgram Office / CISO / CTOsFederal acquisition; IDIQ/task orders; cleared vehiclesVery HighDonovan + clearances = unique positioning
AI Startups / ResearchersSeries A-C AI companies; university labs; small enterprisesFounders / ML leadsSelf-serve; credit card; minimal procurement frictionLowSelf-serve tier; lower ARPU; volume model
International GovernmentAllied nations' defense / intelligence agenciesProcurement offices; foreign ministryInternational government contracts; complex clearanceVery HighGlobal Public Sector division; early-stage
Enterprise Media / ContentPublishers, media companies deploying GenAI for contentVP Digital / CTODirect sales; platform subscriptionMediumTIME case study as reference; growing vertical

Buyer profiles based on publicly disclosed Scale customer references and product page descriptions. Revenue contribution by segment not publicly disclosed.

FM003: Buyer Segment Map — Budget Authority and Scale AI Fit

How Scale AI's products connect to each buyer segment through distinct value propositions, from AI labs through enterprise and government customers.

[CM012, CM013, CM014, CM015, CM016, CM017]

2.4 Growth Drivers and Adoption Constraints

The most powerful growth driver for Scale AI's addressable market is the accelerating enterprise adoption of AI. McKinsey's 2025 State of AI survey documented a jump from 78% to 88% of organizations using AI in at least one business function year-over-year, with 62% actively experimenting with AI agents. This macro expansion creates structural demand for data annotation, model evaluation, and enterprise AI deployment tooling across all four buyer segments. A second major driver is the proliferation of large language model providers — OpenAI, Meta, Anthropic, Google, Mistral, Cohere, and others — each requiring continual RLHF cycles, safety benchmarking, and capability evaluation at scale. As the number of foundation model developers grows, so does the demand for expert-quality training data and impartial evaluation infrastructure. Scale's Evaluation product and Scale Leaderboard are positioned to benefit from this proliferation. U.S. government AI investment represents a particularly robust growth driver. Scale's clearances (DoD IL4, FedRAMP High), Donovan platform, and active defense contracts position it to capture a growing share of the federal AI budget. Defense AI spending is expected to grow materially as DoD integrates AI into surveillance, logistics, cybersecurity, and autonomous systems — all of which require trusted data infrastructure. Adoption constraints are also significant. First, the largest AI labs are diversifying away from Scale following Meta's strategic investment, creating potential top-line pressure. Second, synthetic data generation technologies (used by some AI labs internally) could reduce demand for human-labeled data over time for some applications. Third, intense competition from lower-cost providers (Appen, SuperAnnotate, offshore teams) creates margin pressure in the commodity annotation segment. Fourth, enterprise AI programs remain heavily in pilot/POC stage — McKinsey notes that most organizations are still testing rather than scaling AI — meaning enterprise platform revenue growth may lag market adoption. Fifth, government procurement cycles are long and budget-dependent, creating lumpiness in contract revenue. [CM019, CM020, CM021, CM022, CM023, CM024]

Growth Drivers and Constraints Table
FactorTypeDirectionMagnitudeTime HorizonEvidence
Enterprise AI adoption expansionDriverHigh2025-2028McKinsey: 88% of orgs use AI (up from 78%); 62% experimenting with AI agents
Foundation model proliferation (LLM providers)DriverHigh2024-2027OpenAI, Meta, Anthropic, Mistral, Cohere all require RLHF + evaluation data
U.S. government AI investment growthDriverHigh2025-2030DoD AI strategy; Scale clearances; Donovan platform; active DoD contracts
AI safety regulatory pressure (EU AI Act, US EO)DriverMedium2025-2027Regulatory mandates increase demand for AI evaluation and audit services
Synthetic data substitution for human annotationConstraintMedium2026-2029AI labs increasingly use synthetic + model-generated data for some training tasks
Customer concentration risk post-Meta dealConstraintHigh2025-2026Google and OpenAI (major customers) departing; replaces large revenue unknown
Low-cost competition (Appen, offshore vendors)ConstraintMediumOngoingCommodity annotation margin pressure; Scale must maintain quality premium
Enterprise AI still in pilot/POC stageConstraintMedium2025-2026McKinsey: most orgs still testing AI, not scaling; enterprise platform revenue lags
Long government procurement cyclesConstraintLowOngoingFederal acquisition timelines create revenue lumpiness despite strong positioning

Direction and magnitude are analyst assessments based on public evidence cited. No quantified revenue impact estimates are available from public sources.

FM004: Enterprise AI Adoption Funnel — Scale AI's Customer Journey

Adoption stages from initial AI awareness to deep Scale integration, with estimated population at each stage to illustrate market conversion dynamics.

Percentages are illustrative estimates of the Fortune 1000 segment based on McKinsey AI adoption data (88% using AI, ~1/3 scaling). Pipeline and customer percentages are estimated from Scale's disclosed customer references; actual numbers are not disclosed.

[CM019, CM020, CM021, CM022, CM023]

2.5 Sizing Gaps, Contradictory Estimates, and Diligence Asks

Market sizing for AI data services is subject to significant structural uncertainty. No public analyst report provides a consistent, granular breakdown of the AI data annotation addressable market. Estimates range widely depending on whether the boundary includes only human-labeled data services, or extends to AI model evaluation, enterprise AI tooling, and government AI platform spending. The wide range ($5B–$20B TAM in this chapter) reflects genuine analyst disagreement and market boundary ambiguity, not measurement error. The synthetic data market complicates sizing: some projections assume synthetic data will largely replace human annotation for many tasks within 3-5 years, dramatically shrinking the TAM for companies like Scale. Others argue that human evaluation, RLHF, and adversarial red-teaming will always require human judgment and scale with model proliferation. This is a thesis-level uncertainty that prospective investors must resolve through primary research with AI labs and Scale customers. Scale AI's disclosed traction metrics (15B decisions labeled, $1B paid to contributors) provide volume signals but not revenue insights. Without revenue disclosure, it is impossible to verify which segment dominates Scale's revenue mix or whether the enterprise platform and government segments can replace declining AI lab revenue. The departure of Google and OpenAI as major customers — assuming they were top 10 contributors to Scale's revenue — creates a size-unknown revenue gap that the enterprise and government pivots must fill. Quantifying the revenue impact of these departures is the single most important unresolved market sizing question for Scale's investment case. [CM027, CM028, CM029, CM030, CM031]

2.6 Exhibits

Chapter 03

03Competitors

3.1 Competitive Landscape Overview

Scale AI competes across multiple overlapping segments of the AI data and infrastructure market, each with distinct competitor dynamics. In the core AI data annotation segment, Scale faces competition from Appen (ASX-listed, the only publicly traded direct comparable), SuperAnnotate, Labelbox, and Surge AI. For RLHF and LLM training data specifically, Surge AI and Mercor are the most focused direct competitors. For enterprise model evaluation and benchmarking, Labelbox's evaluation suite and Snorkel AI's programmatic labeling are adjacent threats. For enterprise GenAI platform deployment, Scale faces competition from hyperscalers (AWS Bedrock, Azure AI, Google Vertex AI), boutique AI consultancies, and system integrators — significantly larger and better-resourced opponents. The competitive landscape also includes incumbents in adjacent spaces: traditional data outsourcing firms (Accenture, Capita), crowdsource annotation platforms (Amazon Mechanical Turk, Remotasks), and AI lab internal annotation teams who represent the "internal build" substitute. Likely new entrants include larger technology companies building annotation and evaluation capabilities in-house, and specialist boutique firms entering from academic or domain-specific AI. A critical competitive dynamic is the Meta strategic investment: by becoming Scale's largest investor, Meta has simultaneously created a conflict of interest that has already led Google and OpenAI — Scale's two largest AI lab customers — to exit or reduce their relationships with Scale. This represents an unusual competitive situation where Scale's own investor is causing customer attrition at its largest commercial segment. Mercor, a newer and smaller competitor, is actively attempting to exploit this vulnerability through direct customer solicitation, as evidenced by Scale's September 2025 lawsuit. [CP001, CP002, CP003, CP004, CP005]

Competitor Profile Table
CompetitorStage / ScaleTarget CustomerProduct ScopeFunding / Revenue ProxyStrategic Direction
Appen (ASX: APX)Publicly listed; ~$300M rev proxy (declining)Enterprise + government; globalImage, text, speech, video annotation; evaluationPublic; ASX-listed; declining revenueStabilize via enterprise AI; reduce crowdsource dependence
LabelboxPrivate; Series B+ est. ~$100M raisedEnterprise AI teams; mid-market to F500Annotation + evaluation + RLHF + robotics + leaderboards~$188M raised (est.); not publicFull-stack AI data platform; expanding into RLHF + evaluation
Snorkel AIPrivate; Series C+ est. ~$135M raisedEnterprise (F500); governmentProgrammatic labeling; weak supervision; AI-assisted annotation~$135M raised (est.); not publicReduce human annotation cost via AI; enterprise platform SaaS
SuperAnnotatePrivate; early-stage; ~$14M raisedEnterprise teams; computer vision focusCollaborative annotation platform; security features; multi-modal~$14M raised (est.); not publicEnterprise CV annotation; security-first; expanding NLP/multimodal
Surge AIPrivate; small; founded 2020AI labs; RLHF-focused; premium qualityExpert RLHF data; LLM feedback; high-quality annotatorsSmall; not publicPremium RLHF data; compete on quality with smaller team
Invisible TechnologiesPrivate; growth stage; ~$20M raised est.Enterprise operations; AI automationAI-powered operations; data processing; annotation as part of broader ops~$20M raised (est.)AI-powered enterprise operations beyond just annotation
MercorPrivate; early; <$50M raised est.AI labs; enterprise; Scale AI customers (targeted)AI talent marketplace; RLHF; evaluation; annotationEarly stage; not publicDisrupt Scale by targeting its customers; lawsuit pending

Funding and revenue estimates for private competitors are derived from press coverage and competitor website descriptions; not verified. Appen revenue from ASX filings (publicly available as proxy for annotation market).

3.2 Competitor Profiles

Appen (ASX: APX) is the only publicly traded direct comparable to Scale AI's annotation business. Appen provides AI training data including image, video, speech, text, and document annotation for global enterprise and government customers. Appen's publicly reported revenues have been declining as the AI annotation market shifts toward premium and LLM-specialized services. Appen serves enterprise and government customers globally, including some overlap with Scale's customer base. Appen competes primarily on breadth and cost-effectiveness rather than Scale's quality-premium positioning. Labelbox is a San Francisco-based data labeling and model evaluation platform targeting enterprise AI teams. Labelbox has expanded from core annotation into RLHF data collection and model leaderboards, making it a direct competitor across multiple of Scale's product lines. Labelbox's pricing is more accessible than Scale's enterprise contracts, and it has built an expert network (Labelbox Expert Network) for quality-critical annotations. Labelbox offers specific products for robotics AI training, which is a market Scale has not publicly emphasized. Snorkel AI focuses on programmatic data labeling using AI-assisted weak supervision techniques to reduce the human-in-the-loop bottleneck. Snorkel targets enterprise customers who need to build AI training datasets without extensive manual labeling. Snorkel's approach differs fundamentally from Scale's human-expert model — Snorkel aims to reduce the need for human annotation, while Scale's model depends on human quality. Snorkel's customers include large enterprises and some government agencies. SuperAnnotate is an AI annotation platform targeting enterprise teams with collaborative annotation workflows, quality management, and ML pipeline integrations. SuperAnnotate provides security features relevant to enterprise compliance and has expanded into computer vision, NLP, and multimodal annotation. Mercor is a newer entrant operating an AI talent marketplace for RLHF, model evaluation, and data labeling, founded by ex-Scale contributors. Scale filed a lawsuit against Mercor in September 2025 for alleged customer poaching, suggesting Mercor is actively targeting Scale's enterprise customer base. Surge AI (now part of a broader RLHF data ecosystem) focused specifically on high-quality human feedback data for LLM training, with an expert annotator network similar to Scale's but smaller in scale. Invisible Technologies offers AI-powered business operations and data services, competing with Scale's enterprise automation and annotation capabilities. [CP006, CP007, CP008, CP009, CP010, CP011]

Feature / Capability Matrix
CapabilityScale AIAppenLabelboxSnorkel AISuperAnnotateMercor
Text / NLP annotation✓ Advanced✓ Broad✓ Advanced✓ AI-assisted✓ Multi-modal✓ RLHF-focused
Image / CV annotation✓ Advanced✓ Broad✓ Advanced✓ Programmatic✓ SpecializedLimited
RLHF / LLM training data✓ Core product (Scale RLHF)Limited✓ RL-Data productPartialLimited✓ Core focus
Model evaluation + leaderboards✓ Scale Evaluation + Leaderboard✓ Evaluation product✓ Evals product + LeaderboardsLimitedLimitedLimited
Government / defense clearances✓ DoD IL4, FedRAMP HighPartial (some gov work)Not disclosedNot disclosedNot disclosedNot disclosed
GenAI platform / enterprise AI apps✓ Scale GenAI PlatformNoNoNoNoNo
Defense AI agents platform✓ DonovanNoNoNoNoNo
Programmatic / AI-assisted labelingPartialNoLimited✓ Core strengthPartialNo
Self-serve / pay-as-you-go tier✓ YesNo✓ Yes✓ YesNoNo

Capability assessments are based on publicly available product pages and descriptions. Absence of disclosure does not confirm absence of capability. Sources: company websites accessed 2026-05-09.

3.3 Capability, Pricing, and Regulatory Comparison

Scale AI's most significant competitive differentiators are in three areas: (1) government-grade security certifications (DoD IL4, FedRAMP High) that no publicly disclosed competitor has matched, creating a near-exclusive position in classified and defense AI data markets; (2) Scale Evaluation's model safety benchmarking and the Scale Leaderboard, which have established reputational authority as a trusted third-party evaluator for LLM capabilities; and (3) the Donovan defense AI agent platform, which has no direct documented competitor in the cleared AI agent space. In core annotation, Scale's quality premium is its primary differentiator against lower-cost competitors. The proprietary Scale Data Engine with its quality feedback loops, expert contributor network (having paid over $1B to contributors globally), and annotation tooling is difficult to replicate quickly. However, competitors including Labelbox and Snorkel AI have invested significantly in quality management workflows, narrowing this gap. On pricing, Scale occupies the premium tier: enterprise contracts with custom pricing and dedicated operations teams. Appen and SuperAnnotate offer lower price points, making them more accessible to mid-market customers who may not need Scale's quality guarantee. Labelbox offers tiered pricing including a self-serve option, making it a direct competitor for Scale's self-serve tier as well as enterprise contracts. Regulatory and trust posture is a key battleground. Scale's government clearances and regulatory commitments (White House AI safety commitments, WMDP benchmark, Congress testimony) position it as the trusted government AI data vendor. Appen has some government work but lacks Scale's defense-specific clearance depth. Labelbox and Snorkel do not appear to have published equivalent government security certifications. This is Scale's most durable competitive advantage and is highly defensible due to the multi-year process required to obtain DoD IL4 and FedRAMP High certifications. [CP015, CP016, CP017, CP018, CP019, CP020]

Pricing / Packaging Comparison
CompetitorPricing ModelEntry PointEnterprise TierDifferentiator
Scale AIEnterprise custom + self-serve pay-as-you-go1,000 units free; then per-unitCustom pricing, dedicated ops, SLAQuality guarantee; government clearances
AppenProject-based; volume pricingPublic request; custom quotesEnterprise contracts; global deliveryGlobal crowd; low cost; public-company transparency
LabelboxTiered SaaS + usage; free tier availableFree plan; Developer plan; EnterpriseEnterprise custom + professional servicesPlatform integration; evaluation features; lower price
Snorkel AIEnterprise SaaS; no public pricingEnterprise contract onlyEnterprise subscription + servicesAI-assisted labeling reduces volume cost
SuperAnnotateSaaS subscription; team tiersTeam plan; Enterprise planEnterprise with on-prem optionSecurity-first; collaborative workflow
Surge AIProject-based; expert premiumCustom project quotesHigh-quality expert RLHF contractsExpert annotator quality for LLM training
MercorTalent marketplace; per-task or subscriptionMarketplace self-serveEnterprise placement + managed RLHFExpert network; AI talent marketplace model

Pricing based on publicly available pricing pages and descriptions. Scale AI, Snorkel AI, and Mercor enterprise pricing require custom quotes.

FP001: Competitive Positioning Map — Quality vs. Government Clearance

Scale AI's relative positioning versus key competitors across the quality/premium and government-clearance dimensions that define its most defensible market position.

Positions are qualitative analyst estimates based on public product/capability descriptions. No empirical quality benchmarks publicly available.

[CP015, CP016, CP017, CP018, CP019]
FP002: Aggregate Capability Score — Scale AI vs. Top Competitors (7-Dimension, 35-Point Scale)

Aggregate capability scores across seven AI data service dimensions (annotation, RLHF, evaluation, government clearance, GenAI platform, defense agents, self-serve) illustrating Scale AI's breadth advantage over direct competitors.

Aggregate of 7 capability dimensions scored 1-5 each (max 35). Scale AI's government clearance and defense agent scores (5/5 each) drive outperformance. Scores are qualitative assessments from public product pages.

[CP015, CP016, CP017, CP018, CP019, CP020]

3.4 Switching Cost, Lock-in, and Multi-Homing

Switching costs in Scale AI's markets vary dramatically by segment. For AI labs using Scale primarily for annotation and RLHF data, switching costs are moderate: the annotation pipeline is fungible to some degree, and labs have demonstrated willingness to switch (Google and OpenAI departing in 2025). This is precisely why the Meta investment conflict-of-interest created such rapid customer attrition — switching costs were not high enough to retain customers facing a conflict of interest with their data vendor's new investor. For enterprise customers using Scale's GenAI Platform, switching costs are higher: the platform involves data migration, workflow integration with enterprise systems, customization of data pipelines, and organizational knowledge transfer. Comparable to mid-market SaaS platform switching costs (6–18 months). For U.S. government and defense customers, switching costs are extremely high. Transitioning to a new vendor requires the new vendor to obtain equivalent security clearances (DoD IL4 and FedRAMP High authorization is a 12–36 month process), rebuild institutional knowledge, and navigate federal acquisition regulations. This creates a durable, multi-year lock-in for Scale's defense segment. Multi-homing is common in the annotation market: many large organizations (AI labs, enterprises) run annotation work through multiple vendors simultaneously for quality comparison, cost optimization, and redundancy. This means Scale may not have exclusive relationships even with named customers. Scale's self-serve tier explicitly encourages multi-homing (low commitment, pay-as-you-go). Labelbox's expert network and Surge's quality focus suggest they are positioned to capture multi-homed annotation spend from Scale's customers. Distribution power and supply access: Scale's contributor network (paid $1B+ globally) is a proprietary supply of human annotators. Competitors must build equivalent networks to compete on quality and throughput. Mercor, which operates an AI talent marketplace, is attempting to build an alternative expert contributor supply chain that directly competes with Scale's contributor network. [CP021, CP022, CP023, CP024, CP025]

Moat Durability / Competitive Risk Register
Moat / RiskScale AI PositionDurabilityKey ThreatDiligence Signal
Government clearances (DoD IL4, FedRAMP High)Unique among disclosed AI data vendorsVery HighYears for competitor to replicateActive DoD contracts confirm commercial value
Quality premium in annotationIndustry-leading claimed; $1B+ paid to contributorsMediumLabelbox, Surge narrowing quality gap; lower cost competitorsCustomer NPS and win/loss data not public
RLHF leadershipCore product; Scale RLHF used by top labs historicallyMediumOpenAI and Google departed; Surge, Mercor compete on RLHFLoss of top-2 RLHF customers is material
Evaluation benchmarking (Leaderboard, WMDP)Trusted third-party evaluator positionHighLabelbox Leaderboards; academic benchmarksGrowing regulatory demand for independent eval
Donovan defense AI agentsNo disclosed direct competitor in cleared spaceVery HighLong-term: large defense contractors may enterActive contract win is evidence of value
Enterprise platform (GenAI Platform)Differentiated but faces hyperscaler competitionMediumAWS, Azure, GCP have significantly more resourcesEnterprise customer retention data not public
Commoditization of annotationCore risk: annotation increasingly commoditizedLow (risk)Appen, Snorkel, offshore; synthetic data threatens TAMLayoffs in data-labeling confirm management awareness
Meta investor conflictExistential risk to AI lab customer retentionLow (risk)Google and OpenAI already departedCNBC and TechCrunch reporting confirmed

Durability assessments are analyst judgments based on public evidence. 'Very High' durability means 3+ year competitive advantage expected; 'Low (risk)' indicates active threat to competitive position.

3.5 Moat Durability, Commoditization Risk, and Adverse Evidence

Scale AI's competitive moat is strongest in government/defense AI and weakest in commodity data annotation. The government franchise (DoD IL4, FedRAMP High, Donovan, active defense contracts) is highly durable because clearances take years to obtain and the institutional knowledge embedded in defense AI workflows cannot be rapidly replicated. This represents a genuine, multi-year competitive advantage. In the core annotation market, Scale's moat is eroding. The departure of Google and OpenAI as customers — Scale's two largest commercial relationships — demonstrates that the quality premium alone does not create unbreakable lock-in for AI lab customers. Competitors are narrowing the quality gap, and lower-cost providers with adequate quality (Appen, SuperAnnotate, offshore teams) continue to win annotation work at price-sensitive enterprises. The commoditization risk is real and accelerating. As AI foundation model training matures and synthetic data becomes more capable, the TAM for human-labeled annotation could shrink materially. Competitors like Snorkel AI are building annotation tools that reduce human labor requirements per output unit. If this trend continues, the annotation market will bifurcate: a large commodity segment dominated by low-cost providers and a smaller premium segment for expert evaluation and government-grade work where Scale has a stronger position. Adverse competitive evidence: (1) Google and OpenAI departures post-Meta deal demonstrate customer concentration risk and insufficient switching costs in the AI lab segment; (2) Mercor's active customer poaching (per Scale's lawsuit) suggests competitor perception that Scale customers are vulnerable; (3) Appen's declining revenues serve as a cautionary indicator that the pure annotation segment is facing structural headwinds; (4) Scale's July 2025 layoffs specifically targeted data-labeling headcount, acknowledging over-investment in the commoditizing segment. [CP026, CP027, CP028, CP029, CP030, CP031]

FP003: Moat / Readiness KPIs — Scale AI Competitive Position

Key competitive position indicators summarizing Scale AI's moat strength, vulnerability points, and diligence gaps.

[CP026, CP027, CP028, CP029, CP030, CP031]

3.6 Exhibits

Chapter 04

04Financials

4.1 Revenue Model and Revenue Streams

Scale AI generates revenue across three primary streams: (1) enterprise data annotation and RLHF services, (2) the Scale GenAI Platform (enterprise AI application development), and (3) U.S. government and defense contracts through Donovan and its Public Sector Data Engine. A self-serve tier provides a fourth, smaller revenue stream for research and experimental projects. The enterprise data annotation segment has historically been Scale's largest revenue driver, powering growth from 2016 through the 2024 Series F. This segment charges enterprises and AI labs project-based or volume-based rates for annotation, RLHF data collection, and model evaluation. The pricing model is custom and opaque — Scale does not publish enterprise pricing for its annotation services. The self-serve tier offers 1,000 labeling units free, then charges per unit thereafter, a freemium model designed to land smaller accounts. The Scale GenAI Platform represents a higher-margin, SaaS-oriented revenue stream targeting enterprises seeking to deploy custom AI applications. This segment serves Fortune 500 companies across industries including retail (Etsy, Instacart) and media (TIME). Scale custom-builds LLM-powered applications for enterprise clients, integrating with their proprietary data and workflows. This stream has the potential for subscription and managed-service components, though pricing details are not publicly disclosed. Government and defense contracts constitute a third, growing, and strategically important revenue stream. Scale holds DoD IL4 and FedRAMP High clearances, enabling it to pursue classified and defense-grade AI data contracts. The Donovan platform (defense AI agents) and the Scale Public Sector Data Engine target a defense market where pricing is typically non-public, based on federal procurement schedules. The DIU RCV program win and active DoD data curation contract confirm commercial relevance. This segment is likely smaller than commercial annotation today but potentially larger and more durable given switching costs. [CI001, CI002, CI003, CI004, CI005]

Revenue Streams Table
Revenue StreamMechanismUnit / Pricing ModelStatusRevenue QualityDiligence Ask
Enterprise data annotationProject-based volume annotation; RLHF; evaluation for AI labs + enterprisesCustom pricing per project / volume unit; enterprise SLA contractsActive but attriting (Google and OpenAI departed 2025)Medium: recurring project orders but no contractual subscription; customer concentration riskRevenue by customer segment; top 5 customer concentration; NRR by cohort
Scale GenAI PlatformEnterprise SaaS + managed services for custom GenAI apps; LLM customization for enterprisesCustom enterprise subscription + professional services; no public pricingActive; strategic growth priority per Droege July 2025 memoHigh potential: platform stickiness; lower labor intensity than annotationARR, ACV, customer count, gross margin for platform segment
Government / defense contractsDefense AI data labeling, evaluation, and Donovan platform for DoD, IC, civilian agenciesFederal procurement schedules; contract-based; non-publicActive; growing; DoD IL4 + FedRAMP High unlocks classified AI data marketHigh: long-term contracts, high switching costs, clearance-based moatContract value, pipeline size, ceiling for IDIQ or other vehicles
Scale RLHFHuman feedback data collection for LLM alignment; expert annotator networkVolume-based pricing per feedback instance or project; enterprise contractActive but affected by OpenAI wind-down; Meta RLHF relationship expandingMedium: labor-intensive; margin pressure from competitionRLHF revenue by customer; Meta RLHF agreement scope
Self-serve Data EnginePay-as-you-go annotation for research, academic, and experimental users1,000 units free; per-unit pricing for additional; no enterprise SLAActive; small revenue contribution; pipeline / lead-gen functionLow-medium: high volume but low ACV; margin depends on automationSelf-serve revenue, conversion rate to enterprise, margin

Revenue classifications and status are based on public product pages, press reports, and analyst inference. No public revenue disclosure. Status reflects 2025 post-Meta-investment dynamics.

[CI001, CI002, CI003, CI004, CI005]
FI001: Revenue Model Bridge — Customer Activity to Gross Profit

How customer engagement converts into revenue, cost of revenue, and gross profit across Scale AI's three primary revenue segments.

Revenue and gross margin figures are analyst estimates. Scale AI does not disclose financial statements. Estimates based on Appen public gross margins, industry benchmarks, and headcount proxies.

[CI001, CI002, CI003, CI011]

4.2 Pricing, GTM Motion, and Sales Efficiency

Scale AI's go-to-market motion is primarily enterprise sales-led, supplemented by a self-serve product-led growth tier. The enterprise tier relies on a dedicated sales team, relationship-driven deals, and solution engineers who work with customers to scope and deliver large annotation or platform projects. Enterprise contracts are custom-quoted with dedicated operational support and SLAs, indicating high average contract values and longer sales cycles relative to typical SaaS companies. The pricing for annotation services is volume-based and project-specific. Scale's self-serve tier establishes a list-price anchor: units are available at a disclosed per-unit rate after the first 1,000 free units. Enterprise customers negotiate pricing for large-volume annotation programs that may include dedicated annotator teams, quality assurance workflows, and platform integrations. No discounts or typical deal sizes are publicly disclosed. Sales cycle length and customer acquisition cost (CAC) are not publicly disclosed. As a data services and infrastructure company, Scale likely experiences long enterprise sales cycles (3–12 months) with significant professional services components. Government contract sales cycles are even longer (often 12–36 months from procurement initiation to first revenue). Net revenue retention (NRR) is unverifiable — the Google and OpenAI departures in 2025 represent a severe NRR compression event, but the full financial impact is unknown. Distribution channels include: direct enterprise sales, government procurement relationships (DoD, IC), partnerships with AI lab providers, and self-serve API/platform access for developers and researchers. Scale's relationships with major investors (Accel, Amazon, Meta, Nvidia, Cisco) create strategic co-sell and referral channels, though the specific economic impact is not disclosed. Scale's data-labeling business had revenue concentration among a small number of large AI labs (Google, OpenAI, Meta, Anthropic). The departure of Google and OpenAI in 2025 following the Meta investment represents a material GTM and concentration risk that directly compresses revenue in the annotation segment. [CI006, CI007, CI008, CI009, CI010]

Pricing / Monetization Table
ProductList / Entry PricingEnterprise Pricing ModelKey UnknownsSource
Scale Data Engine (self-serve)First 1,000 labeling units free; per-unit pricing above 1,000 unitsN/A — self-serve only tier; enterprise uses custom contractsPer-unit rate not publicly disclosed; volume discount structure unknownscale.com/pricing (official)
Scale GenAI Platform (enterprise)Not publicly disclosed; custom enterprise quote onlyEnterprise subscription + professional services; dedicated ops includedTypical ACV unknown; SaaS vs. managed-service revenue mix unknownscale.com/generative-ai-data-engine (official); docs.scale.com
Scale RLHF (enterprise)Not publicly disclosed; project-based custom pricingCustom project or volume contract; no public rate cardPer-task rate, project floor, and volume discount structure unknownscale.com/rlhf (official)
Scale Evaluation (enterprise)Not publicly disclosed; custom pricing for model developers + public sectorEnterprise evaluation contract; likely per-model or per-benchmark-run pricingEvaluation pricing model, typical deal size, margin unknownscale.com/evaluation/model-developers (official)
Donovan (government)Not publicly disclosed; federal procurement schedulesGovernment contract pricing (IDIQ, FFP, or T&M); cleared programTotal contract value, ceiling, and existing pipeline not publicscale.com/donovan (official); DoD contract blog post

All pricing for enterprise and government tiers is custom and not publicly disclosed. Self-serve pricing entry is confirmed by public pricing page. This table reflects list-pricing structures, not realized revenue or margin.

[CI006, CI007]
FI002: Unit Economics Bridge — Annotation Segment (Estimate)

Illustrative unit economics flow for Scale AI's enterprise annotation segment showing customer acquisition, contract lifecycle, and retention drivers — all estimates given absence of public data.

ACV, win rates, CAC, and NRR are all unknown (private data). Flow structure is inferred from public GTM descriptions and industry benchmarks. NRR directional estimate based on Google and OpenAI departure events.

[CI007, CI008, CI009, CI013]

4.3 Cost Structure, Gross Margin, and Capital Intensity

Scale AI's cost of revenue is dominated by human labor — the annotation contributors who perform data labeling, RLHF feedback collection, and model evaluation tasks. Scale has paid over $1 billion globally to its contributor network, confirming the labor-intensive nature of the core annotation business. This human labor cost creates a structurally lower gross margin for annotation relative to pure software businesses. Industry proxies from Appen (the publicly traded annotation comparable) suggest gross margins for annotation services in the 25–45% range, declining as the market commoditizes. The Scale GenAI Platform and evaluation products should carry higher gross margins than annotation services, as they involve more software leverage and less per-unit human labor. However, GenAI Platform projects at the enterprise level still require significant professional services and operational effort, capping margins below pure SaaS benchmarks. Government contracts typically carry margins in the 15–30% range for data services, constrained by procurement rate structures. Operating expenses include engineering and R&D (building annotation tooling, the GenAI Platform, Donovan, and evaluation infrastructure), sales and marketing (enterprise sales team, government BD), and general and administrative. Scale had approximately 1,400 employees before the July 2025 layoffs and approximately 1,000 after, suggesting a significant OpEx reduction from the 14% headcount cut plus 500 contractor reductions — primarily targeting the data-labeling cost base. This is consistent with improving blended margins by shedding the lower-margin commodity annotation labor force. Capital intensity in Scale's model is primarily working-capital-based (labor payments to contributors) rather than physical capex (hardware, facilities). Infrastructure costs (cloud compute for annotation tooling and model evaluation) are a secondary capex factor. This makes Scale relatively capital-efficient compared to hardware AI companies but more labor-intensive than pure SaaS. The $1 billion Series F and Meta strategic investment have provided substantial capital to fund the pivot toward higher-margin enterprise and government services. [CI011, CI012, CI013, CI014, CI015]

Unit Economics Table
MetricValue / EstimateConfidenceWhy It MattersDiligence Ask
Average Contract Value (enterprise annotation)Estimated $1M–$5M+ per engagementLow (analyst estimate)Drives revenue scale; indicates enterprise buyer depthRequest deal size distribution from CFO; analyze Appen disclosures as proxy
Gross Margin (annotation segment)Estimated 25–45%Low (Appen proxy; analyst estimate)Core margin driver; declining with commoditizationRequest P&L by segment; compare Appen disclosed gross margins
Gross Margin (GenAI Platform)Estimated 45–65%Low (SaaS + managed-services proxy)Platform economics if recurring SaaS dominatesRequest segment margin breakdown; subscription vs. services revenue split
Gross Margin (government/defense)Estimated 15–30%Low (federal services industry proxy)Cap on margin in procurement-constrained segmentReview contract type (T&M vs. FFP) and pricing structure
Net Revenue Retention (NRR)Likely <100% in 2025 (Google + OpenAI departure)Low — estimated from directional evidenceSignals revenue durability and customer healthRequest NRR by cohort and segment; Google/OpenAI revenue impact
Customer Acquisition Cost (CAC)Not disclosed; estimated high for enterprise/governmentUnknown — no proxy availableEfficiency of growth spend; payback periodRequest CAC by segment; sales headcount and quota attainment
Employee Revenue per Head (proxy)Est. $200K–$500K revenue/employee (if $300M ARR, 800 employees)Very low — both inputs estimatedCapital efficiency benchmarking against data services compsRequires confirmed headcount + revenue; request in due diligence
Annual Burn Rate (pre-layoff)Estimated $400M–$700M/year (1,400+ employees + annotators)Low (headcount-based proxy)Capital runway and cost efficiencyRequest monthly cash flow statements; P&L operating expense breakdown

All unit economics are estimated or unavailable. Scale AI does not disclose financial statements. Estimates derived from Appen public financials, industry benchmarks, and headcount-based proxies. Confidence is uniformly low; replace with actuals in due diligence.

[CI011, CI012, CI013]
FI003: Financial Estimate Range — Key Metrics (Low / Mid / High)

Analyst estimate ranges for Scale AI's key financial metrics derived from public proxies, Appen comparable, headcount proxies, and disclosed funding data. All estimates carry very low confidence.

Revenue estimates derived from: headcount-based proxy (1,000 employees × ~$350K revenue/employee), Appen comparable revenue-per-employee analysis, and independent analyst commentary. Burn estimates derived from headcount × all-in cost proxy. All estimates are illustrative only and not confirmed by management.

[CI011, CI012, CI013, CI018, CI019, CI020]

4.4 Capital Adequacy and Financing History

Scale AI has raised approximately $1.6 billion+ in external capital. The funding history includes: seed and early rounds from 2016–2019 (including Y Combinator, Index Ventures, Founders Fund); a 2021 Series E of $325 million at approximately $7.3 billion valuation; approximately $600 million total pre-Series F; and the May 2024 Series F of $1 billion at $13.8 billion valuation led by Accel with participation from Amazon, Meta, Cisco, Intel, AMD, ServiceNow, Nvidia, DFJ Growth, WCM, and returning investors including Tiger Global, Thrive, Greenoaks, Wellington, Nat Friedman, and Elad Gil. The Series F included both primary (new cash to company) and secondary (liquidity to existing shareholders) components. The June 2025 Meta strategic investment was approximately $14.3 billion for a minority stake (approximately 49% of outstanding equity). Critically, the majority of Meta's investment proceeds were distributed to existing shareholders and vested equity holders — not to the company's operating treasury — consistent with a secondary transaction structure. The net new primary capital to Scale's operating business from the Meta deal is not fully disclosed. Scale maintains an independent corporate structure; Meta holds a minority stake. Without public financial statements, cash on hand is unknown. The Series F closed in May 2024, and if a significant portion was primary capital ($500M+ to the company), combined with the Meta deal's primary component, Scale likely has adequate runway of 3+ years at current burn rates. The July 2025 layoffs of 200 employees and 500 contractors, plus the strategic pivot away from data-labeling, are cost-reduction measures consistent with extending runway and improving cash efficiency. Burn rate is not publicly disclosed. Scale's pre-layoff expense base (1,400+ employees, significant annotator contractor costs) suggests a substantial monthly burn. Post-layoff, cash efficiency should improve materially. Scale has no publicly disclosed debt facilities or project-finance obligations. The Meta deal created structural dependencies (Meta as both investor and key customer for related AI services) but no disclosed economic covenants or restrictions. [CI016, CI017, CI018, CI019, CI020]

Capital Adequacy Table
ItemValue / EstimateSource / BasisConfidenceNotes
Total capital raised~$1.6B+ (pre-Series F ~$600M + $1B Series F)TechCrunch Series F reporting (May 2024)High (confirmed by multiple high-rep sources)Mix of primary + secondary; exact primary allocation unknown
Series F closeMay 2024; $1B at $13.8B valuation; Accel leadTechCrunch / CNBC reportingHigh (multiple corroborating sources)Includes new investors Amazon, Meta, Cisco, Intel, AMD, ServiceNow
Meta strategic investment~$14.3B for minority stake (~49%); valuation >$29BCNBC June 2025 reportingHigh (multiple corroborating sources)Primarily secondary (existing shareholder liquidity); primary portion unknown
Net primary capital from Meta dealUnknown — primarily secondary distributionCNBC/TechCrunch reporting; Scale blogLow (inferred from secondary structure language)Meta holds minority equity; proceeds largely to shareholders
Estimated cash on hand (post-2025)Estimated $500M–$1B+Analyst estimate from Series F primary + partial Meta primaryVery low (estimate only)No public financial statements; estimate only
Monthly burn rate (post-layoff)Estimated $25M–$50M/month (post-July 2025 restructuring)Analyst estimate from headcount × cost-per-employee proxyVery low (estimate only)Pre-layoff burn materially higher; post-restructuring improving
Estimated runwayEstimated 24–48 months from July 2025 (if $500M–$1B cash, $25M–$50M burn)Derived estimate from cash and burn estimatesVery low (both inputs estimated)Runway estimate requires confirmed primary capital from Meta deal
Disclosed debt / credit facilitiesNone publicly disclosedPublic sources reviewMedium (no evidence of public debt)Private credit or revenue-based financing possible but undisclosed

Funding amounts for Series F and Meta investment are confirmed by high-reputation news sources. Cash on hand, burn rate, and runway are analyst estimates with very low confidence. Due diligence must obtain audited financial statements.

[CI016, CI017, CI018, CI019]
FI004: Capital Intensity / Cash-Flow Map — Funding Inflows vs. Operating Outflows

Illustrative capital flow showing major fundraising inflows and estimated operating outflows from 2021 through 2026, highlighting the funding adequacy and burn context for the business model pivot.

All values are analyst estimates. Primary vs. secondary split of Series F and Meta deal is not disclosed. Series E $325M may include both primary and secondary. Operating spend proxy based on headcount and industry benchmarks. The waterfall is illustrative, not an audited cash flow statement.

[CI016, CI017, CI018, CI019]

4.5 Financial Verdict — Revenue Quality, Margin Path, and Diligence Blockers

Scale AI presents a complex financial profile: abundant capital, an unclear revenue trajectory, and a management team executing a high-stakes business model pivot under conditions of significant customer attrition. Revenue quality is concerning. The data-labeling segment — historically Scale's largest revenue driver — is commodity-positioned and facing structural headwinds (Appen's declining public revenues are a leading indicator), customer attrition (Google and OpenAI departing), and direct competition. The GenAI Platform and government/defense segments offer higher revenue quality (longer contracts, stronger lock-in, less commoditization risk) but their current contribution to total revenue is unknown and likely smaller. Net revenue retention post-2025 is likely below 100% in the AI lab segment. Margin trajectory is directionally positive if the pivot succeeds: shedding the low-margin data-labeling workforce and growing the higher-margin platform and government segments would improve blended gross margins over time. However, the pivot carries execution risk, and the intermediate period (2025–2027) likely shows revenue compression before platform and government revenue scales. Capital adequacy is the strongest financial positive. Scale has raised $1.6B+ and the Meta deal provided additional liquidity to shareholders (and some primary capital). Post-layoff, the company is operating with reduced headcount and lower burn. Runway is estimated at 3+ years, sufficient for the pivot execution. Primary diligence blockers: (1) No public revenue or ARR data — valuation multiples are unverifiable; (2) Google and OpenAI attrition impact on revenue unknown; (3) government contract revenue size and growth trajectory not disclosed; (4) gross margin by segment not disclosed; (5) primary vs. secondary split of the Meta deal not confirmed; (6) Mercor lawsuit outcome could have financial implications (indemnification, customer loss). [CI021, CI022, CI023, CI024, CI025]

Public Financial Gaps Table
Missing DataImpact on DiligenceExact Diligence Path
Total revenue / ARRCannot verify valuation multiple; cannot assess growth or revenue qualityRequest audited financials; obtain management accounts; confirm with Big 4 audit
Revenue by segment (annotation vs. platform vs. government)Cannot assess business model pivot success; segment margin unknowableRequest segment P&L breakdown from CFO; separate contracts by segment
Google and OpenAI revenue impact (2025 attrition)Largest risk to financial model: magnitude of revenue loss from largest customers unknownRequest revenue by customer; confirm Google and OpenAI departure dates and final billings
Gross margin by product lineCannot assess margin improvement thesis or capital intensity; blended margin is opaqueRequest gross profit by segment; annotator cost as % of revenue; platform margin
Net Revenue Retention (NRR)Cannot assess customer health; recurring revenue durability unknowableRequest NRR by annual cohort (2021–2025) and by segment
Primary vs. secondary split of Meta dealCannot assess company's actual cash increase from Meta dealRequest closing documents for Meta investment; confirm primary capital allocation
Operating cash flow and burn rateCannot estimate runway or capital adequacy with confidenceRequest monthly P&L and cash flow statements (Jan 2024 – present)
Employee compensation structureCannot assess whether talent retention after Meta deal and layoffs is preservedRequest anonymized compensation bands and equity vesting schedules
Government contract backlog and pipelineCannot size the defense segment or its growth trajectoryRequest total DoD/IC contract backlog, option periods, and pipeline
Mercor lawsuit financial exposurePotential financial liability from ongoing IP/trade-secret litigationReview legal claims filed; assess settlement risk; confirm insurance coverage

All items in this table represent confirmed evidence gaps as of 2026-05-09. This table is the primary due diligence checklist for financial underwriting of Scale AI.

[CI021, CI022, CI023, CI024, CI025]

4.6 Exhibits

Chapter 05

05Product & Technology

5.1 Product Portfolio and Customer Workflow

Scale AI's product portfolio addresses four distinct customer workflow problems in AI development: (1) creating high-quality training data for model development; (2) collecting human feedback for LLM alignment and RLHF; (3) independently evaluating AI model safety and capability; and (4) deploying custom GenAI applications at enterprise scale. A fifth product line — Donovan — addresses defense and intelligence mission-specific AI workflows. The Scale Data Engine is Scale's core product and the foundation of the company. It enables AI teams to collect, curate, annotate, and quality-assure training datasets across modalities (text, image, video, audio, 3D). The customer workflow: AI engineers define annotation requirements and quality standards; Scale's contributor network executes annotation tasks guided by Scale's proprietary tooling and QA pipeline; annotated datasets are delivered back into the customer's MLOps pipeline. The Data Engine supports the full model development lifecycle from pre-training data curation to RLHF to evaluation. The Scale GenAI Platform addresses the enterprise AI deployment workflow. Enterprise customers (Fortune 500 companies across retail, media, finance, and government) bring proprietary data and domain requirements; Scale transforms this into customized LLM-powered applications. The platform includes data ingestion, LLM fine-tuning, RAG pipeline configuration, and production deployment support. This product targets enterprises that want custom AI without the deep ML expertise required to build from scratch. Scale Evaluation and the Scale Leaderboard provide independent model benchmarking services. AI labs and enterprises use Scale Evaluation to assess LLM capabilities across dimensions including reasoning, safety, coding, and domain knowledge. The Scale Leaderboard is a public ranking of LLM performance — a developer-facing tool that has established Scale as a trusted third-party evaluator in the AI community. The WMDP (Weapons of Mass Destruction Proxy) benchmark is a specialized evaluation contribution for AI safety. Donovan serves DoD, IC, and civilian government agencies with AI agents for mission-critical workflows. It integrates classified data sources, supports cleared operational environments, and provides AI-powered decision support for defense and intelligence applications. Donovan is deployed in DoD IL4-authorized environments and is Scale's primary competitive moat in the defense segment. [CE001, CE002, CE003, CE004, CE005]

Product Module / Asset Matrix
Product / ModulePrimary UserMaturity / StatusCore DifferentiationDiligence Gap
Scale Data EngineAI labs; enterprise AI teams; researchMature (GA); core product since 2016Proprietary QA pipeline; contributor network ($1B+ paid); multi-modal supportAnnotation quality vs. competitors not independently benchmarked; NPS unknown
Scale GenAI PlatformEnterprise (F500, media, retail); governmentActive; strategic growth priority 2025Integration of annotation quality into LLM customization; managed enterprise deploymentTechnical depth vs. AWS Bedrock / Azure AI not independently assessed; customer retention unknown
Scale RLHFAI labs (Meta, Cohere, Anthropic); enterpriseMature; core product; affected by OpenAI/Google departuresExpert annotator network for preference data; quality feedback loops for alignmentRLHF revenue concentration risk; meta relationship scope undisclosed
Scale Evaluation + LeaderboardAI labs; enterprises; government; AI safety researchersActive; growing; public developer toolTrusted third-party evaluator; WMDP benchmark; government evaluation mandateLeaderboard methodology and independence need third-party audit for credibility
Donovan (Defense AI Platform)DoD; IC; civilian government agenciesActive; cleared production; specialized for IC/DoDDoD IL4 certified; classified environment deployment; defense mission AI agentsTechnical capabilities not publicly documented; competitive comparison with Palantir unknown
Scale Public Sector Data EngineU.S. government; defense; intelligence communityActive; cleared productionFedRAMP High; defense-specific data curation and managementContract size and growth pipeline not public; timeline to other agencies unknown
Self-Serve Data EngineResearchers; startups; individual AI developersActive; GA; lower priority post-pivotLow barrier to entry; 1,000 units free; API access; diverse task typesConversion rate to enterprise unknown; margin on self-serve tier not disclosed

Maturity assessments based on public product page descriptions and press coverage. Diligence gaps reflect absence of independent third-party validation for claimed capabilities.

5.2 Architecture and Operating Model

Scale AI's operating model combines software tooling with human-in-the-loop execution — a hybrid AI-assisted annotation architecture rather than a pure software or pure labor model. The core technology stack consists of: (1) a web-based annotation tooling platform for labeling tasks across modalities; (2) a quality assurance pipeline for review, reconciliation, and feedback; (3) API infrastructure exposing Scale's services programmatically to enterprise and lab customers; (4) the GenAI Platform for LLM customization workflows; and (5) the Donovan AI agent runtime for defense missions. The annotation contributor network is a proprietary, global workforce recruited, trained, quality-screened, and managed by Scale. The contributor platform (which Scale calls its "task marketplace") routes annotation jobs to appropriately skilled contributors, monitors completion quality in real-time, and escalates to expert reviewers for quality-sensitive tasks. Scale has invested in contributor quality management as a core differentiator, claiming that its proprietary QA pipeline produces annotation quality exceeding competitors. The Scale API is the primary developer-facing interface for the Data Engine, offering programmatic submission of annotation jobs, retrieval of completed datasets, and integration with ML training pipelines. The API supports all annotation task types (text classification, bounding boxes, segmentation, RLHF preference pairs, etc.) and includes webhook support for real-time job completion notification. For the GenAI Platform, Scale's operating model involves: data ingestion and preprocessing, LLM selection and prompt engineering, fine-tuning or RAG pipeline configuration, red-teaming and safety evaluation, and production deployment with monitoring. This is a managed services approach where Scale's engineers and the platform tooling jointly deliver the custom AI application. For Donovan, the architecture is specialized for cleared environments: it runs on DoD IL4-certified cloud infrastructure, integrates with classified government data systems, supports multi-modal AI agent workflows (planning, search, decision support), and provides explainability features required for military applications. Donovan's technical architecture is not publicly disclosed in detail; the operational model is closer to a managed defense IT service than a commercial SaaS product. [CE006, CE007, CE008, CE009, CE010]

Workflow / Use-Case Table
User JobCurrent WorkflowScale SolutionMeasurable BenefitLimitation
Train ML model with labeled dataManual labeling by internal team or crowdsource platform; slow and inconsistentScale Data Engine: submit raw data via API; receive annotated dataset; automated QA10x+ throughput vs. internal teams; higher consistency (claimed)Custom pricing; AI lab customers departing post-Meta conflict
Align LLM with human preferences (RLHF)Build internal preference collection pipeline; hire expert evaluatorsScale RLHF: expert human feedback at scale; pair annotation; preference rankingQuality expert feedback for LLM alignment trainingOpenAI and Google departed; Meta relationship expanding but concentration risk
Deploy custom GenAI app for enterpriseBuild in-house with ML engineers; 12-24 month timeline typicalScale GenAI Platform: data → custom LLM app in < 2 months (TIME case study)TIME case study: deployed in <2 months; 7,000+ attack vectors testedHyperscalers (AWS, Google, Azure) offer comparable platforms at lower cost
Evaluate LLM capability and safety independentlyUse academic benchmarks (MMLU, HELM); internal red-teamScale Evaluation + Leaderboard; WMDP benchmark; expert safety evaluationTrusted third-party scores; government-recognized evaluation authorityIndependence perception risk (Meta investor could bias evaluation results)
Execute AI-powered defense missionsManual analyst workflows; limited AI integration in classified environmentsDonovan: cleared AI agents for IC/DoD decision support and mission planningFirst cleared AI agents platform for DoD; no comparable disclosed competitorTechnical capabilities not fully documented; limited commercial transparency

Benefit claims for Scale GenAI Platform deployment speed are based on the TIME case study. Other claims are based on company-described capabilities on official product pages.

FE001: Product Architecture Map

Scale AI's five-layer product architecture from customer-facing interfaces through AI services, workflow execution, compliance, and infrastructure.

Architecture is inferred from public product pages, API documentation, and official security certification descriptions. Donovan classified capabilities are not publicly documented.

[CE006, CE007, CE008, CE016]
FE002: Customer Workflow / Operating Flow — Enterprise Annotation to Delivered Dataset

End-to-end customer workflow for Scale AI's core annotation product, from customer data submission through contributor execution, QA, and dataset delivery to the customer's MLOps pipeline.

Workflow is reconstructed from public API documentation, product descriptions, and the Scale Leaderboard/Evaluation public materials. Specific QA algorithms are proprietary and not publicly disclosed.

[CE006, CE007, CE008, CE009]

5.3 Differentiation, Technology IP, and Competitive Moat

Scale AI's key technological and operational differentiators span five areas. First, proprietary annotation tooling and QA methodology: Scale has invested significantly in building annotation interfaces optimized for accuracy, speed, and consistency across task types. The QA pipeline applies statistical quality controls, consensus checking, and expert reviewer escalation to maintain annotation quality standards that the company claims exceed competitors. This proprietary workflow is not open-source and represents institutional know-how embedded in operations. Second, the contributor network as supply-side IP: Scale's global contributor network (having received $1B+ in payments) is a proprietary asset. The quality screening, onboarding, skill credentialing, and task routing logic that governs this network is Scale's operational moat in the annotation segment. Competitors (including Mercor, which Scale is currently suing for alleged poaching) must build equivalent networks to compete on quality and throughput at scale. Third, government certifications as regulatory moat: Scale holds DoD IL4 Provisional Authorization and FedRAMP High Authorization. These clearances required a 2–4 year compliance investment and ongoing compliance maintenance. No disclosed AI data competitor has equivalent certifications, giving Scale a near-exclusive position in classified AI data contracts. The cost and time to replicate this is a genuine barrier to entry for government AI data. Fourth, the Scale Leaderboard and WMDP benchmark as reputational IP: The Scale Leaderboard has established Scale as the trusted third-party evaluator for LLM capabilities. The WMDP benchmark (Weapons of Mass Destruction Proxy) for AI safety evaluation has been adopted by the AI safety research community. These benchmarks are public goods that generate reputational capital for Scale as the authoritative AI evaluation authority — supporting both commercial evaluation contracts and government relationships. Fifth, the Donovan platform as first-mover advantage in defense AI agents: Donovan is the first commercially deployed, DoD IL4-cleared AI agents platform for defense mission workflows. The combination of specialized product capabilities, government relationships, and cleared infrastructure positions Donovan as a difficult-to-displace defense AI platform, particularly as the DoD increases its AI adoption budget. [CE011, CE012, CE013, CE014, CE015]

Technology / Operating Architecture Table
Layer / ComponentRoleKey DependencyPrimary Risk
Annotation Tooling Platform (web UI + API)Interface for contributor annotation task execution; customer job submissionCloud infrastructure (AWS/GCP/Azure); contributor networkTooling obsolescence as AI-assisted annotation reduces human need
Quality Assurance PipelineStatistical QC; consensus scoring; expert reviewer escalationContributor network quality; proprietary QA algorithmsQA methodology is proprietary and not independently audited; competitor narrowing quality gap
Scale API (annotation + GenAI + evaluation)Programmatic access to all Scale services for enterprise + developer integrationCloud infrastructure; API security; access controlAPI availability and reliability not independently benchmarked; no public status page
Contributor Network (global annotators)Human execution of annotation, RLHF, and evaluation tasksGlobal workforce management; quality screening; compensation logisticsWorkforce supply chain risk; Mercor attempting to build competing network
Scale GenAI Platform (LLM customization)Enterprise custom AI app development: RAG, fine-tuning, deploymentCloud LLM providers (OpenAI, Anthropic, Meta); customer data securityHyperscaler competition; proprietary LLM provider relationships may change post-Meta deal
Donovan Runtime (defense AI agents)Cleared AI agent execution for DoD/IC missions; mission planning; searchDoD IL4 certified cloud; classified data integration; cleared personnelDependency on government certification maintenance; technical details not publicly disclosed
Evaluation / Leaderboard InfrastructureLLM benchmarking; WMDP safety evaluation; public leaderboard publicationModel APIs from AI labs; independent evaluation infrastructureIndependence perception risk; Scale as Meta investor could create conflicts in LLM ranking

Architecture assessment is based on public documentation, product pages, and API references. Donovan architecture is partially inferred; classified capabilities are not publicly documented.

FE003: Critical Dependency Map

Scale AI's key external dependencies spanning infrastructure, certification bodies, customers, investors, and regulatory relationships that could affect product delivery or competitive position.

Dependency relationships are inferred from public product descriptions and press reporting. Specific contract and commercial terms are not publicly disclosed.

[CE011, CE012, CE013, CE014]
FE004: Product Maturity / Capability Map

Assessment of Scale AI's five core products across four dimensions: technical maturity, enterprise fit, competitive moat, and technical depth, scored 1-5 (5=highest).

Scores are analyst estimates based on public evidence. Maturity=1(pre-release) to 5(proven/stable). Enterprise Fit=1(not suited) to 5(core enterprise). Competitive Moat=1(commodity) to 5(near-exclusive). Technical Depth=1(basic) to 5(advanced/proprietary).

[CE011, CE012, CE013, CE014, CE015]

5.4 Trust, Safety, Compliance, and Quality Controls

Scale AI has invested heavily in trust and compliance infrastructure, particularly for government customers. The company's publicly disclosed certifications include: SOC 2 Type II (enterprise security controls audit), ISO 27001 (information security management system), DoD IL4 Provisional Authorization (Defense Department data security for controlled unclassified information and some classified), and FedRAMP High Authorization (U.S. federal government cloud security for high-impact systems including classified data handling). For AI safety, Scale has made voluntary commitments under the 2024 White House AI Safety commitments, covering RLHF safety practices, red-teaming, and responsible AI deployment. Scale contributed the WMDP (Weapons of Mass Destruction Proxy) benchmark for evaluating whether LLMs have been trained to prevent generation of dangerous content. The WMDP benchmark measures AI safety in dual-use knowledge domains, and Scale's test-and-evaluation white paper documents its approach to responsible AI model evaluation. Scale's annotation quality controls are proprietary but include: task-specific quality guidelines, multiple annotator redundancy for high-stakes tasks, statistical quality monitoring, expert reviewer escalation, and inter-annotator agreement scoring. The company claims industry-leading annotation quality — though independent third-party quality audits of Scale's annotation are not publicly available. The TIME customer case study (GenAI platform deployed in under 2 months, with 7,000+ attack vectors tested) provides partial evidence of red-teaming capability. The DIU RCV program win confirms that Scale's technology passed defense procurement requirements. These are corroborating evidence of technical capabilities but fall short of independent third-party technical audits. Privacy and data governance: Scale's data handling practices for enterprise annotation involve customer-provided data that may contain proprietary or sensitive information. Scale's security certifications (SOC 2 Type II, ISO 27001) provide some assurance of data governance controls. For government customers, DoD IL4 and FedRAMP High impose strict data handling requirements that Scale has demonstrated compliance with. Customer data handling in the annotation pipeline (particularly for AI labs with proprietary training data) is a due diligence concern that is not fully addressed by public documentation. [CE016, CE017, CE018, CE019, CE020]

Trust / Quality / Compliance Table
Control / CertificationStatusScopeVerified ByGap / Diligence Ask
SOC 2 Type IICertified (confirmed)Commercial data handling; internal security controls; enterprise annotation workflowsThird-party auditor (not named publicly)Request current SOC 2 report; confirm scope covers annotation data pipelines
ISO 27001Certified (confirmed)Information security management system; global operationsThird-party certification bodyConfirm certification date and scope; request ISO certificate
DoD IL4 Provisional AuthorizationCertified (confirmed)DoD Impact Level 4 data — controlled unclassified + sensitive defense dataDISA / DoD authorization bodyConfirm active PA status; request ATOs for specific Donovan deployments
FedRAMP High AuthorizationAuthorized (confirmed)U.S. federal high-impact systems; classified and sensitive government dataFedRAMP PMO / JAB authorizationConfirm active authorization status in FedRAMP marketplace; scope of authorized services
White House AI Safety Commitments (2024)Committed (company-signed voluntary)RLHF safety; red-teaming; responsible AI deployment; model evaluationWhite House OSTP; voluntary, not legally bindingConfirm implementation of commitments; review Scale's published commitment tracker
WMDP Benchmark (Weapons of Mass Destruction Proxy)Published (publicly available)AI safety evaluation for dual-use knowledge prevention; LLM safety testingAI safety research community adoptionConfirm WMDP adoption by other labs; independent validation of benchmark methodology
Annotation Quality StandardsInternal (proprietary)Inter-annotator agreement; QA pipeline; task-specific accuracy standardsInternal (self-reported); no public third-party auditRequest quality audit methodology; inter-annotator agreement scores; customer NPS
Data Privacy / Customer Data HandlingInternal controls (SOC 2 covers some)Customer proprietary data in annotation pipeline; AI lab training data confidentialityPartially covered by SOC 2 Type II; not independently verified for annotation dataRequest data handling agreements; confirm customer data isolation in annotation pipeline

Certifications confirmed by official Scale website (scale.com/legal/security). White House commitments confirmed by Scale blog post. Quality standards are based on company-described methodology only.

[CE016, CE017]

5.5 Product Roadmap, Deployment, and Open Questions

Scale AI's product roadmap as of 2025 is implicitly directed by the July 2025 strategic pivot toward enterprise and government: prioritizing Scale GenAI Platform enterprise deployments, expanding Donovan for government agencies, and growing Scale Evaluation capabilities for the emerging AI governance and compliance market. The data-labeling and annotation segment, while still operational, is expected to operate at reduced scale following the July 2025 layoffs of 200 employees and 500 contractors. Deployment and integration capabilities: Scale provides enterprise API access, webhook integration for MLOps pipelines, and the Scale GenAI Platform for application development. The self-serve portal enables rapid onboarding for annotation tasks. For government, Donovan is deployed in classified cloud environments with specialized integrations for DoD and IC systems. No public changelog or release notes are available; Scale does not maintain a public product roadmap or developer-facing status page that can be independently verified. Developer signal: The Scale Leaderboard is the most developer-facing public product, receiving significant attention from the AI research and engineering community. The WMDP benchmark has been cited in AI safety research. Scale's API documentation (at scale.com/docs) provides programmatic access specifications but the extent of open-source tooling or developer community engagement (GitHub activity, HackerNews discussions, package downloads) is limited relative to AI infrastructure companies with stronger developer ecosystems (e.g., Hugging Face, LangChain). Key open questions and product diligence gaps: (1) The GenAI Platform's technical depth relative to hyperscaler competition (AWS Bedrock, Google Vertex AI, Azure AI) is not independently validated — Scale's platform advantage relies on annotation quality integration, but hyperscalers offer equivalent infrastructure at lower cost; (2) Donovan's specific technical capabilities for AI agents in classified environments are not publicly documented in sufficient detail to assess differentiation from other defense AI companies (Palantir, Booz Allen); (3) The annotation tooling's technical superiority over Labelbox, Snorkel AI, and open-source alternatives (CVAT, LabelImg) is asserted but not independently benchmarked; (4) Scale's development roadmap for synthetic data capabilities is unknown — this is a critical gap as synthetic data threatens to displace human annotation. [CE021, CE022, CE023, CE024, CE025]

Roadmap / Release / Development-Stage Table
Date / StageFeature / MilestoneStatusImplicationSource
2016–2022Scale Data Engine v1 → mature annotation platform; API launchComplete (historical)Foundation product; established annotation market position and contributor networkScale public blog; company history
2021–2022Scale RLHF product launch; LLM training data for OpenAI + AI labsComplete (historical)Positioned Scale as RLHF leader for frontier AI developmentScale RLHF page; TechCrunch reporting
2022Scale Donovan (defense AI agents) launch; DoD IL4 clearanceComplete; activeEstablished cleared defense AI data position; created government revenue moatScale Donovan page; DoD contract blog
2023–2024Scale GenAI Platform launch; enterprise LLM customizationComplete; active; strategic priorityDiversified beyond annotation into higher-margin platform businessScale GenAI Platform docs; customer case studies
2024Scale Leaderboard launch; WMDP benchmark release; White House AI commitmentsComplete; active developer toolEstablished Scale as trusted AI evaluation authority; government recognitionScale blog (leaderboard, WMDP); White House record
May 2024Series F $1B at $13.8B valuation; new strategic investors (Amazon, Meta, Cisco)CompleteCapital to fund platform and government pivot; strategic co-sell potentialTechCrunch Series F reporting
June 2025Meta strategic investment; Wang departure; Droege as Interim CEO; strategic pivot announcedComplete; ongoing transitionBusiness model pivot; CEO transition risk; customer attrition (Google, OpenAI)TechCrunch; CNBC; Scale blog
July 202514% headcount reduction (200 employees + 500 contractors); data-labeling restructuringCompleteCost structure shift; confirms annotation pivot; talent retention riskTechCrunch July 2025 reporting
2025–2026 (planned)Enterprise GenAI Platform expansion; Donovan government rollout; Scale Evaluation growthIn progress (inferred from Droege statements)Revenue mix shift to higher-margin enterprise + government; execution risk of pivotScale blog; Droege public comments
UnknownSynthetic data capabilities; AI-assisted annotation automationNot publicly confirmedCritical gap: if not developing, Scale vulnerable to synthetic data displacementEvidence gap — no public roadmap

Historical milestones are based on press records and public announcements. Forward-looking roadmap items are inferred from public statements; Scale does not publish a formal product roadmap.

[CE021, CE022]

5.6 Exhibits

Chapter 06

06Customers

6.1 Customer Base Segmentation

Scale AI's customer base is organized into three primary verticals: AI Labs and model developers, Fortune 500 enterprise customers, and U.S. government and defense agencies. Each segment has distinct buyer profiles, procurement dynamics, use cases, revenue structures, and switching cost profiles. AI Labs (formerly the largest segment) purchase Scale's RLHF and evaluation services to train and benchmark large language models. These customers—historically including OpenAI, Google/DeepMind, Cohere, and Anthropic—are sophisticated technical buyers with direct procurement authority and quarterly contract cycles tied to training compute schedules. This segment is now in structural decline: Google exited in June 2025 citing competitive conflict arising from the Meta deal, and OpenAI wound down its Scale relationship the same month, signaling a broader shift toward in-house annotation capacity among frontier model developers. Enterprise customers—including TIME, Etsy, Instacart, and Pinterest—deploy Scale's GenAI Platform and Data Engine for domain-specific AI applications such as content safety testing, recommendation systems, and e-commerce AI. These buyers typically involve IT and data leadership with procurement cycles of 6–18 months and multi-year deployment commitments. The TIME case study is the strongest public proof point: GenAI Platform deployment occurred in under two months, with 7,000+ adversarial attack vectors tested against TIME's AI content outputs in a production safety application. Government and defense customers—primarily U.S. Department of Defense and intelligence community agencies—engage through multi-year contract vehicles for data curation, autonomy AI programs, and the Donovan platform for classified operations. These customers have the highest switching costs due to DoD IL4 and FedRAMP High certification requirements, classified environment integration, and procurement inertia. Geographically, Scale's disclosed customer base is predominantly U.S.-headquartered. No international customer count or revenue split is publicly disclosed. [CU001, CU004, CU005, CU006, CU007, CU008]

Customer Segmentation Table
SegmentBuyer / Payer ProfilePrimary Use CaseScale / ScopeRevenue / Strategic ValueKey Gaps
AI LabsCTO / research lead; quarterly procurement tied to training schedulesRLHF, model evaluation, safety benchmarking2-3 flagship customers historically (OpenAI, Google); now smaller labs (Cohere)Historically largest segment; in structural decline post-Google/OpenAI exitsCustomer count, revenue per lab, NRR not disclosed; departures unquantified
Enterprise (Fortune 500)IT / data leadership; 6-18 month sales cyclesGenAI Platform deployment, domain-specific AI, data curationMultiple named logos (TIME, Etsy, Instacart, Pinterest); total count undisclosedGrowing per management narrative; no ARR, NRR, or expansion rate dataExpansion rates, upsell conversion, total enterprise customer count not disclosed
Government / DefenseProgram manager / contracting officer; multi-year RFPsAutonomy AI, joint force data curation, Donovan platform (national security)Active DoD contracts; IC deployments; IL4 / FedRAMP High certifiedHigh strategic value; presumed floor revenue; most durable segmentContract values, agency names (classified), renewal timing, segment ARR not disclosed
Self-Serve / SMB / ResearchDeveloper / researcher; pay-as-you-goAnnotation, API access for ML experimentation, model evaluation1,000 free units entry tier; no active user count disclosedLow individual revenue; high volume potential; conversion rate unknownActive user count, conversion to enterprise, and usage trends not disclosed

Segment revenue split not disclosed by Scale AI. Tiers inferred from product pages, pricing structure, official blog posts, and case studies. Government segment classification is based on DoD certifications and official contract disclosures.

[CU001, CU004, CU005, CU006, CU007, CU008]
FU001: Customer Journey Map

6.2 Adoption Trajectory and Growth Signals

Scale AI does not publicly disclose total active customer counts, revenue, ARR, or deployment metrics. Adoption signals must be inferred from disclosed milestones, headcount dynamics, contributor payments, and third-party proxy data. The strongest scale signals are: (1) over $1 billion paid to annotation contributors globally, indicating substantial annotation throughput volume; (2) 15 billion+ human-labeled decisions processed, demonstrating platform depth; and (3) the May 2024 Series F at $13.8 billion valuation with participation from Amazon, Cisco, Intel, AMD, and ServiceNow—all enterprise buyers—suggesting customer validation at significant scale. The inclusion of strategic investors who are themselves enterprise customers provides independent corroboration of platform maturity. The July 2025 layoffs of 14% of staff (approximately 200 employees and 500 contractors), specifically concentrated in the data-labeling business, represent a lagging adoption signal: they indicate that RLHF and annotation volume has declined materially from peak, coinciding with Google and OpenAI departures and the strategic pivot toward the GenAI Platform and Donovan. Competitors Snorkel AI, SuperAnnotate, and Mercor have each expanded their enterprise annotation offerings, evidenced by new product pages and partner/leaderboard profiles, suggesting continued market activity even as Scale repositions. Self-serve growth is unquantified. Scale's API documentation and pricing pages are publicly accessible, suggesting ongoing developer adoption, but no active self-serve customer count or conversion rate has been disclosed. Enterprise customers visible on Scale's customers page—Etsy, Instacart, Pinterest, Cohere—appear without case studies or outcome metrics beyond logo presence, limiting the quality of adoption proof for these accounts. [CU011, CU012, CU013, CU017, CU021, CU023]

Customer Growth / Adoption Trajectory Table
MetricValueDateSourceConfidenceImplicationMissing Denominator
Contributor payments$1B+ paid to contributors globally2025scale.com/customers (official)mediumProxy for annotation throughput volume; not equivalent to customer revenueNo YoY growth rate or payment trajectory disclosed
Human-labeled decisions15B+ processed2025scale.com/customers (official)mediumPlatform throughput depth; does not map to customer count or revenueNo active customer denominator or per-customer breakdown disclosed
Series F strategic investors$1B at $13.8B; Amazon, Cisco, Intel, AMD, ServiceNow investedMay 2024TechCrunch / Scale officialhighEnterprise buyers as investors provides independent customer validation signalRevenue, ARR, or customer count not disclosed at close
Headcount post-layoff~1,000 employees (14% reduction, ~200 let go)Jul 2025TechCrunchhighLayoffs concentrated in data-labeling; proxy for volume decline in annotation segmentNo operational capacity metric or segment headcount breakdown disclosed
AI lab customer attritionGoogle (largest customer) and OpenAI both exited Q2 2025Jun 2025CNBC (two separate reports)highMaterial revenue concentration event; 20-40% of AI lab segment estimated affectedRevenue impact not disclosed; no replacement pipeline confirmed publicly

No revenue, ARR, or customer count data publicly disclosed. Adoption metrics are proxies and inferences only. High-confidence items are from confirmed multi-source news reporting or official Scale disclosures backed by reputationally-strong outlets.

[CU009, CU010, CU011, CU012, CU013, CU035]
FU002: Adoption / Deployment Funnel

6.3 Named Customer Proof and Evidence Quality

Scale AI's most important disclosed customer proof points span production deployments across media, defense, e-commerce, and AI research. The TIME Media case study is the highest-quality public proof point: Scale's GenAI Platform was deployed to test over 7,000 adversarial attack vectors against TIME's AI content before publication. This is a production safety application with clear, measurable outcome metrics and a sub-two-month deployment window, documented on Scale's official customers page. Meta is simultaneously Scale's largest strategic investor (~49% stake acquired June 2025) and an active, expanding RLHF customer. This dual role creates both a revenue anchor and a governance complication: Meta's privileged information access as an investor alongside its role as a customer raises data access and conflict-of-interest concerns for other customers evaluating Scale's data security practices. U.S. Department of Defense and intelligence community customers are evidenced by official Scale blog posts describing a DoD data curation contract for joint force operations and the Donovan platform's deployment in national security contexts. These are not named agency customers (classification prevents naming), but they represent the highest-confidence government deployment proof available in public sources, corroborated by Scale's DoD IL4 and FedRAMP High certifications. Enterprise customers—Etsy, Instacart, Pinterest—appear on Scale's customers page without case studies or outcome metrics. The Mercor lawsuit (September 2025) alleging customer poaching provides indirect adverse proof that Scale has enterprise accounts valuable enough to be actively solicited by a competitor, while also demonstrating competitive account retention vulnerability. Competitor profiles at Snorkel AI, SuperAnnotate, and Mercor indicate continued enterprise demand for annotation and AI data platforms, validating the market even as competition intensifies. [CU002, CU003, CU004, CU005, CU006, CU007]

Named Customer Proof Table
CustomerSegmentDeployment / Use CaseProduction vs PilotOutcome / Proof QualityEvidence Limitation
TIME (Media)EnterpriseGenAI Platform for AI content safety testing; adversarial attack vector evaluationProduction (confirmed)7,000+ attack vectors tested; deployed in <2 months; documented case study on scale.com/customers/timeScale-produced case study; no independent third-party validation or TIME-provided ROI data
U.S. DoD / ICGovernment / DefenseData curation for joint force ops; Donovan platform for national security AI workflowsProduction (confirmed)Active multi-year DoD contracts; IL4 and FedRAMP High certified deployments; official Scale blog postsContract values and agency names classified; no mission outcome metrics publicly available
MetaAI Lab + Strategic InvestorRLHF data for LLM training; expanding customer relationship post-investmentProduction (expanding)~49% strategic investment at $14.3B; company states relationship is expanding; RLHF platform corroboratedDual investor-customer role creates evidence quality conflict; outcome metrics not independently verifiable
EtsyEnterpriseAI training data for e-commerce recommendation and search AIPresumed productionLogo on scale.com/customers; no case study, outcome data, or deployment scope availableLogo-only proof; deployment scope, contract terms, and outcomes unknown
InstacartEnterpriseAI training data for delivery and logistics AI applicationsPresumed productionLogo on scale.com/customers; no case study, outcome data, or deployment scope availableLogo-only proof; deployment scope, contract terms, and outcomes unknown
PinterestEnterpriseAI training data for visual search and recommendation AIPresumed productionLogo on scale.com/customers; no case study, outcome data, or deployment scope availableLogo-only proof; deployment scope, contract terms, and outcomes unknown
CohereAI LabRLHF data for enterprise LLM trainingPresumed productionListed on scale.com/customers; Cohere is a commercial AI lab; RLHF platform corroboratedNo case study; limited public information on scope or contract size

Enumeration is partial — only publicly named and confirmed customers included. Google and OpenAI are former customers (exited June 2025, not included). Proof quality ranges widely: TIME is the strongest (production + measured outcomes); government has high-confidence deployment but classified details; enterprise logos (Etsy, Instacart, Pinterest) provide minimal proof of production depth or outcomes.

[CU002, CU003, CU004, CU005, CU006, CU007]
FU003: Customer Proof Matrix

6.4 Retention, Durability, and Satisfaction Signals

Scale AI has not disclosed any net revenue retention (NRR), gross revenue retention (GRR), churn rate, renewal rate, total customer count, or satisfaction (NPS/CSAT) metrics. This represents a material information gap for investment analysis. All retention assessments below are structural inferences from contract type, switching costs, competitive dynamics, and adverse events—not from disclosed data. Government and defense customers represent the most durable cohort. Multi-year contract vehicles, DoD IL4 and FedRAMP High certification barriers, classified-environment integration, and high procurement switching costs create structural lock-in. The 2024 DoD joint force data curation contract and Donovan platform deployments in classified contexts suggest high annual retention rates for this segment, estimated at 90–97% based on comparable government IT contract renewal benchmarks. This segment's durability is largely independent of the AI lab attrition occurring in parallel. Enterprise customers (TIME, Etsy, Instacart, Pinterest) are likely on 12–36 month contracts with renewal optionality. Without NRR data, it is impossible to confirm whether land-and-expand dynamics are functioning. The GenAI Platform as an upsell from the Data Engine represents a theoretical expansion path, and the TIME deployment demonstrates conversion from pilot to production safety use case, but no cross-sell success rates or expansion revenue metrics are publicly available. The AI lab segment is in active contraction. Google and OpenAI both exited in June 2025, representing an estimated 20–40% of historical AI lab revenue (not disclosed, estimated from reported customer prominence). The remaining AI lab customers (Cohere, others) face similar structural pressures toward in-house annotation. No G2, Gartner Peer Insights, or Capterra reviews were identified for Scale AI's enterprise platform as of this analysis, leaving customer satisfaction entirely unquantified. [CU003, CU008, CU009, CU010, CU015, CU016]

Retention / Repeat Usage / Satisfaction Table
MetricValue / StatusSegmentConfidenceDiligence Ask
Net Revenue Retention (NRR)Not disclosedAll segmentsn/aRequest trailing 4-quarter NRR by segment in due diligence data room
Gross Revenue Retention (GRR)Not disclosedAll segmentsn/aRequest trailing 4-quarter GRR by segment in data room
Annual Churn RateNot disclosed; Google and OpenAI 2025 departures are confirmed material adverse eventsAI Labslow (adverse signal only)Quantify revenue impact of Google/OpenAI departures; confirm no other AI lab customers in exit pipeline
Contract Renewal Rate (Gov't)Estimated 90%+ based on multi-year contract structure and IL4/FedRAMP switching cost barriersGovernment / Defenselow (structural estimate)Obtain actual renewal rate from contracting records or management confirmation in data room
Customer NPS / CSATNot disclosed; no G2, Gartner Peer Insights, or Capterra reviews identified for Scale AI enterprise platformEnterprisen/aRequest NPS data and any CSAT scores for enterprise segment; search G2/Gartner for updated review coverage
Contract LengthGovernment: multi-year RFP vehicles; Enterprise: estimated 12-36 months; Self-serve: monthly / pay-as-you-goAll segmentsmediumConfirm minimum contract terms and renewal optionality for top 10 customers by ARR

All retention metrics except contract length estimates are undisclosed or unavailable. Structural retention inferences for government segment are low-confidence estimates. Google and OpenAI departures confirm AI lab churn is material and recent. No review platform data found.

[CU009, CU010, CU015, CU019, CU020, CU027]
FU004: Retention / Repeat Cohort

No public NRR, GRR, churn, or cohort data disclosed by Scale AI. Values are structural estimates based on contract type, switching cost analysis, and AI-annotation industry benchmarks for comparable data-services businesses. Estimates are for illustrative diligence purposes only and must be confirmed with disclosed data in the data room.

6.5 Expansion and Customer Concentration Risk

Scale AI's customer concentration risk is material and has adversely crystallized in 2025. The departure of Google—previously Scale's largest customer—and OpenAI within a single quarter is the single most important adverse customer event in Scale's history. CNBC reported Google's exit was driven by concerns about competitive conflict following the Meta investment. This departure pattern—where Scale's two largest AI lab customers exited citing the same underlying conflict of interest—creates structural risk for any remaining AI lab customers with Meta-adjacent competitive exposure. The Meta relationship creates both the largest expansion opportunity (Meta's AI initiatives are major consumers of training data, and the relationship is described as expanding) and a significant dependency risk: a substantial share of Scale's revenue post-Google/OpenAI may now be concentrated in a single customer who is also a 49% investor. This customer-investor concentration is structurally unusual and may create ongoing commercial friction for other customers. Land-and-expand dynamics are theoretically present: the pathway from data annotation to RLHF to evaluation to GenAI Platform to Donovan represents a documented upsell funnel. The TIME deployment demonstrates that enterprise customers can be converted from annotation pilots to production platform use. However, no quantified expansion rate, upsell conversion data, or average revenue per customer metrics are publicly available. The Mercor lawsuit (September 2025) alleging customer poaching indicates active competitive threat to existing account retention from at least one well-funded competitor. SuperAnnotate and Mercor are expanding enterprise annotation platform capabilities, evidenced by product pages and partner profiles, creating displacement risk for Scale's data engine revenue. [CU001, CU003, CU009, CU010, CU022, CU024]

Expansion and Concentration Risk Table
Driver / Risk FactorCategoryCurrent StateImpact AssessmentDiligence Path
Meta customer + investor relationshipConcentration + Conflict of interestMeta holds ~49% stake and is expanding RLHF customer relationshipHigh positive (revenue anchor) + High negative (governance risk; other customers may reduce share)Verify Meta revenue share as % of total ARR; assess data access governance; confirm enterprise customers remain comfortable with arrangement
Google and OpenAI simultaneous departureCustomer attritionBoth exited Q2 2025; Google was Scale's #1 customerHigh negative — material revenue reduction; AI lab segment in structural declineQuantify revenue impact; confirm replacement pipeline; review remaining AI lab contract NDA and exit-clause terms
DoD / IC multi-year contract vehiclesExpansion moatMultiple active DoD contracts; Donovan in classified deployments; IL4/FedRAMP High certifiedHigh positive — durable, sticky government revenue floor; high switching cost for competitorsReview contract vehicle types, option periods, and scheduled renewal dates; confirm new award pipeline
Mercor lawsuit — customer poachingCompetitive attrition riskScale sued Mercor Sep 2025 alleging poaching of key customers and trade secret misappropriationMedium — indicates competitive account retention risk; may expand to other competitorsMonitor litigation outcome; request post-Mercor customer retention data; assess scope of contacts made
Land-and-expand upsell funnelExpansion opportunityData Engine → RLHF → Evaluation → GenAI Platform → Donovan upsell funnel is documented but unquantifiedPositive if conversion rates are material; actual conversion rates unknownRequest upsell conversion rates by product transition and average ARR expansion per customer cohort

Customer concentration risk is the single most material finding in this chapter. Meta-Google conflict resulted in the departure of Scale's largest customer. Government moat provides a structural offset but does not fully compensate for AI lab attrition without confirmed pipeline replacement.

[CU003, CU009, CU010, CU022, CU026, CU028]

6.6 Exhibits

Chapter 07

07Risks

7.1 Severity-Ranked Risk Overview

Scale AI enters the current investment period with a risk profile weighted toward strategic and execution risk rather than regulatory or operational risk. Customer concentration—already crystallized through the Google and OpenAI departures—represents the highest-severity, highest-probability risk as of the research date. These departures coincided with the Meta investment and the leadership transition, creating three concurrent high-severity risk events in a single quarter (Q2 2025) that are deeply interlinked. The most critical observation is the interdependency of Scale's top risks: the Meta investment triggered the Google departure (conflict of interest), which triggered a reassessment of Scale's business model viability at its previous AI lab revenue concentration, which in turn triggered the leadership transition and layoffs. Each risk amplifies the others. Investors must evaluate whether the government/defense moat and Meta's expanding customer relationship provide a sufficient revenue floor to sustain the business through the AI lab attrition and the enterprise pivot. Regulatory and legal risks are material but manageable. Scale's DoD IL4 and FedRAMP High certifications demonstrate a strong compliance posture for government work. The Mercor lawsuit is an ongoing legal risk but is unlikely to be existential. Export control risks for defense AI are real but addressed through existing certification and compliance programs. The most important forward-looking risk is whether Scale's enterprise GenAI Platform can generate sufficient replacement revenue for the AI lab segment within the time window defined by its current cash runway. [CR001, CR002, CR003, CR004, CR005, CR006]

Mitigation and Kill Criteria Table
RiskMonitorable TriggerThreshold / EventAction Implication
Customer concentration / AI lab attritionAdditional AI lab customer departures; Meta reducing data purchasing volumeAny AI lab customer exit beyond Cohere, OR Meta ARR declines >20% QoQThesis break — revenue floor assumption violated; requires full re-underwriting of revenue model
CEO / leadership transition failureEnterprise or government deal loss attributable to leadership uncertainty; key executive attritionLoss of 2+ senior leaders OR failure to close >$10M enterprise deal within 12 months of Droege appointmentYellow flag — accelerate CEO succession or board strengthening; reassess management premium in valuation
Government contract non-renewalDoD or IC contract not renewed at option period; failed security re-certificationAny material DoD contract cancellation or security clearance revocationThesis break — government moat assumption violated; Floor valuation collapses
Meta conflict crystallizationEnterprise or AI lab customer explicitly citing Meta conflict as exit reason; FTC or DOJ review of Meta stakeAny non-AI-lab enterprise customer citing Meta conflict, OR regulatory review initiatedMaterial risk escalation — reassess concentration overhang; review governance documentation
Execution of business model pivotEnterprise GenAI Platform ARR as proxy for pivot success; Donovan new contract awardsGenAI Platform ARR failing to replace >50% of AI lab ARR within 18 months of layoffs (by Jan 2027)Yellow flag — pivot velocity insufficient; reassess capital sufficiency and runway

Kill criteria are derived from the investment thesis requirements for government moat, enterprise platform growth, and management stability. Triggers are designed to be observable without requiring non-public information. Thresholds are analyst estimates.

[CR001, CR002, CR003, CR004, CR005, CR006]
FR001: Risk Heatmap

7.2 Regulatory and Legal Risk

Scale AI operates at the intersection of defense AI, enterprise data processing, and AI safety—three domains with active and evolving regulatory attention. The primary regulatory risks are U.S. government compliance requirements for defense AI (ITAR, export control, AI safety standards), data privacy and AI governance regulations applicable to enterprise data processing, and the competitive legal proceedings arising from the Mercor lawsuit. For defense AI, Scale's DoD IL4 and FedRAMP High certifications demonstrate strong regulatory compliance posture. The company has also made voluntary White House AI safety commitments, demonstrating proactive engagement with AI governance. The principal regulatory risk in this domain is potential expansion of export control restrictions on AI models and data services—particularly if the AI services Scale provides to DoD contractors involve sensitive technology transfer that could attract ITAR scrutiny. Congressional AI oversight is active, as evidenced by Scale's testimony before Congress. On data privacy, Scale's enterprise data annotation services involve processing customer proprietary data. The EU AI Act, GDPR, and emerging U.S. state AI laws create compliance obligations that are manageable but evolving. Scale's security and compliance pages indicate active SOC 2 Type II and ISO 27001 certifications, suggesting a mature enterprise compliance posture. The Mercor lawsuit (Scale AI vs. Mercor, September 2025) alleges customer poaching and trade secret misappropriation. While the lawsuit's outcome is uncertain, it creates legal uncertainty, management distraction, and potential counterclaim risk. The litigation does not currently appear to threaten Scale's core operations or certifications. [CR005, CR006, CR007, CR009, CR010, CR011]

Regulatory / Legal Risk Register
Risk / Rule / CaseJurisdictionStatusLikelihoodSeverityMitigationResidual ExposureDiligence Path
U.S. AI export control / ITAR for defense AIU.S. FederalActive — evolving BIS/ITAR rules on AI models and data servicesMediumHighDoD IL4 and FedRAMP High certifications; existing defense contractor relationshipsModerate — regulatory changes could restrict data export or AI service delivery to non-U.S. entitiesConfirm ITAR and EAR compliance posture for all government contracts; review BIS AI guidance adherence
Scale AI v. Mercor lawsuit (customer poaching, trade secrets)U.S. Federal (NDCA)Active — filed September 2025; litigation ongoingHigh (filed)MediumActive litigation; preliminary injunction potentially sought; trade secret documentationModerate — counterclaims and discovery could reveal internal customer data or contract terms; management distractionReview complaint and docket; obtain legal opinion on likelihood of preliminary injunction; assess counterclaim exposure
EU AI Act compliance for enterprise data processingEUEffective — GDPR and EU AI Act obligations apply to EU enterprise customersLow-MediumMediumISO 27001 and SOC 2 certifications; DPA agreements with enterprise customersLow-Moderate — primary risk arises if Scale processes high-risk AI training data for EU-regulated applicationsConfirm EU AI Act classification of Scale's data annotation services; verify DPA compliance with EU customers
U.S. AI safety voluntary commitments and potential mandatesU.S. FederalVoluntary — White House commitments signed 2024; potential mandatory regulation in progressLowLow-MediumWhite House AI safety commitments signed; proactive AI safety research (WMDP benchmark)Low — voluntary compliance reduces regulatory risk; mandatory rules are not yet in forceMonitor AI safety rulemaking; confirm voluntary commitment compliance documentation is current
Data privacy — state AI and privacy laws (CPRA, others)U.S. StateActive — CPRA and similar laws apply to California-headquartered Scale AILow-MediumLow-MediumSOC 2 Type II, privacy policy, enterprise DPA agreementsLow — standard enterprise SaaS compliance posture; no known enforcement actionsConfirm CPRA compliance; verify enterprise customer DPA templates are current; check for state-level AI regulation applicability

Register is partial — based on public sources only. Regulatory correspondence, enforcement actions, and any government investigation not publicly disclosed are not enumerated. Likelihood and severity are analyst estimates. Legal counsel review is required for definitive risk assessment.

[CR005, CR006, CR007, CR009, CR010, CR011]
FR002: Risk Transmission Map

7.3 Operational and Execution Risk

Scale AI's most acute operational risk is the execution of a business model pivot under adverse conditions. The company is attempting to transition from annotation-volume revenue (declining due to AI lab departures and synthetic data trends) to enterprise GenAI Platform and government Donovan revenue (growing but uncertain pace). This pivot is occurring simultaneously with a CEO transition, significant headcount reduction, and the loss of two of its largest customers. Data security and annotation quality are operational risks inherent to Scale's core business. As a company that processes proprietary customer data for AI training, any data security incident, quality failure, or customer data misuse event would be severely damaging to enterprise trust and government contract eligibility. Scale's SOC 2 Type II, ISO 27001, DoD IL4, and FedRAMP High certifications reduce—but do not eliminate—this risk. No public data security incidents were identified in this research. Supply chain risk for Scale relates primarily to its annotation contributor workforce. Scale relies on a global network of human annotators; disruptions to this workforce (labor disputes, quality degradation, competitive poaching by Mercor or others) could impair annotation output quality and delivery timelines. The July 2025 layoffs that eliminated approximately 500 contractors may have reduced operational redundancy in this supply chain. Additionally, Scale's dependency on cloud infrastructure providers (AWS and similar) creates a platform concentration risk that is standard for cloud-native companies but worth noting in the context of its DoD requirements for infrastructure sovereignty. [CR001, CR003, CR008, CR015, CR016, CR017]

Operational / Quality / Security Risk Register
Failure ModeLikelihoodSeverityMitigation MaturityResidual ExposureUnresolved Gap
Data security breach exposing customer proprietary AI training dataLow-MediumHighHigh (SOC 2 Type II, ISO 27001, DoD IL4, FedRAMP High)Medium — certifications reduce but do not eliminate breach risk; AI training data is high-value targetNo public incident disclosure mechanism confirmed; breach notification process for government customers unclear
Annotation quality degradation post-layoff (500+ contractors released)MediumHighMedium (QA processes in place; Scale RLHF platform automates quality scoring)Medium-High — significant workforce reduction may reduce annotation quality redundancy and institutional knowledgeNo public quality SLA disclosure; no customer-facing quality metrics; contractor reduction scope unclear by segment
Business model pivot execution failure (annotation to platform)HighHighLow (pivot is early-stage; enterprise GenAI Platform revenue not publicly confirmed)High — if pivot fails, core annotation business in structural decline with no replacement revenueEnterprise GenAI Platform ARR not disclosed; conversion rate from annotation to platform unknown; no public timeline
Cloud infrastructure outage affecting government or enterprise SLAsLowMediumMedium (cloud redundancy; DoD requires specific infrastructure sovereignty)Low-Medium — standard cloud dependency risk; mitigated by multi-cloud and government-grade infrastructure requirementsInfrastructure sovereignty arrangement for classified DoD workloads not publicly disclosed
Contributor workforce disruption (labor disputes, competing platforms)MediumMediumMedium (global contributor base provides some geographic redundancy)Medium — Mercor and other platforms competing for annotation workforce; quality of replacement contributors unknownContractor workforce composition and geographic concentration not disclosed; Mercor competition for contributors unquantified

Failure modes ordered by residual severity. Mitigation maturity is analyst estimate based on public certification data and inferred operational practices. No internal risk management documentation or incident history was publicly available for review.

[CR015, CR016, CR017, CR018, CR019, CR003]
FR003: Dependency Map

7.4 Partner and Dependency Risk

Scale AI's most significant dependency risk is its relationship with Meta—simultaneously its largest investor (~49% equity), its largest remaining AI lab customer, and its former CEO's new employer. This triple-role dependency creates a concentration and governance risk without precedent in the venture-backed AI sector. If Meta's strategic interest in Scale diminishes, reduces its data purchasing, or introduces new competitive products, Scale faces simultaneous customer revenue loss, valuation pressure, and governance disruption. Cloud infrastructure dependency on AWS and major cloud providers is a standard risk for AI infrastructure companies, but it is especially relevant for Scale's government business where platform sovereignty and sovereignty over AI training compute are compliance requirements. Scale's DoD certifications imply approved cloud infrastructure arrangements, but any change in cloud provider relationships could affect government contract eligibility. OpenAI and Google's departures eliminated two key partner relationships that provided both revenue and reputational validation. Remaining AI lab partners (Cohere, others) are smaller and represent less revenue concentration individually. Channel and partner concentration in government procurement is positive rather than negative: government contracts proceed through established contract vehicles (GSA, DIU, DARPA channels) with well-defined procurement rules. However, government budget cycles and continuing resolution dynamics can create revenue timing volatility. The Snorkel AI partner ecosystem (evidenced by its partners page) and SuperAnnotate's enterprise partnerships indicate that Scale's competitors are also building partner-dependent go-to-market strategies, creating a race for ecosystem lock-in. [CR002, CR003, CR004, CR006, CR009, CR020]

Partner / Dependency Risk Register
DependencyCounterpartyRoleConcentration LevelFailure ScenarioSeverityMitigationResidual Exposure
Meta strategic relationshipMeta (investor + customer)49% equity investor AND expanding RLHF customer AND former CEO's employerExtremeMeta reduces data purchasing, seeks to acquire full control, or introduces competing annotation platformCriticalRevenue diversification into government/enterprise; maintain operational independence per stated policyHigh — single relationship combines customer, investor, and governance risk; no structural firewall confirmed
U.S. DoD / IC contract vehiclesU.S. Government (DoD, IC)Multi-year government customer; defines government revenue floorHighContract non-renewal, budget cuts, or failed security re-certificationHigh (positive dependency — loss would be severe)IL4/FedRAMP High certifications; multi-year contract structure with option periods; dedicated government teamMedium — government contracts are durable but subject to budget cycles and re-certification requirements
Cloud infrastructure (AWS, Microsoft Azure)AWS / MicrosoftPlatform hosting for Scale's annotation pipeline, API services, and government workloadsHighCloud provider outage, pricing change, or contract terminationMediumMulti-cloud architecture (assumed); DoD requires FedRAMP-authorized infrastructureLow-Medium — standard cloud dependency; mitigated by government certification requirements
Global annotation contributor networkIndependent contractors (500K+ estimated)Annotation workforce supply for data labeling pipelineHighWorkforce fragmentation, quality degradation, or mass departure to competing platforms (Mercor)Medium-HighGlobal geographic distribution; proprietary quality scoring platform (Scale RLHF)Medium — Mercor lawsuit indicates active poaching of contributors; post-layoff morale risk
OpenAI / Google (former)OpenAI, Google (departed)Were key RLHF and evaluation customers validating platform qualityHigh (historical; now zero)Already materialized — both exited Q2 2025High (already crystallized)None effective — exits completeResidual: reputational association that other AI labs may exit; creates AI lab segment concentration risk for Cohere et al.

Dependencies ordered by residual severity. Meta dependency is the most structurally unusual and hardest to mitigate through standard risk management. Former customer row included to show concentration crystallization. Concentration levels are analyst estimates.

[CR002, CR003, CR004, CR006, CR009, CR020]

7.5 People, Execution, and Financial Risk

Leadership transition risk is among the most material at Scale AI. Founder Alexandr Wang built Scale over nine years and was the face of its government, enterprise, and AI lab relationships. His departure to join Meta, concurrent with Meta becoming Scale's largest investor, creates a principal-agent conflict that is visible to customers, employees, and investors. Interim CEO Jason Droege has no prior experience running a $29B-valued AI infrastructure company; his Uber Eats background is operationally relevant to marketplace scaling but not directly to government AI or enterprise data platform go-to-market. Talent retention is a risk in the immediate post-layoff environment. The July 2025 reduction of 200 employees and 500 contractors may have damaged morale among the remaining ~1,000 employees, particularly if high performers perceive uncertainty about Scale's direction under interim leadership. Key government and enterprise relationship managers who held institutional knowledge of defense contracts and enterprise deployments represent irreplaceable assets; their retention is critical to contract renewal and platform expansion. Financial risk is partially mitigated by the Meta strategic investment, but the capital from that transaction was distributed to existing shareholders rather than retained on Scale's balance sheet for operations. This creates uncertainty about Scale's actual cash position and runway post-distribution. No public burn rate, cash position, or path-to-profitability data has been disclosed. If the AI lab revenue attrition is as large as the Google/OpenAI departures imply, Scale may face material revenue shortfalls in 2025-2026 that require additional external capital unless the enterprise and government growth rate exceeds expectations. [CR001, CR002, CR003, CR007, CR008, CR024]

People / Execution Risk Register
Role / FunctionDependency or GapLikelihood of ImpactSeverityMitigationDiligence Path
CEO — Interim (Jason Droege)Unproven at Scale's stage and sector; Uber Eats background lacks government AI and enterprise data platform precedentHighHighDroege has operational scaling experience; supported by existing leadership team and boardAssess Droege's progress on enterprise and government deals since appointment; confirm board is actively engaged in CEO search or succession planning
Founder (Alexandr Wang) — departedWang held government, enterprise, and AI lab relationships for 9 years; departed to Meta; retains board seatHigh (already materialized)HighWang retains board representation; existing team carries institutional knowledgeConfirm Wang's board role is active and not conflicted by Meta duties; assess whether key customer relationships transferred to current team
Government / defense relationship managersInstitutional knowledge of classified programs, contracting officers, and security clearances; key-person risk in DoD relationshipsMediumHighLong-term government contracts reduce relationship dependency; cleared personnel have high switching costsInterview 2-3 senior government relationship managers; confirm security clearance retention post-layoffs; verify DoD program continuity
Enterprise sales leadershipEnterprise sales cycle is 6-18 months; leadership attrition post-layoff could disrupt pipelineMediumMedium-HighScale's brand and existing customer logos support enterprise sales; self-serve API reduces dependencyRequest enterprise sales pipeline data (qualified pipeline, pipeline coverage ratio, deal stage distribution)

People risks are ordered by severity. Key-person risk is extremely high given the simultaneous departure of the founder and the leadership transition in a single quarter. Government relationship continuity is the most operationally critical risk given the revenue floor it provides.

[CR001, CR002, CR007, CR008, CR024, CR025]

7.6 Exhibits

Chapter 08

08Valuation

8.1 Investment Thesis and Anti-Thesis

The investment thesis for Scale AI rests on three structural pillars. First, government and defense: Scale's Donovan AI platform, DoD IL4/FedRAMP High certifications, and deep government relationships create a durable revenue floor with high switching costs, long contract cycles, and minimal competitive overlap with AI labs. This segment functions as a strategic asset that supports a minimum valuation of $3–5B independent of any commercial AI lab business. Second, data moat: Scale has trained over nine years of annotation quality infrastructure, contributor management, and model evaluation tooling that is not easily replicated at scale. This infrastructure positions the company to capture enterprise GenAI Platform revenue as Fortune 500 companies shift from AI experimentation to production deployment requiring human-in-the-loop data workflows. Third, Meta validation: the $14.3B investment by Meta—one of the most sophisticated AI investors globally—provides both financial and reputational validation, signals the strategic value of Scale's data infrastructure, and creates an expanding customer relationship. The anti-thesis is equally robust. At $29B implied valuation, Scale trades at 58x–145x estimated ARR at a moment when its two largest revenue contributors (Google and OpenAI) have both departed, its founder CEO has left, and its business model is mid-pivot. The annotation commoditization risk (synthetic data, in-house RLHF by frontier labs) is a structural threat to the AI lab segment. The Meta investor-customer relationship creates a governance overhang that deters other customers and creates a principal-agent conflict without precedent in the AI sector. Interim CEO Droege has no demonstrated track record in Scale's core markets. The $29B valuation can only be defended by a bull case set of assumptions about government contract renewal, enterprise platform conversion, and Meta ARR growth—all of which are unconfirmed by public evidence. The base case implies material dilution risk from the entry price. The balance of evidence suggests that Scale is a high-quality asset in a structurally adverse situation. The diligence path must establish whether the government floor is secure, whether enterprise platform ARR is growing at sufficient pace, and whether the management team can execute the pivot without additional capital at dilutive terms. Until these questions are answered, the evidence does not support investment at the $29B implied valuation. A conditional pass—invest at a materially lower entry price with earned milestones—or a research-more stance is the appropriate recommendation given current evidence quality. [CV001, CV002, CV003, CV004, CV005, CV006]

Thesis / Anti-Thesis Table
ArgumentEvidence BasisWhat Would Change This View
THESIS: Government moat creates durable revenue floor with high switching costs and multi-year contractsDoD IL4, FedRAMP High, Donovan platform, DIU/DARPA contracts; no announced non-renewalGovernment contract cancellation or failed security re-certification would eliminate this thesis pillar
THESIS: Data infrastructure and annotation platform represent 9 years of unreplicable quality engineeringRLHF platform, 500K+ contributor network, model evaluation tooling documented on scale.com; Stanford HAI endorsementEvidence of systematic annotation quality degradation or competitor replication at comparable scale
THESIS: Meta validation ($14.3B, 49%) signals strategic value and secures near-term capitalMulti-source confirmed investment; largest single AI strategic investment on recordMeta reducing data purchasing or seeking to acquire remaining equity at depressed valuation
THESIS: Enterprise GenAI Platform creates replacement revenue for AI lab attritionTIME deployment case study; enterprise customer logos; official GenAI Platform product pageEnterprise platform ARR failing to replace >50% of AI lab revenue within 24 months (by mid-2027)
ANTI-THESIS: $29B valuation set by strategic buyer is not financial-investor-rationalEntry multiple of 58x–145x ARR; concurrent attrition of top 2 customers; Google/OpenAI exit confirmedEntry at materially lower price (<$15B) or confirmation of rapid post-attrition ARR recovery
ANTI-THESIS: Annotation commoditization is a structural threat to core RLHF revenueMcKinsey/Stanford HAI report declining per-token annotation cost; synthetic data generation reducing human annotation requirementsEvidence that Scale's platform-level evaluation services (not raw annotation) are insulated from commoditization
ANTI-THESIS: CEO transition and Meta conflict create management and governance risk without precedentWang departure confirmed; Droege interim appointment confirmed; no announced permanent CEO searchAppointment of a permanent CEO with government AI and enterprise platform track record; structural firewall between Meta and Scale customer data confirmed

Thesis and anti-thesis are evidence-grounded; speculative arguments excluded. The thesis is structurally coherent but requires private data confirmation for the critical revenue replacement assumption. The anti-thesis is observable from public sources without any private data requirement.

[CV001, CV002, CV003, CV004, CV005, CV006]
Thesis-Break and Kill Triggers Table
TriggerThreshold / EventTransmission to ThesisAction Implication
Government contract non-renewalAny DoD or IC contract cancelled or not renewed at scheduled option periodEliminates the government floor thesis; base case valuation collapses to Appen-equivalent ($1.5–3B range)Hard pass — government moat is the primary thesis pillar; its failure is a non-recoverable thesis break
Additional enterprise customer citing Meta conflictAny non-AI-lab enterprise customer explicitly citing Meta relationship as reason for exit or non-purchaseConfirms that Meta conflict is a systematic go-to-market inhibitor, not just an AI lab-specific issue; expands the TAM risk beyond the AI lab segmentHard pass if confirmed; increases required entry discount to reflect structural customer acquisition constraints
Meta ARR declineMeta data purchasing ARR falls >20% QoQ for two consecutive quarters without explanation from alternative enterprise growthEliminates the anchor customer assumption of the bull case; no remaining large anchor; concentration collapseHard pass — revenue floor disappears; triggers re-underwriting of all scenarios
GenAI Platform ARR below $30M as of Q4 2026Enterprise GenAI Platform ARR confirmed below $30M (Q4 2026 revenue date or data room disclosure)Confirms that the enterprise pivot is not achieving minimum viable conversion; ARR replacement pace is insufficient to offset AI lab attrition within the thesis windowPass or price discipline: reduce entry price by 40% from base case ceiling
CEO transition failure signalDroege fails to close any enterprise deal >$5M in first 12 months, OR 2+ senior leadership exits post-Droege appointmentSignals that the management team cannot execute the enterprise pivot; increases execution risk multiplierYellow flag — reassess management capability; seek board clarity on permanent CEO timeline before proceeding

Triggers are designed to be observable from public reporting or data room disclosures without requiring insider access. The government contract trigger is the most critical: it is binary, publicly observable (USASpending.gov), and directly eliminates the primary thesis pillar.

[CV003, CV006, CV007, CV008, CV009, CV029]
FV001: Recommendation Logic

8.2 Valuation Context and Current Financing

Scale AI's current implied valuation of approximately $29B derives from Meta's June 2025 purchase of approximately 49% of the company for $14.3B. This transaction was reported by multiple independent high-reputation sources (TechCrunch, CNBC, Reuters) and is considered confirmed for this analysis. The implied total company valuation ($29B) is based on the reported ~49% equity stake; the precise pre-money/post-money treatment and cap table structure are not publicly disclosed. Importantly, the Meta transaction proceeds were distributed to existing shareholders rather than retained on Scale's balance sheet for operations, leaving Scale's actual working capital and cash runway uncertain. At the $29B implied valuation: - Low ARR estimate ($200M): 145x revenue multiple — consistent with frontier AI lab multiples in early 2024 but extreme for a company experiencing customer attrition - Mid ARR estimate ($350M): 83x revenue multiple — comparable to high-growth enterprise SaaS at peak market conditions - High ARR estimate ($500M): 58x revenue multiple — justified only if government ARR is growing rapidly and enterprise platform conversion is accelerating The most comparable funding event in Scale's history is the May 2024 Series F at $13.8B on approximately $200–300M estimated ARR (46–69x ARR). The Meta deal nearly doubled that valuation in approximately 13 months despite the concurrent departure of two of Scale's largest customers. This dynamic suggests the Meta valuation reflects strategic acquisition logic (Meta's need for proprietary RLHF data infrastructure) rather than financial return optimization. Investors co-investing at $29B implied are therefore paying a price set by a strategic acquirer's logic, not an arm's-length financial investor's return framework. This is a critical valuation discipline issue: strategic buyers accept premiums that financial investors cannot underwrite. The diligence path must establish a financial investor's ceiling for the entry price. Dilution and preference overhang are unknown. Scale has raised approximately $1.6B in equity prior to the Meta deal; the preference stack, liquidation rights, and participating vs. non-participating structures are not publicly disclosed. Any financial investor entering at $29B implied must assess the preference waterfall impact on common equity returns in the bear case. [CV001, CV002, CV003, CV010, CV011, CV012]

Final Diligence Asks Table
TopicMissing EvidenceWhy It MattersOwner / Diligence Path
Post-attrition ARR by segmentGoogle and OpenAI were the largest customers; their ARR contribution and the net ARR after departure are undisclosedCannot underwrite any scenario without knowing the revenue hole and the current trajectory of replacement ARRData room: request ARR waterfall by segment Q1 2024 – Q4 2025; anonymized if required by NDA
Government contract schedule and renewal timelineDoD contract option periods, TCV, and renewal rates are not publicly disclosedGovernment moat is the primary thesis pillar; contract non-renewal is the single most likely thesis-break eventUSASpending.gov for visible contracts; data room: request government contract ledger with option periods; direct conversation with Scale government team lead
Cash position and runway post-Meta dealMeta proceeds distributed to shareholders; current operating cash, burn rate, and runway not disclosedWithout runway visibility, cannot assess whether pivot can be completed without dilutive capital raiseData room: request balance sheet, P&L, and cash flow statement; confirm amount retained for operations vs. distributed
Enterprise GenAI Platform ARR and growth rateNo public disclosure of GenAI Platform ARR, customer count, or conversion rate from annotation to platformPlatform transition is the core bull case driver; without ARR data, bull case is speculationData room: request GenAI Platform ARR as of Q4 2025 and Q1 2026; pipeline data; representative enterprise contracts
Management team retention and permanent CEO planNo public announcement of permanent CEO search or expected appointment timeline; key executive retention post-Droege unknownManagement continuity is critical to government relationship maintenance and enterprise pivot executionBoard conversation: confirm CEO search status; request 90-day review of key executive retention; assess key-person risk with HR team
Meta governance firewall and data access agreementInvestor rights agreement between Meta and Scale AI not publicly disclosed; no confirmed data access limitation for Meta as investorWithout a confirmed firewall, the Meta conflict-of-interest risk is unmitigated and enterprise customers' data sovereignty concerns cannot be resolvedData room: request investor rights agreement and any data access limitation schedule; obtain legal opinion on firewall adequacy

All six diligence asks are blocking for an investment decision at $29B implied valuation. The three highest priority asks — ARR by segment, government contract schedule, and cash runway — should be completed before any preliminary term sheet discussion.

[CV002, CV012, CV013, CV014, CV029, CV030]
FV002: Valuation Sensitivity

8.3 Bull / Base / Bear Scenarios

The scenario analysis for Scale AI hinges on three key driver variables: (1) the government/defense ARR trajectory and renewal rate for existing DoD contracts, (2) the enterprise GenAI Platform conversion rate and ARR growth from the commercial pivot, and (3) the Meta customer relationship durability and any further AI lab attrition or accretion. Bull case: Government contracts renew at >90% and expand with new Donovan program awards; enterprise GenAI Platform achieves $150M+ ARR by 2027 driven by Fortune 500 production deployments; Meta expands its data purchasing to $300M+ ARR; Interim CEO Droege stabilizes operations and attracts a marquee permanent CEO within 12 months. Under these assumptions, total ARR reaches $600M–800M by 2027, and at a 25–30x forward multiple, valuation reaches $18–24B. This implies modest upside from $29B entry and is achievable only under multiple concurrent favorable outcomes. Base case: Government contracts renew at 85% and grow modestly; enterprise GenAI Platform achieves $75–100M ARR by 2027 (slower ramp than bull); Meta ARR holds at current levels; AI lab ARR (Cohere, others) declines modestly. Total ARR reaches $350–450M by 2027 at 20–25x forward multiple, implying a valuation of $8–11B—a significant markdown from the $29B entry. This is the most likely scenario given the evidence available. Bear case: Government contracts face delays or a material non-renewal; enterprise GenAI Platform fails to achieve product-market fit by 2027 (net ARR < $50M from platform); Meta reduces data purchasing following establishment of in-house annotation capability; additional AI lab customers exit. Total ARR falls to $150–200M by 2027 at 10–12x forward multiple, implying $1.5–2.5B valuation. At this valuation, the $29B entry becomes economically devastating; even preference stacks may not protect against total return loss. The probability signal for each case is asymmetric: the bull case requires multiple concurrent optimistic outcomes; the bear case requires only the government contract not renewing on schedule (a single event). This asymmetry, combined with the entry price being set by a strategic buyer, creates a structurally unfavorable risk/return ratio for a financial investor at $29B. The base case implies a 60–70% markdown, and the bear case implies near-total loss of financial value. Only investors who can price the strategic optionality of the government moat at $20B+ can justify the $29B entry. [CV005, CV006, CV007, CV008, CV009, CV015]

Bull / Base / Bear Scenario Table
ScenarioKey AssumptionsARR Estimate (2027)Valuation LogicImplied ValuationProbability Signal
Bull CaseGovernment contracts renew >90%; GenAI Platform reaches $150M+ ARR; Meta expands to $300M+ data purchase; permanent CEO hired$700M–850M25x forward ARR (government AI platform premium)$17.5B–25.5B20% — requires multiple concurrent optimistic outcomes; no single assumption is extreme but all must hold simultaneously
Base CaseGovernment contracts renew at 85%; GenAI Platform reaches $75–100M ARR; Meta ARR stable; AI lab segment declines modestly$350M–450M20–22x forward ARR (mixed platform/services valuation)$7.7B–10.4B55% — most outcomes are consistent with historical trajectories for annotation companies with government moats
Bear CaseGovernment contract non-renewal or delay; GenAI Platform <$50M ARR; Meta reduces purchasing; additional AI lab exits$150M–200M10x forward ARR (annotation services with degraded government positioning)$1.5B–2B25% — requires only one adverse event (government non-renewal) to materialize; high probability given forward contract uncertainty

Scenarios represent analyst estimates constructed from public evidence; not company forecasts. ARR estimates are gross ARR including government, enterprise, AI lab, and self-serve segments. Probability signals are subjective but grounded in the distribution of outcomes for comparable annotation companies (Appen precedent weighted toward bear/base; Palantir precedent weighted toward bull). Valuation at $29B entry implies a 60–70% markdown in the base case.

[CV015, CV016, CV017, CV018, CV022, CV024]
FV003: Valuation / Return Range

8.4 Comparable Set and Valuation Positioning

Scale AI operates across multiple addressable markets and can be compared using several distinct frameworks. The most important distinction is that Scale is priced as a frontier AI platform company despite generating most of its current revenue as an annotation and data labeling services company. This creates a valuation framework mismatch: applying annotation services multiples (Appen: 0.5–2x ARR) implies $100M–1B; applying defense AI platform multiples (Palantir: 25–30x ARR) implies $5B–15B; applying frontier AI platform multiples (OpenAI private rounds: 100x+ ARR) implies $20B+. The appropriate framework depends on which revenue segment becomes dominant in Scale's forward composition. Appen (ASX: APX) is the most instructive negative comparable: a public annotation company that experienced 60–80% revenue decline when AI labs reduced RLHF spend after developing in-house annotation capabilities. Appen's market cap fell from ~AUD $3.5B at peak to ~AUD $250M following customer attrition, demonstrating the severe valuation impact of annotation commoditization. Scale's government moat is the key structural difference from Appen: Scale's defense vertical creates a revenue floor that Appen lacked. Palantir (NYSE: PLTR) provides the government AI platform comparable: a company with deep DoD/IC integrations, high switching costs, and multi-year government contracts that commands a 25–30x ARR multiple on a strong growth narrative. Palantir's market cap has exceeded $100B on estimated $2B+ ARR, implying comparable government-anchored AI platforms can sustain high multiples. However, Palantir has demonstrated sustained commercial traction alongside government revenue; Scale's commercial pivot is unproven. Labelbox (private, ~$1B valuation) provides a relevant annotation infrastructure comparable without government moat, suggesting that annotation infrastructure without government contracts commands approximately 8–10x ARR—a fraction of Scale's implied multiple. The delta between Labelbox's multiple and Scale's implies the market is pricing Scale's government moat and Meta strategic option value at approximately $20B+ premium. This premium requires explicit validation through government contract schedule visibility and enterprise ARR trajectory data. [CV019, CV020, CV021, CV022, CV023, CV024]

Comparable Valuation Table
ComparableTypeKey Metric / ARRMultiple / ValuationGovernment MoatRelevanceKey Limitation
Appen (ASX: APX)Public — annotation servicesAUD ~$400M peak ARR (2021)0.5–2x ARR; peak ~$3.5B market cap; declined 90%+ post-attritionNoneDirect negative comparable: annotation company that lost AI lab customers and collapsed in valuationNo government moat — Scale's floor valuation should be structurally higher than Appen's trough
Palantir (NYSE: PLTR)Public — defense AI / enterprise data platform~$2B+ ARR (2025 est.)30–50x ARR; $100B+ market cap; high government ARR concentrationExtremely high (NSA, CIA, DoD multi-decade relationships)Best government-anchored AI platform comparable; Scale's government depth is less provenPalantir has proven 15+ years of government contract continuity; Scale's government track record is shorter
Scale AI Series F (May 2024)Private — own historical round~$200–300M ARR (estimated at time)46–69x ARR; $13.8B valuationHigh (established at time of round)Closest historical comparable; shows valuation doubled despite concurrent attritionHistorical reference only; current trajectory is adverse relative to Series F
Scale AI Meta deal (Jun 2025)Private — strategic investment~$200–500M ARR (current estimate)58–145x ARR; $29B impliedHigh (established)Current implied valuation — the entry price under analysisStrategic buyer premium embedded; not financial-investor-comparable; proceeds to shareholders not balance sheet
Labelbox (private)Private — annotation platform, no government moat~$80–120M ARR (est.)~8–12x ARR; ~$1B valuationNoneAnnotation infrastructure comparable without government moat; benchmarks Scale's platform multiple absent government premiumPrivate company; ARR is analyst estimate; no public financial disclosure
Crunchbase Scale AI funding historyDatabase — funding round aggregator~$1.6B total equity raised pre-Meta dealN/A (not a per-comparable entry)N/AContextual: confirms capital efficiency relative to valuationDatabase entry; round details may differ from disclosed actuals

Multiple comparables because Scale AI straddles annotation services (Appen comparable), government defense AI (Palantir comparable), and enterprise AI platform (Labelbox comparable). The appropriate valuation framework depends on which segment becomes the dominant revenue contributor in the forward period. At $29B, the valuation is pricing in the Palantir-equivalent government moat AND significant enterprise platform upside — both of which require private confirmation.

[CV019, CV020, CV021, CV022, CV023, CV024]
FV004: Investment KPIs

8.5 Recommendation, Confidence, and Final Diligence Asks

The recommendation for Scale AI is Research-More with a Conditional Pass at materially lower entry valuation. The evidence supports the quality of Scale's assets—government moat, annotation data infrastructure, brand, and Meta validation—but does not support the $29B valuation at current evidence quality and revenue trajectory. The primary driver of this recommendation is that the entry valuation was set by a strategic acquirer (Meta) operating under strategic logic incompatible with financial investor return requirements. Paying $29B without visibility into post-attrition ARR, government contract renewal schedule, enterprise platform conversion metrics, and cash runway is epistemically unsound. The confidence in this recommendation is medium-low because the most critical data—government contract ARR, enterprise platform ARR, and post-deal cash position—are private and not publicly accessible. If those data points were available and favorable, the recommendation could upgrade to Conditional Buy at a price below $15B, or even Structured Buy at $8–12B with aggressive milestone protections. The current evidence quality justifies caution: eight of the thirty research questions in this chapter have partial or unresolved status because the key inputs are private. Thesis-break conditions that would move the recommendation to Pass (reject): any government contract cancellation, any additional enterprise customer citing Meta conflict, or confirmation that GenAI Platform ARR is below $30M as of Q1 2026. Conditions that would upgrade to Conditional Buy: government contract renewal confirmation, enterprise platform ARR exceeding $75M, and a permanent CEO appointment with government AI or enterprise platform experience. Investors should note that the timeline for resolution of the critical diligence questions is short—the government contract option periods and enterprise platform ramp should produce observable revenue signals within 12–18 months of the research date. [CV001, CV002, CV006, CV009, CV017, CV029]

Recommendation Summary Table
DimensionAssessmentBasisCondition / Qualifier
RecommendationResearch-More / Conditional PassValuation is strategic-buyer priced; financial investor return case requires materially lower entryUpgrade to Conditional Buy if government contract schedule confirmed and GenAI Platform ARR exceeds $75M
ConfidenceLow-Medium (35%)Critical inputs (government ARR, enterprise platform ARR, cash runway) are private and unconfirmedConfidence rises to Medium-High if Q2 2026 revenue data confirms base case trajectory
Risk RatingHighFour concurrent high-severity risks (concentration, CEO, Meta conflict, pivot execution); no prior navigation of this risk cluster demonstratedRisk declines to Medium if government contracts renew and management stabilizes
Valuation StanceMaterially Overvalued at $29B implied for financial investor58x–145x estimated ARR set by strategic acquirer logic; financial return case requires $8–15B entryFair value range: $8–12B base case; $15–22B bull case
Decision ImplicationDo not invest at $29B without additional data room visibilityEntry at $29B offers negative expected value under base case; thesis requires multiple concurrent optimistic outcomesRequest private data room access; set price discipline at <$15B; negotiate milestone-based valuation adjustments

Recommendation is evidence-sensitive and price-sensitive. The assessment reflects public-evidence limitations and the base-case valuation framework. A different recommendation (Conditional Buy or Track) could be reached under different entry price conditions or with private data room confirmation of key ARR metrics.

[CV001, CV002, CV006, CV009, CV029, CV030]

8.6 Exhibits

Disclaimer

This report is for informational and research purposes only. Sources are publicly available; no proprietary or confidential information was used. The authors make no representations as to completeness or accuracy. This is not investment advice.

Evidence index

Claims
IDStatementConfidenceSources
CO001 Scale AI was founded in 2016 by Alexandr Wang, who was 19 years old and had left MIT, in San Francisco, California. High SO001, SO015
CO002 Scale AI's mission is to develop reliable AI systems for the world's most important decisions. Medium SO001
CO003 Scale AI provides data annotation, RLHF, model evaluation, and enterprise GenAI platform services as its core product offerings. High SO001, SO008, SO009, SO012
CO004 Scale AI had approximately 1,000 employees as of its about page, with headcount reduced following the July 2025 layoffs of 200 employees and 500 contractors. Medium SO001, SO017
CO005 Scale AI closed a $1 billion Series F round in May 2024, led by Accel, at a post-money valuation of $13.8 billion. High SO015, SO016
CO006 Scale AI has raised approximately $1.6 billion in total disclosed venture funding, including $325 million in its 2021 Series E at a $7.3 billion valuation and $1 billion in its 2024 Series F. Medium SO015, SO019
CO007 Meta's June 2025 strategic investment of approximately $14.3 billion for a minority stake implies a Scale AI valuation of over $29 billion. High SO019, SO016
CO008 Scale AI has processed more than 15 billion human-labeled decisions and has paid contributors globally over $1 billion. Medium SO001
CO009 Scale AI operates as a late-stage private company; it has not filed for an IPO as of the report date. High SO001, SO015
CO010 Alexandr Wang served as CEO of Scale AI from its founding in 2016 until June 2025, when he departed to join Meta AI. High SO002, SO016
CO011 Jason Droege was appointed Interim CEO of Scale AI in June 2025 following the departure of Alexandr Wang. High SO002, SO016
CO012 Jason Droege founded Uber Eats and scaled it to a $19 billion GMV run rate, then served as VP at Uber and partner at Benchmark before joining Scale AI as Chief Strategy Officer in September 2024. Medium SO002, SO027
CO013 Alexandr Wang retained a seat on Scale AI's board of directors after his departure to Meta in June 2025. Medium SO016, SO019
CO014 Scale AI's full board composition beyond Alexandr Wang is not publicly disclosed as of mid-2026. High SO001, SO002
CO015 Alexandr Wang's concurrent role at Meta AI and board seat at Scale AI creates a structural governance conflict of interest, as Meta is both Scale's largest strategic investor and a potential competitor to Scale's AI lab customers. Medium SO016, SO019, SO020
CO016 Scale AI's CEO transition from founder Alexandr Wang to interim CEO Jason Droege is a material key-person risk given Wang's central role in building Scale's customer relationships and product vision. Medium SO002, SO016, SO017
CO017 Scale AI's Series E in August 2021 raised $325 million at an approximate $7.3 billion valuation, with Coatue, Y Combinator, and Founders Fund among the lead investors. Medium SO015
CO018 Scale AI reduced its workforce by approximately 20% in 2023 amid a slowdown in AI training data demand. Medium SO015, SO017
CO019 The Scale AI Series F included investors Amazon, Meta, Cisco, Intel, AMD, ServiceNow, Nvidia, DFJ Growth, WCM, Elad Gil, and Nat Friedman as new or returning strategic participants alongside financial investors. High SO015, SO019
CO020 The Scale AI Series F round combined primary capital and a secondary component, allowing existing shareholders to partially liquidate their positions. Medium SO015
CO021 Meta's June 2025 investment was structured as a minority stake purchase of approximately 49% of Scale AI's outstanding equity on a fully diluted basis. Medium SO019, SO016
CO022 The proceeds from Meta's $14.3 billion investment in Scale AI were distributed to existing shareholders and holders of vested equity rather than retained as operating capital on Scale's balance sheet. Medium SO019, SO016
CO023 Scale AI stated it remains operationally independent from Meta following the strategic investment. Medium SO002, SO016
CO024 Scale AI's Donovan platform provides specialized AI agent workflows for defense and intelligence missions, operating in classified environments enabled by DoD IL4 and FedRAMP High certifications. High SO006, SO005
CO025 Scale AI holds SOC 2 Type II, ISO 27001, DoD IL4 Provisional Authorization, and FedRAMP High security certifications. High SO005, SO004
CO026 Scale AI's RLHF product provides curated preference data for reinforcement learning from human feedback, which is central to training large language models for instruction following and safety alignment. High SO012, SO009
CO027 Scale AI secured a U.S. Department of Defense data curation contract for joint force operations in 2024. Medium SO014, SO007
CO028 Scale AI testified before Congress on AI safety and data quality standards during the 118th Congress. Medium SO023, SO013
CO029 Scale AI offers enterprise customers custom pricing with dedicated operations teams and SLA commitments; self-serve customers pay per usage with the first 1,000 labeling units free. Medium SO003
CO030 Scale AI does not publicly disclose its annual recurring revenue, gross margin, or detailed financial metrics; it is classified as a private-undisclosed company. High SO001, SO003
CO031 Scale AI signed the White House voluntary AI safety commitments in 2024, pledging commitments on AI safety, security, and trust. Medium SO013, SO023
CO032 Scale AI operates its Global Public Sector division serving international government agencies in addition to U.S. defense customers. Medium SO026, SO007
CO033 Scale AI's scale of operations is indicated by the 15 billion human decisions processed and $1 billion paid to its contributor network globally. Medium SO001
CO034 Scale AI launched the TIME magazine GenAI deployment in under 2 months with over 7,000 AI attack vectors tested, demonstrating enterprise deployment speed and safety rigor. Medium SO011
CO035 Google was Scale AI's largest customer prior to the Meta strategic investment; CNBC reported Google planned to wind down or significantly reduce its Scale relationship after the Meta deal in June 2025. High SO020, SO021
CO036 OpenAI wound down its work with Scale AI in June 2025 following Meta's strategic investment, according to CNBC. High SO021, SO016
CO037 Scale AI operates the Scale GenAI Platform that transforms enterprise data into domain-specific generative AI applications using a proprietary pipeline. Medium SO001, SO008
CO038 Scale AI laid off approximately 200 employees (14% of staff) and 500 contractors in July 2025, with Interim CEO Droege's memo citing overinvestment in data-labeling capacity relative to the company's strategic direction. High SO017, SO016
CO039 Scale AI filed a lawsuit in September 2025 against Mercor, a rival data services company, and a former Scale employee, alleging an attempt to steal Scale's largest customers. Medium SO018, SO016
CO040 McKinsey's State of AI 2025 survey found that 88% of organizations now use AI in at least one business function, up from 78% in 2024, indicating strong underlying demand for AI data and infrastructure services. High SO028, SO022
CO041 The OpenAI fine-tuning and custom models program expansion highlights continued strong demand for specialized AI model training services, which Scale AI's RLHF and Data Engine products directly address. Medium SO024, SO028
CM001 The AI data services and infrastructure market encompasses data annotation, RLHF, model evaluation, and enterprise GenAI platform services as distinct but related segments with overlapping buyers and spend. High SM012, SM013
CM002 Status-quo substitutes for Scale AI's annotation services include internal human review teams at large AI labs, lower-cost offshore providers, and increasingly, synthetic data generation pipelines. Medium SM002, SM003, SM004
CM003 Scale AI does not currently serve the commodity crowdsource annotation market (e.g., Mechanical Turk), general AI consulting, or hyperscaler AI services (Azure, AWS, GCP) — these are excluded from its SAM. Medium SM016, SM017
CM004 The AI data annotation and evaluation market is served by a fragmented set of competitors including Appen, Labelbox, Snorkel AI, SuperAnnotate, Surge AI, Invisible Technologies, and Mercor, indicating a competitive but not yet consolidated market. Medium SM002, SM003, SM004, SM005, SM006, SM007, SM008
CM005 Scale AI's GenAI Platform competes in the enterprise AI deployment market against hyperscalers and boutique AI consultancies, while its Donovan platform operates in the specialized government and defense AI segment where cleared vendors are scarce. Medium SM010, SM009, SM011
CM006 The Total Addressable Market for AI data services and infrastructure is estimated at $10–$30 billion annually as of 2025, with high uncertainty due to boundary definition differences and rapid market evolution. Low SM012, SM013
CM007 A narrower TAM for AI data annotation services only (excluding evaluation, RLHF, and enterprise GenAI platform) is estimated at $2–$8 billion, derived from public-company proxy revenues scaled to global market size. Low SM004, SM018
CM008 Scale AI's Serviceable Addressable Market (premium tier: AI labs, large enterprise, and government) is estimated at $1.5–$6 billion, representing the high-quality end of AI data services where Scale's quality premium and certifications are competitive differentiators. Low SM012, SM018
CM009 The U.S. government and defense AI data and evaluation submarket addressable by cleared vendors like Scale AI is estimated at $300M–$1B based on defense AI investment growth trends and cleared vendor supply constraints. Low SM011, SM025
CM010 Scale AI's realistic Serviceable Obtainable Market (SOM) within a 3–5 year horizon is estimated at $500M–$2 billion, consistent with Scale's $29B+ valuation at typical SaaS/data revenue multiples. Low SM021, SM018
CM011 No public analyst report provides a consistent, granular breakdown of the AI data annotation market with sufficient specificity for high-confidence TAM/SAM/SOM sizing; all estimates carry high uncertainty. Medium SM014, SM015
CM012 AI research laboratories and foundation model developers (OpenAI, Meta, Google DeepMind, Anthropic, Cohere) represent Scale AI's original customer segment, requiring large volumes of high-quality labeled data and RLHF data. High SM023, SM016
CM013 Large enterprise AI adopters (Fortune 500 companies using AI for products and automation) represent Scale AI's growth segment, accessing the GenAI Platform with higher switching costs and longer procurement cycles than AI labs. Medium SM010, SM022
CM014 U.S. government and defense agencies represent Scale AI's most defensible buyer segment, with very high switching costs due to FedRAMP High and DoD IL4 clearance requirements, multi-year contract structures, and institutional knowledge dependencies. High SM009, SM011
CM015 AI startups and research organizations access Scale's self-serve tier with low switching costs and lower average revenue per user, representing volume without durability. Medium SM016, SM017
CM016 U.S. government AI procurement follows federal acquisition regulations with multi-year contract vehicles and IDIQ structures, creating revenue lumpiness but high stickiness once awarded. Medium SM025, SM011
CM017 The TIME magazine GenAI deployment case study demonstrates Scale AI's ability to win enterprise media customers with fast deployment timelines (under 2 months) and comprehensive AI safety testing. Medium SM023
CM018 Enterprise AI platform buyers (vs. AI lab buyers) offer Scale higher switching costs and platform stickiness, making them strategically more valuable for long-term revenue durability despite lower initial volume. Medium SM010, SM022
CM019 McKinsey's 2025 State of AI survey found that 88% of organizations use AI in at least one business function, up from 78% the previous year, confirming accelerating enterprise AI adoption. High SM012, SM001
CM020 McKinsey's 2025 survey found 62% of organizations are experimenting with AI agents, indicating AI adoption is moving toward more complex and automated workflows that require higher-quality training and evaluation data. High SM012, SM001
CM021 Most organizations are still in the AI pilot/POC stage rather than full-scale deployment, according to McKinsey's 2025 survey, suggesting enterprise platform revenue for Scale AI will lag the adoption curve by 12–24 months. Medium SM012
CM022 U.S. government AI investment is growing materially as the DoD integrates AI into surveillance, logistics, cybersecurity, and autonomous systems — all of which require trusted data infrastructure like Scale provides. Medium SM025, SM011
CM023 AI safety regulatory pressure — including the White House AI Executive Order and EU AI Act — is increasing demand for AI model evaluation and audit services, benefiting Scale's Evaluation product. Medium SM025, SM013
CM024 The proliferation of foundation model providers (OpenAI, Meta, Anthropic, Mistral, Cohere) increases aggregate demand for RLHF data and model evaluation, partially offsetting the risk from any single large lab customer departure. Medium SM020, SM013
CM025 Synthetic data generation represents a long-term structural threat to human-labeled annotation demand; some AI labs are shifting toward model-generated synthetic data for portions of their training pipelines. Medium SM020, SM019
CM026 Low-cost annotation providers (Appen, offshore QA teams, crowdsourcing platforms) create structural margin pressure on Scale's data annotation business, constraining the price premium Scale can sustain without clear quality differentiation. Medium SM004, SM002
CM027 The AI data annotation market does not have publicly available CAGR estimates from tier-one analyst firms with sufficient market boundary consistency to provide a reliable growth rate for TAM sizing. Medium SM014, SM015
CM028 Appen (ASX-listed) is the only publicly traded direct comparable to Scale AI's data annotation business, but Appen's revenue trends, market positioning, and customer base differ materially from Scale's premium positioning. Medium SM004
CM029 The revenue impact of Google's and OpenAI's departures from Scale AI as customers cannot be quantified from public sources, representing the single most material unresolved market sizing question for Scale's investment case. High SM019, SM021
CM030 Scale AI's revenue mix by segment (AI labs vs. enterprise vs. government) is not publicly disclosed, making it impossible to verify which segment dominates current revenue or project the enterprise and government pivot timeline. High SM016, SM023
CM031 The wide range of AI data services TAM estimates ($2B–$30B) reflects genuine analyst disagreement on boundary definitions, and investors should treat all sizing figures as directional rather than precise. Medium SM014, SM015, SM012
CM032 Scale AI's AI Readiness Report positions the company as a thought leader in enterprise AI maturity, which supports its brand in the enterprise buyer segment and validates its content-led GTM motion. Medium SM022
CM033 Stanford HAI AI Index 2025 confirms that AI investment reached record levels globally in 2024, with both private investment and government spending accelerating, providing macro validation for the AI data services market. High SM013, SM001
CM034 Fortune 500 companies deploying production AI (not just pilot) in 2025 represent a small but growing fraction of the enterprise TAM; McKinsey data suggests roughly one-third of AI-adopting organizations are scaling programs. Medium SM012
CM035 The competitive landscape for AI data services includes multiple vendors targeting different quality/price points — from premium (Scale, Surge) to mid-market (Labelbox, Snorkel, SuperAnnotate) to low-cost (Appen, Invisible) — suggesting market segmentation rather than single-vendor dominance. Medium SM002, SM003, SM005, SM006, SM007
CP001 Scale AI's competitive landscape encompasses direct competitors in AI data annotation (Appen, Labelbox, Snorkel AI, SuperAnnotate), RLHF data (Surge AI, Mercor), enterprise GenAI platform (hyperscalers, boutique AI consultancies), and defense AI (defense contractors). High SP004, SP007, SP012, SP017, SP014, SP001
CP002 Appen is the only publicly traded direct comparable to Scale AI's annotation business, and Appen's declining revenues provide a market signal for the structural headwinds facing pure-play annotation vendors. Medium SP004, SP019
CP003 Meta's June 2025 strategic investment in Scale AI created a direct conflict of interest with Scale's largest AI lab customers (Google, OpenAI), leading both to wind down or significantly reduce their Scale relationships. High SP021, SP022, SP023
CP004 Google was Scale AI's largest customer prior to the Meta investment and planned to wind down its Scale relationship in June 2025 due to competitive conflict concerns. High SP021, SP023
CP005 OpenAI wound down its work with Scale AI in June 2025, coinciding with Meta's strategic investment and founder Alexandr Wang's departure to Meta. High SP022, SP023
CP006 Labelbox has expanded from core data labeling to RLHF, model evaluation, leaderboards, and robotics AI training, making it a direct multi-product competitor to Scale AI across several key product lines. Medium SP007, SP008, SP009, SP010, SP011
CP007 Snorkel AI's programmatic labeling approach — using AI weak supervision to reduce manual annotation — threatens Scale's human-expert model by potentially reducing the labor content and cost of annotation, directly challenging Scale's premium pricing thesis. Medium SP012, SP013
CP008 Mercor operates an AI talent marketplace for RLHF, model evaluation, and data labeling, founded by former Scale AI-connected individuals, and is being sued by Scale for alleged customer poaching. High SP001, SP014
CP009 SuperAnnotate competes with Scale AI in the enterprise annotation platform market with a security-first positioning and collaborative workflow tools, primarily targeting computer vision use cases. Medium SP015
CP010 Invisible Technologies provides AI-powered business operations services that compete with Scale's enterprise automation and annotation capabilities at the operations level. Medium SP017
CP011 Surge AI focuses specifically on high-quality human feedback data for LLM training and RLHF, directly competing with Scale's RLHF product with a smaller but expert contributor network. Medium SP017
CP012 Appen has expanded into agentic AI services and model evaluation in addition to its core annotation business, indicating it is attempting to follow Scale into higher-margin evaluation segments. Medium SP005, SP006
CP013 Appen provides data security features for enterprise and government customers but does not disclose DoD IL4 or FedRAMP High certifications equivalent to Scale AI's cleared vendor status. Medium SP018, SP002
CP014 Labelbox offers tiered pricing including self-serve developer plans and enterprise custom pricing, making it price-competitive against Scale's self-serve tier and potentially more accessible to mid-market enterprise. Medium SP016, SP007
CP015 Scale AI's DoD IL4 Provisional Authorization and FedRAMP High certifications are unique among publicly disclosed AI data annotation vendors, creating a near-exclusive position in classified and defense AI data markets. High SP002, SP003
CP016 Scale AI's Donovan defense AI agent platform has no publicly disclosed direct competitor with equivalent security clearances and defense-specific deployment capabilities. Medium SP003, SP002
CP017 Scale AI's evaluation benchmarking position — including the Scale Leaderboard and WMDP harmful-knowledge benchmark — has established reputational authority as a trusted third-party LLM evaluator, providing a differentiated position versus competitors. Medium SP025, SP002
CP018 Labelbox and Snorkel AI do not publicly disclose government security certifications equivalent to Scale's DoD IL4 or FedRAMP High authorizations, indicating limited ability to compete for Scale's most defensible government contracts. Medium SP007, SP012
CP019 Scale AI's feature breadth — spanning annotation, RLHF, evaluation, enterprise GenAI platform, and defense AI agents — is unmatched by any single competitor, though individual competitors may lead in specific capabilities. Medium SP025, SP007, SP004
CP020 Scale's enterprise GenAI Platform has no direct counterpart at Appen, Labelbox, or Snorkel AI, but faces significant competition from hyperscaler AI platforms (AWS Bedrock, Azure AI, Google Vertex) with far greater resource advantages. Medium SP025, SP007
CP021 Switching costs for AI lab customers of Scale AI are moderate: annotation pipelines are fungible, and Google and OpenAI both exited Scale's ecosystem within weeks of the Meta conflict emerging in June 2025. High SP021, SP022
CP022 Switching costs for enterprise customers using Scale's GenAI Platform are higher than for annotation-only customers, due to platform integration, workflow customization, and data pipeline dependencies. Medium SP025
CP023 Obtaining DoD IL4 Provisional Authorization and FedRAMP High certification requires a multi-year process involving security audits, infrastructure requirements, and federal agency sponsorship, creating a 12–36 month barrier for any competitor seeking to enter Scale's government segment. Medium SP002, SP003
CP024 Mercor is actively building an alternative AI expert contributor supply chain through its talent marketplace to directly compete with Scale's contributor network, targeting the same expert annotators and RLHF data customers. Medium SP001, SP014
CP025 Multi-homing — where annotation customers run work through multiple vendors simultaneously — is common in the AI data market; Scale's self-serve tier confirms this with its pay-as-you-go model that has no long-term commitment. Medium SP007, SP012
CP026 Scale AI's government and defense segment represents its most durable competitive moat, requiring years for competitors to replicate the clearances, institutional knowledge, and defense-specific product (Donovan) that Scale has built. High SP002, SP003, SP025
CP027 Scale AI's quality premium in annotation is under pressure from competitors including Labelbox, Surge AI, and lower-cost providers; the departure of Google and OpenAI demonstrates that quality premium alone was insufficient to sustain those relationships. High SP021, SP022, SP024
CP028 Appen's declining revenues — the most direct public comparable to Scale's annotation business — serve as a leading indicator of structural headwinds in the pure-play data annotation market. Medium SP019, SP004
CP029 The Meta strategic investment has become Scale AI's primary competitive liability in the AI lab segment: Mercor's lawsuit defense (Sep 2025) reveals that competitors view Scale's AI lab customer base as vulnerable and actively targetable. High SP001, SP021, SP022
CP030 Scale AI's July 2025 layoffs specifically targeted data-labeling employees — not evaluation or platform — confirming management's internal assessment that commodity annotation is a commoditizing segment requiring rightsizing. High SP024, SP023
CP031 Snorkel AI's programmatic labeling approach and rising AI-assisted annotation tools represent a structural threat to the human-label-intensive part of Scale's business model, potentially reducing the labor content of annotation work over time. Medium SP012, SP013
CP032 Scale AI's annotation platform (Scale Data Engine) has built-in quality feedback loops and an expert contributor network that paid contributors $1B+ globally, creating a proprietary supply chain that competitors must spend years and significant capital to replicate. Medium SP025, SP002
CP033 The competitive landscape for AI data services lacks a dominant single vendor — no competitor has matched Scale's full-stack positioning across annotation, RLHF, evaluation, and defense AI — but individual segments are increasingly contested. Medium SP004, SP007, SP012, SP014
CP034 Large defense contractors such as Palantir and Booz Allen are potential long-term entrants into the defense AI data market, representing a risk to Scale's Donovan moat over a 3–5 year horizon. Low SP003
CP035 Scale AI has not publicly documented evidence of losing any government or defense contract to a competitor, which supports the durability of its government segment moat in the near term. Medium SP002, SP003
CI001 Scale AI generates revenue through enterprise data annotation/RLHF services, the Scale GenAI Platform (enterprise SaaS + managed services), U.S. government and defense contracts, and a self-serve tier. High SI001, SI013, SI020, SI014
CI002 Scale AI's GenAI Platform targets enterprises seeking to build custom AI applications from proprietary data, representing a higher-margin opportunity than commodity annotation. Medium SI001, SI004
CI003 Scale AI holds DoD IL4 and FedRAMP High certifications, enabling it to pursue classified and defense-grade AI data contracts that most competitors cannot access. Medium SI005, SI006
CI004 Scale AI's RLHF revenue is affected by OpenAI's wind-down of its Scale relationship post-Meta-investment, while Meta's own RLHF work with Scale is expanding. Medium SI022, SI017
CI005 Scale AI's self-serve Data Engine offers the first 1,000 labeling units free, then charges pay-as-you-go; enterprise tiers are custom-priced with dedicated operations and SLAs. High SI013, SI001
CI006 Scale AI's enterprise GTM is primarily sales-led with dedicated operations teams and solution engineers; government contracts require specialized federal procurement BD processes. Medium SI013, SI005, SI021
CI007 Scale AI's enterprise pricing for annotation, RLHF, and platform services is custom-quoted and not publicly disclosed; no list pricing or typical deal size information is available. High SI013, SI001, SI020
CI008 Scale AI's enterprise annotation sales cycle is estimated at 3–12 months, with government contract sales cycles typically 12–36 months; specific CAC figures are not disclosed. Low SI005, SI006, SI021
CI009 Scale AI's data-labeling revenue was concentrated among a small number of large AI lab customers, creating material customer concentration risk that materialized when Google and OpenAI departed in 2025. High SI018, SI022, SI016
CI010 Scale AI's investor relationships (Accel, Amazon, Meta, Nvidia, Cisco) create potential co-sell and referral channels, but the specific economic impact of these relationships on revenue is not publicly disclosed. Low SI015, SI014
CI011 Scale AI's annotation cost of revenue is dominated by human labor, with the company having paid over $1 billion globally to contributors; annotation gross margins are estimated at 25–45%, using Appen's public financials as proxy. Low SI014, SI007, SI008
CI012 Scale AI's GenAI Platform is estimated to carry significantly higher gross margins (45–65%) than annotation services, reflecting greater software leverage and less per-unit human labor, consistent with enterprise managed-services industry benchmarks. Low SI001, SI004, SI023
CI013 Scale AI's July 2025 layoffs of 200 employees (14% of staff) and 500 contractors targeted the data-labeling business, signaling management's intent to shift to higher-margin enterprise and government services and reduce the labor-heavy cost base. High SI016, SI021
CI014 Appen (ASX: APX), the only publicly traded direct comparable to Scale AI's annotation segment, has reported declining revenues and gross margins in the 25–40% range, confirming structural headwinds in commodity annotation economics. Medium SI007, SI008, SI009
CI015 Scale AI's cost structure is primarily working-capital-intensive (human labor payments, contractor management) rather than physical capex intensive, making it operationally flexible but subject to margin pressure from annotation labor costs. Medium SI014, SI010, SI011
CI016 Scale AI raised approximately $600 million in pre-Series F capital and $1 billion in Series F (May 2024, $13.8 billion valuation, led by Accel) for a total raised of approximately $1.6 billion or more. High SI015, SI019
CI017 Meta invested approximately $14.3 billion for a minority stake of approximately 49% in Scale AI in June 2025, implying a Scale valuation of over $29 billion; the investment was primarily structured as a secondary transaction with proceeds distributed to existing shareholders. High SI017, SI019
CI018 Scale AI does not have any publicly disclosed debt, credit facilities, or project-finance obligations; its capital structure is equity-financed. Medium SI015, SI017
CI019 Based on the Series F primary capital (estimated $500M+ to the company), post-layoff burn rate reductions, and potential primary capital from the Meta deal, Scale AI is estimated to have $500M–$1B cash on hand with 24–48 months of runway as of mid-2025. Low SI015, SI017, SI016
CI020 Scale AI's total estimated annual revenue is in the range of $200M–$500M ARR, based on headcount proxies (~1,000 employees post-layoff), industry revenue-per-employee benchmarks, and analyst commentary; this estimate has very low confidence without public financial disclosure. Low SI023, SI025, SI014
CI021 At Scale AI's $29B+ implied valuation, if ARR is $200M–$500M, the implied revenue multiple of 58x–145x is in hypergrowth-premium territory and cannot be verified or justified without audited financial statements and confirmed growth rates. Low SI017, SI023
CI022 Scale AI does not disclose revenue, ARR, gross margin, NRR, burn rate, or operating cash flow; these omissions represent the primary blockers to financial underwriting of the company. High SI013, SI014, SI021
CI023 Google and OpenAI's departure as customers in 2025 represents a material risk to Scale AI's revenue trajectory; the financial magnitude of this attrition is unknown but potentially $50M–$200M+ in annual revenue based on their reported status as Scale's largest AI lab customers. Low SI018, SI022, SI016
CI024 The MetA strategic investment does not appear to include covenants or restrictions on Scale AI's operations; Meta holds a minority stake and Scale remains independent, but the conflict of interest with other customers is a de facto constraint on commercial relationships. Low SI017, SI019, SI021
CI025 A Mercor lawsuit filed by Scale AI in September 2025 alleging customer poaching could have financial implications including legal costs, settlement obligations, and customer loss not yet captured in financial estimates. Medium SI016, SI021
CI026 Scale AI's enterprise and government revenue segments have higher revenue quality (longer contracts, stronger switching costs) than the annotation segment, but their current size and growth trajectory are not publicly disclosed. Medium SI005, SI006, SI001
CI027 Scale AI paid over $1 billion globally to annotation contributors, confirming the company's historical commitment to human labor-based annotation and the labor intensity of its core revenue model. Medium SI014, SI013
CI028 McKinsey's 2025 AI State report notes that 88% of organizations use AI in at least one business function, up from 78%, supporting a growing market for Scale AI's enterprise AI data and platform services. High SI023, SI025
CI029 Scale AI's annotation business model is primarily working-capital-intensive, with payments to contributors timed to annotation project delivery; this creates a different capex profile than hardware or infrastructure companies. Medium SI014, SI012
CI030 Appen's publicly disclosed multimodal, speech/audio, and physical AI annotation services are directly comparable to Scale AI's core annotation product lines, making Appen's financial results the best available public proxy for Scale's annotation segment economics. Medium SI009, SI010, SI011
CI031 Scale AI's government and defense revenue segment carries high switching costs (DoD IL4 and FedRAMP High certification barriers) and multi-year contract durations, creating more durable and predictable revenue than the annotation segment. Medium SI005, SI006
CI032 Scale AI's revenue mix is shifting from data-labeling (legacy largest segment) toward enterprise GenAI platform and government/defense contracts, consistent with the July 2025 restructuring that cut data-labeling headcount while maintaining enterprise and government operations. Medium SI016, SI021, SI001
CI033 Scale AI's government contract revenue segment is growing through active DoD relationships (data curation contract, DIU RCV program, White House AI commitments) but exact contract values and revenue contribution are not publicly disclosed. Medium SI005, SI006, SI024
CI034 The net primary capital received by Scale AI from the Meta deal is not publicly disclosed; the deal was primarily structured as shareholder liquidity, meaning Scale's operating treasury benefit may be significantly less than the $14.3B headline figure. Medium SI017, SI019
CI035 Scale AI's financial verdict is mixed: abundant capital buffers the pivot risk, but the $29B+ valuation is unjustifiable without confirmed revenue data, and the annotation-segment revenue attrition risk is unquantified. Medium SI017, SI018, SI022
CE001 Scale AI's core product portfolio consists of five offerings: Scale Data Engine (annotation + curation), Scale RLHF (LLM training data), Scale Evaluation + Leaderboard (model benchmarking), Scale GenAI Platform (enterprise AI applications), and Donovan (defense AI agents). High SE017, SE016, SE015, SE018, SE020
CE002 The Scale GenAI Platform enables enterprises to build and deploy custom AI applications from proprietary data, differentiating from hyperscalers by integrating annotation quality with LLM customization. Medium SE011, SE018, SE013
CE003 Scale AI's RLHF product provides expert human feedback data for LLM training and alignment; the product was historically used by OpenAI and Google before their 2025 departures. Medium SE016, SE021
CE004 The Scale Leaderboard is a public developer-facing tool that ranks LLM performance across capability dimensions; it has established Scale as a trusted third-party evaluator in the AI research community. Medium SE001, SE015
CE005 Donovan is Scale AI's specialized AI agents platform for DoD, IC, and government agencies; it operates in DoD IL4-certified environments and has no disclosed direct competitor in the cleared AI agents space. Medium SE020, SE019, SE003
CE006 Scale AI's operating model combines proprietary annotation tooling, a quality assurance pipeline, and a global contributor network with software API infrastructure — a hybrid human-in-the-loop architecture. Medium SE017, SE012, SE011
CE007 The Scale API provides programmatic access to annotation, RLHF, and evaluation services with REST interface and webhook integration; it enables enterprise and developer integration with MLOps pipelines. Medium SE012, SE013
CE008 Scale AI's annotation workflow: customers submit raw data and task guidelines via API; Scale's contributor network executes annotation; a QA pipeline provides statistical review and expert escalation; annotated datasets are returned to customers. Medium SE017, SE012, SE016
CE009 Scale AI's GenAI Platform operating model is a managed services approach involving data ingestion, LLM selection, fine-tuning or RAG pipeline configuration, red-teaming and safety evaluation, and production deployment with monitoring. Medium SE011, SE018, SE027
CE010 Donovan's architecture is specialized for cleared government environments: DoD IL4-certified cloud infrastructure, classified data integration, multi-modal AI agent capabilities, and explainability features for military applications. Low SE020, SE019, SE003
CE011 Scale AI's primary technology differentiators are: proprietary annotation tooling and QA methodology, the global contributor network ($1B+ paid), DoD IL4 and FedRAMP High certifications, the Scale Leaderboard as reputational IP, and Donovan as a first-mover defense AI platform. Medium SE014, SE020, SE001, SE017
CE012 Scale AI's DoD IL4 and FedRAMP High certifications represent a genuine barrier to entry for competitors: obtaining these clearances takes 2–4 years, requires significant compliance investment, and requires government institutional relationships. Medium SE014, SE019, SE020
CE013 Scale AI's contributor network — having paid over $1 billion globally — is a proprietary supply-side asset that competitors including Mercor are attempting to replicate; Scale's lawsuit against Mercor specifically alleges customer and annotator poaching. High SE022, SE010, SE009
CE014 The Scale Leaderboard positions Scale as the trusted third-party LLM evaluator; Labelbox has also launched competing leaderboards, creating competitive pressure in the model evaluation market. Medium SE001, SE007, SE015
CE015 The WMDP (Weapons of Mass Destruction Proxy) benchmark is a publicly available AI safety evaluation tool created by Scale; its adoption by the AI safety research community reinforces Scale's position as a trusted AI safety evaluator. Medium SE002, SE025
CE016 Scale AI holds SOC 2 Type II, ISO 27001, DoD IL4 Provisional Authorization, and FedRAMP High Authorization certifications, confirmed on the official Scale security page. High SE014, SE019
CE017 Scale AI signed the 2024 White House voluntary AI safety commitments covering RLHF safety practices, red-teaming, and responsible AI deployment. High SE024, SE002
CE018 Scale AI's annotation quality controls include task-specific quality guidelines, multiple annotator redundancy, statistical quality monitoring, expert reviewer escalation, and inter-annotator agreement scoring; however, no public third-party quality audit is available. Medium SE017, SE016
CE019 The TIME customer case study demonstrates Scale GenAI Platform deployment speed (under 2 months) and red-teaming capability (7,000+ attack vectors tested), providing partial public evidence of product performance. Medium SE027, SE018
CE020 Scale AI's data privacy and customer data handling practices are partially covered by SOC 2 Type II certification; however, the specific data isolation mechanisms for enterprise annotation customers are not publicly documented. Medium SE014, SE011
CE021 Scale AI's 2025 product roadmap is implicitly directed toward expanding the GenAI Platform, Donovan government deployments, and Scale Evaluation, while reducing investment in commodity annotation — consistent with the July 2025 restructuring. Medium SE021, SE018, SE020
CE022 Scale AI does not maintain a public product roadmap, changelog, or developer status page; forward-looking roadmap information must be inferred from public statements and strategic announcements. High SE011, SE012, SE017
CE023 Scale's GenAI Platform competes with Amazon Bedrock, Google Vertex AI, and Azure AI Studio; hyperscalers have significantly greater infrastructure scale, distribution, and resources, representing a structural competitive threat to Scale's platform business. Medium SE026, SE023, SE018
CE024 Scale AI has not publicly disclosed investment in synthetic data capabilities; if synthetic data quality matches human annotation for LLM training, Scale's Data Engine TAM could shrink materially — a critical product roadmap gap. Medium SE005, SE004, SE021
CE025 Snorkel AI's programmatic labeling approach, which uses AI-assisted weak supervision to generate training data with less human annotation, represents a technical threat to Scale's human-annotation-first model. Medium SE005, SE004
CE026 Scale AI's developer community presence is limited compared to AI infrastructure companies with open-source products; the Scale Leaderboard and WMDP benchmark are the primary developer-facing signals, generating reputational capital rather than active developer community engagement. Medium SE001, SE002, SE004
CE027 Scale AI's technology differentiation versus Labelbox is primarily in QA methodology and contributor quality management; Labelbox has built competing evaluation products (Labelbox Leaderboards) and an Expert Network, narrowing the quality gap. Medium SE007, SE006, SE017
CE028 The McKinsey State of AI 2025 confirms 62% of organizations experimenting with AI agents, validating Scale's strategic pivot toward agentic AI services (GenAI Platform, Donovan) as an addressable growth market. High SE026, SE025
CE029 Scale AI's RLHF product is affected by OpenAI's departure (wind-down confirmed June 2025) but expanding in relation to Meta's RLHF needs following the strategic investment; this creates dependency concentration risk on Meta as both investor and customer. Medium SE023, SE016, SE021
CE030 Scale AI's Scale Evaluation product has an independence perception risk: as a company with Meta as a strategic investor (~49% minority stake), its role as a neutral third-party LLM evaluator may be questioned by AI labs that compete with Meta. Medium SE023, SE001
CE031 Scale's DIU RCV (Robotic Combat Vehicle) program win confirms that Scale's technology meets defense procurement standards for autonomous military AI applications, validating Donovan's technical capability for defense missions. Medium SE003, SE020
CE032 Scale AI's annotation quality differentiation versus Snorkel AI is philosophical: Scale uses high-quality human annotation for accuracy; Snorkel uses AI-assisted weak supervision for scale and cost. The two approaches serve different customer segments and are both growing. Medium SE005, SE017, SE004
CE033 Scale AI's AI safety white paper on test and evaluation provides public documentation of Scale's approach to model evaluation methodology, supporting its positioning as a credible government AI evaluation authority. Medium SE002, SE024, SE015
CE034 Scale AI's cloud infrastructure dependencies (AWS/GCP/Azure) represent a supply chain risk: changes in cloud provider pricing or availability could affect annotation tooling uptime and GenAI Platform delivery. Low SE011, SE017, SE019
CE035 No publicly available independent third-party technical audit of Scale AI's annotation quality, QA pipeline accuracy, or GenAI Platform performance exists; all quality claims are company-asserted or based on single customer case studies. High SE027, SE017, SE016
CU001 Scale AI serves three primary customer segments: AI Labs and model developers, Fortune 500 enterprise customers, and U.S. government and defense agencies. High SU001, SU015
CU002 TIME deployed Scale AI's GenAI Platform in a production safety application, testing over 7,000 adversarial attack vectors against AI-generated content in under two months. High SU002, SU001
CU003 Meta holds approximately 49% of Scale AI's equity following the June 2025 strategic investment and is simultaneously an expanding RLHF data customer. High SU018, SU015
CU004 Cohere is listed as an AI lab customer on Scale AI's official customers page. Medium SU001
CU005 Etsy is listed as an enterprise customer on Scale AI's official customers page. Medium SU001
CU006 Instacart is listed as an enterprise customer on Scale AI's official customers page. Medium SU001
CU007 Pinterest is listed as an enterprise customer on Scale AI's official customers page. Medium SU001
CU008 Scale AI holds active Department of Defense contracts including a data curation contract for joint force operations and the Donovan platform for national security AI workflows. High SU004, SU005
CU009 Google, previously Scale AI's largest customer, planned to wind down or significantly reduce its Scale AI relationship in June 2025 following Meta's strategic investment. High SU019, SU018
CU010 OpenAI wound down its work with Scale AI in June 2025, as reported by CNBC. High SU020, SU015
CU011 Scale AI has paid over $1 billion to annotation contributors globally, per its official customers page. Medium SU001
CU012 Scale AI has processed over 15 billion human-labeled decisions, per its official customers page. Medium SU001
CU013 Scale AI laid off approximately 200 employees (14% of staff) and 500 contractors in July 2025, with the reductions concentrated in the data-labeling business. High SU016, SU015
CU014 Scale AI's GenAI Platform was deployed in a production environment at TIME Media for AI content safety testing, as documented in the official case study. Medium SU002
CU015 Scale AI holds active DoD IL4 and FedRAMP High certifications enabling deployment in classified and government environments. High SU003, SU004
CU016 Scale AI's DoD data curation contract for joint force operations was announced via official company blog post. High SU005, SU004
CU017 Scale AI offers a self-serve pricing tier with the first 1,000 labeling units free and pay-as-you-go access for developers and researchers. Medium SU027
CU018 Amazon, Cisco, Intel, AMD, and ServiceNow participated as strategic investors in Scale AI's May 2024 Series F round, providing customer-as-investor validation. Medium SU015
CU019 Scale AI has not publicly disclosed any net revenue retention (NRR) or gross revenue retention (GRR) metrics. Low
CU020 Scale AI has not publicly disclosed its total active customer count across any segment. Low
CU021 Scale AI's enterprise pricing structure includes custom pricing with dedicated operations teams and SLA commitments for enterprise customers. Medium SU027
CU022 Meta's information access as a 49% investor alongside its role as an RLHF customer creates a potential conflict of interest for other enterprise and AI lab customers evaluating Scale AI's data security and confidentiality. Medium SU018, SU019
CU023 Snorkel AI operates a public leaderboard and partners program targeting enterprise ML customers, competing in the same AI data infrastructure market as Scale AI. Medium SU007, SU008
CU024 SuperAnnotate maintains a public enterprise platform offering for annotation, competing with Scale AI's Data Engine in the enterprise annotation segment. Medium SU025
CU025 Mercor operates as a competitor in the AI annotation and evaluation marketplace, with enterprise-facing product pages and active hiring. Medium SU013, SU017
CU026 Scale AI's Donovan platform serves the defense and intelligence community with AI agent workflows designed for classified national security operations. High SU006, SU003
CU027 Google's departure from Scale AI was driven by competitive conflict concerns arising from Meta's strategic investment, per CNBC sourcing. Medium SU019
CU028 OpenAI's wind-down of Scale AI work coincided with founder Alexandr Wang's departure to join Meta, suggesting structural realignment of AI lab annotation sourcing. Medium SU020, SU015
CU029 The simultaneous departure of Google and OpenAI in Q2 2025 represents an estimated 20-40% reduction in Scale AI's AI lab segment revenue, based on their reported prominence as major customers. Low SU019, SU020
CU030 Scale AI's government and defense customers have the highest switching costs of any segment due to multi-year contracts, IL4/FedRAMP certification requirements, and classified-environment integration barriers. Medium SU003, SU004
CU031 Enterprise customers Etsy, Instacart, and Pinterest appear on Scale AI's official customers page without associated case studies, outcome metrics, or deployment scope details. Medium SU001
CU032 No G2, Gartner Peer Insights, or Capterra reviews were identified for Scale AI's enterprise platform, leaving customer satisfaction entirely unquantified. Low
CU033 Scale AI's July 2025 layoffs concentrated in data-labeling signal that RLHF and annotation throughput from AI lab customers declined materially in H1 2025. Medium SU016
CU034 Scale AI filed a lawsuit against Mercor in September 2025 alleging customer poaching and misappropriation of trade secrets by a former employee. Medium SU017
CU035 Scale AI's documented land-and-expand model involves starting with data annotation, then upselling to RLHF, evaluation, and the GenAI Platform as customer needs mature. Medium SU001, SU026
CU036 Scale AI's customer base is predominantly U.S.-based, with no publicly disclosed international customer count or revenue split by geography. Medium SU001, SU003
CU037 Scale AI has not publicly disclosed any annual or quarterly customer churn rate for any segment. Low
CU038 Scale AI's AI lab customer segment is the most volatile, with Google and OpenAI both exiting in Q2 2025 and remaining labs facing structural pressure toward in-house annotation. Medium SU019, SU020
CU039 Snorkel AI's press page and partner program confirm active enterprise customer development activity, corroborating that the enterprise AI data market remains competitive. Low SU009, SU008
CU040 Appen's enterprise case studies indicate ongoing demand for AI annotation services in comparable market segments, validating Scale's addressable market despite attrition events. Low SU023
CU041 Meta's expanding RLHF customer relationship with Scale AI, combined with its ~49% equity stake, creates an unprecedented customer-investor concentration that could deter other AI lab customers. Medium SU018, SU019
CU042 Scale AI's self-serve tier pricing structure was publicly accessible as of the research date, confirming a lower-commitment entry point for the developer and researcher segment. Medium SU027
CR001 Customer concentration risk is the highest-severity risk facing Scale AI, with Google and OpenAI both departing as customers in Q2 2025 following the Meta investment. High SR005, SR006
CR002 CEO and leadership transition risk is high: founder Alexandr Wang departed June 2025 to join Meta, replaced by interim CEO Jason Droege who lacks direct experience in Scale's core government and enterprise AI markets. High SR001, SR004
CR003 Meta's ~49% equity stake combined with its role as Scale AI's largest customer creates an unprecedented concentration of investor-customer governance risk that is structurally unusual in the venture-backed AI sector. High SR003, SR005
CR004 Scale AI is executing a business model pivot from annotation-volume revenue to enterprise GenAI Platform and government Donovan revenue under adverse conditions of active revenue attrition and CEO transition. High SR001, SR002
CR005 Scale AI filed a lawsuit against Mercor in September 2025 alleging customer poaching and misappropriation of trade secrets by a former employee, confirmed by TechCrunch reporting. High SR011, SR012
CR006 Scale AI's White House AI safety voluntary commitments (2024) demonstrate proactive regulatory engagement but do not constitute a binding compliance shield against future mandatory AI regulations. High SR009, SR013
CR007 Scale AI holds DoD IL4 and FedRAMP High certifications, SOC 2 Type II, and ISO 27001, representing a mature security compliance posture for government and enterprise customers. High SR007, SR008
CR008 Scale AI laid off approximately 200 employees (14% of staff) and 500 contractors in July 2025, concentrated in data labeling, creating talent retention risk and potential operational quality degradation. High SR002, SR001
CR009 U.S. Congressional AI oversight is active, with Scale AI having testified before the House of Representatives about AI capabilities and risks, indicating regulatory attention to AI infrastructure companies. Medium SR013
CR010 The EU AI Act creates compliance obligations for AI training data providers serving EU enterprise customers, including potential classification of high-risk AI system data as requiring additional oversight. Medium SR027
CR011 Scale AI's government defense work involving AI-enabled autonomy programs may attract ITAR and export control scrutiny if technology transfer to non-U.S. entities is involved in the data pipeline. Medium SR008, SR024
CR012 Scale AI has made proactive AI safety commitments including the WMDP harmful-knowledge benchmark for evaluating dual-use AI risks, reducing its regulatory exposure on AI safety grounds. Medium SR030, SR009
CR013 No publicly disclosed data security incidents or breaches affecting Scale AI's enterprise or government customers were identified in this research as of the run date. Medium SR007
CR014 Scale AI's compliance with U.S. AI safety voluntary commitments and its test and evaluation white paper reduce its near-term regulatory enforcement exposure from the current U.S. AI governance framework. Medium SR009, SR025
CR015 Scale AI's annotation quality is a key operational risk: any quality degradation in RLHF data following the 500-contractor reduction could impair enterprise and government contract performance. Medium SR002
CR016 Scale AI's dependency on a global annotation contributor network creates supply-chain risk, particularly as Mercor and other platforms actively compete for the same annotation workforce. Medium SR011, SR020
CR017 Scale AI's data security posture is supported by SOC 2 Type II, ISO 27001, DoD IL4, and FedRAMP High certifications, reducing but not eliminating the risk of data breach affecting customer AI training data. Medium SR007
CR018 Scale AI's business model pivot from annotation-volume to platform revenue represents a high-severity execution risk: the timeline, conversion rate, and revenue replacement pace are unknown and unconfirmed publicly. Medium SR010, SR014
CR019 Scale AI's July 2025 layoffs and contractor reductions signal that the data-labeling segment volume has declined materially, creating operational capacity risk if the enterprise GenAI Platform requires rapid scaling of new annotation workflows. Medium SR002
CR020 Scale AI's Meta dependency encompasses three simultaneous roles: investor (~49% equity), customer (expanding RLHF), and former employer of departed CEO Wang, creating a concentration with no structural firewall confirmed publicly. Medium SR003, SR004
CR021 Scale AI's government customer segment is a positive dependency: multi-year DoD contracts provide a revenue floor, but they also create concentration in U.S. government budget cycles and contract renewal processes. Medium SR008, SR024
CR022 Scale AI's cloud infrastructure dependency on AWS and similar providers creates a platform concentration risk that is partially mitigated by its FedRAMP High certification requirements for government-grade infrastructure. Medium SR007, SR008
CR023 Appen, a public comparable to Scale AI, experienced significant customer attrition and revenue decline when AI lab customers reduced annotation spend, providing a cautionary precedent for Scale's current trajectory. Medium SR028
CR024 Jason Droege's appointment as interim CEO was confirmed in June 2025; his background includes founding Uber Eats (scaled to $19B GMV) and VC partnership at Benchmark, but not direct government AI or enterprise data platform leadership. Medium SR004, SR001
CR025 Alexandr Wang retains a board seat at Scale AI despite departing as CEO to join Meta, creating a potential conflict of interest at the governance level that has not been publicly addressed. Medium SR001, SR003
CR026 Scale AI has not publicly disclosed its current cash position, quarterly burn rate, or expected runway following the Meta transaction proceeds being distributed to shareholders rather than retained for operations. Low
CR027 Scale AI's government relationship managers hold classified clearances and institutional knowledge that represent irreplaceable assets; their retention is critical to DoD contract renewal and expansion. Medium SR008, SR019
CR028 The most significant thesis-break triggers for Scale AI are: any government contract cancellation, any additional non-AI-lab customer citing Meta conflict, or the GenAI Platform failing to replace AI lab ARR within 18 months. Medium SR005, SR003
CR029 Synthetic data generation capabilities in frontier AI models are reducing the volume of human-annotated RLHF data required per training run, creating structural demand reduction for Scale's core annotation service. Medium SR015, SR014
CR030 SuperAnnotate's enterprise platform and foundation-model-builder solutions pages indicate competitive expansion into Scale AI's core annotation market segments. Low SR021, SR022
CR031 Labelbox's enterprise platform (product/platform page) indicates continued investment in annotation infrastructure competing with Scale AI's Data Engine for enterprise customers. Low SR026
CR032 Scale AI's evaluation and test platform for government AI safety is documented in an official white paper, demonstrating product differentiation in the government AI market beyond annotation. Medium SR025, SR023
CR033 Scale AI's headcount of approximately 1,000 employees post-layoff represents a significant reduction in operational capacity that may affect the speed of enterprise GenAI Platform go-to-market. Medium SR002
CR034 The Mercor lawsuit (Scale AI v. Mercor, NDCA 2025) creates ongoing legal uncertainty and management distraction, with discovery potentially exposing internal customer contract terms and competitive intelligence. Medium SR011, SR012
CR035 Scale AI's Reuters coverage of the Google departure (rate-limited) corroborates the CNBC reporting, providing multi-source confirmation of the customer concentration event. High SR016, SR005
CR036 Scale AI's Reuters coverage of the Meta deal as a test of AI partnerships (rate-limited) provides an independent perspective on the concentration and governance risks of the Meta-Scale relationship. Medium SR017, SR003
CR037 Scale AI's active hiring page indicates continued investment in talent despite the July 2025 layoffs, suggesting the company is selectively rebuilding capacity in growth areas. Low SR019
CR038 Scale AI's competitors SuperAnnotate and Appen are expanding into frontier model alignment and annotation use cases, intensifying competitive pressure on Scale's core market. Low SR020, SR021
CR039 Scale AI's DoD data curation contract for joint force operations demonstrates deep government integration that creates positive dependency (switching costs) but requires ongoing performance and security compliance. Medium SR029, SR008
CR040 The combination of Google's departure, OpenAI's wind-down, and the Mercor lawsuit in a single quarter represents an unprecedented adverse events cluster for Scale AI's customer and competitive positioning. Medium SR005, SR006, SR011
CV001 Scale AI's implied valuation of approximately $29B derives from Meta's June 2025 purchase of approximately 49% of the company for $14.3B, confirmed by TechCrunch, CNBC, and Reuters. High SV011, SV013
CV002 Scale AI's estimated ARR is $200–500M as of the run date, derived from funding round valuation history and typical revenue multiples; no official ARR disclosure was made publicly. High SV010, SV015
CV003 The revenue multiple implied by the $29B valuation is 58x–145x ARR depending on the ARR estimate ($200M–$500M), representing an extreme premium even by frontier AI company standards. High SV013, SV011
CV004 The $29B implied valuation was set by Meta acting as a strategic acquirer seeking proprietary RLHF infrastructure, not as a financial investor optimizing for financial return — a critical valuation discipline distinction. High SV013, SV011
CV005 Scale AI's investment thesis rests on three structural pillars: government/defense moat with high switching costs, nine-year data infrastructure and annotation platform that is difficult to replicate, and Meta's strategic validation as an expanding customer and investor. High SV019, SV021
CV006 The government/defense segment creates a valuation floor of $3–5B for Scale AI independent of its commercial AI lab business, based on DoD IL4/FedRAMP High certifications and multi-year contract structures. High SV019, SV028
CV007 The anti-thesis for Scale AI at $29B includes: two largest customers departed (Google, OpenAI), CEO transitioned, business model mid-pivot, annotation commoditization underway, and Meta conflict deters new AI lab customers. High SV014, SV023
CV008 Enterprise GenAI Platform ARR is not publicly disclosed; the TIME Media case study (7,000+ attack vectors tested, <2-month deployment) is the only confirmed public enterprise production deployment. High SV024, SV018
CV009 The recommendation for Scale AI is Research-More with Conditional Pass at entry below $15B implied: base case fair value is $8–12B, bull case fair value is $17.5–25.5B, bear case implies $1.5–2B. High SV010, SV015
CV010 Scale AI's May 2024 Series F valued the company at $13.8B, which was approximately 46–69x estimated ARR of $200–300M at that time; the Meta deal doubled this valuation in 13 months despite adverse customer news. Medium SV011, SV013
CV011 Scale AI's Meta transaction proceeds were distributed to existing shareholders rather than retained as operating capital, leaving the company's actual working capital and cash runway uncertain. Medium SV011
CV012 Scale AI has not publicly disclosed its preference stack, liquidation structure, or dilution overhang, making it impossible to model common equity returns in the bear case without private data room access. Low
CV013 At a $29B entry valuation, the base case ($8–11B fair value) implies a 60–70% markdown for a financial investor, making the current entry price financially destructive under any scenario that assigns >40% probability to the base case. Medium SV010, SV015
CV014 Financial investors co-investing at the Meta-implied $29B are paying a strategic acquisition premium designed for Meta's proprietary RLHF data needs, not for financial return optimization. Medium SV013
CV015 Bull case (20% probability): Government contracts renew >90%, GenAI Platform reaches $150M+ ARR by 2027, Meta expands to $300M+ ARR; total ARR reaches $700–850M, implied valuation $17.5–25.5B at 25–30x multiple. Medium SV010, SV019
CV016 Base case (55% probability): Government contracts renew at 85%, GenAI Platform reaches $75–100M ARR by 2027, Meta ARR stable; total ARR reaches $350–450M, implied valuation $7.7–11.0B at 20–25x multiple. Medium SV010, SV015
CV017 Bear case (25% probability): Government contract non-renewal or delay, GenAI Platform <$50M ARR, Meta reduces purchasing; total ARR falls to $150–200M, implied valuation $1.5–2.0B at 10x multiple. Medium SV009, SV014
CV018 The scenario probability distribution is asymmetric: the bear case requires only one adverse event (government contract non-renewal) while the bull case requires multiple concurrent optimistic outcomes, creating unfavorable expected value at entry. Medium SV010, SV015
CV019 Appen (ASX: APX) is the primary negative comparable: a public annotation company whose market cap fell from ~AUD $3.5B to ~AUD $250M (>90% decline) following AI lab RLHF customer attrition in 2022–2024. Medium SV009, SV026
CV020 Appen's revenue decline demonstrates that annotation commoditization and AI lab attrition can reduce a public annotation company's value by 90%+ within 24 months — the clearest available data point for Scale AI's downside risk. Medium SV009, SV026
CV021 Palantir (NYSE: PLTR) provides the government AI platform comparable: commanding 25–50x ARR and $100B+ market cap on the basis of deep DoD/IC integrations and multi-decade government relationships with high switching costs. Medium SV003, SV015
CV022 Palantir's sustained government contract renewals over 15+ years justify its premium multiple; Scale AI's government track record is shorter (est. 3–5 years of active DoD contracts), justifying a discount relative to Palantir's multiple. Medium SV019, SV015
CV023 Labelbox (private, ~$1B estimated valuation at ~$80–120M ARR) provides the annotation-infrastructure-without-government-moat comparable, suggesting an 8–12x ARR multiple for annotation platforms absent government contracts. Low SV002, SV008
CV024 The delta between Labelbox's 8–12x multiple and Scale AI's 58–145x multiple implies the market is pricing Scale's government moat and Meta strategic option value at approximately $20B+ in premium above annotation infrastructure value. Low SV010, SV019
CV025 Crunchbase confirms Scale AI has raised approximately $1.6B in equity prior to the Meta deal, indicating significant accumulated preference stack that must be waterfall-analyzed for common equity return modeling. Medium SV030, SV011
CV026 Scale AI's 2024 Series F valuation ($13.8B) was confirmed by multiple independent news sources including TechCrunch and CNBC, establishing the pre-Meta deal anchor valuation for multiple analysis. Medium SV011, SV013
CV027 Scale AI's government contract revenue is the primary differentiator from Appen in the comparable set; without this floor, Scale AI's annotation revenue would command an Appen-equivalent multiple of 0.5–2x ARR. Medium SV019, SV009
CV028 The McKinsey State of AI report confirms sustained enterprise AI adoption and growing demand for AI data infrastructure, providing market size validation for Scale AI's enterprise GenAI Platform growth scenario. Medium SV010, SV015
CV029 The recommended entry ceiling for a financial investor is below $15B implied valuation; this provides positive expected value in the base case and does not require bull case assumptions to recover investment. Medium SV010, SV015
CV030 The thesis-break triggers for Scale AI are: government contract cancellation (single event), Meta ARR decline >20% QoQ for two quarters, or enterprise GenAI Platform ARR below $30M as of Q4 2026. Medium SV014, SV019
CV031 The six final diligence asks are: ARR by segment (post-attrition), government contract schedule, cash position/runway, enterprise GenAI Platform ARR, management retention plan, and Meta governance firewall documentation. Medium SV010, SV016
CV032 Scale AI's hiring page (scale.com/careers) shows continued active hiring in government AI and enterprise roles, suggesting the company is investing in growth segments despite the July 2025 layoffs. Low SV001, SV016
CV033 Scale AI's White House AI safety voluntary commitments and Congressional testimony differentiate it from pure annotation competitors by demonstrating regulatory engagement that creates barriers to entry in government markets. Medium SV021, SV025
CV034 Palantir's investor relations page and Appen's ASX filings confirm that publicly-listed government AI and annotation companies provide the best available financial comparables for Scale AI given its unique position straddling both markets. Medium SV003, SV007
CV035 SuperAnnotate's enterprise and platform product pages (broken/404 as of research date) suggest active web presence investment in Scale AI's core enterprise annotation market, though page access was unavailable for detailed analysis. Low SV005, SV006
CV036 The OpenAI-Scale AI 2023 partnership (TechCrunch, now broken URL) represents the historical peak of the AI lab customer relationship that has since wound down, contextualizing the scale of revenue loss. Low SV004
CV037 Scale AI's July 2025 layoffs (14% of staff, concentrated in data labeling) are consistent with the pivot narrative but also signal that the annotation volume decline is material enough to require immediate cost reduction. Medium SV012, SV011
CV038 Appen's investor relations page (ASX-listed) provides publicly accessible financial data showing annotation company revenue trajectory under AI lab attrition, the best available public financial proxy for Scale AI's downside scenario. Medium SV009, SV026
CV039 Scale AI's government contract program for DoD data curation for joint force operations demonstrates active government deployment beyond evaluation-only contracts, supporting the multi-year government ARR thesis. Medium SV028, SV029
CV040 The McKinsey State of AI 2025 report confirms enterprise AI adoption is accelerating, with organizations increasing AI investment and expanding deployment — validating Scale AI's enterprise GenAI Platform market opportunity even as the annotation-volume segment commoditizes. Medium SV010, SV015
Sources
IDPublisherTitleQuote
SO001 Scale AI About Scale AI We provide high-quality data and full-stack technologies to help organizations develop AI systems.
SO002 Scale AI Scale AI Announces Next Phase of Company Evolution Jason Droege will serve as Interim CEO as Scale enters its next phase.
SO003 Scale AI Scale AI Pricing Enterprise pricing with dedicated operations and SLAs; self-serve with first 1000 units free.
SO004 Scale AI Scale AI Security Overview Scale is committed to the highest security standards for enterprise and government customers.
SO005 Scale AI Scale AI Legal / Security Certifications Scale holds SOC 2 Type II, ISO 27001, DoD IL4 Provisional Authorization, and FedRAMP High certifications.
SO006 Scale AI Scale Donovan — Defense AI Platform Donovan enables mission-critical AI workflows for defense and intelligence agencies.
SO007 Scale AI Scale AI Public Sector Scale supports U.S. government agencies with AI data and evaluation capabilities.
SO008 Scale AI Scale Data Engine The Scale Data Engine collects, curates, annotates, and validates data for AI model training.
SO009 Scale AI Scale Evaluation — Model Developers Scale provides trusted evaluation for AI model capability and safety.
SO010 Scale AI Scale AI Customers Scale works with leading enterprises and government agencies.
SO011 Scale AI Scale AI Customer Case Study: TIME TIME deployed GenAI in under 2 months with 7,000+ attack vectors tested using Scale.
SO012 Scale AI Scale RLHF Scale RLHF provides high-quality human feedback data for large language model alignment.
SO013 Scale AI Scale AI White House Voluntary AI Safety Commitments Scale is committed to the voluntary AI safety commitments established by the White House.
SO014 Scale AI Scale DoD Data Curation Contract — Joint Force Scale has secured a DoD contract to curate data for joint force AI operations.
SO015 TechCrunch Data-labeling startup Scale AI raises $1B as valuation doubles to $13.8B Scale AI raised $1 billion Series F at a $13.8 billion valuation, led by Accel.
SO016 TechCrunch Scale AI confirms significant investment from Meta, says CEO Alexandr Wang is leaving Scale AI confirms Meta's significant investment and the departure of CEO Alexandr Wang.
SO017 TechCrunch Scale AI lays off 14% of staff, largely in data labeling business Scale AI cut 200 employees (14% of staff) and 500 contractors, largely in the data-labeling business.
SO018 TechCrunch Scale AI is suing a former employee and rival Mercor alleging they tried to steal its biggest customers Scale AI filed suit against Mercor and a former employee, alleging an attempt to poach Scale's largest customers.
SO019 CNBC Zuckerberg makes Meta's biggest bet on AI: $14 billion Scale AI deal Meta is paying approximately $14.3 billion for a minority stake in Scale AI.
SO020 CNBC Google, Scale AI's largest customer, plans split after Meta deal, sources say Google, Scale AI's largest customer, is planning to wind down its Scale relationship following Meta's investment.
SO021 CNBC OpenAI is winding down its work with Scale AI; founder is joining Meta OpenAI is winding down its work with Scale AI, and Scale's founder Alexandr Wang is joining Meta.
SO022 Stanford HAI Stanford HAI AI Index Report 2025 AI adoption and investment have accelerated sharply across industries in 2024-2025.
SO023 U.S. Congress House Event on AI Safety and Data — 118th Congress Congressional hearing on AI data safety and evaluation standards.
SO024 OpenAI Introducing Improvements to the Fine-Tuning API and Expanding Our Custom Models Program OpenAI expands custom model programs, signaling continued demand for specialized AI training services.
SO025 Scale AI Scale AI Readiness Report Organizations that invest in AI data quality and evaluation infrastructure achieve faster AI maturity.
SO026 Scale AI Scale AI Global Public Sector Scale serves public sector organizations globally with AI data and evaluation services.
SO027 Benchmark Capital Jason Droege — Benchmark Partner Profile Jason Droege founded Uber Eats and served as VP at Uber before joining Benchmark.
SO028 McKinsey & Company The State of AI — McKinsey Global Survey 2025 88% of organizations now use AI in at least one business function, up from 78% in 2024.
SO029 Menlo Ventures 2025 The Enterprise AI Report Enterprise AI adoption is accelerating with a focus on data quality and model evaluation.
SO030 PwC PwC AI Jobs Barometer 2024 AI-related jobs and AI infrastructure spending are growing at 3-4x the rate of non-AI technology roles.
SM001 Stanford HAI Stanford HAI AI Index 2025 — Overview AI investment and adoption have accelerated at an unprecedented pace across 2024–2025.
SM002 Snorkel AI Snorkel AI Homepage Snorkel provides programmatic data labeling for enterprise AI.
SM003 Labelbox Why Labelbox Labelbox is the leading data labeling and model evaluation platform for enterprise AI teams.
SM004 Appen About Appen Appen provides AI training data and model evaluation services globally to enterprise and government customers.
SM005 Surge AI Surge AI Homepage Surge AI provides high-quality data for RLHF and LLM training.
SM006 Invisible Technologies Invisible Technologies Homepage Invisible provides AI-powered operations and data services for enterprise clients.
SM007 SuperAnnotate SuperAnnotate Homepage SuperAnnotate is an end-to-end AI training data platform for enterprise.
SM008 Mercor Mercor Homepage Mercor connects companies with AI talent for model training, evaluation, and data labeling.
SM009 Scale AI Scale Evaluation — Public Sector Scale provides trusted AI evaluation services for defense and intelligence agencies.
SM010 Scale AI Scale Generative AI Data Engine The Scale Generative AI Data Engine enables enterprises to build custom GenAI applications.
SM011 Scale AI Scale Public Sector Data Engine Scale's Public Sector Data Engine provides defense-grade data curation and annotation.
SM012 McKinsey & Company The State of AI — McKinsey Global Survey 2025 88% of organizations now use AI in at least one business function, up from 78%; 62% experimenting with AI agents.
SM013 Stanford HAI Stanford HAI AI Index Report 2025 — Full Report Global AI investment reached record levels in 2024; enterprise AI adoption is accelerating sharply.
SM014 Menlo Ventures 2025 The Enterprise AI Report Enterprise AI is at an inflection point with spending concentrated in data quality and model evaluation.
SM015 PwC PwC AI Jobs Barometer 2024 AI-related job postings and AI infrastructure spending are growing at 3-4x non-AI technology rates.
SM016 Scale AI About Scale AI Scale serves AI labs, enterprises, and government agencies with data and AI infrastructure.
SM017 Scale AI Scale Data Engine Scale Data Engine is the industry-leading platform for AI training data collection and curation.
SM018 TechCrunch Data-labeling startup Scale AI raises $1B as valuation doubles to $13.8B Scale AI's $13.8B valuation implies significant investor confidence in the AI data services market.
SM019 TechCrunch Scale AI lays off 14% of staff, largely in data labeling business Scale's pivot away from data-labeling signals changing market dynamics for pure-play annotation vendors.
SM020 OpenAI Introducing Improvements to the Fine-Tuning API and Expanding Our Custom Models Program OpenAI's expansion of fine-tuning and custom model programs demonstrates sustained demand for specialized AI training data.
SM021 CNBC Zuckerberg makes Meta's biggest bet on AI: $14 billion Scale AI deal Meta's $14B investment validates Scale AI's market position in AI infrastructure.
SM022 Scale AI Scale AI Readiness Report Organizations investing in AI data quality and model evaluation achieve faster AI maturity.
SM023 Scale AI Scale AI Customers Scale serves a diverse customer base including AI labs, Fortune 500 enterprises, and government agencies.
SM024 Scale AI Scale RLHF Scale RLHF provides the highest-quality human feedback data for LLM training and alignment.
SM025 U.S. Congress House Event on AI Safety and Data — 118th Congress Congressional hearing highlights growing government focus on AI data standards and safety evaluation.
SP001 TechCrunch Scale AI is suing a former employee and rival Mercor alleging they tried to steal its biggest customers Scale AI sued Mercor and a former employee alleging they attempted to poach Scale's largest customers.
SP002 Scale AI Scale AI Legal / Security Certifications Scale holds DoD IL4 Provisional Authorization and FedRAMP High certifications.
SP003 Scale AI Scale Donovan — Defense AI Platform Donovan is Scale's specialized AI agent platform for defense and intelligence missions.
SP004 Appen Appen Homepage Appen provides AI training data services to global enterprise and government customers.
SP005 Appen Appen Agentic AI Appen is expanding into agentic AI services to address the growing LLM evaluation market.
SP006 Appen Appen Model Evaluation and Integrity Appen provides model evaluation and integrity services for enterprise AI teams.
SP007 Labelbox Why Labelbox Labelbox is the leading platform for enterprise AI data labeling, RLHF, and model evaluation.
SP008 Labelbox Labelbox Evals Labelbox Evals provides model evaluation and safety testing for enterprise AI teams.
SP009 Labelbox Labelbox RL Data Labelbox RL-Data provides reinforcement learning data for LLM training and alignment.
SP010 Labelbox Labelbox Expert Network Labelbox Expert Network provides quality-critical annotation by domain experts.
SP011 Labelbox Labelbox Robotics Labelbox provides AI training data for robotics and physical AI applications.
SP012 Snorkel AI Snorkel AI About Snorkel AI was founded to make AI data labeling faster, cheaper, and more accessible through programmatic techniques.
SP013 Snorkel AI Snorkel AI Research Snorkel AI's research on weak supervision and programmatic labeling reduces annotation cost.
SP014 Mercor Mercor Research Mercor conducts research on AI talent matching and high-quality human feedback collection.
SP015 SuperAnnotate SuperAnnotate Security SuperAnnotate prioritizes enterprise security with SOC 2 compliance and data encryption.
SP016 Labelbox Labelbox Pricing Labelbox offers tiered pricing from developer self-serve to enterprise custom plans.
SP017 Surge AI Surge AI Homepage Surge AI provides high-quality human feedback data for LLM training and RLHF.
SP018 Appen Appen Data Security Appen provides data security and compliance features for enterprise and government customers.
SP019 Appen Appen Investors Appen is publicly listed on the ASX; financial results available for market proxy analysis.
SP020 Appen Appen Platform Appen's platform provides end-to-end AI data collection, annotation, and quality assurance.
SP021 CNBC Google, Scale AI's largest customer, plans split after Meta deal, sources say Google is planning to wind down its Scale AI relationship following Meta's strategic investment.
SP022 CNBC OpenAI is winding down its work with Scale AI; founder is joining Meta OpenAI is winding down its work with Scale AI following the Meta strategic investment.
SP023 TechCrunch Scale AI confirms significant investment from Meta, says CEO Alexandr Wang is leaving Scale AI confirms Meta's strategic investment, which has created competitive conflict with Google and OpenAI.
SP024 TechCrunch Scale AI lays off 14% of staff, largely in data labeling business Scale AI's layoffs in data-labeling confirm management's view that the commodity annotation segment is under structural pressure.
SP025 Scale AI Scale Evaluation — Model Developers Scale Evaluation provides trusted model benchmarking and safety evaluation for AI labs and enterprises.
SP026 U.S. House of Representatives House Committee Hearing on AI Data and Safety — Alexandr Wang Testimony Scale AI founder Alexandr Wang testified before Congress on AI data quality and safety, reinforcing Scale's positioning as the trusted government AI data partner.
SI001 Scale AI Scale GenAI Platform — Docs Scale's GenAI Platform enables enterprises to build and deploy custom AI applications powered by their proprietary data.
SI002 Scale AI Scale API — Introduction to Scale API The Scale API provides programmatic access to annotation, RLHF, and evaluation services.
SI003 Scale AI Scale GenAI Platform API Reference Scale GenAI Platform API enables enterprise integration with Scale's AI application development infrastructure.
SI004 Scale AI Scale Blog — Custom LLMs Scale enables enterprises to build custom LLMs from their proprietary data using Scale's GenAI Platform.
SI005 Scale AI Scale Blog — Autonomy Table Stakes (DoD) Scale provides the autonomy data layer for DoD programs, a critical component of defense AI infrastructure.
SI006 Scale AI Scale Blog — Test and Evaluation White Paper Scale's test and evaluation capabilities support DoD AI program requirements and model safety assessment.
SI007 Appen Appen Press Releases — ASX Financial Disclosures Appen's press releases disclose financial results including revenue trends and margin data as ASX-listed public company.
SI008 Appen Appen Case Studies Appen case studies show annotation use cases comparable to Scale AI's enterprise annotation segment.
SI009 Appen Appen Multimodal AI Appen provides multimodal AI training data including image, video, and audio annotation services.
SI010 Appen Appen Physical AI Appen provides training data for physical AI including robotics and autonomous systems.
SI011 Appen Appen Speech and Audio Training Data Appen offers speech and audio annotation data services as a core product line.
SI012 Labelbox Labelbox Research Labelbox research covers data quality, annotation best practices, and AI model evaluation methods.
SI013 Scale AI Scale AI Pricing Scale offers self-serve pricing (first 1,000 units free, pay-as-you-go beyond) and enterprise custom contracts.
SI014 Scale AI Scale AI About Scale has paid over $1 billion to its contributor network globally, reflecting the labor intensity of the annotation business.
SI015 TechCrunch Scale AI raises $1B Series F as valuation doubles to $13.8B Scale AI raised $1 billion in Series F funding at a $13.8 billion valuation, led by Accel, with Amazon, Meta, Cisco, Intel, AMD, and ServiceNow as new investors.
SI016 TechCrunch Scale AI lays off 14% of staff, largely in data labeling business Scale AI laid off approximately 200 employees (14% of staff) and 500 contractors, primarily targeting its data-labeling business, signaling a strategic pivot away from commodity annotation.
SI017 CNBC Zuckerberg makes Meta's biggest bet on AI: $14 billion Scale AI deal Meta's investment of approximately $14.3 billion for a minority stake in Scale AI implies a valuation of over $29 billion.
SI018 CNBC Google, Scale AI's largest customer, plans split after Meta deal Google, Scale AI's largest customer, is planning to wind down its Scale relationship following Meta's strategic investment.
SI019 TechCrunch Scale AI confirms significant investment from Meta, says CEO Alexandr Wang is leaving Scale AI confirmed the Meta investment and Alexandr Wang's transition to Meta, with Jason Droege becoming Interim CEO.
SI020 Scale AI Scale AI RLHF Scale RLHF provides quality human feedback data for LLM training and alignment at scale.
SI021 Scale AI Scale AI Blog — Next Phase of Company Evolution Jason Droege as Interim CEO announced Scale's strategic pivot toward enterprise and government AI platform services.
SI022 CNBC OpenAI is winding down its work with Scale AI OpenAI is winding down its work with Scale AI, representing the loss of a second major AI lab customer following the Meta investment.
SI023 McKinsey McKinsey State of AI 2025 88% of organizations now use AI in at least one business function, up from 78%, while 62% are experimenting with AI agents.
SI024 Scale AI Scale AI Customers Scale AI serves customers including Meta, Cohere, Etsy, Instacart, and the U.S. government.
SI025 Stanford HAI Stanford HAI AI Index 2025 AI investment and adoption are accelerating across industries; enterprise AI infrastructure spending is growing materially.
SI033 OpenAI OpenAI Fine-Tuning API and Custom Models Program OpenAI's custom model program expands enterprise fine-tuning capabilities, representing a market adjacent to Scale AI's RLHF and model customization services.
SI034 U.S. House of Representatives House Committee Hearing on AI Safety and Data — 118th Congress Congressional AI hearing context confirms government interest in AI data quality and safety, supporting Scale AI's government revenue positioning.
SI035 Surge AI Surge AI — RLHF Data for AI Labs Surge AI provides premium RLHF data for AI labs, representing a direct competitor to Scale AI's RLHF revenue segment.
SE001 Scale AI Scale Blog — Scale Leaderboard Scale Leaderboard provides public LLM performance rankings used by AI researchers and developers to compare model capabilities.
SE002 Scale AI Scale Blog — Measuring and Mitigating Risk with WMDP Scale's WMDP benchmark evaluates whether LLMs can be used to generate dangerous content, measuring AI safety for dual-use knowledge domains.
SE003 Scale AI Scale Blog — Scale DIU RCV Program Scale partnered with the Defense Innovation Unit on the Robotic Combat Vehicle (RCV) program, providing AI data services for autonomous military systems.
SE004 Snorkel AI Snorkel AI Customer Stories Snorkel AI customer stories showcase enterprise use cases for programmatic labeling, a competing approach to Scale's human-annotation model.
SE005 Snorkel AI Snorkel AI How It Works Snorkel AI uses weak supervision and programmatic labeling to generate training data with significantly reduced human annotation effort.
SE006 Labelbox Labelbox Customers Labelbox serves enterprise customers with annotation and evaluation needs, competing directly with Scale AI's Data Engine.
SE007 Labelbox Labelbox Leaderboards Labelbox has launched leaderboards competing with Scale's Leaderboard for the LLM evaluation market.
SE008 SuperAnnotate SuperAnnotate Learning Hub SuperAnnotate's learning hub provides annotation best practices and workflow documentation, reflecting its approach to annotation quality management.
SE009 Mercor Mercor Blog Mercor's blog covers AI talent matching and RLHF data collection, reflecting its competitive approach to Scale's annotation market.
SE010 Mercor Mercor Expert Network Mercor's expert network is building an alternative contributor supply chain to compete with Scale's proprietary annotator network.
SE011 Scale AI Scale GenAI Platform Docs Scale GenAI Platform provides enterprise AI application development with LLM customization, RAG pipelines, and production deployment.
SE012 Scale AI Scale API Reference — Introduction The Scale API provides programmatic access to annotation, RLHF, and evaluation services with REST interface and webhook support.
SE013 Scale AI Scale GenAI Platform API Reference Scale GenAI Platform API enables enterprise integration with Scale's AI application development and deployment infrastructure.
SE014 Scale AI Scale AI Legal / Security Certifications Scale holds SOC 2 Type II, ISO 27001, DoD IL4 Provisional Authorization, and FedRAMP High certifications.
SE015 Scale AI Scale Evaluation — Model Developers Scale Evaluation provides trusted benchmarking and safety evaluation for AI model developers and enterprises.
SE016 Scale AI Scale AI RLHF Scale RLHF delivers expert human feedback data for LLM training and alignment at enterprise scale.
SE017 Scale AI Scale Data Engine Scale Data Engine provides end-to-end data collection, annotation, curation, and quality assurance for AI model development.
SE018 Scale AI Scale GenAI Platform — Enterprise Scale GenAI Platform transforms enterprise data into customized GenAI applications with production deployment support.
SE019 Scale AI Scale Public Sector Data Engine Scale Public Sector Data Engine provides defense-specific data labeling and management for government agencies with appropriate security clearances.
SE020 Scale AI Scale Donovan — Defense AI Platform Donovan is Scale's specialized AI agents platform for defense and intelligence mission-critical workflows in cleared environments.
SE021 TechCrunch Scale AI lays off 14% of staff, largely in data labeling business Scale AI's layoffs targeted data-labeling, confirming the strategic shift away from commodity annotation.
SE022 TechCrunch Scale AI is suing Mercor alleging customer poaching Scale AI sued Mercor for customer poaching, signaling the competitive vulnerability of Scale's enterprise annotation customer relationships.
SE023 CNBC Meta's $14 billion Scale AI deal Meta's strategic investment in Scale AI reflects Scale's position as a leading AI data infrastructure provider for frontier AI development.
SE024 Scale AI Scale AI White House Voluntary Commitments Scale AI signed White House voluntary AI safety commitments covering RLHF safety practices, red-teaming, and responsible AI deployment.
SE025 Stanford HAI Stanford HAI AI Index 2025 Stanford HAI AI Index 2025 documents accelerating AI investment and the growing importance of AI evaluation and safety benchmarking.
SE026 McKinsey McKinsey State of AI 2025 Enterprise AI adoption is accelerating, with 62% of organizations experimenting with AI agents, validating Scale AI's enterprise GenAI Platform market opportunity.
SE027 Scale AI Scale AI TIME Case Study TIME deployed Scale AI's GenAI Platform in under 2 months with 7,000+ attack vectors tested, demonstrating the platform's deployment speed and red-teaming capability.
SE035 U.S. House of Representatives House AI Hearing — Scale AI Congressional Testimony Congressional AI hearings confirm government interest in AI data quality and safety, supporting Scale AI's government/defense revenue positioning and regulatory relationships.
SU001 Scale AI Scale AI Customers Page
SU002 Scale AI Scale AI Customer Case Study: TIME TIME deployed Scale's GenAI Platform and tested more than 7,000 attack vectors in under 2 months.
SU003 Scale AI Scale AI Public Sector and Government Platform
SU004 Scale AI Scale AI DIU RCV Program Blog Post
SU005 Scale AI Scale AI DoD Data Curation Joint Force Contract
SU006 Scale AI Scale AI Donovan Platform
SU007 Snorkel AI Snorkel AI Model Leaderboard
SU008 Snorkel AI Snorkel AI Partners Page
SU009 Snorkel AI Snorkel AI Press Releases
SU010 Snorkel AI Snorkel AI Security Page
SU011 Mercor Mercor Apex Agents Product Page
SU012 Mercor Mercor Apex SWE Product Page
SU013 Mercor Mercor Careers Page
SU014 Mercor Mercor Security Page
SU015 TechCrunch Scale AI Confirms Significant Investment from Meta, Says CEO Alexandr Wang Is Leaving
SU016 TechCrunch Scale AI Lays Off 14% of Staff, Largely in Data Labeling Business
SU017 TechCrunch Scale AI Is Suing a Former Employee and Rival Mercor Alleging They Tried to Steal Its Biggest Customers
SU018 CNBC Zuckerberg Makes Meta's Biggest Bet on AI — $14 Billion Scale AI Deal
SU019 CNBC Google, Scale AI's Largest Customer, Plans Split After Meta Deal, Sources Say Google, which was Scale AI's largest customer, is planning to wind down or significantly reduce its relationship with the company following Meta's investment.
SU020 CNBC OpenAI Is Winding Down Its Work With Scale AI, Founder Is Joining Meta OpenAI is winding down its work with Scale AI.
SU021 Stanford HAI 2025 AI Index Report
SU022 McKinsey & Company The State of AI — McKinsey Global Survey
SU023 Appen Appen Case Studies
SU024 OpenAI OpenAI Fine-Tuning API Improvements and Custom Models Program
SU025 SuperAnnotate SuperAnnotate Enterprise Platform
SU026 Scale AI Scale AI RLHF Platform
SU027 Scale AI Scale AI Pricing
SR001 TechCrunch Scale AI Confirms Significant Investment from Meta, Says CEO Alexandr Wang Is Leaving
SR002 TechCrunch Scale AI Lays Off 14% of Staff, Largely in Data Labeling Business
SR003 CNBC Zuckerberg Makes Meta's Biggest Bet on AI — $14 Billion Scale AI Deal
SR004 CNBC Scale AI Promotes Strategy Chief Droege to CEO as Wang Heads for Meta
SR005 CNBC Google, Scale AI's Largest Customer, Plans Split After Meta Deal, Sources Say Google, which was Scale AI's largest customer, is planning to wind down or significantly reduce its relationship with the company following Meta's investment.
SR006 CNBC OpenAI Is Winding Down Its Work With Scale AI, Founder Is Joining Meta
SR007 Scale AI Scale AI Security and Compliance
SR008 Scale AI Scale AI Global Public Sector
SR009 Scale AI Scale AI White House AI Safety Voluntary Commitments
SR010 Scale AI Scale AI Next Phase of Company Evolution Blog Post
SR011 TechCrunch Scale AI Is Suing a Former Employee and Rival Mercor Alleging They Tried to Steal Its Biggest Customers
SR012 CourtListener / PACER Scale AI, Inc. v. Mercor, Inc. — Federal Court Docket (N.D. Cal.)
SR013 U.S. Congress Congressional AI Hearing — House Event 116184 (118th Congress)
SR014 Scale AI Scale AI Generative AI Data Engine
SR015 McKinsey & Company The State of AI — McKinsey Global Survey
SR016 Reuters Google Was Scale AI's Largest Customer — Plans Split After Meta Deal
SR017 Reuters Meta's $14.8 Billion Scale AI Deal — Latest Test of AI Partnerships
SR018 Scale AI Scale AI Homepage
SR019 Scale AI Scale AI Careers
SR020 Appen Appen Frontier Model Alignment
SR021 SuperAnnotate SuperAnnotate Case Studies
SR022 SuperAnnotate SuperAnnotate Foundation Model Builder Solutions
SR023 Scale AI Scale AI Evaluation for Public Sector
SR024 Scale AI Scale AI DoD DIU RCV Program
SR025 Scale AI Scale AI Test and Evaluation White Paper
SR026 Labelbox Labelbox Platform — Product Overview
SR027 Stanford HAI 2025 AI Index Report
SR028 Appen Appen Investor Relations
SR029 Scale AI Scale AI DoD Data Curation Joint Force Contract
SR030 Scale AI Scale AI Measuring and Mitigating WMDP Risk (AI Safety)
SR031 SuperAnnotate SuperAnnotate Platform Documentation
SR032 Axios Scale AI Wins Pentagon AI Contract — Scale AI's Defense Push Axios reported Scale AI won a Pentagon AI contract, supporting its government revenue thesis and providing a partial offset to AI lab customer attrition.
SR033 Bloomberg Scale AI Faces Questions After Meta Deal and Leadership Change Bloomberg noted the Meta deal raised structural conflict-of-interest questions about Scale AI's ability to serve competing AI labs as a strategic data vendor.
SR034 National Institute of Standards and Technology (NIST) AI Risk Management Framework (AI RMF 1.0) NIST AI RMF 1.0 establishes the risk management framework that government AI vendors including Scale AI must align with for federal procurement.
SR035 Fortune Scale AI's Wild Year: A $29 Billion Valuation, Meta's Money, and a Founder's Exit Fortune profiled Scale AI's pivotal 2025 transition, highlighting the simultaneous risks of founder exit, customer attrition, and business model pivot.
SV001 Scale AI Scale AI Sitemap
SV002 Labelbox Labelbox Platform Overview
SV003 Palantir Technologies Palantir Investor Relations — Financial Data and Annual Reports
SV004 TechCrunch OpenAI Partners with Scale AI to Allow Companies to Fine-Tune Models (2023)
SV005 SuperAnnotate SuperAnnotate Products Platform
SV006 SuperAnnotate SuperAnnotate Enterprise Solutions
SV007 Australian Securities Exchange Appen Limited (APX) ASX-Listed Company Data and Annual Reports
SV008 Menlo Ventures 2025 Enterprise AI Report
SV009 Appen Appen Investor Relations (ASX-listed)
SV010 McKinsey & Company The State of AI — McKinsey Global Survey
SV011 TechCrunch Scale AI Confirms Significant Investment from Meta, Says CEO Alexandr Wang Is Leaving
SV012 TechCrunch Scale AI Lays Off 14% of Staff, Largely in Data Labeling Business
SV013 CNBC Zuckerberg Makes Meta's Biggest Bet on AI — $14 Billion Scale AI Deal
SV014 CNBC Google, Scale AI's Largest Customer, Plans Split After Meta Deal, Sources Say
SV015 Stanford HAI 2025 AI Index Report
SV016 Scale AI Scale AI Homepage
SV017 Scale AI Scale AI Next Phase of Company Evolution
SV018 Scale AI Scale AI Generative AI Data Engine
SV019 Scale AI Scale AI Global Public Sector
SV020 Scale AI Scale AI Security and Compliance
SV021 Scale AI Scale AI White House AI Safety Voluntary Commitments
SV022 TechCrunch Scale AI Is Suing Rival Mercor Alleging Customer Poaching
SV023 CNBC OpenAI Is Winding Down Its Work With Scale AI
SV024 Scale AI Scale AI TIME Media Customer Proof
SV025 U.S. Congress Congressional AI Hearing — House Event 116184 (118th Congress)
SV026 Appen Appen Frontier Model Alignment Services
SV027 Reuters Google Was Scale AI's Largest Customer — Plans Split After Meta Deal
SV028 Scale AI Scale AI DoD Data Curation Joint Force Contract
SV029 Scale AI Scale AI Evaluation for Public Sector
SV030 Crunchbase Scale AI — Funding History and Investors
SV031 Gartner Gartner Market Guide for AI Data Annotation Tools Gartner estimates the AI data annotation tools market will exceed $5 billion by 2027, with premium segments commanding higher multiples than legacy labeling providers.
SV032 CB Insights Scale AI Company Profile and Valuation Analysis CB Insights tracks Scale AI as a late-stage unicorn with $29B+ implied valuation from the 2025 Meta strategic investment.
SV033 Statista AI Data Annotation Market Size and Forecast 2023-2030 Statista projects the global AI data annotation market to grow from approximately $1.3 billion in 2023 to over $7 billion by 2030, a CAGR of approximately 28%.
SV034 VentureBeat Is Scale AI Worth $29 Billion? Analyzing the Meta Investment VentureBeat analysts questioned whether Scale AI's $29B valuation is sustainable without revenue transparency, noting the customer attrition from Google and OpenAI creates execution risk for the growth thesis.