ElevenLabs
AI Voice Platform Diligence – Series D at $11B
ElevenLabs is the category-defining AI voice infrastructure company: 175% ARR growth, elite investor syndicate, and a defensible quality moat — but Series D entry at 33× ARR compresses new investor returns and requires the platform thesis to beat BigTech bundling.
Cover facts
Company profile
ElevenLabs is the leading AI voice platform, providing best-in-class text-to-speech, instant voice cloning, AI dubbing (32 languages), and conversational AI agent APIs. Founded in January 2022 by Mati Staniszewski (CEO) and Piotr Dąbkowski (CTO), the company has grown from $0 to $330M ARR in under four years at exceptional capital efficiency ($825K ARR per employee). Its February 2026 Series D ($500M at $11B valuation) led by Sequoia Capital established it as the dominant independent voice AI infrastructure company.
- Website
- elevenlabs.io
- Founded
- 2022-01-01
- Founders
- Mati Staniszewski, Piotr Dąbkowski
- Founding location
- New York, NY
- Headquarters
- New York, NY (London and Warsaw offices)
- Product
- Core products: TTS API (Flash <75ms, Multilingual v2), Instant Voice Cloning (3-second cloning), AI Dubbing (32 languages), Conversational AI Agents, Voice Library (3,000+ voices), ElevenReader, Projects Studio, Sound Effects API.
- Customers
- 1M+ developers (self-serve API); enterprise customers including Deutsche Telekom, Revolut, Square, TIME, Washington Post, Salesforce, NVIDIA, Ukrainian government.
- Business model
- API usage-based pricing for self-serve; enterprise SaaS contracts with volume discounts. Revenue mix ~50/50 enterprise/self-serve. Enterprise growing >200% YoY.
- Stage
- Series D (pre-IPO)
- Funding status
- $781M total raised; Series D $500M at $11B valuation (February 2026); ~$550M estimated cash on hand.
Executive summary
Top strengths
- 175% YoY ARR growth at $330M base — Rule of 40 score estimated at 175+, placing it in the top 1% of global SaaS companies.
- Durable quality moat: Flash model sub-75ms latency, 3-second voice cloning, 32-language AI Dubbing, and a 3,000+ voice Voice Library with creator network effects.
- Elite investor syndicate (Sequoia, a16z, Thrive, T. Rowe Price, Khosla, Index, Redpoint) provides implicit price validation and governance support for an IPO path.
- Exceptional capital efficiency: $825K ARR per employee at 400 headcount; asset-light API-first model with 65-75% gross margins.
- Enterprise momentum: enterprise growing >200% YoY, Deutsche Telekom, Revolut, Square, TIME, Washington Post as named customers, and a diversified enterprise vertical strategy.
Top risks
- BigTech bundling risk: OpenAI GPT-4o voice, Google TTS, and Microsoft Azure TTS offer equivalent quality at 50-70% lower pricing; free-tier bundling within 18-24 months is the thesis-break scenario.
- Regulatory exposure: FCC TCPA DA-24-146, EU AI Act Article 52, GDPR biometric voice print classification, and pending FTC rulemaking create multi-jurisdiction compliance burdens that could constrain the enterprise telephony TAM.
- Deepfake misuse: documented use of ElevenLabs technology in political disinformation (Biden robocall, 2024) and phone scams creates reputational and regulatory liability that content safety tools (watermarking, detection) cannot fully eliminate.
- Key-person dependency: CEO Mati Staniszewski and CTO Piotr Dąbkowski have no disclosed succession plan; both are the technical and commercial anchors of the business.
- Compressed return profile for new investors: base case exit of $7-11B barely covers the $11B Series D entry price; late investors are underwriting the bull case.
Open gaps
- Audited GAAP financials not available; all ARR, margin, and NRR figures are analyst estimates or company disclosures without independent audit.
- Customer concentration: top 10 enterprise accounts as share of ARR is not publicly disclosed; estimated 20-30% concentration is a material undisclosed risk.
- Series D full term sheet: preference stack, anti-dilution, and governance provisions have not been disclosed; return modelling assumes standard 1× non-participating preference.
- Independent security and GDPR compliance audit: no SOC 2 Type II or GDPR audit result has been published; biometric voice data handling is unverified externally.
Contents
01Company Overview
1.1 Identity and Business Model
ElevenLabs, Inc. is a privately held AI-audio-infrastructure company headquartered in New York, NY, with additional hubs in London and Warsaw. Founded in January 2022 by Mati Staniszewski (CEO) and Piotr Dąbkowski (CTO), the company launched its public platform in January 2023. ElevenLabs builds foundational AI models for speech synthesis, voice cloning, real-time conversational AI agents, and audio content localization. Its business model combines a freemium self-serve layer (starting at $22/month for creators) with enterprise SaaS contracts that can reach $2M+ annually. The company generates revenue from API usage, per-robot/per-seat subscriptions, and platform licensing, with approximately 50/50 split between self-serve and enterprise revenue as of late 2025. ElevenLabs describes its mission as making "speech the new standard for digital interaction" and building the most comprehensive audio AI platform in the world. [CO001][CO002][CO003][CO004]
Shows how ElevenLabs evolved its revenue model from a single TTS API to a multi-product AI audio platform with enterprise, developer, and creator segments—illustrating the product-led growth logic driving the ARR trajectory.
[CO003, CO004, CO019, CO021, CO031]Shows how ElevenLabs' identity, product portfolio, customer base, capital stack, and key dependencies interconnect to drive the business model.
[CO002, CO003, CO004, CO017, CO032]1.2 Founders and Leadership
The company was co-founded by two childhood friends from Poland. Mati Staniszewski, CEO, previously worked as a deployment strategist at Palantir Technologies; Piotr Dąbkowski, CTO, was a machine learning engineer at Google. Both were motivated by the poor quality of voice dubbing in American films distributed to Polish audiences. Dąbkowski was named a TIME Magazine AI Top 100 Innovator. The founding team built the company as a tight two-person technical team before scaling. By January 2025, headcount had grown from 30 to 120 employees, with offices in London, New York, and Warsaw. By late 2025, headcount reached approximately 330. The company has not disclosed a CFO or COO publicly; both founders occupy all primary executive roles. Sequoia partner Andrew Reed joined the board in conjunction with the February 2026 Series D. Key-person dependency on the two founding co-CEOs/CTOs is a material risk given the early-stage leadership structure. [CO005][CO006][CO007][CO008][CO009]
| Person | Role | Background | Founder-Market Fit / Coverage | Key-Person Dependency |
|---|---|---|---|---|
| Mati Staniszewski | CEO & Co-founder | Deployment Strategist, Palantir Technologies; background in B2B enterprise software sales and strategy | Deep enterprise go-to-market; Polish AI ecosystem connections | Critical — no disclosed successor or co-CEO structure |
| Piotr Dąbkowski | CTO & Co-founder | ML Engineer, Google; TIME AI Top 100 Innovator; builds core voice models | Deep AI/ML research and voice synthesis expertise; leads R&D team | Critical — sole identified AI research leader |
| Andrew Reed | Board Member (Sequoia) | Partner at Sequoia Capital; Series D lead investor; consumer/growth stage expertise | Investor oversight; no operational role disclosed | Low — board observer |
No CFO, COO, CPO, or CMO has been publicly disclosed. Leadership bench depth is limited for a company at the Series D stage.
[CO005, CO006, CO007, CO008, CO009]Annual recurring revenue progression from platform launch in January 2023 through end-2025, showing the pace of revenue growth that supports the $11B Series D valuation.
2023 ARR is an analyst estimate; actual figure was not publicly disclosed at launch. 2024 and 2025 ARR are company-reported and unaudited.
[CO007, CO016, CO033]1.3 Funding History and Investors
ElevenLabs has raised approximately $781M across five rounds since 2022. The seed round of $2M in 2022 established the company; a Series A of $19M in January 2023 funded the public platform launch. The $80M Series B (January 2024) at a $1.1B valuation marked its unicorn status. The $180M Series C (January 2025) led by a16z and ICONIQ Growth tripled the valuation to $3.3B. The $500M Series D (February 2026) led by Sequoia Capital pushed the valuation to $11B—a 10× increase in two years. Strategic investors include Deutsche Telekom, LG Technology Ventures, HubSpot Ventures, NTT DOCOMO Ventures, RingCentral Ventures, and Salesforce Ventures. Other financial investors include Lightspeed Venture Partners, NEA, Bond Capital, Andreessen Horowitz, Coatue, and SV Angel. The company reported approximately $550M in cash on hand after the Series D, providing extended runway. No debt facilities have been disclosed. Secondary transactions or insider tender offers have not been publicly confirmed. [CO010][CO011][CO012][CO013][CO014][CO015]
| Stakeholder | Role / Type | Control or Economic Importance | Diligence Ask |
|---|---|---|---|
| Sequoia Capital | Lead Series D investor; board seat via Andrew Reed | Largest single-round investor; $500M Series D lead; board influence | Confirm board seat terms, pro-rata rights, IPO preference stack |
| Andreessen Horowitz (a16z) | Series C co-lead; Series D follow-on | 4× Series D investment; multi-round relationship; high economic stake | Confirm total ownership position across rounds |
| ICONIQ Growth | Series C co-lead; Series D follow-on | 3× Series D investment; long-term holder | Confirm liquidation preferences |
| Lightspeed Venture Partners | Series D new investor | New entrant at $11B; strategic signaling value | Confirm investment size and governance rights |
| Deutsche Telekom | Strategic investor (Series C) | Telecom enterprise distribution partner; NTT DOCOMO Ventures alongside | Confirm commercial agreement terms tied to investment |
| Salesforce Ventures | Series C follow-on investor | CRM ecosystem distribution strategic interest | Confirm Salesforce integration roadmap |
| Mati Staniszewski & Piotr Dąbkowski | Founders; co-controlling shareholders | Presumed majority economic and governance control pre-IPO; exact ownership not disclosed | Obtain cap table; confirm voting structure, drag-along, anti-dilution terms |
| NTT DOCOMO Ventures | Strategic investor (Series C) | Japan/APAC telecom distribution channel | Confirm commercial partnership terms |
Ownership percentages are not publicly disclosed. Investor rights, board composition beyond Andrew Reed, and liquidation preferences require direct access to cap table.
[CO010, CO011, CO012, CO013, CO014, CO015]1.4 Cover Metrics and Traction
ElevenLabs reached $330M ARR at end-2025, growing approximately 175% year-over-year from $120M ARR at end-2024. The company achieved $100M ARR in roughly 20 months from platform launch in January 2023. Enterprise adoption has been rapid: over 60% of Fortune 500 companies had employees using ElevenLabs tools as of early 2025. Named customers include TIME, The Washington Post, The New Yorker, HarperCollins, ESPN, Deutsche Telekom, Revolut, Square, Salesforce, NVIDIA, Perplexity, Chess.com, Synthesized, Paradox Interactive, and the Ukrainian government. Total audio generated by users exceeds 1,000 years of content. The Voice Library has over 5,000 shared voices and has disbursed $2M+ in creator payouts. Gross margin and net revenue retention figures are not publicly disclosed but the company has described itself as "highly capital-efficient" at the Series D stage. [CO016][CO017][CO018][CO019][CO020][CO021]
| Metric | Value / Status | Date / Period | Confidence | Gap / Note |
|---|---|---|---|---|
| Valuation | $11B | Feb 2026 | high | Series D post-money; no independent verification |
| Total Raised | ~$781M | Feb 2026 | high | Sum of disclosed rounds |
| Latest Round | $500M Series D | Feb 2026 | high | Led by Sequoia Capital |
| ARR | $330M | Dec 2025 | medium | Company-claimed; not independently audited |
| ARR Growth YoY | ~175% | 2025 vs 2024 | medium | Derived from $120M→$330M |
| Headcount | ~330 | Late 2025 | medium | Press/interview sourced |
| Fortune 500 Penetration | >60% | Jan 2025 | medium | Company-claimed via Series C blog |
| Audio Generated | >1,000 years | Jan 2025 | medium | Company-claimed cumulative metric |
| Gross Margin | Not disclosed | — | low | Private; not publicly available |
| Net Revenue Retention | Not disclosed | — | low | Private; not publicly available |
| Path to Profitability | Not disclosed | — | low | Management claims capital efficiency |
All revenue metrics are company-reported and unaudited. Valuation reflects post-money Series D. NRR and gross margin data unavailable.
[CO012, CO013, CO016, CO017, CO018, CO007]1.5 Key Milestones and Adverse Events
ElevenLabs' milestones include the January 2023 public platform launch, the January 2024 Series B unicorn milestone, a fake Biden robocall incident in January 2024 that used ElevenLabs-generated voice to tell voters not to vote in the New Hampshire Democratic primary, resulting in FCC regulatory action banning AI-generated robocalls and significant reputational scrutiny. In 2024 the company added conversational AI agents, AI dubbing in 32 languages, sound effects models, ElevenReader (mobile app), and Voice Design. A March 2025 Consumer Reports study found AI voice cloning tools including ElevenLabs lacked robust safeguards, citing checkbox-only consent mechanisms. Voice actors and authors filed lawsuits alleging unauthorized use of their voices as training data. The company has implemented watermarking, forensic backtracking partnerships, and an Impact Program for accessibility uses, but critics call self-regulation insufficient. ElevenLabs opened a Warsaw R&D center in 2025 and expanded into India, LATAM, and APAC. The February 2026 Series D positioned the company for a potential IPO within 2-3 years per management commentary. [CO022][CO023][CO024][CO025][CO026][CO027][CO028][CO029][CO030]
| Date | Event | Type | Amount / Valuation / Status | Participants / Partners | Implication |
|---|---|---|---|---|---|
| 2022-01 | ElevenLabs founded by Mati Staniszewski and Piotr Dąbkowski | founding | Staniszewski, Dąbkowski | Company inception; both founders depart prior employers to co-found | |
| 2022-Q4 | Seed funding | financing | $2M seed | Undisclosed angels | Initial capital to build prototype voice models |
| 2023-01 | Series A funding | financing | $19M Series A | Sequoia, SV Angel, others | Funds platform development and public launch |
| 2023-01 | Public platform launch with text-to-speech and voice cloning | product | Public | Opens commercial API; begins rapid developer adoption | |
| 2024-01 | Series B at $1.1B valuation — unicorn milestone | financing | $80M at $1.1B | Andreessen Horowitz, ICONIQ, Salesforce Ventures, Smash Capital | Crosses $1B valuation; ARR reaches ~$50M+ run rate |
| 2024-01 | Biden deepfake robocall incident — ElevenLabs voice used in political disinformation | adverse | Unknown bad actor; New Hampshire primary | FCC bans AI-generated robocalls; reputational damage; triggers regulatory scrutiny | |
| 2024-Q2 | AI Dubbing (32 languages), Sound Effects Model, Voice Design, ElevenReader app launched | product | Public | Expands platform beyond TTS; enters audio localization market | |
| 2024-Q3 | Conversational AI agents product launched; 250,000 agents built by developers in first two months | product | Developers globally | Enters real-time agent market; rapid developer traction | |
| 2024-Q4 | $120M ARR milestone; 60%+ Fortune 500 employee penetration | scale | Enterprise customers | Demonstrates enterprise velocity; validates B2B go-to-market | |
| 2025-01 | Series C at $3.3B valuation — tripling from Series B in one year | financing | $180M at $3.3B | a16z, ICONIQ (co-leads); Deutsche Telekom, LG, HubSpot, NTT DOCOMO, RingCentral (strategic) | Accelerates research, international expansion, and agentic AI build-out |
| 2025-03 | Consumer Reports publishes study finding ElevenLabs and peers lack robust voice cloning safeguards | adverse | Consumer Reports | Amplifies public criticism of self-regulation model; regulatory risk increases | |
| 2025-Q2 | Warsaw R&D center opened; India office expanded for Indic language coverage | scale | Internal | Deepens international AI talent pipeline | |
| 2025-11 | ARR reaches $330M — 175% YoY growth; headcount reaches ~330 | scale | Internal | Demonstrates sustained hyper-growth; improving capital efficiency | |
| 2026-02 | Series D at $11B valuation — 10× valuation in two years | financing | $500M at $11B | Sequoia (lead); a16z, ICONIQ, Lightspeed, Evantic, Bond, NEA | Positions company for IPO within 2–3 years; largest AI audio fundraise globally |
Seed and early Series A dates are approximate; exact close dates not publicly confirmed. Lawsuit details (voice actor copyright claims) excluded from above as terms are not publicly resolved.
[CO001, CO010, CO011, CO012, CO013, CO016]1.6 Exhibits
02Market Analysis
2.1 Market Boundary and Scope
ElevenLabs competes across three adjacent but distinct markets: (1) AI text-to-speech (TTS) and voice synthesis infrastructure, which covers API-delivered speech generation from text input for developers, publishers, and creators; (2) AI voice cloning and customization, covering enterprise creation of branded or personalized voice identities; and (3) conversational AI agent infrastructure, which delivers real-time voice-interactive AI agents for customer service, sales, education, and gaming. The company also addresses the AI audio localization and dubbing market through its 32-language dubbing product. Excluded from ElevenLabs' core market: speech recognition/ASR (transcription), voice biometrics for authentication, and general-purpose voice assistants (Alexa, Siri-class). Status-quo substitutes include human voiceover artists (for content creation), traditional IVR/telephony systems (for contact centers), and manual dubbing studios (for localization). The AI agent market represents the most strategically important adjacency, where ElevenLabs competes with OpenAI Realtime API, Google TTS, Amazon Polly, and specialized players like Deepgram and Cartesia. [CM001][CM002][CM003]
| Market Segment | Included Spend | Excluded Spend | Status-Quo Substitute | ElevenLabs Relevance |
|---|---|---|---|---|
| AI Text-to-Speech (TTS) | API-delivered speech synthesis from text for developers and enterprises | ASR/transcription, voice biometrics | Human voiceover artists ($300-500/hr) | Core product; API and platform |
| AI Voice Cloning | Brand and persona voice creation; creator voice monetization | Voice biometrics for auth | Recording sessions with voice talent | Voice cloning API; Voice Library platform |
| AI Audio Dubbing/Localization | AI-powered translation and voice replication across languages | Human dubbing studios not using AI | Human dubbing studios ($5,000-50,000/project) | 32-language dubbing product |
| Conversational AI Agents (Voice) | Real-time voice AI for customer service, gaming, education | Text chatbots, IVR without AI | Traditional IVR systems; offshore call centers | ElevenAgents; Conversational AI product |
| AI Sound Effects | AI-generated sound design and audio effects | Music generation (excluded) | Foley artists; sound library licensing | Sound Effects Model; nascent segment |
Market boundary based on ElevenLabs' disclosed product lines as of January 2025. Conversational AI agents represent the highest-growth adjacent segment.
[CM001, CM002, CM003]2.2 Market Sizing — TAM, SAM, and SOM
Multiple analyst sources converge on the core AI voice generator market at approximately $3B in 2024, growing to $20.4B by 2030 at a 37% CAGR per MarketsandMarkets research. Grand View Research estimates a slightly higher trajectory at $21.75B by 2030 (29.6% CAGR). The broader voice AI market including ASR, voice assistants, and TTS exceeds $70B by 2030. ElevenLabs' serviceable addressable market (SAM) is the subset addressable through its API and enterprise contracts: enterprise AI audio, conversational AI agent infrastructure, and AI media localization. This SAM is estimated at $8–12B by 2027, based on enterprise contact center AI (approximately $4B), media/publishing voice AI ($2B), and developer API infrastructure ($2–4B). The serviceable obtainable market (SOM) at current trajectory is approximately $1–1.5B by 2027, consistent with reaching $500–700M ARR at the company's disclosed growth rate with moderate market share capture in enterprise segments. At $330M ARR on a ~$4B total market in 2025, ElevenLabs holds approximately 7–8% share of the core AI voice generation market. [CM004][CM005][CM006][CM007][CM008]
| Lens | 2024 Estimate (USD) | 2030 Estimate (USD) | CAGR | Methodology | Source |
|---|---|---|---|---|---|
| AI Voice Generator Market (MarketsandMarkets) | $3.0B | $20.4B | 37.1% | Bottom-up vendor survey + enterprise adoption | MarketsandMarkets 2024 |
| AI Voice Generators Market (Grand View Research) | $3.0B | $21.75B | 29.6% | Broad market report, global scope | Grand View Research 2024 |
| TTS Market Only (conservative) | $2.5B | $7.25–8.8B | 13-15% | Narrow TTS-only scope; excludes agents | Research and Markets / NextMSC 2024 |
| Broader Voice AI / Speech Tech | $15B+ | $70B+ | ~25% | Includes ASR, voice assistants, agents | Industry composite 2024 |
| ElevenLabs SAM (enterprise audio + agents) | $4-5B | $8-12B | ~25% | Enterprise contact center + media + developer API | Analyst-derived estimate |
| ElevenLabs SOM | $350M | $1-1.5B | ~30% | ARR trajectory extrapolation; current market share ~7-8% | Derived from company ARR + market size |
Wide range across sources reflects scope ambiguity between TTS-only and broader voice AI. ElevenLabs SAM and SOM are analyst-derived estimates, not company-guided.
[CM004, CM005, CM006, CM007, CM008]Illustrates the wide range of market size estimates across different scopes (TTS-only, AI voice generation, and broad voice AI/speech tech), anchoring ElevenLabs' SAM and SOM within the realistic portion of this range.
All values are analyst-derived projections in USD billions. Wide ranges reflect definitional ambiguity across scope of 'voice AI'. ElevenLabs SAM and SOM are estimated.
[CM004, CM023, CM029]Annual AI voice generator market size (core scope) from 2024 to 2030, showing the 37% CAGR trajectory that underpins ElevenLabs' growth case.
Intermediate years (2026-2028) interpolated at 37% CAGR from MarketsandMarkets 2024 base. Values are in USD billions.
[CM004, CM007]2.3 Buyer and Segment Map
ElevenLabs' buyers span developer/creator (bottom-up) and enterprise (top-down) segments. The developer/creator segment (estimated ~50% of revenue) includes indie developers using the API for apps and games, content creators producing audio podcasts and YouTube content, and small business teams automating customer touchpoints. The enterprise segment (~50% of revenue) includes: media and publishing firms (TIME, Washington Post, HarperCollins) licensing ElevenLabs for content production and dubbing; customer service platforms and contact centers using Conversational AI agents; gaming studios (Paradox Interactive) for in-game voice; telecommunications carriers (Deutsche Telekom, NTT DOCOMO) as both investors and distribution channels; and educational technology platforms (Chess.com, Praktika). The primary enterprise buyer persona is the Chief Digital Officer, CTO, or Head of Customer Experience. In the developer segment, the buyer is the engineer or startup founder accessing the API. Budget owner for enterprise deals is typically the IT/digital transformation budget, not marketing. A key adoption friction is procurement security review for AI-generated voice handling, particularly in regulated industries. [CM009][CM010][CM011][CM012]
| Segment | Buyer Persona | Budget Owner | Adoption Path | ElevenLabs Fit |
|---|---|---|---|---|
| Media and Publishing | Head of Digital, CTO | Digital transformation budget | Proof-of-concept → enterprise contract | Strong; named customers TIME, WaPo, HarperCollins |
| Customer Service / Contact Center | Head of CX, CTO | IT operations budget | Pilot → workflow integration | Strong; ElevenAgents; 250k agents built |
| Gaming and Interactive | Head of Engineering, Creative Director | Game dev budget | SDK trial → per-title contract | Strong; Paradox Interactive, Inworld |
| Telecom Operators | CTO, Head of Product | Network/service innovation budget | Strategic partnership → embedded product | Growing; Deutsche Telekom, NTT DOCOMO investors |
| Education Technology | Head of Curriculum / Product | EdTech product budget | API trial → per-seat licensing | Moderate; Chess.com, Praktika, SchoolAI |
| Developer / Creator (self-serve) | Individual developer or creator | Personal credit card → startup budget | Freemium → paid plan upgrade | Core segment; developer adoption flywheel |
| Accessibility | Nonprofit or healthcare org | Grant / research budget | Impact Program (free) → paid upgrade | Mission alignment; limited revenue |
Revenue split is approximately 50% enterprise / 50% self-serve (company-reported, 2025). Accessibility segment is largely non-revenue but brand-building.
[CM009, CM010, CM011, CM012]Positions ElevenLabs' customer segments on a two-axis map of deal size vs. growth potential, showing where enterprise and developer segments sit relative to each other.
Ordinal 0–10 scoring. X-axis: average deal/contract size (10=highest). Y-axis: segment growth potential (10=highest). Based on analyst estimates and company-disclosed customer data.
[CM011, CM035]2.4 Growth Drivers and Adoption Constraints
Growth drivers include: (1) LLM proliferation enabling conversational AI agents at scale, creating demand for natural, low-latency voice output as the interface layer; (2) digital content globalization driving demand for AI dubbing to replace expensive human localization; (3) cost economics — AI voice at $22/month vs. $300–500/hour for professional voiceover artists; (4) developer API adoption following the "Twilio for voice" distribution model; and (5) enterprise automation of contact center workflows as AI agents replace first-tier IVR systems. Adoption constraints include: (1) regulatory risk from deepfake concerns leading to EU AI Act voice provisions and potential US biometric/voice data regulations that could restrict training data practices; (2) enterprise trust in AI voices for customer-facing use cases, requiring consent, watermarking, and audit trails; (3) model inference costs requiring continued GPU cost reduction to sustain margins; (4) Big Tech competition from Google TTS, OpenAI Realtime API, Amazon Polly, and Microsoft Azure Speech; and (5) the commoditization risk as open-source voice models (Coqui, Bark, Kokoro) approach commercial quality at zero cost. [CM013][CM014][CM015][CM016][CM017][CM018]
| Factor | Type | Direction | Magnitude | Time Horizon |
|---|---|---|---|---|
| LLM proliferation driving demand for voice agent layer | Technology driver | Positive | High | Immediate (2024-2026) |
| Content globalization requiring multilingual audio | Demand driver | Positive | High | Near-term (2025-2028) |
| AI voice cost vs. human voiceover ($22/mo vs. $300-500/hr) | Economic driver | Positive | High | Immediate |
| Developer API adoption / 'Twilio for voice' model | Distribution driver | Positive | Medium | Near-term |
| Enterprise contact center AI automation | Demand driver | Positive | High | Near-term (2025-2028) |
| EU AI Act voice provisions + US biometric/voice laws | Regulatory constraint | Negative | Medium | Near-term (2025-2027) |
| Enterprise trust barriers for customer-facing AI voice | Adoption constraint | Negative | Medium | Ongoing |
| Model inference cost (GPU compute) affecting margins | Cost constraint | Negative | Medium | Ongoing |
| Big Tech (Google, OpenAI, Amazon, Microsoft) voice products | Competitive constraint | Negative | High | Immediate |
| Open-source voice model commoditization (Coqui, Bark, Kokoro) | Competitive constraint | Negative | Medium | Medium-term (2025-2027) |
| Deepfake regulatory backlash restricting voice cloning | Regulatory risk | Negative | Material | Medium-term |
Maps the value chain from AI model training through platform delivery to end-user application, showing where ElevenLabs sits and the competitive dynamics at each layer.
[CM032, CM033, CM034]2.5 Sizing Gaps and Conflicting Estimates
Market sizing for AI voice is heterogeneous and contested. Key contradictions: MarketsandMarkets projects $20.4B by 2030 while Research and Markets projects a much more modest $7.25B for TTS-only. The difference reflects scope ambiguity: broader "AI voice" includes ASR, voice assistants, and speech analytics. Analysts frequently conflate these categories. A further complication is that ElevenLabs' primary revenue driver—conversational AI agents—is typically sized within the broader "contact center AI" market ($17B+ by 2030 per Grand View Research), which would imply a much larger TAM when combined with core TTS. The absence of direct competitor revenue disclosures makes bottom-up market share analysis difficult. Estimates of ElevenLabs' market share (7–8% of core voice generation, 2025) are derived from disclosed ARR relative to analyst total market size, and may materially differ if the actual serviceable market is smaller due to ElevenLabs' specific product focus. [CM019][CM020]
2.6 Exhibits
03Competitors
3.1 Competitive Landscape Overview
ElevenLabs competes across three market tiers: specialised AI-native voice API startups (Cartesia, Deepgram, Resemble AI, Play.ht, Murf), Big Tech incumbents with bundled TTS (Google, Amazon, Microsoft), and foundation model labs expanding into audio (OpenAI, potentially Anthropic). Each tier presents a distinct competitive dynamic. Among AI-native peers, ElevenLabs holds the quality lead, with independent Mean Opinion Scores of approximately 4.5/5, ahead of all API-accessible alternatives as of 2025. However, Cartesia has carved out a credible niche with sub-95ms latency at lower per-character pricing, attracting voice-agent builders who prioritise speed over maximum naturalness. Deepgram competes primarily in speech-to-text but its Aura TTS model is gaining traction in enterprise real-time scenarios. Big Tech incumbents (Google Cloud TTS, Amazon Polly, Microsoft Azure Speech) maintain distribution advantages through their broader cloud relationships, existing enterprise procurement, and zero-marginal-cost bundling with cloud platforms. They lag ElevenLabs on voice naturalness benchmarks but present a real risk for commodity TTS workloads where price and platform consolidation dominate decision-making. Microsoft's integration of Azure Speech into Microsoft 365 Copilot is particularly notable given the scale of the existing Microsoft enterprise install base. [CP001][CP013][CP014][CP015]
| Provider | Entry Price | Mid Tier | Enterprise | Per-Char Rate (volume) | Free Tier |
|---|---|---|---|---|---|
| ElevenLabs | $5/mo (Starter) | $22/mo (Creator) | Custom ($1,320+/yr) | ~$0.30/1M chars | Yes (10K chars/mo) |
| Cartesia | Developer free | $99/mo Pro | Custom | $0.038/1K chars | Yes |
| Deepgram | Pay-per-use | $4,000/yr base | Custom | ~$0.015/1K chars (TTS) | Yes (200 credits) |
| OpenAI TTS | Pay-per-use | API pricing | Via Azure | ~$0.015/1K chars | Via API credits |
| Google Cloud TTS | Pay-per-use | Committed use discount | Enterprise via GCP | ~$0.004/1K chars | Yes (1M chars/mo) |
| Amazon Polly | Pay-per-use | N/A | Volume pricing | $4/1M chars | 12 months (5M chars/mo) |
| Microsoft Azure Speech | Pay-per-use | $4/1M chars standard | Enterprise agreement | ~$1/1M chars (neural) | Yes (500K chars/mo) |
| Murf AI | $0 (Basic) | $29/mo Pro | Custom team plans | N/A (subscription) | Yes (limited) |
Per-character rates normalised to comparable usage tiers. ElevenLabs character pricing not publicly listed as per-char; estimate from plan limits.
[CP016, CP017, CP013, CP014]3.2 Competitor Profiles and Positioning
Cartesia AI emerged from stealth in September 2024 with $80M in funding led by Andreessen Horowitz. It offers 40–95ms synthesis latency—faster than ElevenLabs—with hallucination-free outputs claimed for complex text. Its pricing at $0.038 per 1,000 characters is substantially cheaper than ElevenLabs at comparable volume. Cartesia's limitations include a smaller voice library (~130 voices), fewer supported languages (~15–20), and limited multilingual depth. Resemble AI differentiates through its deepfake detection and provenance watermarking capabilities, on-premise deployment for regulated industries, and 149-language coverage—though it targets a smaller market of compliance-first buyers. Play.ht competes on voice library breadth (900+ voices, 142 languages) but trails ElevenLabs on voice naturalness. Murf AI serves non-technical content teams with a studio-first UI and affordable subscription pricing but lacks real-time synthesis API capabilities. [CP002][CP003][CP009][CP011][CP012][CP027] OpenAI's TTS API remains a lower-quality but ecosystem-integrated option, while GPT-4o's real-time audio mode introduced in 2024 represents a longer-horizon threat to ElevenLabs' conversational AI agent product. VocalCopyCat's 2025 critical analysis argues that ElevenLabs' current commercial success may outpace its technical leadership, with credible challengers on latency and pricing—a view echoed by developer community comparisons but not yet reflected in enterprise contract displacement. [CP007][CP008][CP021][CP024]
| Competitor | Founded | Funding | Languages | Latency (ms) | On-Prem | Primary Buyer | Strategic Positioning |
|---|---|---|---|---|---|---|---|
| ElevenLabs | 2022 | $781M total | 70+ | 75 | No | Enterprises, developers | Quality leader, full-stack voice platform |
| Cartesia AI | 2023 | $80M | ~15-20 | 40-95 | Yes | Voice-agent builders | Speed and anti-hallucination challenger |
| Deepgram | 2015 | $130M+ | 1 (EN) | 150 | Yes | Enterprise real-time apps | STT-first; TTS as complement |
| OpenAI TTS | 2015 | Public/large | 57 | 200+ | No | GPT ecosystem developers | Ecosystem integration; quality follower |
| Google Cloud TTS | 2016 | Internal | 50+ | 200+ | No | Cloud-first enterprises | Distribution via GCP; commodity pricing |
| Amazon Polly | 2016 | Internal | 30+ | 200+ | No | AWS-native builders | AWS ecosystem; pay-per-use cost leader |
| Microsoft Azure Speech | 2017 | Internal | ~110 | 200+ | No | M365 enterprise buyers | M365/Copilot bundling advantage |
| Murf AI | 2020 | $16M | ~20 | 300+ | No | Content teams, e-learning | No-code studio; non-developer focus |
| Play.ht | 2016 | $20M | 142 | 250+ | No | Multilingual content teams | Breadth-first voice library leader |
| Resemble AI | 2019 | $8M | 149 | 200 | Yes | Regulated sectors | Compliance-first; deepfake detection |
Funding figures approximate; Deepgram's total raised is $130M per reported rounds. Latency figures are approximate mid-points from published benchmarks.
[CP001, CP002, CP007, CP009, CP010, CP011]| Feature | ElevenLabs | Cartesia | Deepgram | OpenAI TTS | Murf AI | Resemble AI |
|---|---|---|---|---|---|---|
| Voice cloning (instant) | Yes (3s) | Yes (3s) | No | No | Basic | Yes (advanced) |
| Real-time streaming API | Yes (<75ms) | Yes (<95ms) | Yes (~150ms) | Yes (~200ms) | No | Yes (~200ms) |
| Languages | 70+ | ~15-20 | 1 (EN) | 57 | ~20 | 149 |
| On-premise deployment | No | Yes | Yes | No | No | Yes |
| Voice marketplace | Yes (3000+) | Limited | No | No | Yes (120+) | No |
| Deepfake detection | No | No | No | No | No | Yes |
| Conversational AI agent | Yes | Via partner | Partial | Yes (GPT-4o) | No | Partial |
| AI dubbing | Yes (32 lang) | No | No | No | No | No |
Vendor self-reported capabilities subject to change. 'Partial' indicates limited or beta-stage feature support.
[CP003, CP004, CP005, CP006, CP011, CP012]3.3 Moat, Switching Costs, and Competitive Risk
ElevenLabs' competitive durability rests on several reinforcing factors. First, voice quality leadership: its model quality edge, backed by proprietary training data from its Voice Library and publishing partnerships, is difficult to replicate quickly. Second, developer ecosystem lock-in: enterprise customers building voice-native products on the ElevenLabs API accumulate integration depth, custom voice model weights, and embedded Voice Library content that carry meaningful switching costs. Third, the Voice Library marketplace itself creates a two-sided network effect—voice creators building on the platform attract developers, who attract enterprise customers, who fund more creator payouts. NEA and Salesforce Ventures both specifically called out the platform's ecosystem potential as a primary investment thesis. [CP019][CP020][CP030] Key competitive risks include: (1) the absence of on-premise deployment, which structurally excludes ElevenLabs from regulated-industry RFPs won by Resemble AI; (2) pricing pressure from Cartesia and Big Tech for high-volume commodity TTS; (3) potential bundling by Microsoft or Google that commoditises basic TTS for customers already on those platforms; and (4) multi-homing, where developers layer Cartesia for speed and ElevenLabs for quality in the same product. Consumer Reports' March 2025 critique of weak safeguards across all voice cloning services, including ElevenLabs, adds regulatory risk to the moat picture—a compliance failure could damage the brand trust that anchors enterprise relationships and trigger customer contract reviews. ElevenLabs' planned response includes cryptographic voice provenance metadata and enhanced consent verification flows, but these measures are still in development as of mid-2026. [CP018][CP022][CP025][CP032][CP033]
| Moat Factor | ElevenLabs Strength | Primary Threat | Durability (1-5) | Risk Level |
|---|---|---|---|---|
| Voice quality leadership | MOS ~4.5/5, best-in-class | Cartesia/Deepgram closing gap | 3 | Medium |
| Voice Library network effect | 3000+ voices, creator economy | Platform replication by BigTech | 4 | Low |
| Enterprise API integration depth | Deep embedding in customer stacks | Multi-homing; swap-out risk | 4 | Medium |
| On-premise absence | N/A – structural gap | Resemble AI, Deepgram wins regulated RFPs | 1 | High |
| BigTech distribution disadvantage | No cloud platform partnership | Google/Amazon/Microsoft bundling | 2 | High |
| Pricing premium | Justified by quality today | Commodity downward pressure | 3 | Medium |
| Regulatory/trust risk | Brand strong but safeguards critiqued | Consumer Reports / FCC action | 3 | High |
| AI dubbing uniqueness | 32 languages, no comparable API | Google Translate+TTS combos | 4 | Low |
Durability rated 1 (weak) to 5 (strong). Risk level based on probability and impact of moat erosion.
[CP015, CP019, CP020, CP021, CP022, CP025]3.4 Exhibits
04Financials
4.1 Revenue Model and ARR Trajectory
ElevenLabs has established one of the fastest ARR growth curves in enterprise software history. The company reached $330M ARR at end-2025 from a standing start of effectively zero in January 2023, when its platform launched publicly. Key milestones: $100M ARR in 20 months (January 2025), $200M ARR in September 2025, and $330M ARR by December 2025. The most recent segment—$100M to $200M—took 10 months; the next $130M came in just five months. This acceleration is driven by enterprise adoption: enterprise revenue grew over 200% year-over-year in 2025, with the enterprise/self-serve mix approaching 50/50 from a heavily self-serve starting point. The largest enterprise contracts reached $2M annually. The company operates four revenue streams: (1) self-serve subscriptions from $5/month (Starter) to $1,320+/year (Business), serving millions of developer and creator users; (2) enterprise SaaS contracts with custom pricing, usage-based tiers, and annual commitments; (3) API usage fees charged per character or per minute of audio generated; and (4) marketplace commissions from the Voice Library, where creators license voices to enterprise customers. The marketplace has paid out over $2M to voice creators as of early 2026, reflecting early but growing traction. SaaStr cited ElevenLabs as exemplifying the "PLG-to-enterprise" motion, where a viral self-serve product converts into high-value enterprise relationships. [CI001][CI002][CI003][CI004][CI005][CI006][CI023]
| Revenue Stream | Description | Pricing Model | Estimated Revenue Share | Growth Driver |
|---|---|---|---|---|
| API Usage Fees | Per-character or per-minute audio generation via REST API | Usage-based ($0.30–$0.005/1K chars by tier) | ~35% | Developer and enterprise API adoption |
| Self-Serve Subscriptions | Monthly/annual plans: Starter to Business | $5/mo–$1,320+/yr | ~15% | Product virality, freemium conversion |
| Enterprise SaaS Contracts | Custom annual contracts with usage tiers and SLAs | Custom ($200K–$2M+/yr) | ~48% | Fortune 500 voice agent deployments |
| Voice Marketplace Commissions | Creator voice licensing fees from enterprise buyers | Revenue share on voice sales | ~2% | Voice Library creator ecosystem |
Revenue share estimates are approximate; enterprise SaaS is the fastest-growing segment.
[CI006, CI004, CI005, CI023]| Tier | Price | Monthly Characters | Voice Cloning | API Access | Target Buyer |
|---|---|---|---|---|---|
| Free | $0/mo | 10,000 | No | Limited | Casual users, evaluation |
| Starter | $5/mo | 30,000 | No | Basic | Hobbyists, indie developers |
| Creator | $22/mo | 100,000 | Yes (Instant) | Full | Content creators, YouTubers |
| Pro | $99/mo | 500,000 | Yes (Professional) | Full + Priority | Developers, small businesses |
| Scale | $330/mo | 2,000,000 | Yes | Full + SLA | Scale-up companies |
| Business | $1,320+/yr | Custom | Yes + Custom models | Enterprise SLA | Mid-market enterprises |
| Enterprise Custom | Custom | Custom | White-label + Custom | Dedicated SLA + support | Fortune 500, platform deals |
Pricing is subject to change; overages billed separately. Enterprise custom pricing negotiated per contract.
[CI004, CI005]4.2 Unit Economics and Capital Efficiency
ElevenLabs' unit economics are among the strongest reported by a late-stage AI venture. At $330M ARR with approximately 400 employees, the company generates roughly $825,000 in ARR per employee—a top-decile metric that compares favourably to the best-in-class public SaaS companies (Snowflake, Datadog) and well above typical venture-backed peers at the same stage. The company reportedly achieved operational breakeven at approximately $200M ARR in mid-2025, suggesting that incremental ARR is substantially accretive to operating income at current headcount. Gross margins are not publicly disclosed; analyst estimates range from 70% to 80%, consistent with model-weight-as-API-product economics where compute costs decline as model efficiency improves. Capital efficiency is similarly strong. ElevenLabs raised approximately $781M total through February 2026 against $330M ARR—a capital-raised-to-ARR ratio of approximately 2.4×, compared to 10×+ for Anthropic and 15×+ for OpenAI at equivalent revenue scale. After the $500M Series D, the company reported approximately $550M in cash, providing several years of operating runway even at growth-mode burn. The company has no disclosed debt obligations. ElevenLabs described its capital posture as intentionally building a sustainable business while pursuing product-led expansion rather than revenue-at-all-costs. [CI007][CI008][CI009][CI010][CI012][CI016]
| Metric | Estimated Value | Benchmark (top SaaS) | Evidence Quality | Notes |
|---|---|---|---|---|
| ARR per employee | $825K | $500K–$1M (top quartile) | Medium | 330 staff per TechFront360; $330M ARR |
| ARR (end-2025) | $330M | — | High | Widely corroborated across multiple sources |
| YoY ARR growth | 175% | >100% for hypergrowth | High | End-2024: $120M; End-2025: $330M |
| Enterprise ARR growth | >200% YoY | Top-quartile enterprise SaaS | Medium | Analyst-reported; not independently verified |
| Breakeven ARR | ~$200M (mid-2025) | — | Medium | Reportedly profitable; unaudited |
| Gross margin (est.) | 70-80% | 70-80% for AI API SaaS | Low | Not disclosed; analyst estimate |
| NRR | Not disclosed | >120% for elite SaaS | N/A | Critical diligence gap |
| CAC payback | Not disclosed | <12 months ideal | N/A | Not calculable from public data |
| Cash on hand | ~$550M | — | Medium | Post-Series D estimate |
| Total raised | ~$781M | — | High | Per Crunchbase and press releases |
| Item | Amount | Date | Source | Notes |
|---|---|---|---|---|
| Cash on hand (estimated) | ~$550M | Feb 2026 | Post-Series D estimate | Company-stated range; exact figure not confirmed |
| Series D raised | $500M | Feb 2026 | ElevenLabs official | Led by Sequoia; closes funding gap |
| Total raised (all rounds) | ~$781M | Cumulative to Feb 2026 | Public filings and press | Seed through Series D |
| Debt obligations | None disclosed | As of Feb 2026 | Company statements | No convertible notes or credit facilities confirmed |
| Annual burn (implied) | Undisclosed | 2025 | Not confirmed | Company claims profitability; burn likely below $100M/yr |
| Runway (estimated) | 5+ years at current efficiency | From Feb 2026 | Analyst estimate | Assumes continued profitability improvement |
4.3 Financial Gaps and Diligence Assessment
Despite the strong headline metrics, ElevenLabs' public financial disclosure has material gaps that a VC investor must resolve before commitment. Gross margin has not been disclosed publicly. Net revenue retention and gross revenue retention are absent from any investor materials in the public domain. Customer acquisition cost and payback period cannot be calculated without headcount-by-function data, and burn rate is not confirmed—the profitability claim is reported but not independently audited. Customer concentration is also unknown: if a handful of large enterprise clients account for more than 20% of ARR, the revenue quality assessment changes substantially. ElevenLabs' Voice Actor lawsuit (the Adam and Bella TTS voice litigation) represents a contingent liability that has not been quantified publicly. If the lawsuit results in damages or requires the company to retrain or remove those voice models, the cost and revenue impact would be material to the financials. Revenue recognition policy is inferred rather than verified—the mix of subscription and consumption revenue has different deferred revenue and cash-to-ARR timing characteristics. On balance, ElevenLabs presents exceptional growth and efficiency metrics, but the private financials required for diligence—margins, NRR, CAC, churn, burn—are not yet available through public sources and must be demanded as conditions of any investment process. [CI014][CI015][CI017][CI018][CI019][CI021][CI024][CI025]
| Metric | Availability | Severity | Why It Matters | Suggested Diligence Action |
|---|---|---|---|---|
| Gross margin | Not disclosed | Critical | Determines scalability and pricing power | Request management accounts |
| Net revenue retention (NRR) | Not disclosed | Critical | Key measure of enterprise stickiness and expansion | Request cohort revenue data |
| Customer acquisition cost (CAC) | Not disclosed | Moderate | Validates go-to-market efficiency | Headcount-by-function + marketing spend |
| Churn rate (monthly/annual) | Not disclosed | Critical | Validates self-serve revenue quality | Request monthly cohort retention data |
| Burn rate / cash consumption | Not disclosed | Moderate | Validates profitability claim and runway | Audited financials or cash flow statement |
| Revenue recognition policy | Not disclosed | Moderate | Subscription vs usage timing affects deferred revenue | Legal review of customer MSAs |
| Customer concentration | Not disclosed | Moderate | Top-client dependency risk | Request ARR by top-10 customers |
| Legal contingent liabilities | Partially disclosed | Moderate | Voice actor lawsuits; damages unquantified | Legal review and reserves assessment |
This table should form the basis of a pre-LOI financial data room request list.
[CI014, CI015, CI019, CI021, CI024, CI025]4.4 Exhibits
05Product & Technology
5.1 Product Portfolio and Customer Workflow
ElevenLabs has expanded from a single text-to-speech API into a comprehensive AI audio platform with eight distinct product lines. The core Text-to-Speech API processes natural-language text and outputs audio in seconds via REST or WebSocket, serving developers and enterprise applications from gaming to publishing. Voice Cloning lets users create personalised voices from audio samples (3 seconds for Instant Cloning, 30+ minutes for Professional Cloning), feeding enterprise use cases such as personalised customer service and branded content. AI Dubbing automatically translates and re-voices video content across 32 languages while preserving speaker identity, targeting media localisation workflows that previously required weeks of human voice work. The Conversational AI platform powers voice agents capable of multi-turn dialogue with interrupt detection, emotion-aware response, and Twilio telephony integration—enabling call centre automation and interactive customer experiences. The Voice Library marketplace aggregates 5,000+ licensed voices where creators earn revenue share and enterprise customers access diverse voice talent at scale. ElevenReader, Sound Effects AI, and the Projects studio extend the platform into consumer reading, audio production, and sound design. Together, these products integrate into a single API surface, enabling enterprise customers to build compound voice workflows without multiple vendor relationships. [CE001][CE005][CE006][CE019][CE021][CE022]
| Product Module | Primary Use Case | Target Buyer | API Access | GA Status | Key Differentiator |
|---|---|---|---|---|---|
| TTS API (Flash model) | Real-time voice agents, live apps | Developers, enterprise | REST + WebSocket | GA | Sub-75ms latency |
| TTS API (Multilingual v2) | High-quality audio content | Developers, publishers | REST + WebSocket | GA | 29+ languages, MOS 4.5 |
| Instant Voice Cloning | Personalised voice experiences | Developers, content creators | API | GA | 3-second sample cloning |
| Professional Voice Cloning | Branded enterprise voice | Enterprise, media | API + portal | GA | High-fidelity with 30+ min audio |
| AI Dubbing | Video localisation at scale | Media, streaming, marketing | Portal + API | GA | 32 languages, speaker preservation |
| Conversational AI Platform | Voice bots, customer support, IVR | Enterprise, telecoms | WebSocket + Twilio | GA | Multi-turn dialogue, telephony |
| Voice Library | Diverse voice talent marketplace | Enterprise, publishers | Integrated portal | GA | 3,000+ licensed voices |
| ElevenReader | Document read-aloud, accessibility | Consumers, enterprise | Mobile + web app | GA | AI voice personalisation |
| Projects Studio | Audiobook, podcast production | Publishers, content creators | Web portal | GA | Multi-voice, chapter editing |
| Sound Effects AI | Video and game audio production | Developers, game studios | API + portal | GA | Text-to-SFX generation |
| Speech-to-Speech (STS) | Live voice conversion | Streaming, gaming, entertainment | API | Beta | Real-time identity transfer |
5.2 Technology Architecture and Differentiation
ElevenLabs' underlying technology uses a proprietary flow-matching neural architecture that prioritises prosodic control and emotional expressiveness over raw generation speed. The Flash model variant sacrifices some quality for sub-75ms latency, enabling real-time conversational applications that transformer-based predecessors could not serve. The Multilingual v2 model uses a shared architecture across 29+ languages, enabling zero-shot cross-language voice transfer—a key enabler for the AI Dubbing product and a significant technical moat against single-language competitors. Voice cloning operates through few-shot learning: the model adapts to a new speaker's vocal characteristics from minimal sample data, a capability that requires large proprietary training datasets to calibrate reliably. ElevenLabs' 1,000+ years of user-generated audio provides a self-reinforcing data flywheel that improves model quality with scale. The platform's differentiation combines three layers: model quality (highest MOS scores in independent 2025 benchmarks), API depth (streaming, real-time, telephony, game engine integrations), and a two-sided Voice Library network effect. NVIDIA's strategic investment signals compute infrastructure partnership access, critical for maintaining model training at scale. The FCC's 2024 ruling on AI-generated voice robocalls creates a compliance layer for enterprise customers deploying Conversational AI agents—ElevenLabs' SOC 2 and GDPR certifications position it as a compliance-aware vendor. [CE002][CE004][CE007][CE008][CE014][CE017][CE023][CE024][CE028]
| Use Case | ElevenLabs Product | Deployment Method | Example Customers | Latency Requirement |
|---|---|---|---|---|
| AI customer support agent | Conversational AI + Flash TTS | WebSocket API + Twilio | Deutsche Telekom, Revolut | Real-time (<100ms) |
| Audiobook production | Projects + Pro TTS | Web portal | HarperCollins, The Atlantic | Batch (<5s) |
| Game NPC voices | TTS API + STS | Unity/Unreal SDK | Paradox Interactive, Chess.com | Near-real-time (<200ms) |
| Video dubbing/localisation | AI Dubbing | Portal + API | BILD, BurdaVerlag | Batch (<30s/min video) |
| Podcast narration | TTS API | REST API | Washington Post, TIME | Batch (<5s) |
| E-learning voiceover | TTS + Projects | Portal | Unnamed corporate clients | Batch (<5s) |
| Live interactive fiction | Conversational AI | WebSocket | Aug X Labs | Real-time (<100ms) |
| Component | Technology Type | Key Capability | Dependency | Moat Assessment |
|---|---|---|---|---|
| Flash TTS model | Flow-matching neural net | Sub-75ms synthesis | GPU clusters (NVIDIA) | Strong – best latency/quality tradeoff |
| Multilingual v2 model | Shared multilingual architecture | 29+ languages, cross-language voice transfer | Large multilingual training corpus | Strong – breadth at quality |
| Voice Cloning (IVC) | Few-shot speaker adaptation | 3-second clone creation | Proprietary large voice dataset | Strong – data flywheel |
| Conversational AI engine | LLM-orchestrated dialogue + TTS | Multi-turn, interrupt detection | Third-party LLM (OpenAI/Anthropic) | Moderate – LLM dependency |
| Sound Effects AI | Text-to-audio generative model | Controllable SFX from text | GPU clusters | Moderate – early mover |
| Voice Library platform | Two-sided marketplace | 3,000+ licensed voices | Creator ecosystem | Strong – network effect |
| API infrastructure | Cloud-hosted REST + WebSocket | High availability, global CDN | AWS/Google Cloud | Moderate – no on-premise |
Moat assessment is qualitative; rated Strong/Moderate/Weak based on replicability and competitive position.
[CE002, CE003, CE007, CE017, CE023]5.3 Trust, Safety, and Competitive Technology Risk
ElevenLabs' trust and safety posture is a significant ongoing risk. Consumer Reports' March 2025 assessment found that voice cloning safeguards across the industry—including ElevenLabs—relied primarily on consent checkboxes that could be easily bypassed. The No-Go Voice list protects named public figures but enforces via name matching rather than biometric fingerprinting, creating circumvention risk for voice fraud. The pending voice actor litigation over the Adam and Bella default TTS voices raises material questions about the provenance of ElevenLabs' training data. The company has announced planned cryptographic audio provenance metadata for AI-generated content, but this feature was not live as of mid-2025. The company's autonomous AI audio roadmap for 2026, announced in early 2026, suggests a strategic evolution from voice tool to audio agent—expanding the product surface but also expanding the misuse surface. Critical infrastructure dependencies on major cloud GPU providers represent a technical and operational risk: disruption to AWS or Google Cloud compute capacity would directly impact API availability. ElevenLabs' on-premise absence also means it cannot serve regulated industries where data sovereignty requirements prohibit third-party cloud processing. This is a structural gap that Resemble AI and Deepgram currently exploit. The competitive threat from Big Tech bundled TTS is real but currently mitigated by the significant voice naturalness gap—a gap ElevenLabs must sustain through continued model investment to remain defensible. [CE010][CE011][CE016][CE018][CE023][CE025][CE026]
| Trust Dimension | Current Status | Evidence | Risk Level | Planned Improvement |
|---|---|---|---|---|
| SOC 2 Type II | Certified | Company-disclosed | Low | Annual renewal |
| GDPR compliance | Compliant | Company-disclosed, EU customers | Low | Ongoing |
| HIPAA BAA | Available on enterprise plan | Company-disclosed | Low | Expanding to more tiers |
| Voice cloning consent | Checkbox-based | Consumer Reports 2025 | High | Biometric fingerprint matching planned |
| No-Go Voice list | Name-matching only | ElevenLabs safety page | Moderate | Biometric verification planned |
| Training data provenance | Disputed (voice actor lawsuits) | The Voice Realm, CR 2025 | High | Legal resolution pending |
| AI audio provenance metadata | In development | Company roadmap 2026 | Moderate | Cryptographic watermarking |
| FCC TCPA compliance | Customer responsibility | FCC ruling DA-24-146 | Moderate | Compliance guidance provided |
| Initiative | Stage | Target Date | Strategic Purpose | Risk |
|---|---|---|---|---|
| Autonomous AI audio models | Announced | 2026 H2 | Agent-layer product expansion | Technical execution risk |
| Biometric voice fingerprinting | In development | 2026 | Trust and safety improvement | Regulatory acceptance unclear |
| Cryptographic audio provenance | In development | 2026 | Deepfake labelling compliance | Adoption by platforms required |
| On-premise/private cloud option | Not announced | Unknown | Regulated-industry expansion | Architecture investment required |
| Expanded telephony integrations | Active | 2025–2026 | Call-centre AI agent growth | Carrier partnership complexity |
| Asian language expansion | Active | 2026 | Asia market penetration | Data availability and model training |
5.4 Exhibits
06Customers
6.1 Customer Base Segmentation and Scale
ElevenLabs' customer base spans individual developers through Fortune 500 enterprises across five primary verticals: media and publishing, gaming, enterprise technology, education and e-learning, and government. By early 2025, more than 60% of Fortune 500 companies had employees using ElevenLabs tools—up from 41% in 2024—with over 1 million registered developers in the community. The company served over 2 million conversational AI agent deployments by early 2026, a metric that illustrates adoption depth beyond passive API usage. Users have generated over 1,000 years of cumulative audio content on the platform, a proxy for engagement and stickiness that few software platforms at this age can claim. The customer mix by revenue is approximately 50/50 enterprise and self-serve. Enterprise revenue grew over 200% year-over-year in 2025, outpacing self-serve growth and driving average contract value higher. Geographically, the customer base is predominantly North American and European, with Asia-Pacific cited as a key expansion target for the Series D capital. Key enterprise segments include: telecom operators deploying AI phone agents (Deutsche Telekom), fintech companies building customer service automation (Revolut), media organisations automating audio content (Washington Post, TIME), and game studios generating NPC voice at scale (Paradox Interactive). [CU001][CU002][CU007][CU008][CU016][CU017][CU021]
| Metric | 2023 | 2024 | 2025 | 2026 (est.) | Source Quality |
|---|---|---|---|---|---|
| Fortune 500 adoption (%) | ~10% | 41% | >60% | ~75% (est.) | Medium |
| Registered developers | <200K | >500K | >1M | ~1.5M (est.) | Medium |
| Conversational AI agents deployed | ~10K | ~200K | ~1M | >2M | Medium |
| Audio generated (cumulative) | <50 years | <200 years | >500 years | >1,000 years | Medium |
| Enterprise ARR growth (YoY) | N/A | ~100% | >200% | Not disclosed | Medium |
| Metric | Value | Source | Evidence Quality | Diligence Action |
|---|---|---|---|---|
| Enterprise NRR | Not disclosed | Analyst estimates > 120% (unverified) | Low | Demand from company in data room |
| Self-serve monthly churn | Not disclosed | Estimated 3–6%/mo (unverified) | Low | Request cohort data |
| Enterprise renewal rate | Not disclosed | No evidence of non-renewal found | Absent | Verify in reference calls |
| Cumulative audio generated | >1,000 years | ElevenLabs official (Jan 2026) | High | Strong engagement proxy |
| Developer GitHub stars (SDK) | >5,000 | GitHub repository | Medium | Developer stickiness signal |
| Fortune 500 penetration growth | 41% to 60%+ (2024 to 2025) | Multiple analyst sources | Medium | Confirms enterprise expansion |
6.2 Named Customer Proof and Outcomes
ElevenLabs has assembled a reference-quality enterprise customer portfolio. In media and publishing: TIME Magazine, The Washington Post, The Atlantic, The New Yorker, HarperCollins, ESPN, and Perplexity's Discover Daily use ElevenLabs to generate audio content at scale—the Washington Post and TIME converting written articles to audio in seconds rather than hours. In gaming: Paradox Interactive and Chess.com embed ElevenLabs voices in interactive character experiences and AI chess coaching. In enterprise technology: Deutsche Telekom, Revolut, Square, Salesforce, and NVIDIA are named clients, with Salesforce Ventures being an active investor-customer. The Ukrainian government is a notable public-sector adopter. ElevenLabs' Chess.com deployment, where AI coaches guide players through chess games in natural-sounding voice, exemplifies the platform's ability to create deeply embedded, high-engagement customer experiences that are technically difficult to replace. These customer logos serve a dual purpose: revenue proof and marketing validation. The breadth of verticals (media, gaming, telecoms, fintech, government) signals product-market fit across multiple categories rather than single-niche dependence. However, individual case study depth and publicly documented ROI outcomes are limited—most customer evidence is logo-level disclosure rather than quantified outcome reporting. [CU003][CU004][CU005][CU006][CU019][CU020]
| Vertical | Primary Use Case | Representative Customers | Revenue Tier | Growth Signal |
|---|---|---|---|---|
| Media and Publishing | Article narration, podcast, audiobook | Washington Post, TIME, HarperCollins | Enterprise | Rapid: audio-first content trend |
| Gaming and Entertainment | NPC voice, interactive AI characters | Paradox Interactive, Chess.com, AMGI Studios | Mid-market to Enterprise | Growing: game studios scaling AI dialogue |
| Enterprise Technology | Customer service AI, internal tools | Deutsche Telekom, Revolut, Square, Salesforce | Enterprise | Rapid: voice agent adoption |
| Education and E-Learning | Courseware narration, language learning | Unnamed corporate L&D teams | Mid-market | Moderate |
| Government and Public Sector | Accessibility, content delivery | Ukrainian Government | Variable | Early: limited but high-profile |
| Developer and Startup | API experimentation, product prototyping | 1M+ registered developers | Self-serve | High volume; lower per-user revenue |
| Customer | Industry | Verified Use | Production vs Pilot | Reference Quality | Source |
|---|---|---|---|---|---|
| Washington Post | Media | AI-generated article audio narration | Production | High | ElevenLabs official |
| TIME Magazine | Media | Article narration, multilingual audio | Production | High | ElevenLabs official |
| HarperCollins | Publishing | Audiobook generation | Production | High | Investor materials |
| Deutsche Telekom | Telecoms | AI call centre voice agents | Production | High | CNBC, ElevenLabs blog |
| Revolut | Fintech | Customer support voice AI | Production | Medium | CNBC |
| Paradox Interactive | Gaming | NPC voice generation at scale | Production | Medium | ElevenLabs official |
| Chess.com | Gaming | AI chess coaching voice interface | Production | Medium | ElevenLabs official |
| Salesforce | Enterprise tech | Internal tools + investor-customer | Production | High | Salesforce Ventures |
| Ukrainian Government | Government | Public audio content generation | Production | Medium | Investor materials |
| Perplexity | AI Research | Discover Daily podcast voices | Production | Medium | Investor materials |
6.3 Retention, Expansion, and Concentration Risk
ElevenLabs' retention and expansion dynamics are largely opaque from public sources. Net revenue retention and gross revenue retention have not been disclosed. Enterprise contracts are primarily annual, limiting multi-year revenue visibility. The land-and-expand motion—developer adoption within an enterprise triggering a company-wide agreement—mirrors Twilio's growth model and is consistent with the 50/50 enterprise/self-serve revenue mix observed. No publicly documented cases of major enterprise contract terminations or competitor switches were identified, though the absence of adverse evidence does not confirm strong retention. Customer concentration is unknown; no disclosure of top-customer ARR percentage exists. Pricing friction on self-serve plans—particularly the credit-based model and rate limits—has generated developer complaints and potential churn risk. Enterprise customers who embed ElevenLabs' API directly into their product stacks (games, customer service platforms) accumulate switching costs through voice model integration, data dependency, and user experience continuity requirements. However, multi-homing remains a realistic pattern for enterprise buyers who use ElevenLabs for quality workloads and cheaper alternatives for high-volume simpler workloads. The FCC's 2024 ruling on AI voice calls creates a compliance burden for enterprise customers deploying Conversational AI agents over telephone networks, requiring ongoing regulatory monitoring. ElevenLabs provides compliance guidance and terms-of-service restrictions to address this, but enforcement relies on customer responsibility rather than technical controls. [CU009][CU010][CU012][CU013][CU014][CU018][CU022][CU024]
| Risk Factor | Current Status | Risk Level | Evidence | Mitigation |
|---|---|---|---|---|
| Customer concentration (top customer %) | Unknown | Moderate | No disclosure | Require top-10 customer ARR in data room |
| Vertical concentration | Media+Gaming+Enterprise split | Low-Medium | Public disclosures | Diversified across 5+ verticals |
| Geographic concentration | US/Europe dominant | Moderate | Series D cited Asia expansion | Active international sales investment |
| Self-serve churn friction | Pricing friction cited by developers | Moderate | Developer complaints | Monitor Hacker News, Reddit feedback |
| Enterprise multi-homing | Developers using Cartesia + ElevenLabs | Medium | Analyst reports | Deepen quality gap and API depth |
| Regulatory compliance burden | FCC TCPA voice agent rules | Moderate | FCC ruling 2024 | Customer guidance programs |
6.4 Exhibits
07Risks
7.1 Regulatory and Legal Risks
ElevenLabs operates at the intersection of AI, biometric data, and consumer communications — three of the most actively regulated technology domains globally. The FCC's January 2024 ruling (DA-24-146) declaring AI-generated voices in robocalls illegal under the TCPA is the most immediate regulatory risk, directly affecting ElevenLabs' enterprise telephony customers. Any customer deploying ElevenLabs' Conversational AI platform for outbound automated calls without prior written consumer consent is now in violation of federal law, creating compliance liability that flows through to ElevenLabs if customers rely on its API as an unguarded tool. [CR001][CR002] In Europe, the EU AI Act and GDPR create a dual compliance burden. The EU AI Act's Article 52 requires AI-generated voice outputs to be clearly labelled; any ElevenLabs customer distributing synthetic audio in the EU without such disclosure risks fines under the Act. Simultaneously, the European Data Protection Board has indicated that voice prints generated during voice cloning constitute biometric data under GDPR Article 9, requiring explicit informed consent. This significantly increases the overhead for ElevenLabs' European enterprise customers and could slow EU market adoption. [CR002][CR003] On the litigation front, ElevenLabs has not faced a major filed lawsuit as of early 2026, but the sector has entered a period of active legal action from voice actors and rights holders. The removal of ElevenLabs' 'Adam' default voice in 2023 following misuse to recreate non-consensual celebrity content demonstrated the company's reactive stance, raising questions about whether proactive IP clearance for default voice training data has been sufficient. SAG-AFTRA's ongoing AI voice negotiations with entertainment studios, and EFF's analysis highlighting fragmented right-of-publicity laws across 35 US states, confirm that the legal landscape is unstable. [CR004][CR014][CR023][CR025] The FTC's Voice Cloning Challenge (January 2024) signals regulatory intent to impose safety standards on AI voice APIs. US Congressional hearings in 2024-2025 following the Biden deepfake robocall incident have increased the probability of federal legislation within ElevenLabs' investment horizon. ElevenLabs' CEO has committed publicly to regulatory engagement and a compliance roadmap, but the adequacy of its trust-and-safety measures has not been independently audited. [CR005][CR015][CR021][CR028][CR039][CR040]
| Risk / Rule | Jurisdiction | Status | Likelihood | Severity | Mitigation | Residual Exposure |
|---|---|---|---|---|---|---|
| FCC TCPA DA-24-146 (AI robocalls illegal) | United States | In force Jan 2024 | High – affects US enterprise telephony customers | High – non-compliance triggers FCC enforcement, fines, carrier blocking | API usage terms prohibit TCPA-violating outbound calls; compliance guidance published | Medium – ElevenLabs liable if customer misuse is demonstrably enabled |
| EU AI Act Article 52 (synthetic voice labelling) | European Union | Enforcement begins 2026 | High – labelling obligation applies to all synthetic voice outputs | Medium – fines up to €15M or 3% global revenue for non-compliance | Watermarking and disclosure documentation provided; enterprise contracts updated | Medium – enforcement uncertainty; EU market growth at risk |
| GDPR / EDPB biometric voice print rules | European Union | Current | Medium – triggered when voice prints stored or processed | High – GDPR Article 9 requires explicit consent; fines up to 4% global revenue | Data processing agreements updated; explicit consent flows for voice cloning | Medium – compliance complexity may deter EU enterprise adoption |
| FTC Voice Cloning Challenge / potential rule | United States | 2024 challenge launched; rulemaking possible 2025-2026 | Medium – FTC rulemaking not yet final | High – mandatory safety standards could require API changes | Engaged FTC proceedings; safety whitepaper published | High – federal rule could impose material product changes |
| Voice actor right-of-publicity litigation | United States (state-level) | Active sector litigation; ElevenLabs not yet named in major filed case | Medium – growing wave of sector suits; ElevenLabs IP clearance not publicly verified | High – damages, injunctions, forced product changes | Training data consent processes updated; default voices reviewed | High – fragmented state laws, no federal safe harbour |
| SAG-AFTRA AI voice collective bargaining provisions | United States | Active negotiations; provisions in studio contracts 2024-2025 | Medium – entertainment vertical exposure | Medium – entertainment customers may need contract restructuring | Monitoring SAG-AFTRA guidance; enterprise legal review ongoing | Medium – entertainment vertical revenue at risk if compliance costly |
| Risk | Key Individuals | Severity | Mitigation | Residual Exposure |
|---|---|---|---|---|
| CEO key-person dependency | Mati Staniszewski | High – investor relationships, strategic vision, product direction | No disclosed succession plan; equity vesting incentives | High – no named successor or co-CEO structure |
| CTO key-person dependency | Piotr Dąbkowski | High – core model architecture and R&D roadmap | Engineering team depth growing; no disclosed CTO succession | High – single-point technical leadership |
| Geopolitical disruption to Poland/Warsaw engineering hub | ~150-200 Warsaw-based engineers (estimated) | Medium – engineering capacity reduction, talent dislocation | Distributed teams in New York and London; remote-capable | Low-Medium – EEA geopolitical risk elevated but manageable |
| Talent competition from BigTech for AI/ML engineers | Entire engineering team | Medium – AI talent market hypercompetitive | Competitive compensation, equity, mission-driven culture | Medium – risk of targeted recruitment by OpenAI, Google, Anthropic |
| Founder dilution and board control post-Series D | Staniszewski, Dąbkowski | Low-Medium – founder authority may dilute with large institutional investors | Founding team retains operational control; investor composition supportive | Low – well-aligned investor syndicate (Sequoia, Andreessen, NEA) |
Two-dimensional risk heatmap mapping likelihood (rows) against impact (columns) for ElevenLabs' key risk categories. Cell tone reflects residual severity after mitigations.
[CR006, CR007, CR009, CR012, CR013, CR020]7.2 Operational, Partner, and Competitive Risks
ElevenLabs' operational risk profile is dominated by its cloud infrastructure dependency, deepfake misuse exposure, and accelerating BigTech competition. The company operates entirely on cloud infrastructure, primarily AWS, making it vulnerable to GPU capacity constraints, pricing changes, and unplanned outages. Its Conversational AI product requires real-time voice synthesis with sub-75ms latency, meaning any sustained AWS infrastructure disruption could breach enterprise SLAs and trigger churn among the most valuable customer segment. The NVIDIA GPU supply chain dependency adds another layer of risk: export restrictions on advanced GPUs or supply shortfalls could increase ElevenLabs' inference costs or limit capacity expansion. [CR006][CR032][CR037] Deepfake misuse remains the most structurally difficult operational risk. Despite deploying content safety tools — including abuse detection, usage monitoring, and AI SynthID-compatible watermarking — MIT Technology Review's 2024 research found that AI-generated voice detection tools achieve less than 70% accuracy in blind tests, suggesting ElevenLabs' watermarking programme may be insufficient as a robust compliance or reputational defence. The documented misuse of ElevenLabs' technology in the 2024 Biden robocall deepfake and various phone scam campaigns demonstrates that even policy-compliant operations create reputational liability. [CR009][CR015][CR016][CR022] On the competitive side, OpenAI's launch of GPT-4o with native voice capabilities in May 2024 represents the single largest competitive threat to ElevenLabs' market position. OpenAI's TTS API was priced at $15/million characters in 2025 — roughly 50% cheaper than ElevenLabs' base pricing — with Microsoft and Google offering even lower rates. Open-source TTS models including Coqui XTTS and StyleTTS2 have reached near-commercial-grade quality, enabling self-hosted alternatives for price-sensitive or on-premise-requiring customers. MOS score convergence across providers indicates model commoditization is proceeding faster than ElevenLabs' premium pricing implies. [CR007][CR008][CR019][CR020] The risk of BigTech bundling TTS as a near-zero-marginal-cost feature within their AI developer platforms is the most consequential long-term scenario. ElevenLabs' current moat rests on voice quality leadership, low latency (Flash model), and a diversified ecosystem (Voice Library, AI Dubbing, multilingual support); if this quality gap closes within 18-24 months, the pricing premium for ElevenLabs' self-serve tier would be difficult to sustain. The absence of an on-premise deployment option further constrains ElevenLabs' ability to compete for government, healthcare, and financial-services customers with strict data residency requirements. [CR019][CR029][CR031]
| Failure Mode | Likelihood | Severity | Mitigation Maturity | Residual Exposure | Unresolved Gap |
|---|---|---|---|---|---|
| AWS cloud outage disrupting Conversational AI real-time streams | Low – AWS 99.99% SLA | High – enterprise SLA breaches, churn risk | Medium – multi-region AWS deployment reduces single-AZ risk | Medium – single-cloud dependency remains | No disclosed multi-cloud or on-premise fallback |
| Deepfake misuse damaging brand reputation and triggering regulation | High – ongoing misuse documented | High – regulatory action, enterprise customer defection | Medium – abuse detection, watermarking deployed | High – technical detection accuracy <70% | Independent audit of content safety efficacy not available |
| GPU capacity constraints increasing latency or inference costs | Low-Medium – NVIDIA supply chain risks | Medium – SLA violations, margin compression | Low – no disclosed capacity hedging strategy | Medium – NVIDIA dependency unhedged | No public multi-supplier GPU strategy disclosed |
| Self-serve API abuse generating excess GPU costs | Medium – API-first model structurally exposed | Medium – unit economics impacted below headline ARR | Medium – rate limiting and abuse detection in place | Medium – structural exposure ongoing | Abuse loss rate not publicly disclosed |
| Security breach of voice print biometric data | Low – no reported incidents to date | Very High – GDPR fines, customer loss, regulatory escalation | Low – no public independent security audit | High – biometric data high-value target | No SOC 2 or ISO 27001 certification status disclosed |
| Risk Category | Key Monitoring Indicator | Thesis-Break Threshold | Lead Time Signal | VC Action |
|---|---|---|---|---|
| BigTech competitive bundling | OpenAI/Google TTS API pricing vs. ElevenLabs | OpenAI or Google launches free TTS tier with >80% quality parity | GPT-4.x voice quality benchmarks published; pricing announcements | Reduce position; accelerate exit or bridge to acquirer |
| Regulatory / legal enforcement | FTC rulemaking progress; Congressional bill status; filed litigation | Federal mandatory consent rule enacted; ElevenLabs named in major class action | Bill passage; FTC NPRM published; first major settlement in sector | Increase compliance DD; model remediation timeline |
| Financial deterioration | ARR growth rate YoY; gross margin trend; NRR | ARR growth falls below 80% YoY AND gross margin declines below 60% | Quarterly ARR updates; Series E data room | Bridge or co-invest evaluation; exit option analysis |
| Reputational / deepfake crisis | Consumer Reports ratings; adverse media sentiment index; enterprise churn | High-profile deepfake incident directly attributed to ElevenLabs; NRR declines 20%+ | Emerging adverse media; customer support ticket spikes | Crisis response assessment; customer retention DD |
| Open-source commoditization | Hugging Face TTS leaderboard; community adoption metrics | Open-source model achieves >95% MOS parity with ElevenLabs at <$0.001/char | GitHub stars, API downloads, enterprise open-source adoption reports | Re-evaluate moat; assess enterprise switching velocity |
Directed acyclic graph showing how primary risk categories propagate through ElevenLabs' business to affect revenue, customers, and valuation.
[CR007, CR009, CR020, CR029, CR033]7.3 Financial, Execution, and Portfolio Kill Criteria
ElevenLabs' financial risk profile benefits from a strong cash position (~$550M post-Series D) and rapid ARR growth ($120M to $330M in 2025, 175% YoY). However, several financial risks warrant close monitoring. Gross margins are estimated at 65-75%, below typical SaaS benchmarks of 75-80%+, primarily because real-time and low-latency GPU inference is computationally intensive. As Conversational AI grows as a share of revenue, GPU cost per session could widen this gap unless hardware costs continue declining. Self-serve API abuse — where bad actors generate large volumes of audio on free or low-cost plans — is a structural cost leak that compresses unit economics below the headline ARR numbers suggest. [CR011][CR012][CR038] Key-person dependency is an execution risk that is difficult to hedge. Mati Staniszewski (CEO) and Piotr Dąbkowski (CTO) are the technical and commercial anchors of ElevenLabs; no succession plan or executive bench depth has been disclosed. Poland's Warsaw hub provides engineering depth, but the geopolitical risk from Eastern European instability, though currently low probability, is non-zero. Customer concentration adds financial fragility: an estimated 20-30% of ARR is attributable to a small number of anchor enterprise customers, meaning the loss of two or three accounts could trigger double-digit ARR decline and investor concern. Net revenue retention, estimated by analysts at 130-150%, is unverified; if it fell below 110%, the growth narrative would face serious pressure. [CR013][CR017][CR030][CR036] The thesis-break scenario for a VC investor is a conjunction of three concurrent events: (1) a major BigTech player bundles equivalent voice quality at near-zero marginal cost; (2) regulatory enforcement materially restricts ElevenLabs' enterprise telephony and media markets; and (3) a significant data breach or high-profile deepfake incident causes enterprise churn above 20% NRR decline. Any single trigger is insufficient alone, but two of three occurring within the same 12-month period would require a fundamental reassessment of the investment. Monitoring indicators include BigTech pricing actions, Congressional bill progress, Consumer Reports ratings, and ElevenLabs' quarterly ARR update (expected at Series E). [CR033][CR034]
| Dependency | Counterparty | Role | Concentration | Failure Scenario | Severity | Mitigation | Residual Exposure |
|---|---|---|---|---|---|---|---|
| Cloud compute / GPU inference | AWS (primary) | Core infrastructure for all model serving | Very High – single primary cloud vendor | AWS pricing increase +30% or extended outage | High | Multi-region AWS; monitoring and alerting | Medium – single-cloud risk unresolved |
| GPU hardware supply | NVIDIA | H100/A100 GPU provisioning via AWS | High – NVIDIA dominant GPU supplier | Export controls or supply constraints drive GPU cost spike | Medium | Spot instance flexibility; capacity reservation | Medium – NVIDIA monopoly dependency |
| API distribution / ecosystem | Salesforce, NVIDIA, Apple | Enterprise platform distribution partners | Medium – each represents <20% of ARR individually | Partner terminates integration or launches competing product | Medium | Diversified partner base; direct enterprise sales | Low-Medium – no single partner dominates |
| CDN / edge delivery | Cloudflare, AWS CloudFront | Sub-75ms latency delivery to end users | High – few CDN alternatives at required scale and latency | CDN pricing increase or capacity constraint violates SLAs | Medium | Multi-CDN strategy partially in place | Low – CDN market competitive |
| Enterprise anchor customers | Deutsche Telekom, Revolut, Square, TIME, others | ARR concentration; enterprise social proof | Medium – top 3 accounts estimated 10-15% ARR | Loss of anchor customer triggers ARR decline and investor concern | High | Customer success, multi-year contracts, product stickiness | Medium – concentration risk unquantified publicly |
Directed graph showing ElevenLabs' critical external dependencies and their connections to core business capabilities.
[CR006, CR026]7.4 Exhibits
08Valuation
8.1 Investment Thesis, Anti-Thesis, and Recommendation
ElevenLabs' investment thesis at $11B rests on four compounding pillars. First, the AI voice market is large and fast-growing: Grand View Research forecasts the TTS and voice AI market reaching approximately $50B by 2030 at a 23% CAGR. Second, ElevenLabs holds a durable quality and ecosystem moat: its Flash model's sub-75ms latency, the 3,000+ voice Voice Library, and 32-language AI Dubbing capability collectively represent a platform defensibility that a single-product TTS tool cannot match. Third, its growth rate of 175% YoY at $330M ARR is exceptional — placing it in the top 1% of SaaS companies globally by Rule of 40 score (estimated 170-200+). Fourth, the Series D investor syndicate — Sequoia, a16z, Thrive, T. Rowe Price, Khosla, Index, Redpoint, Lux — represents the highest-conviction AI investor set globally, providing not just capital but distribution and governance endorsement. [CV018][CV024][CV025][CV015] Sequoia frames ElevenLabs as the 'voice layer of the AI stack,' analogous to Twilio's communications layer role — a platform multiple rather than a product multiple — which justifies a premium above the SaaS peer group. Meritech's public cloud index confirms that AI-category SaaS companies with >100% NTM growth commanded 25-40× NTM revenue in late 2025; ElevenLabs' 33× is within this range and arguably modest given its growth rate and ecosystem breadth. [CV022][CV023][CV024] The anti-thesis centres on three vectors: (1) BigTech bundling equivalent TTS quality at near-zero cost within 18-24 months; (2) regulatory restrictions (FCC TCPA, EU AI Act, potential FTC rulemaking) materially constraining the enterprise telephony use case; (3) open-source quality convergence reducing the quality premium that underwrites ElevenLabs' pricing. The key test: if ElevenLabs' NRR remains above 130% through 2026-2027, the anti-thesis is losing. If NRR falls below 110%, it is winning. [CV019][CV028] Recommendation: **Watch / Conditional Buy** at current $11B valuation. The risk-adjusted return for new Series D investors is compressed versus earlier entry points — base case implies flat to modest upside versus current valuation — but the bull case (enterprise telephony and Conversational AI platform scaling) offers 2-3× return in 5 years for investors with conviction in the platform thesis and regulatory resilience. [CV034][CV039]
| Dimension | Assessment | Rationale |
|---|---|---|
| Overall Recommendation | Watch / Conditional Buy | Strong growth and investor syndicate justify valuation; return profile compressed at Series D entry for new investors |
| Confidence Level | Medium | ARR growth and quality moat are verifiable; margin, NRR, and customer concentration are estimated, not disclosed |
| Risk Rating | Medium-High | Regulatory (FCC/EU AI Act), competitive (BigTech TTS bundling), and reputational (deepfake) risks are material but not imminent |
| Valuation Stance | Fairly Valued at $11B (33× ARR) | Modest premium to AI SaaS public comps at 25-33× NTM; within range for a category leader at 175% growth |
| Target Return (Bull Case) | 1.5-2.2× in 5 years (IPO 2028-2029) | Requires $900M+ ARR and 18-22× multiple at IPO; achievable in bull case |
| Target Return (Base Case) | 0.7-1.0× (flat to slight upside) | Base case ARR of $600-750M at 12-15× multiple yields flat to modest uplift from $11B entry |
| Hold Recommendation | 5-7 years for current investors | Voice AI platform thesis needs full enterprise telephony and Conversational AI cycle to play out |
| Exit Recommendation | IPO preferred; strategic acquisition possible | M&A buyer (Microsoft, Apple, Salesforce) possible at $15-20B; IPO preferred for valuation maximization |
| Company | Type | Valuation | ARR (Est.) | ARR Multiple | Growth Rate | Relevance |
|---|---|---|---|---|---|---|
| ElevenLabs (current) | Private | $11B (Feb 2026) | $330M (2025) | ~33× | 175% YoY | Subject company |
| SoundHound AI (SOUN) | Public | $1.5-2B market cap | ~$90M | ~17-22× | 55-65% YoY | Closest public comparable; lower growth penalizes multiple |
| Nuance Communications (acq.) | M&A exit | $19.7B (Microsoft, 2022) | ~$1.6B | ~12× | Moderate (mature) | Strategic M&A floor at scale; implies ElevenLabs needs $900M+ ARR |
| Anthropic | Private | ~$60-75B (2025) | ~$1B+ | ~50-75× | >200% YoY | Upper bound for AI infra premium; LLM platform, not pure voice |
| Perplexity AI | Private | ~$9B (2025) | ~$100M | ~90× | >200% YoY | Upper bound for consumer AI premium; high multiple reflects search disruption narrative |
| Cartesia AI | Private | ~$400-600M (est., 2025) | <$10M | ~50-80× | Early stage | Lower-bound voice AI comp; early stage inflates multiple |
| Deepgram | Private | ~$700M-$1B (est., 2024) | ~$50-80M | ~10-15× | 50-70% YoY | Speech API comparable; lower quality differentiation drives lower multiple |
| Mistral AI | Private | ~$6B (2025) | ~$100-150M | ~40-60× | >200% YoY | European AI infra; premium reflects LLM/API platform, not voice |
Decision logic from initial thesis assessment through key diligence gates to final investment recommendation.
[CV034, CV039]Key investment metrics for ElevenLabs at the time of the $11B Series D, benchmarked against SaaS top-quartile norms.
[CV001, CV008, CV009, CV016, CV017]8.2 Valuation Context, Comparables, and Return Analysis
ElevenLabs' $11B valuation at $330M ARR represents a 33× ARR multiple. Within the private AI landscape, this positions ElevenLabs at the moderate end: Anthropic was valued at implied 50-60× ARR, Perplexity at ~90× ARR, and Mistral at 40-50× ARR in their respective 2025-2026 funding rounds. ElevenLabs' relative modesty on multiple (despite superior unit economics and ARR per employee) may reflect the market's recognition of voice AI's higher regulatory risk profile relative to LLM infrastructure plays. [CV001][CV002][CV033] On public comparables, SoundHound AI (SOUN) traded at 10-20× NTM revenue with slower ARR growth, providing a floor for ElevenLabs' premium. Bessemer's State of the Cloud 2025 benchmarks suggest top-quartile SaaS companies growing >100% YoY commanded 20-30× NTM revenue in public markets; ElevenLabs' 33× private multiple is only modestly above this range and defensible for a category leader. The Meritech public cloud index supports this: AI-category SaaS leaders with >100% growth traded at 25× median, with leaders at 40× — placing ElevenLabs in the 70th percentile of the AI multiple distribution. [CV003][CV022][CV036] On M&A comparables, Microsoft's $19.7B Nuance acquisition at ~12× ARR sets a strategic buyer reference point: ElevenLabs would need $900M-$1.6B ARR before a strategic buyer at equivalent terms would meet the $11B valuation. This is achievable in the bull case (2027-2028) but tight in the base case. [CV004] A new Series D investor at $11B requires a $22-27.5B exit to achieve 15-18% IRR over 5 years. In the bull case, an IPO at $18-22B in 2028-2029 at 15-20× $1.1B ARR is plausible; in the base case, a flat or modest uplift exit of $10-13B generates sub-10% IRR. Probability-weighted expected return is estimated at 1.8-2.2× capital, which is acceptable but not exceptional for late-stage VC. [CV017][CV030][CV031] ElevenLabs' preference stack of approximately $781M total raised, assuming standard 1× non-participating liquidation preference, protects Series D investors from downside but does not amplify common equity returns. T. Rowe Price's participation signals IPO proximity (2027-2028 window), with institutional investors typically entering 18-36 months before S-1 filing. [CV006][CV021][CV037]
| Thesis Pillar | Supporting Evidence | Anti-Thesis Risk | Monitoring Signal |
|---|---|---|---|
| Voice AI platform dominance | 175% YoY ARR, 3K+ voices, 32 langs, Flash <75ms latency | BigTech bundles TTS at near-zero cost within 18-24 months | OpenAI/Google TTS API pricing trend; MOS quality benchmarks |
| Durable quality moat | MOS benchmark leadership, instant voice cloning, AI Dubbing | Open-source TTS reaches 95%+ MOS parity by 2026 | Hugging Face TTS leaderboard; enterprise open-source adoption |
| Enterprise expansion flywheel | ARR per employee $825K; enterprise growing >200% YoY | NRR declines below 120% as enterprise churn accelerates | Quarterly NRR disclosure; Series E data room |
| Category leader multiple | Top-tier investor syndicate; 33× ARR within AI SaaS range | Regulatory restrictions constrain enterprise telephony TAM | FTC rulemaking progress; FCC TCPA enforcement actions |
| Platform network effects | Voice Library creator/user flywheel; 3000+ contributed voices | Competing Voice Libraries emerge (OpenAI, Google) with larger creator communities | OpenAI/Google Voice Library launch announcements |
| Trigger | Type | Probability (12 months) | Impact | Lead Signal | Response |
|---|---|---|---|---|---|
| OpenAI/Google launches free TTS tier at >80% ElevenLabs MOS quality | Competitive | 20% | High – self-serve ARR at risk; pricing power collapse | Public model quality benchmarks; pricing announcements | Reduce position; model exit timeline; bridge options |
| FTC mandatory AI voice safety rule enacted (NPRM → final rule) | Regulatory | 15% | High – API modifications required; enterprise telephony TAM constrained | FTC rulemaking calendar; Congressional bill passage | Compliance DD; product roadmap review; legal counsel engagement |
| ElevenLabs technology named in major class action or DOJ enforcement | Legal | 10% | Very High – reputational damage; enterprise churn; regulatory escalation | Court filings; PACER monitoring; adverse media spike | Crisis response; legal reserve adequacy assessment |
| NRR declines below 110% for two consecutive quarters | Financial | 15% | High – compounding ARR growth story breaks; down-round risk | Quarterly ARR disclosures; Series E data room access | Down-round modelling; exit option analysis |
| Open-source TTS achieves 95%+ MOS parity on Hugging Face leaderboard | Technical | 25% | Medium – self-serve pricing pressure; developer migration | Hugging Face TTS leaderboard; community adoption metrics | Re-evaluate moat; enterprise-only pivot assessment |
Exit valuation estimates for ElevenLabs across bull, base, and bear ARR scenarios at different revenue multiples, illustrating return range sensitivity.
All ARR and multiple values are analyst estimates based on comparable set and growth deceleration assumptions; actual exit values depend on market conditions, timing, and investor structure.
[CV017, CV015]8.3 Scenarios, Exit Readiness, and Final Diligence Asks
**Bull case (25% probability):** ARR reaches $900M-$1.1B by end-2027 on continued 80-100% YoY growth; gross margins improve to 73-76% as Conversational AI revenue scales on better GPU unit economics; IPO in 2028 at 18-22× ARR → $16.2B-$24.2B valuation. Implied 1.5-2.2× return for Series D investors. Key drivers: enterprise telephony adoption accelerates, EU AI Act compliance managed, Conversational AI scales to top-3 enterprise product globally. **Base case (55% probability):** ARR reaches $600M-$750M by end-2027 on 60-70% YoY growth with some BigTech pricing pressure; gross margins reach 70%; IPO or acquisition in 2028-2029 at 12-15× ARR → $7.2B-$11.25B valuation. Implied 0.7-1.0× return for Series D investors (modest value preservation, not return). Key risk: NRR declines from 140% to 120%, slowing the ARR compounding. **Bear case (20% probability):** BigTech commoditizes TTS within 18 months, ARR growth falls to 40-50% YoY; regulatory headwinds constrain US telephony vertical; flat or down round at $8-9B in 2027; limited IPO optionality. Series D investors hold at or below cost. [CV010][CV011][CV012][CV026] ElevenLabs is estimated to be 2-3 years from IPO readiness. Prerequisites include: audited GAAP financials (required for S-1), a CFO with public-company experience, demonstrated sustained gross margin improvement above 70%, and resolution of material regulatory uncertainty in the US and EU. The T. Rowe Price and institutional co-investors in Series D create a structural pressure toward IPO that is likely to accelerate governance improvements. [CV020][CV029][CV035] Final diligence asks: (1) Audited 2024-2025 financials with segmented gross margin by product line; (2) NRR by customer cohort vintage (enterprise vs. self-serve separately); (3) Customer concentration schedule — top 10 accounts as share of ARR; (4) Independent GDPR and SOC 2 audit results; (5) CPU/GPU infrastructure cost breakdown per product; (6) Regulatory engagement log with FCC, FTC, and EU AI Office; (7) Series D full term sheet including governance rights and anti-dilution provisions. [CV027][CV040]
| Scenario | Probability | ARR by 2027 (Year-End) | Growth Rate | Gross Margin | Exit Valuation | Exit Multiple | Return for Series D Investor |
|---|---|---|---|---|---|---|---|
| Bull | 25% | $900M-$1.1B | 80-100% YoY | 73-76% | $16B-$24B | 18-22× ARR (IPO 2028) | 1.5-2.2× capital |
| Base | 55% | $600M-$750M | 60-70% YoY | 69-72% | $7.2B-$11.25B | 12-15× ARR (IPO/M&A 2028-2029) | 0.65-1.0× capital |
| Bear | 20% | $400M-$500M | 40-50% YoY | 58-65% | $5B-$8B | 10-12× ARR (down/flat round 2027) | 0.45-0.7× capital |
| Probability-Weighted | 100% | $620M-$780M (weighted) | 65-75% YoY (weighted) | 67-71% (weighted) | $8.5B-$12B (weighted) | ~33× current ARR | ~1.8-2.2× capital (expected) |
| Diligence Ask | Priority | Why It Matters | If Not Provided |
|---|---|---|---|
| Audited 2024-2025 GAAP financial statements with segmented gross margin by product | Critical | All margin and unit economics estimates are unverified; public market investors will require audited financials | Do not commit at $11B without verifiable margin data |
| NRR by customer cohort: enterprise vs. self-serve separately, by vintage year | Critical | Platform thesis depends on strong expansion; estimated 130-150% NRR must be confirmed segment-by-segment | Adjust probability weight toward base case; lower conviction |
| Customer concentration schedule: top 10 accounts as % of ARR | High | Estimated 20-30% ARR concentration in anchor customers is a material undisclosed financial risk | Assume high concentration; haircut base-case valuation by 10-15% |
| Independent SOC 2 Type II and GDPR audit results | High | Biometric voice data requires validated security and compliance posture before EU enterprise rollout | Flag as blocker for EU enterprise thesis; raise compliance timeline concern |
| GPU/CPU infrastructure cost breakdown by product line | High | Gross margin by product determines which lines are value-accretive and which compress enterprise economics | Cannot underwrite margin improvement path to 75%+ without this |
| Series D full term sheet (preference, anti-dilution, governance rights) | Medium | Liquidation preferences and governance provisions affect return distribution in exit scenarios | Assume 1× non-participating preference; model conservative equity waterfall |
| Regulatory engagement log with FCC, FTC, EU AI Office | Medium | Proactive regulatory engagement reduces binary risk of enforcement action | Raise probability of adverse regulatory scenario from 15% to 25% |
Shows the low and high return range for Series D investors ($11B entry) across bear, base, and bull case exit scenarios.
All values in $B. Entry = $11B. Bear: 0.4-0.7× capital. Base: 0.65-1.0×. Bull: 1.3-2.4×. Weighted: 0.7-1.3× capital assuming 20/55/25 scenario weights.
[CV020, CV016]8.4 Exhibits
Disclaimer
This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.
Evidence index
| ID | Statement | Confidence | Sources |
|---|---|---|---|
| CO001 | ElevenLabs was founded in January 2022 by Mati Staniszewski and Piotr Dąbkowski in New York. | High | SO001, SO012 |
| CO002 | ElevenLabs is headquartered in New York, NY, with additional offices in London and Warsaw. | Medium | SO001, SO012 |
| CO003 | ElevenLabs' business model combines a freemium self-serve tier starting at $22/month with enterprise contracts that can reach $2M+ annually. | Medium | SO003, SO013 |
| CO004 | ElevenLabs' enterprise and self-serve revenue segments were approximately 50/50 in split as of late 2025. | Medium | SO003 |
| CO005 | Mati Staniszewski, ElevenLabs CEO, previously worked as a deployment strategist at Palantir Technologies before co-founding the company. | Medium | SO011, SO013 |
| CO006 | Piotr Dąbkowski, ElevenLabs CTO, previously worked as a machine learning engineer at Google and was named a TIME Magazine AI Top 100 Innovator. | High | SO001, SO012 |
| CO007 | ElevenLabs headcount grew from 30 employees in January 2024 to approximately 120 by January 2025, and approximately 330 by late 2025. | Medium | SO001, SO003 |
| CO008 | ElevenLabs has not publicly disclosed a CFO, COO, CPO, or other C-suite executives beyond the two founding co-leaders. | Medium | SO001, SO014 |
| CO009 | Sequoia Capital partner Andrew Reed joined ElevenLabs' board in conjunction with the February 2026 Series D funding round. | Medium | SO004, SO014 |
| CO010 | ElevenLabs raised a $2M seed round in 2022 and a $19M Series A in January 2023, with Sequoia Capital participating in the Series A. | Medium | SO001, SO012 |
| CO011 | ElevenLabs raised an $80M Series B in January 2024 at a $1.1B valuation, led by Andreessen Horowitz with participation from ICONIQ, Salesforce Ventures, and Smash Capital. | High | SO002, SO003, SO012 |
| CO012 | ElevenLabs raised a $180M Series C in January 2025 at a $3.3B valuation, co-led by a16z and ICONIQ Growth, with strategic investors including Deutsche Telekom, LG Technology Ventures, HubSpot Ventures, NTT DOCOMO Ventures, and RingCentral Ventures. | High | SO001, SO002, SO010 |
| CO013 | ElevenLabs raised a $500M Series D in February 2026 at an $11B valuation, led by Sequoia Capital, with participation from a16z, ICONIQ, Lightspeed Venture Partners, Evantic Capital, Bond, and NEA. | High | SO004, SO005, SO014, SO017, SO021 |
| CO014 | ElevenLabs' total funding across all rounds is approximately $781M as of February 2026. | Medium | SO003, SO004 |
| CO015 | Strategic investors in ElevenLabs include Deutsche Telekom, NTT DOCOMO Ventures, HubSpot Ventures, Salesforce Ventures, LG Technology Ventures, and RingCentral Ventures, each with implied commercial partnership interests. | Medium | SO001, SO012 |
| CO016 | ElevenLabs reached $120M ARR at end-2024 and $330M ARR at end-2025, representing approximately 175% year-over-year growth. | Medium | SO003, SO013, SO021 |
| CO017 | ElevenLabs claims that employees at over 60% of Fortune 500 companies used its platform as of January 2025. | Medium | SO001 |
| CO018 | ElevenLabs users have collectively generated over 1,000 years of audio content since the platform launched in January 2023. | Medium | SO001 |
| CO019 | ElevenLabs' Voice Library has over 5,000 shared community voices and has distributed more than $2M in payouts to voice creators. | Medium | SO001 |
| CO020 | ElevenLabs' largest enterprise contracts reach up to $2M each, per third-party analysts. | Low | SO003 |
| CO021 | ElevenLabs' conversational AI agents product saw 250,000 agents built by developers within two months of launch in late 2024. | Medium | SO001 |
| CO022 | ElevenLabs technology was used in a January 2024 fake robocall impersonating President Biden to discourage voters in the New Hampshire Democratic primary. | Medium | SO006, SO012 |
| CO023 | The FCC banned AI-generated voices in robocalls following the Biden deepfake robocall incident in which ElevenLabs' technology was used. | Medium | SO006, SO008 |
| CO024 | Voice actors and authors have filed lawsuits against ElevenLabs alleging unauthorized use of their voices as AI training data. | Medium | SO007 |
| CO025 | A March 2025 Consumer Reports study found AI voice cloning tools including ElevenLabs lacked robust safeguards, with consent mechanisms limited to checkbox acknowledgment. | High | SO008, SO009, SO016 |
| CO026 | Russian influence operations allegedly used ElevenLabs' voice AI tools to generate convincing fake audio for propaganda purposes. | Low | SO018 |
| CO027 | ElevenLabs has implemented watermarking on generated voice samples and partnered with deepfake detection companies as safeguard responses to misuse. | Medium | SO006 |
| CO028 | ElevenLabs opened a Warsaw R&D center in 2025 and expanded its India team to focus on Indic language coverage. | High | SO001, SO019 |
| CO029 | ElevenLabs' Impact Program offers free access to over 80 organizations in accessibility, education, and culture, including ALS patient voice restoration. | Medium | SO001 |
| CO030 | ElevenLabs management indicated a potential IPO within 2-3 years as of the February 2026 Series D announcement. | Medium | SO021, SO004 |
| CO031 | ElevenLabs' AI dubbing product supports localization in 32 languages, localizing over 1 million hours of audio content. | Medium | SO001 |
| CO032 | Named enterprise customers include TIME, The Washington Post, The New Yorker, HarperCollins, ESPN, Deutsche Telekom, Revolut, Square, NVIDIA, Perplexity, Chess.com, and Paradox Interactive. | Medium | SO001 |
| CO033 | ElevenLabs' platform launch occurred in January 2023; prior to that the company was in stealth/private beta since founding in January 2022. | High | SO001, SO012 |
| CO034 | ElevenLabs reported approximately $550M in cash on hand following the February 2026 Series D, providing extended runway. | Low | SO003 |
| CO035 | ElevenLabs' gross margin and net revenue retention are not publicly disclosed; the company self-describes as 'highly capital-efficient' at the Series D stage. | Medium | SO003, SO014 |
| CM001 | ElevenLabs operates across three core market segments: AI text-to-speech API infrastructure, AI voice cloning and customization, and conversational AI agent infrastructure. | High | SM008, SM019 |
| CM002 | ElevenLabs also addresses the AI audio dubbing and localization market through its 32-language dubbing product, which has localized over 1 million hours of audio. | Medium | SM008 |
| CM003 | Status-quo substitutes for ElevenLabs include human voiceover artists (for content creation), traditional IVR/telephony (for contact centers), and human dubbing studios (for localization). | Medium | SM005, SM006 |
| CM004 | The AI voice generator market was approximately $3.0B in 2024 and is projected to reach $20.4B by 2030, representing a 37.1% CAGR, per MarketsandMarkets research. | High | SM001, SM005 |
| CM005 | Grand View Research estimates the AI voice generators market at $21.75B by 2030 at a 29.6% CAGR, consistent with but slightly higher than MarketsandMarkets. | Medium | SM002 |
| CM006 | The broader voice AI/speech technology market (including ASR, voice assistants, and agents) exceeds $70B by 2030, materially larger than the core AI voice synthesis market. | Low | SM004 |
| CM007 | ElevenLabs' SAM — covering enterprise AI audio, conversational agents, and developer API infrastructure — is estimated at $8–12B by 2027. | Low | SM005, SM010 |
| CM008 | At $330M ARR on a ~$4B total AI voice generator market in 2025, ElevenLabs holds approximately 7–8% market share in the core AI voice generation category. | Low | SM010, SM012 |
| CM009 | ElevenLabs' enterprise customer segments include media/publishing, customer service/contact centers, gaming, telecommunications, and educational technology platforms. | High | SM008, SM010 |
| CM010 | ElevenLabs' enterprise buyer persona is typically the Chief Digital Officer, CTO, or Head of Customer Experience; developer segment buyer is the individual engineer accessing the API. | Medium | SM005, SM010 |
| CM011 | Named enterprise customers include TIME, The Washington Post, HarperCollins, ESPN, Deutsche Telekom, Revolut, Square, NVIDIA, Perplexity, Chess.com, and Paradox Interactive. | Medium | SM008 |
| CM012 | ElevenLabs reports a ~50/50 split between enterprise and self-serve revenue as of late 2025, indicating meaningful penetration of both the developer bottom-up and enterprise top-down channels. | Medium | SM010, SM011 |
| CM013 | LLM proliferation is driving demand for natural, low-latency voice output as the interface layer for AI agents, creating structural pull for ElevenLabs' conversational AI product. | Medium | SM008, SM012 |
| CM014 | AI voice generation costs ($22/month for developer access) are orders of magnitude cheaper than professional voiceover artists ($300–500/hour), creating a powerful economic substitution incentive. | Medium | SM010, SM011 |
| CM015 | ElevenLabs' developer API adoption model parallels Twilio's distribution strategy, using a freemium bottom-up funnel to reach enterprise contracts. | Medium | SM010, SM012 |
| CM016 | Big Tech competitors Google TTS, OpenAI Realtime API, Amazon Polly, and Microsoft Azure Speech all compete in the AI voice infrastructure market with bundling advantages. | High | SM005, SM010 |
| CM017 | Regulatory constraints from the EU AI Act's voice provisions and potential US biometric/voice data laws represent material adoption constraints on AI voice cloning. | Medium | SM009, SM014 |
| CM018 | Open-source voice models including Coqui TTS, Bark, and Kokoro are improving rapidly and approaching commercial-grade quality at zero cost, creating commoditization risk for lower tiers of ElevenLabs' pricing. | Medium | SM016, SM019 |
| CM019 | MarketsandMarkets projects the AI voice market at $20.4B while Research and Markets projects $7.25–8.8B for TTS-only by 2030, reflecting definitional scope differences that create contradictory market size estimates. | Medium | SM001, SM003 |
| CM020 | Enterprise trust barriers—including consent for AI voices in customer interactions, audio watermarking requirements, and compliance with sector-specific regulations—limit AI voice adoption in healthcare and financial services. | Medium | SM009, SM014, SM025 |
| CM021 | The geographic AI voice market growth is global, with APAC and EMEA representing fast-growing segments driven by multilingual content demand; ElevenLabs has expanded to India, Poland, and is targeting LATAM and APAC. | Medium | SM018, SM024 |
| CM022 | ElevenLabs' conversational AI agent product reached 250,000 developer-built agents within two months of launch, indicating strong product-market fit in the contact center and interactive agent market. | Medium | SM008 |
| CM023 | ElevenLabs' SAM serviceable obtainable market (SOM) is estimated at $1–1.5B by 2027, consistent with reaching $500–700M ARR at current growth rates. | Low | SM010, SM012 |
| CM024 | The AI dubbing/localization market represents a distinct adjacent opportunity driven by global content localization demand; ElevenLabs has localized over 1 million hours of audio. | Medium | SM008 |
| CM025 | Enterprise trust in AI voices for customer-facing applications is lower than for internal productivity use cases due to consent, authenticity, and reputational risk concerns. | Medium | SM009, SM022 |
| CM026 | The ElevenLabs ARR per employee is approximately $1M (330M ARR / 330 employees), significantly above SaaS industry median, indicating capital efficiency. | Low | SM010, SM011 |
| CM027 | Model inference GPU compute costs represent an ongoing operational constraint that could compress gross margins if ElevenLabs scales faster than AI chip cost curves decline. | Medium | SM013, SM017 |
| CM028 | ElevenLabs' accessibility segment (Impact Program, 80+ partner orgs) is primarily brand-building and non-revenue generating, but creates social license and regulatory goodwill. | Medium | SM008 |
| CM029 | The contact center AI market is projected to exceed $17B by 2030, representing a large adjacent market for ElevenLabs' conversational agent product. | Low | SM004, SM005 |
| CM030 | ElevenLabs faces switching cost limitations for its API customers — while integration switching cost is moderate (API key change + code update), the re-training of custom voice clones is a meaningful retention factor. | Low | SM010 |
| CM031 | Enterprises in healthcare and finance face specific AI voice adoption constraints including HIPAA, GDPR, and sector-specific data handling requirements for AI-generated voice content. | Medium | SM014, SM020 |
| CM032 | The AI voice market had no dominant player with more than 15% market share as of 2024, suggesting the market remains fragmented and highly competitive. | Medium | SM001, SM005 |
| CM033 | ElevenLabs' market share of 7–8% in core AI voice generation makes it one of the largest pure-play vendors, but Big Tech bundling means total enterprise voice spend is dominated by Google, Amazon, and Microsoft. | Medium | SM010, SM016 |
| CM034 | The fraction of ElevenLabs' 60% Fortune 500 penetration that represents paying enterprise customers versus free-tier users is not publicly disclosed. | Medium | SM010 |
| CM035 | ElevenLabs' gaming segment customers including Paradox Interactive and Inworld demonstrate an adoption path: game developer SDK trial followed by a per-title or revenue-share contract. | Medium | SM008, SM019 |
| CP001 | ElevenLabs' primary direct competitors in the AI TTS API market include Cartesia, Deepgram, OpenAI TTS, Google Cloud TTS, Amazon Polly, Microsoft Azure Speech, Murf AI, Play.ht, and Resemble AI. | Medium | SP001, SP002, SP003 |
| CP002 | Cartesia AI achieved 40–95ms latency for voice synthesis in 2025, undercutting ElevenLabs' 75ms floor, and raised $80M in funding at an undisclosed valuation. | High | SP005, SP007, SP017 |
| CP003 | Cartesia supports approximately 15–20 languages and offers on-premise deployment, whereas ElevenLabs supports 70+ languages with no on-premise option as of 2025. | Medium | SP002, SP005 |
| CP004 | Deepgram's 'Aura' TTS model achieves approximately 150ms latency and is primarily English-focused with limited voice cloning capability compared to ElevenLabs. | Medium | SP005, SP014 |
| CP005 | Deepgram offers on-premise TTS deployment and specialises in speech-to-text alongside TTS, giving it stronger enterprise data-security positioning than ElevenLabs. | Medium | SP005, SP014 |
| CP006 | ElevenLabs achieved a Mean Opinion Score of approximately 4.5/5 in independent blind listening tests in 2025, ranking first among API-accessible TTS providers. | Medium | SP001, SP022 |
| CP007 | OpenAI's TTS API offers approximately 200ms latency, 57 languages, limited voice customisation and no voice cloning, competing primarily through GPT ecosystem integration. | Medium | SP002, SP008 |
| CP008 | OpenAI GPT-4o real-time audio mode, launched in 2024, enables latency below 300ms for voice-to-voice conversation, representing a longer-term competitive threat to ElevenLabs' conversational agent product. | Medium | SP008, SP020 |
| CP009 | Murf AI targets non-technical content creators and corporate e-learning teams, with a studio-centric UI, approximately 120+ preset voices, and pricing starting at $29/month per user. | Medium | SP002, SP015 |
| CP010 | Murf AI does not offer a real-time synthesis API for conversational AI agents, limiting its addressable market to pre-rendered content production versus ElevenLabs' broader use cases. | Medium | SP001, SP015 |
| CP011 | Resemble AI offers deepfake detection, on-premise deployment, and supports 149 languages, positioning it as the compliance-first alternative to ElevenLabs for regulated industries. | Medium | SP002, SP012 |
| CP012 | Play.ht offers 900+ voices across 142 languages, making it the broadest multilingual voice library, though it trails ElevenLabs in voice naturalness benchmarks. | Medium | SP001, SP016 |
| CP013 | Google Cloud Text-to-Speech offers WaveNet and Neural2 voice models with enterprise SLA coverage and deep integration into Google Cloud infrastructure, pricing at $0.000004 per character at volume. | High | SP009, SP003 |
| CP014 | Amazon Polly integrates natively with AWS and supports 30+ languages, targeting developers already in the AWS ecosystem with pay-per-use pricing around $4 per 1M characters. | High | SP010, SP003 |
| CP015 | Microsoft Azure Cognitive Speech integrates with Microsoft 365, Teams, and Azure OpenAI, giving it enterprise distribution advantages ElevenLabs cannot replicate without partnerships. | High | SP011, SP019 |
| CP016 | ElevenLabs' pricing ranges from $5/month (Starter) to $22/month (Creator) to enterprise custom pricing starting at $1,320/year, with usage-based overages on top. | Medium | SP001, SP013 |
| CP017 | Cartesia prices at approximately $0.038 per 1,000 characters, significantly cheaper than ElevenLabs' volume rates, creating downward pricing pressure in high-volume TTS workloads. | Medium | SP005, SP007 |
| CP018 | Enterprise customers can and do use multiple TTS vendors simultaneously (ElevenLabs for quality, Cartesia for speed, Deepgram for data privacy) indicating multi-homing as a real pattern. | Medium | SP002, SP018 |
| CP019 | ElevenLabs creates switching costs through enterprise API integration depth, custom voice model training data, and embedded Voice Library partnerships with content publishers. | Medium | SP020, SP021 |
| CP020 | ElevenLabs' Voice Library marketplace, where voice creators list licensed voices, creates a network effect: more voices attract more developers who attract more enterprise customers. | Medium | SP020, SP021, SP022 |
| CP021 | VocalCopyCat's 2025 analysis argues ElevenLabs' commercial success exceeds its technical leadership, with Cartesia and Deepgram achieving comparable or better latency at lower cost. | Medium | SP004 |
| CP022 | Consumer Reports' March 2025 assessment found most voice cloning platforms including ElevenLabs had weak safeguards, often limited to consent-checkbox verification, enabling non-consensual cloning. | High | SP025, SP024 |
| CP023 | Near-term entrants likely to disrupt the AI voice market include well-funded foundation model labs (Mistral, Cohere) adding native TTS, and Big Tech voice agent integrations bundled at no marginal cost. | Medium | SP019, SP022 |
| CP024 | Independent developer evaluations in 2025 show ElevenLabs and Cartesia each winning across different benchmarks: ElevenLabs on naturalness, Cartesia on latency and anti-hallucination consistency. | Medium | SP005, SP001 |
| CP025 | ElevenLabs lacks an on-premise or private-cloud deployment option as of 2025, creating a structural gap for regulated industries such as healthcare, defence, and financial services. | Medium | SP002, SP018 |
| CP026 | ElevenLabs' conversational AI agent platform competes with specialised agent infrastructure platforms; no independent benchmark comparing agent platforms head-to-head was publicly available as of mid-2025. | Low | |
| CP027 | Cartesia raised $80M in Series B funding in 2024 led by Andreessen Horowitz, reaching a reported $300M-range valuation and positioning as the speed-optimised challenger to ElevenLabs. | Medium | SP017, SP005 |
| CP028 | No publicly documented cases of ElevenLabs losing a major enterprise contract to a competitor due to pricing or features were identified; high churn risk remains a gap in the evidence base. | Low | |
| CP029 | Google Cloud TTS and Amazon Polly have large installed enterprise bases through their broader cloud relationships, but neither has matched ElevenLabs' voice quality as of 2025 independent evaluations. This quality gap is ElevenLabs' most defensible short-term competitive advantage against incumbent cloud providers. | Medium | SP009, SP010, SP003 |
| CP030 | ElevenLabs' moat assessment by NEA and Salesforce Ventures highlights its proprietary voice data, model quality, and developer ecosystem as durable differentiation factors. | Medium | SP020, SP021 |
| CP031 | Resemble AI's inclusion of deepfake detection and provenance metadata tools gives it a compliance positioning edge over ElevenLabs in industries facing emerging regulatory mandates. | Medium | SP012, SP011 |
| CP032 | ElevenLabs' pricing at premium tiers ($22+/month) is materially higher per character than Big Tech competitors (Google, Amazon) for simple TTS workloads, limiting pure-cost-driven enterprise deals. | Medium | SP016, SP009, SP010 |
| CP033 | Microsoft Azure Speech's direct integration with Microsoft 365 Copilot and Teams positions it as the default voice layer for the large existing Microsoft enterprise install base. | High | SP011, SP015 |
| CP034 | Play.ht's 142-language library outpaces ElevenLabs' 70+ languages in coverage breadth, though ElevenLabs claims deeper model quality per language rather than breadth-first expansion. | Medium | SP001, SP016, SP006 |
| CP035 | The AI TTS market shows clear segmentation: ElevenLabs leads on quality/enterprise; Cartesia on low-latency real-time; Resemble AI on compliance; Murf on no-code content teams; Big Tech on distribution. | Medium | SP003, SP002, SP019 |
| CI001 | ElevenLabs reached $330M ARR at end-2025, up 175% year-over-year from $120M at end-2024. | High | SI001, SI003, SI011 |
| CI002 | ElevenLabs' ARR milestones: $100M in January 2025 (20 months from platform launch), $200M in September 2025, and $330M by end-2025. | High | SI001, SI006, SI003 |
| CI003 | ElevenLabs' revenue split was approximately 50/50 between enterprise and self-serve customers as of late 2025, with enterprise revenue growing over 200% year-over-year. | Medium | SI004, SI017, SI023 |
| CI004 | ElevenLabs' largest enterprise contracts reach $2M per year; most enterprise deals are usage-based custom agreements without public pricing. | Medium | SI017, SI023, SI020 |
| CI005 | ElevenLabs' published self-serve pricing tiers: Free (10,000 characters/month), Starter ($5/month), Creator ($22/month), Pro ($99/month), Scale ($330/month), and Business ($1,320/year base). | Medium | SI021, SI017, SI001 |
| CI006 | ElevenLabs generates four primary revenue streams: API usage fees, self-serve subscriptions, enterprise SaaS contracts, and voice marketplace commissions from the Voice Library. | Medium | SI023, SI019, SI017 |
| CI007 | ElevenLabs does not publicly disclose gross margins; analyst estimates range from 70% to 80% gross margin based on AI SaaS comparables, though the inference compute cost base is uncertain. | Low | SI004, SI017, SI018 |
| CI008 | ElevenLabs reportedly achieved operational breakeven at approximately $200M ARR in mid-2025, earlier than typical enterprise SaaS benchmarks. | Medium | SI001, SI004, SI007 |
| CI009 | ElevenLabs generates approximately $825,000 in ARR per employee with ~400 staff and $330M ARR—a top-decile metric among venture-backed AI companies. | Medium | SI004, SI005, SI017 |
| CI010 | After the $500M Series D in February 2026, ElevenLabs had approximately $550M in cash on hand, providing a multi-year runway at the current burn rate. | Medium | SI009, SI013, SI014 |
| CI011 | ElevenLabs disclosed use of Series D proceeds for: global expansion (Europe, Asia), product R&D including autonomous AI models, and enterprise sales team hiring. | Medium | SI009, SI012, SI010 |
| CI012 | ElevenLabs has no publicly disclosed debt facilities, convertible notes, or project-finance obligations as of the February 2026 Series D. | Medium | SI009, SI025 |
| CI013 | ElevenLabs' investors have signaled an IPO as a likely exit path; insiders and analysts cite late 2026 or 2027 as a plausible window given revenue scale. | Medium | SI010, SI014, SI024 |
| CI014 | ElevenLabs has not disclosed net revenue retention (NRR) or gross revenue retention (GRR); the absence is a diligence gap for assessing enterprise stickiness. | Medium | SI017, SI018 |
| CI015 | ElevenLabs has not disclosed customer acquisition cost (CAC) or payback period; the metric is not calculable from public data given no headcount-by-function breakdown. | Medium | SI017, SI018 |
| CI016 | ElevenLabs' capital intensity is materially lower than AI model peers: it raised $781M total vs OpenAI's $12B+ at similar revenue milestones, implying better capital efficiency. | Medium | SI001, SI005, SI004 |
| CI017 | ElevenLabs' inference compute costs on GPU clusters are not publicly disclosed; industry estimates suggest AI voice synthesis costs are declining as newer architectures (SSMs) replace transformers. | Low | SI018, SI017 |
| CI018 | ElevenLabs' working capital is expected to be minimal due to prepaid annual subscriptions and enterprise upfront billings typical of SaaS; no working capital challenges were publicly flagged. | Low | SI023, SI017 |
| CI019 | ElevenLabs' revenue concentration risk is unknown; no public disclosure exists of top-customer ARR contribution or dependency on any single client. | Medium | SI018, SI017 |
| CI020 | ElevenLabs' ARR growth rate is showing acceleration rather than deceleration: $100M took 20 months, $200M took another 10, and $330M came in just 5 more months as of end-2025. | High | SI001, SI002, SI006 |
| CI021 | ElevenLabs' revenue recognition is a mix of subscription (monthly/annual SaaS) and usage-consumption (API calls billed per character or per minute); the exact split is undisclosed. | Low | SI023, SI017 |
| CI022 | Enterprise contracts at ElevenLabs are typically annual, custom-priced agreements with usage-based tiers, enabling predictable cash flow from upfront billing. | Low | SI020, SI023 |
| CI023 | ElevenLabs' Voice Library marketplace has paid out over $2M to voice creators; marketplace commission revenue is an emerging but not yet material revenue stream. | Medium | SI022, SI023 |
| CI024 | Material financial diligence blockers for ElevenLabs include: undisclosed gross margin, NRR, CAC, churn rate, and burn rate; a VC investor should demand these before making a commitment decision. | Medium | SI018, SI017, SI024 |
| CI025 | ElevenLabs' voice actor lawsuits (Adam and Bella TTS voices) create a contingent liability that has not been quantified publicly and could affect revenue from the accused voice products. | Medium | SI016, SI018 |
| CI026 | ElevenLabs' ARR crossed $100M in January 2025, making it one of the fastest AI companies to reach that milestone in the API-as-a-product category. | High | SI001, SI027 |
| CI027 | ElevenLabs' Series D investors include Sequoia, Andreessen Horowitz, ICONIQ Growth, Lightspeed, and Coatue—a coalition of top-tier growth investors signalling confidence in the company's financial trajectory. | High | SI013, SI014, SI025 |
| CI028 | Enterprise voice AI deployments at ElevenLabs typically include dedicated SLAs, custom voice model training, and integration support—creating higher annual contract values than standard API subscription models. | Low | SI020, SI023 |
| CI029 | ElevenLabs' freemium tier generates platform virality and developer trial at scale; conversion to paid tiers is the primary self-serve revenue engine. | Low | SI001, SI021 |
| CI030 | ElevenLabs' revenue per employee of $825K compares to Databricks ($450K/employee in 2024) and Snowflake ($430K in 2024), demonstrating superior capital-light model scaling. | Medium | SI004, SI005 |
| CI031 | ElevenLabs' enterprise revenue growing >200% YoY in 2025 while maintaining headcount discipline resulted in significant operating leverage compared to its Series B cohort peers. | Medium | SI001, SI007 |
| CI032 | The FTC has identified AI voice cloning as a consumer deception risk; regulatory penalties against voice AI companies could create contingent financial liabilities for ElevenLabs. | Medium | SI026, SI016 |
| CI033 | ElevenLabs disclosed plans to use Series D proceeds to expand its sales force targeting European and Asian enterprise markets, adding incremental sales payroll cost. | Medium | SI009, SI011, SI012 |
| CI034 | Voice marketplace creator payouts ($2M+ cumulative by early 2026) represent a customer acquisition cost substitute—creators who list voices build traffic and enterprise client interest at low cash cost. | Low | SI022, SI019 |
| CI035 | ElevenLabs' CNBC coverage in February 2026 cited Deutsche Telekom, Revolut, and Square as notable enterprise customers, indicating blue-chip logo penetration that supports contract renewal confidence. | Medium | SI027, SI020 |
| CE001 | ElevenLabs' core product suite in 2025 includes: TTS API, Instant Voice Cloning, Professional Voice Cloning, AI Dubbing, Conversational AI Agents, Voice Library marketplace, ElevenReader, Projects studio, and Sound Effects AI. | High | SE001, SE002, SE006, SE007, SE012, SE013, SE014 |
| CE002 | ElevenLabs' Flash model achieves sub-75ms latency for real-time voice synthesis, while its Multilingual v2 model prioritises quality in 29+ languages at approximately 200ms latency. | High | SE001, SE004 |
| CE003 | ElevenLabs' Instant Voice Cloning creates a voice clone from as little as 3 seconds of audio; Professional Voice Cloning requires 30+ minutes of high-quality audio for superior accuracy. | High | SE002, SE005 |
| CE004 | ElevenLabs supports text-to-speech in 70+ languages and AI dubbing across 32 languages as of mid-2025, with deep per-language model quality rather than breadth-first translation. | High | SE001, SE006, SE016 |
| CE005 | ElevenLabs' Conversational AI platform enables developers to deploy voice agents that handle multi-turn dialogue, interrupt detection, and emotion-aware synthesis through a WebSocket-based real-time API. | High | SE003, SE017 |
| CE006 | ElevenLabs' Voice Library marketplace allows creators to list licensed voices for sale; enterprise customers pay per use and creators receive a revenue share; over $2M paid to creators as of early 2026. | High | SE007, SE018 |
| CE007 | ElevenLabs uses a proprietary flow-matching neural architecture (not transformer-only) for its TTS models, enabling faster synthesis and better prosodic control than earlier attention-based approaches. | Medium | SE010, SE016 |
| CE008 | ElevenLabs offers deployment via REST API, Python/TypeScript SDKs, WebSocket for real-time streaming, Twilio and telephony integrations, Unity SDK, and Unreal Engine plugin. | High | SE001, SE003, SE021 |
| CE009 | ElevenLabs has filed provisional patents related to neural voice synthesis methods but no granted patents have been publicly confirmed in the USPTO database as of mid-2025. | Low | SE010 |
| CE010 | ElevenLabs' trust and safety controls include: consent-based cloning verification, a No-Go Voice list for famous individuals, abuse monitoring, and a reported plan for cryptographic audio provenance metadata. | Medium | SE008, SE009, SE020 |
| CE011 | Consumer Reports' March 2025 assessment found ElevenLabs' voice cloning safeguards were inadequate—often limited to a consent checkbox that could be bypassed—creating ongoing misuse risk. | High | SE009, SE020 |
| CE012 | No major publicly documented API downtime incidents affecting ElevenLabs enterprise customers were identified; the company offers SLA guarantees to enterprise tier customers but specific uptime percentages are not published. | Low | SE019, SE022 |
| CE013 | ElevenLabs' ElevenReader is a mobile and web application allowing users to listen to any document (PDFs, articles, books) in their chosen AI voice; it targets accessibility and content consumption use cases. | Medium | SE013 |
| CE014 | ElevenLabs holds SOC 2 Type II certification and is GDPR-compliant; HIPAA compliance is available under Business Associate Agreements (BAAs) for healthcare enterprise contracts. | Medium | SE008, SE022 |
| CE015 | ElevenLabs' AI Dubbing product translates and re-voices video content in 32 languages while preserving original speaker characteristics, lip-sync timing, and emotional tone. | Medium | SE006 |
| CE016 | ElevenLabs announced development of autonomous AI audio models in early 2026, representing a roadmap shift from tool-layer to agent-layer product strategy. | Medium | SE015 |
| CE017 | ElevenLabs' multilingual model uses a shared architecture across languages rather than separate per-language models, enabling zero-shot cross-language voice transfer for dubbing. | Medium | SE010, SE016 |
| CE018 | Training data provenance is a risk for ElevenLabs: the Adam and Bella default voice lawsuits allege that voice actor audio was used without consent to train commercial products. | High | SE011, SE009 |
| CE019 | ElevenLabs' Projects feature provides a full audiobook and podcast production studio within the browser, enabling chapter-level navigation, multi-voice casting, and batch TTS generation. | Medium | SE014 |
| CE020 | ElevenLabs' speech-to-speech (STS) feature enables real-time voice conversion: a user speaks, and the output is synthesised in the target voice without intermediate text transcription. | Medium | SE001, SE003 |
| CE021 | ElevenLabs' Sound Effects AI, released in 2024, generates sound effects from text descriptions, expanding the product from voice synthesis toward full audio content generation. | Medium | SE012 |
| CE022 | ElevenLabs mobile apps are available for iOS and Android, centred on ElevenReader functionality; full API developer access remains web-only. | Medium | SE013, SE021 |
| CE023 | ElevenLabs relies on major cloud providers (AWS, Google Cloud) for GPU inference infrastructure; NVIDIA is a strategic investor, providing technology partnership access. | Medium | SE025, SE019 |
| CE024 | ElevenLabs' FCC-relevant risk: the FCC's February 2024 declaratory ruling declared AI-generated voice robocalls illegal under TCPA, creating compliance obligations for ElevenLabs' Conversational AI agent customers. | High | SE023, SE008 |
| CE025 | ElevenLabs' conversational AI agent platform competes directly with Bland AI, Retell AI, and Vocode in the voice agent infrastructure space, though ElevenLabs differentiates through superior voice quality rather than orchestration depth. | Low | SE017, SE016 |
| CE026 | ElevenLabs offers a No-Go Voice list that prevents cloning of high-profile public figures by name; however, the enforcement relies on name matching rather than biometric voice fingerprinting. | Medium | SE008, SE020 |
| CE027 | ElevenLabs' Multilingual v2 model achieves state-of-the-art performance on cross-language prosody transfer, enabling dubbing that preserves speaker identity across 32 language pairs. | Medium | SE006, SE004 |
| CE028 | ElevenLabs' platform processes over 1,000 years of audio content generated by users as of early 2026, providing a proprietary usage data flywheel for model improvement. | Medium | SE024, SE016 |
| CE029 | ElevenLabs' enterprise deployment includes dedicated SLA support, custom voice model training, private API endpoints, and compliance package add-ons for regulated industries. | Medium | SE022, SE024 |
| CE030 | ElevenLabs' integration with Twilio and native telephony protocols enables its Conversational AI agents to handle inbound and outbound phone calls at scale, expanding the customer support and sales automation use case. | Medium | SE003, SE017 |
| CE031 | ElevenLabs' Python and TypeScript SDKs on GitHub have accumulated over 5,000 GitHub stars combined, indicating strong developer community adoption and integration momentum. | Medium | SE026, SE027 |
| CE032 | ElevenLabs' API documentation covers TTS, voice cloning, agents, and dubbing with code samples in Python, TypeScript, and cURL; developer onboarding from account creation to first voice generation takes under 5 minutes. | Medium | SE026, SE001 |
| CE033 | ElevenLabs' WebSocket streaming API enables real-time audio generation with first-audio-chunk latency under 75ms for Flash model, critical for interactive voice applications. | Medium | SE003, SE001 |
| CE034 | ElevenLabs' default TTS voices (Adam, Bella) are pre-trained on licensed voice data; the pending litigation challenges whether the licensing was obtained with full consent. | Medium | SE011, SE009 |
| CE035 | ElevenLabs' product expansion from voice API to full audio platform (TTS + dubbing + agents + sound effects + marketplace) is designed to increase revenue per customer and reduce single-product churn risk. | Medium | SE016, SE024 |
| CU001 | ElevenLabs' platform was used by employees at over 60% of Fortune 500 companies by early 2025, up from 41% in 2024. | High | SU001, SU012, SU014 |
| CU002 | ElevenLabs serves over 2 million conversational AI agent deployments as of early 2026, reflecting rapid adoption in enterprise voice automation. | Medium | SU012, SU010 |
| CU003 | ElevenLabs' named media and publishing customers include TIME, The Washington Post, The Atlantic, The New Yorker, HarperCollins, ESPN, Perplexity, and BILD. | High | SU001, SU002, SU004 |
| CU004 | ElevenLabs' named gaming customers include Paradox Interactive, Chess.com, and AMGI Studios, using the platform for NPC voice generation and interactive game character dialogue. | High | SU004, SU006, SU007 |
| CU005 | ElevenLabs' named enterprise SaaS and telecoms customers include Deutsche Telekom, Revolut, Square, Salesforce, and NVIDIA, indicating blue-chip enterprise adoption. | High | SU008, SU002, SU015 |
| CU006 | The Ukrainian government uses ElevenLabs for audio content generation and accessibility services, demonstrating public sector adoption beyond commercial verticals. | Medium | SU001, SU007 |
| CU007 | ElevenLabs users have generated over 1,000 years of audio content on the platform as of early 2026, indicating deep engagement and platform stickiness beyond trial usage. | High | SU012, SU001 |
| CU008 | ElevenLabs' customer base spans five primary verticals: media and publishing, gaming, enterprise technology, education and e-learning, and government/public sector. | Medium | SU007, SU001, SU004 |
| CU009 | ElevenLabs' self-serve free tier functions as a top-of-funnel acquisition channel; enterprise conversion typically begins with departmental API trials that expand to platform-wide agreements. | Medium | SU011, SU007 |
| CU010 | ElevenLabs' enterprise contracts are primarily annual agreements; no publicly disclosed multi-year contract structure exists that would provide multi-year revenue visibility. | Low | SU007, SU009 |
| CU011 | No publicly documented cases of major ElevenLabs enterprise customers departing for competitors or publicly terminating contracts were identified as of mid-2026. | Low | SU018, SU016 |
| CU012 | ElevenLabs' customer concentration is not disclosed; no single named customer represents more than 10% of ARR based on public information, though this cannot be confirmed. | Low | SU009, SU003 |
| CU013 | ElevenLabs has integration partnerships with Twilio (telephony), Salesforce (CRM), and major cloud providers; partner-sourced revenue is not material as a disclosed percentage but growing. | Medium | SU002, SU024 |
| CU014 | Some developers have reported friction with ElevenLabs' credit-based pricing model and rate limits on the self-serve tier, which can create unexpected cost overruns for high-volume API users. | Medium | SU018, SU023 |
| CU015 | ElevenLabs' Voice Library marketplace creator community has grown to thousands of voice creators, creating a customer loyalty mechanism beyond the core API relationship. | Medium | SU004, SU007 |
| CU016 | ElevenLabs' enterprise ARR grew over 200% year-over-year in 2025, outpacing self-serve growth and suggesting improving revenue quality through larger contract sizes. | Medium | SU011, SU013, SU022 |
| CU017 | ElevenLabs' geographic revenue mix is predominantly North American and European; the company cited Asia-Pacific expansion as a major use of Series D proceeds. | Medium | SU025, SU010 |
| CU018 | ElevenLabs does not publish NPS or customer satisfaction metrics; the absence of disclosed retention data is a diligence gap for assessing enterprise relationship quality. | Medium | SU009, SU017 |
| CU019 | ElevenLabs' Chess.com deployment serves interactive chess coaching use cases where AI voices guide players, demonstrating engagement-deepening applications in consumer gaming. | Medium | SU004, SU006 |
| CU020 | The Washington Post and TIME both use ElevenLabs to automatically generate audio versions of written articles, cutting production time from hours to seconds and enabling audio content at scale. | Medium | SU004, SU005 |
| CU021 | ElevenLabs' customer base includes over 1 million registered developers as of 2024, creating a broad funnel from which enterprise deals emerge. | Medium | SU001, SU007 |
| CU022 | Enterprise customers who embed ElevenLabs TTS into their own products (e.g., games, customer service platforms) create a switching cost: all downstream end-user voice quality is dependent on ElevenLabs continuing to provide superior output. | Medium | SU007, SU009 |
| CU023 | ElevenLabs' Conversational AI agent platform saw customer deployments grow from near-zero in early 2024 to over 2 million agents by early 2026, indicating rapid enterprise adoption of the voice-agent paradigm. | Medium | SU012, SU010 |
| CU024 | ElevenLabs' FCC-regulated customer deployments (AI phone agents) must comply with the 2024 TCPA ruling on AI voice calls; non-compliance by customers creates reputational and platform risk for ElevenLabs. | Medium | SU019, SU018 |
| CU025 | ElevenLabs' self-serve to enterprise conversion motion mirrors Twilio's developer-led sales model, where individual developer adoption within a company eventually triggers an enterprise-level agreement. | Low | SU011, SU002 |
| CU026 | G2 and Trustpilot reviews of ElevenLabs in 2025 average 4.4-4.7 stars out of 5, with users citing voice quality and ease of API integration as primary strengths. | Medium | SU026, SU027 |
| CU027 | Capterra reviewers of ElevenLabs highlight the product's ease of use for non-technical enterprise users and the breadth of voice library as key differentiators. | Medium | SU028 |
| CU028 | TechRadar's 2025 review rated ElevenLabs as the best AI text-to-speech tool tested, citing MOS scores, voice cloning quality, and developer API documentation. | Medium | SU030, SU020 |
| CU029 | Forbes' 2024 feature on ElevenLabs highlighted TIME and The Washington Post as anchor customers, providing primary-tier editorial validation of the enterprise customer proof. | Medium | SU031, SU003 |
| CU030 | ElevenLabs' Product Hunt launch received strong developer community reception, generating high upvotes and positive feedback on API quality and pricing access. | Low | SU029 |
| CU031 | ElevenLabs targets enterprise buyers with dedicated customer success managers, implementation support, and SLA packages; smaller customers rely on self-serve documentation. | Low | SU032, SU002 |
| CU032 | Wired's 2024 coverage noted that ElevenLabs' voice cloning technology had become accessible enough for individual creators and bad actors alike, raising dual-use concerns relevant to enterprise customer trust. | Medium | SU033, SU016 |
| CU033 | ElevenLabs' AI Business coverage in 2025 highlighted the company's deliberate vertical focus on media, gaming, and enterprise telephony as customer acquisition strategy. | Medium | SU032, SU007 |
| CU034 | Enterprise customers integrating ElevenLabs into production workflows (audiobooks, call centres, games) face voice quality degradation if they switch providers, as end-user expectations are calibrated to ElevenLabs' benchmark. | Low | SU028, SU030 |
| CU035 | ElevenLabs' Voice Library has over 5,000 creator-contributed voices available to enterprise customers, creating a diversity of voice options that no single enterprise could source from a traditional voice agency at comparable cost. | Medium | SU004, SU015 |
| CR001 | The FCC ruled in January 2024 (DA-24-146) that AI-generated voices in robocalls are illegal under the TCPA, directly implicating any ElevenLabs customer using its API for automated outbound telephony without prior written consumer consent. | High | SR001, SR002 |
| CR002 | EU AI Act Article 52 requires providers of AI systems that generate synthetic voice output to clearly label such output as AI-generated; failure exposes ElevenLabs' European customers to fines, creating compliance overhead that could slow enterprise adoption in the EU. | Medium | SR003 |
| CR003 | The European Data Protection Board has indicated that voice prints created during voice cloning qualify as biometric data under GDPR Article 9, requiring explicit consent and triggering stricter processing requirements that ElevenLabs must comply with for EU users. | Medium | SR004, SR003 |
| CR004 | Multiple voice actors filed or threatened lawsuits against AI voice companies in 2024, alleging unauthorized training on voice samples; while ElevenLabs has not been named in major filed litigation as of early 2026, it operates in a legal environment where such exposure is actively materializing for the sector. | Medium | SR005, SR006 |
| CR005 | The FTC launched a Voice Cloning Challenge in January 2024, signalling regulatory intent to combat AI voice fraud; this implies potential future enforcement actions or mandatory safety standards that could require API modifications at ElevenLabs. | Medium | SR007, SR001 |
| CR006 | ElevenLabs operates entirely on cloud infrastructure (primarily AWS), making it vulnerable to cloud vendor price increases, unplanned outages, and terms-of-service changes that could disrupt real-time Conversational AI products with strict latency SLAs. | Medium | SR008, SR009 |
| CR007 | OpenAI's launch of GPT-4o in May 2024 with native voice capabilities, including low-latency emotional speech synthesis, represents direct competition for ElevenLabs' core TTS and Conversational AI products from a company with a far larger customer base and distribution. | High | SR010, SR011 |
| CR008 | Open-source TTS models such as Coqui XTTS and StyleTTS2 achieved near-commercial-grade voice quality benchmarks by 2024, enabling self-hosted alternatives to ElevenLabs for price-sensitive or on-premise-requiring enterprise customers. | Medium | SR012, SR013 |
| CR009 | ElevenLabs technology has been documented in use by bad actors for political deepfake robocalls (including a Biden voice clone in January 2024 New Hampshire primary) and phone scams, creating reputational and regulatory risk for the company regardless of its intent. | High | SR014, SR015, SR021 |
| CR010 | SAG-AFTRA negotiated AI voice provisions in studio contracts through 2024 and is actively pursuing protections that could restrict how entertainment companies use synthetic voice platforms like ElevenLabs, adding compliance complexity for ElevenLabs' media vertical customers. | Medium | SR016, SR006 |
| CR011 | ElevenLabs raised $500M in February 2026 at $11B valuation, bringing its estimated cash position to approximately $550M with a total raised of ~$781M; assuming a $20-30M monthly operating cost, this implies 18-24 months of runway before the next capital event. | Medium | SR017, SR018 |
| CR012 | ElevenLabs' gross margins are estimated at 65-75% at its current scale, below SaaS benchmarks of 75-80%+, primarily due to GPU inference costs for real-time and low-latency voice generation workloads. | Medium | SR017, SR018 |
| CR013 | ElevenLabs is substantially key-person dependent on CEO Mati Staniszewski and CTO Piotr Dąbkowski, both co-founders who jointly own the technical vision and investor relationships; no disclosed successor plan or executive redundancy has been publicized. | Medium | SR018, SR030 |
| CR014 | ElevenLabs removed its 'Adam' default voice in 2023 following reports it was being misused to clone celebrity voices without consent, demonstrating both the reputational risk of default voice assets and the company's reactive (rather than proactive) content policy enforcement. | High | SR027, SR024 |
| CR015 | ElevenLabs has deployed content safety measures including abuse detection, usage monitoring, voice verification requirements for high-risk use cases, and AI SynthID-compatible watermarking on generated audio as mitigation against deepfake misuse. | Medium | SR019, SR029 |
| CR016 | MIT Technology Review (2024) reported that AI-generated voice detection tools achieve less than 70% accuracy in blind tests, meaning ElevenLabs' watermarking may be insufficient as a complete compliance or reputational defence against deepfake misuse. | Medium | SR020 |
| CR017 | ElevenLabs' Poland and Warsaw engineering hub and support for Ukrainian government use cases creates geopolitical exposure to Eastern European conflict; a significant escalation could disrupt engineering capacity or create sanctions-related compliance issues. | Low | SR018, SR030 |
| CR018 | No public record of a significant data breach or security incident at ElevenLabs had been disclosed as of early 2026, though the API-first architecture and storage of voice print biometric data represent a high-value target for malicious actors. | Medium | SR022, SR019 |
| CR019 | OpenAI TTS via the API was priced at $15 per million characters in 2025, approximately 50% cheaper than ElevenLabs' base pricing; Google and Microsoft Azure TTS are even lower at $4-8 per million characters, creating ongoing price compression pressure. | Medium | SR011, SR010 |
| CR020 | AI voice synthesis quality benchmarks (MOS scores) have been converging rapidly since 2022; multiple vendors including Microsoft and Google reached statistically equivalent MOS to human voice by 2024, suggesting the product category is commoditizing faster than ElevenLabs' pricing implies. | Medium | SR013, SR012 |
| CR021 | US Congress introduced at least three AI voice-related legislative proposals in 2024 following the Biden robocall deepfake incident; if enacted, new federal disclosure requirements could mandate costly API changes for ElevenLabs and its customers. | Medium | SR021, SR007 |
| CR022 | ElevenLabs' terms of service explicitly prohibit impersonation and non-consensual voice cloning, but enforcement relies on user reporting and post-hoc detection rather than technical prevention, meaning reputational liability exists even for policy-compliant operations. | Medium | SR022, SR020 |
| CR023 | EFF analysis in 2024 concluded that current US right-of-publicity laws are fragmented across 35 states and do not uniformly cover AI voice cloning, leaving ElevenLabs exposed to varying state-level litigation risk with no federal safe harbour. | Medium | SR023, SR005 |
| CR024 | Bloomberg reported that ElevenLabs faced significant backlash in late 2023 when users weaponized its default voices to create non-consensual celebrity audio content, forcing the company to update its abuse policies and invest in moderating the Voice Library. | High | SR024, SR027 |
| CR025 | Professional voice actor communities have organized boycotts and public awareness campaigns against ElevenLabs, generating adverse media coverage and creating a sector of adverse stakeholders who may support restrictive regulation. | Medium | SR025, SR006 |
| CR026 | ElevenLabs relies on third-party CDN and edge delivery infrastructure including AWS CloudFront and Cloudflare to achieve its sub-75ms latency promise; any CDN pricing change or capacity constraint could affect enterprise SLA compliance. | Low | SR026, SR008 |
| CR027 | Gartner's 2025 Market Guide for AI Voice Technologies flagged vendor lock-in risk and compliance uncertainty as primary reasons enterprises are delaying large-scale voice AI deployments, which could limit ElevenLabs' enterprise land-and-expand velocity. | Medium | SR028, SR018 |
| CR028 | ElevenLabs CEO Mati Staniszewski publicly committed in early 2025 to proactive regulatory engagement and a compliance roadmap for enterprise customers including GDPR-compliant data residency options and FCC-compliant API guardrails. | Medium | SR030, SR029 |
| CR029 | If OpenAI, Google, or Microsoft were to bundle high-quality TTS as a free feature within their AI developer platforms (similar to how Google Maps APIs have been threatened by BigTech bundling), ElevenLabs' self-serve pricing tier would face immediate revenue pressure. | Medium | SR010, SR011 |
| CR030 | Revenue concentration is a material risk for ElevenLabs: an estimated 20-30% of ARR is attributable to a small number of large enterprise contracts; loss of two or three anchor customers could trigger significant ARR decline and investor concern. | Low | SR017, SR018 |
| CR031 | ElevenLabs does not yet offer an on-premise or private-cloud deployment option, meaning enterprise customers with strict data sovereignty requirements (government, healthcare, finance) face a compliance barrier to adoption that limits ElevenLabs' total addressable enterprise market. | Medium | SR022, SR028 |
| CR032 | ElevenLabs' real-time Conversational AI product requires ultra-low latency at scale; a sustained AWS infrastructure incident affecting GPU availability could interrupt live deployments and cause enterprise customers to incur SLA penalties or seek alternative providers. | Medium | SR008, SR026 |
| CR033 | The thesis-break trigger for ElevenLabs is a combination of: (1) a major BigTech player (OpenAI, Google) bundling equivalent voice quality at near-zero marginal cost, AND (2) regulatory enforcement that materially restricts the US enterprise telephony market, AND (3) a significant data breach or deepfake incident that triggers customer churn above 20% NRR decline. | Medium | SR018, SR030 |
| CR034 | ElevenLabs has published monitoring indicators for its trust-and-safety program including abuse reports per million API calls, takedown response time under 24 hours, and annual third-party audits of moderation systems, providing some measurable accountability. | Medium | SR019, SR029 |
| CR035 | ElevenLabs' IP portfolio as of early 2026 consisted primarily of trade secrets and proprietary training datasets rather than an extensive patent portfolio; this limits its ability to use IP litigation as a defensive moat against well-resourced BigTech competitors. | Low | SR018, SR023 |
| CR036 | ElevenLabs' net revenue retention is estimated by analysts at 130-150%, driven by strong expansion within existing accounts; however, this estimate is unverified and would materially weaken the financial risk profile if retention fell below 110%. | Low | SR017, SR018 |
| CR037 | ElevenLabs partners with NVIDIA for GPU inference acceleration; any disruption to the NVIDIA supply chain or export control restrictions on GPU hardware could increase ElevenLabs' inference costs or limit its capacity expansion plans. | Low | SR008, SR028 |
| CR038 | ElevenLabs' self-serve business is exposed to credit card fraud and account abuse at scale; without enterprise contract protections, API misuse by free or low-cost subscribers can generate GPU costs that exceed subscription revenue, compressing unit economics. | Medium | SR017, SR022 |
| CR039 | Rising political scrutiny of AI voice in the US (multiple Congressional hearings in 2024-2025) increases the probability of federal legislation imposing mandatory disclosures or consent requirements on AI voice APIs within ElevenLabs' 3-5 year investment horizon. | Medium | SR021, SR007, SR001 |
| CR040 | ElevenLabs has engaged external legal counsel and published a trust-and-safety whitepaper addressing deepfake concerns, signalling awareness of legal risk and a proactive strategy to pre-empt regulatory action, though the adequacy of these measures has not been independently audited. | Medium | SR029, SR030 |
| CV001 | ElevenLabs' $11B valuation in February 2026 represents a revenue multiple of approximately 33× its $330M ARR, which is at the lower end of the 30-80× range commanded by high-growth AI infrastructure companies in 2025-2026 private rounds. | High | SV001, SV003 |
| CV002 | Private AI companies with >150% YoY ARR growth in 2025-2026 commanded 40-90× ARR multiples on average; Perplexity raised at ~90× ARR, Anthropic at implied 50-60× ARR, and Mistral at 40-50× ARR, making ElevenLabs' 33× relatively modest for its growth profile. | Medium | SV003, SV002 |
| CV003 | SoundHound AI (SOUN), the most directly comparable public voice AI company, traded at 10-20× NTM revenue in 2024-2025 with significantly lower ARR growth than ElevenLabs, suggesting ElevenLabs' premium multiple is predicated on a growth-rate and quality-differentiation premium that must be sustained. | Medium | SV005, SV006 |
| CV004 | Microsoft's 2022 acquisition of Nuance Communications at $19.7B for approximately $1.6B ARR implies a 12× exit multiple; for ElevenLabs to achieve comparable M&A economics, its ARR would need to reach $900M-$1.6B before a strategic buyer would justify the current $11B valuation. | Medium | SV007, SV002 |
| CV005 | ElevenLabs' Series D was led by Sequoia Capital with participation from a16z, Thrive Capital, T. Rowe Price, Khosla, Lux, Index, Redpoint, Bond, and others; the breadth and reputation of the investor syndicate serves as implicit price validation from the highest-conviction AI investors globally. | High | SV008, SV009, SV027 |
| CV006 | ElevenLabs' total raised is approximately $781M including the $500M Series D; assuming a standard 1× non-participating liquidation preference, the preference overhang is approximately $781M, meaning an exit below $781M would return zero to common equity holders. | Medium | SV004, SV008 |
| CV007 | ElevenLabs' valuation has stepped up approximately 12-15× from its estimated early 2023 valuation, representing a 2-year implied CAGR of ~250%; early seed investors (NEA, Andreessen Horowitz) are sitting on roughly 100-200× paper returns at the current $11B valuation. | Medium | SV004, SV017 |
| CV008 | ElevenLabs' Rule of 40 score is estimated at approximately 170-200 (175% ARR growth + estimated -5% to +25% FCF margin), placing it in the top 1% of SaaS companies globally and warranting a premium multiple versus the SaaS market median. | Medium | SV011, SV023 |
| CV009 | ElevenLabs' ARR per employee of approximately $825K is above the SaaS upper quartile benchmark of $400-600K per employee, indicating exceptional capital efficiency and an asset-light model that argues for a higher revenue multiple than less efficient AI peers. | Medium | SV011, SV012 |
| CV010 | The bull case for ElevenLabs assumes ARR grows to $900M-$1.1B by end-2027 (roughly 80% YoY deceleration from 175%), driven by enterprise expansion in telephony, media dubbing, and Conversational AI, enabling an IPO at 15-20× ARR in 2028 at $13.5B-$22B valuation. | Low | SV023, SV009 |
| CV011 | The base case for ElevenLabs assumes ARR grows to $600M-$750M by end-2027 (roughly 60-70% YoY deceleration), with margins improving to 70-72% gross, enabling an IPO or acquisition in 2028-2029 at 12-15× ARR, implying a $7.2B-$11.25B exit range — flat to modest upside from current $11B valuation for late-entry investors. | Medium | SV023, SV024 |
| CV012 | The bear case assumes BigTech commoditizes TTS pricing within 24 months, ARR growth decelerates to 40-50% YoY, margins remain compressed at 60-65%, and ElevenLabs must raise a flat or down round at $8-9B valuation, generating minimal returns for Series D investors. | Low | SV024, SV002 |
| CV013 | Cartesia AI's $80M raise (2025) at an estimated $400-600M valuation with sub-$10M ARR implies an even higher ARR multiple than ElevenLabs, suggesting the voice AI sector is attracting premium capital despite the competitive risks; this is a double-edged indicator (sector excitement but also irrational pricing). | Low | SV019, SV002 |
| CV014 | Deepgram's Series C funding (2024) at an estimated valuation of $700M-$1B with approximately $50-80M ARR implies a 10-15× ARR multiple for an enterprise voice API company at lower growth rates, providing a lower-bound comparable to ElevenLabs' premium positioning. | Low | SV020, SV002 |
| CV015 | Grand View Research forecasts the text-to-speech market to reach approximately $50B TAM by 2030 at a 23% CAGR; at its current ARR of $330M, ElevenLabs holds less than 1% of its estimated 2025 TAM, implying substantial market share expansion headroom that supports a premium valuation. | Medium | SV015, SV016 |
| CV016 | ElevenLabs' net revenue retention is estimated by independent analysts at 130-150%, which is consistent with top-quartile SaaS benchmarks and implies ARR doubling every 2-3 years from existing customers alone at stable NRR, before new customer acquisition. | Low | SV023, SV011 |
| CV017 | A new investor entering at $11B in the Series D (February 2026) requires ElevenLabs to achieve an exit valuation of $22-27.5B (2-2.5× entry) within 4-5 years to match a 15-18% IRR threshold; this requires either an IPO at 15× $1.5B+ ARR or a strategic acquisition above $22B. | Medium | SV024, SV012 |
| CV018 | The investment thesis rests on four pillars: (1) voice AI is a large, fast-growing market; (2) ElevenLabs has a durable quality and ecosystem moat (Voice Library, multilingual, conversational AI platform); (3) 175% ARR growth at $330M base is exceptional; (4) the leading investor syndicate has underwritten the business at $11B. | Medium | SV009, SV010, SV021, SV022 |
| CV019 | The investment anti-thesis is: (1) BigTech (OpenAI/Google/Microsoft) bundles equivalent TTS quality at near-zero cost within 24 months; (2) regulatory restrictions (FCC/FTC/EU AI Act) materially constrain the enterprise telephony use case that drives enterprise ARR; (3) quality commoditization reduces ElevenLabs' pricing power and NRR. | Medium | SV024, SV030 |
| CV020 | ElevenLabs is likely 2-3 years from IPO readiness as of early 2026, needing to reach approximately $600-800M ARR, establish audited financials and a CFO-led finance function, and demonstrate sustained gross margin improvement before an S-1 filing would attract institutional public market demand. | Medium | SV013, SV028 |
| CV021 | The T. Rowe Price co-investment in ElevenLabs' Series D signals proximity to public market events: T. Rowe Price systematically participates in late-stage rounds for companies within 2-4 years of IPO, and its presence is a meaningful signal of ElevenLabs' public market readiness trajectory. | Medium | SV027, SV009 |
| CV022 | Bessemer's State of the Cloud 2025 showed that top-quartile SaaS companies growing >100% YoY traded at 20-30× NTM revenue in the public markets; ElevenLabs' 33× private round multiple is only modestly above this range, suggesting the premium is partially justified by its exceptional growth rate. | Medium | SV011, SV012 |
| CV023 | Meritech's public cloud index showed that AI-category SaaS companies with >100% NTM growth commanded a median of 25× NTM revenue in late 2025, with top performers at 40×; ElevenLabs' 33× private round is within this publicly-grounded range. | Medium | SV012, SV026 |
| CV024 | Sequoia's Series D thesis for ElevenLabs frames the company as the 'voice layer of the AI stack' — analogous to how Twilio became the communications layer of the internet — implying a platform multiple rather than a single-product SaaS multiple. | Medium | SV009, SV022 |
| CV025 | Index Ventures' early-stage investment thesis for ElevenLabs emphasized the compounding effects of the Voice Library (network effect from creator-contributed voices) and the multilingual moat; if these network effects sustain, they justify a multiple above pure revenue-growth comparable sets. | Medium | SV018, SV021 |
| CV026 | If ElevenLabs remains private beyond 2028 and the IPO window closes (e.g., due to market correction or sustained AI regulatory headwinds), early investors face a 2-3 year delay on distributions, which would reduce IRRs by 300-500 basis points versus a 2028 exit scenario. | Low | SV014, SV024 |
| CV027 | Key diligence asks before a new investor commits at $11B: (1) audited 2025 financials and unit economics by cohort; (2) NRR by customer segment (enterprise vs. self-serve); (3) independent security and GDPR compliance audit; (4) CPU/GPU infrastructure cost breakdown by product; (5) customer concentration > 5% ARR list. | Medium | SV023, SV024 |
| CV028 | ElevenLabs' thesis-break triggers include: (1) OpenAI or Google announcing a free-tier TTS product matching >80% of ElevenLabs' MOS quality; (2) a federal TCPA enforcement action against an ElevenLabs customer generating material headline risk; (3) NRR declining below 110% for two consecutive quarters. | Medium | SV024, SV030 |
| CV029 | ElevenLabs' gross margin improvement from an estimated 65% today to 75%+ (required for a public-market SaaS premium multiple) hinges on a combination of GPU hardware cost declines (Moore's Law trajectory) and increasing software/enterprise mix reducing per-unit compute intensity. | Medium | SV023, SV011 |
| CV030 | ElevenLabs' estimated probability-weighted expected return for a Series D investor is approximately 1.8-2.2× invested capital over 5 years in a base+bull blended scenario, assuming 60% base case probability at 1.5× and 30% bull case at 3×, with a 10% bear case at 0.5×. | Low | SV024, SV023 |
| CV031 | ElevenLabs' Series D at $11B implies the company must achieve approximately $700M-$1B ARR by 2028 to trade at a reasonable 11-15× public market multiple at IPO; at its current 175% growth rate decelerating to 80-100%, this target is achievable but not guaranteed. | Medium | SV023, SV012 |
| CV032 | Thrive Capital's participation in the Series D, combined with its track record of investments in Stripe, GitHub, and OpenAI, is a qualitative endorsement of ElevenLabs' platform potential and signals conviction that the company can sustain its growth trajectory. | Low | SV029, SV009 |
| CV033 | TechCrunch coverage of the ElevenLabs Series D noted that the round implies 25-30% ARR multiple premium over comparable AI infrastructure companies at the time, reflecting the market's willingness to pay a category-leader premium for ElevenLabs' dominant voice AI position. | Medium | SV017, SV003 |
| CV034 | ElevenLabs' valuation recommendation is conditional: at $11B and $330M ARR with 175% growth, the entry is justifiable for investors with a 5-7 year horizon and high risk tolerance if — and only if — the enterprise telephony and media dubbing verticals continue to scale without regulatory disruption. | Medium | SV024, SV009 |
| CV035 | ElevenLabs' confidence in its valuation is reinforced by public institutional backing (T. Rowe Price, Goldman Sachs asset management equivalent), which typically requires audited financials and board governance reforms compatible with eventual S-1 preparation. | Medium | SV027, SV013 |
| CV036 | Yahoo Finance and Morningstar comparable data for AI voice and speech analytics public companies shows a wide EV/NTM revenue range of 3-25× depending on growth rate, with category leaders (SoundHound at high-growth periods) touching 20-25×; ElevenLabs' private premium is justified only if it sustains >100% growth through IPO. | Medium | SV026, SV005 |
| CV037 | ElevenLabs' Series D preference stack includes a $500M Series D tranche at $11B valuation (implied ~$27.50/share equivalent); in a flat or moderate down round scenario, the Series D investors have first-call on proceeds, protecting them while earlier-round commons may be diluted. | Low | SV006, SV004 |
| CV038 | BVP's State of the Cloud 2025 confirms that the Rule of 40 premium is most pronounced for companies above 150% growth: each 10-point improvement in Rule of 40 above 100 yields approximately 2-4× additional EV/NTM multiple in the public markets — directly supporting ElevenLabs' 33× private valuation. | Medium | SV011, SV022 |
| CV039 | ElevenLabs' recommendation stance is: Watch / Conditional Buy — strong growth profile and best-in-class investor syndicate justify the valuation at current growth rates, but the risk-adjusted return for new investors at $11B is compressed, with limited upside relative to downside unless the voice AI platform thesis materialises. | Medium | SV024, SV023 |
| CV040 | ElevenLabs' Series D in February 2026 represents a major fundraising milestone validating the $11B valuation; however, the 12-18 month window following a large funding event is typically when growth-stage companies face the most execution pressure, making 2026-2027 the critical validation period for investors. | Medium | SV030, SV013 |