Diligence report AI language learning / consumer subscription edtech / enterprise language training Series C private / growth-stage edtech unicorn 2026-05-05

Speak

Fast-growing AI speaking tutor with real demand signals, but still too opaque for conviction pricing at the last unicorn mark

Speak has credible AI-language-learning traction and real unicorn momentum, but public evidence still undersupports the $1B mark.

Cover facts

Valuation 01

1000 USD M

Total Raised 02

162 USD M

Enterprise customers 03

200 organizations+

Downloads 04

15 M+

Recommendation 05

research-more

Company profile

Speak is a private AI language-learning company building a voice-first tutor for spoken fluency across consumer and employer use cases. The company pairs a differentiated speaking-focused curriculum with generative AI and speech technologies, giving it stronger product proof than many education apps. Public evidence shows meaningful global distribution and investor support, but still leaves too many financial, retention, and cap-table questions unanswered for a high-conviction price call.

Website: www.speak.com
Founded: 2016-01-01
Founders: Connor Zwick, Andrew Hsu
Founding location: San Francisco, California, USA
Headquarters: San Francisco, California, USA
Product: Speak delivers AI-guided speaking practice, pronunciation feedback, structured lessons, live roleplays, and multilingual tutoring through mobile apps and an enterprise training product.
Customers: Consumers learning spoken English and other languages, plus employers using English upskilling and workplace language practice.
Business model: Consumer subscriptions with in-app pricing plus enterprise seat-based or contract language-training offerings.
Stage: Series C private / growth-stage edtech unicorn
Funding status: Raised $78M Series C in December 2024 at a $1.0B valuation and roughly $162M cumulative funding, after a June 2024 B-extension at a $500M valuation.

Executive summary

Top strengths

Product positioning is differentiated around spoken fluency rather than passive language study.
Consumer traction is real, with strong app-store ratings, millions of downloads, and visible international reach.
The company proved repeat investor demand in 2024, doubling valuation from $500M to $1.0B within six months.
Speak for Business creates a second revenue surface beyond consumer subscriptions.
Engineering and product execution appear strong enough to ship advanced AI tutor and voice-agent features quickly.

Top risks

Public disclosure is too thin on ARR, margins, retention, and payback to defend the current valuation with confidence.
Competitive pressure from Duolingo, ELSA, Babbel, OpenAI-enabled entrants, and low-cost AI tutors can compress pricing power.
Consumer billing friction, refund complaints, and pronunciation-quality criticism show product-risk edges beneath the growth story.
Dependence on third-party model and app-store ecosystems can weaken moat durability and economics.
Regulatory and privacy exposure rises because the product processes voice data across many geographies and may touch younger learners.

Open gaps

Current ARR, gross margin, consumer conversion funnels, and enterprise revenue mix.
Renewal, churn, NRR, and concentration for Speak for Business.
Full cap-table terms, preferences, secondary rights, and investor protections at the Series C mark.
Current headcount by function, current board composition, and deeper governance controls.
Segment-level cohort retention and whether premium pricing is durable outside early adopter markets.

Chapter 01

01Company Overview

1.1 Identity, product scope, and business model

Speak is the consumer-facing brand of Speakeasy Labs, a San Francisco-headquartered startup focused on spoken-language fluency rather than textbook-style memorization. Across its homepage, app-store listings, and late-2024 fundraising materials, the company consistently describes itself as an AI language tutor that pushes learners to speak out loud, receive instant feedback, and build conversational confidence. That positioning matters because it is narrower than general language-learning incumbents: Speak competes first on speaking efficacy, not on broad gamified content breadth. By May 2026, Speak’s public consumer surfaces show a broader product than the English-only wedge that powered its early growth. The homepage and Apple listing advertise six live learning tracks (French, Spanish, English, Korean, Italian, and Japanese), while the June 2025 product post states that Spanish launched first for English speakers and four more languages followed shortly after. This evolution supports a hybrid model: consumer subscriptions remain the core monetization engine, but the same voice-first tutoring stack now also supports a business offering for employers. TechCrunch’s December 2024 reporting gives the clearest public list-price anchor for the consumer product at $20 per month or $99 per year. The public disclosure profile is still that of a private growth startup. Speak says enough to establish category, product strategy, and top-line traction, but not enough to validate unit economics, current ARR, or the exact customer mix between consumer and enterprise.[CO001, CO002, CO003, CO004, CO037]

1.2 Founders, leadership, and organizational footprint

The public leadership picture is founder-led and still highly concentrated. Late-2024 fundraising coverage consistently names Connor Zwick as CEO and Andrew Hsu as CTO/co-founder, with Zwick serving as the primary external spokesperson in both official company posts and independent press. That concentration is positive for speed and product coherence, but it also creates clear key-person dependence because the company’s narrative, investor messaging, and product vision all run heavily through Zwick. Speak’s operating footprint is global for a company of its size. TechCrunch reported a 75-person workforce across San Francisco, Seoul, Tokyo, and Ljubljana in June 2024, while the current careers page still highlights Seoul, Ljubljana, and San Francisco and presents the company as internationally distributed. The careers page is useful but internally stale: it also includes an older historical snapshot referring to a 60-person team and more than $60 million raised, showing that the page was not fully updated after the late-2024 financing jump. That inconsistency is a small but real diligence signal: even company-controlled surfaces should be treated as time-stamped claims, not timeless facts. Governance disclosure is limited. The best publicly supported board datapoint is that Accel partner Ben Quazzo joined the board at Series C. Beyond that, public materials identify related persons in the Form D filing but do not disclose a full current board roster, observer rights, or control provisions.[CO002, CO014, CO015, CO016, CO024, CO029]

Leadership and founder table
Person	Role	Background / public signal	Founder-market fit or functional coverage	Key-person dependency
Connor Zwick	CEO, co-founder	Primary external spokesperson across official posts and TechCrunch fundraising coverage	Owns product vision, fundraising narrative, and category positioning	High
Andrew Hsu	CTO, co-founder	Named in fundraising coverage and co-credited for technical direction	Owns speech / AI stack and technical credibility	High
Ben Quazzo	Accel partner, board member	Joined board at Series C according to official and TechCrunch coverage	Adds late-stage venture governance and recruiting leverage	Medium
Colton Gyulay	Related person in Form D	Listed in Form D related persons set	Suggests finance or legal leadership role, but public scope unclear	Low
Alex Berkenkamp	Related person in Form D	Listed in Form D related persons set	Suggests operating leadership, but public scope unclear	Low

Only roles directly observable in reviewed public sources are included; broader executive bench is not fully disclosed.

[CO024, CO029, CO030]

1.3 Funding history, valuation step-up, and chronology of record

Speak’s public funding story accelerated sharply in 2024. The company announced a $20 million Series B-3 in June 2024 at a $500 million valuation, then followed six months later with a $78 million Series C at a $1 billion valuation. The Series C post says total capital reached $162 million and frames the financing as the second preempted round of the year. Independent coverage from TechCrunch, Tech Funding News, and HolonIQ corroborates the valuation jump strongly enough to treat the unicorn milestone as well supported. The securities-filing trail is directionally consistent with the press narrative. A Form D filed on December 11, 2024 lists a $77.7 million equity offering under Rule 506(b), with the first sale dated November 13, 2024. The filing list also shows additional Form D events on August 12, 2024 and October 17, 2023, which likely map to intermediate financing activity around the B-extension period and prior private issuance activity. Those filings do not reveal ownership terms, preferences, or all governance rights, but they do add useful primary-document corroboration that real capital moved on the timeline the company describes. HolonIQ’s EdTech unicorn ledger then recorded Speak as joining the global EdTech unicorn list in December 2024 at a $1 billion valuation. That makes the Series C not just a financing milestone but a category-status milestone with real signaling value for future fundraising, recruiting, and enterprise sales.[CO008, CO009, CO010, CO011, CO012, CO023]

Stakeholder or investor map
Stakeholder	Role	Control or economic importance	Diligence ask
Accel	Lead Series C investor; board seat via Ben Quazzo	Anchors unicorn round and likely holds meaningful governance rights from Dec 2024 onward	Request board documents, pro rata rights, and ownership percentage.
OpenAI Startup Fund	Repeat investor	Strategic AI ecosystem signal and product-validation partner	Clarify any model-access, branding, or exclusivity arrangements.
Khosla Ventures	Repeat investor	Long-term financial backer across multiple rounds	Confirm ownership, preferences, and follow-on intentions.
Y Combinator / Paul Graham network	Early investor and signal amplifier	High reputational value and founder-network support	Clarify whether YC rights differ from standard seed investors.
Buckley Ventures	Led June 2024 Series B-3	Key bridge investor before the Series C step-up	Review whether the B-3 introduced special terms before Accel led Series C.
Employers using Speak for Business	Commercial stakeholders rather than cap-table owners	Potentially meaningful design partners if the 200+ customer claim is real	Request top-10 customers by seats, geography, and renewal status.

Public sources identify major financing participants and the enterprise customer class, but not the full capitalization table or ownership splits.

[CO008, CO009, CO011, CO017, CO018, CO023]

Milestone table
Date	Event	Type	Amount / valuation / status	Participants	Implication
2016	Speak founded / formal company creation enters public chronology	founding		Connor Zwick; Andrew Hsu	Anchors company identity used by official and third-party profiles.
2019	English learning launched first in Korea, beginning Asian market wedge	product	Initial market launch	Speak	Established the Korea-first distribution strategy later referenced in official posts.
2023-10	Public filing trail shows a Form D dated 2023-10-17	financing	Form D event	Speakeasy Labs	Earliest reviewed filing evidence of private capital activity in this run.
2024-06-18	Series B-3 announced	financing	$20M at $500M valuation	Buckley Ventures; OpenAI Startup Fund; Khosla; Paul Graham; Jeff Weiner	Valuation doubled in less than a year and set up the unicorn jump.
2024-08-12	Additional Form D appears on public filing list	financing	Form D event	Speakeasy Labs	Suggests further private issuance activity before Series C closing.
2024-12-10	Series C announced; Speak for Business highlighted; Ben Quazzo joins board	financing / governance	$78M at $1B valuation	Accel; OpenAI Startup Fund; Khosla; YC	Unicorn milestone and governance step-up with board formalization.
2024-12	HolonIQ adds Speak to the EdTech unicorn list	scale	$1B valuation status	HolonIQ	External category validation beyond company PR.
2025-06-17	Four new languages for English speakers launch and learner count crosses 15M+	product / scale	15M+ learners claimed	Speak	Confirms multilingual expansion beyond the original English wedge.
2026-02-23	Android Police publishes a critical review on pronunciation leniency	adverse	Independent adverse product review	Android Police	Shows scaled visibility but also real quality risk around over-permissive feedback.

Founding year is standardized to 2016 despite one conflicting TechCrunch line describing a 2014 launch; the conflict is preserved in evidence gaps rather than left unresolved in the chronology of record.

[CO001, CO008, CO009, CO012, CO021, CO023]

FO001: Strategic inflection timeline

Public chronology of Speak’s identity, capital formation, product expansion, and external validation from 2016 through February 2026.

[CO008, CO009]

FO003: Investability scorecard

Compact view of Speak’s public maturity, traction, and diligence risk as of May 2026.

[CO012, CO032]

1.4 Traction signals, enterprise expansion, and risk flags

Speak has enough public traction data to support “scaled private company,” but not enough to underwrite operating performance. On the consumer side, official surfaces claim 15M+ downloads and a 4.8 rating, while app stores independently confirm strong rating depth (44K ratings on Apple; 112K reviews on Google Play). Earlier 2024 fundraising materials said Speak had more than 10 million learners in 40+ countries and that users had already spoken more than one billion sentences that year. Those are all company-centered metrics, but they are at least consistent with the product’s visible distribution footprint. The most interesting non-consumer signal is the enterprise product. Speak for Business appears to have launched in earnest between the June and December 2024 rounds, and the Series C announcement claims more than 200 customers and an 85% employee adoption rate. The current B2B page similarly says 200+ brands rely on Speak for Business. That suggests a real second act beyond subscriptions, although public disclosures still do not break out B2B revenue, contract size, or retention. Risk flags remain mostly product-quality and disclosure-related rather than legal or financial. Independent reviewers praised the polish and speaking-first design, but adverse reviews also point to lenient pronunciation scoring, confusing premium packaging, refunds and auto-renew complaints, and incomplete disclosure on current headcount and board composition. Those issues do not invalidate the growth narrative, but they do make the next layer of diligence operational rather than purely promotional.[CO005, CO006, CO007, CO013, CO017, CO018]

Snapshot KPI table
Metric	Value / status	Date	Confidence	Gap / diligence ask
Latest valuation	$1.0B (Series C)	2024-12	high	Supported by official announcement, TechCrunch, and HolonIQ.
Total raised	$162M cumulative	2024-12	medium	Company claim; reconcile against full cap table and any undisclosed bridge capital.
Consumer scale	15M+ downloads claimed; 10M+ users reported mid-2024	2025-06 / 2024-06	medium	Downloads and users are different denominators; request MAU, DAU, and paying subscribers.
Enterprise traction	200+ customers / brands; 85% adoption rate claimed	2024-12 / 2026-05	medium	Need customer list, seat count, and renewal data to validate enterprise depth.
Current headcount	75 in Jun 2024; 253 in 2025 third-party estimate	2024-06 / 2025-11	low	Request current employee count by function; public sources conflict.
Consumer list price	$20/month or $99/year	2024-12	high	Verify current in-app pricing by geography and Premium Plus upsell rules.
App reputation	4.8 on Apple; 4.7 on Google Play	2026-05	high	Review cohort of low-star complaints by locale and version.
Debt / credit facilities			low	No public debt or credit facility found in reviewed sources; confirm directly with management.
Board composition	Ben Quazzo added at Series C; full board not public	2024-12	low	Request current board roster, observer rights, and governance documents.

Unsupported private metrics are left null or described as conflicting public estimates rather than forced into false precision.

[CO005, CO006, CO007, CO009, CO010, CO018]

FO002: Company snapshot logic

How Speak’s core AI tutor, subscription model, enterprise expansion, and funding momentum connect.

[CO003, CO009, CO017, CO018, CO019, CO032]

1.5 Exhibits

Chapter 02

02Market Analysis

2.1 Market boundary: where Speak competes and where it does not

Speak sits inside a narrower segment than “language learning” in general. The relevant boundary is digital, speaking-first language learning: products that use AI or live conversation to help users practice spoken output in realistic contexts, rather than primarily drilling reading, grammar, or flash-card memorization. Speak’s own fundraising materials make that framing explicit by contrasting its speaking-first tutor with older app models and by calling out the much broader $100B+ online-and-offline language-learning market only as background context. That means the closest substitutes are not every educational app, but a narrower bundle: broad language platforms such as Duolingo, Babbel, and Busuu; specialized speaking apps such as ELSA and Praktika; live-tutor networks such as Cambly; and, in enterprise settings, employer-sponsored English training or in-house tutoring. Spend that belongs inside the market includes consumer subscriptions, employer-funded English upskilling, and digital spoken-language products. Spend outside the market includes generalized test-prep, textbook-led classroom instruction that does not emphasize speaking, or enterprise training categories unrelated to language fluency. The key diligence point is that Speak is best understood as a spoken-fluency wedge inside a large but heterogeneous market. A very broad TAM exists, but the underwritable market depends on who actually pays for repeated speaking practice and how much of that demand can be digitized.[CM001, CM002, CM023, CM030, CM031, CM033]

Market definition table
Segment / category	Included spend	Excluded spend	Buyer / payer	Relevance to Speak
AI speaking-first language apps	Consumer subscriptions and employer-funded speaking practice	General classroom tuition and unrelated workforce training	Learner or employer	Core market
Broader digital English learning	Apps, online tutoring, test-linked practice, digital English courses	Offline-only classroom instruction without digital component	Learner, parent, employer, school	Important adjacent pool
Broad online language learning	All online language-learning subscriptions across many languages	Offline tutoring and textbook sales	Learner, employer, school	Upper-bound context, too broad for SAM
Human tutor and live conversation services	Live sessions and recurring tutor memberships	Self-serve AI-only subscriptions	Learner or employer	Primary substitute, not direct AI market
Community / structured course apps	Subscription courses with reading, grammar, and community correction	Pure speaking-only or pure tutoring models	Learner	Primary substitute set for consumer budgets

Boundary logic separates spoken-fluency products from the broader education and tutoring universe.

[CM001, CM023, CM030, CM031, CM033]

FM001: Buyer and adoption path flow

How demand moves from global English-learning need to consumer subscription or employer-funded adoption.

[CM004, CM023]

2.2 Sizing lenses: broad demand is real, but the lenses are not interchangeable

The strongest public evidence supports a large and growing digital English-learning market, but not a single clean number. Technavio estimates the digital English language learning market will grow by $39.46B between 2024 and 2029 at a 24.5% CAGR, with APAC contributing 39% of growth. Preply, using a different methodology and broader online-learning lens, estimates the global online language-learning market reaches about $115B in 2025, with English alone worth roughly $43.51B and growing around 22% annually. Speak’s own June 2024 post uses the broadest lens of all, describing a $100B+ online-and-offline market. These figures should not be averaged together. Technavio is reporting forecast growth for a specific digital-English segment; Preply is reporting a broader platform-led online market; Speak is using a company-authored framing that includes offline spend. The right interpretation is directional rather than exact: all credible sources point to a very large and still-growing market, but public data does not isolate a clean, source-verified SAM or SOM for Speak. The more actionable sizing conclusion is qualitative: English remains the single largest language-learning category, APAC remains the most important growth geography, and speaking-first digital products are riding both the long-term demand for English and the short-term adoption of AI-assisted learning.[CM001, CM003, CM004, CM006, CM007, CM010]

TAM / SAM / sizing lens table
Publisher / lens	Year	Geography	Value	Methodology	Confidence	Limitation
Technavio digital English market growth	2024-2029	Global	USD 39.46B growth; 24.5% CAGR	Forecast growth for digital English learning segment	medium	Growth increment, not current market size
Technavio APAC contribution	2024-2029	APAC	39% of growth	Regional contribution to forecast market growth	medium	Shows geography, not Speak share
Preply online language learning market	2025	Global	USD 115B	Platform-led report using broader online language-learning lens	low	Competitor-authored and broader than Speak’s exact wedge
Preply English learning segment	2025	Global	USD 43.51B; ~22% annual growth	English-only segment estimate within broader online market	low	Methodology not directly comparable to Technavio
MarketsandMarkets AI in education	2024-2030	Global	USD 2.21B to USD 5.82B; 17.5% CAGR	Adjacent AI-in-education market sizing	medium	Too broad to serve as Speak TAM
Speak company framing	2024	Global	USD 100B+	Company-authored framing of online and in-person language learning	low	Broadest and least precise lens
TechCrunch English learners	2024	Global	~1.5B learners	CEO quote used as demand-side population lens	medium	Learners are not paying customers

The table preserves scope differences instead of collapsing them into a single blended TAM.

[CM001, CM003, CM006, CM007, CM010, CM021]

FM002: Market estimate reference bands

Source-backed reference points for the size of markets adjacent to Speak; repeated low/mid/high values reflect point estimates from different lenses rather than probabilistic scenarios.

These rows compare differently scoped market lenses in the same currency units. They should not be summed or blended into a single TAM for Speak.

[CM006, CM010]

FM003: Spoken-language purchase funnel

Conceptual conversion funnel from broad global English demand to the narrower universe of paid speaking-first products like Speak.

This funnel mixes population and monetization reference points to show narrowing commercial reality rather than a strict unitary forecast. It is a conceptual sizing aid, not a revenue model.

[CM003, CM022]

2.3 Buyer, user, and payer segmentation

Speak’s buyer map splits into two materially different motions. In the consumer motion, the user and payer are usually the same person: a learner paying a recurring subscription for self-directed speaking practice. TechCrunch’s reported $20/month or $99/year list price fits this self-serve model. In the enterprise motion, however, the learner is the user but the employer is the payer, usually buying English training as a productivity, recruiting, mobility, or retention benefit. Speak’s Series C post is explicit that the business product is industry agnostic and already claims 200+ customers. Public market research aligns with that split. Technavio says corporate non-academic learners are a major segment for digital English learning, reflecting the role of English in international business communication. Preply’s 2026 report likewise says English remains the default second language because of business and education demand. Together, these sources suggest that consumer demand is emotion- and aspiration-led, while enterprise demand is budget- and productivity-led. The adoption path also differs by segment. Consumers can trial and subscribe immediately. Employers require HR, L&D, or departmental budget approval and need proof of adoption, seat utilization, and skill improvement. That makes enterprise economics potentially larger per account, but slower and more dependent on renewal evidence than consumer subscriptions.[CM009, CM023, CM024, CM025]

Segment / buyer map
Segment	Buyer	User	Payer	Workflow / budget owner	Adoption trigger
Consumer self-serve	Individual learner	Individual learner	Individual learner	App-store subscription or direct in-app upgrade	Travel, career mobility, confidence building
Employer-sponsored English training	L&D lead / HR / business unit manager	Employee learner	Employer	Training budget, productivity, retention, or global mobility budget	Cross-border communication need or workforce upskilling
Student / exam-adjacent user	Student or parent	Student learner	Student / parent / school	Personal education budget	English for education access or credential goals
Live tutor alternative buyer	Individual learner or employer	Learner	Learner or employer	Tutor marketplace or contract budget	Need for conversation practice without building internal program
Community-course app user	Individual learner	Individual learner	Individual learner	General learning-app subscription budget	Cheaper or more structured alternative to AI-speaking-first products

Public data is strong on consumer and employer use cases but weak on formal school procurement specific to Speak.

[CM009, CM023, CM024, CM025]

FM004: Buyer budget authority matrix

Maps who approves spend, who uses the product, and what proof is needed across the main buyer segments relevant to Speak.

[CM009, CM023]

2.4 Growth drivers and adoption constraints

The demand case is unusually strong. English remains the most learned language globally, with roughly 1.5 billion total speakers and clear utility in business, education, and mobility. Technavio attributes market growth partly to the flexibility of digital courses, while WEF and MarketsandMarkets emphasize the broader shift toward personalized learning, adaptive content, and AI-powered educational tools. Speak-specific materials add a more practical angle: users want to actually speak, not just memorize. For employers, English-learning budgets can be justified as productivity or workforce-development spend, which strengthens the enterprise thesis. But public sources also show real constraints. WEF highlights access, privacy, bias, and displacement concerns in AI-enhanced learning. World Bank argues that scalable AI adoption depends on connectivity, compute, local data, and skills—conditions that are unevenly distributed across emerging markets. The academic review of AI language tools reaches a similar conclusion: personalized feedback helps, but privacy, evidence quality, and teacher readiness lag. Product-specific reviews sharpen the risk further: both Android Police and Languatalk found Speak strong at motivation but weaker at rigorous pronunciation correction and advanced-level feedback. The practical takeaway is that demand growth is not the same as frictionless adoption. Speak benefits from macro demand, but its segment still has to prove learning efficacy, trusted feedback quality, and acceptable privacy norms to convert interest into durable paid usage.[CM004, CM008, CM012, CM015, CM016, CM017]

Growth drivers and constraints table
Driver / constraint	Direction	Timing	Implication	Diligence ask
English remains the default language of business and education	Driver	Current	Sustains the largest single language-learning demand pool	Validate segment demand by geography and language pair
Mobile / self-paced flexibility	Driver	Current	Helps self-serve subscriptions scale without human tutor cost	Measure session frequency and retention by cohort
AI-driven personalization and adaptive feedback	Driver	Current	Improves perceived relevance and can raise engagement	Request efficacy studies and completion-rate data
Employer upskilling demand	Driver	Current	Supports B2B motion and larger ACVs than consumer subscriptions	Request enterprise seat counts and renewal rates
APAC growth concentration	Driver	Current	Favors companies with strong Asian market fit and distribution	Assess localization and payment-stack strength by market
Digital divide / infrastructure gaps	Constraint	Current	Limits adoption in lower-connectivity and lower-income markets	Review offline product capability and country mix
Data privacy, bias, and regulation	Constraint	Current	Can slow institutional or employer adoption of AI-based tools	Review privacy architecture and jurisdiction-specific policies
Weak pronunciation accuracy or shallow feedback	Constraint	Current	Can hurt retention once learners move past beginner novelty	Commission independent efficacy and pronunciation benchmarking
No clean public SAM / market-share data	Constraint	Current	Makes valuation narratives easier than underwriting narratives	Request internal market-share and CAC data

The core diligence divide is not whether demand exists, but whether the market can be monetized efficiently and defensibly.

[CM004, CM008, CM009, CM015, CM016, CM017]

2.5 Exhibits

Chapter 03

03Competitors

3.1 Landscape overview: many ways to solve the same job

Buyers can solve the “help me speak a language fluently” job through at least four classes of products: broad freemium language platforms, specialized AI-speaking apps, live human-tutor networks, and structured course or community-led apps. Speak belongs in the specialized speaking-first cluster, but it competes against all four classes because consumers and employers do not buy product categories—they buy outcomes. That matters: even if Speak’s product is distinctive, it still loses share whenever a buyer decides that Duolingo is good enough, Cambly is more trusted, or Babbel / Busuu are more structured. The public scale hierarchy is clear. Duolingo is the category incumbent with 100M+ MAUs and platform-level bundle power. Babbel and Busuu have much larger installed bases than Speak and compete on structured pedagogy rather than AI novelty. Cambly owns the live-tutor substitute lane. ELSA, Praktika, and Loora prove that AI-speaking competition is now crowded, not greenfield. In this landscape, Speak’s pitch is not “we are the only one,” but rather “we are the most compelling combination of speaking-first UX, AI tutoring, and emerging enterprise distribution.” That is a real position, but not yet a dominant one.[CP001, CP005, CP008, CP011, CP012, CP014]

Competitor profile table
Competitor	Category	Scale / funding signal	Target segment	Differentiation	Limitation
Duolingo	Incumbent broad platform	100M+ MAUs; top-grossing education app	Mass-market global learners	Scale, freemium funnel, 40+ languages	Not purely speaking-first; can be less specialized on conversation depth
ELSA Speak	Direct specialized speaking rival	18M+ downloads; 109K App Store ratings	English learners seeking pronunciation and speaking practice	Pronunciation-first AI coaching	Primarily English-focused rather than broad multilingual platform
Cambly	Live human-tutor substitute	24/7 human tutor network	Learners who value native-speaker conversations	Human accountability and trust	Human cost model can be pricier and less scalable
Busuu	Community-led structured app	120M+ users; 50M+ Play downloads	Learners wanting structure plus community feedback	Native-speaker community and structured review	Less AI-conversation-centric than Speak
Babbel	Structured course app	25M subscriptions sold; 50M+ Play downloads	Learners wanting expert-built lessons	Pedagogical structure and review loops	Less conversation-first, more curriculum-led
Praktika	AI tutor challenger	20M+ learners; low price framing	Cost-sensitive learners wanting AI-tutor style practice	Cheap AI tutor substitute	Public enterprise distribution not evident
Loora	AI English tutor challenger	English-only AI tutor positioning	Professionals and business English learners	Always-on English conversation coach	Narrower brand footprint than larger incumbents

Where funding or revenue is not publicly disclosed in reviewed sources, the table uses scale proxies such as MAUs, downloads, or subscriptions sold.

[CP005, CP008, CP011, CP012, CP014, CP016]

Feature / capability matrix
Buying criterion	Speak	Duolingo	ELSA	Cambly	Busuu	Babbel	Praktika / Loora
Speaking-first workflow	Strong	Moderate	Strong	Strong	Moderate	Moderate	Strong
Explicit pronunciation correction	Moderate / contested	Moderate	Strong	Tutor-dependent	Moderate	Moderate	Strong
Curriculum structure	Moderate	Strong	Moderate	Low	Strong	Strong	Moderate
Human conversation	No	No	No	Yes	Community only	No	No
Multilingual breadth	Moderate	Very strong	Low	Low	Strong	Strong	Low to moderate
Enterprise / employer motion	Present	Limited public evidence	Present but not central	Present	Limited	Limited	Some business messaging, less proven
Feedback depth for advanced learners	Contested	Moderate	Moderate to strong	Tutor-dependent	Moderate	Strong	Moderate

Ordinal assessments synthesize official positioning and reviewed independent commentary. Unknowns were converted into conservative mid-level assessments only when public evidence existed directionally.

[CP001, CP008, CP011, CP023, CP033]

FP001: Competitive positioning map

Ordinal map of scale/distribution power versus speaking-specialization depth across Speak’s main competitor classes.

Axes are evidence-based ordinal scores derived from public scale metrics, product positioning, and review-based specialization signals; they are not survey data.

[CP005, CP029]

3.2 Competitor profiles and business-model differences

Duolingo is the incumbent benchmark because it combines huge free-user scale, a top-grossing app-store position, and broad language breadth. It is not the closest product analog to Speak’s speaking-first tutoring, but it is the most powerful default substitute when buyers want “an app to learn a language.” ELSA is the more relevant direct benchmark: it is also AI-led, English-focused, and centered on speaking and pronunciation. Its 18M+ download claim and 109K App Store ratings put it much closer to Speak’s scale band than Duolingo’s. Cambly competes from a different labor model. It offers real native-speaker tutors around the clock, which makes it more expensive operationally but also more credible for learners who trust humans over AI. Busuu and Babbel are different again: they sell structure, community reinforcement, and curriculum depth. They are less conversation-first than Speak but more clearly pedagogical. Praktika and Loora show how fast the AI-speaking niche itself is filling in, with lower-price or more English-specific tutor framing. The important pattern is not that any one rival dominates every dimension. It is that Speak faces a portfolio of substitutes, each strong in one area: scale, tutor trust, structure, community, or cheap AI tutoring.[CP005, CP008, CP011, CP012, CP014, CP016]

FP002: Capability breadth map

Shows which competitor class is strongest on scale, tutor trust, structure, community, and AI-speaking specialization.

[CP011, CP032]

3.3 Capability, pricing, and distribution comparison

Public pricing data is incomplete, but the directional picture is enough to matter. Speak’s publicly reported consumer price is roughly $20/month or $99/year. Praktika markets an AI-tutor alternative at around $8/month. ELSA offers monthly and yearly memberships. Duolingo uses freemium scale and only converts about 9% of MAUs into subscribers, which gives it broad reach and pricing flexibility. That mix implies pricing pressure from both ends: low-cost AI tutoring below Speak and gigantic freemium funnel economics above it. On capability, Speak is strongest when the buyer values a smooth speaking-first experience and low social friction. On explicit correction, however, ELSA’s pronunciation-first positioning and Speak’s own adverse reviews suggest ELSA may be stronger. On curriculum depth and review systems, Babbel and Busuu look stronger. On live accountability, Cambly is stronger. On pure distribution, Duolingo is clearly stronger. Speak’s one clearly differentiating commercial angle is Speak for Business. Most public rivals here are still primarily consumer products. But because public materials do not disclose customer names, seat counts, or renewals, the competitive importance of that channel is still more promising than proven.[CP003, CP004, CP006, CP007, CP010, CP030]

Pricing / packaging comparison
Company	Public price / model	Included positioning	Unknowns	Implication
Speak	$20/month or $99/year (reported)	Consumer subscription; speaking-first AI tutor	Current geo pricing and enterprise ASP not public	Mid-priced consumer AI tutor
Duolingo	Freemium with paid upsell	Huge free funnel; paid removes ads and adds features	Exact current Super/Max pricing varies by region	Puts ceiling pressure on acquisition and willingness to pay
ELSA	Monthly and yearly memberships	English speaking and pronunciation coaching	Current realized ASP unclear	Direct subscription benchmark in English niche
Cambly	Subscription for live tutoring	Real native-speaker conversations	Exact plan pricing not captured in reviewed source set	Human-trust premium alternative
Busuu	Freemium + subscription	Structured lessons plus community	Current plan pricing not captured here	Competes as a broader value bundle
Babbel	Subscription required for full courses	Expert-built structured lessons	Exact current checkout prices vary by plan	Structured-course alternative with strong brand
Praktika	~$8/month marketing claim	AI tutor substitute for human tutor	Realized plan mix and annual pricing unclear	Low-price AI pressure on Speak

This table compares public list-price anchors and packaging structure, not realized net pricing or enterprise contract value.

[CP003, CP006, CP010, CP016, CP034]

3.4 Moat durability, switching costs, and the main threats

The publicly visible moat for Speak is moderate. The company has a real product identity—speaking-first AI tutoring—and a potentially valuable second channel in business accounts. Those are positives. But the moat is not obviously durable because every one of those strengths is attackable. Duolingo can outspend and out-distribute. Cambly can win on human trust. ELSA can win on explicit pronunciation coaching. Busuu and Babbel can win on structure and retention loops. Praktika and Loora can compress the AI-speaking niche with cheaper or more focused positioning. Switching costs also look limited from public evidence. Consumers can test multiple apps cheaply, and many likely do. Employers may face more friction once they deploy, but there is no public proof that Speak has exclusive contracts or deep workflow lock-in. Public reviews reinforce the risk: if advanced users feel feedback is shallow or overly forgiving, they have plenty of substitutes. The main near-term threats are therefore not existential technology leaps but slower-moving commercial forces: pricing pressure, distribution bundling, and category commoditization. Speak can still win, but it needs outcomes and enterprise proof more than marketing novelty.[CP020, CP022, CP024, CP025, CP026, CP027]

Moat durability / competitive risk register
Moat claim	Threat	Severity	Mitigation / diligence ask
Speaking-first AI UX	Rivals match UX patterns and lesson flow quickly	material	Test real learner outcomes rather than relying on UX distinctiveness
Business distribution channel	Employers multi-home or use Speak as one tool among several	high	Request customer references, seat counts, and exclusivity terms
Multilingual expansion	Broad incumbents still offer more languages and larger ecosystems	material	Track launch cadence and attach rates by new language
Pronunciation and feedback credibility	Independent reviews say Speak can be too forgiving	high	Benchmark speaking accuracy and feedback quality against ELSA and tutors
Brand trust and billing experience	Refund / support complaints weaken willingness to pay	medium	Audit support SLAs, refund rates, and complaint-resolution metrics
AI niche defensibility	Praktika, Loora, and future entrants compress AI-speaking differentiation	high	Monitor pricing moves and feature parity across AI tutor rivals
Incumbent scale pressure	Duolingo’s free funnel and app-store dominance inflate acquisition costs	critical	Measure CAC, payback, and comparative organic acquisition resilience

Severity reflects commercial risk to Speak’s differentiation over a two- to three-year horizon.

[CP020, CP025, CP026, CP028, CP030, CP034]

FP003: Moat and readiness KPI scorecard

Compact scorecard for the core strengths and risks behind Speak’s competitive durability.

[CP020, CP041]

3.5 Exhibits

Chapter 04

04Financials

4.1 Revenue model: hybrid on the surface, opaque underneath

Speak’s public financial story is straightforward at a high level and thin underneath. Public evidence supports two monetization rails: consumer subscriptions sold through mobile app stores and enterprise contracts sold through Speak for Business. TechCrunch consistently reports a consumer price point of roughly $20 per month or $99 per year, and the App Store corroborates recurring monthly and annual subscriptions. Meanwhile the business site and Series C materials show that Speak has built an employer-facing offer and claims 200+ brands or customers. That is enough to conclude Speak is not just a consumer app. But it is not enough to quantify revenue mix, realized pricing, or revenue quality. No reviewed public source discloses ARR, payer count, conversion rate, refund rate, churn, or the share of revenue coming from business accounts. The result is a common diligence trap: visible monetization architecture without visible economics. The category context matters. Language learning buyers can spend nothing on a free or exchange-based product, buy a mid-priced subscription like Speak, pay per lesson for human tutoring, or choose other AI tutors. That means Speak’s public price point sits inside a crowded pricing corridor, not above it.[CI001, CI002, CI003, CI004, CI013, CI015]

Revenue streams table
Revenue stream	Mechanism	Unit	Current public status	Quality	Diligence ask
Consumer subscription	Monthly / annual subscription sold via app stores	subscriber	Confirmed	Medium: list price public, payer count private	Subscriber count, conversion, churn, refunds, share billed on-web vs in-app
Speak for Business	Employer / brand contracts	account / seat	Confirmed	Low-to-medium: buyer count claimed, economics undisclosed	ACV, seats, term length, renewal, implementation cost
Language expansion upside	Additional course catalog monetized through same subscription	language attach / ARPU	Plausible but unquantified	Low	ARPU by language, launch cohorts, attach by geography
Ads / sponsorship	Advertising monetization	n/a	Not surfaced in reviewed sources	Low	Confirm whether any ad revenue exists
Marketplace / transaction fees	Take rate on tutors or other third parties	n/a	Not surfaced in reviewed sources	Low	Confirm whether any take-rate or partner revenue exists

Rows distinguish confirmed revenue mechanisms from plausible but unsupported ones; missing streams are kept explicit rather than inferred away.

[CI001, CI004, CI013, CI034]

Pricing / monetization table
Company / model	Public price / unit	List vs realized	Unknowns	Source / implication
Speak consumer	~$20/month or $99/year	List pricing	Promo discounts, geo pricing, realized net pricing unknown	Mid-priced AI subscription benchmark
Speak business	Contracted enterprise pricing	Not public	Seat pricing, contract length, services, discounts unknown	Potentially higher-quality revenue if enterprise is material
Duolingo	Freemium with paid upsell	List and model known, realized mix partly public	Current regional prices and margin by tier unknown	Strong low-price anchor for the category
Cambly	Per-lesson tutoring plans from about $8.12/lesson annually	List pricing	Tutor mix, utilization, net take unknown	Human tutoring can command higher unit spend
italki	Marketplace lessons with visible trial prices from $5	List pricing per tutor	Take rate, repeat frequency, full lesson ASP unknown	Tutor marketplace creates flexible spend ladder
HelloTalk	Free exchange / VIP ecosystem	Free front door emphasized	VIP monetization details not captured in reviewed set	Zero-price substitute caps casual willingness to pay
Praktika	~$8/month AI tutor claim	Marketing price point	Annual plan and realized ASP unknown	Low-price AI pressure on Speak

This table compares public price anchors and monetization structures, not realized ARPU or contribution margin.

[CI002, CI019, CI020, CI021, CI025, CI038]

Unit economics table
Metric	Public value / status	Confidence	Why it matters	Diligence ask
Consumer list price	~$20/month or $99/year	medium	Starting point for LTV but not enough on its own	Confirm current pricing by geo and discount path
Payer count	Unavailable	low	Needed to convert downloads into revenue	Request current and trailing-12-month payer counts
Gross margin	Unavailable	low	Core test of AI delivery economics	Request margin by channel and compute / platform-fee breakdown
CAC / payback	Unavailable	low	Determines efficiency of growth	Request acquisition channel mix, CAC, payback, and enterprise sales cycle
Refund rate / support drag	Unavailable publicly; adverse anecdotal signals exist	low	Important for consumer revenue quality	Request refund rate, chargebacks, support backlog, and SLA metrics
Enterprise ACV / seat economics	Unavailable	low	Determines whether business improves the model	Request ACV, seats, term, implementation cost, and renewal rate
Retention / NRR	Unavailable	low	Most important revenue-quality test	Request cohort retention, gross retention, and NRR by segment

The sparse public table is the point: most economically decisive metrics remain private.

[CI002, CI015, CI016, CI027, CI031]

FI001: Revenue model bridge

Qualitative bridge from user acquisition and employer acquisition into Speak’s two visible revenue rails.

The flow is qualitative because public sources confirm the rails but not conversion rates, seat counts, or realized net revenue.

[CI001, CI004, CI030]

4.2 Public traction and pricing proxies: useful signals, weak underwriting inputs

Public traction is real. Speak advertises 15M+ downloads, the App Store shows a large ratings base, and Google Play shows six-figure review volume. TechCrunch also reports 10-20 minutes of average daily usage. These are meaningful signals that the product is widely distributed and actively used. But public traction is not the same thing as revenue. Download counts do not tell us how many users are active, how many convert, or how long they stay. Review counts do not show refund rates or net billings. Daily usage does not reveal whether engagement is coming from free-trial users, paying subscribers, or enterprise-sponsored seats. The same caution applies to peer pricing pages. Competitor list prices are useful for triangulating category willingness to pay, but they do not show realized ASPs or contribution margins. Still, the pricing corridor is informative. Speak looks mid-priced relative to low-cost AI tutors, clearly below the spending required for repeated human tutoring, and above free or freemium substitutes. That supports monetization plausibility while also highlighting the limits of pricing power.[CI005, CI006, CI015, CI017, CI018, CI019]

4.3 Cost structure, unit economics, and capital adequacy

From public evidence, Speak appears capital-light in the manufacturing sense but not necessarily cheap to operate. There is no visible inventory, hardware, or project-finance burden. The company looks like a software and content business. However, AI tutoring introduces cost layers that a static course app does not fully bear: inference, speech processing, feedback generation, localization, support, and continued content expansion. Those cost drivers likely matter more than capex. App stores also introduce platform dependency into both distribution and collection. On the business side, any meaningful enterprise motion likely adds sales, onboarding, and customer-success costs. None of that is inherently problematic—but none of it is directly quantified in public sources either. CAC, payback, gross margin, and retention remain private. Capital adequacy is therefore a scenario question. The company has clearly raised meaningful recent capital and has not surfaced public distress signals. But because current cash and burn remain undisclosed, the public record cannot distinguish between “comfortably funded” and “funded but still dependent on another round once hiring or enterprise expansion accelerates.”[CI008, CI010, CI011, CI012, CI027, CI028]

Capital adequacy table
Item	Public status	Current view	Why it matters	Next-round / diligence trigger
Recent funding	Confirmed	~$98M gross announced across recent Series B-3 and C	Provides near-term financing support	Reconcile announced rounds with cap table and net proceeds
Form D corroboration	Partially confirmed	Form D tracker shows $77.7M offering related to late-2024 financing	Improves confidence that the announced raise closed materially	Request official closing docs and proceeds schedule
Cash on hand	Unavailable	Unknown	Most direct runway input	Request current balance and restricted cash
Monthly burn	Unavailable	Unknown	Determines runway speed	Request monthly burn and budget by function
Runway months	Unavailable	Scenario only	Public sources cannot underwrite it	Build base / downside / expansion cases from actual burn
Debt / project finance	No public evidence surfaced	Likely none, but not fully verified	Debt can change risk and flexibility	Request debt schedule, leases, and contingent obligations

This chapter intentionally focuses on adequacy, not a full historical round chronology already covered elsewhere.

[CI008, CI009, CI010, CI012, CI033, CI035]

Public financial gaps table
Missing metric	Impact on diligence	Exact diligence path
Revenue / ARR / billings	Cannot underwrite scale or growth quality	Request audited or board-level trailing-12-month revenue bridge
Subscriber count and conversion	Cannot link downloads to revenue	Request subscriber funnel, trial conversion, and plan mix
Gross margin and platform-fee mix	Cannot assess AI delivery economics	Request COGS by compute, content, support, and app-store fees
Enterprise ACV / seats / renewals	Cannot assess quality of B2B stream	Request top-customer contract summary and renewal history
Burn and runway	Cannot assess financing dependency	Request monthly cash balance and forecast burn scenarios
Refund / churn / support quality	Cannot assess consumer revenue leakage	Request refund rate, churn by plan, and complaint-resolution metrics

The exact diligence path is part of the artifact so the missing-data problem is operational, not rhetorical.

[CI007, CI014, CI027, CI036]

FI002: Unit economics bridge

Shows the public inputs that matter economically even when values are missing.

Most nodes are publicly known concepts rather than public numbers; the bridge exists to make the missing metrics explicit.

[CI015, CI027, CI029]

FI003: Public disclosure completeness range

Ordinal range view showing where Speak’s public financial story is relatively visible and where it is almost blank.

These are evidence-backed ordinal scores, not management KPIs. They summarize how much public disclosure exists: pricing and fundraising are relatively visible, while revenue and runway remain largely undisclosed.

[CI002, CI008, CI007, CI012, CI036]

FI004: Capital intensity / cash-flow map

Maps the main cash drivers that appear likely from public evidence.

The matrix is qualitative and evidence-backed by product and distribution model, not by disclosed cost accounts.

[CI028, CI033, CI037]

4.4 Financial verdict: promising monetization architecture, limited revenue proof

The financial verdict is cautiously positive on architecture and cautious-to-negative on transparency. Speak clearly has monetizable consumer and enterprise surfaces, recent institutional financing, and product engagement that should support some recurring revenue. It is also operating in a category where consumers already pay for tutoring, subscriptions, and exam-prep outcomes, so the existence of demand is not the question. The problem is that the public record validates structure more than performance. Valuation climbed from $500M to $1B on the back of rapid fundraising momentum, but financial disclosure did not keep pace. Public evidence does not tell us how much revenue the business generates, what margins look like after app-store fees and AI delivery costs, how sticky subscribers are, or whether business accounts materially improve economics. That leaves too much of the underwriting case resting on funding headlines, user proxies, and market enthusiasm. For diligence, the next step is not finding more marketing pages. It is obtaining the internal financial packet: ARR and billings, payer counts, gross margin by channel, refund and churn data, enterprise ACVs, sales efficiency, and a real runway model.[CI032, CI034, CI035, CI036]

4.5 Exhibits

Chapter 05

05Product & Technology

5.1 Product surface and learner workflow

Speak’s public product story is unusually coherent: the app is sold as a speaking-first AI tutor, not a vocabulary game or grammar-reference tool. Across the homepage, app-store listings, and product posts, the same workflow repeats: learners enter a structured lesson, speak target phrases out loud, receive immediate corrective feedback, and then apply what they practiced in freer AI conversations. That workflow is now formalized as the Speak Method’s Learn → Practice → Apply loop. In practice, the visible module set includes Tutor Lessons, speaking drills / speaking cards, roleplays, free-form conversations, progress tracking, and newer review or proficiency features such as unit refreshers and Speak Level. The product surface is still primarily mobile-first, but it is no longer just an English-learning app; public help content shows six target languages for learners and 15 native-language entry points for English instruction. The main strategic takeaway is that Speak has a tight workflow fit between pedagogy and product packaging: every major module is ultimately in service of getting the learner to speak, compare output, and try again rather than passively consume content.[CE001, CE002, CE004, CE005, CE006, CE008]

Product module / asset matrix
Module / asset	Primary user	Status / maturity	Differentiation	Diligence gap
Tutor Lessons	New and progressing learners	Core / shipping	Dynamic AI tutor can correct, answer questions, and redirect rather than just play canned audio	No public completion or learning-outcome benchmarks by module
Speaking drills / speaking cards	All learners in structured lessons	Core / shipping	Speaking-first repetition plus in-house matching and phonetic alignment instead of generic quiz mechanics	Public evidence is internal-only on matching accuracy and false-positive control
Live Roleplays	Learners moving into open-ended practice	Shipping, but originally limited rollout in late 2024	Realtime voice conversations combine OpenAI speech-to-speech with Speak proficiency graph, hints, and objectives	Public rollout breadth by market / plan is not quantified
Free Talk / custom conversations	Self-directed learners and higher-intent practice	Shipping	Open-ended AI conversations personalize scenarios and let users create practice around their own contexts	Independent reviews say conversation control and feedback depth still trail serious-learner needs
Progress / review features (Unit Refreshers, Speak Level, side quests)	Active learners returning between sessions	Shipping / staged by market	Adds retention loops and measurable fluency tracking on top of speaking workflow	Speak Level was still limited to English learners in select markets entering 2026

Public surfaces show a coherent module stack, but several higher-order quality claims remain internal or rollout-limited.

[CE004, CE005, CE006, CE008, CE016, CE027]

Workflow / use-case table
User job	Current workflow / pain point	Speak solution	Publicly observable benefit	Limitation / caveat
Start speaking quickly	Traditional apps over-index on reading, grammar, or vocabulary recognition	Speaking-first onboarding with immediate repetition and feedback	Homepage and reviews repeatedly describe learners speaking from day 1	Benefit is mostly anecdotal rather than third-party efficacy-tested
Practice realistic conversations	Human tutors are expensive and hard to schedule	Roleplays and Free Talk simulate common scenarios with AI partners	Users and independent reviews both highlight real-world conversation practice	Some reviewers say the AI can feel repetitive or over-questioning
Correct pronunciation or delivery	Generic ASR misses accented learner speech	Custom ASR plus matching stack, with speech-to-speech where audio nuance matters	Speak reports faster feedback and lower WER after backend upgrades	Most performance numbers are internal and not externally benchmarked
Resume after breaks / stay motivated	Language apps often lose users between sessions	Unit Refreshers, streak systems, side quests, and Speak Level progress views	Winter 2025 release explicitly added retention and progress surfaces	Some app-store reviews still ask for richer rewards and more gamified loops
Learn another language as an English speaker	Prior Speak product focused on English learners in Asia	2025 language expansion added French, Japanese, Korean, and Italian after Spanish	Public help pages now document six target languages	Intermediate depth outside flagship tracks remains incomplete

The workflow is strongest where Speak can keep the learner in a tight speak-hear-correct loop; open-ended mastery and breadth remain less verified.

[CE001, CE004, CE008, CE009, CE015, CE018]

FE001: Learner experience stack

This stack focuses on how learner-facing modules, pedagogy logic, speech feedback systems, and support operations layer together to create Speak’s speaking-first UX.

[CE001, CE004, CE008, CE016, CE029, CE033]

5.2 Architecture and operating model

Speak now discloses more technical detail than many consumer education apps. The 2024 ASR overhaul moved the company away from fragmented on-device and third-party speech systems toward a unified backend stack. Speak says that stack fine-tunes Conformer-CTC on learner speech, serves inference with Nvidia Riva / Triton on Kubernetes and Google Cloud, and uses gRPC plus websocket streaming to return speech feedback quickly enough for lesson interactions. In 2025, Speak added Matching v2, which combines streaming ASR with a phonetic model and forced alignment; this matters because learner speech often breaks the assumptions built into generic ASR. By early 2026, Speak layered a broader voice-agent platform on top: mobile apps connect over WebRTC via LiveKit Cloud; voice-agent servers orchestrate external ASR, LLM, TTS, and speech-to-speech providers; and the system chooses cascade or speech-to-speech by feature rather than forcing one universal stack. That architecture implies real technical depth, but also real operating complexity. Speak is explicitly multi-provider, region-aware, and latency-obsessed, with public discussion of failover, tail-latency monitoring, semantic turn detection, and provider switching when quality or availability degrades.[CE010, CE011, CE012, CE013, CE014, CE015]

Technology / operating architecture table
Layer / process / component	Role	Key dependency	Principal risk
Mobile iOS / Android clients	Capture learner audio, play responses, host lesson UX	Native mobile apps; OS permissions; current versions	No desktop fallback and microphone permissions can break voice workflows
Realtime transport	Low-latency bidirectional audio between clients and backend	WebRTC via LiveKit Cloud; regional routing	External platform dependency and cross-region latency variation
Voice-agent runtime	Runs lesson logic, turn-taking, hints, corrections, and orchestration	LiveKit Agents plus Speak application logic	Complexity grows as feature count and provider permutations expand
Speech recognition and matching	Transcribe learner speech and map it to lesson targets	Fine-tuned Conformer-CTC, Nvidia Riva/Triton, wav2vec2 phonetic model, forced alignment	Internal metrics are strong but externally unaudited; accent edge cases still exist
LLM / TTS / speech-to-speech providers	Generate tutor responses and voice output	OpenAI Realtime API, external LLM/TTS providers, backup-provider routing	Multi-provider cost, reliability, and model-quality variance
Learning engine and content systems	Store lessons, proficiency graph, conversation state, analytics	Speak backend services and curriculum systems	Limited public evidence on data governance, evals, and versioning discipline
Observability and failover	Measure latency / errors and reroute traffic when providers degrade	Per-provider metrics, backup routing, region-aware operations	Public disclosures do not quantify uptime or incident history

Speak’s technical disclosures indicate a real production ML stack rather than a thin wrapper around one model vendor.

[CE011, CE012, CE016, CE017, CE019, CE020]

Roadmap / release / development-stage table
Date / stage	Feature or milestone	Status	Implication	Source
2024-06	Backend ASR overhaul	Shipped	Core speech feedback moved to a stronger unified backend and internal learner-speech tuning	Speak ASR blog
2024-10	Live Roleplays with Realtime API	Shipped, initially limited rollout	Moves Speak closer to natural real-time conversation and heavier OpenAI dependency	Speak Live Roleplays blog
2024-11	Google Play Best App recognition in HK / Korea / Taiwan	Shipped / external recognition	Suggests traction in core Asian markets where Speak started	Speak Google Play blog
2025-06	French, Japanese, Korean, and Italian launches for English speakers	Shipped	Broadens TAM beyond English-learning use case	Speak new languages blog
2025-12	Winter release: adaptive lessons, refreshers, side quests, audio roleplays, Speak Level	Shipped / partially staged	Adds retention and proficiency surfaces rather than only new lessons	Speak winter release blog
2026-03	Voice Agent Platform disclosure	Shipped platform capability, still evolving	Signals continuing investment in infra, turn detection, and richer speech-to-speech uses	Speak voice agent platform blog
2026 ongoing	More intermediate courses, more languages, broader Speak Level rollout	In development	Breadth and maturity remain moving targets rather than fully complete products	Speak help center and release notes

Speak ships frequently, but several high-value roadmap items remain explicitly in development or market-limited.

[CE008, CE009, CE015, CE019, CE027, CE049]

5.3 Trust, quality, and support controls

On trust and operational quality, Speak’s public evidence is mixed but directionally better than marketing-only surfaces. The company exposes concrete support workflows: in-app issue flags, troubleshooting guides, microphone-permission checklists, and email escalation for unresolved issues. Public store listings also disclose basic privacy and platform controls such as encrypted-in-transit data handling, deletion requests, minimum OS support, and mobile-only availability. Curriculum quality control is also more explicit than usual for an AI app: Speak says lessons are authored by learning designers, AI is used to accelerate production, and humans still review the end product. The weaker side of the trust picture is external friction. Independent review sources praise voice quality and speaking-centric design, but they also report shallow feedback depth, confusing premium tiering, laggy voice recognition episodes, refund complaints, and support delays. More importantly for diligence, Speak’s public privacy and terms URLs did not render readable policy text in text-only review, and we did not locate public SOC 2, ISO 27001, or similar security assurance artifacts. That leaves a real enterprise-trust gap even though consumer support mechanics are clearly visible.[CE029, CE030, CE031, CE032, CE033, CE034]

Trust / quality / compliance table
Control / disclosure / quality signal	Status	Scope	Gap or implication
Curriculum written by learning designers and human-reviewed	Present	Applies to lessons and curriculum creation	Good quality signal for content integrity, but no external pedagogy audit found
In-app bug report flag plus support email workflow	Present	Lesson issues, account issues, troubleshooting escalations	Shows operational support process, but support is email-first and may scale unevenly
Voice-recognition troubleshooting guidance	Present	Microphone permissions, cache clearing, OS updates, Samsung/Bixby conflicts	Confirms the company expects speech capture failures to happen in the field
Data handling disclosure on Google Play	Present	States sharing of app activity/device IDs, collection of personal and financial info, encryption in transit, deletion requests	Useful baseline disclosure, but not a substitute for a detailed privacy or security program
Privacy and terms web pages	Partially inspectable	Public URLs exist from store listings	Text-only review returned JS shells, reducing direct diligence visibility
Security certifications / audits / trust center	Not found publicly	No public SOC 2, ISO 27001, penetration-test summary, or uptime page retained in chapter evidence	Material diligence gap if underwriting enterprise expansion or regulated buyers

Trust evidence is adequate for a consumer app review but incomplete for deeper enterprise or privacy diligence.

[CE029, CE033, CE034, CE035, CE036, CE043]

FE003: External platform dependency map

The most material public dependencies are not every internal component, but the external platforms and interfaces that can bottleneck performance, privacy review, or scale.

[CE019, CE027, CE031, CE034, CE036, CE047]

FE004: Product maturity / capability map

Speak looks strongest in core speaking UX and underlying speech infrastructure; public evidence is weaker on formal trust artifacts and advanced-learner depth.

[CE018, CE033, CE034, CE036, CE042, CE043]

5.4 Differentiation, maturity, and open diligence asks

Speak’s strongest product-level differentiation is not just “AI tutor” branding; it is a purpose-built speech stack paired with pedagogy that appears customized for language learners rather than retrofitted from a generic chatbot. Public claims around learner-accent ASR, phonetic matching, multi-provider TTS selection, and feature-specific voice-pipeline choices support that view. External corroboration from OpenAI, TechCrunch, app stores, and independent reviewers suggests the market sees the same pattern: a highly polished speaking product with strong user love and genuine technical ambition. The maturity signals are credible but should not be over-read. Ratings and download counts are strong, app versions are shipping frequently, and Google Play recognition suggests real distribution in Asia. At the same time, several underwriting questions remain open: how deep the company’s security/compliance program is, whether the richer voice stack is economically efficient at scale, how much of the roadmap is still rollout-limited, and whether independent learning-outcome evidence exists beyond internal product metrics and customer anecdotes. In short, Speak looks product-strong and technically differentiated, but public evidence is much stronger on capability and UX than on formal trust/compliance disclosure.[CE023, CE024, CE037, CE038, CE039, CE040]

FE002: Learn → Practice → Apply operating flow

This flow isolates Speak’s pedagogical loop rather than inventorying modules: introduce language, drill it aloud, then apply it in live conversation and review.

[CE005, CE031, CE039, CE045]

5.5 Exhibits

Chapter 06

06Customers

6.1 Customer base and segmentation

Public evidence suggests that Speak still looks first and foremost like a self-serve consumer language app, with an emerging employer-paid overlay rather than a purely enterprise business. The homepage and app-store listings emphasize direct individual learning, six core language tracks, AI speaking practice, and app-store scale. That self-serve layer appears broad: official and third-party surfaces point to more than 10 million learners by mid-2024 and roughly 15 million downloads by late 2025 to 2026. Geography also matters. South Korea was Speak's inaugural market, the company has publicly said nearly 6% of Korea's population was learning English with the app in 2024, and TechCrunch reported well over 100,000 Korean subscribers as early as 2023. At the same time, Speak has expanded far beyond that origin, with official and third-party reporting pointing to 40+ countries and language offerings spanning English, Spanish, French, Italian, Japanese, and Korean. The newer B2B motion appears to use employers as payer, employees as primary users, and English upskilling as the flagship use case.[CU001, CU002, CU003, CU004, CU005, CU006]

Customer segmentation table
Segment	Buyer / user / payer	Use case	Scale evidence	Revenue / strategic value	Gap
Self-serve consumer learners	Individual / learner / individual	Speaking practice for travel, work, school, and everyday conversation	15M+ downloads on Speak homepage; 44K iOS ratings; 112K Google Play reviews	Core consumer subscription engine and top-of-funnel for brand spread	No public free-to-paid conversion or payer mix
Korea English-learning base	Individual or employer / learner / individual or employer	English fluency and confidence in Speak's origin market	Nearly 6% of Korea population claim in 2024; well over 100K Korean subscribers in 2023	Explains early density, review volume, and enterprise seed demand	Current Korea share of revenue and users is undisclosed
International multi-language consumers	Individual / learner / individual	Spanish, French, English, Italian, Japanese, and Korean speaking practice	40+ countries by 2024 plus six core course tracks on the homepage	Supports global growth beyond Korea and broader cross-sell	No country-by-country user or revenue breakout
Employer-sponsored B2B teams	Employer or L&D / employee / employer	Business English and workforce fluency	200+ brands on B2B page; 200+ business customers and 85% employee adoption in Series C post	Higher-ARPU expansion path layered on top of consumer demand	No public ACV, seat count, or renewal data
Named corporate subscriber base	Employer / employee / employer	Employee language benefit or upskilling program	Forbes says ~500 companies including KPMG and HD Hyundai offered subscriptions, mostly in Korea	Suggests real procurement penetration beyond pilots	No public roster, case studies, or deployment outcomes for most logos

Public evidence points to a consumer-first base with a meaningful but still under-disclosed employer-sponsored layer.

[CU002, CU004, CU005, CU006, CU009, CU012]

Customer growth / adoption trajectory table
Metric	Value	Date	Source	Confidence	Implication	Missing denominator
Learners / users	10M+	2024-06-18 to 2024-06-20	Speak Series B + TechCrunch	medium	By mid-2024 Speak had already reached meaningful global scale	No split between free, paid, or active users
Downloads	15M+	2026-05-05 access	Speak homepage	medium	Current top-of-funnel scale appears materially larger than the 2024 user disclosure	No install-to-activation denominator
App Store rating volume	44K ratings	2026-05-05 access	Apple App Store	high	Large iOS review surface indicates ongoing consumer reach	No rating distribution by country or payer type
Google Play review volume	112K reviews / 10M+ downloads	2026-05-05 access	Google Play	high	Android adds large-scale public validation of adoption	No DAU/MAU or paid-user denominator
Lines spoken	3.74B	2025-12-19	Speak Wrapped 2025	medium	Strong signal of repeat speaking activity rather than one-time installs	No unique-user denominator
Hours practiced in app	19.6M hours	2025-12-19	Speak Wrapped 2025	medium	Shows large aggregate learning time and habit formation	No hours per paying user or per cohort
Lessons started	231M	2025-12-19	Speak Wrapped 2025	medium	Indicates scaled recurring lesson consumption	No completion-rate denominator
Personalized lessons created	80.3M	2025-12-19	Speak Wrapped 2025	medium	Suggests users actively use adaptive or personalized paths	No share of users engaging with personalization
Business customers	200+	2024-12-10	Speak Series C + Dataconomy	medium	Confirms B2B is more than a landing page experiment	No breakdown by segment, geography, or contract size
Companies offering employee subscriptions	~500	2025-11-12	Forbes	low	Implies wider corporate distribution than official case-study count suggests	No active-seat or paid-logo denominator

These metrics are adoption and activity proxies, not retention cohorts or revenue-quality metrics.

[CU004, CU009, CU010, CU015, CU017, CU018]

FU001: Customer journey map

Speak's public journey runs from individual discovery and first speaking session to habit formation, app-store proof, and eventual employer-sponsored expansion for some users.

[CU001, CU004, CU007, CU026, CU028]

FU002: Adoption / deployment flow

Public evidence supports a repeatable flow from consumer discovery to repeated speaking activity, with a narrower branch into employer-sponsored deployment.

[CU004, CU010, CU013, CU017, CU018, CU019]

6.2 Named proof and adoption signals

The strongest public proof is not a roster of enterprise case studies; it is a combination of scaled review surfaces and duplicated named learner testimony. Apple and Google show large rating and review counts, while Speak's own review page republishes identifiable customer comments that also appear on public app-store surfaces. That duplication matters because it reduces the risk that every quote is wholly synthetic, even if the official page is still positively selected. The clearest named examples are consumer learners. j herronov says months of French use improved comprehension of French media and made free talk and bookmarking valuable. Dan S says more than six months of Spanish use beat prior tools because feedback was detailed and instantaneous. Rosalyn Mulder says Speak's tutor-like feedback and streaks made it her preferred option over Duolingo and Mango. These are still self-reported outcomes, but they are specific, recent, and visible across independent distribution channels. On the enterprise side, the proof is thinner but not nonexistent: Speak says it has 200+ business customers, and Forbes reported roughly 500 companies, including KPMG and HD Hyundai, offered subscriptions to employees, mostly in South Korea.[CU005, CU008, CU015, CU016, CU023, CU024]

Named customer proof table
Customer	Segment	Deployment / use case	Production vs pilot	Outcome	Limitation
j herronov	Consumer French learner	Used Speak for a few months to improve listening, free talk, and bookmarks for real conversation	Active consumer use	Says the app improved ability to catch words and sayings in French media and made personalized practice useful	Self-reported outcome; no independent proficiency audit
Dan S	Consumer Spanish learner	Used Speak for 6+ months as a primary speaking-feedback tool	Active consumer use	Says detailed and instantaneous feedback made Speak better than prior Spanish-learning options	Single reviewer and no quantified before/after benchmark
Rosalyn Mulder	Consumer Spanish learner	Android user comparing Speak with Duolingo and Mango	Active consumer use	Says tutor-like feedback and streaks improved motivation and made Speak her preferred option	Motivational proof only; no retention or spend disclosure

Sample only. Publicly accessible named proof is much richer for consumer reviewers than for enterprise logos.

[CU031, CU032, CU033, CU048]

FU003: Customer proof matrix

Speak has strong public consumer proof and aggregate enterprise counts, but weak public visibility into retention and concentration.

[CU008, CU016, CU029, CU043, CU044, CU045]

6.3 Durability, expansion, and concentration risk

The key diligence problem is that customer-quality evidence is much weaker than adoption evidence. Public materials show downloads, ratings, lessons, lines spoken, and even an employer adoption percentage, but they do not disclose NRR, GRR, churn, contract length, cohort retention, or top-customer concentration. That means Speak can credibly show who tries and likes the product, but not yet how durable those customers are in revenue terms. The monetization surface also introduces friction points that matter for churn analysis. App-store pricing shows self-serve subscriptions ranging from roughly $18 monthly to $84 annual premium plans and higher premium-plus tiers, while independent complaint pages surface refund, auto-renewal, language-coverage, and speech-recognition complaints. None of that proves systemic weakness, but it does mean the public record is still better at showing acquisition and usage than renewal quality. The enterprise expansion story is also plausible but incomplete: Forbes says the push began when consumers asked employers to pay, which is a healthy land-and-expand signal, yet the public corpus still lacks a named case-study library, contract terms, or a geography split that would show whether B2B revenue is still heavily concentrated in Korea. Public evidence also does not bridge from headline activity metrics to active payer counts, so investors still need the basics: paid-user cohorts, annual-plan renewal curves, refund rates, and logo-level enterprise retention. Until those are disclosed privately, the right reading is that Speak has demonstrated demand and habit, but not yet fully auditable customer durability across cohorts, logos, or geographies globally.[CU016, CU026, CU028, CU036, CU037, CU039]

Retention / repeat usage / satisfaction table
Metric	Value	Segment	Confidence	Diligence ask
App Store satisfaction	4.8	iOS consumer users	high	Request rating distribution by country, language track, and payer status
Google Play satisfaction	4.7	Android consumer users	high	Request rating distribution and uninstall / refund linkage
Employee adoption	85	Speak for Business employees	medium	Request cohort definition, seat base, and time window behind the metric
Growth durability proxy	Users doubled yearly for five years through 2024	All users	medium	Request paid-user retention to separate growth from churn masking
Repeat usage proxy	3.74B lines spoken in 2025	All users	medium	Request active-user denominator and cohort retention curve
Public NRR		Enterprise	low	Request NRR by contract cohort and geography
Public GRR / churn		All segments	low	Request logo churn, seat churn, and involuntary billing churn
Public contract length / renewals		Speak for Business	low	Request median term, renewal rate, and expansion rate by logo

This table substitutes for the planned cohort figure because no reviewed public source provided time-bucket retention percentages suitable for a cohort chart.

[CU016, CU023, CU024, CU040, CU045, CU046]

Expansion and concentration risk table
Expansion driver	Concentration / friction risk	Impact	Diligence path
Consumer-to-employer upsell	B2B adoption may still be heavily seeded by Korea-based consumer demand	Could overstate global enterprise repeatability if Korea remains the dominant proving ground	Request enterprise ARR, seats, and renewals by country
Speaking-first differentiation and strong ratings	Public ratings do not show whether paid cohorts renew at attractive rates	Adoption quality may look better than revenue durability	Request annual-plan renewal curves and refund rates
More countries and more languages	Language-catalog complaints suggest some users want unsupported languages or better localization	Can cap TAM in adjacent learner segments and raise churn risk after initial trial	Request language roadmap, waitlist size, and retention by new language
Employer-sponsored adoption metric	85% employee adoption is promising but lacks context on seat counts and contract value	Could reflect a small, highly engaged subset rather than scaled enterprise expansion	Request logo-level seat activation and ACV distribution
Self-serve subscription pricing	Auto-renewing subscriptions and refund complaints can create support and involuntary churn burden	Consumer churn and brand sentiment may be more volatile than ratings imply	Request refund, cancellation, and chargeback metrics by store
Aggregate enterprise count	No public roster for the 200+/500-company base means concentration cannot be checked	A few large logos could drive a disproportionate share of B2B revenue	Request top-10 logo revenue share and sponsor concentration

The public record supports expansion narratives, but not a clean concentration bridge.

[CU026, CU028, CU029, CU037, CU041, CU042]

Independent review and complaint surface table
Surface	Signal type	What it shows	Balance	Limitation
Apple App Store reviews	Customer-proof	Large recent review stream with strong ratings and some concrete feature feedback	Mostly positive in the accessible sample	No full rating-distribution or refund data
Google Play reviews	Customer-proof	Large Android review base plus visible complaints on recognition and language handling	Mixed positive and negative examples are visible	No payer-status or retention linkage
Speak official review page	Curated customer-proof	Duplicates real named reviews and makes them easy to inspect	Positively selected because it is limited to 5-star App Store reviews	Cannot stand alone as a balanced satisfaction read
LanguaTalk review	Independent review	Structured critique of feedback depth, lesson variety, and speech-recognition leniency	Adverse	Reviewer sample size is small and editorial
JustUseApp reviews	Independent complaint aggregation	Refund, language-coverage, and missing-content complaints appear publicly	Adverse	Aggregation quality is weaker than first-party store data

This table isolates review-surface quality so the retention table can stay focused on durability metrics.

[CU008, CU036, CU037, CU039, CU040, CU041]

6.4 Exhibits

Chapter 07

07Risks

7.1 Legal, regulatory, and consumer-protection risk

Speak is not facing an obvious public enforcement event today, but its legal surface is more complicated than a generic language app because the product is voice-first, AI-mediated, global, and subscription-driven. Google Play says the app may share app activity and device identifiers with third parties and may collect personal and financial information, while also offering encryption in transit and deletion requests. Apple rates the app 13+, but Google rates it Everyone, which creates an ambiguous minor-facing posture for a service that invites users to speak naturally into an AI tutor. That matters because FTC guidance says COPPA obligations attach if a service is directed to children under 13 or has actual knowledge of collection, and because AI Act and GDPR obligations can intensify when AI systems become more consequential or less transparent. On top of privacy, billing rights are fragmented: Apple controls App Store refunds, Google imposes strict transparency and cancellation rules, and Speak’s own help center draws fine distinctions between cancellation and refunds. The residual legal risk is therefore manageable but real: one part regulation, one part consumer-trust plumbing.[CR003, CR004, CR005, CR008, CR010, CR012]

Regulatory / legal risk register
rule / case	jurisdiction	status	likelihood	severity	mitigation	residual exposure	diligence path
Voice-data, minor-facing, and parental-consent exposure	US / global	Active if Speak is used by under-13 users or knowingly collects their personal data	medium	high	Apple and Google surface age / privacy disclosures; Google says data can be deleted and is encrypted in transit	medium-high	Request age-screening logic, child-data controls, voice-retention schedule, and parent-consent workflow by market.
AI Act / GDPR transparency and assessment obligations	EU	Phased in from Aug 2024 onward	medium	high	Speak clearly markets the product as AI-powered and current use appears focused on tutoring rather than regulated decisions	medium	Map every EU-facing AI feature to transparency, logging, DPIA / FRIA, and special-category-data assumptions before scaling assessments or richer speech analytics.
Auto-renew, refund, and recurring-billing compliance	US / EU / app stores	Current and ongoing	medium-high	medium-high	Help-center flows exist and Apple / Google each provide their own cancellation and refund controls	medium	Test actual disclosure screens, trial-to-paid conversion notices, store-specific chargeback rates, and whether cancellation is obvious in each client.
Cross-border transfer and third-party disclosure risk	EU / global	Ongoing	medium	high	Store disclosures already surface some sharing, collection, encryption, and deletion controls	medium-high	Obtain the full privacy policy, terms, DPA, subprocessor list, SCC or transfer framework, and any model-training exclusions for voice transcripts.

Rows are ordered by residual severity and focus on public legal exposures most likely to matter to an investor in a global voice-AI subscription app.

[CR003, CR004, CR005, CR008, CR010, CR012]

7.2 Operational quality and platform-dependency risk

Operationally, Speak depends on the quality of speech recognition and conversational latency in a product where users notice every miss immediately. Speak built custom ASR because earlier third-party speech-recognition services struggled with accented learner speech, and it now runs that stack on Nvidia GPU inference inside Kubernetes on Google Cloud. That creates a stronger product than a purely off-the-shelf stack, but also a narrower operational base: latency, throughput, or cloud-vendor issues can translate directly into weaker tutoring quality. The same pattern appears in OpenAI dependence. Speak publicly uses GPT-4, GPT-4o, and the Realtime API; it also admits speech-to-speech models still lag text models on instruction following and nuanced coaching. Independent review sources reinforce that this is not just a theoretical risk. Complaints cite laggy recognition, missed words in the middle of sentences, noisy-environment failures, battery drain, device heating, and occasional recording or progress-saving bugs. Strong product love exists, but the operational bar is high because users are paying for an experience that feels immediate, accurate, and fair.[CR022, CR023, CR024, CR025, CR026, CR027]

Operational / quality / security risk register
failure mode	likelihood	severity	mitigation maturity	residual exposure	unresolved gap
Recognition misses accented or continuous speech, undermining feedback quality	medium-high	high	medium — Speak built custom ASR after third-party systems underperformed	high	No public accuracy / error-budget disclosure by language or environment.
Realtime roleplays depend on speech-to-speech models that Speak still says are weaker than text models for nuanced coaching	medium	high	low-medium — strong product work, but admitted model limits remain	medium-high	No public fallback routing or quality-threshold logic by feature.
Long sessions can overheat devices or degrade in noisy environments	medium	medium	low — review aggregators surface the issue but no telemetry is public	medium	No hardware-session or battery-performance data is public.
Billing and support disputes can turn product love into chargebacks and trust damage	medium-high	medium-high	medium — extensive help-center coverage exists	medium-high	No public refund-rate, chargeback-rate, or first-response-time metrics.
Localization and onboarding glitches can blunt expansion into new languages and markets	medium	medium	medium — demand is strong and rollout is ongoing	medium	No public launch-readiness scorecards or country-level retention breakdowns.

This register focuses on issues that can degrade the lived learning experience even when the underlying product concept is strong.

[CR023, CR024, CR025, CR026, CR027, CR036]

Partner / dependency risk register
dependency	counterparty	role	concentration	failure scenario	severity	mitigation	residual exposure
Conversational model stack	OpenAI	GPT-4, GPT-4o, Realtime API, and related tutoring features	high	API pricing, policy, or uptime changes degrade roleplays, feedback quality, or margins	high	Speak has its own learning engine and some custom ASR, not a pure wrapper	high
Inference / ASR infrastructure	Google Cloud + Nvidia	Custom ASR serving, GPUs, Kubernetes runtime	high	Cloud outage, GPU constraint, or inference-cost spike slows or weakens feedback loops	high	Custom model ownership gives some flexibility	high
iOS distribution and billing	Apple App Store	Acquisition, ranking, and in-app subscription rail	medium-high	Policy or billing friction reduces conversion, refunds, or visibility on a critical platform	medium-high	Apple handles refunds and cancellations through standard flows	medium-high
Android distribution and billing	Google Play	Android acquisition, ranking, data-safety disclosures, and subscriptions	medium-high	Policy shifts or disclosure problems slow Android growth or increase trust friction	medium-high	Google provides clear cancellation expectations and verified reviews	medium-high
Private capital support	Venture investors	Funding and valuation support for scale and model investment	medium	If growth or economics soften, next financing may become harder at a premium mark	high	Speak still has active backers and recent capital	medium-high

Even strong product companies become fragile when billing, AI quality, infrastructure, and financing all route through a small set of counterparties.

[CR002, CR022, CR023, CR026, CR029, CR033]

7.3 Financial-model and execution risk

Speak’s financial risk is less about visible distress than about opaque economics behind an expensive-to-run product. Public sources show impressive momentum: over 10 million users by mid-2024, more than 40 countries, a Series C at a $1 billion valuation by late 2024, and continued investor support from OpenAI, Khosla, Accel, and Y Combinator. But the same public record says Speak sometimes builds experiences that are cost-prohibitive today in anticipation of cheaper models later, and independent reporting notes that custom LLM ambitions can be expensive. That means gross margin, inference cost per active learner, refund leakage, and support intensity matter a great deal. Those figures are not public. NicheMetric offers directional app-revenue and download estimates, but not with enough transparency to close underwriting questions. Execution risk compounds this uncertainty: the company scaled from South Korea into more than 20 and then 40-plus countries while still expanding languages, hiring, and product scope. Positive reviews suggest the bet is working, yet requests for more languages and better support imply the operating system around the tutor may still be catching up to demand.[CR002, CR029, CR030, CR031, CR032, CR033]

People / execution risk register
role / function	dependency or gap	likelihood	severity	mitigation	diligence path
ML / ASR / LLM engineering	Custom ASR plus OpenAI-powered tutoring requires scarce systems and model talent	medium	high	Recent capital and active hiring suggest the company is still investing behind the stack	Request org chart, attrition, on-call design, and vendor-ownership map across ASR and roleplay features.
Support and billing operations	Apple, Google, and direct-web payment flows create multilingual support complexity	medium-high	high	Help-center coverage is broad and current	Review first-response times, chargeback handling, escalation runbooks, and localization coverage.
Content and localization operations	More languages and more countries increase lesson QA, policy, and onboarding complexity	medium	high	User love is strong and expansion has already worked in multiple countries	Review retention and refund cohorts by language, region, and app-store channel.
Enterprise / business offering	Speak for Business expands support, contract, and reliability expectations beyond the consumer app	medium	medium-high	Public evidence shows a business edition exists	Request enterprise pipeline, SLA commitments, security questionnaire responses, and support staffing.

This table ranks execution risk by the degree to which scaling complexity can outpace the current public operating system.

[CR033, CR048, CR049, CR052, CR053]

7.4 Mitigations, monitoring indicators, and thesis-break triggers

The good news for Speak is that the product already shows evidence of real user love, large-scale adoption, and a willingness to publish some customer-support and technical detail. That lowers the chance that the main risks are purely hidden disasters. The more realistic underwriting question is whether those mitigations mature as quickly as the company’s ambition. Strong ratings and positive speaking-feedback reviews help, encryption and deletion controls help, and documented cancellation / refund flows help. But none of those substitute for full privacy-policy visibility, enterprise reliability commitments, or hard unit-economics evidence. The most important monitors are practical: store-review clusters around recognition or billing, chargeback and refund rates, any inability to show clear retention and data-use controls for voice data, and any OpenAI or cloud cost shift that forces a weaker product or margin reset. For an investor, the thesis does not break on one bad review week. It breaks if recurring operational complaints, compliance gaps, or missing economics start to overwhelm a product that is currently winning because it feels better than the alternatives.[CR005, CR008, CR010, CR013, CR022, CR033]

Mitigation and kill criteria table
risk	monitorable trigger	threshold / event	action implication
Subscription trust / billing friction	Refund requests, chargebacks, and billing-related review spikes	Meaningful multi-week jump after a pricing or trial-flow change	Pause aggressive acquisition and rework store / onboarding disclosures before scaling spend.
Recognition quality	Clusters of reviews citing missed words, lag, or rushed speaking windows	Persistent issues across more than one major language or platform release	Hold language expansion until accuracy and pacing recover.
Voice-data / child-data compliance	Inability to furnish retention, deletion, or age-screening controls to diligence or regulators	Missing DPIA / retention package or a regulator / platform notice	Downgrade the underwriting case until compliance documentation is complete.
OpenAI / model dependence	Material API price increase, latency regression, or policy constraint	A sustained margin hit or degraded live-roleplay experience	Reset valuation assumptions and require fallback / routing proof.
Cloud / inference concentration	Repeated Sev-1 inference or speech-serving disruptions	More than one major outage without credible failover evidence	Require multi-region or multi-vendor resilience plan before underwriting premium growth.
Financing / valuation support	New round or secondary process implies flat / down pricing without metric proof	Unable to show durable retention, margins, and refund discipline	Treat the current valuation as fragile and avoid paying ahead of verified economics.

These triggers translate a broad diligence narrative into a concrete monitoring framework for investors.

[CR005, CR010, CR014, CR023, CR026, CR033]

FR001: Risk heatmap

Speak’s highest residual risks sit where AI/voice regulation, app-store billing, and model / cloud dependence overlap with a premium consumer experience.

[CR003, CR005, CR008, CR010, CR016, CR019]

FR002: Risk transmission map

Speak’s main risks transmit through a handful of channels: compliance, billing trust, model quality, margin, and valuation support.

[CR003, CR008, CR010, CR022, CR023, CR024]

FR003: Dependency map

Speak’s consumer AI product depends on a compact set of external rails for models, cloud inference, distribution, billing, and support resolution.

[CR001, CR002, CR009, CR022, CR026, CR033]

7.5 Exhibits

Chapter 08

08Valuation

8.1 Recommendation and price discipline

Speak has enough public operating proof to justify serious diligence, but not enough price support to justify a clean buy at the last public $1B mark. Official disclosures show valuation moved from $500M in June 2024 to $1B in December 2024, while later reporting points to more than $100M of annualized revenue, 15M downloads, 10M+ Google Play installs, and a real enterprise footprint. That is materially stronger evidence than most consumer-edtech startups can show. The problem is that the available price anchor still implies a high-single-digit to roughly 10x revenue multiple, which remains richer than current public education and subscription-learning comps. Duolingo is the only public peer with comparable growth and product quality, yet even it screens near 4x EV/revenue on current market data. Because the published valuation is older than the latest revenue milestone and the cap table, preference stack, and retention profile remain undisclosed, the prudent call is research-more with medium confidence, high risk, and a stretched valuation stance. Entry only becomes attractive if updated diligence shows stronger recurring economics than public evidence currently proves, or if entry price is materially below the last public mark.[CV001, CV006, CV012, CV015, CV017, CV021]

Recommendation summary table
Dimension	Assessment	Evidence-backed reason	Decision implication
Recommendation	research-more	Real product traction exists, but public evidence does not yet fully support price-sensitive underwriting at the last $1B mark.	Proceed only with full private diligence package.
Confidence	medium	Funding, revenue, pricing, app traction, and comp data are all visible, but retention and cap-table economics are not.	Treat public view as a screen, not an IC-ready close.
Risk rating	high	Outcome depends on continued premium growth plus undisclosed dilution / preference terms.	Underwrite downside explicitly before any term-sheet decision.
Valuation stance	stretched	Implied private multiple sits above public edtech comp ranges and requires a premium for better growth quality.	Seek price concession or unusually clean economics.
Target underwriting	>=3x gross MOIC over 4-5 years	At the last public mark, that target likely requires a faster scale-up than current public evidence alone proves.	Prefer entry below the last round or with strong downside protections.

Target return / hold framing is an underwriting discipline, not a sourced market quote; it is included because chapter 08 requires a decision implication.

[CV006, CV012, CV050, CV051, CV053, CV057]

FV001: Recommendation logic

Flow from observed traction and market tailwind to valuation caution and a research-more recommendation.

[CV012, CV017, CV021, CV041, CV042, CV051]

FV004: Investment KPIs

IC-style 0-10 scorecard translating the chapter evidence into decision factors.

Scores are ordinal analyst judgments (0-10) synthesized from the cited claims. They are not sourced company KPIs and are used only to summarize investment quality across chapter dimensions.

[CV012, CV017, CV021, CV041, CV042, CV046]

8.2 Scenario range and comparable frame

The public-market anchor says Speak deserves a premium to struggling edtech names, but the premium must be earned through unusually durable growth and monetization. Duolingo still posts healthy growth, positive margin, and a multi-billion market cap, yet its EV/revenue is roughly 4x; Coursera, Udemy, and Chegg sit far below 1x EV/revenue. That spread matters because Speak is still private and investors are being asked to fund into a company with less disclosure than any of those names. The upside case is still meaningful: AI tutors and online language learning both remain large and growing markets, and Speak’s app rankings, pricing, and enterprise traction suggest it is building a more serious learning product than a generic vocabulary game. Still, MMR’s market work flags freemium competition and CAC pressure, and Oliver Wyman shows how quickly AI-exposed software multiples can reprice when markets question seat growth, moat durability, or revenue quality. That is why the scenario frame uses conservative public-comp ranges rather than extrapolating an open-ended AI premium. A bear outcome compresses toward $400M-$650M, a base case holds around $800M-$1.0B, and a bull case requires revenue to move well beyond $150M with broader enterprise conversion and continued app-store strength.[CV025, CV029, CV033, CV037, CV041, CV042]

Thesis / anti-thesis table
Lens	Thesis	Anti-thesis	What would change the view
Demand	Large AI-tutor and language-learning markets support continued category growth.	Big markets do not prevent CAC inflation or competition from free / freemium tools.	Show cohort-level retention and efficient acquisition by geography.
Monetization	Storefront pricing and >$100M annualized revenue suggest consumers will pay for a premium speaking product.	Public evidence does not show conversion durability or whether revenue concentration is too consumer-heavy.	Provide conversion funnels, payback, and renewal metrics.
Enterprise	200+ customers in Dec 2024 and ~500 employers in late 2025 imply B2B expansion optionality.	Logo counts are not the same as durable ARR, expansion, or gross-margin proof.	Disclose enterprise ARR, renewal rates, and top-customer concentration.
Relative valuation	A premium to weaker public edtech peers can be justified if Speak behaves more like a category leader than a commodity app.	Current comp ranges still imply the $1B mark bakes in a lot of future execution.	An updated financing at similar price with better disclosure would improve the call.

[CV009, CV012, CV041, CV042, CV043, CV044]

Bull / base / bear scenario table
Scenario	Assumptions	Valuation / return logic	Key risks	Probability signal
Bear	Growth slows, app momentum softens, and investors value Speak closer to public edtech ranges.	$400M-$650M valuation; a $1B entry would likely lose capital before dilution and preferences.	Multiple compression, paywall friction, weak enterprise conversion.	More likely if app-store momentum fades or sector sentiment weakens again.
Base	Revenue scale holds around current run-rate trajectory, enterprise expands but remains unproven, and premium over public comps narrows but does not vanish.	$800M-$1.0B valuation; a last-round entry produces limited upside unless terms are unusually clean.	Mix / retention opacity, financing need, inability to defend premium.	Most consistent with current public evidence.
Bull	Revenue scales well beyond $150M, enterprise penetration broadens, and Speak keeps premium app momentum with stronger disclosed economics.	$1.1B-$1.4B valuation; upside exists, but still depends on private data confirming durable monetization.	Execution miss, competitive imitation, weaker-than-expected renewal quality.	Needs evidence beyond what public sources currently show.

Scenario ranges are analytical estimates derived from published revenue milestones, current public comps, and 2026 software multiple risk commentary; they exclude undisclosed preference effects.

[CV012, CV021, CV025, CV029, CV033, CV037]

Comparable valuation table
Comparable	Metric	Multiple / valuation / status	Relevance	Limitation
Duolingo	TTM revenue / EV-revenue / market cap	$1.04B revenue; 4.01x EV/revenue; $5.15B market cap	Best public quality benchmark for premium language-learning software.	Much more mature, profitable, and disclosed than Speak.
Coursera	TTM revenue / EV-revenue / market cap	$789.84M revenue; 0.42x EV/revenue; $0.98B market cap	Shows how general online-learning platforms price when growth and margins are weaker.	Business mix is broader than speaking-led consumer language learning.
Udemy	TTM revenue / EV-revenue / market cap	$773.9M revenue; 0.38x EV/revenue; $0.68B market cap	Useful consumer/creator education benchmark for slower-growth subscription learning.	Marketplace economics differ from Speak’s owned-curriculum model.
Chegg	TTM revenue / EV-revenue / market cap	$376.91M revenue; 0.32x EV/revenue; $0.12B market cap	Hard downside anchor for what education assets can trade at after product / moat deterioration.	Chegg is an adverse comp, not a target peer.

Sample of current public anchors used for price discipline; private AI-tutor comparables remain too opaque to enumerate exhaustively from open sources.

[CV025, CV027, CV028, CV029, CV031, CV032]

FV002: Valuation sensitivity

Illustrative enterprise-value outcomes from different revenue / multiple combinations anchored to public evidence.

Values are illustrative USD millions. They are derived from the publicly reported >$100M annualized revenue milestone plus observed public comp multiples, then rounded to decision-useful anchors. They are not company guidance or transaction quotes.

[CV012, CV025, CV029, CV033, CV037, CV050]

FV003: Valuation / return range

Bear/base/bull valuation ranges for Speak using public evidence only.

Values are illustrative USD millions based on public revenue milestones, current listed-peer multiples, and 2026 software multiple-risk commentary. They exclude undisclosed dilution, debt, or liquidation preference effects, so investor return ranges remain partial.

[CV050, CV051, CV054, CV055, CV056]

8.3 Upgrade triggers, thesis breaks, and final asks

The path from research-more to buy is straightforward but evidence-heavy. Speak needs to show that enterprise traction is not just logo count but recurring, expanding revenue with solid renewal quality. It also needs to prove that consumer scale is translating into retention and cohort economics strong enough to defend a premium multiple above public edtech. The main thesis-breaks are observable: app-store momentum rolling over, consumer paywall friction worsening, enterprise adoption failing to translate into disclosed ARR support, or the broader AI-software multiple resetting lower again. Regulatory and market-structure risk also matter because generative-AI platforms can become concentrated around key model, data, or compute owners, limiting downstream moats for application-layer companies. A financing window can also close quickly if markets stop rewarding AI narratives before Speak has disclosed enough economics to separate itself from weaker subscription-learning assets. Put differently, Speak looks like a credible business, but the underwriting gap is around return architecture rather than product existence. Without the cap table, unit economics, and segment mix, the investor is still pricing a narrative premium. With them, the company could merit track or better; without them, the correct posture is disciplined diligence rather than conviction pricing. The upgrade path is therefore concrete today: prove segment-level retention, show enterprise renewal quality, and demonstrate that premium pricing survives competitive pressure.[CV009, CV013, CV022, CV046, CV047, CV053]

Thesis-break and kill triggers table
Trigger	Threshold	Transmission to thesis	Action implication
App momentum fades	Download / ranking data lose premium positioning across core markets for multiple months	Weakens evidence that Speak is still winning consumer attention at premium price points.	Re-cut base case toward public-comp ranges.
Enterprise proof stalls	No disclosed ARR, renewals, or expansion despite growing employer footprint	Turns enterprise story into a narrative add-on rather than a value-supporting second engine.	Do not pay a premium for B2B optionality.
Sector multiple compression deepens	AI/software risk reprices again and premium subscription names de-rate further	Shrinks valuation ceiling even if operations remain solid.	Demand wider discount to last round or stand aside.
Terms are investor-unfriendly	Heavy preferences, ratchets, or hidden dilution appear in the 2024-2026 financing stack	Can eliminate return even if enterprise value holds near base case.	Pass or insist on protective entry terms.

[CV020, CV022, CV048, CV049, CV053, CV058]

Final diligence asks table
Topic	Missing evidence	Why it matters	Owner or diligence path
Cap table / preferences	No public waterfall, option pool, or liquidation preference detail	Return math can diverge sharply from enterprise value growth.	Company counsel + lead investor data room.
Segment mix	No public split of consumer versus enterprise revenue or geography	Premium valuation depends on quality and diversification of revenue.	Finance team revenue bridge and board materials.
Cohort economics	No public conversion, retention, or CAC payback disclosure	Needed to judge whether revenue can compound efficiently.	Growth / finance analytics export by cohort.
Enterprise quality	No public contract duration, renewal, or expansion data for 200-500 employer footprint	Needed to validate bull-case durability and downside protection.	Sales ops pipeline review plus top-customer diligence calls.

[CV009, CV013, CV022, CV046, CV047, CV053]

8.4 Exhibits

Disclaimer

This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.

Evidence index

Claims
ID	Statement	Confidence	Sources
CO001	Official and third-party company profiles place Speak (Speakeasy Labs) founding in 2016.	Medium	SO007, SO012, SO016
CO002	Speak is headquartered in San Francisco, California.	Medium	SO009, SO011, SO016, SO013
CO003	Speak markets itself as an AI language tutor centered on getting learners speaking out loud with instant feedback.	Medium	SO001, SO004
CO004	By May 2026, Speak publicly promoted courses for French, Spanish, English, Korean, Italian, and Japanese.	Medium	SO001, SO004, SO008
CO005	Speak homepage marketing in May 2026 claimed 15M+ downloads and a 4.8 rating.	Medium	SO001
CO006	The iOS App Store listing showed 44K ratings and a 4.8 score on 2026-05-05.	Medium	SO004
CO007	The Google Play listing showed 112K reviews and a 4.7 score on 2026-05-05.	Medium	SO005
CO008	Speak announced a $20M Series B-3 financing at a $500M valuation on 2024-06-18.	High	SO006, SO009
CO009	Speak announced a $78M Series C financing at a $1B valuation on 2024-12-10.	High	SO007, SO010, SO011
CO010	Speak said total funding reached $162M after the Series C round.	Medium	SO007, SO010
CO011	Accel led Speak’s Series C, with OpenAI Startup Fund, Khosla Ventures, and Y Combinator among participating investors.	High	SO007, SO010, SO011
CO012	HolonIQ listed Speak as joining the global EdTech unicorn list in December 2024 at a $1B valuation.	High	SO015, SO007
CO013	By June 2024, Speak said it had more than 10 million learners in over 40 countries.	Medium	SO006, SO009
CO014	TechCrunch reported Speak had a 75-person workforce across San Francisco, Seoul, Tokyo, and Ljubljana in June 2024.	Medium	SO009
CO015	Speak’s careers page names Seoul, Ljubljana, and San Francisco offices and presents the company as globally distributed.	Medium	SO003
CO016	The careers page also contains a historical snapshot claiming a 60-person team and more than $60M raised.	Low	SO003
CO017	Speak said it launched Speak for Business between the June and December 2024 funding rounds.	Medium	SO007, SO002
CO018	The current B2B page says 200+ brands rely on Speak for Business.	Medium	SO002
CO019	Speak’s Series C post said its enterprise offering had more than 200 customers and an 85% employee adoption rate.	Medium	SO007
CO020	Speak reported that users had already spoken more than one billion sentences in 2024.	Medium	SO007
CO021	Speak’s June 2025 product update announced four new languages for English speakers—French, Japanese, Korean, and Italian—after the earlier Spanish release.	Medium	SO008
CO022	The June 2025 post says Speak started by teaching English in Korea, Japan, and Taiwan and had more than 15 million learners globally.	Medium	SO008
CO023	Speakeasy Labs filed a Form D on 2024-12-11 for a $77,699,277 equity offering under Rule 506(b), with first sale on 2024-11-13.	High	SO013, SO014
CO024	The filing page lists Connor Zwick, Andrew Hsu, Colton Gyulay, Alex Berkenkamp, and Ben Quazzo among related persons and shows a 100 Pine Street San Francisco address.	Medium	SO013
CO025	The public filing list shows additional Speakeasy Labs Form D filings dated 2024-08-12 and 2023-10-17.	Medium	SO014
CO026	Speak’s June 2024 announcement said learners speak about 1,000 times on average in their first week.	Medium	SO006
CO027	Speak’s June 2024 announcement said an updated speech recognition model reduced word error rate by over 60% and improved speed by 20% versus existing commercial systems.	Medium	SO006
CO028	Speak’s December 2024 announcement said it created more than 25 million personalized lessons during 2024.	Medium	SO007
CO029	The founders publicly identified in late-2024 coverage are Connor Zwick (CEO/co-founder) and Andrew Hsu (CTO/co-founder).	High	SO010, SO012
CO030	Accel partner Ben Quazzo joined Speak’s board as part of the Series C round.	High	SO007, SO010
CO031	TechCrunch’s June 2024 article described Speak as launched in 2014, conflicting with the company’s 2016 founding references elsewhere.	Low	SO009
CO032	Android Police concluded in February 2026 that Speak’s voice-driven learning can give users a false sense of mastery because it misses basic pronunciation errors.	Medium	SO017, SO005
CO033	JustUseApp’s review aggregation labeled overall customer experience 67.1% negative and highlighted recurring complaints about billing, lag, and recognition.	Low	SO018
CO034	Languatalk’s 2026 review said Speak is polished for beginners but feedback depth, lesson variety, and premium tier clarity remain weak points.	Medium	SO019
CO035	GetLatka reports Speak reached $100M revenue and 253 employees in 2025, but those figures are not corroborated by official Speak disclosures.	Low	SO016
CO036	The current official homepage and app listings show Speak has expanded from an English-only product into a broader multi-language learning app.	High	SO001, SO004, SO008
CO037	Speak positions English learning as the initial wedge but now frames the product as a broader language tutoring platform for consumers and employers.	Medium	SO002, SO007, SO008
CM001	Speak’s June 2024 financing announcement described the addressable market as a $100B+ online and in-person language learning market.	Medium	SM002
CM002	TechCrunch reported Speak initially focused on English because it is the world’s most popular language for learning.	Medium	SM013
CM003	TechCrunch quoted Speak’s CEO saying roughly 1.5 billion people are trying to learn English.	Medium	SM013
CM004	Preply’s 2026 report says English is the most learned language because of its global role in business and education.	Medium	SM011
CM005	Preply states English has about 1.5 billion total speakers worldwide.	Medium	SM011
CM006	Technavio says the digital English language learning market will grow by USD 39.46B from 2024 to 2029 at a 24.5% CAGR.	Medium	SM005
CM007	Technavio identifies APAC as the largest regional market and says it will contribute 39% of forecast growth.	Medium	SM005
CM008	Technavio says increased flexibility from digital language courses is a primary growth driver for the market.	Medium	SM005
CM009	Technavio says corporate non-academic learners are a major digital English customer segment, with roughly 30% opting for digital courses.	Medium	SM005
CM010	MarketsandMarkets projects the broader AI-in-education market to grow from USD 2.21B in 2024 to USD 5.82B by 2030 at a 17.5% CAGR.	Medium	SM006
CM011	MarketsandMarkets says North America held a 43% share of the AI-in-education market in 2024.	Medium	SM006
CM012	MarketsandMarkets says personalized learning and content management accounted for 34.5% of the AI-in-education market in 2024.	Medium	SM006
CM013	The same report names Duolingo and ELSA Speak among notable AI-in-education players.	Medium	SM006
CM014	Stanford HAI reports four out of five U.S. high school and college students now use AI for schoolwork.	Medium	SM007
CM015	WEF says AI can automate or augment up to 20% of educator clerical tasks.	Medium	SM008
CM016	WEF also says equitable access, data privacy, bias, and teacher displacement are major constraints on AI-in-education adoption.	Medium	SM008
CM017	World Bank says low- and middle-income countries face steep challenges adapting or deploying AI at scale.	Medium	SM009
CM018	World Bank frames the foundations of scalable AI adoption as connectivity, compute, context, and competency.	Medium	SM009
CM019	The 2021 systematic review says most AI language tools used machine learning and natural language processing for error identification, feedback, and assessment.	Medium	SM010
CM020	The same review concludes AI language tools improved learner abilities but raised privacy and teacher-preparation concerns.	Medium	SM010
CM021	Preply estimates the global online language learning market reaches about $115B by the end of 2025.	Low	SM011
CM022	Preply estimates the English learning segment is worth about $43.51B in 2025 and growing around 22% annually.	Low	SM011
CM023	Speak’s current product spans both consumer self-serve subscriptions and employer-sponsored learning through Speak for Business.	Medium	SM001, SM003
CM024	Speak’s Series C post says English learning is industry agnostic and its business product already had 200+ customers.	Medium	SM003
CM025	TechCrunch reported Speak’s consumer list price as $20/month or $99/year in 2024.	High	SM014, SM013
CM026	Duolingo’s 2024 annual report says it serves more than 100M monthly active users across 40+ languages and only about 9% of MAUs are paid subscribers.	Medium	SM017
CM027	Duolingo’s 2024 annual report says the app is the top-grossing education app globally on both Apple and Google app stores.	Medium	SM017
CM028	ELSA positions itself as a specialized AI English speaking coach with 18M+ downloads and 460K+ ratings on its website.	Medium	SM018
CM029	ELSA’s App Store listing shows 109K ratings and monthly and yearly memberships, reinforcing a premium subscription model for speaking practice.	Medium	SM019
CM030	Cambly competes with Speak from a different labor model: real conversations with native speakers available 24/7 rather than AI-only tutoring.	Medium	SM020
CM031	Busuu emphasizes community feedback from native speakers and says it has 120M+ registered users.	Medium	SM022
CM032	Busuu’s App Store listing positions the product as community-driven learning rather than AI-first conversation tutoring.	Medium	SM021
CM033	Babbel’s app-store surfaces emphasize expert-built structured lessons, with 25M subscriptions sold and 50M+ Google Play downloads.	Medium	SM023, SM024
CM034	Praktika markets an AI-tutor model with 20M+ learners and a much lower-cost alternative to private human tutors.	Medium	SM025
CM035	Android Police argued Speak can give learners a false sense of mastery because its pronunciation scoring is overly lenient.	Medium	SM015
CM036	Languatalk found Speak strongest for early learners but weaker for serious learners who need deeper feedback and broader lesson variety.	Medium	SM016
CM037	Speak’s June 2025 post says summer travel increases language-learning interest and ties learning motivation to travel identity.	Medium	SM004
CM038	Preply says progress still depends on access, indicating affordability and digital infrastructure remain barriers even as demand rises.	Medium	SM011
CP001	Speak’s core competitive claim is speaking-first AI tutoring rather than broad textbook-style language learning.	Medium	SP001, SP002
CP002	By 2026 Speak was no longer English-only: official surfaces advertise six live learning tracks and a broader multi-language ambition.	Medium	SP001, SP033
CP003	Speak’s current consumer price anchor remains about $20/month or $99/year in public reporting.	High	SP005, SP006
CP004	Speak for Business claims 200+ customers or brands and gives Speak a second distribution path beyond direct-to-consumer subscriptions.	Medium	SP004, SP023
CP005	Speak has materially smaller public scale than Duolingo, which reported 100M+ MAUs and 40+ languages in its 2024 annual report.	Medium	SP009, SP001
CP006	Duolingo’s freemium model relies on huge free-user scale with only about 9% of MAUs paying, making it a powerful low-cost substitute for Speak.	Medium	SP009
CP007	Duolingo is the top-grossing education app globally on Apple and Google app stores, reinforcing its distribution advantage over smaller challengers.	Medium	SP009
CP008	ELSA is a direct specialized speaking competitor focused specifically on English pronunciation and conversation rather than broad multilingual learning.	Medium	SP010, SP011
CP009	ELSA publicly claims 18M+ downloads and 460K+ ratings on its website.	Medium	SP010
CP010	ELSA’s App Store listing shows 109K ratings and monthly/yearly memberships, signaling a scaled subscription business in the same English-speaking niche.	Medium	SP011
CP011	Cambly competes using human native-speaker conversations available 24/7, making it a live-tutor substitute rather than an AI-only product.	Medium	SP012
CP012	Busuu competes with a community-feedback model and says it has more than 120M registered users.	Medium	SP013, SP014
CP013	Busuu’s Play listing shows 50M+ downloads and 1.13M reviews, reflecting broad consumer reach even without an AI-first pitch.	Medium	SP015
CP014	Babbel competes from the structured-course end of the market and says it has sold 25M subscriptions.	Medium	SP016
CP015	Babbel’s Play listing shows 50M+ downloads and 1.12M reviews, giving it much larger installed-base distribution than Speak.	Medium	SP017
CP016	Praktika is an AI-tutor competitor claiming 20M+ learners and a roughly $8/month price point positioned against human tutors.	Medium	SP018
CP017	Loora competes as an always-available AI English tutor with business-English and real-time feedback positioning.	Medium	SP019
CP018	MarketsandMarkets names both Duolingo and ELSA as AI-in-education players, suggesting the competitive set spans broad platforms and specialized speaking tools.	Medium	SP020
CP019	HolonIQ’s January 2026 addition of Preply to the EdTech unicorn list signals that language learning remains a venture-funded, still-fragmenting category.	Medium	SP021
CP020	Speak’s strongest relative differentiation is still its speaking-first UX and AI tutoring focus, not raw scale.	Medium	SP001, SP005, SP007
CP021	Speak’s multi-language expansion narrows one historical weakness against incumbents, but its public breadth still trails broader platforms like Duolingo, Busuu, and Babbel.	Medium	SP001, SP009, SP013, SP016, SP033
CP022	Cambly’s human-tutor model means Speak is not only fighting apps; it is also fighting live conversation as the trusted premium substitute.	Medium	SP012, SP005
CP023	Busuu and Babbel compete on structure, community, and breadth rather than pure AI conversation, giving budget-conscious users alternatives to Speak.	Medium	SP013, SP016, SP017
CP024	Praktika and Loora show that AI-speaking competition is no longer niche; multiple challengers now market conversation practice as a human-tutor replacement.	Medium	SP018, SP019
CP025	Android Police described Speak as heavily inspired by Duolingo, reducing the novelty moat around its course structure and gamification.	Medium	SP007
CP026	Android Police also found Speak’s pronunciation scoring too forgiving, which weakens one of the product claims that should be most defensible versus broad language apps.	Medium	SP007
CP027	Languatalk judged Speak polished for beginners but weaker for serious learners who want deeper feedback, better review loops, and richer advanced practice.	Medium	SP008
CP028	JustUseApp’s complaint aggregation indicates billing, refund, and support issues can damage trust even when the top-line app rating remains strong.	Medium	SP024, SP002, SP003
CP029	ELSA’s and Speak’s app-store footprints are much closer to each other than either is to Duolingo’s scale, making ELSA a more relevant specialized benchmark than Duolingo alone.	Medium	SP002, SP003, SP010, SP011, SP009
CP030	Speak’s B2B motion is a real differentiator versus most consumer-only rivals, but the public record still lacks seat counts, customer names, and renewal rates.	Medium	SP004, SP023
CP031	Preply’s 2026 report reinforces that English remains the most learned language because of business and education demand, which favors all major competitors rather than Speak alone.	Medium	SP022
CP032	Babbel and Busuu both emphasize more structured pedagogy and community reinforcement than Speak’s free-conversation-centered positioning.	Medium	SP013, SP014, SP016
CP033	ELSA emphasizes pronunciation, role-plays, and bilingual support, making it especially strong where buyers want more explicit correction than Speak’s reviews suggest it delivers.	Medium	SP010, SP011, SP007
CP034	Praktika uses a lower-price, AI-tutor framing that could pressure Speak if speaking practice becomes commoditized.	Medium	SP018, SP005
CP035	Duolingo’s enormous free-user funnel and top-grossing status mean it can pressure smaller players on both acquisition cost and user expectations.	Medium	SP009
CP036	Cambly, Busuu, Babbel, Duolingo, ELSA, Praktika, and Loora together cover live tutoring, community correction, structured coursework, and AI conversation—meaning buyers have multiple non-Speak ways to solve the same job.	Medium	SP009, SP010, SP012, SP013, SP016, SP018, SP019
CP037	There is no reviewed public source giving a clean market-share ranking for Speak versus these rivals, which is itself a diligence gap.	Low
CP038	Duolingo’s official homepage still leads with a free value proposition and very broad course catalog, reinforcing its role as the default low-cost substitute.	Medium	SP025, SP032
CP039	Cambly’s own pricing page highlights one-on-one and Pro tutoring plans with native speakers, showing that the human-tutor substitute is productized rather than bespoke.	Medium	SP027, SP028
CP040	Preply and italki extend the substitute set beyond apps into tutor marketplaces, while HelloTalk extends it into peer-to-peer exchange, increasing buyer choice around the same fluency job.	Medium	SP029, SP030, SP031
CP041	The most durable moat visible publicly is distribution into business accounts plus speaking-first UX, but that moat looks moderate rather than dominant because rivals match on either scale or tutor quality.	Medium	SP004, SP009, SP012, SP018
CI001	Speak’s public monetization architecture is hybrid: consumer subscriptions in the app stores plus employer-facing contracts through Speak for Business.	Medium	SI002, SI004
CI002	Speak’s public consumer price anchor is consistently described as about $20 per month or $99 per year.	High	SI007, SI008
CI003	The App Store listing corroborates that Speak sells monthly and annual auto-renewing subscriptions rather than a pure one-time purchase.	Medium	SI002
CI004	Speak for Business publicly claims 200+ brands rely on the product, indicating a real but still sparsely disclosed B2B revenue line.	Medium	SI004, SI006
CI005	Speak has strong public engagement proxies—15M+ downloads, 44K App Store ratings, and 112K Google Play reviews—but none of these disclose paying subscribers or ARR.	Medium	SI001, SI002, SI003
CI006	TechCrunch reported users spend roughly 10-20 minutes per day in Speak, which supports engagement but not monetized retention.	Medium	SI008
CI007	There is no reviewed public source disclosing Speak revenue, ARR, gross margin, net retention, or cash balance.	Low
CI008	Recent officially announced fundraising totals roughly $98M gross across the June 2024 Series B-3 and December 2024 Series C rounds.	High	SI005, SI006
CI009	The Series B-3 announcement attached a $500M valuation to Speak, and the Series C announcement attached a $1B valuation six months later.	High	SI005, SI006
CI010	The Speakeasy Labs Form D reviewed via FilingFlow lists a total offering amount of $77.7M with first sale on November 13, 2024, broadly matching the announced Series C size.	Medium	SI009
CI011	The SEC-filing list implies recurring private financing activity, but it does not disclose cash still on hand after those raises.	Medium	SI010
CI012	Because Speak does not publish balance-sheet data, any runway assessment remains scenario-based rather than underwritten from public statements.	Medium	SI005, SI006, SI009
CI013	Speak’s revenue mix between consumer subscriptions and enterprise contracts is not publicly quantified.	Low
CI014	Enterprise pricing, seat counts, contract length, and renewal data for Speak for Business are absent from reviewed public sources.	Low
CI015	Revenue quality cannot be judged confidently from list price alone because public sources do not reveal free-trial conversion, refund rate, churn, or realized discounts.	Medium	SI002, SI007, SI008
CI016	Public complaint aggregation points to refund and support friction, which could weigh on net consumer revenue quality even if headline app ratings stay high.	Medium	SI024, SI002, SI003
CI017	Duolingo’s 2024 annual report is the clearest public-company proxy that language-learning at scale monetizes through a freemium funnel with only a minority of MAUs paying.	Medium	SI011
CI018	Duolingo’s official homepage still leads with a free value proposition, reinforcing how strong the low-price anchor is for the category.	Medium	SI023
CI019	Compared with Speak’s monthly subscription, Cambly monetizes live tutoring in higher-value lesson units rather than flat app access.	Medium	SI012, SI002
CI020	italki monetizes as a tutor marketplace: its teachers page exposes thousands of English tutors and visible trial prices as low as USD 5.	Medium	SI017
CI021	HelloTalk’s messaging emphasizes free language exchange at 70M+ registered users and 260+ languages, creating a zero-price substitute for casual practice.	Medium	SI018
CI022	Preply’s discovered pricing page still points to subscriptions and corporate language training, showing that tutor-led models also package recurring revenue and B2B offers.	Medium	SI016
CI023	ELSA’s discovered pricing page points to Pro Memberships, Business, and Schools, highlighting that specialized speaking apps can monetize across consumer and institutional channels.	Medium	SI015
CI024	Busuu’s premium page title and Babbel’s pricing/business links confirm that broad-course competitors use explicit paid upsells rather than purely free distribution.	Medium	SI013, SI014
CI025	Praktika markets roughly $8/month AI tutoring, well below Speak’s public price anchor, which pressures category willingness to pay if AI practice commoditizes.	Medium	SI025, SI002
CI026	Loora’s business-facing and app-download surface suggests another AI tutor pursuing both consumer and business monetization, further compressing differentiation.	Medium	SI020, SI026
CI027	Speak’s unit economics remain largely opaque because public sources do not reveal CAC, payback, gross margin, or retention by cohort.	Low
CI028	A reasonable public reading is that Speak has moderate capital intensity: software, AI inference, content/localization, and app-store distribution costs, but no visible inventory or hardware capex burden.	Medium	SI001, SI002, SI003
CI029	AI inference and feedback quality likely make Speak’s delivery costs higher than static course apps, even though public sources do not quantify the margin impact.	Medium	SI001, SI008
CI030	App-store dependence implies revenue collection and discovery are mediated by platform policies and fees rather than wholly controlled by Speak.	Medium	SI002, SI003
CI031	Speak’s sales-efficiency profile on the enterprise side cannot be publically modeled because buyer count is disclosed but no seat counts, ACVs, or sales-cycle data are provided.	Medium	SI004, SI006
CI032	The company’s valuation has risen faster than its public operating disclosure, creating a gap between financing confidence and financial transparency.	Medium	SI005, SI006, SI007, SI008
CI033	There is no reviewed public sign of debt, project-finance obligations, or acute capital distress, but there is also no evidence to prove strong runway.	Medium	SI005, SI006, SI009, SI010
CI034	New-language expansion broadens what Speak can sell, but no public source quantifies attach rate, ARPU by language, or whether new courses monetize at the same price.	Medium	SI001, SI005, SI006
CI035	The strongest public case for Speak’s near-term financing adequacy is simply that it raised significant capital recently and has not publicly disclosed distress signals.	Medium	SI005, SI006, SI010
CI036	The weakest part of the financial story is revenue quality: users, downloads, and valuations are public, but realized revenue, margins, and retention are not.	Medium	SI001, SI005, SI006
CI037	If Speak’s enterprise motion is meaningful, working-capital needs likely come from sales, onboarding, and localization rather than physical fulfillment.	Medium	SI004, SI006
CI038	Category benchmarks show consumers can choose among subscription apps, freemium bundles, live tutoring, tutor marketplaces, and free exchange communities, which caps pricing power for a mid-priced app like Speak.	Medium	SI012, SI017, SI018, SI023, SI025
CE001	Speak publicly positions itself as an AI language tutor centered on speaking out loud and instant feedback.	High	SE001, SE017
CE002	Public product surfaces show Speak teaching six target languages: Spanish, English, French, Italian, Japanese, and Korean.	High	SE001, SE017, SE019
CE003	Speak’s homepage says the app has 15M+ downloads.	Medium	SE001
CE004	Speak’s proprietary learning method is organized around Learn, Practice, and Apply phases.	High	SE005, SE006, SE007
CE005	Tutor Lessons use an AI voice agent that can advance, correct mistakes, answer clarifying questions, and redirect off-topic responses.	Medium	SE005
CE006	Apply-phase roleplays are open-ended conversations where learners complete objectives without one fixed correct answer.	Medium	SE005, SE003
CE007	Speak said in March 2023 that GPT-4 had already been in production for two months powering parts of AI Tutor.	Medium	SE029
CE008	Speak’s winter 2025 release added adaptive lessons, vocab side quests, unit refreshers, audio-first roleplays, and Speak Level.	Medium	SE007
CE009	Speak launched French, Japanese, Korean, and Italian for English speakers in June 2025 after an earlier Spanish release.	High	SE008, SE010
CE010	Before its 2024 ASR overhaul, Speak operated fragmented speech systems across iOS, Android, on-device models, and third-party recognition services.	Medium	SE002
CE011	Speak’s 2024 ASR upgrade fine-tuned Conformer-CTC on many thousands of hours of heavily accented learner speech.	Medium	SE002
CE012	Speak’s 2024 speech stack used Nvidia Riva and Triton on Kubernetes, Google Cloud, gRPC between services, and websockets for client-server streaming.	Medium	SE002
CE013	Speak reported a greater than 60% word-error-rate reduction versus its pre-trained Conformer baseline on internal learner data.	Medium	SE002
CE014	Speak reported a 45% WER improvement versus its earlier fine-tuned on-device Android model.	Medium	SE002
CE015	Speak said first-word feedback latency averaged about 1.6 seconds after the 2024 ASR upgrade, about 20% faster than its prior third-party service.	Medium	SE002
CE016	Matching v2 replaces bag-of-words matching with a combined ASR-plus-phonetic pipeline and forced alignment.	Medium	SE004
CE017	Speak’s matching pipeline uses a customized wav2vec2 family model for phonetic streaming inference and updates transcripts every 200–300 milliseconds while a user speaks.	Medium	SE004
CE018	Speak reported that Matching v2 reduced false negatives by about 40% without increasing false positives on internal labeled data.	Medium	SE004
CE019	Speak’s 2026 voice-agent platform uses WebRTC via LiveKit Cloud between mobile clients and its backend.	Medium	SE005
CE020	Speak’s voice-agent servers are built on LiveKit Agents and call external ASR, LLM, TTS, and speech-to-speech providers alongside Speak backend services.	Medium	SE005
CE021	Speak says its Kubernetes clusters run across multiple regions and route learners to geographically close LiveKit edges and Speak clusters.	Medium	SE005
CE022	Speak uses both cascade (ASR→LLM→TTS) and speech-to-speech pipelines rather than standardizing on one architecture.	Medium	SE005
CE023	Speak says cascade fits roleplays and free-form conversations, while speech-to-speech fits pronunciation feedback and tutor lessons where tone or accent matters.	Medium	SE005
CE024	Speak evaluates and mixes multiple TTS providers by language pair, code-switching quality, latency, and custom-voice fit because no single provider works best everywhere.	Medium	SE005
CE025	Speak tracks end-to-end latency, ASR time-to-final-transcript, TTS time-to-first-byte, and provider performance by region, language, and tail latency percentiles.	Medium	SE005
CE026	Speak says it automatically shifts traffic to backup providers when latency or error thresholds are exceeded in a region-language pair.	Medium	SE005
CE027	Live Roleplays combine OpenAI Realtime API with Speak’s proprietary learning engine, proficiency graph, objectives, and hints.	Medium	SE003
CE028	Speak acknowledged in October 2024 that new speech-to-speech models still lag text models on instruction following and nuanced pronunciation coaching.	Medium	SE003
CE029	Speak says all lessons are written by learning designers and then human-reviewed even when AI is used to speed up workflow.	Medium	SE011
CE030	Speak says a single four-unit course can include 50+ lessons and takes weeks of writing, revising, filming, testing, and polishing.	Medium	SE006
CE031	Speak publicly supports iOS and Android mobile devices but not desktop PCs.	Medium	SE012
CE032	As of March 2026, Speak lists minimum support at app version 4.35.0, iOS 16, and Android 8.	Medium	SE012
CE033	Speak exposes both an in-lesson issue-report flag and email-based support escalation workflows.	High	SE014, SE015
CE034	Google Play says Speak may share app activity and device IDs with third parties, may collect personal and financial information, encrypts data in transit, and supports data deletion requests.	Medium	SE019
CE035	Apple’s listing rates Speak 13+ and links users to a privacy policy and terms page hosted on usespeak.com/speak.com domains.	Medium	SE017
CE036	Speak’s public privacy and terms URLs returned only a JavaScript shell in text-only review, limiting direct inspection of policy details.	Medium	SE030, SE031
CE037	OpenAI’s April 2025 interview says Speak uses OpenAI models across audio and text and that Connor Zwick viewed the Realtime API plus multimodal audio as a key breakthrough for the product.	Medium	SE023
CE038	TechCrunch reported in December 2024 that Speak had more than 10 million downloads and over 200 Speak for Business customers.	Medium	SE024
CE039	SiliconANGLE reported that Live Roleplays rolled out in late 2024 and that Speak also supports business-oriented conversations with suppliers and customers.	Medium	SE025
CE040	Apple’s App Store page shows Speak at 4.8 stars from 44K ratings.	Medium	SE017
CE041	Google Play shows Speak at 4.7 stars from roughly 112K reviews and 10M+ downloads as of the May 2026 listing.	Medium	SE019
CE042	Recent Apple reviews consistently praise Speak’s immediate speaking practice, replay/pronunciation comparison, and AI conversations, but also ask for more languages and richer rewards.	Medium	SE018
CE043	JustUseApp surfaces public complaints about laggy voice recognition, refund friction, support delays, and unfinished lesson availability despite strong headline ratings.	Medium	SE020
CE044	LanguaTalk’s 2026 review says Speak’s voices and speech recognition are strong, but feedback depth, lesson variety at higher levels, and pricing clarity lag serious-learner expectations.	Medium	SE021
CE045	AppsHunter shows the iOS app updated April 30, 2026 at version 4.46.0 with iOS 16+ compatibility and 16 interface languages.	Medium	SE026
CE046	OpenAI’s developer community contains active 2025 implementation guidance for realtime voice agents using Twilio, FastAPI, and OpenAI’s Realtime API.	Medium	SE027
CE047	The public GitHub repository openai/openai-realtime-agents showed about 6.8k stars and 1.1k forks at review time, indicating an active ecosystem around a key Speak dependency.	Medium	SE028
CE048	Speak says its English-learning product now supports 15 different native-language entry points.	Medium	SE010
CE049	Speak said in November 2024 that Google Play named it Best App of 2024 in Hong Kong, Korea, and Taiwan.	Medium	SE009
CE050	Speak’s help center says more intermediate courses and broader non-English access are still in development.	Medium	SE010, SE011
CU001	Speak's homepage positions the product as an AI language tutor focused on speaking out loud with instant feedback.	Medium	SU001
CU002	Speak's homepage lists French, Spanish, English, Korean, Italian, and Japanese as supported learning languages.	Medium	SU001
CU003	Speak's homepage advertises a 4.8 rating.	Medium	SU001
CU004	Speak's homepage advertises 15M+ downloads.	Medium	SU001
CU005	Speak for Business says 200+ brands rely on the product.	Medium	SU002, SU005
CU006	Speak for Business frames employers as the buyer/payer and employees as the primary users of English-learning content.	Medium	SU002
CU007	Speak for Business says learners speak hundreds of sentences per week.	Medium	SU002
CU008	Speak's official review page is explicitly limited to real 5-star App Store reviews from U.S. learners.	Medium	SU003
CU009	Speak said in June 2024 that it had more than 10 million learners in 40+ countries.	Medium	SU004, SU007
CU010	Speak said in June 2024 that learners had more than doubled year over year for the previous five years.	Medium	SU004, SU007
CU011	Speak said learners speak 1,000 times on average in their first week.	Medium	SU004
CU012	Speak said nearly 6% of Korea's population was learning English with the app in 2024.	Medium	SU004, SU008
CU013	Speak said users had already spoken more than one billion sentences in 2024.	Medium	SU005
CU014	Speak said it created 25 million personalized lessons in 2024.	Medium	SU005
CU015	Speak said Speak for Business had more than 200 customers across industries in December 2024.	Medium	SU005, SU012
CU016	Speak said enterprise deployments were seeing an 85% employee adoption rate in December 2024.	Medium	SU005
CU017	Speak Wrapped 2025 says learners spoke 3.74 billion lines in 2025, up 111% year over year.	Medium	SU006
CU018	Speak Wrapped 2025 says learners spent 19.6 million hours practicing in app in 2025, up 85% year over year.	Medium	SU006
CU019	Speak Wrapped 2025 says learners started 231 million lessons in 2025, up 108% year over year.	Medium	SU006
CU020	Speak Wrapped 2025 says learners created 80.3 million personalized lessons in 2025, up 154% year over year.	Medium	SU006
CU021	TechCrunch reported in August 2023 that Speak was live in around 20 countries.	Medium	SU008
CU022	TechCrunch reported in August 2023 that Speak had well over 100,000 subscribers in South Korea.	Medium	SU008
CU023	Apple's App Store listing shows Speak at 4.8 out of 5 from 44K ratings.	High	SU009, SU010
CU024	Google Play shows Speak at 4.7 stars from 112K reviews and 10M+ downloads.	Medium	SU011
CU025	The Google Play listing shows the Android app was updated on 2026-05-01.	Medium	SU011
CU026	The iOS App Store listing shows monthly Premium pricing at $17.99, annual Premium at $83.99, and annual Premium Plus at $164.99.	Medium	SU018
CU027	Forbes reported in November 2025 that about 15 million people had downloaded Speak and the company had surpassed $100 million in annualized revenue, largely from consumers.	Low	SU013
CU028	Forbes reported that Speak began pushing into enterprise in 2024 after some consumers asked employers to cover subscriptions.	Low	SU013
CU029	Forbes reported that roughly 500 companies, including KPMG and HD Hyundai, offered Speak subscriptions to employees primarily in South Korea by late 2025.	Low	SU013
CU030	KPMG describes itself as a global organization of independent professional services firms providing audit, tax, and advisory services.	Medium	SU017
CU031	The App Store and Speak's official review page both preserve j herronov's review saying months of French use improved comprehension of French media and made free-talk and bookmark features valuable.	Medium	SU019, SU026
CU032	Google Play and Speak's official review page both preserve Dan S's review saying more than six months of Spanish use beat prior tools because feedback was detailed and instantaneous.	Medium	SU021, SU024
CU033	Google Play and Speak's official review page both preserve Rosalyn Mulder's review saying tutor-like feedback and streaks made Speak preferable to Duolingo and Mango.	Medium	SU022, SU025
CU034	The App Store reviews page shows Chuck_Ellis saying eight days of use had him forming Spanish sentences with confidence.	Medium	SU010
CU035	The App Store reviews page shows dollargills saying Speak helps with speaking and listening rather than only vocabulary memorization.	Medium	SU010
CU036	The App Store listing includes brombres' review praising focused speaking practice while calling the UI and HFM auto-advance behavior unintuitive.	Medium	SU020
CU037	The Google Play listing includes Danielle Chavez's April 2026 review saying speech recognition misses words mid-sentence and female verb forms can be mishandled.	Medium	SU023
CU038	MWM's editorial app page reports 10M+ downloads, a 4.8 out of 5 user rating, and 354.9K total ratings across locales.	Low	SU016
CU039	LanguaTalk rated Speak 3 out of 5 in February 2026 and said the app works best for beginners while feedback depth and lesson variety remain weaker for serious learners.	Medium	SU015
CU040	LanguaTalk said Speak's speech recognition can be overly lenient and may give learners a false sense of mastery.	Medium	SU015
CU041	JustUseApp surfaces complaints about refund friction, language availability, voice recognition, and missing Spanish-content access.	Low	SU014
CU042	App-store listings on Apple and Google both use auto-renewing subscriptions, indicating Speak's public monetization flow is optimized for self-serve consumer conversion.	Medium	SU009, SU011
CU043	Because Speak's official review page only republishes 5-star App Store reviews, it cannot by itself prove balanced customer satisfaction or retention.	Medium	SU003
CU044	Speak's public customer proof is much stronger for consumer learners and aggregate enterprise counts than for named enterprise case studies.	Medium	SU002, SU003, SU013
CU045	No reviewed public source disclosed NRR, GRR, churn percentages, or cohort-retention tables for Speak.	Medium	SU001, SU002, SU005, SU013, SU015
CU046	No reviewed public source disclosed contract lengths, renewal rates, or seat-expansion data for Speak for Business.	Medium	SU002, SU005, SU013
CU047	No reviewed public source disclosed top-customer concentration, regional revenue mix, or partner concentration for Speak.	Medium	SU002, SU005, SU013, SU016
CU048	Public named-customer proof is sampled rather than exhaustive: official and press sources cite enterprise counts but do not provide a public roster or case-study library for most employer logos.	Medium	SU002, SU005, SU013
CU049	Dataconomy reported that Speak users spend about 10 to 20 minutes per day in the app.	Low	SU012
CU050	Dataconomy reported that Speak for Business served over 200 corporate customers in December 2024.	Medium	SU012
CR001	Apple’s App Store listing shows Speak with a 4.8 rating from 44K ratings and a 13+ age rating.	Medium	SR004
CR002	Google Play shows Speak with 10M+ downloads, 112K reviews, and a 4.7 star rating.	Medium	SR013
CR003	Google Play’s data-safety disclosure says Speak may share app activity and device or other IDs with third parties.	Medium	SR013
CR004	Google Play’s data-safety disclosure says Speak may collect personal info, financial info, and four other data categories.	Medium	SR013
CR005	Google Play says Speak encrypts data in transit and lets users request deletion.	Medium	SR013
CR006	Speak’s Google Play listing says memberships auto-renew unless cancelled at least 24 hours before renewal.	Medium	SR013
CR007	Speak’s Apple App Store listing says memberships auto-renew unless cancelled at least 24 hours before renewal.	Medium	SR004
CR008	Speak’s cancellation article says cancelling a subscription prevents the next renewal but does not generate a refund for the current period.	Medium	SR010
CR009	Speak routes cancellation differently for Apple App Store, Google Play, and direct website purchases.	Medium	SR010
CR010	Speak’s refund policy gives Google Play and website subscribers a refund window of seven days from purchase.	Medium	SR011
CR011	Speak’s refund policy says no refunds are available after 30 days from payment for Google Play and website purchases.	Medium	SR011
CR012	Speak’s refund policy says Apple controls App Store refunds and Speak cannot process those refunds directly.	Medium	SR011
CR013	Apple Support says users can cancel in-app subscriptions and request refunds through Apple.	Medium	SR014
CR014	Google Play tells developers to be transparent about subscription terms, billing frequency, and cancellation methods.	Medium	SR015
CR015	Google Play says subscription apps must include an easy-to-use online method to cancel the subscription.	Medium	SR015
CR016	COPPA applies when an online service is directed to children under 13 or has actual knowledge that it collects personal information from children under 13.	Medium	SR008, SR020
CR017	FTC COPPA guidance says covered services must post a privacy policy and obtain verifiable parental consent before collecting personal information from children.	Medium	SR020
CR018	CNIL says the AI Act was published in the Official Journal in July 2024 and entered into force in stages from 1 August 2024.	Medium	SR018
CR019	CNIL says the AI Act is risk-based and includes high-risk examples such as biometric systems plus specific transparency obligations for some AI systems.	Medium	SR018
CR020	IAPP’s AI Act/GDPR mapping says some high-risk AI deployments require a fundamental-rights impact assessment that complements GDPR impact assessments.	Medium	SR019
CR021	IAPP says GDPR Articles 13-14, 15, and 22 create transparency and meaningful-information duties around automated decision-making.	Medium	SR019
CR022	OpenAI says Speak uses OpenAI models across audio and text modalities for interactive speaking exercises and tutors.	Medium	SR002
CR023	Speak’s Live Roleplays feature is built with OpenAI’s Realtime API and GPT-4o speech-to-speech capabilities.	Medium	SR024, SR007
CR024	Speak says current speech-to-speech models are still weaker than text models on instruction following and nuanced language-learning tasks such as pronunciation coaching.	Medium	SR024
CR025	Speak’s ASR team says earlier third-party speech-recognition services often struggled with heavily accented learner speech.	Medium	SR001
CR026	Speak runs its fine-tuned Conformer-CTC ASR stack with Nvidia Riva and Triton in Kubernetes on Google Cloud Platform.	Medium	SR001
CR027	Speak says the revamped ASR system delivers first-word feedback in about 1.6 seconds on average.	Medium	SR001
CR028	Speak said in March 2023 that GPT-4 had already powered parts of AI Tutor in production for more than two months and over 2M lessons had used the feature.	Medium	SR025
CR029	TechCrunch reported in June 2024 that Speak had grown to over 10 million users and customers in more than 40 countries.	Medium	SR030
CR030	TechCrunch reported in June 2024 that Speak’s user base had doubled every year for the prior five years.	Medium	SR030
CR031	Speak said in August 2023 that nearly 6% of South Korea’s population had used the app.	Medium	SR026
CR032	Speak said in August 2023 that it had begun international expansion and was live in more than 20 countries.	Medium	SR026
CR033	TechCrunch reported in December 2024 that Speak raised a $78M Series C at a $1B valuation.	Medium	SR027, SR007
CR034	SiliconANGLE reported that Speak’s plan to build custom LLMs could create significant costs that the new financing helps absorb.	Medium	SR007
CR035	JustUseApp’s review page says 67.1% of the combined experience it analyzed was negative.	Low	SR005
CR036	JustUseApp includes complaints that Speak’s voice recognition became laggy and did not allow enough time to finish some spoken sentences.	Low	SR005
CR037	JustUseApp includes complaints about unauthorized or unexpected charges after free-trial cancellation and about support not responding for a week on refund requests.	Low	SR005
CR038	JustUseApp includes a complaint that Speak required a Korean phone number during onboarding and another that the interface launched in a language the user did not understand.	Low	SR005
CR039	AppsHunter summarizes negative user themes as expensive subscriptions, inaccurate voice recognition, rushed lessons, limited languages, and occasional app crashes or bugs.	Low	SR006
CR040	AppsHunter lists reported bugs including inability to record during lessons, lesson progress not saving properly, and billing problems.	Low	SR006
CR041	Tooliverse flags premium pricing, pronunciation misreads, battery heating during long sessions, and background-noise sensitivity among its key watch-outs for Speak.	Low	SR022
CR042	A 2026 Google Play review says the app sometimes fails to hear everything a user says in the middle of a sentence.	Low	SR013
CR043	Apple App Store reviews and Speak’s curated review page both show strong user praise for speaking confidence gains and immediate feedback, but repeated requests for more languages.	Medium	SR012, SR023
CR044	Apple rates Speak 13+ while Google Play rates it Everyone, creating an ambiguous minor-facing posture for a voice-first tutoring product.	Medium	SR004, SR013, SR008
CR045	Because billing and refunds are split across Apple, Google Play, and Speak’s direct website, subscription-trust failures can be operationally fragmented and harder to resolve quickly.	Medium	SR010, SR011, SR013, SR014
CR046	OpenAI price, policy, or uptime changes can affect both feature quality and gross-margin assumptions because Speak publicly relies on GPT-4, GPT-4o, and Realtime API capabilities.	Medium	SR002, SR024, SR027
CR047	Speak’s cloud dependency is concentrated in a narrow infrastructure stack because its custom ASR backend runs on Google Cloud with Nvidia GPU inference components.	Medium	SR001
CR048	Rapid growth from South Korea into 20-plus and then 40-plus countries raises localization, support, and content-operations complexity even if demand remains strong.	Medium	SR026, SR030, SR029
CR049	The public review and help-center record shows billing friction is not a one-off issue because refunds, cancellation, payment status, and free-trial questions occupy a dedicated 11-article help collection alongside multiple complaint sources.	Medium	SR003, SR005, SR010, SR011
CR050	Public review evidence supports the idea that Speak has real product value, but premium pricing makes customer expectations for recognition quality and billing fairness materially higher.	Medium	SR012, SR022, SR005
CR051	NicheMetric estimates Speak generated about $10.0M of iOS revenue and more than 2.0M iOS downloads in the last 30 days, but the methodology is not transparent enough for high-confidence underwriting.	Low	SR028
CR052	SiliconANGLE says Speak now offers an enterprise edition, Speak for Business, with business-conversation features, increasing the need for support and contract maturity beyond the consumer app.	Medium	SR007
CR053	Y Combinator’s company page lists Speak as an active San Francisco company with multiple open jobs, which is directionally consistent with ongoing staffing needs rather than a fully stabilized operating model.	Low	SR029
CR054	Speak’s publicly reviewable privacy and terms pages were JavaScript-gated during chapter preparation, so voice-retention, transcript-retention, and model-training specifics could not be fully verified from machine-readable text.	Low
CR055	Public sources did not disclose Speak’s burn rate, gross margin, inference cost per lesson, customer concentration, or enterprise SLA credits.	Low
CR056	No comprehensive cross-jurisdiction litigation or enforcement package was publicly reviewable during chapter preparation, so legal-overhang risk remains only partially closed.	Low
CV001	Speak announced a $20M Series B-3 on June 18, 2024 at a $500M valuation.	Medium	SV001, SV003
CV002	Speak said the June 2024 financing brought total funding to $84M.	Medium	SV001, SV003
CV003	Speak said it had more than 10 million learners in 40+ countries by June 2024.	Medium	SV001, SV003
CV004	Speak said its learner base had more than doubled year over year for five straight years by June 2024.	Medium	SV001, SV003
CV005	TechCrunch reported Speak charged $20 per month or $99 per year in mid-2024.	Medium	SV003
CV006	Speak announced a $78M Series C on December 10, 2024 at a $1B valuation.	Medium	SV002, SV006, SV004
CV007	Speak said the Series C brought lifetime funding to $162M.	Medium	SV002, SV004
CV008	Speak said users had already spoken more than one billion sentences with the product in 2024.	Medium	SV002
CV009	Speak said Speak for Business had 200+ customers and an 85% employee adoption rate in December 2024.	Medium	SV002, SV006
CV010	Forbes said Speak’s valuation doubled to $1B after the December 2024 $78M round.	Medium	SV004, SV002
CV011	Forbes reported roughly 15 million people had downloaded Speak by November 2025.	Medium	SV005
CV012	Forbes reported Speak had surpassed $100M in annualized revenue by November 2025.	Medium	SV005, SV007
CV013	Forbes reported about 500 companies, including KPMG and HD Hyundai, offered Speak subscriptions to employees by late 2025.	Medium	SV005
CV014	Forbes reported paid Speak access ranged from roughly $80 to $200 for consumers in late 2025.	Medium	SV005, SV009
CV015	Apple’s App Store listed Speak at 4.8 stars with 44K ratings when accessed on 2026-05-05.	Medium	SV009
CV016	Apple’s App Store listed current U.S. in-app prices including $17.99 monthly premium and $83.99 annual premium, with higher Plus tiers.	Medium	SV009
CV017	Google Play listed Speak at 10M+ downloads and 112K reviews when accessed on 2026-05-05.	Medium	SV010
CV018	Google Play showed Speak was updated on May 1, 2026.	Medium	SV010
CV019	An AppBrain snapshot from February 2025 estimated 5.8M Android lifetime downloads and about 350K recent 30-day downloads for Speak.	Medium	SV011
CV020	An AppBrain snapshot showed Speak ranked #1 top grossing in South Korea Education and #2 top grossing in Japan Education in early 2025.	Medium	SV011
CV021	NicheMetric estimated Speak generated about $10.0M in iOS revenue and more than 2.0M iOS downloads in the last 30 days.	Medium	SV008
CV022	NicheMetric surfaced critical 2026 user complaints about an immediate paywall, poor support responsiveness, and inability to switch languages.	Medium	SV008
CV023	GetLatka reported Speak reached $100M revenue in 2025, up from $15M in 2024.	Medium	SV007
CV024	GetLatka listed Speak at 253 employees.	Medium	SV007
CV025	Yahoo Finance listed Duolingo at $1.04B trailing revenue and 4.01x EV/Revenue.	Medium	SV014
CV026	Yahoo Finance listed Duolingo at 39.91% profit margin and 35.0% quarterly revenue growth.	Medium	SV014
CV027	CompaniesMarketCap listed Duolingo at a $5.15B market cap in May 2026.	Medium	SV015
CV028	SEC EDGAR showed Duolingo had filed a current 10-Q on 2026-05-05 and a 10-K for 2025.	Medium	SV013
CV029	Yahoo Finance listed Coursera at $789.84M trailing revenue and 0.42x EV/Revenue.	Medium	SV017
CV030	Yahoo Finance listed Coursera at -3.0% quarterly revenue growth and 0.48% profit margin.	Medium	SV017
CV031	CompaniesMarketCap listed Coursera at a $0.98B market cap in May 2026.	Medium	SV018
CV032	SEC EDGAR showed Coursera had a 10-Q on 2026-04-30 and recent annual-report filings on file.	Medium	SV016
CV033	Yahoo Finance listed Udemy at $773.9M trailing revenue and 0.38x EV/Revenue.	Medium	SV020
CV034	Yahoo Finance listed Udemy at 9.1% quarterly revenue growth and -8.23% profit margin.	Medium	SV020
CV035	CompaniesMarketCap listed Udemy at a $0.68B market cap in May 2026.	Medium	SV021
CV036	SEC EDGAR showed Udemy had a 2026 10-K/A and prior 10-K disclosures on file.	Medium	SV019
CV037	Yahoo Finance listed Chegg at $376.91M trailing revenue and 0.32x EV/Revenue.	Medium	SV023
CV038	Yahoo Finance listed Chegg at -49.4% quarterly revenue growth and -27.44% profit margin.	Medium	SV023
CV039	CompaniesMarketCap listed Chegg at a $0.12B market cap in May 2026.	Medium	SV024
CV040	SEC EDGAR showed Chegg had 2026 10-Qs and a 2025 10-K on file.	Medium	SV022
CV041	Grand View Research estimated the global AI tutors market at $2.11B in 2025 and $17.72B by 2033, a 30.5% CAGR.	Medium	SV027
CV042	MMR Statistics estimated the global online language learning market at $24.56B in 2025 and $63.43B by 2032, a 14.52% CAGR.	Medium	SV028
CV043	MMR Statistics said more than 60% of leading online-language platforms had integrated AI-driven adaptive learning, speech recognition, or personalized lesson pathways in 2025.	Medium	SV028
CV044	MMR Statistics said freemium pricing and free content were intensifying competition and raising customer acquisition costs, especially for earlier-stage platforms.	Medium	SV028
CV045	Fortune Business Insights sized the private tutoring market at $66.96B in 2025 and said Asia Pacific held a 60.85% share.	Medium	SV029
CV046	The FTC warned that control over key generative-AI inputs can create barriers to entry and distort competition.	Medium	SV026
CV047	The FTC warned network effects can help generative-AI leaders entrench market power and reduce entrant competitiveness.	Medium	SV026
CV048	Oliver Wyman said public markets began repricing software risk in early 2026 as agentic AI challenged seat-based pricing and durable product differentiation.	Medium	SV025
CV049	Oliver Wyman said valuation multiples are becoming more sensitive to perceived AI exposure and revenue durability.	Medium	SV025
CV050	Using Speak’s $1.0B December 2024 valuation and the later reported >$100M annualized revenue milestone implies a high-single-digit to roughly 10x revenue multiple before later balance-sheet adjustments.	Medium	SV002, SV005
CV051	Speak’s implied private multiple sits above current public-peer EV/revenue levels of 4.01x for Duolingo, 0.42x for Coursera, 0.38x for Udemy, and 0.32x for Chegg.	Medium	SV014, SV017, SV020, SV023
CV052	The public-comp spread suggests Speak’s premium valuation only holds if it sustains materially faster growth and monetization than listed edtech peers.	Medium	SV014, SV017, SV020, SV023, SV005
CV053	Public evidence supports real traction, but absent cap-table, preference-stack, retention, and segment-mix disclosure makes the last public $1B mark hard to underwrite for a new investor.	Medium	SV002, SV005, SV007
CV054	A reasonable bear case is valuation compression toward roughly $400M-$650M if growth slows and public-like multiples dominate.	Medium	SV014, SV017, SV020, SV023, SV025
CV055	A reasonable base case of roughly $800M-$1.0B requires Speak to defend current scale and convert enterprise traction without requiring multiple expansion.	Medium	SV002, SV005, SV009, SV010
CV056	A reasonable bull case above $1.1B requires revenue scaling well beyond $150M plus broader enterprise adoption and continued app-ranking strength.	Medium	SV005, SV011, SV027, SV028
CV057	Because market growth is real but AI competition and multiple compression are also real, a research-more recommendation is better supported than a buy at the last public $1B mark.	Medium	SV025, SV026, SV028, SV029, SV005
CV058	Monitorable downside triggers include weakening app-store momentum, failure to convert enterprise footprint into disclosed recurring economics, and continued sector multiple compression.	Medium	SV010, SV011, SV013, SV025, SV005

Sources
ID	Publisher	Title	Quote
SO001	Speak	Speak homepage	15M+ downloads; 4.8 rating; AI language tutor focused on speaking.
SO002	Speak	Speak for Business	200+ Brands Rely on Speak for Business.
SO003	Speak	Careers page	The team is based in San Francisco, Seoul, Tokyo, and Ljubljana.
SO004	Apple App Store	Speak: Language Learning App Store listing	44K Ratings; 4.8; monthly and annual auto-renewing subscriptions.
SO005	Google Play	Speak: Language Learning Google Play listing	4.7 star; 112K reviews.
SO006	Speak	Series B-3 announcement	Speak raised $20M in Series B-3 financing, doubling its valuation to $500 million.
SO007	Speak	Series C announcement	Speak raised $78M in Series C funding at a $1 billion valuation.
SO008	Speak	New languages on Speak	Four new languages launched; more than 15 million learners around the world.
SO009	TechCrunch	Language learning app Speak nets $20M, doubles valuation	Speak has grown to over 10 million users and now has customers in more than 40 countries.
SO010	TechCrunch	OpenAI-backed Speak raises $78M at $1B valuation	Speak provides average usage around 10-20 minutes/day, paying $20 per month or $99 per year.
SO011	Tech Funding News	OpenAI-backed Speak closes $78M at $1B valuation	Speak, a San Francisco-based language-learning startup, has closed $78 million in Series C funding.
SO012	Unite.AI	Speak secures $78M Series C funding at $1B valuation	Founded in 2016 by Connor Zwick and Andrew Hsu.
SO013	FilingFlow	Speakeasy Labs Form D filing	Total offering $77,699,277; first sale Nov 13, 2024; Rule 506(b).
SO014	SEC Filing Data	Speakeasy Labs SEC filings list	Form D filings appear on 12/11/2024, 08/12/2024, and 10/17/2023.
SO015	HolonIQ	The Complete List of Global EdTech Unicorns	Speak, Language Learning App, has joined the list in Dec 2024 at $1B valuation.
SO016	GetLatka	How Speak hit $100M revenue with a 253 person team in 2025	Speak hit $100M in revenue in November 2025 and had 253 total employees.
SO017	Android Police	I ditched Duolingo for this language app, and it was a total reality check	Speak continued to praise my speaking ability even when I deliberately mispronounced phrases.
SO018	JustUseApp	Speak reviews (2026)	Negative experience 67.1%; complaints include auto-renewal, lag, and speech recognition issues.
SO019	Languatalk	Speak app review	Feedback is brief and lacks depth; premium tier structure is confusing and potentially expensive.
SO020	Crunchbase News	AI language startup Speak hits unicorn status after raising Series C	Speak hit unicorn status after its Series C round.
SO021	Inc.	This AI language learning platform is now a unicorn	Speak translated AI speech technology into a $1 billion valuation.
SO022	Similarweb	Speak app overview
SO023	Sensor Tower	Speak app overview	Speak: Language Learning - Google Play Store - US - Category Rankings and Growth Metrics.
SO024	Speak	Privacy policy route
SO025	Speak	Terms route
SM001	Speak	Speak homepage	The most effective way to learn a language.
SM002	Speak	Series B-3 announcement	Disrupting the $100 billion+ online and in-person language learning market.
SM003	Speak	Series C announcement	English learning is industry agnostic and Speak for Business had 200+ customers.
SM004	Speak	New languages on Speak	Speak started by teaching English in Korea, Japan, and Taiwan.
SM005	Technavio	Digital English language learning market report	Market size to increase USD 39.46 billion at a CAGR of 24.5% from 2024 to 2029.
SM006	MarketsandMarkets	AI in Education Market Forecast & Size	The AI in Education market is projected to grow from USD 2.21 billion in 2024 to USD 5.82 billion by 2030.
SM007	Stanford HAI	AI Index 2026 education chapter	Four out of five U.S. high school and college students now use AI for schoolwork.
SM008	World Economic Forum	AI reshaping global education	AI can automate up to 20% of educator clerical tasks but raises access, privacy, and bias concerns.
SM009	World Bank	Digital Progress and Trends 2025: AI Foundations	Low- and middle-income countries face steep AI adoption challenges; small AI and the four Cs matter.
SM010	arXiv	Systematic Review for AI-based Language Learning Tools	AI-based language learning tools improved learner outcomes but raised privacy and teacher-preparation concerns.
SM011	Preply	Global language learning statistics and trends	English is the most learned language and the English learning market is worth about $43.51B in 2025.
SM012	HolonIQ	Global EdTech Unicorns list	Preply joined the EdTech unicorn list in Jan 2026.
SM013	TechCrunch	OpenAI-backed Speak raises $78M at $1B valuation	For the one and a half billion people out there trying to learn English, the issue is speaking it.
SM014	TechCrunch	Speak nets $20M, doubles valuation	Speak makes money by charging $20 per month, or $99 per year.
SM015	Android Police	Speak review and AI limitations	Speak cannot identify basic pronunciation errors and can create a false sense of mastery.
SM016	Languatalk	Speak app review	Feedback is brief and lacks depth; lesson variety becomes repetitive.
SM017	SEC	Duolingo 2024 annual report	Duolingo offers courses in over 40 languages to more than 100 million monthly active users.
SM018	ELSA Speak	ELSA homepage	18M+ downloads; AI English speaking coach.
SM019	Apple App Store	ELSA App Store listing	109K ratings; yearly and monthly memberships are available.
SM020	Cambly	Cambly homepage	Real conversations with native speakers, anytime, anywhere, 24/7.
SM021	Busuu	Busuu App Store listing	Busuu helps you communicate with confidence from day one and connects learners with native speakers.
SM022	Busuu	Busuu website snapshot	120+ million registered users.
SM023	Babbel	Babbel App Store listing	25 million subscriptions sold.
SM024	Google Play	Babbel Play listing	50M+ downloads; 1.12M reviews.
SM025	Praktika	Praktika homepage	20M+ learners and private-tutor results without the private-tutor price.
SP001	Speak	Speak homepage	15M+ downloads; AI language tutor.
SP002	Apple App Store	Speak App Store listing	44K ratings; 4.8; free with in-app purchases.
SP003	Google Play	Speak Google Play listing	4.7 star; 112K reviews.
SP004	Speak	Series C announcement	200+ customers across various industries.
SP005	TechCrunch	Speak Series B extension article	Speak makes money by charging $20 per month or $99 per year.
SP006	TechCrunch	Speak Series C article	Speak for Business has over 200 customers and consumers typically pay $20 per month or $99 per year.
SP007	Android Police	Speak review and AI limitations	Speak is heavily inspired by Duolingo and misses basic pronunciation errors.
SP008	Languatalk	Speak app review	Speak works well for early learners but not for those seeking deeper adaptive practice.
SP009	SEC	Duolingo 2024 annual report	Duolingo offers 40+ languages to 100M+ MAUs and ~9% of MAUs are paid subscribers.
SP010	ELSA Speak	ELSA homepage	18M+ downloads and 460K+ ratings.
SP011	Apple App Store	ELSA App Store listing	109K ratings; yearly and monthly memberships available.
SP012	Cambly	Cambly homepage	Real conversations with native speakers, anytime, anywhere, 24/7.
SP013	Busuu	Busuu website snapshot	120+ million registered Busuu users.
SP014	Apple App Store	Busuu App Store listing	98K ratings; community feedback from native speakers.
SP015	Google Play	Busuu Play listing	50M+ downloads; 1.13M reviews.
SP016	Apple App Store	Babbel App Store listing	25 million subscriptions sold.
SP017	Google Play	Babbel Play listing	50M+ downloads; 1.12M reviews.
SP018	Praktika	Praktika homepage	20M+ learners and ~$8/month versus a ~$400/month private tutor.
SP019	Loora	Loora homepage	Loora is an always-available AI English tutor focused on real-time feedback.
SP020	MarketsandMarkets	AI in Education Market	Duolingo and ELSA Speak are named among the AI-in-education market participants.
SP021	HolonIQ	Global EdTech Unicorns list	Preply joined the EdTech unicorn list in Jan 2026 at a $1.2B valuation.
SP022	Preply	Global language learning report	English is the most learned language because of business and education demand.
SP023	Speak	Speak for Business page	200+ brands rely on Speak for Business.
SP024	JustUseApp	Speak reviews aggregation	67.1% negative experience according to review aggregation.
SP025	Duolingo	Duolingo homepage	The free, fun, and effective way to learn a language.
SP026	Duolingo	Duolingo Super page	Super Duolingo is the premium upsell path from the free product.
SP027	Cambly	Cambly pricing page	Pricing page shows private lessons and Pro plans from US$8.12 per lesson on annual terms.
SP028	Apple App Store	Cambly App Store listing	Cambly – Learn English App Store listing.
SP029	Preply	Preply homepage	Preply is an online language tutoring marketplace.
SP030	italki	italki homepage	italki is a language-learning marketplace with certificated tutors.
SP031	HelloTalk	HelloTalk homepage	HelloTalk is a language exchange and learning platform.
SP032	Google Play	Duolingo Play listing	Duolingo: Language Lessons - Apps on Google Play.
SP033	Speak	New languages post	French, Japanese, Korean, and Italian launched after Spanish.
SI001	Speak	Speak homepage	15M+ downloads; AI language tutor.
SI002	Apple App Store	Speak App Store listing	44K ratings; monthly and annual auto-renewing subscriptions.
SI003	Google Play	Speak Google Play listing	4.7 star; 112K reviews.
SI004	Speak	Speak for Business page	200+ brands rely on Speak for Business.
SI005	Speak	Series B-3 announcement	Speak raised $20M in Series B-3 financing, doubling its valuation to $500 million.
SI006	Speak	Series C announcement	Speak raised $78M in Series C funding at a $1 billion valuation.
SI007	TechCrunch	Speak Series B extension article	Speak makes money by charging $20 per month or $99 per year.
SI008	TechCrunch	Speak Series C article	Speak users spend roughly 10-20 minutes per day and pay $20 per month or $99 per year.
SI009	FilingFlow	Speakeasy Labs Form D filing	Total offering $77,699,277; first sale Nov 13, 2024; Rule 506(b).
SI010	SECFilingData	Speakeasy Labs SEC filings list	Form D filings appear on 12/11/2024, 08/12/2024, and 10/17/2023.
SI011	SEC	Duolingo 2024 annual report	Duolingo offers 40+ languages to 100M+ MAUs and about 9% of MAUs are paid subscribers.
SI012	Cambly	Cambly pricing page	Private and Pro tutoring plans run from US$8.12 per lesson on annual terms.
SI013	Busuu	Busuu premium page	Premium - Busuu.
SI014	Apple App Store	Babbel App Store listing	25 million subscriptions sold.
SI015	ELSA Speak	ELSA pricing page	The fetched page exposes Pro Memberships, ELSA for Business, and ELSA for Schools.
SI016	Preply	Preply pricing page	The fetched page points users to Preply Subscription and Corporate language training.
SI017	italki	italki teachers page	4339 English tutors available; visible trial pricing starts at USD 5.00.
SI018	HelloTalk	HelloTalk VIP page	HelloTalk says it helps users learn a language for free and has 70M+ registered users across 260+ languages.
SI019	Praktika	Praktika pricing page	The fetched page shows app-download CTAs and a For business link even though the pricing URL 404s.
SI020	Loora	Loora pricing page	The fetched page shows Loora for Business plus app download links even though the pricing URL 404s.
SI021	SEC	Duolingo Q1 2025 10-Q viewer	SEC XBRL viewer for Duolingo quarter ended March 31, 2025.
SI022	ELSA Speak	ELSA homepage	18M+ downloads and 460K+ ratings.
SI023	Duolingo	Duolingo homepage	The free, fun, and effective way to learn a language.
SI024	Apple App Store	Speak App Store reviews aggregation	67.1% negative experience according to review aggregation.
SI025	Praktika	Praktika homepage	20M+ learners and roughly $8/month versus a private tutor.
SI026	Loora	Loora homepage	Loora is an always-available AI English tutor.
SE001	Speak	Speak - The language learning app that gets you speaking	Talk out loud, get instant feedback, and become fluent with the world’s most advanced AI language tutor.
SE002	Speak	Leveling up our core speech recognition systems at Speak	This fine-tuned model dramatically outperforms the pretrained model with a >60% reduction in word error rate for our learners and task type.
SE003	Speak	Live Roleplays powered by OpenAI Realtime API	Today, we’re announcing Live Roleplays, a new Speak experience that combines Realtime API with Speak’s learning engine to enable immersive, life-like speaking practice in a variety of roleplay scenarios.
SE004	Speak	Designing a High-Accuracy Speech Matching Pipeline with ASR and Phonetic Models	By introducing a phonetic model alongside the ASR model, we reduced false negatives by approximately 40% without making our algorithm more lenient.
SE005	Speak	Building Speak's Voice Agent Platform	Audio transport: WebRTC via LiveKit.
SE006	Speak	How Speak reinvents language learning	The Speak Method is our proprietary learning method built around three phases: Learn, Practice, and Apply.
SE007	Speak	Your most personalized Speak yet: What’s new in our winter release	We’re launching new features to deepen your learning experience with Speak, all designed to help you speak more, learn faster, and stay engaged every step of the way.
SE008	Speak	New languages on Speak, just in time for the summer	Today we officially launch four new languages for English speakers to accompany our recent Spanish release: French, Japanese, Korean, and Italian.
SE009	Speak	Speak named Google Play’s “Best App of 2024” in Hong Kong, Korea and Taiwan	Speak named Google Play’s Best App of 2024 in Hong Kong, Korea and Taiwan.
SE010	Speak Help Center	What Languages Can I Learn with Speak?	Speak currently offers English learning courses for speakers of 15 native languages.
SE011	Speak Help Center	How Does Speak Curate Its Content and Curriculum?	All of our lessons are written by learning designers—a team of educators, linguists, and translators.
SE012	Speak Help Center	What devices and operating systems does the Speak app support?	The Speak app is available on both iOS and Android devices ... Speak is not available on desktop (PC).
SE013	Speak Help Center	Voice recognition isn’t working	If Speak isn’t picking up your voice, please try the steps below.
SE014	Speak Help Center	How can I report an issue in the app?	You can report an issue directly from the screen where the problem occurs.
SE015	Speak Help Center	How can I contact Speak Support?	You can contact the Speak Support team by email only.
SE016	Speak Help Center	Troubleshooting Guides	Common troubleshooting methods for when unexpected issues.
SE017	Apple App Store	Speak: Language Learning App - App Store	Powered by cutting-edge AI technology, Speak ensures that you gain fluency by engaging in real-life conversations and receiving instant feedback.
SE018	Apple App Store	Speak: Language Learning - Ratings & Reviews - App Store	I love that the lessons give you the phrase, have you repeat it and then quiz you.
SE019	Google Play	Speak: Language Learning - Apps on Google Play	This app may share these data types with third parties: App activity and Device or other IDs.
SE020	JustUseApp	Speak Reviews (2026) \| Check if app is safe or legit	Overall Customer Experience: Negative experience 67.1% ... The voice recognition became laggy after this update.
SE021	LanguaTalk	Speak App Review: Is It Worth It in 2026?	Speak’s audio quality is strong ... but the app’s feedback and customization features are unlikely to satisfy serious learners.
SE022	Y Combinator	Speak: A superhuman, AI-powered language tutor in your pocket	Applied ML Engineer, Speech.
SE023	OpenAI	Speak is personalizing language learning with AI	That’s easy—OpenAI’s real-time API and multimodality for audio.
SE024	TechCrunch	OpenAI-backed Speak raises $78M at $1B valuation to help users learn languages by talking out loud	Speak has built a platform to teach languages by focusing on how native speakers learn: Using AI, the startup generates audio conversations and listens to users’ responses.
SE025	SiliconANGLE	OpenAI backs $78M round for AI language learning startup Speak	One of the latest additions to Speak’s feature set, Live Roleplays, rolled out a few weeks ago.
SE026	AppsHunter	Speak: Language Learning App - AI Speaking Practice	Updated April 30, 2026. Version 4.46.0. Compatibility: iOS 16.0+.
SE027	OpenAI Developer Community	Voice Agent using Realtime API	The agent-builder package provides a streamlined way to create real-time voice agents powered by OpenAI’s Realtime API.
SE028	GitHub	openai/openai-realtime-agents	This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
SE029	Speak	Speak Shares Details of AI Tutor, Built on Top of OpenAI’s GPT-4	Speak has used GPT-4 in production to power parts of its AI Tutor feature.
SE030	Speak	Speak privacy policy landing page	Speak - The languages learning app that gets you speaking.
SE031	Speak	Speak terms landing page	Speak - The languages learning app that gets you speaking.
SU001	Speak	Speak homepage	15M+ Downloads
SU002	Speak	Speak for Business \| Enterprise Language learning	200+ Brands Rely on Speak for Business
SU003	Speak	Why Learners Love Speak: Real App Store Feedback	This page contains a complete, unedited collection of real 5-star App Store reviews from Speak learners in the United States.
SU004	Speak	Speak Hits $500M Valuation, Expands Rapidly Across Markets	Speak now has more than 10 million learners in 40+ countries, with learners more than doubling year-over-year for the last five years.
SU005	Speak	A new milestone as we bring language learning to all: Raising $78M Series C at a $1B valuation	Our momentum is clear with more than 200+ customers across various industries, and an 85 percent adoption rate among employees.
SU006	Speak	A year in conversation \| Speak Wrapped 2025	In 2025, learners spoke 3.74 billion lines on Speak, an increase of 111 percent from 2024.
SU007	TechCrunch	Language learning app Speak nets $20M, doubles valuation	Its user base has doubled every year for the last five years, and Speak now has customers in more than 40 countries.
SU008	TechCrunch	OpenAI-backed language learning app Speak raises $16M to expand to the US	Speak has managed to hold its own despite the competition, becoming one of the top-downloaded education apps in South Korea, where it first launched, with well over 100,000 subscribers.
SU009	Apple App Store	Speak: Language Learning App - App Store listing	44K Ratings
SU010	Apple App Store	Speak: Language Learning - Ratings & Reviews	4.8 out of 5
SU011	Google Play	Speak: Language Learning - Apps on Google Play	4.7star 112K reviews 10M+ Downloads
SU012	Dataconomy	AI-driven language learning startup Speak raises $78M at a $1B valuation	An enterprise tier, Speak for Business, currently serves over 200 corporate customers.
SU013	Forbes	How AI Language Learning App Speak Is Taking On Duolingo	Now, some 500 companies including KPMG and HD Hyundai offer Speak subscriptions to employees primarily in South Korea.
SU014	JustUseApp	Speak Reviews (2026) \| Check if app is safe or legit	Paid for a subscription for Spanish and now only half of the Spanish lessons are available.
SU015	LanguaTalk	Speak App Review: Is It Worth It in 2026?	Feedback is one of Speak's weakest points.
SU016	MWM	Speak: Language Learning - Education App	Downloads 10M+; User Rating 4.8/5; Total Ratings 354.9K
SU017	KPMG	About KPMG	KPMG is a global organization of independent professional services firms providing Audit, Tax, and Advisory services.
SU018	Apple App Store	Speak pricing snapshot	Monthly Premium $17.99; Annual Premium $83.99; Annual Premium Plus $164.99
SU019	Apple App Store	j herronov App Store review	This app has improved that aspect so much for me. I am not yet fluent, but am able to pick out words and sayings on French videos and French hockey broadcasts now.
SU020	Apple App Store	brombres App Store review	One small gripe: the UI doesn't feel as intuitive as it could.
SU021	Google Play	Dan S Google Play review	The AI is phenominal and language recognition is dead-on.
SU022	Google Play	Rosalyn Mulder Google Play review	They also have a daily streak that me, personally, I enjoy because I feel motivated to get that streak as high as possible.
SU023	Google Play	Danielle Chavez Google Play review	I do run into an issue where it does not hear everything I say particularly in the middle of a sentence.
SU024	Speak	Dan S review excerpt on official Speak review page	I've been using this app for more than six months now. It's simply the best Spanish language learning app that exists.
SU025	Speak	Rosalyn Mulder review excerpt on official Speak review page	I absolutely love this app. There have been so many other apps that I've tried like Duolingo and Mango. They help me learn sure, but I feel this one is the best so far.
SU026	Speak	j herronov review excerpt on official Speak review page	This has become by far my favorite app for a number of reasons.
SR001	Speak	Leveling up our core speech recognition systems at Speak	We deployed Riva and Triton in our existing Kubernetes cluster... Our backend is deployed with Kubernetes on Google Cloud Platform.
SR002	OpenAI	Speak is personalizing language learning with AI	Speak leverages OpenAI models to power its language learning curriculum across modalities such as audio and text.
SR003	Speak Help Center	Subscription/Billing	Subscription/Billing ... 11 articles.
SR004	Apple App Store	Speak: Language Learning App - App Store	44K Ratings ... Age Rating 13+ ... Speak offers both monthly and annual auto-renewing subscriptions.
SR005	JustUseApp	Speak Reviews (2026) \| Check if app is safe or legit	Negative experience 67.1% ... The voice recognition became laggy after this update.
SR006	AppsHunter	Speak: Language Learning App - AI Speaking Practice	Negative things ... Expensive subscription cost ... Voice recognition can be inaccurate ... Lesson progress not saving properly.
SR007	SiliconANGLE	OpenAI backs $78M round for AI language learning startup Speak	Building such models can incur significant costs. The $78 million funding round announced today could make it easier for the company to balance those expenses with growth investments.
SR008	Federal Trade Commission	Children's Online Privacy Protection Rule ("COPPA")	COPPA imposes certain requirements on operators ... directed to children under 13 years of age.
SR009	European Data Protection Board	AI Privacy Risks & Mitigations Large Language Models (LLMs)	AI Privacy Risks & Mitigations Large Language Models (LLMs).
SR010	Speak Help Center	How can I cancel the subscription?	If you cancel your subscription, you can continue using your subscription until the current subscription period ends, but you will not receive a refund.
SR011	Speak Help Center	Refund Policy	Full refund available within 7 days of purchase ... After 30 days from the payment date: No refunds are available.
SR012	Apple App Store	Speak: Language Learning - Ratings & Reviews	4.8 out of 5 ... 44K Ratings.
SR013	Google Play	Speak: Language Learning - Apps on Google Play	10M+ Downloads ... This app may share these data types with third parties ... Data is encrypted in transit.
SR014	Apple Support	Subscriptions and Billing - Official Apple Support	You can cancel a subscription from Apple ... App Store and iTunes Store purchases may be eligible for a refund.
SR015	Google Play Console Help	Create and manage subscriptions	You must be transparent with users about your offer terms ... and how a user can manage or cancel their subscription.
SR016	EUR-Lex	Regulation (EU) 2016/679 (GDPR)	Regulation - 2016/679 - EN - gdpr - EUR-Lex.
SR017	European Data Protection Board	Artificial intelligence	Artificial intelligence \| European Data Protection Board.
SR018	CNIL	Entry into force of the European AI Regulation: the first questions and answers from the CNIL	The European AI Act has just been published ... and will gradually come into force as of 1 August 2024.
SR019	IAPP	EU AI Act: Mapping the Interplays with the GDPR	The AI Act and the GDPR ... map interplays between the AI Act and the GDPR.
SR020	Federal Trade Commission	Children’s Online Privacy Protection Rule: A Six-Step Compliance Plan for Your Business	Before collecting, using or disclosing personal information from a child, you must get their parent’s verifiable consent.
SR021	Android Developers	Google Play Policies	Google Play Policies \| Android Developers.
SR022	Tooliverse	Speak Review 2026 - AI Language Learning	Premium pricing and battery consumption during long sessions require consideration.
SR023	Speak	Why Learners Love Speak: Real App Store Feedback	This page contains a complete, unedited collection of real 5-star App Store reviews from Speak learners in the United States.
SR024	Speak	Live Roleplays powered by OpenAI Realtime API	These new speech-to-speech models aren’t as good as text models on instruction following, and they’re not great yet at more nuanced language learning specific tasks.
SR025	Speak	Speak Shares Details of AI Tutor, Built on Top of OpenAI’s GPT-4	Speak has used GPT-4 in production to power parts of its AI Tutor feature.
SR026	Speak	OpenAI Startup Fund-Backed Speak Announces $16m Series B-2 Financing & Rapid International Expansion	Nearly 6% of the population has turned to Speak ... now live in more than 20 countries.
SR027	TechCrunch	OpenAI-backed Speak raises $78M at $1B valuation to help users learn languages by talking out loud	Speak is using the company’s technology to power its platform.
SR028	NicheMetric	Speak: Language Learning - Revenue, Downloads & Market Analysis	Revenue Last 30 days $10.0M ... Downloads Last 30 days > 2.0M.
SR029	Y Combinator	Speak: A superhuman, AI-powered language tutor in your pocket	Active ... San Francisco ... Jobs 10.
SR030	TechCrunch	Language learning app Speak nets $20M, doubles valuation	Speak has grown to over 10 million users ... customers in more than 40 countries.
SV001	Speak	Speak Hits $500M Valuation, Expands Rapidly Across Markets
SV002	Speak	A new milestone as we bring language learning to all: Raising $78M Series C at a $1B valuation
SV003	TechCrunch	Language learning app Speak nets $20M, doubles valuation
SV004	Forbes via Internet Archive	Speak \| Company Overview & News
SV005	Forbes via Internet Archive	This Startup Is Racing Duolingo To Replace Human Language Tutors With AI
SV006	AIbase	AI Language Learning Platform Speak Raises $78 Million, Valuation Exceeds $1 Billion
SV007	GetLatka	How Speak hit $100M revenue with a 253 person team in 2025.
SV008	NicheMetric	Speak: Language Learning - Revenue, Downloads & Market Analysis
SV009	Apple App Store	Speak: Language Learning App - App Store
SV010	Google Play	Speak: Language Learning - Apps on Google Play
SV011	AppBrain via Internet Archive	Speak - Language Learning for Android - Free App Download
SV012	Apptopia	About: Speak: Language Learning (Google Play version)
SV013	SEC EDGAR	EDGAR Company Search Results - Duolingo, Inc.
SV014	Yahoo Finance	Duolingo, Inc. (DUOL) Valuation Measures & Financial Statistics
SV015	CompaniesMarketCap	Duolingo (DUOL) - Market capitalization
SV016	SEC EDGAR	EDGAR Company Search Results - Coursera, Inc.
SV017	Yahoo Finance	Coursera, Inc. (COUR) Valuation Measures & Financial Statistics
SV018	CompaniesMarketCap	Coursera (COUR) - Market capitalization
SV019	SEC EDGAR	EDGAR Company Search Results - Udemy, Inc.
SV020	Yahoo Finance	Udemy, Inc. (UDMY) Valuation Measures & Financial Statistics
SV021	CompaniesMarketCap	Udemy (UDMY) - Market capitalization
SV022	SEC EDGAR	EDGAR Company Search Results - Chegg, Inc.
SV023	Yahoo Finance	Chegg, Inc. (CHGG) Valuation Measures & Financial Statistics
SV024	CompaniesMarketCap	Chegg (CHGG) - Market capitalization
SV025	Oliver Wyman	How AI is reshaping SaaS valuations: a guide for investors
SV026	Federal Trade Commission	Generative AI Raises Competition Concerns
SV027	Grand View Research	AI Tutors Market Size, Share & Trends \| Industry Report 2033
SV028	MMR Statistics	Online Language Learning Market Insights 2025–2032
SV029	Fortune Business Insights	Private Tutoring Market Size, Share & Industry Growth, 2034
SV030	Sensor Tower	Speak: Language Learning - Apple App Store - US - Category Rankings, Keyword Rankings, Sales Rankings, Research, Performance, and Growth Metrics.