Startup Diligence
Diligence report AI language learning / consumer subscription edtech / enterprise language training Series C private / growth-stage edtech unicorn 2026-05-05

Speak

Fast-growing AI speaking tutor with real demand signals, but still too opaque for conviction pricing at the last unicorn mark

Speak has credible AI-language-learning traction and real unicorn momentum, but public evidence still undersupports the $1B mark.

Cover facts

Valuation 01
1000 USD M
Total Raised 02
162 USD M
Enterprise customers 03
200 organizations+
Downloads 04
15 M+
Recommendation 05
research-more

Company profile

Speak is a private AI language-learning company building a voice-first tutor for spoken fluency across consumer and employer use cases. The company pairs a differentiated speaking-focused curriculum with generative AI and speech technologies, giving it stronger product proof than many education apps. Public evidence shows meaningful global distribution and investor support, but still leaves too many financial, retention, and cap-table questions unanswered for a high-conviction price call.

Website
www.speak.com
Founded
2016-01-01
Founders
Connor Zwick, Andrew Hsu
Founding location
San Francisco, California, USA
Headquarters
San Francisco, California, USA
Product
Speak delivers AI-guided speaking practice, pronunciation feedback, structured lessons, live roleplays, and multilingual tutoring through mobile apps and an enterprise training product.
Customers
Consumers learning spoken English and other languages, plus employers using English upskilling and workplace language practice.
Business model
Consumer subscriptions with in-app pricing plus enterprise seat-based or contract language-training offerings.
Stage
Series C private / growth-stage edtech unicorn
Funding status
Raised $78M Series C in December 2024 at a $1.0B valuation and roughly $162M cumulative funding, after a June 2024 B-extension at a $500M valuation.

Executive summary

Top strengths

  • Product positioning is differentiated around spoken fluency rather than passive language study.
  • Consumer traction is real, with strong app-store ratings, millions of downloads, and visible international reach.
  • The company proved repeat investor demand in 2024, doubling valuation from $500M to $1.0B within six months.
  • Speak for Business creates a second revenue surface beyond consumer subscriptions.
  • Engineering and product execution appear strong enough to ship advanced AI tutor and voice-agent features quickly.

Top risks

  • Public disclosure is too thin on ARR, margins, retention, and payback to defend the current valuation with confidence.
  • Competitive pressure from Duolingo, ELSA, Babbel, OpenAI-enabled entrants, and low-cost AI tutors can compress pricing power.
  • Consumer billing friction, refund complaints, and pronunciation-quality criticism show product-risk edges beneath the growth story.
  • Dependence on third-party model and app-store ecosystems can weaken moat durability and economics.
  • Regulatory and privacy exposure rises because the product processes voice data across many geographies and may touch younger learners.

Open gaps

  • Current ARR, gross margin, consumer conversion funnels, and enterprise revenue mix.
  • Renewal, churn, NRR, and concentration for Speak for Business.
  • Full cap-table terms, preferences, secondary rights, and investor protections at the Series C mark.
  • Current headcount by function, current board composition, and deeper governance controls.
  • Segment-level cohort retention and whether premium pricing is durable outside early adopter markets.

Contents

Chapter 01

01Company Overview

1.1 Identity, product scope, and business model

Speak is the consumer-facing brand of Speakeasy Labs, a San Francisco-headquartered startup focused on spoken-language fluency rather than textbook-style memorization. Across its homepage, app-store listings, and late-2024 fundraising materials, the company consistently describes itself as an AI language tutor that pushes learners to speak out loud, receive instant feedback, and build conversational confidence. That positioning matters because it is narrower than general language-learning incumbents: Speak competes first on speaking efficacy, not on broad gamified content breadth. By May 2026, Speak’s public consumer surfaces show a broader product than the English-only wedge that powered its early growth. The homepage and Apple listing advertise six live learning tracks (French, Spanish, English, Korean, Italian, and Japanese), while the June 2025 product post states that Spanish launched first for English speakers and four more languages followed shortly after. This evolution supports a hybrid model: consumer subscriptions remain the core monetization engine, but the same voice-first tutoring stack now also supports a business offering for employers. TechCrunch’s December 2024 reporting gives the clearest public list-price anchor for the consumer product at $20 per month or $99 per year. The public disclosure profile is still that of a private growth startup. Speak says enough to establish category, product strategy, and top-line traction, but not enough to validate unit economics, current ARR, or the exact customer mix between consumer and enterprise.[CO001, CO002, CO003, CO004, CO037]

1.2 Founders, leadership, and organizational footprint

The public leadership picture is founder-led and still highly concentrated. Late-2024 fundraising coverage consistently names Connor Zwick as CEO and Andrew Hsu as CTO/co-founder, with Zwick serving as the primary external spokesperson in both official company posts and independent press. That concentration is positive for speed and product coherence, but it also creates clear key-person dependence because the company’s narrative, investor messaging, and product vision all run heavily through Zwick. Speak’s operating footprint is global for a company of its size. TechCrunch reported a 75-person workforce across San Francisco, Seoul, Tokyo, and Ljubljana in June 2024, while the current careers page still highlights Seoul, Ljubljana, and San Francisco and presents the company as internationally distributed. The careers page is useful but internally stale: it also includes an older historical snapshot referring to a 60-person team and more than $60 million raised, showing that the page was not fully updated after the late-2024 financing jump. That inconsistency is a small but real diligence signal: even company-controlled surfaces should be treated as time-stamped claims, not timeless facts. Governance disclosure is limited. The best publicly supported board datapoint is that Accel partner Ben Quazzo joined the board at Series C. Beyond that, public materials identify related persons in the Form D filing but do not disclose a full current board roster, observer rights, or control provisions.[CO002, CO014, CO015, CO016, CO024, CO029]

Leadership and founder table
PersonRoleBackground / public signalFounder-market fit or functional coverageKey-person dependency
Connor ZwickCEO, co-founderPrimary external spokesperson across official posts and TechCrunch fundraising coverageOwns product vision, fundraising narrative, and category positioningHigh
Andrew HsuCTO, co-founderNamed in fundraising coverage and co-credited for technical directionOwns speech / AI stack and technical credibilityHigh
Ben QuazzoAccel partner, board memberJoined board at Series C according to official and TechCrunch coverageAdds late-stage venture governance and recruiting leverageMedium
Colton GyulayRelated person in Form DListed in Form D related persons setSuggests finance or legal leadership role, but public scope unclearLow
Alex BerkenkampRelated person in Form DListed in Form D related persons setSuggests operating leadership, but public scope unclearLow

Only roles directly observable in reviewed public sources are included; broader executive bench is not fully disclosed.

[CO024, CO029, CO030]

1.3 Funding history, valuation step-up, and chronology of record

Speak’s public funding story accelerated sharply in 2024. The company announced a $20 million Series B-3 in June 2024 at a $500 million valuation, then followed six months later with a $78 million Series C at a $1 billion valuation. The Series C post says total capital reached $162 million and frames the financing as the second preempted round of the year. Independent coverage from TechCrunch, Tech Funding News, and HolonIQ corroborates the valuation jump strongly enough to treat the unicorn milestone as well supported. The securities-filing trail is directionally consistent with the press narrative. A Form D filed on December 11, 2024 lists a $77.7 million equity offering under Rule 506(b), with the first sale dated November 13, 2024. The filing list also shows additional Form D events on August 12, 2024 and October 17, 2023, which likely map to intermediate financing activity around the B-extension period and prior private issuance activity. Those filings do not reveal ownership terms, preferences, or all governance rights, but they do add useful primary-document corroboration that real capital moved on the timeline the company describes. HolonIQ’s EdTech unicorn ledger then recorded Speak as joining the global EdTech unicorn list in December 2024 at a $1 billion valuation. That makes the Series C not just a financing milestone but a category-status milestone with real signaling value for future fundraising, recruiting, and enterprise sales.[CO008, CO009, CO010, CO011, CO012, CO023]

Stakeholder or investor map
StakeholderRoleControl or economic importanceDiligence ask
AccelLead Series C investor; board seat via Ben QuazzoAnchors unicorn round and likely holds meaningful governance rights from Dec 2024 onwardRequest board documents, pro rata rights, and ownership percentage.
OpenAI Startup FundRepeat investorStrategic AI ecosystem signal and product-validation partnerClarify any model-access, branding, or exclusivity arrangements.
Khosla VenturesRepeat investorLong-term financial backer across multiple roundsConfirm ownership, preferences, and follow-on intentions.
Y Combinator / Paul Graham networkEarly investor and signal amplifierHigh reputational value and founder-network supportClarify whether YC rights differ from standard seed investors.
Buckley VenturesLed June 2024 Series B-3Key bridge investor before the Series C step-upReview whether the B-3 introduced special terms before Accel led Series C.
Employers using Speak for BusinessCommercial stakeholders rather than cap-table ownersPotentially meaningful design partners if the 200+ customer claim is realRequest top-10 customers by seats, geography, and renewal status.

Public sources identify major financing participants and the enterprise customer class, but not the full capitalization table or ownership splits.

[CO008, CO009, CO011, CO017, CO018, CO023]
Milestone table
DateEventTypeAmount / valuation / statusParticipantsImplication
2016Speak founded / formal company creation enters public chronologyfoundingConnor Zwick; Andrew HsuAnchors company identity used by official and third-party profiles.
2019English learning launched first in Korea, beginning Asian market wedgeproductInitial market launchSpeakEstablished the Korea-first distribution strategy later referenced in official posts.
2023-10Public filing trail shows a Form D dated 2023-10-17financingForm D eventSpeakeasy LabsEarliest reviewed filing evidence of private capital activity in this run.
2024-06-18Series B-3 announcedfinancing$20M at $500M valuationBuckley Ventures; OpenAI Startup Fund; Khosla; Paul Graham; Jeff WeinerValuation doubled in less than a year and set up the unicorn jump.
2024-08-12Additional Form D appears on public filing listfinancingForm D eventSpeakeasy LabsSuggests further private issuance activity before Series C closing.
2024-12-10Series C announced; Speak for Business highlighted; Ben Quazzo joins boardfinancing / governance$78M at $1B valuationAccel; OpenAI Startup Fund; Khosla; YCUnicorn milestone and governance step-up with board formalization.
2024-12HolonIQ adds Speak to the EdTech unicorn listscale$1B valuation statusHolonIQExternal category validation beyond company PR.
2025-06-17Four new languages for English speakers launch and learner count crosses 15M+product / scale15M+ learners claimedSpeakConfirms multilingual expansion beyond the original English wedge.
2026-02-23Android Police publishes a critical review on pronunciation leniencyadverseIndependent adverse product reviewAndroid PoliceShows scaled visibility but also real quality risk around over-permissive feedback.

Founding year is standardized to 2016 despite one conflicting TechCrunch line describing a 2014 launch; the conflict is preserved in evidence gaps rather than left unresolved in the chronology of record.

[CO001, CO008, CO009, CO012, CO021, CO023]
FO001: Strategic inflection timeline

Public chronology of Speak’s identity, capital formation, product expansion, and external validation from 2016 through February 2026.

[CO008, CO009]
FO003: Investability scorecard

Compact view of Speak’s public maturity, traction, and diligence risk as of May 2026.

[CO012, CO032]

1.4 Traction signals, enterprise expansion, and risk flags

Speak has enough public traction data to support “scaled private company,” but not enough to underwrite operating performance. On the consumer side, official surfaces claim 15M+ downloads and a 4.8 rating, while app stores independently confirm strong rating depth (44K ratings on Apple; 112K reviews on Google Play). Earlier 2024 fundraising materials said Speak had more than 10 million learners in 40+ countries and that users had already spoken more than one billion sentences that year. Those are all company-centered metrics, but they are at least consistent with the product’s visible distribution footprint. The most interesting non-consumer signal is the enterprise product. Speak for Business appears to have launched in earnest between the June and December 2024 rounds, and the Series C announcement claims more than 200 customers and an 85% employee adoption rate. The current B2B page similarly says 200+ brands rely on Speak for Business. That suggests a real second act beyond subscriptions, although public disclosures still do not break out B2B revenue, contract size, or retention. Risk flags remain mostly product-quality and disclosure-related rather than legal or financial. Independent reviewers praised the polish and speaking-first design, but adverse reviews also point to lenient pronunciation scoring, confusing premium packaging, refunds and auto-renew complaints, and incomplete disclosure on current headcount and board composition. Those issues do not invalidate the growth narrative, but they do make the next layer of diligence operational rather than purely promotional.[CO005, CO006, CO007, CO013, CO017, CO018]

Snapshot KPI table
MetricValue / statusDateConfidenceGap / diligence ask
Latest valuation$1.0B (Series C)2024-12highSupported by official announcement, TechCrunch, and HolonIQ.
Total raised$162M cumulative2024-12mediumCompany claim; reconcile against full cap table and any undisclosed bridge capital.
Consumer scale15M+ downloads claimed; 10M+ users reported mid-20242025-06 / 2024-06mediumDownloads and users are different denominators; request MAU, DAU, and paying subscribers.
Enterprise traction200+ customers / brands; 85% adoption rate claimed2024-12 / 2026-05mediumNeed customer list, seat count, and renewal data to validate enterprise depth.
Current headcount75 in Jun 2024; 253 in 2025 third-party estimate2024-06 / 2025-11lowRequest current employee count by function; public sources conflict.
Consumer list price$20/month or $99/year2024-12highVerify current in-app pricing by geography and Premium Plus upsell rules.
App reputation4.8 on Apple; 4.7 on Google Play2026-05highReview cohort of low-star complaints by locale and version.
Debt / credit facilitieslowNo public debt or credit facility found in reviewed sources; confirm directly with management.
Board compositionBen Quazzo added at Series C; full board not public2024-12lowRequest current board roster, observer rights, and governance documents.

Unsupported private metrics are left null or described as conflicting public estimates rather than forced into false precision.

[CO005, CO006, CO007, CO009, CO010, CO018]
FO002: Company snapshot logic

How Speak’s core AI tutor, subscription model, enterprise expansion, and funding momentum connect.

[CO003, CO009, CO017, CO018, CO019, CO032]

1.5 Exhibits

Chapter 02

02Market Analysis

2.1 Market boundary: where Speak competes and where it does not

Speak sits inside a narrower segment than “language learning” in general. The relevant boundary is digital, speaking-first language learning: products that use AI or live conversation to help users practice spoken output in realistic contexts, rather than primarily drilling reading, grammar, or flash-card memorization. Speak’s own fundraising materials make that framing explicit by contrasting its speaking-first tutor with older app models and by calling out the much broader $100B+ online-and-offline language-learning market only as background context. That means the closest substitutes are not every educational app, but a narrower bundle: broad language platforms such as Duolingo, Babbel, and Busuu; specialized speaking apps such as ELSA and Praktika; live-tutor networks such as Cambly; and, in enterprise settings, employer-sponsored English training or in-house tutoring. Spend that belongs inside the market includes consumer subscriptions, employer-funded English upskilling, and digital spoken-language products. Spend outside the market includes generalized test-prep, textbook-led classroom instruction that does not emphasize speaking, or enterprise training categories unrelated to language fluency. The key diligence point is that Speak is best understood as a spoken-fluency wedge inside a large but heterogeneous market. A very broad TAM exists, but the underwritable market depends on who actually pays for repeated speaking practice and how much of that demand can be digitized.[CM001, CM002, CM023, CM030, CM031, CM033]

Market definition table
Segment / categoryIncluded spendExcluded spendBuyer / payerRelevance to Speak
AI speaking-first language appsConsumer subscriptions and employer-funded speaking practiceGeneral classroom tuition and unrelated workforce trainingLearner or employerCore market
Broader digital English learningApps, online tutoring, test-linked practice, digital English coursesOffline-only classroom instruction without digital componentLearner, parent, employer, schoolImportant adjacent pool
Broad online language learningAll online language-learning subscriptions across many languagesOffline tutoring and textbook salesLearner, employer, schoolUpper-bound context, too broad for SAM
Human tutor and live conversation servicesLive sessions and recurring tutor membershipsSelf-serve AI-only subscriptionsLearner or employerPrimary substitute, not direct AI market
Community / structured course appsSubscription courses with reading, grammar, and community correctionPure speaking-only or pure tutoring modelsLearnerPrimary substitute set for consumer budgets

Boundary logic separates spoken-fluency products from the broader education and tutoring universe.

[CM001, CM023, CM030, CM031, CM033]
FM001: Buyer and adoption path flow

How demand moves from global English-learning need to consumer subscription or employer-funded adoption.

[CM004, CM023]

2.2 Sizing lenses: broad demand is real, but the lenses are not interchangeable

The strongest public evidence supports a large and growing digital English-learning market, but not a single clean number. Technavio estimates the digital English language learning market will grow by $39.46B between 2024 and 2029 at a 24.5% CAGR, with APAC contributing 39% of growth. Preply, using a different methodology and broader online-learning lens, estimates the global online language-learning market reaches about $115B in 2025, with English alone worth roughly $43.51B and growing around 22% annually. Speak’s own June 2024 post uses the broadest lens of all, describing a $100B+ online-and-offline market. These figures should not be averaged together. Technavio is reporting forecast growth for a specific digital-English segment; Preply is reporting a broader platform-led online market; Speak is using a company-authored framing that includes offline spend. The right interpretation is directional rather than exact: all credible sources point to a very large and still-growing market, but public data does not isolate a clean, source-verified SAM or SOM for Speak. The more actionable sizing conclusion is qualitative: English remains the single largest language-learning category, APAC remains the most important growth geography, and speaking-first digital products are riding both the long-term demand for English and the short-term adoption of AI-assisted learning.[CM001, CM003, CM004, CM006, CM007, CM010]

TAM / SAM / sizing lens table
Publisher / lensYearGeographyValueMethodologyConfidenceLimitation
Technavio digital English market growth2024-2029GlobalUSD 39.46B growth; 24.5% CAGRForecast growth for digital English learning segmentmediumGrowth increment, not current market size
Technavio APAC contribution2024-2029APAC39% of growthRegional contribution to forecast market growthmediumShows geography, not Speak share
Preply online language learning market2025GlobalUSD 115BPlatform-led report using broader online language-learning lenslowCompetitor-authored and broader than Speak’s exact wedge
Preply English learning segment2025GlobalUSD 43.51B; ~22% annual growthEnglish-only segment estimate within broader online marketlowMethodology not directly comparable to Technavio
MarketsandMarkets AI in education2024-2030GlobalUSD 2.21B to USD 5.82B; 17.5% CAGRAdjacent AI-in-education market sizingmediumToo broad to serve as Speak TAM
Speak company framing2024GlobalUSD 100B+Company-authored framing of online and in-person language learninglowBroadest and least precise lens
TechCrunch English learners2024Global~1.5B learnersCEO quote used as demand-side population lensmediumLearners are not paying customers

The table preserves scope differences instead of collapsing them into a single blended TAM.

[CM001, CM003, CM006, CM007, CM010, CM021]
FM002: Market estimate reference bands

Source-backed reference points for the size of markets adjacent to Speak; repeated low/mid/high values reflect point estimates from different lenses rather than probabilistic scenarios.

These rows compare differently scoped market lenses in the same currency units. They should not be summed or blended into a single TAM for Speak.

[CM006, CM010]
FM003: Spoken-language purchase funnel

Conceptual conversion funnel from broad global English demand to the narrower universe of paid speaking-first products like Speak.

This funnel mixes population and monetization reference points to show narrowing commercial reality rather than a strict unitary forecast. It is a conceptual sizing aid, not a revenue model.

[CM003, CM022]

2.3 Buyer, user, and payer segmentation

Speak’s buyer map splits into two materially different motions. In the consumer motion, the user and payer are usually the same person: a learner paying a recurring subscription for self-directed speaking practice. TechCrunch’s reported $20/month or $99/year list price fits this self-serve model. In the enterprise motion, however, the learner is the user but the employer is the payer, usually buying English training as a productivity, recruiting, mobility, or retention benefit. Speak’s Series C post is explicit that the business product is industry agnostic and already claims 200+ customers. Public market research aligns with that split. Technavio says corporate non-academic learners are a major segment for digital English learning, reflecting the role of English in international business communication. Preply’s 2026 report likewise says English remains the default second language because of business and education demand. Together, these sources suggest that consumer demand is emotion- and aspiration-led, while enterprise demand is budget- and productivity-led. The adoption path also differs by segment. Consumers can trial and subscribe immediately. Employers require HR, L&D, or departmental budget approval and need proof of adoption, seat utilization, and skill improvement. That makes enterprise economics potentially larger per account, but slower and more dependent on renewal evidence than consumer subscriptions.[CM009, CM023, CM024, CM025]

Segment / buyer map
SegmentBuyerUserPayerWorkflow / budget ownerAdoption trigger
Consumer self-serveIndividual learnerIndividual learnerIndividual learnerApp-store subscription or direct in-app upgradeTravel, career mobility, confidence building
Employer-sponsored English trainingL&D lead / HR / business unit managerEmployee learnerEmployerTraining budget, productivity, retention, or global mobility budgetCross-border communication need or workforce upskilling
Student / exam-adjacent userStudent or parentStudent learnerStudent / parent / schoolPersonal education budgetEnglish for education access or credential goals
Live tutor alternative buyerIndividual learner or employerLearnerLearner or employerTutor marketplace or contract budgetNeed for conversation practice without building internal program
Community-course app userIndividual learnerIndividual learnerIndividual learnerGeneral learning-app subscription budgetCheaper or more structured alternative to AI-speaking-first products

Public data is strong on consumer and employer use cases but weak on formal school procurement specific to Speak.

[CM009, CM023, CM024, CM025]
FM004: Buyer budget authority matrix

Maps who approves spend, who uses the product, and what proof is needed across the main buyer segments relevant to Speak.

[CM009, CM023]

2.4 Growth drivers and adoption constraints

The demand case is unusually strong. English remains the most learned language globally, with roughly 1.5 billion total speakers and clear utility in business, education, and mobility. Technavio attributes market growth partly to the flexibility of digital courses, while WEF and MarketsandMarkets emphasize the broader shift toward personalized learning, adaptive content, and AI-powered educational tools. Speak-specific materials add a more practical angle: users want to actually speak, not just memorize. For employers, English-learning budgets can be justified as productivity or workforce-development spend, which strengthens the enterprise thesis. But public sources also show real constraints. WEF highlights access, privacy, bias, and displacement concerns in AI-enhanced learning. World Bank argues that scalable AI adoption depends on connectivity, compute, local data, and skills—conditions that are unevenly distributed across emerging markets. The academic review of AI language tools reaches a similar conclusion: personalized feedback helps, but privacy, evidence quality, and teacher readiness lag. Product-specific reviews sharpen the risk further: both Android Police and Languatalk found Speak strong at motivation but weaker at rigorous pronunciation correction and advanced-level feedback. The practical takeaway is that demand growth is not the same as frictionless adoption. Speak benefits from macro demand, but its segment still has to prove learning efficacy, trusted feedback quality, and acceptable privacy norms to convert interest into durable paid usage.[CM004, CM008, CM012, CM015, CM016, CM017]

Growth drivers and constraints table
Driver / constraintDirectionTimingImplicationDiligence ask
English remains the default language of business and educationDriverCurrentSustains the largest single language-learning demand poolValidate segment demand by geography and language pair
Mobile / self-paced flexibilityDriverCurrentHelps self-serve subscriptions scale without human tutor costMeasure session frequency and retention by cohort
AI-driven personalization and adaptive feedbackDriverCurrentImproves perceived relevance and can raise engagementRequest efficacy studies and completion-rate data
Employer upskilling demandDriverCurrentSupports B2B motion and larger ACVs than consumer subscriptionsRequest enterprise seat counts and renewal rates
APAC growth concentrationDriverCurrentFavors companies with strong Asian market fit and distributionAssess localization and payment-stack strength by market
Digital divide / infrastructure gapsConstraintCurrentLimits adoption in lower-connectivity and lower-income marketsReview offline product capability and country mix
Data privacy, bias, and regulationConstraintCurrentCan slow institutional or employer adoption of AI-based toolsReview privacy architecture and jurisdiction-specific policies
Weak pronunciation accuracy or shallow feedbackConstraintCurrentCan hurt retention once learners move past beginner noveltyCommission independent efficacy and pronunciation benchmarking
No clean public SAM / market-share dataConstraintCurrentMakes valuation narratives easier than underwriting narrativesRequest internal market-share and CAC data

The core diligence divide is not whether demand exists, but whether the market can be monetized efficiently and defensibly.

[CM004, CM008, CM009, CM015, CM016, CM017]

2.5 Exhibits

Chapter 03

03Competitors

3.1 Landscape overview: many ways to solve the same job

Buyers can solve the “help me speak a language fluently” job through at least four classes of products: broad freemium language platforms, specialized AI-speaking apps, live human-tutor networks, and structured course or community-led apps. Speak belongs in the specialized speaking-first cluster, but it competes against all four classes because consumers and employers do not buy product categories—they buy outcomes. That matters: even if Speak’s product is distinctive, it still loses share whenever a buyer decides that Duolingo is good enough, Cambly is more trusted, or Babbel / Busuu are more structured. The public scale hierarchy is clear. Duolingo is the category incumbent with 100M+ MAUs and platform-level bundle power. Babbel and Busuu have much larger installed bases than Speak and compete on structured pedagogy rather than AI novelty. Cambly owns the live-tutor substitute lane. ELSA, Praktika, and Loora prove that AI-speaking competition is now crowded, not greenfield. In this landscape, Speak’s pitch is not “we are the only one,” but rather “we are the most compelling combination of speaking-first UX, AI tutoring, and emerging enterprise distribution.” That is a real position, but not yet a dominant one.[CP001, CP005, CP008, CP011, CP012, CP014]

Competitor profile table
CompetitorCategoryScale / funding signalTarget segmentDifferentiationLimitation
DuolingoIncumbent broad platform100M+ MAUs; top-grossing education appMass-market global learnersScale, freemium funnel, 40+ languagesNot purely speaking-first; can be less specialized on conversation depth
ELSA SpeakDirect specialized speaking rival18M+ downloads; 109K App Store ratingsEnglish learners seeking pronunciation and speaking practicePronunciation-first AI coachingPrimarily English-focused rather than broad multilingual platform
CamblyLive human-tutor substitute24/7 human tutor networkLearners who value native-speaker conversationsHuman accountability and trustHuman cost model can be pricier and less scalable
BusuuCommunity-led structured app120M+ users; 50M+ Play downloadsLearners wanting structure plus community feedbackNative-speaker community and structured reviewLess AI-conversation-centric than Speak
BabbelStructured course app25M subscriptions sold; 50M+ Play downloadsLearners wanting expert-built lessonsPedagogical structure and review loopsLess conversation-first, more curriculum-led
PraktikaAI tutor challenger20M+ learners; low price framingCost-sensitive learners wanting AI-tutor style practiceCheap AI tutor substitutePublic enterprise distribution not evident
LooraAI English tutor challengerEnglish-only AI tutor positioningProfessionals and business English learnersAlways-on English conversation coachNarrower brand footprint than larger incumbents

Where funding or revenue is not publicly disclosed in reviewed sources, the table uses scale proxies such as MAUs, downloads, or subscriptions sold.

[CP005, CP008, CP011, CP012, CP014, CP016]
Feature / capability matrix
Buying criterionSpeakDuolingoELSACamblyBusuuBabbelPraktika / Loora
Speaking-first workflowStrongModerateStrongStrongModerateModerateStrong
Explicit pronunciation correctionModerate / contestedModerateStrongTutor-dependentModerateModerateStrong
Curriculum structureModerateStrongModerateLowStrongStrongModerate
Human conversationNoNoNoYesCommunity onlyNoNo
Multilingual breadthModerateVery strongLowLowStrongStrongLow to moderate
Enterprise / employer motionPresentLimited public evidencePresent but not centralPresentLimitedLimitedSome business messaging, less proven
Feedback depth for advanced learnersContestedModerateModerate to strongTutor-dependentModerateStrongModerate

Ordinal assessments synthesize official positioning and reviewed independent commentary. Unknowns were converted into conservative mid-level assessments only when public evidence existed directionally.

[CP001, CP008, CP011, CP023, CP033]
FP001: Competitive positioning map

Ordinal map of scale/distribution power versus speaking-specialization depth across Speak’s main competitor classes.

Axes are evidence-based ordinal scores derived from public scale metrics, product positioning, and review-based specialization signals; they are not survey data.

[CP005, CP029]

3.2 Competitor profiles and business-model differences

Duolingo is the incumbent benchmark because it combines huge free-user scale, a top-grossing app-store position, and broad language breadth. It is not the closest product analog to Speak’s speaking-first tutoring, but it is the most powerful default substitute when buyers want “an app to learn a language.” ELSA is the more relevant direct benchmark: it is also AI-led, English-focused, and centered on speaking and pronunciation. Its 18M+ download claim and 109K App Store ratings put it much closer to Speak’s scale band than Duolingo’s. Cambly competes from a different labor model. It offers real native-speaker tutors around the clock, which makes it more expensive operationally but also more credible for learners who trust humans over AI. Busuu and Babbel are different again: they sell structure, community reinforcement, and curriculum depth. They are less conversation-first than Speak but more clearly pedagogical. Praktika and Loora show how fast the AI-speaking niche itself is filling in, with lower-price or more English-specific tutor framing. The important pattern is not that any one rival dominates every dimension. It is that Speak faces a portfolio of substitutes, each strong in one area: scale, tutor trust, structure, community, or cheap AI tutoring.[CP005, CP008, CP011, CP012, CP014, CP016]

FP002: Capability breadth map

Shows which competitor class is strongest on scale, tutor trust, structure, community, and AI-speaking specialization.

[CP011, CP032]

3.3 Capability, pricing, and distribution comparison

Public pricing data is incomplete, but the directional picture is enough to matter. Speak’s publicly reported consumer price is roughly $20/month or $99/year. Praktika markets an AI-tutor alternative at around $8/month. ELSA offers monthly and yearly memberships. Duolingo uses freemium scale and only converts about 9% of MAUs into subscribers, which gives it broad reach and pricing flexibility. That mix implies pricing pressure from both ends: low-cost AI tutoring below Speak and gigantic freemium funnel economics above it. On capability, Speak is strongest when the buyer values a smooth speaking-first experience and low social friction. On explicit correction, however, ELSA’s pronunciation-first positioning and Speak’s own adverse reviews suggest ELSA may be stronger. On curriculum depth and review systems, Babbel and Busuu look stronger. On live accountability, Cambly is stronger. On pure distribution, Duolingo is clearly stronger. Speak’s one clearly differentiating commercial angle is Speak for Business. Most public rivals here are still primarily consumer products. But because public materials do not disclose customer names, seat counts, or renewals, the competitive importance of that channel is still more promising than proven.[CP003, CP004, CP006, CP007, CP010, CP030]

Pricing / packaging comparison
CompanyPublic price / modelIncluded positioningUnknownsImplication
Speak$20/month or $99/year (reported)Consumer subscription; speaking-first AI tutorCurrent geo pricing and enterprise ASP not publicMid-priced consumer AI tutor
DuolingoFreemium with paid upsellHuge free funnel; paid removes ads and adds featuresExact current Super/Max pricing varies by regionPuts ceiling pressure on acquisition and willingness to pay
ELSAMonthly and yearly membershipsEnglish speaking and pronunciation coachingCurrent realized ASP unclearDirect subscription benchmark in English niche
CamblySubscription for live tutoringReal native-speaker conversationsExact plan pricing not captured in reviewed source setHuman-trust premium alternative
BusuuFreemium + subscriptionStructured lessons plus communityCurrent plan pricing not captured hereCompetes as a broader value bundle
BabbelSubscription required for full coursesExpert-built structured lessonsExact current checkout prices vary by planStructured-course alternative with strong brand
Praktika~$8/month marketing claimAI tutor substitute for human tutorRealized plan mix and annual pricing unclearLow-price AI pressure on Speak

This table compares public list-price anchors and packaging structure, not realized net pricing or enterprise contract value.

[CP003, CP006, CP010, CP016, CP034]

3.4 Moat durability, switching costs, and the main threats

The publicly visible moat for Speak is moderate. The company has a real product identity—speaking-first AI tutoring—and a potentially valuable second channel in business accounts. Those are positives. But the moat is not obviously durable because every one of those strengths is attackable. Duolingo can outspend and out-distribute. Cambly can win on human trust. ELSA can win on explicit pronunciation coaching. Busuu and Babbel can win on structure and retention loops. Praktika and Loora can compress the AI-speaking niche with cheaper or more focused positioning. Switching costs also look limited from public evidence. Consumers can test multiple apps cheaply, and many likely do. Employers may face more friction once they deploy, but there is no public proof that Speak has exclusive contracts or deep workflow lock-in. Public reviews reinforce the risk: if advanced users feel feedback is shallow or overly forgiving, they have plenty of substitutes. The main near-term threats are therefore not existential technology leaps but slower-moving commercial forces: pricing pressure, distribution bundling, and category commoditization. Speak can still win, but it needs outcomes and enterprise proof more than marketing novelty.[CP020, CP022, CP024, CP025, CP026, CP027]

Moat durability / competitive risk register
Moat claimThreatSeverityMitigation / diligence ask
Speaking-first AI UXRivals match UX patterns and lesson flow quicklymaterialTest real learner outcomes rather than relying on UX distinctiveness
Business distribution channelEmployers multi-home or use Speak as one tool among severalhighRequest customer references, seat counts, and exclusivity terms
Multilingual expansionBroad incumbents still offer more languages and larger ecosystemsmaterialTrack launch cadence and attach rates by new language
Pronunciation and feedback credibilityIndependent reviews say Speak can be too forgivinghighBenchmark speaking accuracy and feedback quality against ELSA and tutors
Brand trust and billing experienceRefund / support complaints weaken willingness to paymediumAudit support SLAs, refund rates, and complaint-resolution metrics
AI niche defensibilityPraktika, Loora, and future entrants compress AI-speaking differentiationhighMonitor pricing moves and feature parity across AI tutor rivals
Incumbent scale pressureDuolingo’s free funnel and app-store dominance inflate acquisition costscriticalMeasure CAC, payback, and comparative organic acquisition resilience

Severity reflects commercial risk to Speak’s differentiation over a two- to three-year horizon.

[CP020, CP025, CP026, CP028, CP030, CP034]
FP003: Moat and readiness KPI scorecard

Compact scorecard for the core strengths and risks behind Speak’s competitive durability.

[CP020, CP041]

3.5 Exhibits

Chapter 04

04Financials

4.1 Revenue model: hybrid on the surface, opaque underneath

Speak’s public financial story is straightforward at a high level and thin underneath. Public evidence supports two monetization rails: consumer subscriptions sold through mobile app stores and enterprise contracts sold through Speak for Business. TechCrunch consistently reports a consumer price point of roughly $20 per month or $99 per year, and the App Store corroborates recurring monthly and annual subscriptions. Meanwhile the business site and Series C materials show that Speak has built an employer-facing offer and claims 200+ brands or customers. That is enough to conclude Speak is not just a consumer app. But it is not enough to quantify revenue mix, realized pricing, or revenue quality. No reviewed public source discloses ARR, payer count, conversion rate, refund rate, churn, or the share of revenue coming from business accounts. The result is a common diligence trap: visible monetization architecture without visible economics. The category context matters. Language learning buyers can spend nothing on a free or exchange-based product, buy a mid-priced subscription like Speak, pay per lesson for human tutoring, or choose other AI tutors. That means Speak’s public price point sits inside a crowded pricing corridor, not above it.[CI001, CI002, CI003, CI004, CI013, CI015]

Revenue streams table
Revenue streamMechanismUnitCurrent public statusQualityDiligence ask
Consumer subscriptionMonthly / annual subscription sold via app storessubscriberConfirmedMedium: list price public, payer count privateSubscriber count, conversion, churn, refunds, share billed on-web vs in-app
Speak for BusinessEmployer / brand contractsaccount / seatConfirmedLow-to-medium: buyer count claimed, economics undisclosedACV, seats, term length, renewal, implementation cost
Language expansion upsideAdditional course catalog monetized through same subscriptionlanguage attach / ARPUPlausible but unquantifiedLowARPU by language, launch cohorts, attach by geography
Ads / sponsorshipAdvertising monetizationn/aNot surfaced in reviewed sourcesLowConfirm whether any ad revenue exists
Marketplace / transaction feesTake rate on tutors or other third partiesn/aNot surfaced in reviewed sourcesLowConfirm whether any take-rate or partner revenue exists

Rows distinguish confirmed revenue mechanisms from plausible but unsupported ones; missing streams are kept explicit rather than inferred away.

[CI001, CI004, CI013, CI034]
Pricing / monetization table
Company / modelPublic price / unitList vs realizedUnknownsSource / implication
Speak consumer~$20/month or $99/yearList pricingPromo discounts, geo pricing, realized net pricing unknownMid-priced AI subscription benchmark
Speak businessContracted enterprise pricingNot publicSeat pricing, contract length, services, discounts unknownPotentially higher-quality revenue if enterprise is material
DuolingoFreemium with paid upsellList and model known, realized mix partly publicCurrent regional prices and margin by tier unknownStrong low-price anchor for the category
CamblyPer-lesson tutoring plans from about $8.12/lesson annuallyList pricingTutor mix, utilization, net take unknownHuman tutoring can command higher unit spend
italkiMarketplace lessons with visible trial prices from $5List pricing per tutorTake rate, repeat frequency, full lesson ASP unknownTutor marketplace creates flexible spend ladder
HelloTalkFree exchange / VIP ecosystemFree front door emphasizedVIP monetization details not captured in reviewed setZero-price substitute caps casual willingness to pay
Praktika~$8/month AI tutor claimMarketing price pointAnnual plan and realized ASP unknownLow-price AI pressure on Speak

This table compares public price anchors and monetization structures, not realized ARPU or contribution margin.

[CI002, CI019, CI020, CI021, CI025, CI038]
Unit economics table
MetricPublic value / statusConfidenceWhy it mattersDiligence ask
Consumer list price~$20/month or $99/yearmediumStarting point for LTV but not enough on its ownConfirm current pricing by geo and discount path
Payer countUnavailablelowNeeded to convert downloads into revenueRequest current and trailing-12-month payer counts
Gross marginUnavailablelowCore test of AI delivery economicsRequest margin by channel and compute / platform-fee breakdown
CAC / paybackUnavailablelowDetermines efficiency of growthRequest acquisition channel mix, CAC, payback, and enterprise sales cycle
Refund rate / support dragUnavailable publicly; adverse anecdotal signals existlowImportant for consumer revenue qualityRequest refund rate, chargebacks, support backlog, and SLA metrics
Enterprise ACV / seat economicsUnavailablelowDetermines whether business improves the modelRequest ACV, seats, term, implementation cost, and renewal rate
Retention / NRRUnavailablelowMost important revenue-quality testRequest cohort retention, gross retention, and NRR by segment

The sparse public table is the point: most economically decisive metrics remain private.

[CI002, CI015, CI016, CI027, CI031]
FI001: Revenue model bridge

Qualitative bridge from user acquisition and employer acquisition into Speak’s two visible revenue rails.

The flow is qualitative because public sources confirm the rails but not conversion rates, seat counts, or realized net revenue.

[CI001, CI004, CI030]

4.2 Public traction and pricing proxies: useful signals, weak underwriting inputs

Public traction is real. Speak advertises 15M+ downloads, the App Store shows a large ratings base, and Google Play shows six-figure review volume. TechCrunch also reports 10-20 minutes of average daily usage. These are meaningful signals that the product is widely distributed and actively used. But public traction is not the same thing as revenue. Download counts do not tell us how many users are active, how many convert, or how long they stay. Review counts do not show refund rates or net billings. Daily usage does not reveal whether engagement is coming from free-trial users, paying subscribers, or enterprise-sponsored seats. The same caution applies to peer pricing pages. Competitor list prices are useful for triangulating category willingness to pay, but they do not show realized ASPs or contribution margins. Still, the pricing corridor is informative. Speak looks mid-priced relative to low-cost AI tutors, clearly below the spending required for repeated human tutoring, and above free or freemium substitutes. That supports monetization plausibility while also highlighting the limits of pricing power.[CI005, CI006, CI015, CI017, CI018, CI019]

4.3 Cost structure, unit economics, and capital adequacy

From public evidence, Speak appears capital-light in the manufacturing sense but not necessarily cheap to operate. There is no visible inventory, hardware, or project-finance burden. The company looks like a software and content business. However, AI tutoring introduces cost layers that a static course app does not fully bear: inference, speech processing, feedback generation, localization, support, and continued content expansion. Those cost drivers likely matter more than capex. App stores also introduce platform dependency into both distribution and collection. On the business side, any meaningful enterprise motion likely adds sales, onboarding, and customer-success costs. None of that is inherently problematic—but none of it is directly quantified in public sources either. CAC, payback, gross margin, and retention remain private. Capital adequacy is therefore a scenario question. The company has clearly raised meaningful recent capital and has not surfaced public distress signals. But because current cash and burn remain undisclosed, the public record cannot distinguish between “comfortably funded” and “funded but still dependent on another round once hiring or enterprise expansion accelerates.”[CI008, CI010, CI011, CI012, CI027, CI028]

Capital adequacy table
ItemPublic statusCurrent viewWhy it mattersNext-round / diligence trigger
Recent fundingConfirmed~$98M gross announced across recent Series B-3 and CProvides near-term financing supportReconcile announced rounds with cap table and net proceeds
Form D corroborationPartially confirmedForm D tracker shows $77.7M offering related to late-2024 financingImproves confidence that the announced raise closed materiallyRequest official closing docs and proceeds schedule
Cash on handUnavailableUnknownMost direct runway inputRequest current balance and restricted cash
Monthly burnUnavailableUnknownDetermines runway speedRequest monthly burn and budget by function
Runway monthsUnavailableScenario onlyPublic sources cannot underwrite itBuild base / downside / expansion cases from actual burn
Debt / project financeNo public evidence surfacedLikely none, but not fully verifiedDebt can change risk and flexibilityRequest debt schedule, leases, and contingent obligations

This chapter intentionally focuses on adequacy, not a full historical round chronology already covered elsewhere.

[CI008, CI009, CI010, CI012, CI033, CI035]
Public financial gaps table
Missing metricImpact on diligenceExact diligence path
Revenue / ARR / billingsCannot underwrite scale or growth qualityRequest audited or board-level trailing-12-month revenue bridge
Subscriber count and conversionCannot link downloads to revenueRequest subscriber funnel, trial conversion, and plan mix
Gross margin and platform-fee mixCannot assess AI delivery economicsRequest COGS by compute, content, support, and app-store fees
Enterprise ACV / seats / renewalsCannot assess quality of B2B streamRequest top-customer contract summary and renewal history
Burn and runwayCannot assess financing dependencyRequest monthly cash balance and forecast burn scenarios
Refund / churn / support qualityCannot assess consumer revenue leakageRequest refund rate, churn by plan, and complaint-resolution metrics

The exact diligence path is part of the artifact so the missing-data problem is operational, not rhetorical.

[CI007, CI014, CI027, CI036]
FI002: Unit economics bridge

Shows the public inputs that matter economically even when values are missing.

Most nodes are publicly known concepts rather than public numbers; the bridge exists to make the missing metrics explicit.

[CI015, CI027, CI029]
FI003: Public disclosure completeness range

Ordinal range view showing where Speak’s public financial story is relatively visible and where it is almost blank.

These are evidence-backed ordinal scores, not management KPIs. They summarize how much public disclosure exists: pricing and fundraising are relatively visible, while revenue and runway remain largely undisclosed.

[CI002, CI008, CI007, CI012, CI036]
FI004: Capital intensity / cash-flow map

Maps the main cash drivers that appear likely from public evidence.

The matrix is qualitative and evidence-backed by product and distribution model, not by disclosed cost accounts.

[CI028, CI033, CI037]

4.4 Financial verdict: promising monetization architecture, limited revenue proof

The financial verdict is cautiously positive on architecture and cautious-to-negative on transparency. Speak clearly has monetizable consumer and enterprise surfaces, recent institutional financing, and product engagement that should support some recurring revenue. It is also operating in a category where consumers already pay for tutoring, subscriptions, and exam-prep outcomes, so the existence of demand is not the question. The problem is that the public record validates structure more than performance. Valuation climbed from $500M to $1B on the back of rapid fundraising momentum, but financial disclosure did not keep pace. Public evidence does not tell us how much revenue the business generates, what margins look like after app-store fees and AI delivery costs, how sticky subscribers are, or whether business accounts materially improve economics. That leaves too much of the underwriting case resting on funding headlines, user proxies, and market enthusiasm. For diligence, the next step is not finding more marketing pages. It is obtaining the internal financial packet: ARR and billings, payer counts, gross margin by channel, refund and churn data, enterprise ACVs, sales efficiency, and a real runway model.[CI032, CI034, CI035, CI036]

4.5 Exhibits

Chapter 05

05Product & Technology

5.1 Product surface and learner workflow

Speak’s public product story is unusually coherent: the app is sold as a speaking-first AI tutor, not a vocabulary game or grammar-reference tool. Across the homepage, app-store listings, and product posts, the same workflow repeats: learners enter a structured lesson, speak target phrases out loud, receive immediate corrective feedback, and then apply what they practiced in freer AI conversations. That workflow is now formalized as the Speak Method’s Learn → Practice → Apply loop. In practice, the visible module set includes Tutor Lessons, speaking drills / speaking cards, roleplays, free-form conversations, progress tracking, and newer review or proficiency features such as unit refreshers and Speak Level. The product surface is still primarily mobile-first, but it is no longer just an English-learning app; public help content shows six target languages for learners and 15 native-language entry points for English instruction. The main strategic takeaway is that Speak has a tight workflow fit between pedagogy and product packaging: every major module is ultimately in service of getting the learner to speak, compare output, and try again rather than passively consume content.[CE001, CE002, CE004, CE005, CE006, CE008]

Product module / asset matrix
Module / assetPrimary userStatus / maturityDifferentiationDiligence gap
Tutor LessonsNew and progressing learnersCore / shippingDynamic AI tutor can correct, answer questions, and redirect rather than just play canned audioNo public completion or learning-outcome benchmarks by module
Speaking drills / speaking cardsAll learners in structured lessonsCore / shippingSpeaking-first repetition plus in-house matching and phonetic alignment instead of generic quiz mechanicsPublic evidence is internal-only on matching accuracy and false-positive control
Live RoleplaysLearners moving into open-ended practiceShipping, but originally limited rollout in late 2024Realtime voice conversations combine OpenAI speech-to-speech with Speak proficiency graph, hints, and objectivesPublic rollout breadth by market / plan is not quantified
Free Talk / custom conversationsSelf-directed learners and higher-intent practiceShippingOpen-ended AI conversations personalize scenarios and let users create practice around their own contextsIndependent reviews say conversation control and feedback depth still trail serious-learner needs
Progress / review features (Unit Refreshers, Speak Level, side quests)Active learners returning between sessionsShipping / staged by marketAdds retention loops and measurable fluency tracking on top of speaking workflowSpeak Level was still limited to English learners in select markets entering 2026

Public surfaces show a coherent module stack, but several higher-order quality claims remain internal or rollout-limited.

[CE004, CE005, CE006, CE008, CE016, CE027]
Workflow / use-case table
User jobCurrent workflow / pain pointSpeak solutionPublicly observable benefitLimitation / caveat
Start speaking quicklyTraditional apps over-index on reading, grammar, or vocabulary recognitionSpeaking-first onboarding with immediate repetition and feedbackHomepage and reviews repeatedly describe learners speaking from day 1Benefit is mostly anecdotal rather than third-party efficacy-tested
Practice realistic conversationsHuman tutors are expensive and hard to scheduleRoleplays and Free Talk simulate common scenarios with AI partnersUsers and independent reviews both highlight real-world conversation practiceSome reviewers say the AI can feel repetitive or over-questioning
Correct pronunciation or deliveryGeneric ASR misses accented learner speechCustom ASR plus matching stack, with speech-to-speech where audio nuance mattersSpeak reports faster feedback and lower WER after backend upgradesMost performance numbers are internal and not externally benchmarked
Resume after breaks / stay motivatedLanguage apps often lose users between sessionsUnit Refreshers, streak systems, side quests, and Speak Level progress viewsWinter 2025 release explicitly added retention and progress surfacesSome app-store reviews still ask for richer rewards and more gamified loops
Learn another language as an English speakerPrior Speak product focused on English learners in Asia2025 language expansion added French, Japanese, Korean, and Italian after SpanishPublic help pages now document six target languagesIntermediate depth outside flagship tracks remains incomplete

The workflow is strongest where Speak can keep the learner in a tight speak-hear-correct loop; open-ended mastery and breadth remain less verified.

[CE001, CE004, CE008, CE009, CE015, CE018]
FE001: Learner experience stack

This stack focuses on how learner-facing modules, pedagogy logic, speech feedback systems, and support operations layer together to create Speak’s speaking-first UX.

[CE001, CE004, CE008, CE016, CE029, CE033]

5.2 Architecture and operating model

Speak now discloses more technical detail than many consumer education apps. The 2024 ASR overhaul moved the company away from fragmented on-device and third-party speech systems toward a unified backend stack. Speak says that stack fine-tunes Conformer-CTC on learner speech, serves inference with Nvidia Riva / Triton on Kubernetes and Google Cloud, and uses gRPC plus websocket streaming to return speech feedback quickly enough for lesson interactions. In 2025, Speak added Matching v2, which combines streaming ASR with a phonetic model and forced alignment; this matters because learner speech often breaks the assumptions built into generic ASR. By early 2026, Speak layered a broader voice-agent platform on top: mobile apps connect over WebRTC via LiveKit Cloud; voice-agent servers orchestrate external ASR, LLM, TTS, and speech-to-speech providers; and the system chooses cascade or speech-to-speech by feature rather than forcing one universal stack. That architecture implies real technical depth, but also real operating complexity. Speak is explicitly multi-provider, region-aware, and latency-obsessed, with public discussion of failover, tail-latency monitoring, semantic turn detection, and provider switching when quality or availability degrades.[CE010, CE011, CE012, CE013, CE014, CE015]

Technology / operating architecture table
Layer / process / componentRoleKey dependencyPrincipal risk
Mobile iOS / Android clientsCapture learner audio, play responses, host lesson UXNative mobile apps; OS permissions; current versionsNo desktop fallback and microphone permissions can break voice workflows
Realtime transportLow-latency bidirectional audio between clients and backendWebRTC via LiveKit Cloud; regional routingExternal platform dependency and cross-region latency variation
Voice-agent runtimeRuns lesson logic, turn-taking, hints, corrections, and orchestrationLiveKit Agents plus Speak application logicComplexity grows as feature count and provider permutations expand
Speech recognition and matchingTranscribe learner speech and map it to lesson targetsFine-tuned Conformer-CTC, Nvidia Riva/Triton, wav2vec2 phonetic model, forced alignmentInternal metrics are strong but externally unaudited; accent edge cases still exist
LLM / TTS / speech-to-speech providersGenerate tutor responses and voice outputOpenAI Realtime API, external LLM/TTS providers, backup-provider routingMulti-provider cost, reliability, and model-quality variance
Learning engine and content systemsStore lessons, proficiency graph, conversation state, analyticsSpeak backend services and curriculum systemsLimited public evidence on data governance, evals, and versioning discipline
Observability and failoverMeasure latency / errors and reroute traffic when providers degradePer-provider metrics, backup routing, region-aware operationsPublic disclosures do not quantify uptime or incident history

Speak’s technical disclosures indicate a real production ML stack rather than a thin wrapper around one model vendor.

[CE011, CE012, CE016, CE017, CE019, CE020]
Roadmap / release / development-stage table
Date / stageFeature or milestoneStatusImplicationSource
2024-06Backend ASR overhaulShippedCore speech feedback moved to a stronger unified backend and internal learner-speech tuningSpeak ASR blog
2024-10Live Roleplays with Realtime APIShipped, initially limited rolloutMoves Speak closer to natural real-time conversation and heavier OpenAI dependencySpeak Live Roleplays blog
2024-11Google Play Best App recognition in HK / Korea / TaiwanShipped / external recognitionSuggests traction in core Asian markets where Speak startedSpeak Google Play blog
2025-06French, Japanese, Korean, and Italian launches for English speakersShippedBroadens TAM beyond English-learning use caseSpeak new languages blog
2025-12Winter release: adaptive lessons, refreshers, side quests, audio roleplays, Speak LevelShipped / partially stagedAdds retention and proficiency surfaces rather than only new lessonsSpeak winter release blog
2026-03Voice Agent Platform disclosureShipped platform capability, still evolvingSignals continuing investment in infra, turn detection, and richer speech-to-speech usesSpeak voice agent platform blog
2026 ongoingMore intermediate courses, more languages, broader Speak Level rolloutIn developmentBreadth and maturity remain moving targets rather than fully complete productsSpeak help center and release notes

Speak ships frequently, but several high-value roadmap items remain explicitly in development or market-limited.

[CE008, CE009, CE015, CE019, CE027, CE049]

5.3 Trust, quality, and support controls

On trust and operational quality, Speak’s public evidence is mixed but directionally better than marketing-only surfaces. The company exposes concrete support workflows: in-app issue flags, troubleshooting guides, microphone-permission checklists, and email escalation for unresolved issues. Public store listings also disclose basic privacy and platform controls such as encrypted-in-transit data handling, deletion requests, minimum OS support, and mobile-only availability. Curriculum quality control is also more explicit than usual for an AI app: Speak says lessons are authored by learning designers, AI is used to accelerate production, and humans still review the end product. The weaker side of the trust picture is external friction. Independent review sources praise voice quality and speaking-centric design, but they also report shallow feedback depth, confusing premium tiering, laggy voice recognition episodes, refund complaints, and support delays. More importantly for diligence, Speak’s public privacy and terms URLs did not render readable policy text in text-only review, and we did not locate public SOC 2, ISO 27001, or similar security assurance artifacts. That leaves a real enterprise-trust gap even though consumer support mechanics are clearly visible.[CE029, CE030, CE031, CE032, CE033, CE034]

Trust / quality / compliance table
Control / disclosure / quality signalStatusScopeGap or implication
Curriculum written by learning designers and human-reviewedPresentApplies to lessons and curriculum creationGood quality signal for content integrity, but no external pedagogy audit found
In-app bug report flag plus support email workflowPresentLesson issues, account issues, troubleshooting escalationsShows operational support process, but support is email-first and may scale unevenly
Voice-recognition troubleshooting guidancePresentMicrophone permissions, cache clearing, OS updates, Samsung/Bixby conflictsConfirms the company expects speech capture failures to happen in the field
Data handling disclosure on Google PlayPresentStates sharing of app activity/device IDs, collection of personal and financial info, encryption in transit, deletion requestsUseful baseline disclosure, but not a substitute for a detailed privacy or security program
Privacy and terms web pagesPartially inspectablePublic URLs exist from store listingsText-only review returned JS shells, reducing direct diligence visibility
Security certifications / audits / trust centerNot found publiclyNo public SOC 2, ISO 27001, penetration-test summary, or uptime page retained in chapter evidenceMaterial diligence gap if underwriting enterprise expansion or regulated buyers

Trust evidence is adequate for a consumer app review but incomplete for deeper enterprise or privacy diligence.

[CE029, CE033, CE034, CE035, CE036, CE043]
FE003: External platform dependency map

The most material public dependencies are not every internal component, but the external platforms and interfaces that can bottleneck performance, privacy review, or scale.

[CE019, CE027, CE031, CE034, CE036, CE047]
FE004: Product maturity / capability map

Speak looks strongest in core speaking UX and underlying speech infrastructure; public evidence is weaker on formal trust artifacts and advanced-learner depth.

[CE018, CE033, CE034, CE036, CE042, CE043]

5.4 Differentiation, maturity, and open diligence asks

Speak’s strongest product-level differentiation is not just “AI tutor” branding; it is a purpose-built speech stack paired with pedagogy that appears customized for language learners rather than retrofitted from a generic chatbot. Public claims around learner-accent ASR, phonetic matching, multi-provider TTS selection, and feature-specific voice-pipeline choices support that view. External corroboration from OpenAI, TechCrunch, app stores, and independent reviewers suggests the market sees the same pattern: a highly polished speaking product with strong user love and genuine technical ambition. The maturity signals are credible but should not be over-read. Ratings and download counts are strong, app versions are shipping frequently, and Google Play recognition suggests real distribution in Asia. At the same time, several underwriting questions remain open: how deep the company’s security/compliance program is, whether the richer voice stack is economically efficient at scale, how much of the roadmap is still rollout-limited, and whether independent learning-outcome evidence exists beyond internal product metrics and customer anecdotes. In short, Speak looks product-strong and technically differentiated, but public evidence is much stronger on capability and UX than on formal trust/compliance disclosure.[CE023, CE024, CE037, CE038, CE039, CE040]

FE002: Learn → Practice → Apply operating flow

This flow isolates Speak’s pedagogical loop rather than inventorying modules: introduce language, drill it aloud, then apply it in live conversation and review.

[CE005, CE031, CE039, CE045]

5.5 Exhibits

Chapter 06

06Customers

6.1 Customer base and segmentation

Public evidence suggests that Speak still looks first and foremost like a self-serve consumer language app, with an emerging employer-paid overlay rather than a purely enterprise business. The homepage and app-store listings emphasize direct individual learning, six core language tracks, AI speaking practice, and app-store scale. That self-serve layer appears broad: official and third-party surfaces point to more than 10 million learners by mid-2024 and roughly 15 million downloads by late 2025 to 2026. Geography also matters. South Korea was Speak's inaugural market, the company has publicly said nearly 6% of Korea's population was learning English with the app in 2024, and TechCrunch reported well over 100,000 Korean subscribers as early as 2023. At the same time, Speak has expanded far beyond that origin, with official and third-party reporting pointing to 40+ countries and language offerings spanning English, Spanish, French, Italian, Japanese, and Korean. The newer B2B motion appears to use employers as payer, employees as primary users, and English upskilling as the flagship use case.[CU001, CU002, CU003, CU004, CU005, CU006]

Customer segmentation table
SegmentBuyer / user / payerUse caseScale evidenceRevenue / strategic valueGap
Self-serve consumer learnersIndividual / learner / individualSpeaking practice for travel, work, school, and everyday conversation15M+ downloads on Speak homepage; 44K iOS ratings; 112K Google Play reviewsCore consumer subscription engine and top-of-funnel for brand spreadNo public free-to-paid conversion or payer mix
Korea English-learning baseIndividual or employer / learner / individual or employerEnglish fluency and confidence in Speak's origin marketNearly 6% of Korea population claim in 2024; well over 100K Korean subscribers in 2023Explains early density, review volume, and enterprise seed demandCurrent Korea share of revenue and users is undisclosed
International multi-language consumersIndividual / learner / individualSpanish, French, English, Italian, Japanese, and Korean speaking practice40+ countries by 2024 plus six core course tracks on the homepageSupports global growth beyond Korea and broader cross-sellNo country-by-country user or revenue breakout
Employer-sponsored B2B teamsEmployer or L&D / employee / employerBusiness English and workforce fluency200+ brands on B2B page; 200+ business customers and 85% employee adoption in Series C postHigher-ARPU expansion path layered on top of consumer demandNo public ACV, seat count, or renewal data
Named corporate subscriber baseEmployer / employee / employerEmployee language benefit or upskilling programForbes says ~500 companies including KPMG and HD Hyundai offered subscriptions, mostly in KoreaSuggests real procurement penetration beyond pilotsNo public roster, case studies, or deployment outcomes for most logos

Public evidence points to a consumer-first base with a meaningful but still under-disclosed employer-sponsored layer.

[CU002, CU004, CU005, CU006, CU009, CU012]
Customer growth / adoption trajectory table
MetricValueDateSourceConfidenceImplicationMissing denominator
Learners / users10M+2024-06-18 to 2024-06-20Speak Series B + TechCrunchmediumBy mid-2024 Speak had already reached meaningful global scaleNo split between free, paid, or active users
Downloads15M+2026-05-05 accessSpeak homepagemediumCurrent top-of-funnel scale appears materially larger than the 2024 user disclosureNo install-to-activation denominator
App Store rating volume44K ratings2026-05-05 accessApple App StorehighLarge iOS review surface indicates ongoing consumer reachNo rating distribution by country or payer type
Google Play review volume112K reviews / 10M+ downloads2026-05-05 accessGoogle PlayhighAndroid adds large-scale public validation of adoptionNo DAU/MAU or paid-user denominator
Lines spoken3.74B2025-12-19Speak Wrapped 2025mediumStrong signal of repeat speaking activity rather than one-time installsNo unique-user denominator
Hours practiced in app19.6M hours2025-12-19Speak Wrapped 2025mediumShows large aggregate learning time and habit formationNo hours per paying user or per cohort
Lessons started231M2025-12-19Speak Wrapped 2025mediumIndicates scaled recurring lesson consumptionNo completion-rate denominator
Personalized lessons created80.3M2025-12-19Speak Wrapped 2025mediumSuggests users actively use adaptive or personalized pathsNo share of users engaging with personalization
Business customers200+2024-12-10Speak Series C + DataconomymediumConfirms B2B is more than a landing page experimentNo breakdown by segment, geography, or contract size
Companies offering employee subscriptions~5002025-11-12ForbeslowImplies wider corporate distribution than official case-study count suggestsNo active-seat or paid-logo denominator

These metrics are adoption and activity proxies, not retention cohorts or revenue-quality metrics.

[CU004, CU009, CU010, CU015, CU017, CU018]
FU001: Customer journey map

Speak's public journey runs from individual discovery and first speaking session to habit formation, app-store proof, and eventual employer-sponsored expansion for some users.

[CU001, CU004, CU007, CU026, CU028]
FU002: Adoption / deployment flow

Public evidence supports a repeatable flow from consumer discovery to repeated speaking activity, with a narrower branch into employer-sponsored deployment.

[CU004, CU010, CU013, CU017, CU018, CU019]

6.2 Named proof and adoption signals

The strongest public proof is not a roster of enterprise case studies; it is a combination of scaled review surfaces and duplicated named learner testimony. Apple and Google show large rating and review counts, while Speak's own review page republishes identifiable customer comments that also appear on public app-store surfaces. That duplication matters because it reduces the risk that every quote is wholly synthetic, even if the official page is still positively selected. The clearest named examples are consumer learners. j herronov says months of French use improved comprehension of French media and made free talk and bookmarking valuable. Dan S says more than six months of Spanish use beat prior tools because feedback was detailed and instantaneous. Rosalyn Mulder says Speak's tutor-like feedback and streaks made it her preferred option over Duolingo and Mango. These are still self-reported outcomes, but they are specific, recent, and visible across independent distribution channels. On the enterprise side, the proof is thinner but not nonexistent: Speak says it has 200+ business customers, and Forbes reported roughly 500 companies, including KPMG and HD Hyundai, offered subscriptions to employees, mostly in South Korea.[CU005, CU008, CU015, CU016, CU023, CU024]

Named customer proof table
CustomerSegmentDeployment / use caseProduction vs pilotOutcomeLimitation
j herronovConsumer French learnerUsed Speak for a few months to improve listening, free talk, and bookmarks for real conversationActive consumer useSays the app improved ability to catch words and sayings in French media and made personalized practice usefulSelf-reported outcome; no independent proficiency audit
Dan SConsumer Spanish learnerUsed Speak for 6+ months as a primary speaking-feedback toolActive consumer useSays detailed and instantaneous feedback made Speak better than prior Spanish-learning optionsSingle reviewer and no quantified before/after benchmark
Rosalyn MulderConsumer Spanish learnerAndroid user comparing Speak with Duolingo and MangoActive consumer useSays tutor-like feedback and streaks improved motivation and made Speak her preferred optionMotivational proof only; no retention or spend disclosure

Sample only. Publicly accessible named proof is much richer for consumer reviewers than for enterprise logos.

[CU031, CU032, CU033, CU048]
FU003: Customer proof matrix

Speak has strong public consumer proof and aggregate enterprise counts, but weak public visibility into retention and concentration.

[CU008, CU016, CU029, CU043, CU044, CU045]

6.3 Durability, expansion, and concentration risk

The key diligence problem is that customer-quality evidence is much weaker than adoption evidence. Public materials show downloads, ratings, lessons, lines spoken, and even an employer adoption percentage, but they do not disclose NRR, GRR, churn, contract length, cohort retention, or top-customer concentration. That means Speak can credibly show who tries and likes the product, but not yet how durable those customers are in revenue terms. The monetization surface also introduces friction points that matter for churn analysis. App-store pricing shows self-serve subscriptions ranging from roughly $18 monthly to $84 annual premium plans and higher premium-plus tiers, while independent complaint pages surface refund, auto-renewal, language-coverage, and speech-recognition complaints. None of that proves systemic weakness, but it does mean the public record is still better at showing acquisition and usage than renewal quality. The enterprise expansion story is also plausible but incomplete: Forbes says the push began when consumers asked employers to pay, which is a healthy land-and-expand signal, yet the public corpus still lacks a named case-study library, contract terms, or a geography split that would show whether B2B revenue is still heavily concentrated in Korea. Public evidence also does not bridge from headline activity metrics to active payer counts, so investors still need the basics: paid-user cohorts, annual-plan renewal curves, refund rates, and logo-level enterprise retention. Until those are disclosed privately, the right reading is that Speak has demonstrated demand and habit, but not yet fully auditable customer durability across cohorts, logos, or geographies globally.[CU016, CU026, CU028, CU036, CU037, CU039]

Retention / repeat usage / satisfaction table
MetricValueSegmentConfidenceDiligence ask
App Store satisfaction4.8iOS consumer usershighRequest rating distribution by country, language track, and payer status
Google Play satisfaction4.7Android consumer usershighRequest rating distribution and uninstall / refund linkage
Employee adoption85Speak for Business employeesmediumRequest cohort definition, seat base, and time window behind the metric
Growth durability proxyUsers doubled yearly for five years through 2024All usersmediumRequest paid-user retention to separate growth from churn masking
Repeat usage proxy3.74B lines spoken in 2025All usersmediumRequest active-user denominator and cohort retention curve
Public NRREnterpriselowRequest NRR by contract cohort and geography
Public GRR / churnAll segmentslowRequest logo churn, seat churn, and involuntary billing churn
Public contract length / renewalsSpeak for BusinesslowRequest median term, renewal rate, and expansion rate by logo

This table substitutes for the planned cohort figure because no reviewed public source provided time-bucket retention percentages suitable for a cohort chart.

[CU016, CU023, CU024, CU040, CU045, CU046]
Expansion and concentration risk table
Expansion driverConcentration / friction riskImpactDiligence path
Consumer-to-employer upsellB2B adoption may still be heavily seeded by Korea-based consumer demandCould overstate global enterprise repeatability if Korea remains the dominant proving groundRequest enterprise ARR, seats, and renewals by country
Speaking-first differentiation and strong ratingsPublic ratings do not show whether paid cohorts renew at attractive ratesAdoption quality may look better than revenue durabilityRequest annual-plan renewal curves and refund rates
More countries and more languagesLanguage-catalog complaints suggest some users want unsupported languages or better localizationCan cap TAM in adjacent learner segments and raise churn risk after initial trialRequest language roadmap, waitlist size, and retention by new language
Employer-sponsored adoption metric85% employee adoption is promising but lacks context on seat counts and contract valueCould reflect a small, highly engaged subset rather than scaled enterprise expansionRequest logo-level seat activation and ACV distribution
Self-serve subscription pricingAuto-renewing subscriptions and refund complaints can create support and involuntary churn burdenConsumer churn and brand sentiment may be more volatile than ratings implyRequest refund, cancellation, and chargeback metrics by store
Aggregate enterprise countNo public roster for the 200+/500-company base means concentration cannot be checkedA few large logos could drive a disproportionate share of B2B revenueRequest top-10 logo revenue share and sponsor concentration

The public record supports expansion narratives, but not a clean concentration bridge.

[CU026, CU028, CU029, CU037, CU041, CU042]
Independent review and complaint surface table
SurfaceSignal typeWhat it showsBalanceLimitation
Apple App Store reviewsCustomer-proofLarge recent review stream with strong ratings and some concrete feature feedbackMostly positive in the accessible sampleNo full rating-distribution or refund data
Google Play reviewsCustomer-proofLarge Android review base plus visible complaints on recognition and language handlingMixed positive and negative examples are visibleNo payer-status or retention linkage
Speak official review pageCurated customer-proofDuplicates real named reviews and makes them easy to inspectPositively selected because it is limited to 5-star App Store reviewsCannot stand alone as a balanced satisfaction read
LanguaTalk reviewIndependent reviewStructured critique of feedback depth, lesson variety, and speech-recognition leniencyAdverseReviewer sample size is small and editorial
JustUseApp reviewsIndependent complaint aggregationRefund, language-coverage, and missing-content complaints appear publiclyAdverseAggregation quality is weaker than first-party store data

This table isolates review-surface quality so the retention table can stay focused on durability metrics.

[CU008, CU036, CU037, CU039, CU040, CU041]

6.4 Exhibits

Chapter 07

07Risks

7.1 Legal, regulatory, and consumer-protection risk

Speak is not facing an obvious public enforcement event today, but its legal surface is more complicated than a generic language app because the product is voice-first, AI-mediated, global, and subscription-driven. Google Play says the app may share app activity and device identifiers with third parties and may collect personal and financial information, while also offering encryption in transit and deletion requests. Apple rates the app 13+, but Google rates it Everyone, which creates an ambiguous minor-facing posture for a service that invites users to speak naturally into an AI tutor. That matters because FTC guidance says COPPA obligations attach if a service is directed to children under 13 or has actual knowledge of collection, and because AI Act and GDPR obligations can intensify when AI systems become more consequential or less transparent. On top of privacy, billing rights are fragmented: Apple controls App Store refunds, Google imposes strict transparency and cancellation rules, and Speak’s own help center draws fine distinctions between cancellation and refunds. The residual legal risk is therefore manageable but real: one part regulation, one part consumer-trust plumbing.[CR003, CR004, CR005, CR008, CR010, CR012]

Regulatory / legal risk register
rule / casejurisdictionstatuslikelihoodseveritymitigationresidual exposurediligence path
Voice-data, minor-facing, and parental-consent exposureUS / globalActive if Speak is used by under-13 users or knowingly collects their personal datamediumhighApple and Google surface age / privacy disclosures; Google says data can be deleted and is encrypted in transitmedium-highRequest age-screening logic, child-data controls, voice-retention schedule, and parent-consent workflow by market.
AI Act / GDPR transparency and assessment obligationsEUPhased in from Aug 2024 onwardmediumhighSpeak clearly markets the product as AI-powered and current use appears focused on tutoring rather than regulated decisionsmediumMap every EU-facing AI feature to transparency, logging, DPIA / FRIA, and special-category-data assumptions before scaling assessments or richer speech analytics.
Auto-renew, refund, and recurring-billing complianceUS / EU / app storesCurrent and ongoingmedium-highmedium-highHelp-center flows exist and Apple / Google each provide their own cancellation and refund controlsmediumTest actual disclosure screens, trial-to-paid conversion notices, store-specific chargeback rates, and whether cancellation is obvious in each client.
Cross-border transfer and third-party disclosure riskEU / globalOngoingmediumhighStore disclosures already surface some sharing, collection, encryption, and deletion controlsmedium-highObtain the full privacy policy, terms, DPA, subprocessor list, SCC or transfer framework, and any model-training exclusions for voice transcripts.

Rows are ordered by residual severity and focus on public legal exposures most likely to matter to an investor in a global voice-AI subscription app.

[CR003, CR004, CR005, CR008, CR010, CR012]

7.2 Operational quality and platform-dependency risk

Operationally, Speak depends on the quality of speech recognition and conversational latency in a product where users notice every miss immediately. Speak built custom ASR because earlier third-party speech-recognition services struggled with accented learner speech, and it now runs that stack on Nvidia GPU inference inside Kubernetes on Google Cloud. That creates a stronger product than a purely off-the-shelf stack, but also a narrower operational base: latency, throughput, or cloud-vendor issues can translate directly into weaker tutoring quality. The same pattern appears in OpenAI dependence. Speak publicly uses GPT-4, GPT-4o, and the Realtime API; it also admits speech-to-speech models still lag text models on instruction following and nuanced coaching. Independent review sources reinforce that this is not just a theoretical risk. Complaints cite laggy recognition, missed words in the middle of sentences, noisy-environment failures, battery drain, device heating, and occasional recording or progress-saving bugs. Strong product love exists, but the operational bar is high because users are paying for an experience that feels immediate, accurate, and fair.[CR022, CR023, CR024, CR025, CR026, CR027]

Operational / quality / security risk register
failure modelikelihoodseveritymitigation maturityresidual exposureunresolved gap
Recognition misses accented or continuous speech, undermining feedback qualitymedium-highhighmedium — Speak built custom ASR after third-party systems underperformedhighNo public accuracy / error-budget disclosure by language or environment.
Realtime roleplays depend on speech-to-speech models that Speak still says are weaker than text models for nuanced coachingmediumhighlow-medium — strong product work, but admitted model limits remainmedium-highNo public fallback routing or quality-threshold logic by feature.
Long sessions can overheat devices or degrade in noisy environmentsmediummediumlow — review aggregators surface the issue but no telemetry is publicmediumNo hardware-session or battery-performance data is public.
Billing and support disputes can turn product love into chargebacks and trust damagemedium-highmedium-highmedium — extensive help-center coverage existsmedium-highNo public refund-rate, chargeback-rate, or first-response-time metrics.
Localization and onboarding glitches can blunt expansion into new languages and marketsmediummediummedium — demand is strong and rollout is ongoingmediumNo public launch-readiness scorecards or country-level retention breakdowns.

This register focuses on issues that can degrade the lived learning experience even when the underlying product concept is strong.

[CR023, CR024, CR025, CR026, CR027, CR036]
Partner / dependency risk register
dependencycounterpartyroleconcentrationfailure scenarioseveritymitigationresidual exposure
Conversational model stackOpenAIGPT-4, GPT-4o, Realtime API, and related tutoring featureshighAPI pricing, policy, or uptime changes degrade roleplays, feedback quality, or marginshighSpeak has its own learning engine and some custom ASR, not a pure wrapperhigh
Inference / ASR infrastructureGoogle Cloud + NvidiaCustom ASR serving, GPUs, Kubernetes runtimehighCloud outage, GPU constraint, or inference-cost spike slows or weakens feedback loopshighCustom model ownership gives some flexibilityhigh
iOS distribution and billingApple App StoreAcquisition, ranking, and in-app subscription railmedium-highPolicy or billing friction reduces conversion, refunds, or visibility on a critical platformmedium-highApple handles refunds and cancellations through standard flowsmedium-high
Android distribution and billingGoogle PlayAndroid acquisition, ranking, data-safety disclosures, and subscriptionsmedium-highPolicy shifts or disclosure problems slow Android growth or increase trust frictionmedium-highGoogle provides clear cancellation expectations and verified reviewsmedium-high
Private capital supportVenture investorsFunding and valuation support for scale and model investmentmediumIf growth or economics soften, next financing may become harder at a premium markhighSpeak still has active backers and recent capitalmedium-high

Even strong product companies become fragile when billing, AI quality, infrastructure, and financing all route through a small set of counterparties.

[CR002, CR022, CR023, CR026, CR029, CR033]

7.3 Financial-model and execution risk

Speak’s financial risk is less about visible distress than about opaque economics behind an expensive-to-run product. Public sources show impressive momentum: over 10 million users by mid-2024, more than 40 countries, a Series C at a $1 billion valuation by late 2024, and continued investor support from OpenAI, Khosla, Accel, and Y Combinator. But the same public record says Speak sometimes builds experiences that are cost-prohibitive today in anticipation of cheaper models later, and independent reporting notes that custom LLM ambitions can be expensive. That means gross margin, inference cost per active learner, refund leakage, and support intensity matter a great deal. Those figures are not public. NicheMetric offers directional app-revenue and download estimates, but not with enough transparency to close underwriting questions. Execution risk compounds this uncertainty: the company scaled from South Korea into more than 20 and then 40-plus countries while still expanding languages, hiring, and product scope. Positive reviews suggest the bet is working, yet requests for more languages and better support imply the operating system around the tutor may still be catching up to demand.[CR002, CR029, CR030, CR031, CR032, CR033]

People / execution risk register
role / functiondependency or gaplikelihoodseveritymitigationdiligence path
ML / ASR / LLM engineeringCustom ASR plus OpenAI-powered tutoring requires scarce systems and model talentmediumhighRecent capital and active hiring suggest the company is still investing behind the stackRequest org chart, attrition, on-call design, and vendor-ownership map across ASR and roleplay features.
Support and billing operationsApple, Google, and direct-web payment flows create multilingual support complexitymedium-highhighHelp-center coverage is broad and currentReview first-response times, chargeback handling, escalation runbooks, and localization coverage.
Content and localization operationsMore languages and more countries increase lesson QA, policy, and onboarding complexitymediumhighUser love is strong and expansion has already worked in multiple countriesReview retention and refund cohorts by language, region, and app-store channel.
Enterprise / business offeringSpeak for Business expands support, contract, and reliability expectations beyond the consumer appmediummedium-highPublic evidence shows a business edition existsRequest enterprise pipeline, SLA commitments, security questionnaire responses, and support staffing.

This table ranks execution risk by the degree to which scaling complexity can outpace the current public operating system.

[CR033, CR048, CR049, CR052, CR053]

7.4 Mitigations, monitoring indicators, and thesis-break triggers

The good news for Speak is that the product already shows evidence of real user love, large-scale adoption, and a willingness to publish some customer-support and technical detail. That lowers the chance that the main risks are purely hidden disasters. The more realistic underwriting question is whether those mitigations mature as quickly as the company’s ambition. Strong ratings and positive speaking-feedback reviews help, encryption and deletion controls help, and documented cancellation / refund flows help. But none of those substitute for full privacy-policy visibility, enterprise reliability commitments, or hard unit-economics evidence. The most important monitors are practical: store-review clusters around recognition or billing, chargeback and refund rates, any inability to show clear retention and data-use controls for voice data, and any OpenAI or cloud cost shift that forces a weaker product or margin reset. For an investor, the thesis does not break on one bad review week. It breaks if recurring operational complaints, compliance gaps, or missing economics start to overwhelm a product that is currently winning because it feels better than the alternatives.[CR005, CR008, CR010, CR013, CR022, CR033]

Mitigation and kill criteria table
riskmonitorable triggerthreshold / eventaction implication
Subscription trust / billing frictionRefund requests, chargebacks, and billing-related review spikesMeaningful multi-week jump after a pricing or trial-flow changePause aggressive acquisition and rework store / onboarding disclosures before scaling spend.
Recognition qualityClusters of reviews citing missed words, lag, or rushed speaking windowsPersistent issues across more than one major language or platform releaseHold language expansion until accuracy and pacing recover.
Voice-data / child-data complianceInability to furnish retention, deletion, or age-screening controls to diligence or regulatorsMissing DPIA / retention package or a regulator / platform noticeDowngrade the underwriting case until compliance documentation is complete.
OpenAI / model dependenceMaterial API price increase, latency regression, or policy constraintA sustained margin hit or degraded live-roleplay experienceReset valuation assumptions and require fallback / routing proof.
Cloud / inference concentrationRepeated Sev-1 inference or speech-serving disruptionsMore than one major outage without credible failover evidenceRequire multi-region or multi-vendor resilience plan before underwriting premium growth.
Financing / valuation supportNew round or secondary process implies flat / down pricing without metric proofUnable to show durable retention, margins, and refund disciplineTreat the current valuation as fragile and avoid paying ahead of verified economics.

These triggers translate a broad diligence narrative into a concrete monitoring framework for investors.

[CR005, CR010, CR014, CR023, CR026, CR033]
FR001: Risk heatmap

Speak’s highest residual risks sit where AI/voice regulation, app-store billing, and model / cloud dependence overlap with a premium consumer experience.

[CR003, CR005, CR008, CR010, CR016, CR019]
FR002: Risk transmission map

Speak’s main risks transmit through a handful of channels: compliance, billing trust, model quality, margin, and valuation support.

[CR003, CR008, CR010, CR022, CR023, CR024]
FR003: Dependency map

Speak’s consumer AI product depends on a compact set of external rails for models, cloud inference, distribution, billing, and support resolution.

[CR001, CR002, CR009, CR022, CR026, CR033]

7.5 Exhibits

Chapter 08

08Valuation

8.1 Recommendation and price discipline

Speak has enough public operating proof to justify serious diligence, but not enough price support to justify a clean buy at the last public $1B mark. Official disclosures show valuation moved from $500M in June 2024 to $1B in December 2024, while later reporting points to more than $100M of annualized revenue, 15M downloads, 10M+ Google Play installs, and a real enterprise footprint. That is materially stronger evidence than most consumer-edtech startups can show. The problem is that the available price anchor still implies a high-single-digit to roughly 10x revenue multiple, which remains richer than current public education and subscription-learning comps. Duolingo is the only public peer with comparable growth and product quality, yet even it screens near 4x EV/revenue on current market data. Because the published valuation is older than the latest revenue milestone and the cap table, preference stack, and retention profile remain undisclosed, the prudent call is research-more with medium confidence, high risk, and a stretched valuation stance. Entry only becomes attractive if updated diligence shows stronger recurring economics than public evidence currently proves, or if entry price is materially below the last public mark.[CV001, CV006, CV012, CV015, CV017, CV021]

Recommendation summary table
DimensionAssessmentEvidence-backed reasonDecision implication
Recommendationresearch-moreReal product traction exists, but public evidence does not yet fully support price-sensitive underwriting at the last $1B mark.Proceed only with full private diligence package.
ConfidencemediumFunding, revenue, pricing, app traction, and comp data are all visible, but retention and cap-table economics are not.Treat public view as a screen, not an IC-ready close.
Risk ratinghighOutcome depends on continued premium growth plus undisclosed dilution / preference terms.Underwrite downside explicitly before any term-sheet decision.
Valuation stancestretchedImplied private multiple sits above public edtech comp ranges and requires a premium for better growth quality.Seek price concession or unusually clean economics.
Target underwriting>=3x gross MOIC over 4-5 yearsAt the last public mark, that target likely requires a faster scale-up than current public evidence alone proves.Prefer entry below the last round or with strong downside protections.

Target return / hold framing is an underwriting discipline, not a sourced market quote; it is included because chapter 08 requires a decision implication.

[CV006, CV012, CV050, CV051, CV053, CV057]
FV001: Recommendation logic

Flow from observed traction and market tailwind to valuation caution and a research-more recommendation.

[CV012, CV017, CV021, CV041, CV042, CV051]
FV004: Investment KPIs

IC-style 0-10 scorecard translating the chapter evidence into decision factors.

Scores are ordinal analyst judgments (0-10) synthesized from the cited claims. They are not sourced company KPIs and are used only to summarize investment quality across chapter dimensions.

[CV012, CV017, CV021, CV041, CV042, CV046]

8.2 Scenario range and comparable frame

The public-market anchor says Speak deserves a premium to struggling edtech names, but the premium must be earned through unusually durable growth and monetization. Duolingo still posts healthy growth, positive margin, and a multi-billion market cap, yet its EV/revenue is roughly 4x; Coursera, Udemy, and Chegg sit far below 1x EV/revenue. That spread matters because Speak is still private and investors are being asked to fund into a company with less disclosure than any of those names. The upside case is still meaningful: AI tutors and online language learning both remain large and growing markets, and Speak’s app rankings, pricing, and enterprise traction suggest it is building a more serious learning product than a generic vocabulary game. Still, MMR’s market work flags freemium competition and CAC pressure, and Oliver Wyman shows how quickly AI-exposed software multiples can reprice when markets question seat growth, moat durability, or revenue quality. That is why the scenario frame uses conservative public-comp ranges rather than extrapolating an open-ended AI premium. A bear outcome compresses toward $400M-$650M, a base case holds around $800M-$1.0B, and a bull case requires revenue to move well beyond $150M with broader enterprise conversion and continued app-store strength.[CV025, CV029, CV033, CV037, CV041, CV042]

Thesis / anti-thesis table
LensThesisAnti-thesisWhat would change the view
DemandLarge AI-tutor and language-learning markets support continued category growth.Big markets do not prevent CAC inflation or competition from free / freemium tools.Show cohort-level retention and efficient acquisition by geography.
MonetizationStorefront pricing and >$100M annualized revenue suggest consumers will pay for a premium speaking product.Public evidence does not show conversion durability or whether revenue concentration is too consumer-heavy.Provide conversion funnels, payback, and renewal metrics.
Enterprise200+ customers in Dec 2024 and ~500 employers in late 2025 imply B2B expansion optionality.Logo counts are not the same as durable ARR, expansion, or gross-margin proof.Disclose enterprise ARR, renewal rates, and top-customer concentration.
Relative valuationA premium to weaker public edtech peers can be justified if Speak behaves more like a category leader than a commodity app.Current comp ranges still imply the $1B mark bakes in a lot of future execution.An updated financing at similar price with better disclosure would improve the call.
[CV009, CV012, CV041, CV042, CV043, CV044]
Bull / base / bear scenario table
ScenarioAssumptionsValuation / return logicKey risksProbability signal
BearGrowth slows, app momentum softens, and investors value Speak closer to public edtech ranges.$400M-$650M valuation; a $1B entry would likely lose capital before dilution and preferences.Multiple compression, paywall friction, weak enterprise conversion.More likely if app-store momentum fades or sector sentiment weakens again.
BaseRevenue scale holds around current run-rate trajectory, enterprise expands but remains unproven, and premium over public comps narrows but does not vanish.$800M-$1.0B valuation; a last-round entry produces limited upside unless terms are unusually clean.Mix / retention opacity, financing need, inability to defend premium.Most consistent with current public evidence.
BullRevenue scales well beyond $150M, enterprise penetration broadens, and Speak keeps premium app momentum with stronger disclosed economics.$1.1B-$1.4B valuation; upside exists, but still depends on private data confirming durable monetization.Execution miss, competitive imitation, weaker-than-expected renewal quality.Needs evidence beyond what public sources currently show.

Scenario ranges are analytical estimates derived from published revenue milestones, current public comps, and 2026 software multiple risk commentary; they exclude undisclosed preference effects.

[CV012, CV021, CV025, CV029, CV033, CV037]
Comparable valuation table
ComparableMetricMultiple / valuation / statusRelevanceLimitation
DuolingoTTM revenue / EV-revenue / market cap$1.04B revenue; 4.01x EV/revenue; $5.15B market capBest public quality benchmark for premium language-learning software.Much more mature, profitable, and disclosed than Speak.
CourseraTTM revenue / EV-revenue / market cap$789.84M revenue; 0.42x EV/revenue; $0.98B market capShows how general online-learning platforms price when growth and margins are weaker.Business mix is broader than speaking-led consumer language learning.
UdemyTTM revenue / EV-revenue / market cap$773.9M revenue; 0.38x EV/revenue; $0.68B market capUseful consumer/creator education benchmark for slower-growth subscription learning.Marketplace economics differ from Speak’s owned-curriculum model.
CheggTTM revenue / EV-revenue / market cap$376.91M revenue; 0.32x EV/revenue; $0.12B market capHard downside anchor for what education assets can trade at after product / moat deterioration.Chegg is an adverse comp, not a target peer.

Sample of current public anchors used for price discipline; private AI-tutor comparables remain too opaque to enumerate exhaustively from open sources.

[CV025, CV027, CV028, CV029, CV031, CV032]
FV002: Valuation sensitivity

Illustrative enterprise-value outcomes from different revenue / multiple combinations anchored to public evidence.

Values are illustrative USD millions. They are derived from the publicly reported >$100M annualized revenue milestone plus observed public comp multiples, then rounded to decision-useful anchors. They are not company guidance or transaction quotes.

[CV012, CV025, CV029, CV033, CV037, CV050]
FV003: Valuation / return range

Bear/base/bull valuation ranges for Speak using public evidence only.

Values are illustrative USD millions based on public revenue milestones, current listed-peer multiples, and 2026 software multiple-risk commentary. They exclude undisclosed dilution, debt, or liquidation preference effects, so investor return ranges remain partial.

[CV050, CV051, CV054, CV055, CV056]

8.3 Upgrade triggers, thesis breaks, and final asks

The path from research-more to buy is straightforward but evidence-heavy. Speak needs to show that enterprise traction is not just logo count but recurring, expanding revenue with solid renewal quality. It also needs to prove that consumer scale is translating into retention and cohort economics strong enough to defend a premium multiple above public edtech. The main thesis-breaks are observable: app-store momentum rolling over, consumer paywall friction worsening, enterprise adoption failing to translate into disclosed ARR support, or the broader AI-software multiple resetting lower again. Regulatory and market-structure risk also matter because generative-AI platforms can become concentrated around key model, data, or compute owners, limiting downstream moats for application-layer companies. A financing window can also close quickly if markets stop rewarding AI narratives before Speak has disclosed enough economics to separate itself from weaker subscription-learning assets. Put differently, Speak looks like a credible business, but the underwriting gap is around return architecture rather than product existence. Without the cap table, unit economics, and segment mix, the investor is still pricing a narrative premium. With them, the company could merit track or better; without them, the correct posture is disciplined diligence rather than conviction pricing. The upgrade path is therefore concrete today: prove segment-level retention, show enterprise renewal quality, and demonstrate that premium pricing survives competitive pressure.[CV009, CV013, CV022, CV046, CV047, CV053]

Thesis-break and kill triggers table
TriggerThresholdTransmission to thesisAction implication
App momentum fadesDownload / ranking data lose premium positioning across core markets for multiple monthsWeakens evidence that Speak is still winning consumer attention at premium price points.Re-cut base case toward public-comp ranges.
Enterprise proof stallsNo disclosed ARR, renewals, or expansion despite growing employer footprintTurns enterprise story into a narrative add-on rather than a value-supporting second engine.Do not pay a premium for B2B optionality.
Sector multiple compression deepensAI/software risk reprices again and premium subscription names de-rate furtherShrinks valuation ceiling even if operations remain solid.Demand wider discount to last round or stand aside.
Terms are investor-unfriendlyHeavy preferences, ratchets, or hidden dilution appear in the 2024-2026 financing stackCan eliminate return even if enterprise value holds near base case.Pass or insist on protective entry terms.
[CV020, CV022, CV048, CV049, CV053, CV058]
Final diligence asks table
TopicMissing evidenceWhy it mattersOwner or diligence path
Cap table / preferencesNo public waterfall, option pool, or liquidation preference detailReturn math can diverge sharply from enterprise value growth.Company counsel + lead investor data room.
Segment mixNo public split of consumer versus enterprise revenue or geographyPremium valuation depends on quality and diversification of revenue.Finance team revenue bridge and board materials.
Cohort economicsNo public conversion, retention, or CAC payback disclosureNeeded to judge whether revenue can compound efficiently.Growth / finance analytics export by cohort.
Enterprise qualityNo public contract duration, renewal, or expansion data for 200-500 employer footprintNeeded to validate bull-case durability and downside protection.Sales ops pipeline review plus top-customer diligence calls.
[CV009, CV013, CV022, CV046, CV047, CV053]

8.4 Exhibits

Disclaimer

This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.

Evidence index

Claims
IDStatementConfidenceSources
CO001 Official and third-party company profiles place Speak (Speakeasy Labs) founding in 2016. Medium SO007, SO012, SO016
CO002 Speak is headquartered in San Francisco, California. Medium SO009, SO011, SO016, SO013
CO003 Speak markets itself as an AI language tutor centered on getting learners speaking out loud with instant feedback. Medium SO001, SO004
CO004 By May 2026, Speak publicly promoted courses for French, Spanish, English, Korean, Italian, and Japanese. Medium SO001, SO004, SO008
CO005 Speak homepage marketing in May 2026 claimed 15M+ downloads and a 4.8 rating. Medium SO001
CO006 The iOS App Store listing showed 44K ratings and a 4.8 score on 2026-05-05. Medium SO004
CO007 The Google Play listing showed 112K reviews and a 4.7 score on 2026-05-05. Medium SO005
CO008 Speak announced a $20M Series B-3 financing at a $500M valuation on 2024-06-18. High SO006, SO009
CO009 Speak announced a $78M Series C financing at a $1B valuation on 2024-12-10. High SO007, SO010, SO011
CO010 Speak said total funding reached $162M after the Series C round. Medium SO007, SO010
CO011 Accel led Speak’s Series C, with OpenAI Startup Fund, Khosla Ventures, and Y Combinator among participating investors. High SO007, SO010, SO011
CO012 HolonIQ listed Speak as joining the global EdTech unicorn list in December 2024 at a $1B valuation. High SO015, SO007
CO013 By June 2024, Speak said it had more than 10 million learners in over 40 countries. Medium SO006, SO009
CO014 TechCrunch reported Speak had a 75-person workforce across San Francisco, Seoul, Tokyo, and Ljubljana in June 2024. Medium SO009
CO015 Speak’s careers page names Seoul, Ljubljana, and San Francisco offices and presents the company as globally distributed. Medium SO003
CO016 The careers page also contains a historical snapshot claiming a 60-person team and more than $60M raised. Low SO003
CO017 Speak said it launched Speak for Business between the June and December 2024 funding rounds. Medium SO007, SO002
CO018 The current B2B page says 200+ brands rely on Speak for Business. Medium SO002
CO019 Speak’s Series C post said its enterprise offering had more than 200 customers and an 85% employee adoption rate. Medium SO007
CO020 Speak reported that users had already spoken more than one billion sentences in 2024. Medium SO007
CO021 Speak’s June 2025 product update announced four new languages for English speakers—French, Japanese, Korean, and Italian—after the earlier Spanish release. Medium SO008
CO022 The June 2025 post says Speak started by teaching English in Korea, Japan, and Taiwan and had more than 15 million learners globally. Medium SO008
CO023 Speakeasy Labs filed a Form D on 2024-12-11 for a $77,699,277 equity offering under Rule 506(b), with first sale on 2024-11-13. High SO013, SO014
CO024 The filing page lists Connor Zwick, Andrew Hsu, Colton Gyulay, Alex Berkenkamp, and Ben Quazzo among related persons and shows a 100 Pine Street San Francisco address. Medium SO013
CO025 The public filing list shows additional Speakeasy Labs Form D filings dated 2024-08-12 and 2023-10-17. Medium SO014
CO026 Speak’s June 2024 announcement said learners speak about 1,000 times on average in their first week. Medium SO006
CO027 Speak’s June 2024 announcement said an updated speech recognition model reduced word error rate by over 60% and improved speed by 20% versus existing commercial systems. Medium SO006
CO028 Speak’s December 2024 announcement said it created more than 25 million personalized lessons during 2024. Medium SO007
CO029 The founders publicly identified in late-2024 coverage are Connor Zwick (CEO/co-founder) and Andrew Hsu (CTO/co-founder). High SO010, SO012
CO030 Accel partner Ben Quazzo joined Speak’s board as part of the Series C round. High SO007, SO010
CO031 TechCrunch’s June 2024 article described Speak as launched in 2014, conflicting with the company’s 2016 founding references elsewhere. Low SO009
CO032 Android Police concluded in February 2026 that Speak’s voice-driven learning can give users a false sense of mastery because it misses basic pronunciation errors. Medium SO017, SO005
CO033 JustUseApp’s review aggregation labeled overall customer experience 67.1% negative and highlighted recurring complaints about billing, lag, and recognition. Low SO018
CO034 Languatalk’s 2026 review said Speak is polished for beginners but feedback depth, lesson variety, and premium tier clarity remain weak points. Medium SO019
CO035 GetLatka reports Speak reached $100M revenue and 253 employees in 2025, but those figures are not corroborated by official Speak disclosures. Low SO016
CO036 The current official homepage and app listings show Speak has expanded from an English-only product into a broader multi-language learning app. High SO001, SO004, SO008
CO037 Speak positions English learning as the initial wedge but now frames the product as a broader language tutoring platform for consumers and employers. Medium SO002, SO007, SO008
CM001 Speak’s June 2024 financing announcement described the addressable market as a $100B+ online and in-person language learning market. Medium SM002
CM002 TechCrunch reported Speak initially focused on English because it is the world’s most popular language for learning. Medium SM013
CM003 TechCrunch quoted Speak’s CEO saying roughly 1.5 billion people are trying to learn English. Medium SM013
CM004 Preply’s 2026 report says English is the most learned language because of its global role in business and education. Medium SM011
CM005 Preply states English has about 1.5 billion total speakers worldwide. Medium SM011
CM006 Technavio says the digital English language learning market will grow by USD 39.46B from 2024 to 2029 at a 24.5% CAGR. Medium SM005
CM007 Technavio identifies APAC as the largest regional market and says it will contribute 39% of forecast growth. Medium SM005
CM008 Technavio says increased flexibility from digital language courses is a primary growth driver for the market. Medium SM005
CM009 Technavio says corporate non-academic learners are a major digital English customer segment, with roughly 30% opting for digital courses. Medium SM005
CM010 MarketsandMarkets projects the broader AI-in-education market to grow from USD 2.21B in 2024 to USD 5.82B by 2030 at a 17.5% CAGR. Medium SM006
CM011 MarketsandMarkets says North America held a 43% share of the AI-in-education market in 2024. Medium SM006
CM012 MarketsandMarkets says personalized learning and content management accounted for 34.5% of the AI-in-education market in 2024. Medium SM006
CM013 The same report names Duolingo and ELSA Speak among notable AI-in-education players. Medium SM006
CM014 Stanford HAI reports four out of five U.S. high school and college students now use AI for schoolwork. Medium SM007
CM015 WEF says AI can automate or augment up to 20% of educator clerical tasks. Medium SM008
CM016 WEF also says equitable access, data privacy, bias, and teacher displacement are major constraints on AI-in-education adoption. Medium SM008
CM017 World Bank says low- and middle-income countries face steep challenges adapting or deploying AI at scale. Medium SM009
CM018 World Bank frames the foundations of scalable AI adoption as connectivity, compute, context, and competency. Medium SM009
CM019 The 2021 systematic review says most AI language tools used machine learning and natural language processing for error identification, feedback, and assessment. Medium SM010
CM020 The same review concludes AI language tools improved learner abilities but raised privacy and teacher-preparation concerns. Medium SM010
CM021 Preply estimates the global online language learning market reaches about $115B by the end of 2025. Low SM011
CM022 Preply estimates the English learning segment is worth about $43.51B in 2025 and growing around 22% annually. Low SM011
CM023 Speak’s current product spans both consumer self-serve subscriptions and employer-sponsored learning through Speak for Business. Medium SM001, SM003
CM024 Speak’s Series C post says English learning is industry agnostic and its business product already had 200+ customers. Medium SM003
CM025 TechCrunch reported Speak’s consumer list price as $20/month or $99/year in 2024. High SM014, SM013
CM026 Duolingo’s 2024 annual report says it serves more than 100M monthly active users across 40+ languages and only about 9% of MAUs are paid subscribers. Medium SM017
CM027 Duolingo’s 2024 annual report says the app is the top-grossing education app globally on both Apple and Google app stores. Medium SM017
CM028 ELSA positions itself as a specialized AI English speaking coach with 18M+ downloads and 460K+ ratings on its website. Medium SM018
CM029 ELSA’s App Store listing shows 109K ratings and monthly and yearly memberships, reinforcing a premium subscription model for speaking practice. Medium SM019
CM030 Cambly competes with Speak from a different labor model: real conversations with native speakers available 24/7 rather than AI-only tutoring. Medium SM020
CM031 Busuu emphasizes community feedback from native speakers and says it has 120M+ registered users. Medium SM022
CM032 Busuu’s App Store listing positions the product as community-driven learning rather than AI-first conversation tutoring. Medium SM021
CM033 Babbel’s app-store surfaces emphasize expert-built structured lessons, with 25M subscriptions sold and 50M+ Google Play downloads. Medium SM023, SM024
CM034 Praktika markets an AI-tutor model with 20M+ learners and a much lower-cost alternative to private human tutors. Medium SM025
CM035 Android Police argued Speak can give learners a false sense of mastery because its pronunciation scoring is overly lenient. Medium SM015
CM036 Languatalk found Speak strongest for early learners but weaker for serious learners who need deeper feedback and broader lesson variety. Medium SM016
CM037 Speak’s June 2025 post says summer travel increases language-learning interest and ties learning motivation to travel identity. Medium SM004
CM038 Preply says progress still depends on access, indicating affordability and digital infrastructure remain barriers even as demand rises. Medium SM011
CP001 Speak’s core competitive claim is speaking-first AI tutoring rather than broad textbook-style language learning. Medium SP001, SP002
CP002 By 2026 Speak was no longer English-only: official surfaces advertise six live learning tracks and a broader multi-language ambition. Medium SP001, SP033
CP003 Speak’s current consumer price anchor remains about $20/month or $99/year in public reporting. High SP005, SP006
CP004 Speak for Business claims 200+ customers or brands and gives Speak a second distribution path beyond direct-to-consumer subscriptions. Medium SP004, SP023
CP005 Speak has materially smaller public scale than Duolingo, which reported 100M+ MAUs and 40+ languages in its 2024 annual report. Medium SP009, SP001
CP006 Duolingo’s freemium model relies on huge free-user scale with only about 9% of MAUs paying, making it a powerful low-cost substitute for Speak. Medium SP009
CP007 Duolingo is the top-grossing education app globally on Apple and Google app stores, reinforcing its distribution advantage over smaller challengers. Medium SP009
CP008 ELSA is a direct specialized speaking competitor focused specifically on English pronunciation and conversation rather than broad multilingual learning. Medium SP010, SP011
CP009 ELSA publicly claims 18M+ downloads and 460K+ ratings on its website. Medium SP010
CP010 ELSA’s App Store listing shows 109K ratings and monthly/yearly memberships, signaling a scaled subscription business in the same English-speaking niche. Medium SP011
CP011 Cambly competes using human native-speaker conversations available 24/7, making it a live-tutor substitute rather than an AI-only product. Medium SP012
CP012 Busuu competes with a community-feedback model and says it has more than 120M registered users. Medium SP013, SP014
CP013 Busuu’s Play listing shows 50M+ downloads and 1.13M reviews, reflecting broad consumer reach even without an AI-first pitch. Medium SP015
CP014 Babbel competes from the structured-course end of the market and says it has sold 25M subscriptions. Medium SP016
CP015 Babbel’s Play listing shows 50M+ downloads and 1.12M reviews, giving it much larger installed-base distribution than Speak. Medium SP017
CP016 Praktika is an AI-tutor competitor claiming 20M+ learners and a roughly $8/month price point positioned against human tutors. Medium SP018
CP017 Loora competes as an always-available AI English tutor with business-English and real-time feedback positioning. Medium SP019
CP018 MarketsandMarkets names both Duolingo and ELSA as AI-in-education players, suggesting the competitive set spans broad platforms and specialized speaking tools. Medium SP020
CP019 HolonIQ’s January 2026 addition of Preply to the EdTech unicorn list signals that language learning remains a venture-funded, still-fragmenting category. Medium SP021
CP020 Speak’s strongest relative differentiation is still its speaking-first UX and AI tutoring focus, not raw scale. Medium SP001, SP005, SP007
CP021 Speak’s multi-language expansion narrows one historical weakness against incumbents, but its public breadth still trails broader platforms like Duolingo, Busuu, and Babbel. Medium SP001, SP009, SP013, SP016, SP033
CP022 Cambly’s human-tutor model means Speak is not only fighting apps; it is also fighting live conversation as the trusted premium substitute. Medium SP012, SP005
CP023 Busuu and Babbel compete on structure, community, and breadth rather than pure AI conversation, giving budget-conscious users alternatives to Speak. Medium SP013, SP016, SP017
CP024 Praktika and Loora show that AI-speaking competition is no longer niche; multiple challengers now market conversation practice as a human-tutor replacement. Medium SP018, SP019
CP025 Android Police described Speak as heavily inspired by Duolingo, reducing the novelty moat around its course structure and gamification. Medium SP007
CP026 Android Police also found Speak’s pronunciation scoring too forgiving, which weakens one of the product claims that should be most defensible versus broad language apps. Medium SP007
CP027 Languatalk judged Speak polished for beginners but weaker for serious learners who want deeper feedback, better review loops, and richer advanced practice. Medium SP008
CP028 JustUseApp’s complaint aggregation indicates billing, refund, and support issues can damage trust even when the top-line app rating remains strong. Medium SP024, SP002, SP003
CP029 ELSA’s and Speak’s app-store footprints are much closer to each other than either is to Duolingo’s scale, making ELSA a more relevant specialized benchmark than Duolingo alone. Medium SP002, SP003, SP010, SP011, SP009
CP030 Speak’s B2B motion is a real differentiator versus most consumer-only rivals, but the public record still lacks seat counts, customer names, and renewal rates. Medium SP004, SP023
CP031 Preply’s 2026 report reinforces that English remains the most learned language because of business and education demand, which favors all major competitors rather than Speak alone. Medium SP022
CP032 Babbel and Busuu both emphasize more structured pedagogy and community reinforcement than Speak’s free-conversation-centered positioning. Medium SP013, SP014, SP016
CP033 ELSA emphasizes pronunciation, role-plays, and bilingual support, making it especially strong where buyers want more explicit correction than Speak’s reviews suggest it delivers. Medium SP010, SP011, SP007
CP034 Praktika uses a lower-price, AI-tutor framing that could pressure Speak if speaking practice becomes commoditized. Medium SP018, SP005
CP035 Duolingo’s enormous free-user funnel and top-grossing status mean it can pressure smaller players on both acquisition cost and user expectations. Medium SP009
CP036 Cambly, Busuu, Babbel, Duolingo, ELSA, Praktika, and Loora together cover live tutoring, community correction, structured coursework, and AI conversation—meaning buyers have multiple non-Speak ways to solve the same job. Medium SP009, SP010, SP012, SP013, SP016, SP018, SP019
CP037 There is no reviewed public source giving a clean market-share ranking for Speak versus these rivals, which is itself a diligence gap. Low
CP038 Duolingo’s official homepage still leads with a free value proposition and very broad course catalog, reinforcing its role as the default low-cost substitute. Medium SP025, SP032
CP039 Cambly’s own pricing page highlights one-on-one and Pro tutoring plans with native speakers, showing that the human-tutor substitute is productized rather than bespoke. Medium SP027, SP028
CP040 Preply and italki extend the substitute set beyond apps into tutor marketplaces, while HelloTalk extends it into peer-to-peer exchange, increasing buyer choice around the same fluency job. Medium SP029, SP030, SP031
CP041 The most durable moat visible publicly is distribution into business accounts plus speaking-first UX, but that moat looks moderate rather than dominant because rivals match on either scale or tutor quality. Medium SP004, SP009, SP012, SP018
CI001 Speak’s public monetization architecture is hybrid: consumer subscriptions in the app stores plus employer-facing contracts through Speak for Business. Medium SI002, SI004
CI002 Speak’s public consumer price anchor is consistently described as about $20 per month or $99 per year. High SI007, SI008
CI003 The App Store listing corroborates that Speak sells monthly and annual auto-renewing subscriptions rather than a pure one-time purchase. Medium SI002
CI004 Speak for Business publicly claims 200+ brands rely on the product, indicating a real but still sparsely disclosed B2B revenue line. Medium SI004, SI006
CI005 Speak has strong public engagement proxies—15M+ downloads, 44K App Store ratings, and 112K Google Play reviews—but none of these disclose paying subscribers or ARR. Medium SI001, SI002, SI003
CI006 TechCrunch reported users spend roughly 10-20 minutes per day in Speak, which supports engagement but not monetized retention. Medium SI008
CI007 There is no reviewed public source disclosing Speak revenue, ARR, gross margin, net retention, or cash balance. Low
CI008 Recent officially announced fundraising totals roughly $98M gross across the June 2024 Series B-3 and December 2024 Series C rounds. High SI005, SI006
CI009 The Series B-3 announcement attached a $500M valuation to Speak, and the Series C announcement attached a $1B valuation six months later. High SI005, SI006
CI010 The Speakeasy Labs Form D reviewed via FilingFlow lists a total offering amount of $77.7M with first sale on November 13, 2024, broadly matching the announced Series C size. Medium SI009
CI011 The SEC-filing list implies recurring private financing activity, but it does not disclose cash still on hand after those raises. Medium SI010
CI012 Because Speak does not publish balance-sheet data, any runway assessment remains scenario-based rather than underwritten from public statements. Medium SI005, SI006, SI009
CI013 Speak’s revenue mix between consumer subscriptions and enterprise contracts is not publicly quantified. Low
CI014 Enterprise pricing, seat counts, contract length, and renewal data for Speak for Business are absent from reviewed public sources. Low
CI015 Revenue quality cannot be judged confidently from list price alone because public sources do not reveal free-trial conversion, refund rate, churn, or realized discounts. Medium SI002, SI007, SI008
CI016 Public complaint aggregation points to refund and support friction, which could weigh on net consumer revenue quality even if headline app ratings stay high. Medium SI024, SI002, SI003
CI017 Duolingo’s 2024 annual report is the clearest public-company proxy that language-learning at scale monetizes through a freemium funnel with only a minority of MAUs paying. Medium SI011
CI018 Duolingo’s official homepage still leads with a free value proposition, reinforcing how strong the low-price anchor is for the category. Medium SI023
CI019 Compared with Speak’s monthly subscription, Cambly monetizes live tutoring in higher-value lesson units rather than flat app access. Medium SI012, SI002
CI020 italki monetizes as a tutor marketplace: its teachers page exposes thousands of English tutors and visible trial prices as low as USD 5. Medium SI017
CI021 HelloTalk’s messaging emphasizes free language exchange at 70M+ registered users and 260+ languages, creating a zero-price substitute for casual practice. Medium SI018
CI022 Preply’s discovered pricing page still points to subscriptions and corporate language training, showing that tutor-led models also package recurring revenue and B2B offers. Medium SI016
CI023 ELSA’s discovered pricing page points to Pro Memberships, Business, and Schools, highlighting that specialized speaking apps can monetize across consumer and institutional channels. Medium SI015
CI024 Busuu’s premium page title and Babbel’s pricing/business links confirm that broad-course competitors use explicit paid upsells rather than purely free distribution. Medium SI013, SI014
CI025 Praktika markets roughly $8/month AI tutoring, well below Speak’s public price anchor, which pressures category willingness to pay if AI practice commoditizes. Medium SI025, SI002
CI026 Loora’s business-facing and app-download surface suggests another AI tutor pursuing both consumer and business monetization, further compressing differentiation. Medium SI020, SI026
CI027 Speak’s unit economics remain largely opaque because public sources do not reveal CAC, payback, gross margin, or retention by cohort. Low
CI028 A reasonable public reading is that Speak has moderate capital intensity: software, AI inference, content/localization, and app-store distribution costs, but no visible inventory or hardware capex burden. Medium SI001, SI002, SI003
CI029 AI inference and feedback quality likely make Speak’s delivery costs higher than static course apps, even though public sources do not quantify the margin impact. Medium SI001, SI008
CI030 App-store dependence implies revenue collection and discovery are mediated by platform policies and fees rather than wholly controlled by Speak. Medium SI002, SI003
CI031 Speak’s sales-efficiency profile on the enterprise side cannot be publically modeled because buyer count is disclosed but no seat counts, ACVs, or sales-cycle data are provided. Medium SI004, SI006
CI032 The company’s valuation has risen faster than its public operating disclosure, creating a gap between financing confidence and financial transparency. Medium SI005, SI006, SI007, SI008
CI033 There is no reviewed public sign of debt, project-finance obligations, or acute capital distress, but there is also no evidence to prove strong runway. Medium SI005, SI006, SI009, SI010
CI034 New-language expansion broadens what Speak can sell, but no public source quantifies attach rate, ARPU by language, or whether new courses monetize at the same price. Medium SI001, SI005, SI006
CI035 The strongest public case for Speak’s near-term financing adequacy is simply that it raised significant capital recently and has not publicly disclosed distress signals. Medium SI005, SI006, SI010
CI036 The weakest part of the financial story is revenue quality: users, downloads, and valuations are public, but realized revenue, margins, and retention are not. Medium SI001, SI005, SI006
CI037 If Speak’s enterprise motion is meaningful, working-capital needs likely come from sales, onboarding, and localization rather than physical fulfillment. Medium SI004, SI006
CI038 Category benchmarks show consumers can choose among subscription apps, freemium bundles, live tutoring, tutor marketplaces, and free exchange communities, which caps pricing power for a mid-priced app like Speak. Medium SI012, SI017, SI018, SI023, SI025
CE001 Speak publicly positions itself as an AI language tutor centered on speaking out loud and instant feedback. High SE001, SE017
CE002 Public product surfaces show Speak teaching six target languages: Spanish, English, French, Italian, Japanese, and Korean. High SE001, SE017, SE019
CE003 Speak’s homepage says the app has 15M+ downloads. Medium SE001
CE004 Speak’s proprietary learning method is organized around Learn, Practice, and Apply phases. High SE005, SE006, SE007
CE005 Tutor Lessons use an AI voice agent that can advance, correct mistakes, answer clarifying questions, and redirect off-topic responses. Medium SE005
CE006 Apply-phase roleplays are open-ended conversations where learners complete objectives without one fixed correct answer. Medium SE005, SE003
CE007 Speak said in March 2023 that GPT-4 had already been in production for two months powering parts of AI Tutor. Medium SE029
CE008 Speak’s winter 2025 release added adaptive lessons, vocab side quests, unit refreshers, audio-first roleplays, and Speak Level. Medium SE007
CE009 Speak launched French, Japanese, Korean, and Italian for English speakers in June 2025 after an earlier Spanish release. High SE008, SE010
CE010 Before its 2024 ASR overhaul, Speak operated fragmented speech systems across iOS, Android, on-device models, and third-party recognition services. Medium SE002
CE011 Speak’s 2024 ASR upgrade fine-tuned Conformer-CTC on many thousands of hours of heavily accented learner speech. Medium SE002
CE012 Speak’s 2024 speech stack used Nvidia Riva and Triton on Kubernetes, Google Cloud, gRPC between services, and websockets for client-server streaming. Medium SE002
CE013 Speak reported a greater than 60% word-error-rate reduction versus its pre-trained Conformer baseline on internal learner data. Medium SE002
CE014 Speak reported a 45% WER improvement versus its earlier fine-tuned on-device Android model. Medium SE002
CE015 Speak said first-word feedback latency averaged about 1.6 seconds after the 2024 ASR upgrade, about 20% faster than its prior third-party service. Medium SE002
CE016 Matching v2 replaces bag-of-words matching with a combined ASR-plus-phonetic pipeline and forced alignment. Medium SE004
CE017 Speak’s matching pipeline uses a customized wav2vec2 family model for phonetic streaming inference and updates transcripts every 200–300 milliseconds while a user speaks. Medium SE004
CE018 Speak reported that Matching v2 reduced false negatives by about 40% without increasing false positives on internal labeled data. Medium SE004
CE019 Speak’s 2026 voice-agent platform uses WebRTC via LiveKit Cloud between mobile clients and its backend. Medium SE005
CE020 Speak’s voice-agent servers are built on LiveKit Agents and call external ASR, LLM, TTS, and speech-to-speech providers alongside Speak backend services. Medium SE005
CE021 Speak says its Kubernetes clusters run across multiple regions and route learners to geographically close LiveKit edges and Speak clusters. Medium SE005
CE022 Speak uses both cascade (ASR→LLM→TTS) and speech-to-speech pipelines rather than standardizing on one architecture. Medium SE005
CE023 Speak says cascade fits roleplays and free-form conversations, while speech-to-speech fits pronunciation feedback and tutor lessons where tone or accent matters. Medium SE005
CE024 Speak evaluates and mixes multiple TTS providers by language pair, code-switching quality, latency, and custom-voice fit because no single provider works best everywhere. Medium SE005
CE025 Speak tracks end-to-end latency, ASR time-to-final-transcript, TTS time-to-first-byte, and provider performance by region, language, and tail latency percentiles. Medium SE005
CE026 Speak says it automatically shifts traffic to backup providers when latency or error thresholds are exceeded in a region-language pair. Medium SE005
CE027 Live Roleplays combine OpenAI Realtime API with Speak’s proprietary learning engine, proficiency graph, objectives, and hints. Medium SE003
CE028 Speak acknowledged in October 2024 that new speech-to-speech models still lag text models on instruction following and nuanced pronunciation coaching. Medium SE003
CE029 Speak says all lessons are written by learning designers and then human-reviewed even when AI is used to speed up workflow. Medium SE011
CE030 Speak says a single four-unit course can include 50+ lessons and takes weeks of writing, revising, filming, testing, and polishing. Medium SE006
CE031 Speak publicly supports iOS and Android mobile devices but not desktop PCs. Medium SE012
CE032 As of March 2026, Speak lists minimum support at app version 4.35.0, iOS 16, and Android 8. Medium SE012
CE033 Speak exposes both an in-lesson issue-report flag and email-based support escalation workflows. High SE014, SE015
CE034 Google Play says Speak may share app activity and device IDs with third parties, may collect personal and financial information, encrypts data in transit, and supports data deletion requests. Medium SE019
CE035 Apple’s listing rates Speak 13+ and links users to a privacy policy and terms page hosted on usespeak.com/speak.com domains. Medium SE017
CE036 Speak’s public privacy and terms URLs returned only a JavaScript shell in text-only review, limiting direct inspection of policy details. Medium SE030, SE031
CE037 OpenAI’s April 2025 interview says Speak uses OpenAI models across audio and text and that Connor Zwick viewed the Realtime API plus multimodal audio as a key breakthrough for the product. Medium SE023
CE038 TechCrunch reported in December 2024 that Speak had more than 10 million downloads and over 200 Speak for Business customers. Medium SE024
CE039 SiliconANGLE reported that Live Roleplays rolled out in late 2024 and that Speak also supports business-oriented conversations with suppliers and customers. Medium SE025
CE040 Apple’s App Store page shows Speak at 4.8 stars from 44K ratings. Medium SE017
CE041 Google Play shows Speak at 4.7 stars from roughly 112K reviews and 10M+ downloads as of the May 2026 listing. Medium SE019
CE042 Recent Apple reviews consistently praise Speak’s immediate speaking practice, replay/pronunciation comparison, and AI conversations, but also ask for more languages and richer rewards. Medium SE018
CE043 JustUseApp surfaces public complaints about laggy voice recognition, refund friction, support delays, and unfinished lesson availability despite strong headline ratings. Medium SE020
CE044 LanguaTalk’s 2026 review says Speak’s voices and speech recognition are strong, but feedback depth, lesson variety at higher levels, and pricing clarity lag serious-learner expectations. Medium SE021
CE045 AppsHunter shows the iOS app updated April 30, 2026 at version 4.46.0 with iOS 16+ compatibility and 16 interface languages. Medium SE026
CE046 OpenAI’s developer community contains active 2025 implementation guidance for realtime voice agents using Twilio, FastAPI, and OpenAI’s Realtime API. Medium SE027
CE047 The public GitHub repository openai/openai-realtime-agents showed about 6.8k stars and 1.1k forks at review time, indicating an active ecosystem around a key Speak dependency. Medium SE028
CE048 Speak says its English-learning product now supports 15 different native-language entry points. Medium SE010
CE049 Speak said in November 2024 that Google Play named it Best App of 2024 in Hong Kong, Korea, and Taiwan. Medium SE009
CE050 Speak’s help center says more intermediate courses and broader non-English access are still in development. Medium SE010, SE011
CU001 Speak's homepage positions the product as an AI language tutor focused on speaking out loud with instant feedback. Medium SU001
CU002 Speak's homepage lists French, Spanish, English, Korean, Italian, and Japanese as supported learning languages. Medium SU001
CU003 Speak's homepage advertises a 4.8 rating. Medium SU001
CU004 Speak's homepage advertises 15M+ downloads. Medium SU001
CU005 Speak for Business says 200+ brands rely on the product. Medium SU002, SU005
CU006 Speak for Business frames employers as the buyer/payer and employees as the primary users of English-learning content. Medium SU002
CU007 Speak for Business says learners speak hundreds of sentences per week. Medium SU002
CU008 Speak's official review page is explicitly limited to real 5-star App Store reviews from U.S. learners. Medium SU003
CU009 Speak said in June 2024 that it had more than 10 million learners in 40+ countries. Medium SU004, SU007
CU010 Speak said in June 2024 that learners had more than doubled year over year for the previous five years. Medium SU004, SU007
CU011 Speak said learners speak 1,000 times on average in their first week. Medium SU004
CU012 Speak said nearly 6% of Korea's population was learning English with the app in 2024. Medium SU004, SU008
CU013 Speak said users had already spoken more than one billion sentences in 2024. Medium SU005
CU014 Speak said it created 25 million personalized lessons in 2024. Medium SU005
CU015 Speak said Speak for Business had more than 200 customers across industries in December 2024. Medium SU005, SU012
CU016 Speak said enterprise deployments were seeing an 85% employee adoption rate in December 2024. Medium SU005
CU017 Speak Wrapped 2025 says learners spoke 3.74 billion lines in 2025, up 111% year over year. Medium SU006
CU018 Speak Wrapped 2025 says learners spent 19.6 million hours practicing in app in 2025, up 85% year over year. Medium SU006
CU019 Speak Wrapped 2025 says learners started 231 million lessons in 2025, up 108% year over year. Medium SU006
CU020 Speak Wrapped 2025 says learners created 80.3 million personalized lessons in 2025, up 154% year over year. Medium SU006
CU021 TechCrunch reported in August 2023 that Speak was live in around 20 countries. Medium SU008
CU022 TechCrunch reported in August 2023 that Speak had well over 100,000 subscribers in South Korea. Medium SU008
CU023 Apple's App Store listing shows Speak at 4.8 out of 5 from 44K ratings. High SU009, SU010
CU024 Google Play shows Speak at 4.7 stars from 112K reviews and 10M+ downloads. Medium SU011
CU025 The Google Play listing shows the Android app was updated on 2026-05-01. Medium SU011
CU026 The iOS App Store listing shows monthly Premium pricing at $17.99, annual Premium at $83.99, and annual Premium Plus at $164.99. Medium SU018
CU027 Forbes reported in November 2025 that about 15 million people had downloaded Speak and the company had surpassed $100 million in annualized revenue, largely from consumers. Low SU013
CU028 Forbes reported that Speak began pushing into enterprise in 2024 after some consumers asked employers to cover subscriptions. Low SU013
CU029 Forbes reported that roughly 500 companies, including KPMG and HD Hyundai, offered Speak subscriptions to employees primarily in South Korea by late 2025. Low SU013
CU030 KPMG describes itself as a global organization of independent professional services firms providing audit, tax, and advisory services. Medium SU017
CU031 The App Store and Speak's official review page both preserve j herronov's review saying months of French use improved comprehension of French media and made free-talk and bookmark features valuable. Medium SU019, SU026
CU032 Google Play and Speak's official review page both preserve Dan S's review saying more than six months of Spanish use beat prior tools because feedback was detailed and instantaneous. Medium SU021, SU024
CU033 Google Play and Speak's official review page both preserve Rosalyn Mulder's review saying tutor-like feedback and streaks made Speak preferable to Duolingo and Mango. Medium SU022, SU025
CU034 The App Store reviews page shows Chuck_Ellis saying eight days of use had him forming Spanish sentences with confidence. Medium SU010
CU035 The App Store reviews page shows dollargills saying Speak helps with speaking and listening rather than only vocabulary memorization. Medium SU010
CU036 The App Store listing includes brombres' review praising focused speaking practice while calling the UI and HFM auto-advance behavior unintuitive. Medium SU020
CU037 The Google Play listing includes Danielle Chavez's April 2026 review saying speech recognition misses words mid-sentence and female verb forms can be mishandled. Medium SU023
CU038 MWM's editorial app page reports 10M+ downloads, a 4.8 out of 5 user rating, and 354.9K total ratings across locales. Low SU016
CU039 LanguaTalk rated Speak 3 out of 5 in February 2026 and said the app works best for beginners while feedback depth and lesson variety remain weaker for serious learners. Medium SU015
CU040 LanguaTalk said Speak's speech recognition can be overly lenient and may give learners a false sense of mastery. Medium SU015
CU041 JustUseApp surfaces complaints about refund friction, language availability, voice recognition, and missing Spanish-content access. Low SU014
CU042 App-store listings on Apple and Google both use auto-renewing subscriptions, indicating Speak's public monetization flow is optimized for self-serve consumer conversion. Medium SU009, SU011
CU043 Because Speak's official review page only republishes 5-star App Store reviews, it cannot by itself prove balanced customer satisfaction or retention. Medium SU003
CU044 Speak's public customer proof is much stronger for consumer learners and aggregate enterprise counts than for named enterprise case studies. Medium SU002, SU003, SU013
CU045 No reviewed public source disclosed NRR, GRR, churn percentages, or cohort-retention tables for Speak. Medium SU001, SU002, SU005, SU013, SU015
CU046 No reviewed public source disclosed contract lengths, renewal rates, or seat-expansion data for Speak for Business. Medium SU002, SU005, SU013
CU047 No reviewed public source disclosed top-customer concentration, regional revenue mix, or partner concentration for Speak. Medium SU002, SU005, SU013, SU016
CU048 Public named-customer proof is sampled rather than exhaustive: official and press sources cite enterprise counts but do not provide a public roster or case-study library for most employer logos. Medium SU002, SU005, SU013
CU049 Dataconomy reported that Speak users spend about 10 to 20 minutes per day in the app. Low SU012
CU050 Dataconomy reported that Speak for Business served over 200 corporate customers in December 2024. Medium SU012
CR001 Apple’s App Store listing shows Speak with a 4.8 rating from 44K ratings and a 13+ age rating. Medium SR004
CR002 Google Play shows Speak with 10M+ downloads, 112K reviews, and a 4.7 star rating. Medium SR013
CR003 Google Play’s data-safety disclosure says Speak may share app activity and device or other IDs with third parties. Medium SR013
CR004 Google Play’s data-safety disclosure says Speak may collect personal info, financial info, and four other data categories. Medium SR013
CR005 Google Play says Speak encrypts data in transit and lets users request deletion. Medium SR013
CR006 Speak’s Google Play listing says memberships auto-renew unless cancelled at least 24 hours before renewal. Medium SR013
CR007 Speak’s Apple App Store listing says memberships auto-renew unless cancelled at least 24 hours before renewal. Medium SR004
CR008 Speak’s cancellation article says cancelling a subscription prevents the next renewal but does not generate a refund for the current period. Medium SR010
CR009 Speak routes cancellation differently for Apple App Store, Google Play, and direct website purchases. Medium SR010
CR010 Speak’s refund policy gives Google Play and website subscribers a refund window of seven days from purchase. Medium SR011
CR011 Speak’s refund policy says no refunds are available after 30 days from payment for Google Play and website purchases. Medium SR011
CR012 Speak’s refund policy says Apple controls App Store refunds and Speak cannot process those refunds directly. Medium SR011
CR013 Apple Support says users can cancel in-app subscriptions and request refunds through Apple. Medium SR014
CR014 Google Play tells developers to be transparent about subscription terms, billing frequency, and cancellation methods. Medium SR015
CR015 Google Play says subscription apps must include an easy-to-use online method to cancel the subscription. Medium SR015
CR016 COPPA applies when an online service is directed to children under 13 or has actual knowledge that it collects personal information from children under 13. Medium SR008, SR020
CR017 FTC COPPA guidance says covered services must post a privacy policy and obtain verifiable parental consent before collecting personal information from children. Medium SR020
CR018 CNIL says the AI Act was published in the Official Journal in July 2024 and entered into force in stages from 1 August 2024. Medium SR018
CR019 CNIL says the AI Act is risk-based and includes high-risk examples such as biometric systems plus specific transparency obligations for some AI systems. Medium SR018
CR020 IAPP’s AI Act/GDPR mapping says some high-risk AI deployments require a fundamental-rights impact assessment that complements GDPR impact assessments. Medium SR019
CR021 IAPP says GDPR Articles 13-14, 15, and 22 create transparency and meaningful-information duties around automated decision-making. Medium SR019
CR022 OpenAI says Speak uses OpenAI models across audio and text modalities for interactive speaking exercises and tutors. Medium SR002
CR023 Speak’s Live Roleplays feature is built with OpenAI’s Realtime API and GPT-4o speech-to-speech capabilities. Medium SR024, SR007
CR024 Speak says current speech-to-speech models are still weaker than text models on instruction following and nuanced language-learning tasks such as pronunciation coaching. Medium SR024
CR025 Speak’s ASR team says earlier third-party speech-recognition services often struggled with heavily accented learner speech. Medium SR001
CR026 Speak runs its fine-tuned Conformer-CTC ASR stack with Nvidia Riva and Triton in Kubernetes on Google Cloud Platform. Medium SR001
CR027 Speak says the revamped ASR system delivers first-word feedback in about 1.6 seconds on average. Medium SR001
CR028 Speak said in March 2023 that GPT-4 had already powered parts of AI Tutor in production for more than two months and over 2M lessons had used the feature. Medium SR025
CR029 TechCrunch reported in June 2024 that Speak had grown to over 10 million users and customers in more than 40 countries. Medium SR030
CR030 TechCrunch reported in June 2024 that Speak’s user base had doubled every year for the prior five years. Medium SR030
CR031 Speak said in August 2023 that nearly 6% of South Korea’s population had used the app. Medium SR026
CR032 Speak said in August 2023 that it had begun international expansion and was live in more than 20 countries. Medium SR026
CR033 TechCrunch reported in December 2024 that Speak raised a $78M Series C at a $1B valuation. Medium SR027, SR007
CR034 SiliconANGLE reported that Speak’s plan to build custom LLMs could create significant costs that the new financing helps absorb. Medium SR007
CR035 JustUseApp’s review page says 67.1% of the combined experience it analyzed was negative. Low SR005
CR036 JustUseApp includes complaints that Speak’s voice recognition became laggy and did not allow enough time to finish some spoken sentences. Low SR005
CR037 JustUseApp includes complaints about unauthorized or unexpected charges after free-trial cancellation and about support not responding for a week on refund requests. Low SR005
CR038 JustUseApp includes a complaint that Speak required a Korean phone number during onboarding and another that the interface launched in a language the user did not understand. Low SR005
CR039 AppsHunter summarizes negative user themes as expensive subscriptions, inaccurate voice recognition, rushed lessons, limited languages, and occasional app crashes or bugs. Low SR006
CR040 AppsHunter lists reported bugs including inability to record during lessons, lesson progress not saving properly, and billing problems. Low SR006
CR041 Tooliverse flags premium pricing, pronunciation misreads, battery heating during long sessions, and background-noise sensitivity among its key watch-outs for Speak. Low SR022
CR042 A 2026 Google Play review says the app sometimes fails to hear everything a user says in the middle of a sentence. Low SR013
CR043 Apple App Store reviews and Speak’s curated review page both show strong user praise for speaking confidence gains and immediate feedback, but repeated requests for more languages. Medium SR012, SR023
CR044 Apple rates Speak 13+ while Google Play rates it Everyone, creating an ambiguous minor-facing posture for a voice-first tutoring product. Medium SR004, SR013, SR008
CR045 Because billing and refunds are split across Apple, Google Play, and Speak’s direct website, subscription-trust failures can be operationally fragmented and harder to resolve quickly. Medium SR010, SR011, SR013, SR014
CR046 OpenAI price, policy, or uptime changes can affect both feature quality and gross-margin assumptions because Speak publicly relies on GPT-4, GPT-4o, and Realtime API capabilities. Medium SR002, SR024, SR027
CR047 Speak’s cloud dependency is concentrated in a narrow infrastructure stack because its custom ASR backend runs on Google Cloud with Nvidia GPU inference components. Medium SR001
CR048 Rapid growth from South Korea into 20-plus and then 40-plus countries raises localization, support, and content-operations complexity even if demand remains strong. Medium SR026, SR030, SR029
CR049 The public review and help-center record shows billing friction is not a one-off issue because refunds, cancellation, payment status, and free-trial questions occupy a dedicated 11-article help collection alongside multiple complaint sources. Medium SR003, SR005, SR010, SR011
CR050 Public review evidence supports the idea that Speak has real product value, but premium pricing makes customer expectations for recognition quality and billing fairness materially higher. Medium SR012, SR022, SR005
CR051 NicheMetric estimates Speak generated about $10.0M of iOS revenue and more than 2.0M iOS downloads in the last 30 days, but the methodology is not transparent enough for high-confidence underwriting. Low SR028
CR052 SiliconANGLE says Speak now offers an enterprise edition, Speak for Business, with business-conversation features, increasing the need for support and contract maturity beyond the consumer app. Medium SR007
CR053 Y Combinator’s company page lists Speak as an active San Francisco company with multiple open jobs, which is directionally consistent with ongoing staffing needs rather than a fully stabilized operating model. Low SR029
CR054 Speak’s publicly reviewable privacy and terms pages were JavaScript-gated during chapter preparation, so voice-retention, transcript-retention, and model-training specifics could not be fully verified from machine-readable text. Low
CR055 Public sources did not disclose Speak’s burn rate, gross margin, inference cost per lesson, customer concentration, or enterprise SLA credits. Low
CR056 No comprehensive cross-jurisdiction litigation or enforcement package was publicly reviewable during chapter preparation, so legal-overhang risk remains only partially closed. Low
CV001 Speak announced a $20M Series B-3 on June 18, 2024 at a $500M valuation. Medium SV001, SV003
CV002 Speak said the June 2024 financing brought total funding to $84M. Medium SV001, SV003
CV003 Speak said it had more than 10 million learners in 40+ countries by June 2024. Medium SV001, SV003
CV004 Speak said its learner base had more than doubled year over year for five straight years by June 2024. Medium SV001, SV003
CV005 TechCrunch reported Speak charged $20 per month or $99 per year in mid-2024. Medium SV003
CV006 Speak announced a $78M Series C on December 10, 2024 at a $1B valuation. Medium SV002, SV006, SV004
CV007 Speak said the Series C brought lifetime funding to $162M. Medium SV002, SV004
CV008 Speak said users had already spoken more than one billion sentences with the product in 2024. Medium SV002
CV009 Speak said Speak for Business had 200+ customers and an 85% employee adoption rate in December 2024. Medium SV002, SV006
CV010 Forbes said Speak’s valuation doubled to $1B after the December 2024 $78M round. Medium SV004, SV002
CV011 Forbes reported roughly 15 million people had downloaded Speak by November 2025. Medium SV005
CV012 Forbes reported Speak had surpassed $100M in annualized revenue by November 2025. Medium SV005, SV007
CV013 Forbes reported about 500 companies, including KPMG and HD Hyundai, offered Speak subscriptions to employees by late 2025. Medium SV005
CV014 Forbes reported paid Speak access ranged from roughly $80 to $200 for consumers in late 2025. Medium SV005, SV009
CV015 Apple’s App Store listed Speak at 4.8 stars with 44K ratings when accessed on 2026-05-05. Medium SV009
CV016 Apple’s App Store listed current U.S. in-app prices including $17.99 monthly premium and $83.99 annual premium, with higher Plus tiers. Medium SV009
CV017 Google Play listed Speak at 10M+ downloads and 112K reviews when accessed on 2026-05-05. Medium SV010
CV018 Google Play showed Speak was updated on May 1, 2026. Medium SV010
CV019 An AppBrain snapshot from February 2025 estimated 5.8M Android lifetime downloads and about 350K recent 30-day downloads for Speak. Medium SV011
CV020 An AppBrain snapshot showed Speak ranked #1 top grossing in South Korea Education and #2 top grossing in Japan Education in early 2025. Medium SV011
CV021 NicheMetric estimated Speak generated about $10.0M in iOS revenue and more than 2.0M iOS downloads in the last 30 days. Medium SV008
CV022 NicheMetric surfaced critical 2026 user complaints about an immediate paywall, poor support responsiveness, and inability to switch languages. Medium SV008
CV023 GetLatka reported Speak reached $100M revenue in 2025, up from $15M in 2024. Medium SV007
CV024 GetLatka listed Speak at 253 employees. Medium SV007
CV025 Yahoo Finance listed Duolingo at $1.04B trailing revenue and 4.01x EV/Revenue. Medium SV014
CV026 Yahoo Finance listed Duolingo at 39.91% profit margin and 35.0% quarterly revenue growth. Medium SV014
CV027 CompaniesMarketCap listed Duolingo at a $5.15B market cap in May 2026. Medium SV015
CV028 SEC EDGAR showed Duolingo had filed a current 10-Q on 2026-05-05 and a 10-K for 2025. Medium SV013
CV029 Yahoo Finance listed Coursera at $789.84M trailing revenue and 0.42x EV/Revenue. Medium SV017
CV030 Yahoo Finance listed Coursera at -3.0% quarterly revenue growth and 0.48% profit margin. Medium SV017
CV031 CompaniesMarketCap listed Coursera at a $0.98B market cap in May 2026. Medium SV018
CV032 SEC EDGAR showed Coursera had a 10-Q on 2026-04-30 and recent annual-report filings on file. Medium SV016
CV033 Yahoo Finance listed Udemy at $773.9M trailing revenue and 0.38x EV/Revenue. Medium SV020
CV034 Yahoo Finance listed Udemy at 9.1% quarterly revenue growth and -8.23% profit margin. Medium SV020
CV035 CompaniesMarketCap listed Udemy at a $0.68B market cap in May 2026. Medium SV021
CV036 SEC EDGAR showed Udemy had a 2026 10-K/A and prior 10-K disclosures on file. Medium SV019
CV037 Yahoo Finance listed Chegg at $376.91M trailing revenue and 0.32x EV/Revenue. Medium SV023
CV038 Yahoo Finance listed Chegg at -49.4% quarterly revenue growth and -27.44% profit margin. Medium SV023
CV039 CompaniesMarketCap listed Chegg at a $0.12B market cap in May 2026. Medium SV024
CV040 SEC EDGAR showed Chegg had 2026 10-Qs and a 2025 10-K on file. Medium SV022
CV041 Grand View Research estimated the global AI tutors market at $2.11B in 2025 and $17.72B by 2033, a 30.5% CAGR. Medium SV027
CV042 MMR Statistics estimated the global online language learning market at $24.56B in 2025 and $63.43B by 2032, a 14.52% CAGR. Medium SV028
CV043 MMR Statistics said more than 60% of leading online-language platforms had integrated AI-driven adaptive learning, speech recognition, or personalized lesson pathways in 2025. Medium SV028
CV044 MMR Statistics said freemium pricing and free content were intensifying competition and raising customer acquisition costs, especially for earlier-stage platforms. Medium SV028
CV045 Fortune Business Insights sized the private tutoring market at $66.96B in 2025 and said Asia Pacific held a 60.85% share. Medium SV029
CV046 The FTC warned that control over key generative-AI inputs can create barriers to entry and distort competition. Medium SV026
CV047 The FTC warned network effects can help generative-AI leaders entrench market power and reduce entrant competitiveness. Medium SV026
CV048 Oliver Wyman said public markets began repricing software risk in early 2026 as agentic AI challenged seat-based pricing and durable product differentiation. Medium SV025
CV049 Oliver Wyman said valuation multiples are becoming more sensitive to perceived AI exposure and revenue durability. Medium SV025
CV050 Using Speak’s $1.0B December 2024 valuation and the later reported >$100M annualized revenue milestone implies a high-single-digit to roughly 10x revenue multiple before later balance-sheet adjustments. Medium SV002, SV005
CV051 Speak’s implied private multiple sits above current public-peer EV/revenue levels of 4.01x for Duolingo, 0.42x for Coursera, 0.38x for Udemy, and 0.32x for Chegg. Medium SV014, SV017, SV020, SV023
CV052 The public-comp spread suggests Speak’s premium valuation only holds if it sustains materially faster growth and monetization than listed edtech peers. Medium SV014, SV017, SV020, SV023, SV005
CV053 Public evidence supports real traction, but absent cap-table, preference-stack, retention, and segment-mix disclosure makes the last public $1B mark hard to underwrite for a new investor. Medium SV002, SV005, SV007
CV054 A reasonable bear case is valuation compression toward roughly $400M-$650M if growth slows and public-like multiples dominate. Medium SV014, SV017, SV020, SV023, SV025
CV055 A reasonable base case of roughly $800M-$1.0B requires Speak to defend current scale and convert enterprise traction without requiring multiple expansion. Medium SV002, SV005, SV009, SV010
CV056 A reasonable bull case above $1.1B requires revenue scaling well beyond $150M plus broader enterprise adoption and continued app-ranking strength. Medium SV005, SV011, SV027, SV028
CV057 Because market growth is real but AI competition and multiple compression are also real, a research-more recommendation is better supported than a buy at the last public $1B mark. Medium SV025, SV026, SV028, SV029, SV005
CV058 Monitorable downside triggers include weakening app-store momentum, failure to convert enterprise footprint into disclosed recurring economics, and continued sector multiple compression. Medium SV010, SV011, SV013, SV025, SV005
Sources
IDPublisherTitleQuote
SO001 Speak Speak homepage 15M+ downloads; 4.8 rating; AI language tutor focused on speaking.
SO002 Speak Speak for Business 200+ Brands Rely on Speak for Business.
SO003 Speak Careers page The team is based in San Francisco, Seoul, Tokyo, and Ljubljana.
SO004 Apple App Store Speak: Language Learning App Store listing 44K Ratings; 4.8; monthly and annual auto-renewing subscriptions.
SO005 Google Play Speak: Language Learning Google Play listing 4.7 star; 112K reviews.
SO006 Speak Series B-3 announcement Speak raised $20M in Series B-3 financing, doubling its valuation to $500 million.
SO007 Speak Series C announcement Speak raised $78M in Series C funding at a $1 billion valuation.
SO008 Speak New languages on Speak Four new languages launched; more than 15 million learners around the world.
SO009 TechCrunch Language learning app Speak nets $20M, doubles valuation Speak has grown to over 10 million users and now has customers in more than 40 countries.
SO010 TechCrunch OpenAI-backed Speak raises $78M at $1B valuation Speak provides average usage around 10-20 minutes/day, paying $20 per month or $99 per year.
SO011 Tech Funding News OpenAI-backed Speak closes $78M at $1B valuation Speak, a San Francisco-based language-learning startup, has closed $78 million in Series C funding.
SO012 Unite.AI Speak secures $78M Series C funding at $1B valuation Founded in 2016 by Connor Zwick and Andrew Hsu.
SO013 FilingFlow Speakeasy Labs Form D filing Total offering $77,699,277; first sale Nov 13, 2024; Rule 506(b).
SO014 SEC Filing Data Speakeasy Labs SEC filings list Form D filings appear on 12/11/2024, 08/12/2024, and 10/17/2023.
SO015 HolonIQ The Complete List of Global EdTech Unicorns Speak, Language Learning App, has joined the list in Dec 2024 at $1B valuation.
SO016 GetLatka How Speak hit $100M revenue with a 253 person team in 2025 Speak hit $100M in revenue in November 2025 and had 253 total employees.
SO017 Android Police I ditched Duolingo for this language app, and it was a total reality check Speak continued to praise my speaking ability even when I deliberately mispronounced phrases.
SO018 JustUseApp Speak reviews (2026) Negative experience 67.1%; complaints include auto-renewal, lag, and speech recognition issues.
SO019 Languatalk Speak app review Feedback is brief and lacks depth; premium tier structure is confusing and potentially expensive.
SO020 Crunchbase News AI language startup Speak hits unicorn status after raising Series C Speak hit unicorn status after its Series C round.
SO021 Inc. This AI language learning platform is now a unicorn Speak translated AI speech technology into a $1 billion valuation.
SO022 Similarweb Speak app overview
SO023 Sensor Tower Speak app overview Speak: Language Learning - Google Play Store - US - Category Rankings and Growth Metrics.
SO024 Speak Privacy policy route
SO025 Speak Terms route
SM001 Speak Speak homepage The most effective way to learn a language.
SM002 Speak Series B-3 announcement Disrupting the $100 billion+ online and in-person language learning market.
SM003 Speak Series C announcement English learning is industry agnostic and Speak for Business had 200+ customers.
SM004 Speak New languages on Speak Speak started by teaching English in Korea, Japan, and Taiwan.
SM005 Technavio Digital English language learning market report Market size to increase USD 39.46 billion at a CAGR of 24.5% from 2024 to 2029.
SM006 MarketsandMarkets AI in Education Market Forecast & Size The AI in Education market is projected to grow from USD 2.21 billion in 2024 to USD 5.82 billion by 2030.
SM007 Stanford HAI AI Index 2026 education chapter Four out of five U.S. high school and college students now use AI for schoolwork.
SM008 World Economic Forum AI reshaping global education AI can automate up to 20% of educator clerical tasks but raises access, privacy, and bias concerns.
SM009 World Bank Digital Progress and Trends 2025: AI Foundations Low- and middle-income countries face steep AI adoption challenges; small AI and the four Cs matter.
SM010 arXiv Systematic Review for AI-based Language Learning Tools AI-based language learning tools improved learner outcomes but raised privacy and teacher-preparation concerns.
SM011 Preply Global language learning statistics and trends English is the most learned language and the English learning market is worth about $43.51B in 2025.
SM012 HolonIQ Global EdTech Unicorns list Preply joined the EdTech unicorn list in Jan 2026.
SM013 TechCrunch OpenAI-backed Speak raises $78M at $1B valuation For the one and a half billion people out there trying to learn English, the issue is speaking it.
SM014 TechCrunch Speak nets $20M, doubles valuation Speak makes money by charging $20 per month, or $99 per year.
SM015 Android Police Speak review and AI limitations Speak cannot identify basic pronunciation errors and can create a false sense of mastery.
SM016 Languatalk Speak app review Feedback is brief and lacks depth; lesson variety becomes repetitive.
SM017 SEC Duolingo 2024 annual report Duolingo offers courses in over 40 languages to more than 100 million monthly active users.
SM018 ELSA Speak ELSA homepage 18M+ downloads; AI English speaking coach.
SM019 Apple App Store ELSA App Store listing 109K ratings; yearly and monthly memberships are available.
SM020 Cambly Cambly homepage Real conversations with native speakers, anytime, anywhere, 24/7.
SM021 Busuu Busuu App Store listing Busuu helps you communicate with confidence from day one and connects learners with native speakers.
SM022 Busuu Busuu website snapshot 120+ million registered users.
SM023 Babbel Babbel App Store listing 25 million subscriptions sold.
SM024 Google Play Babbel Play listing 50M+ downloads; 1.12M reviews.
SM025 Praktika Praktika homepage 20M+ learners and private-tutor results without the private-tutor price.
SP001 Speak Speak homepage 15M+ downloads; AI language tutor.
SP002 Apple App Store Speak App Store listing 44K ratings; 4.8; free with in-app purchases.
SP003 Google Play Speak Google Play listing 4.7 star; 112K reviews.
SP004 Speak Series C announcement 200+ customers across various industries.
SP005 TechCrunch Speak Series B extension article Speak makes money by charging $20 per month or $99 per year.
SP006 TechCrunch Speak Series C article Speak for Business has over 200 customers and consumers typically pay $20 per month or $99 per year.
SP007 Android Police Speak review and AI limitations Speak is heavily inspired by Duolingo and misses basic pronunciation errors.
SP008 Languatalk Speak app review Speak works well for early learners but not for those seeking deeper adaptive practice.
SP009 SEC Duolingo 2024 annual report Duolingo offers 40+ languages to 100M+ MAUs and ~9% of MAUs are paid subscribers.
SP010 ELSA Speak ELSA homepage 18M+ downloads and 460K+ ratings.
SP011 Apple App Store ELSA App Store listing 109K ratings; yearly and monthly memberships available.
SP012 Cambly Cambly homepage Real conversations with native speakers, anytime, anywhere, 24/7.
SP013 Busuu Busuu website snapshot 120+ million registered Busuu users.
SP014 Apple App Store Busuu App Store listing 98K ratings; community feedback from native speakers.
SP015 Google Play Busuu Play listing 50M+ downloads; 1.13M reviews.
SP016 Apple App Store Babbel App Store listing 25 million subscriptions sold.
SP017 Google Play Babbel Play listing 50M+ downloads; 1.12M reviews.
SP018 Praktika Praktika homepage 20M+ learners and ~$8/month versus a ~$400/month private tutor.
SP019 Loora Loora homepage Loora is an always-available AI English tutor focused on real-time feedback.
SP020 MarketsandMarkets AI in Education Market Duolingo and ELSA Speak are named among the AI-in-education market participants.
SP021 HolonIQ Global EdTech Unicorns list Preply joined the EdTech unicorn list in Jan 2026 at a $1.2B valuation.
SP022 Preply Global language learning report English is the most learned language because of business and education demand.
SP023 Speak Speak for Business page 200+ brands rely on Speak for Business.
SP024 JustUseApp Speak reviews aggregation 67.1% negative experience according to review aggregation.
SP025 Duolingo Duolingo homepage The free, fun, and effective way to learn a language.
SP026 Duolingo Duolingo Super page Super Duolingo is the premium upsell path from the free product.
SP027 Cambly Cambly pricing page Pricing page shows private lessons and Pro plans from US$8.12 per lesson on annual terms.
SP028 Apple App Store Cambly App Store listing Cambly – Learn English App Store listing.
SP029 Preply Preply homepage Preply is an online language tutoring marketplace.
SP030 italki italki homepage italki is a language-learning marketplace with certificated tutors.
SP031 HelloTalk HelloTalk homepage HelloTalk is a language exchange and learning platform.
SP032 Google Play Duolingo Play listing Duolingo: Language Lessons - Apps on Google Play.
SP033 Speak New languages post French, Japanese, Korean, and Italian launched after Spanish.
SI001 Speak Speak homepage 15M+ downloads; AI language tutor.
SI002 Apple App Store Speak App Store listing 44K ratings; monthly and annual auto-renewing subscriptions.
SI003 Google Play Speak Google Play listing 4.7 star; 112K reviews.
SI004 Speak Speak for Business page 200+ brands rely on Speak for Business.
SI005 Speak Series B-3 announcement Speak raised $20M in Series B-3 financing, doubling its valuation to $500 million.
SI006 Speak Series C announcement Speak raised $78M in Series C funding at a $1 billion valuation.
SI007 TechCrunch Speak Series B extension article Speak makes money by charging $20 per month or $99 per year.
SI008 TechCrunch Speak Series C article Speak users spend roughly 10-20 minutes per day and pay $20 per month or $99 per year.
SI009 FilingFlow Speakeasy Labs Form D filing Total offering $77,699,277; first sale Nov 13, 2024; Rule 506(b).
SI010 SECFilingData Speakeasy Labs SEC filings list Form D filings appear on 12/11/2024, 08/12/2024, and 10/17/2023.
SI011 SEC Duolingo 2024 annual report Duolingo offers 40+ languages to 100M+ MAUs and about 9% of MAUs are paid subscribers.
SI012 Cambly Cambly pricing page Private and Pro tutoring plans run from US$8.12 per lesson on annual terms.
SI013 Busuu Busuu premium page Premium - Busuu.
SI014 Apple App Store Babbel App Store listing 25 million subscriptions sold.
SI015 ELSA Speak ELSA pricing page The fetched page exposes Pro Memberships, ELSA for Business, and ELSA for Schools.
SI016 Preply Preply pricing page The fetched page points users to Preply Subscription and Corporate language training.
SI017 italki italki teachers page 4339 English tutors available; visible trial pricing starts at USD 5.00.
SI018 HelloTalk HelloTalk VIP page HelloTalk says it helps users learn a language for free and has 70M+ registered users across 260+ languages.
SI019 Praktika Praktika pricing page The fetched page shows app-download CTAs and a For business link even though the pricing URL 404s.
SI020 Loora Loora pricing page The fetched page shows Loora for Business plus app download links even though the pricing URL 404s.
SI021 SEC Duolingo Q1 2025 10-Q viewer SEC XBRL viewer for Duolingo quarter ended March 31, 2025.
SI022 ELSA Speak ELSA homepage 18M+ downloads and 460K+ ratings.
SI023 Duolingo Duolingo homepage The free, fun, and effective way to learn a language.
SI024 Apple App Store Speak App Store reviews aggregation 67.1% negative experience according to review aggregation.
SI025 Praktika Praktika homepage 20M+ learners and roughly $8/month versus a private tutor.
SI026 Loora Loora homepage Loora is an always-available AI English tutor.
SE001 Speak Speak - The language learning app that gets you speaking Talk out loud, get instant feedback, and become fluent with the world’s most advanced AI language tutor.
SE002 Speak Leveling up our core speech recognition systems at Speak This fine-tuned model dramatically outperforms the pretrained model with a >60% reduction in word error rate for our learners and task type.
SE003 Speak Live Roleplays powered by OpenAI Realtime API Today, we’re announcing Live Roleplays, a new Speak experience that combines Realtime API with Speak’s learning engine to enable immersive, life-like speaking practice in a variety of roleplay scenarios.
SE004 Speak Designing a High-Accuracy Speech Matching Pipeline with ASR and Phonetic Models By introducing a phonetic model alongside the ASR model, we reduced false negatives by approximately 40% without making our algorithm more lenient.
SE005 Speak Building Speak's Voice Agent Platform Audio transport: WebRTC via LiveKit.
SE006 Speak How Speak reinvents language learning The Speak Method is our proprietary learning method built around three phases: Learn, Practice, and Apply.
SE007 Speak Your most personalized Speak yet: What’s new in our winter release We’re launching new features to deepen your learning experience with Speak, all designed to help you speak more, learn faster, and stay engaged every step of the way.
SE008 Speak New languages on Speak, just in time for the summer Today we officially launch four new languages for English speakers to accompany our recent Spanish release: French, Japanese, Korean, and Italian.
SE009 Speak Speak named Google Play’s “Best App of 2024” in Hong Kong, Korea and Taiwan Speak named Google Play’s Best App of 2024 in Hong Kong, Korea and Taiwan.
SE010 Speak Help Center What Languages Can I Learn with Speak? Speak currently offers English learning courses for speakers of 15 native languages.
SE011 Speak Help Center How Does Speak Curate Its Content and Curriculum? All of our lessons are written by learning designers—a team of educators, linguists, and translators.
SE012 Speak Help Center What devices and operating systems does the Speak app support? The Speak app is available on both iOS and Android devices ... Speak is not available on desktop (PC).
SE013 Speak Help Center Voice recognition isn’t working If Speak isn’t picking up your voice, please try the steps below.
SE014 Speak Help Center How can I report an issue in the app? You can report an issue directly from the screen where the problem occurs.
SE015 Speak Help Center How can I contact Speak Support? You can contact the Speak Support team by email only.
SE016 Speak Help Center Troubleshooting Guides Common troubleshooting methods for when unexpected issues.
SE017 Apple App Store Speak: Language Learning App - App Store Powered by cutting-edge AI technology, Speak ensures that you gain fluency by engaging in real-life conversations and receiving instant feedback.
SE018 Apple App Store Speak: Language Learning - Ratings & Reviews - App Store I love that the lessons give you the phrase, have you repeat it and then quiz you.
SE019 Google Play Speak: Language Learning - Apps on Google Play This app may share these data types with third parties: App activity and Device or other IDs.
SE020 JustUseApp Speak Reviews (2026) | Check if app is safe or legit Overall Customer Experience: Negative experience 67.1% ... The voice recognition became laggy after this update.
SE021 LanguaTalk Speak App Review: Is It Worth It in 2026? Speak’s audio quality is strong ... but the app’s feedback and customization features are unlikely to satisfy serious learners.
SE022 Y Combinator Speak: A superhuman, AI-powered language tutor in your pocket Applied ML Engineer, Speech.
SE023 OpenAI Speak is personalizing language learning with AI That’s easy—OpenAI’s real-time API and multimodality for audio.
SE024 TechCrunch OpenAI-backed Speak raises $78M at $1B valuation to help users learn languages by talking out loud Speak has built a platform to teach languages by focusing on how native speakers learn: Using AI, the startup generates audio conversations and listens to users’ responses.
SE025 SiliconANGLE OpenAI backs $78M round for AI language learning startup Speak One of the latest additions to Speak’s feature set, Live Roleplays, rolled out a few weeks ago.
SE026 AppsHunter Speak: Language Learning App - AI Speaking Practice Updated April 30, 2026. Version 4.46.0. Compatibility: iOS 16.0+.
SE027 OpenAI Developer Community Voice Agent using Realtime API The agent-builder package provides a streamlined way to create real-time voice agents powered by OpenAI’s Realtime API.
SE028 GitHub openai/openai-realtime-agents This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
SE029 Speak Speak Shares Details of AI Tutor, Built on Top of OpenAI’s GPT-4 Speak has used GPT-4 in production to power parts of its AI Tutor feature.
SE030 Speak Speak privacy policy landing page Speak - The languages learning app that gets you speaking.
SE031 Speak Speak terms landing page Speak - The languages learning app that gets you speaking.
SU001 Speak Speak homepage 15M+ Downloads
SU002 Speak Speak for Business | Enterprise Language learning 200+ Brands Rely on Speak for Business
SU003 Speak Why Learners Love Speak: Real App Store Feedback This page contains a complete, unedited collection of real 5-star App Store reviews from Speak learners in the United States.
SU004 Speak Speak Hits $500M Valuation, Expands Rapidly Across Markets Speak now has more than 10 million learners in 40+ countries, with learners more than doubling year-over-year for the last five years.
SU005 Speak A new milestone as we bring language learning to all: Raising $78M Series C at a $1B valuation Our momentum is clear with more than 200+ customers across various industries, and an 85 percent adoption rate among employees.
SU006 Speak A year in conversation | Speak Wrapped 2025 In 2025, learners spoke 3.74 billion lines on Speak, an increase of 111 percent from 2024.
SU007 TechCrunch Language learning app Speak nets $20M, doubles valuation Its user base has doubled every year for the last five years, and Speak now has customers in more than 40 countries.
SU008 TechCrunch OpenAI-backed language learning app Speak raises $16M to expand to the US Speak has managed to hold its own despite the competition, becoming one of the top-downloaded education apps in South Korea, where it first launched, with well over 100,000 subscribers.
SU009 Apple App Store Speak: Language Learning App - App Store listing 44K Ratings
SU010 Apple App Store Speak: Language Learning - Ratings & Reviews 4.8 out of 5
SU011 Google Play Speak: Language Learning - Apps on Google Play 4.7star 112K reviews 10M+ Downloads
SU012 Dataconomy AI-driven language learning startup Speak raises $78M at a $1B valuation An enterprise tier, Speak for Business, currently serves over 200 corporate customers.
SU013 Forbes How AI Language Learning App Speak Is Taking On Duolingo Now, some 500 companies including KPMG and HD Hyundai offer Speak subscriptions to employees primarily in South Korea.
SU014 JustUseApp Speak Reviews (2026) | Check if app is safe or legit Paid for a subscription for Spanish and now only half of the Spanish lessons are available.
SU015 LanguaTalk Speak App Review: Is It Worth It in 2026? Feedback is one of Speak's weakest points.
SU016 MWM Speak: Language Learning - Education App Downloads 10M+; User Rating 4.8/5; Total Ratings 354.9K
SU017 KPMG About KPMG KPMG is a global organization of independent professional services firms providing Audit, Tax, and Advisory services.
SU018 Apple App Store Speak pricing snapshot Monthly Premium $17.99; Annual Premium $83.99; Annual Premium Plus $164.99
SU019 Apple App Store j herronov App Store review This app has improved that aspect so much for me. I am not yet fluent, but am able to pick out words and sayings on French videos and French hockey broadcasts now.
SU020 Apple App Store brombres App Store review One small gripe: the UI doesn't feel as intuitive as it could.
SU021 Google Play Dan S Google Play review The AI is phenominal and language recognition is dead-on.
SU022 Google Play Rosalyn Mulder Google Play review They also have a daily streak that me, personally, I enjoy because I feel motivated to get that streak as high as possible.
SU023 Google Play Danielle Chavez Google Play review I do run into an issue where it does not hear everything I say particularly in the middle of a sentence.
SU024 Speak Dan S review excerpt on official Speak review page I've been using this app for more than six months now. It's simply the best Spanish language learning app that exists.
SU025 Speak Rosalyn Mulder review excerpt on official Speak review page I absolutely love this app. There have been so many other apps that I've tried like Duolingo and Mango. They help me learn sure, but I feel this one is the best so far.
SU026 Speak j herronov review excerpt on official Speak review page This has become by far my favorite app for a number of reasons.
SR001 Speak Leveling up our core speech recognition systems at Speak We deployed Riva and Triton in our existing Kubernetes cluster... Our backend is deployed with Kubernetes on Google Cloud Platform.
SR002 OpenAI Speak is personalizing language learning with AI Speak leverages OpenAI models to power its language learning curriculum across modalities such as audio and text.
SR003 Speak Help Center Subscription/Billing Subscription/Billing ... 11 articles.
SR004 Apple App Store Speak: Language Learning App - App Store 44K Ratings ... Age Rating 13+ ... Speak offers both monthly and annual auto-renewing subscriptions.
SR005 JustUseApp Speak Reviews (2026) | Check if app is safe or legit Negative experience 67.1% ... The voice recognition became laggy after this update.
SR006 AppsHunter Speak: Language Learning App - AI Speaking Practice Negative things ... Expensive subscription cost ... Voice recognition can be inaccurate ... Lesson progress not saving properly.
SR007 SiliconANGLE OpenAI backs $78M round for AI language learning startup Speak Building such models can incur significant costs. The $78 million funding round announced today could make it easier for the company to balance those expenses with growth investments.
SR008 Federal Trade Commission Children's Online Privacy Protection Rule ("COPPA") COPPA imposes certain requirements on operators ... directed to children under 13 years of age.
SR009 European Data Protection Board AI Privacy Risks & Mitigations Large Language Models (LLMs) AI Privacy Risks & Mitigations Large Language Models (LLMs).
SR010 Speak Help Center How can I cancel the subscription? If you cancel your subscription, you can continue using your subscription until the current subscription period ends, but you will not receive a refund.
SR011 Speak Help Center Refund Policy Full refund available within 7 days of purchase ... After 30 days from the payment date: No refunds are available.
SR012 Apple App Store Speak: Language Learning - Ratings & Reviews 4.8 out of 5 ... 44K Ratings.
SR013 Google Play Speak: Language Learning - Apps on Google Play 10M+ Downloads ... This app may share these data types with third parties ... Data is encrypted in transit.
SR014 Apple Support Subscriptions and Billing - Official Apple Support You can cancel a subscription from Apple ... App Store and iTunes Store purchases may be eligible for a refund.
SR015 Google Play Console Help Create and manage subscriptions You must be transparent with users about your offer terms ... and how a user can manage or cancel their subscription.
SR016 EUR-Lex Regulation (EU) 2016/679 (GDPR) Regulation - 2016/679 - EN - gdpr - EUR-Lex.
SR017 European Data Protection Board Artificial intelligence Artificial intelligence | European Data Protection Board.
SR018 CNIL Entry into force of the European AI Regulation: the first questions and answers from the CNIL The European AI Act has just been published ... and will gradually come into force as of 1 August 2024.
SR019 IAPP EU AI Act: Mapping the Interplays with the GDPR The AI Act and the GDPR ... map interplays between the AI Act and the GDPR.
SR020 Federal Trade Commission Children’s Online Privacy Protection Rule: A Six-Step Compliance Plan for Your Business Before collecting, using or disclosing personal information from a child, you must get their parent’s verifiable consent.
SR021 Android Developers Google Play Policies Google Play Policies | Android Developers.
SR022 Tooliverse Speak Review 2026 - AI Language Learning Premium pricing and battery consumption during long sessions require consideration.
SR023 Speak Why Learners Love Speak: Real App Store Feedback This page contains a complete, unedited collection of real 5-star App Store reviews from Speak learners in the United States.
SR024 Speak Live Roleplays powered by OpenAI Realtime API These new speech-to-speech models aren’t as good as text models on instruction following, and they’re not great yet at more nuanced language learning specific tasks.
SR025 Speak Speak Shares Details of AI Tutor, Built on Top of OpenAI’s GPT-4 Speak has used GPT-4 in production to power parts of its AI Tutor feature.
SR026 Speak OpenAI Startup Fund-Backed Speak Announces $16m Series B-2 Financing & Rapid International Expansion Nearly 6% of the population has turned to Speak ... now live in more than 20 countries.
SR027 TechCrunch OpenAI-backed Speak raises $78M at $1B valuation to help users learn languages by talking out loud Speak is using the company’s technology to power its platform.
SR028 NicheMetric Speak: Language Learning - Revenue, Downloads & Market Analysis Revenue Last 30 days $10.0M ... Downloads Last 30 days > 2.0M.
SR029 Y Combinator Speak: A superhuman, AI-powered language tutor in your pocket Active ... San Francisco ... Jobs 10.
SR030 TechCrunch Language learning app Speak nets $20M, doubles valuation Speak has grown to over 10 million users ... customers in more than 40 countries.
SV001 Speak Speak Hits $500M Valuation, Expands Rapidly Across Markets
SV002 Speak A new milestone as we bring language learning to all: Raising $78M Series C at a $1B valuation
SV003 TechCrunch Language learning app Speak nets $20M, doubles valuation
SV004 Forbes via Internet Archive Speak | Company Overview & News
SV005 Forbes via Internet Archive This Startup Is Racing Duolingo To Replace Human Language Tutors With AI
SV006 AIbase AI Language Learning Platform Speak Raises $78 Million, Valuation Exceeds $1 Billion
SV007 GetLatka How Speak hit $100M revenue with a 253 person team in 2025.
SV008 NicheMetric Speak: Language Learning - Revenue, Downloads & Market Analysis
SV009 Apple App Store Speak: Language Learning App - App Store
SV010 Google Play Speak: Language Learning - Apps on Google Play
SV011 AppBrain via Internet Archive Speak - Language Learning for Android - Free App Download
SV012 Apptopia About: Speak: Language Learning (Google Play version)
SV013 SEC EDGAR EDGAR Company Search Results - Duolingo, Inc.
SV014 Yahoo Finance Duolingo, Inc. (DUOL) Valuation Measures & Financial Statistics
SV015 CompaniesMarketCap Duolingo (DUOL) - Market capitalization
SV016 SEC EDGAR EDGAR Company Search Results - Coursera, Inc.
SV017 Yahoo Finance Coursera, Inc. (COUR) Valuation Measures & Financial Statistics
SV018 CompaniesMarketCap Coursera (COUR) - Market capitalization
SV019 SEC EDGAR EDGAR Company Search Results - Udemy, Inc.
SV020 Yahoo Finance Udemy, Inc. (UDMY) Valuation Measures & Financial Statistics
SV021 CompaniesMarketCap Udemy (UDMY) - Market capitalization
SV022 SEC EDGAR EDGAR Company Search Results - Chegg, Inc.
SV023 Yahoo Finance Chegg, Inc. (CHGG) Valuation Measures & Financial Statistics
SV024 CompaniesMarketCap Chegg (CHGG) - Market capitalization
SV025 Oliver Wyman How AI is reshaping SaaS valuations: a guide for investors
SV026 Federal Trade Commission Generative AI Raises Competition Concerns
SV027 Grand View Research AI Tutors Market Size, Share & Trends | Industry Report 2033
SV028 MMR Statistics Online Language Learning Market Insights 2025–2032
SV029 Fortune Business Insights Private Tutoring Market Size, Share & Industry Growth, 2034
SV030 Sensor Tower Speak: Language Learning - Apple App Store - US - Category Rankings, Keyword Rankings, Sales Rankings, Research, Performance, and Growth Metrics.