Startup Diligence
Diligence report AI Software / Developer Productivity Late Private (post-Series A) 2026-05-09

Cognition AI

Cognition AI is a category-defining autonomous software engineering platform with exceptional ARR growth ($1M → $73M in 9 months), but the $10.2B valuation at ~140× ARR prices in near-flawless execution in a market facing rapid benchmark commoditization and critical undisclosed financial metrics.

Cover facts

Last Round Valuation 01
$10.2B (September 2025) [CV004]
Total Funding 02
~$1.575B (seed + Series A + $400M round) [CV010]
Estimated ARR 03
~$73M (April 2025 est.; third-party, unconfirmed) [CV030]
ARR Growth 04
$1M → $73M in 9 months (Dec 2024 – Apr 2025) [CV001]
Lead Investor 05
Founders Fund IX (CIK 0001971631) [CV007]
Key Product 06
Devin — autonomous AI software engineer (cloud + VPC deployment) [CO002]
Windsurf Acquisition 07
Acquired July 2025; 350+ enterprise accounts, 250K+ DAU [CV006]
Named Customers 08
Nubank (12× efficiency, 20× cost savings), Mercedes-Benz, Cognizant [CU003]
Security Incident 09
Prompt-injection vulnerability disclosed Dec 2024; patched [CR006]
Benchmark Gap 10
Devin 13.86% SWE-bench Full (2024); Claude Code 72.5% Verified (2025) [CR018]

Company profile

Cognition AI is a San Francisco-based AI startup founded in 2023 by Scott Wu, Steven Hao, and Walden Yan—competitive programming champions and former olympiad medalists. The company's flagship product, Devin, is a cloud-hosted autonomous AI software engineer that plans, writes, tests, and deploys code end-to-end without constant human input. Devin is accessed via API, Slack, Jira, or a web interface and priced on a usage-based model (ACUs). In July 2025, Cognition acquired Windsurf IDE, adding 250K+ daily active users and 350+ enterprise accounts. The company has raised approximately $1.575B, with the most recent $400M round in September 2025 valuing it at $10.2B.

Website
cognition.ai
Founded
2023-01-01
Founders
Scott Wu, Steven Hao, Walden Yan
Founding location
San Francisco, CA
Headquarters
San Francisco, CA
Product
Devin is a cloud-hosted autonomous AI software engineer that accepts tasks via Slack, Jira, or web interface, then plans, writes code, tests, and deploys independently using Agent Compute Units (ACUs). Devin 2.0 (April 2025) added direct PR merging at a 3× lower effective price per task. Windsurf IDE (acquired July 2025) adds 250K+ daily active users and 350+ enterprise accounts to the platform.
Customers
Enterprise engineering teams (1,000+ developers), mid-market technology companies, and individual professional developers. Named enterprise customers: Nubank, Mercedes-Benz, Cognizant. US Government vertical launched February 2026.
Business model
Usage-based SaaS: $500/month Team plan plus $2.25/ACU overage; Enterprise plan requires direct negotiation. Windsurf IDE adds free-to-paid conversion funnel. Cognizant channel partnership adds reseller revenue layer.
Stage
Late Private (post-Series A)
Funding status
$21M seed (2023), $175M Series A at $2B (Founders Fund IX, Jan/Apr 2025), $400M round at $10.2B (Sep 2025). Total ~$1.575B disclosed.
[CO001, CO002, CO003, CI001, CI002]

Executive summary

Top strengths

  • Exceptional ARR growth: $1M to $73M in 9 months represents a generational growth sprint with no direct analog in enterprise SaaS history
  • Category pioneer: Devin established 'autonomous AI software engineer' as an enterprise procurement line item before most competitors recognized the category
  • Windsurf IDE integration: 250K+ DAU and 350+ enterprise accounts create a unique combined agent-plus-IDE distribution moat not available to single-product competitors
  • Lighthouse customer proof: Nubank (12× efficiency, 20× cost savings, published case study) is a high-quality reference for enterprise ROI in financial services
  • Founder pedigree: Scott Wu, Steven Hao, Walden Yan are competitive programming champions with deep AI/ML engineering intuition relevant to autonomous coding
  • Devin eats its own dog food: 659 Devin PRs/week merged internally, providing rapid model improvement feedback loop

Top risks

  • Benchmark commoditization: Claude Code Opus 4 at 72.5% SWE-bench Verified vs. Devin's 13.86% Full demonstrates a 5× competitive improvement in 15 months; technical differentiation may erode by late 2026
  • Security overhang: December 2024 prompt-injection vulnerability; no published post-Devin-2.0 penetration test or bug-bounty program; expanded autonomous PR-merge capability increases attack surface
  • Valuation multiple: $10.2B at ~140× estimated ARR is a 7× premium to Cursor/Anysphere; risk-adjusted return at this entry is negative in the base case and catastrophic in the bear case
  • LLM provider conflict-of-interest: Anthropic and OpenAI are both primary LLM API suppliers and direct competitors via Claude Code and OpenAI Codex; pricing leverage and terms are undisclosed
  • Customer concentration: Only one quantified enterprise case study (Nubank); $73M ARR from an undisclosed small number of accounts creates high single-customer churn sensitivity
  • EU AI Act GPAI compliance: August 2026 deadline; no disclosed GPAI registration, EU DPO appointment, or transparency documentation; EU expansion announced without compliance infrastructure

Open gaps

  • Net Revenue Retention (NRR) by cohort: undisclosed; the single most critical metric for validating ARR quality at $10.2B valuation; must be ≥110% to justify premium
  • Gross margin by product line: undisclosed; LLM inference cost structure unknown; path to profitability cannot be modeled without this metric
  • Top-5 customer ARR concentration: undisclosed; Nubank is the only quantified reference; concentration above 30% in a single customer is a thesis-break risk
  • Post-Devin-2.0 security audit: no public penetration test report or red-team disclosure post-April 2025 expanded agentic capabilities; SOC 2 Type II (March 2024) predates current attack surface
  • Windsurf integration customer retention: 350+ enterprise accounts acquired July 2025; 6-month retention rate (January 2026) unknown; critical proof point for IDE moat thesis
  • LLM API contract terms: volume pricing, agentic use-case restrictions, and change-of-control provisions with Anthropic/OpenAI are undisclosed; margin floor and supplier conflict-of-interest cannot be assessed

Contents

Chapter 01

01Company Overview

1.1 Identity and Business Model

Cognition AI is a San Francisco-based AI-native software company building autonomous agents that perform end-to-end software engineering tasks. Its flagship product, Devin, is marketed as the world's first AI software engineer: an agent that can interpret tickets, plan solutions, write and debug code, run tests, and deploy to production with minimal human intervention. The company operates a SaaS subscription model for individual developers and small teams (Core plan at $20/month), a Team plan at $500/month including 250 Agent Compute Units, and custom enterprise contracts with VPC deployment options. Following the July 2025 acquisition of Windsurf—an AI-native IDE—Cognition expanded its product portfolio to serve developers who prefer a more interactive, IDE-centered workflow rather than a fully autonomous agent. The dual-product strategy allows Cognition to address both the agentic automation segment and the traditional coding-assistant market that GitHub Copilot and Cursor dominate. Revenue is primarily usage-based: additional Agent Compute Units beyond plan inclusions are priced at roughly $2.25 per ACU, creating a metered model that scales with enterprise task volume. The company's website and blog are public-facing channels; it does not publish GAAP financials and reports only via investor communications and company-controlled press releases.[CO001, CO002, CO003, CO028, CO029]

1.2 Founders and Leadership

Cognition AI was co-founded by three elite competitive programmers, each an International Olympiad in Informatics gold medalist, giving the company an exceptionally deep technical foundation from inception. Scott Wu, CEO, is a three-time IOI gold medalist, a 2011 MATHCOUNTS national champion, and holds a Harvard economics degree. On the Codeforces competitive programming platform, Wu holds the top designation of Legendary Grandmaster, reflecting continued excellence in algorithmic problem-solving. Prior to Cognition, he co-founded Lunchclub, an AI-powered professional networking product. Steven Hao, CTO, is also an IOI gold medalist with prior experience at Scale AI, DeepMind, Waymo, Nuro, Modal, and Cursor—a background spanning both frontier AI research and production engineering systems. Walden Yan, CPO, brings product leadership and IOI-level algorithmic depth. The founding team of approximately 10 individuals collectively held 10 IOI gold medals at launch, an extraordinary concentration of competitive-programming talent rarely seen at an early-stage startup. This profile has attracted top-tier investors and talent but also creates a meaningful key-person dependency: the founders' technical credibility is inseparable from the company's market positioning. No board composition or independent director information has been publicly disclosed.[CO004, CO005, CO006, CO007, CO026, CO027]

Leadership and Founder Table
NameRoleEducational/Competitive BackgroundPrior ExperienceKey-Person Dependency
Scott WuCEO & Co-founder3× IOI gold medal; 2011 MATHCOUNTS national champion; Harvard (economics); Codeforces Legendary GrandmasterLunchclub co-founderCritical — brand and technical credibility inseparable from product
Steven HaoCTO & Co-founderIOI gold medalScale AI, DeepMind, Waymo, Nuro, Modal, CursorCritical — core model and system architecture
Walden YanCPO & Co-founderIOI gold medalProduct leadership in AIHigh — product vision and roadmap

Board composition, independent directors, and broader leadership team (VP-level and below) are not publicly disclosed. Only co-founders have been publicly identified.

[CO004, CO005, CO006, CO007, CO026, CO027]

1.3 Funding History and Investors

Cognition AI's capital formation has been among the fastest in recent venture history. An initial ~$21M seed/Series A round in March 2024 valued the company at approximately $350M before any meaningful revenue existed, reflecting investor conviction in the founding team's ability to execute. Just weeks later in April 2024, Cognition closed $175M at a $2B valuation, crossing into unicorn territory within six months of founding. Founders Fund—Peter Thiel's venture firm—led both rounds, providing not only capital but reputational signal to the broader market. By September 2025, following the Windsurf acquisition and rapid ARR growth, Cognition raised $400M at a post-money valuation of $10.2B, bringing total capital raised to approximately $696M. Co-investors in the September 2025 round included Lux Capital, 8VC, Elad Gil (prominent angel and operator), Bain Capital Ventures, D1 Capital, Definition Capital, and Swish Ventures. As of May 2026, the company is reportedly in discussions for a new financing round at a potential $25B+ valuation, implying continued strong investor appetite for agentic AI coding infrastructure. Notably, the company has disclosed a total net cash burn of under $20M from founding through Q3 2025, suggesting the rapid growth was achieved with remarkable capital efficiency prior to the acquisition.[CO008, CO009, CO010, CO020, CO030, CO031]

Cognition AI Snapshot KPI Table
MetricValue / StatusAs-Of DateConfidenceGap / Caveat
Valuation (last round)$10.2BSep 2025highPost-money; pre-money not disclosed
Total capital raised~$696MSep 2025highSum of known disclosed rounds
ARR (Devin, pre-acquisition)$73MJun 2025highCompany-disclosed via blog
ARR (post-Windsurf combined)~$155MJul 2025mediumPartly company-claimed; Windsurf component is estimate
Initial funding (seed/Series A)$21M at $350MMar 2024highFirst disclosed institutional round
Series A extension$175M at $2BApr 2024highCompany announcement + press
Net cash burn (inception to Q3 2025)<$20MSep 2025mediumCompany-claimed; not audited
Headcount (est.)~50-2502025lowPre-acquisition ~49; Windsurf integration unclear
Enterprise customers (Windsurf)350+Jul 2025mediumWindsurf-only figure at acquisition
Price to ARR multiple (Sep 2025)~65-140xSep 2025mediumDepends on which ARR base is used

Revenue and valuation figures are company-reported or analyst-estimated; no audited financials are publicly available. ARR figures represent annualized subscription and usage revenue. Multiple ranges reflect pre- vs. post-Windsurf ARR bases.

[CO008, CO009, CO010, CO014, CO015, CO016]
Stakeholder or investor map
InvestorRound(s)RoleControl / Economic ImportanceDiligence Ask
Founders FundSeed/Series A + $400MLead investorLargest single institutional holder; likely board representationConfirm board seats, anti-dilution, pro-rata rights
Lux Capital$400M (Sep 2025)Co-investorDeep-tech focus; likely minority stakeConfirm ownership %
8VC$400M (Sep 2025)Co-investorTech-focused growth fundConfirm ownership %
D1 Capital$400M (Sep 2025)Co-investorLate-stage growth equityConfirm ownership % and governance rights
Bain Capital Ventures$400M (Sep 2025)Co-investorEnterprise SaaS experience; minorityConfirm ownership %
Elad GilMultiple roundsIndividual investorProminent angel; advisory relationship likelyConfirm advisory vs. formal board role
Definition Capital$400M (Sep 2025)Co-investorTechnology focusedConfirm size of check
Swish Ventures$400M (Sep 2025)Co-investorEarly-stage / growth technologyConfirm role

Ownership percentages are not publicly disclosed. Round participation derived from press releases and news reports. Board rights, anti-dilution terms, and liquidation preferences are unknown.

[CO008, CO009, CO010, CO020, CO036]
FO002: Cognition AI ARR and Valuation Snapshot KPIs

Key financial and operational metrics as of the most recent disclosed data points.

ARR is a combination of company-disclosed Devin ARR ($73M Jun 2025) and Windsurf ARR ($82M at acquisition Jul 2025). Valuation is post-money from the September 2025 financing. Lifetime burn is company-stated and unaudited.

[CO037, CO042, CO015, CO016]

1.4 Scale and Milestones

Cognition AI's revenue trajectory is extraordinary by any SaaS benchmark. The company exited 2024 at approximately $1M ARR (September 2024) and accelerated to $73M ARR by June 2025—a roughly 73x increase in nine months driven by enterprise adoption of Devin for code migration, bug-fixing, and feature automation tasks. The July 2025 acquisition of Windsurf, which contributed $82M ARR from 350+ enterprise accounts, pushed the combined business to approximately $155M ARR within weeks. Major enterprise clients including Goldman Sachs (reportedly piloting Devin with 12,000 developers), Citi, Dell, Cisco, Ramp, Palantir, Nubank, and Mercado Libre have been publicly cited. Nubank reportedly achieved a 12x efficiency improvement in code migration workflows using Devin. In April 2026, Cognition opened a Singapore APAC headquarters, signaling international expansion intent. The same month, a Mercedes-Benz partnership was announced. Headcount data is not publicly disclosed in detail; pre-acquisition estimates placed the Cognition team at roughly 49 employees, while the Windsurf acquisition added potentially 200 individuals before layoffs and buyout departures.[CO011, CO012, CO013, CO014, CO015, CO021]

Milestone table
DateEventTypeAmount / Valuation / StatusParticipantsImplication
Nov 2023Company founded by Scott Wu, Steven Hao, and Walden YanfoundingN/AThree IOI gold medalistsLaunch of first purpose-built agentic AI coding company
Mar 2024$21M seed/Series A at $350M valuationfinancing$21M / $350MFounders FundInstitutional validation before first product release
Mar 2024Devin publicly announced; SWE-bench 13.86% score publishedproductN/ACognition AIWorld's first AI software engineer claim; generated global press
Apr 2024$175M raise at $2B valuationfinancing$175M / $2BFounders Fund + othersUnicorn status within ~5 months of founding
Sep 2024$1M ARR milestonescale$1M ARRN/AEarly commercial traction from self-serve enterprise
Jan 2025Devin 2.0 launched with multi-agent parallelizationproductN/ACognition AI83% efficiency gain claimed vs. v1.x; parallel task execution
Jun 2025$73M ARR reached (Devin only)scale$73M ARRN/A73x ARR growth in 9 months; breakthrough enterprise adoption
Jul 2025Windsurf acquired (~$250M est.)product~$250M / +$82M ARR / 350+ customersCognition AI / WindsurfAdded AI-native IDE product and enterprise customer base
Aug 202530 Windsurf employees laid off; buyouts offered to ~200adverse9-month salary buyout packagesCognition AI CEO Scott WuCulture integration controversy; talent attrition risk
Sep 2025$400M raise at $10.2B valuationfinancing$400M / $10.2BFounders Fund, Lux, 8VC, D1, Bain, Elad Gil + othersDecacorn status; 5x valuation expansion in 18 months
Apr 2026Singapore APAC headquarters openedscaleN/ACognition AIFirst international office; APAC enterprise penetration
Apr 2026Mercedes-Benz partnership announcedpartnershipN/ACognition AI, Mercedes-BenzEuropean automotive enterprise entry point

Dates and amounts derived from company announcements and press coverage. Windsurf acquisition price is an industry estimate; official terms not disclosed. Milestone table reflects publicly known events only; internal pivots and regulatory activity are not known.

[CO001, CO008, CO009, CO011, CO014, CO017]
FO001: Cognition AI Company Milestone Timeline

Key founding, financing, product, scale, and adverse events from November 2023 through April 2026.

[CO034, CO037, CO041, CO042, CO025, CO033]
FO003: Cognition AI Business System Flow

How Cognition AI's identity, products, customers, and capital relate to each other.

[CO003, CO011, CO021]

1.5 Governance and Key-Person Risk

Cognition AI is a privately held C-corporation with no public financial disclosures, no publicly identified board composition, and no independent directors known to investors or analysts outside the cap table. The company's governance structure remains opaque: investor rights, board seats granted to Founders Fund and other lead investors, and secondary market transaction terms have not been disclosed. The extreme concentration of product vision and technical credibility in the three co-founders—particularly Scott Wu, whose public persona and coding legend status are inseparable from Devin's brand—creates substantial key-person risk. The mandatory 80-hour, six-day workweek culture that prompted voluntary buyout offers to Windsurf employees in 2025 raises questions about talent retention, especially for acquired personnel who did not self-select for that environment. Post-acquisition attrition among Windsurf engineers could erode the IDE product capabilities that justified the ~$250M deal. Critics and independent reviewers have noted that Devin's real-world task success rates are substantially lower than company-curated benchmark demonstrations suggest, and that the 'fully autonomous software engineer' framing may oversell current capabilities, creating future investor and customer expectation risk.[CO034, CO035, CO041, CO022, CO023]

1.6 Exhibits

Chapter 02

02Market Analysis

2.1 Market Definition and Boundaries

The market for Cognition AI's Devin product sits at the intersection of two overlapping categories: AI-powered developer tools (copilots, code completion, review automation) and autonomous software engineering agents (end-to-end task execution requiring planning, multi-step reasoning, and self-correction). The former is a large established market anchored by GitHub Copilot with $2B+ ARR; the latter is an emerging sub-segment where Devin occupies a pioneer position. For sizing purposes, the relevant Total Addressable Market spans tools that professional software development teams might purchase to increase engineering velocity—from autocomplete plugins through fully autonomous agents that can execute tickets without human oversight. The included spend is: enterprise and SMB subscriptions to AI coding assistants, autonomous agent platforms sold to engineering teams, and API consumption for programmatic code generation. Excluded spend is: general large language model API access not sold for code use-cases, low-code/no-code application builders targeting business users (distinct buyer and workflow), infrastructure software (compute, storage, networking), and traditional outsourced software development services. Status-quo substitutes include offshore staff augmentation, traditional IDEs without AI, internal developer tooling built on base LLM APIs, and cloud-vendor bundled IDE extensions such as AWS CodeWhisperer and Azure GitHub Copilot bundles. The market boundary is contested across analyst firms. Grand View Research segments "generative AI coding assistants" narrowly at $92–98M by 2030 using a restrictive scope, while Mordor Intelligence defines "AI code tools" broadly at $24B by 2030. Cognition's Devin occupies the premium agentic sub-segment with $500/month Team plans and enterprise contracts in the $500K+ per year range—far above the $10–$19/seat copilot tier. The serviceable addressable market thus comprises large enterprises willing to pay a productivity premium for autonomous ticket resolution, estimated at 5–15% of the global developer population of 27–28.7 million professional engineers. [CM001, CM003, CM007, CM021, CM023]

Market definition table
CategoryIncluded SpendExcluded SpendBuyer / PayerRelevance to Cognition
AI Coding Assistants (copilot tier)IDE plugin subscriptions, autocomplete APIs, code review botsGeneral LLM API access not sold for codeDeveloper / Engineering budgetIndirect: establishes category norm Devin upgrades from
Autonomous Software Engineering Agents (agent tier)End-to-end ticket resolution platforms, agent-as-engineer subscriptions, enterprise agent deploymentsRPA tools, test automation (non-LLM)VP Eng / CTO / IT budgetDirect: Devin's primary TAM
Low-Code / No-Code PlatformsVisual app builders, citizen-developer tools, spreadsheet automationExcluded entirely from Cognition's TAMBusiness users (not engineers)Not addressable by Devin
Traditional Outsourced Dev ServicesStaff augmentation, consulting, offshore engineering teamsIn-house headcount salariesProcurement / FinanceSubstitution target: Devin competes for this budget
Cloud IDE and Infrastructure BundlesAWS CodeWhisperer, Azure Dev tools, GCP AI bundled with cloud servicesCore cloud compute, storage, networkingEngineering / Cloud buyer (bundled)Partially competitive with Cognition for enterprise engineering wallet

Market boundary is disputed across analyst firms; this table reflects Cognition's serviceable scope, not industry consensus definitions. Row 3 (low-code) is explicitly excluded from Cognition's TAM despite some analyst reports conflating it with AI code tools.

[CM001, CM021, CM023]

2.2 Market Sizing — TAM, SAM, and SOM

Multiple independent sizing lenses are applied to bound the opportunity and reconcile widely varying analyst estimates. The lens approach is preferred over a single top-down estimate given material methodological variation across sources. Lens 1 — AI code tools market (Mordor Intelligence, 2025): The broad AI code tools market is valued at $7.37B in 2025 and forecast to reach $23.97B by 2030 at a 26.6% CAGR. This covers all AI-assisted coding tools from IDE plugins through full agents. TAM candidate: $7.4B (2025), $24B (2030). Limitation: overly broad; includes low-code tools, documentation AI, and testing automation not directly comparable to Devin's buyer profile. Lens 2 — Developer seats × willingness-to-pay: 27–28.7 million professional developers globally (Evans Data / Statista 2024). GitHub Copilot at $10–$19/month captures approximately 1.3 million paid seats from this pool in 2025. Autonomous agent tools targeting enterprise buyers at $500–$5,000/month per team are addressable to perhaps 10% of enterprise engineering teams—roughly 500K–1M developer seats × $2,000/year average contract value = $1B–$2B SAM in 2026, expanding to $5B–$8B by 2030 as adoption broadens. Lens 3 — Enterprise software engineering spend share: Global software spending reached approximately $675– $700B in 2024 (WIPO). Even a 1% shift of software project labor costs to AI tools implies a $6.7B market; a 3% shift implies $20B. This is an upper-bound lens corroborating the Mordor $24B 2030 figure. Lens 4 — Gartner adoption curve: Gartner forecasts 90% of enterprise engineers using AI code assistants by 2028 versus 14% in early 2024. Total estimated enterprise engineering headcount globally approximately 15 million. At 90% penetration × $500/year average price = $6.75B enterprise sub-segment alone by 2028. These estimates conflict materially: the narrow Grand View Research figure of $92–98M by 2030 excludes coding-adjacent AI tools entirely, while the $97.9B ResearchAndMarkets figure appears to use an atypically broad scope that inflates the addressable market. Diligence should weight the $1–8B SAM range most heavily. [CM001, CM002, CM003, CM004, CM005, CM006]

TAM/SAM/SOM or sizing lens table
PublisherYearGeographyValueCAGRMethodologyConfidenceLimitation
Mordor Intelligence2025→2030Global$7.37B→$23.97B26.6%Bottom-up vendor revenues + adoption modelingmediumBroad scope includes low-code tools; overstates Devin-comparable market
Grand View Research2024→2030Global$25.9M→$92.5M (narrow)24.8%Narrow 'generative AI coding assistants' onlymediumLikely understates by excluding broader developer AI tools
ResearchAndMarkets / BusinessWire2025→2030Global→$97.9B24.8%Broad AI tools including dev infrastructurelowTAM figure implausibly large; includes non-comparable segments
Developer seats lens (this analysis)2025→2030Global$1–2B SAM→$5–8B~40%27M developers × enterprise penetration × ASPmediumASP assumption ($2K/yr avg) sensitive to enterprise/SMB mix
Gartner adoption lens (this analysis)2028Enterprise global~$6.75B enterprise segmentN/A90% enterprise penetration × 15M eng × $500/yrmediumAssumes full Gartner adoption curve materializes by 2028
Agentic AI market (CMR / VC Cafe)2025→2034Global$4.35B→$103B>40%Cross-sector autonomous agent spendlowIncludes non-software verticals; directionally supportive only

Wide TAM range reflects fundamentally different scope definitions. SAM lens ($1–2B today) is the most actionable figure for Cognition planning. Low/base/high scenario: $1B / $7B / $24B by 2030 for the broad AI code tools market.

[CM001, CM002, CM003, CM004, CM005, CM024]
FM001: Market sizing lens

TAM/SAM/SOM layers for the AI autonomous software engineering market as of 2025, showing addressable market funnel from broad AI tools to Cognition's target SOM

SAM and SOM are analyst estimates based on developer-seat and enterprise-spend lenses; actual market boundaries disputed across research firms.

[CM001, CM005, CM007, CM035]
FM002: Market estimate range

Low / base / high estimates for the AI code tools market by 2030, showing analyst disagreement across scope definitions

All values are 2030 projections in USD billions. Wide range reflects fundamentally different scope definitions across analyst sources. ResearchAndMarkets $97.9B figure excluded as outlier with implausibly broad scope.

[CM002, CM003, CM004, CM005, CM024]

2.3 Buyer, User, and Payer Segmentation

Cognition AI targets three principal buyer segments with distinct budget ownership and adoption triggers. The primary segment, enterprise software teams, sees VP Engineering or CTO act as buyer with software engineers as users and IT budget as payer. The adoption trigger is a measurable productivity gap—typically revealed by benchmarking AI tool ROI against headcount costs. Reference customers Goldman Sachs (12K developer pilot), Citi, Cisco, Dell, and Nubank illustrate this segment. Deal sizes range from $100K to $2M+ annually for enterprise contracts, with sales cycles of 3–9 months including legal and security review. The secondary segment, high-growth technology startups, sees CTOs or founding engineers as both buyer and user, with founders allocating seed capital as payer. The adoption trigger is velocity: Devin allows a 10-person team to execute like a 50-person team. These buyers typically start on the $20/month Core plan and upgrade to Team. Key reference customers include Ramp and Nubank, which reported a 12× efficiency gain. Deal value is lower at $500–$6,000/year but volume is high and conversion to enterprise contracts is likely as companies scale. The tertiary segment, individual developers and freelancers, presents lower strategic priority. Buyers and users are the same person; payers are self-employed developers billing hourly. The adoption trigger is competitive necessity as AI tooling becomes table-stakes for professional developers. The Core plan at $20/month targets this group, which has high churn sensitivity and primarily serves as a product feedback loop and lead-generation funnel rather than a primary revenue driver. Financial services enterprises represent a cross-cutting priority segment given Goldman Sachs' 12K-developer Devin pilot and Citi's reported deployment—these buyers require SOC 2 Type II, data residency, and regulatory compliance commitments before enterprise sign-off. [CM010, CM011, CM018, CM019, CM022, CM025]

Segment / buyer map
SegmentBuyerUserPayerWorkflowBudget OwnerAdoption Trigger
Enterprise F500 EngineeringVP Engineering / CTOSoftware engineers, DevOpsEngineering / IT budgetTicket resolution, code review, new feature developmentCTO or VP EngDeveloper headcount cost pressure; AI ROI demonstrated by pilot
Mid-market SaaS CompaniesCTO / Eng ManagerBackend + frontend developersEngineering budgetFeature velocity, tech debt paydownCTOSpeed-to-market competitive pressure; fundraise-to-launch cycle time
High-Growth Startups (Seed–Series B)Founding engineer / CTODevelopers + CTOFounder budget / seed capitalFull-stack feature development, MVP buildsFoundersCapital efficiency: replace 2–3 developer headcount cost
Financial Services EnterprisesCIO / Head of Platform EngCompliance-conscious engineersIT / digital transformation budgetInternal tooling, regulatory reporting codeCIORegulatory tech modernization + developer cost reduction
Individual Developers / FreelancersSelfSelfSelfClient project execution, side projectsIndividualCompetitive necessity; peers using AI tools

Budget ownership varies: enterprise deals flow through formal procurement with security review lasting 3–9 months; startup deals are founder-approved in days. Financial services buyers require SOC 2 Type II and data residency commitments before enterprise sign-off.

[CM018, CM019, CM025, CM026]
FM003: Buyer / segment map

Buyer-user-payer relationships and value flow across Cognition AI's key market segments from budget owner through to enterprise deployment

[CM018, CM019, CM025, CM026, CM028]

2.4 Growth Drivers and Adoption Constraints

The market is subject to strong structural tailwinds as well as specific constraints governing adoption velocity and the pace at which Cognition can monetize the opportunity. Growth drivers include documented developer productivity gains—GitHub Copilot users complete tasks 51–55% faster with 46% of code AI-generated, and organizations using AI tools report 3.2× developer productivity improvements—which validate the ROI case and lower the procurement approval bar. The rapid shift to agentic AI is particularly significant: enterprise spend on agentic AI systems is projected to surge from under $1B in 2024 to $51B+ by 2028 at approximately 150% CAGR, normalizing autonomous agent budgets across enterprise technology organizations. Labor cost and talent scarcity create a compelling economic case. The median US software engineer salary exceeds $130K annually, meaning a $500/month tool that autonomously resolves even 10–20% of engineering tickets has a self-funding ROI measurable in weeks rather than quarters. Gartner's Magic Quadrant coverage of AI code assistants provides enterprise procurement legitimacy, and the Gartner forecast of 90% enterprise adoption by 2028 creates a top-down organizational mandate supporting category-level spending approval. The primary constraints are trust and governance: enterprises universally require role-based access controls, audit trails, IP ownership clarity, and security review before autonomous deployment, adding 3–9 months to enterprise sales cycles. Hallucination and code quality risk mean autonomous agents can introduce hard-to-catch bugs; buyers insist on review layers that limit full automation scope. Integration with legacy toolchains—on-premise Git servers, air-gapped CI/CD pipelines, proprietary IDEs—increases deployment cost and complexity. The EU AI Act and US executive orders on AI safety impose documentation and explainability requirements on enterprise AI systems in critical workflows. Competitive fragmentation, with GitHub Copilot (42% market share), Cursor ($9B valuation), Windsurf (now part of Cognition), and OpenAI Codex all competing, can produce buyer choice overload that stalls procurement decisions. [CM008, CM009, CM012, CM013, CM014, CM015]

Growth drivers and constraints table
FactorDirectionTimingImplicationDiligence Ask
Developer productivity ROI documented (51–55% faster with AI)TailwindPresent (2024–2026)Validates economic case; accelerates procurement approval for AI toolsVerify Cognition-specific ROI data from customers, not just GitHub Copilot proxies
Gartner: 90% enterprise AI code adoption by 2028TailwindNear-term (2026–2028)Category legitimacy; creates top-down mandate in large enterprisesConfirm enterprise customers are accelerating commitments in 2026
Agentic AI spend: $1B→$51B by 2028 (150% CAGR)TailwindNear-term (2026–2028)Massive budget reallocation toward autonomous tools expands TAM rapidlyMonitor whether enterprise CFOs approve agentic budgets separately from copilot budgets
Software engineer talent scarcity and salary inflationTailwindOngoingMakes $500/month Devin compelling vs. $130K+ engineer hire economicallyTrack developer salary surveys; assess if AI tools slow hiring demand in target segments
Hallucination / code quality trust deficitHeadwindNear-term (2025–2027)Slows autonomous deployment; requires human review layers limiting full TAM captureAsk customers what % of Devin output is deployed without human review in production
Enterprise security and IP ownership concernsHeadwindPresent and ongoingExtends sales cycles 3–9 months; restricts Fortune 500 deployments pending legal reviewReview Cognition's IP terms, data handling, and SOC 2 / ISO 27001 certifications
EU AI Act and US AI governance regulationHeadwindMedium-term (2026–2028)Adds compliance documentation burden; may require explainability features Devin lacksReview Cognition's regulatory compliance roadmap; assess EU market readiness
Competitive fragmentation (Copilot, Cursor, Windsurf, Codex)HeadwindPresentBuyer choice overload stalls procurement; risk of commoditization at copilot tierTrack whether Devin's agentic differentiation sustains or competitors close the gap

Timing reflects estimated onset or peak relevance of each factor. All tailwinds are structural (secular trends) rather than cyclical, suggesting the favorable dynamics are durable over the 3–5 year investment horizon.

[CM008, CM009, CM012, CM013, CM014, CM020]
FM004: Adoption funnel or value-chain map

Adoption journey from developer awareness to full autonomous deployment of Cognition AI Devin in enterprise environments

Funnel stage percentages are estimates based on Gartner adoption data and reported customer counts; intermediate stages (evaluation, departmental) are analyst estimates not directly published by any research firm.

[CM008, CM010, CM015, CM017, CM019]

2.5 Exhibits

Chapter 03

03Competitors

3.1 Competitive Landscape Overview

The AI-assisted software engineering market in 2026 is stratified into three broad competitive tiers. The first tier—established workflow co-pilots—includes GitHub Copilot (Microsoft) and Amazon Q Developer, both backed by cloud hyperscaler distribution. GitHub Copilot commands an estimated 42% category market share with 20 million users and over $2 billion in annual recurring revenue as of early 2025, deeply embedded across enterprise developer toolchains. Amazon Q Developer provides similar completion and chat capabilities optimized for AWS workloads, targeting the large installed base of AWS enterprise customers with SOC 2 Type II compliance and native IAM integrations. The second tier—IDE-native AI coding assistants—is led by Cursor (Anysphere), which reached $2 billion ARR by February 2026 after launching its first paid tiers in mid-2024 and now operates at a $29.3 billion valuation. Cursor's VS Code fork model allows deep editor integration unavailable to plugin-based competitors, and its multi-agent parallel execution architecture (up to eight concurrent agents) increasingly overlaps with Devin's autonomous coding use case. Windsurf (formerly Codeium), now acquired by Cognition, previously competed in this tier before integration. The third tier—fully autonomous AI software engineers—is where Cognition's Devin directly positions. Claude Code (Anthropic), OpenAI Codex (re-launched as a web-based agent), and SWE-agent (open-source academic benchmark harness) all compete here to varying degrees. Claude Code's terminal-native approach and superior code quality on SWE-bench evaluations represent a quality threat, while OpenAI Codex's re-launch as an agentic execution environment targets Devin's core positioning. The differentiation frontier is shifting from benchmark scores toward measurable enterprise workflow integration and verifiable ROI metrics—terrain where Cognition's deployment evidence with Goldman Sachs and others provides a current advantage. [CP001, CP002, CP003, CP033]

Competitor profile table
CompetitorCompanyFoundedTierARR (est.)ValuationKey Investors
GitHub CopilotMicrosoft/GitHub2021Hyperscaler co-pilot$2B+N/A (MSFT)Microsoft
CursorAnysphere2022IDE-native assistant$2B$29.3B (Nov 2025)Thrive, a16z, Accel, Nvidia, Google
Claude CodeAnthropic2023Autonomous agentN/A (bundled)$60B+Google, Amazon, Salesforce
Amazon Q DeveloperAmazon Web Services2023Hyperscaler co-pilotN/A (AWS bundle)N/A (Amazon)Amazon
OpenAI CodexOpenAI2025 (re-launch)Autonomous agentN/A (bundled)$300B+Microsoft, a16z, Thrive
ReplitReplit Inc.2016Cloud IDE platform$80M est.$1.2BAndreessen Horowitz, Coatue
SWE-agentPrinceton NLP2024Open-source frameworkFree/open-sourceN/AAcademic
Windsurf (pre-acq.)Codeium (acquired)2021IDE-native assistant$82M (at acq.)$250M est.General Catalyst, Kleiner

ARR and valuation figures are estimates from secondary sources as of early 2026 where primary sources unavailable.

[CP001, CP002, CP003, CP005, CP006, CP016]
FP001: Competitive positioning map
[CP025, CP033, CP037]

3.2 Tier-One Hyperscaler-Backed Competitors

GitHub Copilot, owned by Microsoft since the GitHub acquisition, benefits from unmatched distribution embedded in the world's largest developer platform. With 90% of Fortune 100 companies using Copilot and over 20 million active developers, its scale creates high switching-cost inertia. Copilot's pricing ranges from $10 per month for individuals to $19 per month for Business tier and $39 per month for Enterprise, making it the default low-cost entry point for most developer teams. The platform has progressively expanded from inline code completions to chat, pull-request summarization, and workspace-level Copilot Workspace—a multi-step planning and execution agent that increasingly encroaches on Devin's autonomous territory. GitHub Copilot reached over $2 billion in annual recurring revenue by early 2025, making it the largest pure-play AI coding revenue line globally. Amazon Q Developer addresses the enterprise segment's compliance and security requirements most directly, offering a Free tier and a Pro tier at $19 per user per month. Its native AWS service integrations, 200,000-token context window, and Agents for Amazon CodeWhisperer capability set position it as the preferred choice for organizations standardized on AWS. Q Developer achieved SOC 2 Type II certification and supports VPC isolation—security postures unavailable from most startup competitors. The hyperscaler tier's primary limitation versus Cognition is the absence of full-cycle autonomous task completion: neither Copilot nor Q Developer autonomously plans, implements, tests, and deploys end-to-end without human-in-the-loop confirmation at each stage. This architectural constraint creates the addressable gap Devin is designed to exploit, though both hyperscalers are actively extending agent capabilities toward this frontier. [CP004, CP009, CP010, CP011, CP019, CP022]

Feature / capability matrix
FeatureDevin (Cognition)GitHub CopilotCursorClaude CodeAmazon Q Dev
Autonomous end-to-end executionYes (full cycle)Partial (Workspace)Partial (8 agents)Partial (human-supervised)Partial (Agents feature)
IDE integrationSaaS + Windsurf IDEVS Code, JetBrains, VimVS Code forkTerminal / CLIVS Code, JetBrains, CLI
Context windowNot disclosedNot disclosedNot disclosed200K tokens200K tokens
Multi-agent parallelYes (Devin Teams)NoYes (up to 8)NoNo
Open-source model optionNoNoModel routing (multiple)NoNo
SOC 2 / Enterprise compliancePartialEnterprise tierEnterprise tierEnterprise via APISOC 2 Type II
Free tierNoFree (limited)Free (hobby)Yes (limited)Free (50 req/mo)
VPC / on-prem deploymentNot documentedNoNoVia API gatewayYes (VPC)

Capability ratings are based on publicly documented features as of May 2026; roadmap features excluded.

[CP008, CP010, CP012, CP022, CP025, CP030]
Pricing / packaging comparison
ToolFree TierEntry PaidMid TierEnterprise
Devin (Cognition)No$20/mo (5 ACUs)$500/mo (250 ACUs, Team)Custom
GitHub CopilotYes (2k completions)$10/mo (Individual)$19/mo (Business)$39/mo (Enterprise)
CursorYes (hobby)$20/mo (Pro)$40/mo (Business)Custom
Claude CodeYes (limited)$10/mo (Pro)$100/mo (Max)Enterprise API
Amazon Q DeveloperYes (50 req/mo)$19/mo (Pro)N/ACustom / Marketplace
ReplitYes$25/mo (Core)$40/mo (Teams)Custom

Pricing as of May 2026 from official product pages; subject to frequent change.

[CP009, CP015, CP019, CP023]
FP002: Feature breadth / capability map
[CP008, CP010, CP022, CP029]

3.3 IDE-Native Competitors — Cursor and Windsurf

Cursor (Anysphere) is the most commercially formidable competitor to Cognition in the pure-ARR sense, having scaled from zero to $2 billion ARR within approximately 24 months of its public launch. Its Series D funding round in November 2025 raised $2.3 billion at a $29.3 billion valuation—the largest venture financing in AI coding history at the time. Investors include Thrive Capital, Andreessen Horowitz, Accel, Nvidia, and Google. Cursor's architecture relies on a VS Code fork with deep token-budget controls, model-agnostic routing (Sonnet, GPT-4o, Gemini), and a multi-agent background agents feature that runs up to eight parallel task threads—narrowing the functionality gap with Devin's full-autonomy proposition. Cursor's 1 million-plus paying customers and 50,000 enterprise teams demonstrate product-market fit across the individual developer and team segments. Cursor is growing faster than Cognition on ARR metrics and has more paying customers, providing proportionally more training signal and pricing power. Windsurf (formerly Codeium) built strong developer mindshare with a free-tier, VS Code compatible AI IDE and over 800,000 users before the Cognition acquisition in July 2025. Post-acquisition, Windsurf's enterprise book ($82M ARR contributed, 350-plus enterprise customers) was folded into Cognition's revenue base, converting a direct competitor into a distribution channel. The integration is not yet complete, and retention of Windsurf's enterprise customers under the Cognition brand is a key execution risk. The acquisition also demonstrated Cognition's willingness to pursue inorganic growth to defend against IDE-layer encroachment—a signal that the competitive pressure from Cursor in the IDE tier was viewed as existential to distribution strategy. [CP005, CP006, CP007, CP008, CP018, CP020]

3.4 Autonomous Agent Competitors — Claude Code, OpenAI Codex, and SWE-agent

Claude Code (Anthropic) is arguably Cognition's most direct quality-based competitor in the autonomous agent tier. Claude Code operates as a terminal-native agentic coding assistant layered on top of Anthropic's Claude 3.7 Sonnet and Claude 4 models, which have consistently ranked among the highest performers on SWE-bench Verified—an independent benchmark measuring an agent's ability to resolve real GitHub issues. Devin's initial SWE-bench score of 13.86% (March 2024) was a breakthrough at the time, but subsequent model improvements from Anthropic pushed Claude-based agents above 50%, raising questions about Devin's benchmark differentiation. Claude Code's pricing ($10 per month on Pro, $100 per month on Max) undercuts or matches Devin's Core tier ($20 per month for 5 ACUs) on entry price, though it offers different economics for agentic-scale autonomous use. The key architectural gap is that Claude Code still requires a human operator to supervise at the terminal, whereas Devin is designed to run fully unattended in enterprise cloud environments—a meaningful operational distinction for large-scale automation. OpenAI re-launched Codex in 2025 as a web-based agentic coding environment backed by the o3 model family, positioning it as a direct Devin competitor. Codex Workspace allows users to assign high-level feature tasks and receive completed pull requests, mirroring Devin's core value proposition. Given OpenAI's $300 billion-plus valuation, API distribution, and ChatGPT developer ecosystem, Codex's competitive entry represents a fundamental threat to Cognition's differentiation if OpenAI prioritizes execution quality and aggressive pricing. SWE-agent (Princeton NLP Group) is an open-source research framework rather than a commercial product; it is significant primarily as a benchmark comparator and as a recruitment signal that world-class ML researchers are actively working on autonomous code execution frameworks outside commercial settings, which could accelerate open-source commoditization of the core technology. [CP012, CP013, CP014, CP015, CP016, CP017]

Moat durability / competitive risk register
Moat FactorCognition StrengthDurability (1-5)Primary ThreatThreat Source
Autonomous execution depthHigh – full unattended task cycles3Rapid capability parity from labsAnthropic, OpenAI
Enterprise integrationsMedium – Goldman, Citi, Nubank proofs4Hyperscaler bundling incentivesMicrosoft, Amazon
Windsurf IDE distributionMedium – 350+ enterprise clients3Cursor IDE market dominanceCursor (Anysphere)
Founder talent + networkHigh – 3x IOI gold medalists, ex-Scale/DeepMind4Talent poaching by frontier labsOpenAI, Google, Anthropic
Training data flywheelMedium – speculative, unverified2Larger model labs have more training dataOpenAI, Google
Benchmark leadershipLow – lost within months of launch1All major competitors surpassed SWE-bench scoreAll peers
Pricing competitivenessLow – most expensive per-task2Commoditization pressure from free/cheap tiersClaude Code, Copilot

Durability scores are qualitative analyst assessments; scale 1 (weakest) to 5 (strongest).

[CP026, CP027, CP028, CP032, CP034]

3.5 Moat Assessment and Competitive Dynamics

Cognition's defensible competitive advantages break down into three categories: execution depth, enterprise data flywheel, and distribution via Windsurf. Execution depth—the ability to complete multi-hour, multi-file agentic coding sessions with planning, sandboxed execution, testing, and PR creation—remains ahead of most competitors in verified enterprise deployments (Goldman Sachs 12K developer pilot, Nubank 12x efficiency claim). The enterprise data flywheel from each completed task creates training signal for model fine-tuning, which potentially compounds over time—but this advantage is speculative until Cognition's model quality can be independently verified versus frontier labs. Windsurf integration adds IDE-layer distribution that Devin alone could not have quickly organically scaled. Counterarguments to moat durability are significant. Cursor is growing faster than Cognition on pure ARR metrics and has more paying customers. Frontier model labs (Anthropic, OpenAI) control the underlying reasoning capabilities and can deploy equivalent agentic pipelines without licensing dependencies—giving them structural cost advantages. Benchmark credibility questions persist: Cognition's March 2024 Devin demo was partially disputed, and independent evaluations of SWE-bench performance showed other tools surpassing Devin within months of its launch. The competitive window for pure agentic differentiation is narrowing, making Cognition's enterprise go-to-market execution and customer retention metrics the true leading indicators of sustainable competitive position. Cognition's compliance posture relative to enterprise-grade competitors like Amazon Q Developer (SOC 2 Type II, VPC isolation) is another documented gap that could slow enterprise contract cycles for security-sensitive verticals. [CP026, CP028, CP031, CP035]

FP003: Moat / readiness KPIs
[CP026, CP028, CP035]
Chapter 04

04Financials

4.1 Revenue Model and ARR Trajectory

Cognition AI operates a consumption-based SaaS revenue model anchored to Agent Compute Units (ACUs)—a proprietary token of compute capacity consumed per autonomous task. Individual developers subscribe at $20 per month (Core plan: 5 ACUs), teams at $500 per month (Team plan: 250 ACUs), and enterprises under custom contracts with volume discounts and additional ACU pools. The pricing ladder is designed to monetize proportionally to task complexity and volume, making it structurally different from seat-based tools like GitHub Copilot or Cursor. However, this model creates lumpy revenue: enterprise customers with variable sprint workloads may have uneven monthly consumption patterns, requiring careful cohort-level analysis to distinguish durable ARR from one-time expansion. Cognition's ARR trajectory has been one of the most rapid in enterprise AI history. The company exited the first six months of operations with negligible revenue, crossed $1M ARR in September 2024, reached $73M ARR by June 2025—a 73x increase in nine months—and then jumped to approximately $155M combined ARR following the July 2025 Windsurf acquisition ($82M ARR contributed). The Cognition AI blog post from September 2024 ("Funding, Growth, and the Next Frontier of AI Coding Agents") confirmed the $73M ARR figure and outlined the company's two-product strategy. Post-Windsurf, the combined entity has a bifurcated revenue base: Devin's enterprise automation revenue (high ACV, low seat count) and Windsurf's IDE subscription revenue (lower ACV, higher seat count). Separating and growing both lines while integrating the teams is a material execution risk. As of April 2026, reports suggest the company is in discussions for additional financing at a potential $25B+ valuation, implying investors believe ARR will continue growing materially above $155M. However, post-acquisition ARR disclosed figures should be interpreted cautiously: the $82M Windsurf ARR contribution was measured at time of acquisition and may not reflect post-migration retention rates.

Revenue streams table
Revenue StreamProductModelACV RangeLaunchedEst. Contribution to ARR
Individual subscriptionsDevin CoreUsage-based, $20/mo (5 ACUs)$240/yr2024~$5M est.
Team subscriptionsDevin TeamsUsage-based, $500/mo (250 ACUs)$6,000/yr2024~$25M est.
Enterprise contractsDevin EnterpriseCustom, VPC deployment$50K–$1M+ /yr2024~$43M est.
IDE subscriptions (Windsurf)Windsurf Free/ProSeat-based, $0–$15/mo~$120/yr paid seatPre-acq. (2024)~$30M est.
IDE enterprise (Windsurf)Windsurf EnterpriseCustom enterprise contracts$50K–$500K/yrPre-acq. (2024)~$52M est.
Usage overagesDevin all plansPer-ACU metered, ~$2.25/ACUVaries2024Not disclosed

ARR contribution estimates are analyst approximations; Cognition does not disclose per-stream revenue breakdown.

[CI001, CI002, CI003, CI008]
FI001: Revenue model bridge
[CI003, CI004, CI007]

4.2 Funding History and Capital Structure

Cognition AI has raised approximately $696 million in venture capital across three primary rounds in under 18 months, making it one of the fastest-funded AI infrastructure companies on record. The seed/Series A in March 2024 raised approximately $21 million at a $350 million pre-money valuation from Founders Fund, signaling early conviction in the founding team before any product revenue. In April 2024, just weeks after launch, Cognition closed $175 million at a $2 billion post-money valuation—again led by Founders Fund—crossing unicorn status within six months of founding. The September 2025 Series B raised $400 million at a $10.2 billion post-money valuation, with co-investors including Lux Capital, 8VC, Elad Gil, Bain Capital Ventures, D1 Capital, Definition Capital, and Swish Ventures. VentureBeat and TechCrunch confirmed this round in September 2025. The company disclosed a remarkably low net cash burn of under $20 million from founding through Q3 2025—extraordinary given the fundraising scale—suggesting the bulk of capital was held in reserve rather than spent on headcount or infrastructure at a rate typical for similar-stage companies. The Windsurf acquisition in July 2025 was estimated at approximately $250 million, funded from existing capital reserves without a new primary raise—a sign of financial discipline given the scale of the transaction relative to employee count. Post-acquisition headcount reportedly reached approximately 249 before layoffs of 30 employees and the departure of Windsurf personnel who accepted the 9-month buyout offer.

Pricing / monetization table
PlanProductPriceACU AllowanceTarget SegmentOverage Rate
CoreDevin$20/month5 ACUsIndividual developers~$2.25/ACU
TeamDevin$500/month250 ACUsEngineering teams~$2.25/ACU
EnterpriseDevinCustomCustom poolEnterprise orgsNegotiated
FreeWindsurf IDE$0/monthLimited AI requestsHobbyist/studentN/A
ProWindsurf IDE~$15/monthStandard AI creditsIndividual devsVaries
EnterpriseWindsurf IDECustomCustomEnterprise orgsNegotiated

Windsurf pricing as of acquisition July 2025; subject to change post-integration with Cognition pricing structure.

[CI001, CI002, CI005]
FI002: Unit economics bridge
[CI006, CI035, CI011]

4.3 Unit Economics and Margin Structure

Cognition's unit economics are difficult to model precisely because the company does not disclose gross margins, customer acquisition cost (CAC), or cohort-level retention data. However, structural inferences are possible. The ACU model implies a cost-of-revenue that is partially variable (inference compute per task) and partially fixed (model hosting, sandbox environments, CI/CD infrastructure). Enterprise AI inference at frontier model scale (likely GPT-4 or Claude API calls per task) can cost $5–$50 per hour of agent runtime at current cloud spot prices, while enterprise ACU packages price at roughly $2.25 per ACU with 250 ACUs per Team plan month—creating a gross margin that depends heavily on task length and model call efficiency. High-value enterprise contracts (Goldman Sachs, Citi, Nubank) likely have custom pricing with volume discounts that could improve margin at scale if infrastructure costs are amortized across large task volumes. The Nubank case study reporting a 12x efficiency improvement in code migration workflows suggests that customers perceive strong value relative to developer hours saved—a favorable indicator for pricing power. However, the inherent challenge for fully autonomous coding agents is that task complexity variance is high: simple bug fixes cost little to execute but complex multi-week feature builds can consume compute budget disproportionately. Unless Cognition gates task scope at the enterprise level, margin management at scale will require sophisticated compute forecasting. The company's capital efficiency pre-acquisition (sub-$20M burn on $350M+ raised before revenue scale) contrasts sharply with the likely post-acquisition cost structure. Adding 250 employees (Windsurf personnel) to a ~49-person team more than quadruples headcount, compressing per-employee revenue significantly in the near term unless post-integration synergies materialize quickly.

Unit economics table
MetricEstimateBasisConfidenceCaveat
Gross margin (est.)50–70%Inferred from compute cost vs. ACU pricinglowNot disclosed; highly variable by task complexity
Inference cost per ACU (est.)$0.50–$1.50Market rates for frontier model inferencelowDepends on model mix and task length
Customer ACV (enterprise)$50K–$1M+Industry comp, customer case studiesmediumRange is wide; no primary disclosure
Net revenue retention (est.)100–130%Inferred from ARR growth trajectorylowNo NRR data publicly disclosed
CAC (est.)Not disclosedN/AunknownNo marketing spend or sales data published
ARR per employee (pre-Windsurf)~$1.5M~49 employees, $73M ARR as of June 2025mediumHeadcount estimate is approximate
ARR per employee (post-Windsurf)~$623K~249 employees, $155M ARR as of July 2025mediumIncludes all acquired Windsurf staff before departures

All unit economics are analyst estimates; no primary financial disclosure exists. Treat all values as directional only.

[CI006, CI009, CI010, CI011]
FI003: Financial estimate range
[CI012, CI013, CI014, CI016]

4.4 Public Financial Data Gaps and Disclosure Limitations

Cognition AI is a privately held Delaware C-corporation with no public filing obligations, no disclosed GAAP financial statements, and no investor day or earnings calls. All financial data in this chapter relies on: (1) company-authored blog posts; (2) third-party news reports citing unnamed sources; (3) investor communications and press release summaries; and (4) secondary market research estimates. This disclosure posture is typical for pre-IPO AI companies but creates material risk for investors and customers attempting to verify sustainability of the ARR trajectory. The most significant undisclosed financial metrics are: gross margin by product line (Devin vs. Windsurf), customer churn and net revenue retention (NRR), CAC and payback period, enterprise deal sizes and contract lengths (ACV), and post-Windsurf integration costs. Without NRR data, it is impossible to distinguish whether the $73M to $155M ARR jump reflects organic customer expansion or purely inorganic acquisition contribution. Enterprise AI tool contracts in other companies typically show NRR of 100–140% when adoption is strong; if Cognition's NRR is below 100%, the underlying Devin-only ARR may be stagnating despite the Windsurf injection. Sacra, CB Insights, and other secondary research platforms provide partial revenue and headcount estimates that must be treated as approximations. A secondary concern is the reported $25B+ valuation discussions in early 2026, which would imply a forward revenue multiple of approximately 160x on $155M ARR. While extreme multiples are historically common for hypergrowth AI companies in their early stages, they are sustainable only if ARR continues growing at 2x+ annually—a bar that requires substantial new enterprise customer acquisition and retention of the Windsurf base. Any deceleration in ARR growth would make the valuation multiple compress significantly.

Capital adequacy table
RoundDateAmountValuation (post-money)Lead InvestorsConfirmed Source
Seed/Series AMar 2024$21M$350MFounders FundMultiple news reports
Series A extensionApr 2024$175M$2BFounders FundAxios, CNBC, multiple
Series BSep 2025$400M$10.2BLux Capital, 8VC, Elad Gil, BCV, D1TechCrunch, VentureBeat, CNBC
Total raised (through Series B)Sep 2025~$596M$10.2BMultipleSummed from above
Windsurf acquisitionJul 2025~$250M (est.)N/ACognition (acquirer)Widely reported estimate
Potential new round (rumored)2026Not disclosed$25B+ (reported)Not disclosedPress reports, not confirmed

Series B total excludes the Windsurf acquisition cost; combined capital deployed (raised + acquisition) is approximately $846M. Founders Fund IX Form D/A (SEC, Oct 2025) confirms $972M LP capital base.

[CI012, CI013, CI014, CI015, CI016, CI024]
FI004: Capital intensity / cash-flow map
[CI015, CI019, CI020]

4.5 Capital Adequacy and Financial Runway

With approximately $696M raised and sub-$20M cash burn disclosed through Q3 2025, Cognition had substantial financial runway heading into 2026 even before any new financing. The Windsurf acquisition consumed an estimated $250M of that reserve, leaving approximately $426M available minus ongoing operating expenses. At a conservative 200-employee burn rate of $400K per employee-year (blended fully-loaded cost), annual operating costs could be $80M or higher excluding compute. If the company is generating $155M ARR with moderate gross margins (estimated 50–70% given inference costs), gross profit could fund ongoing operations without immediate new capital. However, the $25B+ valuation discussions suggest the company is seeking capital to accelerate—not merely to sustain—operations. Expected uses of new capital include R&D for Devin next-generation models, enterprise sales force expansion, global data center capacity for inference at scale, and potential further acquisitions to consolidate the AI coding infrastructure market. The Founders Fund lead investor relationship provides additional runway optionality: Founders Fund has historically shown willingness to lead follow-on rounds for breakout portfolio companies, reducing financing risk. The presence of large institutional co-investors (D1 Capital, Bain Capital Ventures) signals access to crossover and late-stage capital. Overall capital adequacy is rated medium-high: the company has sufficient runway for at least 24 months at current burn estimates, but aggressive growth execution and integration costs from Windsurf could accelerate burn faster than current estimates suggest.

Public financial gaps table
Missing MetricWhy It MattersSeverityProxy Used
Gross margin by product lineDetermines profitability and long-term unit economicsHighIndustry comp: 50–70% est.
Net revenue retention (NRR)Indicates whether ARR is organically durableHighNot available; assume 100–130%
Customer churn rateValidates stickiness of enterprise contractsHighNot disclosed; inferred from case studies
CAC and payback periodDetermines GTM efficiency at scaleMediumNot available
GAAP revenue vs. ARRDeferred revenue may differ from ARRMediumARR used as proxy
Windsurf post-acquisition retentionCritical for combined ARR durabilityHighNot yet disclosed

Gaps listed represent the most decision-relevant undisclosed financial data as of May 2026 research date.

[CI017, CI018]
Chapter 05

05Product & Technology

5.1 Product Definition and Customer Workflow

Cognition AI offers two core products: Devin, a cloud-hosted autonomous AI software engineer, and Windsurf, an agentic IDE for local and cloud-assisted development. Devin accepts natural-language task descriptions—submitted via the web dashboard, Slack, Linear, or Jira—and executes full end-to-end software engineering tasks autonomously: planning the approach, writing and modifying code, running tests, iterating on failures, and opening pull requests with reviewable diffs and confidence indicators. Unlike code-completion tools (GitHub Copilot, Cursor), Devin takes ownership of entire tasks rather than assisting a human in writing individual lines. In a typical enterprise workflow, a developer or engineering manager assigns a Jira ticket or Slack message to Devin. Devin reads the ticket, explores the repository context using its DeepWiki server, sets up the necessary environment, writes the implementation, runs the test suite, and reports back with a pull request. The PR includes Devin's internal trace of decisions for human review. The Devin 2.0 release added a confidence meter that quantifies the probability of task success before committing, allowing teams to triage tasks efficiently. Devin can handle tasks concurrently across multiple sessions, enabling teams to delegate entire sprint backlogs rather than individual tickets. Windsurf IDE complements Devin by providing a local development environment with AI Cascade agent integration. Developers can start a coding session locally in Windsurf and hand off complex, compute-intensive work to Devin in the cloud mid-session, with state preserved across the transition. The "Devin in Windsurf" feature announced in April 2026 formalizes this local-to-cloud handoff workflow. Together, the two products target the full developer lifecycle: Windsurf for daily coding assistance and Devin for delegated autonomous task execution.

Product/SKU map table
ProductDelivery ModeTarget UserKey FeaturesPricing TierLaunched
Devin FreeCloud agentIndividual devLimited ACUs, web dashboard, basic integrationsFreeApr 2026
Devin ProCloud agentIndividual devMore ACUs, API access, confidence meterPaid ($20/mo base)Apr 2025
Devin MaxCloud agentPower userHigh ACU pool, priority queuePaid (custom)Apr 2026
Devin TeamsCloud agentDev teamsShared ACU pool, team management, Slack/Jira integrationTeam pricing2025
Devin EnterpriseCloud or VPCEnterprise orgVPC deploy, custom-trained Devin, dedicated supportCustom contract2024
Windsurf IDE (Free/Pro)Desktop IDEIndividual devCascade AI agent, in-editor completions, CodemapsFree/$15/mo2024
Windsurf EnterpriseDesktop IDE + cloudEnterprise orgSSO, centralized billing, enterprise supportCustom contract2024
Devin APIREST APIDevOps/CIProgrammatic session management, CI/CD integrationIncluded in paid plans2025

Plan structure as of May 2026 research date; April 2026 plan refresh replaced legacy Core/Team plans with Free/Pro/Max/Teams/Enterprise structure.

[CE001, CE008, CE009, CE025, CE026]
FE001: Devin task lifecycle flow
[CE003, CE010, CE017, CE021]

5.2 Product and Technology Map

Cognition AI's product portfolio as of May 2026 comprises four distinct product units: (1) Devin Core/Pro/Max/Enterprise—cloud agent plans for individual, team, and enterprise customers; (2) Windsurf IDE—the agentic desktop IDE with Cascade AI agent, acquired from Codeium in July 2025; (3) the Devin API—a RESTful API for programmatic session creation and CI/CD pipeline automation; and (4) Windsurf Enterprise—the enterprise tier of the IDE with SSO, centralized billing, and advanced admin controls. A fifth element, Windsurf Codemaps, is an AI-annotated structured codebase map powered by SWE-1.5 and Claude Sonnet 4.5, enabling rapid onboarding and debugging by grounding navigation to exact lines and visual node graphs. The April 2026 plan revision retired the original Core and Team plans, replacing them with a five-tier structure: Free (limited ACUs), Pro, Max, Teams, and Enterprise. This signals a land-and-expand strategy: the Free tier provides entry-level exposure for individual developers and bootcamp students, while Pro and Max serve power users, and Teams/Enterprise capture organizational deployment. SWE-Check, introduced in collaboration with Applied Compute in April 2026, is a specialized bug detection model trained via reinforcement learning that matches Opus 4.6 on internal evals while running approximately 10x faster—positioned as a cost-efficient quality gate within the Devin agent loop. The company's in-house model, SWE-1.5, powers both SWE-Check and Windsurf Codemaps, representing Cognition's first proprietary model release beyond the Devin runtime ensemble. All products are delivered cloud-first, with VPC deployment available for enterprise customers requiring on-premise or isolated cloud infrastructure. The Devin for Terminal feature, released in 2026, enables developers to start agent sessions from their local terminal and escalate to the cloud when the task outgrows local resources, bridging the CLI and cloud deployment modes.

Integration ecosystem table
CategoryPlatformIntegration TypeCapabilityStatus
Source controlGitHubNativePR creation, repo access, issue trackingGA
Source controlGitLabNativeBranch management, MR creationGA
Source controlBitbucketNativePR creation, code contributionGA
Project managementJiraNativeTicket assignment, status syncGA
Project managementLinearNativeIssue tracking, sprint delegationGA
CommunicationSlackNativeSession initiation, progress updatesGA
CommunicationMicrosoft TeamsNativeSession initiation, notificationsGA
MonitoringSentry / Datadog / PagerDutyMCPAlert-triggered sessions, log contextMCP marketplace
DatabasesPostgreSQL / MySQL / MongoDBMCPRead/write access for data tasksMCP marketplace
DocumentationNotion / ConfluenceMCPContext retrieval for tasksMCP marketplace

MCP Marketplace integrations are third-party configured; availability depends on customer configuration. Native integrations are Cognition-maintained.

[CE005, CE006, CE007, CE018, CE030]
FE002: SWE-bench performance bar chart
[CE002, CE016, CE020, CE023]

5.3 Technical Architecture and Agent Runtime

Devin's core technical architecture centers on an agent runtime that manages long-horizon planning and multi-step execution within a sandboxed cloud environment. The agent receives a natural-language task specification and decomposes it into an ordered sequence of sub-steps: repository exploration, environment setup, implementation, test execution, and pull request creation. At each step, the agent calls specialized tools—a code editor, a Unix shell, and a headless browser— within an isolated compute container that prevents lateral access to other sessions or customer data. This sandboxed design separates Devin's execution context from the customer's production environment, with code changes surfaced only through pull requests that require human approval before merge. The DeepWiki server enables large-codebase comprehension by building vectorized project graphs that represent relationships between files, functions, and modules. This graph-based representation allows Devin to navigate million-line codebases more effectively than purely token-based context windows. BlockDiff snapshotting records incremental state checkpoints during task execution, enabling rapid rollback when a test fails or an approach is abandoned. These two features—DeepWiki and BlockDiff—are among Cognition's most differentiating technical capabilities, as they address the two primary failure modes of LLM-based coding agents: context loss in large repositories and unrecoverable error cascades. The proprietary SWE-1.5 model, released in 2025-2026, underlies SWE-Check and Windsurf Codemaps. It was trained via reinforcement learning specifically for software engineering tasks, contrasting with the general-purpose LLMs used as base layers in most competitor products. The agent runtime also supports Model Context Protocol (MCP), allowing Devin to connect to external tooling including Sentry, Datadog, PostgreSQL, MongoDB, Notion, and hundreds of other services through an extensible plugin marketplace. ACUs (Agent Compute Units) are the metered token of computation consumed per task, priced at approximately $2.25 per ACU with plan-included allocations varying by tier.

SWE-bench performance comparison table
Agent / ModelBenchmark VersionScore (% Resolved)Evaluation ModeDateSource
Devin (Cognition AI)Full (25% subset)13.86%UnassistedMar 2024Cognition AI blog
Prior SOTA (best LLM)Full1.96%UnassistedMar 2024SWE-bench leaderboard
Best assisted LLMFull4.80%Assisted (files given)Mar 2024SWE-bench leaderboard
SWE-agent (open source)Lite12.47%UnassistedMar 2024SWE-bench / Princeton
Claude Code (Opus 4)Verified (500)72.5%Unassisted2025Anthropic
mini-SWE-agentVerified (500)65%UnassistedJul 2025SWE-bench update
Devin PR acceptance trendAIDev dataset (7,156 PRs)+0.77%/weekReal-world PRs32 weeksArxiv 2026

Different benchmark versions (Full vs. Verified vs. Lite) are not directly comparable. Cognition's 13.86% used a 25% random subset of the Full benchmark. Claude Code's 72.5% uses the 500-instance Verified subset.

[CE002, CE020, CE023, CE024, CE035]
FE003: Product capability KPIs
[CE002, CE004, CE014, CE016, CE026]

5.4 Deployment, Integration, and Reliability

Devin supports two primary deployment modes: cloud-hosted (the default, operating in Cognition-managed infrastructure) and VPC deployment for enterprise customers requiring data residency, network isolation, or regulatory compliance. VPC mode allows Devin to operate within the customer's AWS, GCP, or Azure environment, eliminating the need to transmit source code to Cognition's servers. Custom-trained Devin instances are available at the enterprise tier, allowing customers to fine-tune the agent on their proprietary codebases and internal conventions. The platform integrates natively with GitHub, GitLab, and Bitbucket for source control (PR creation, code review automation, branch management); with Jira and Linear for ticket assignment and status tracking; and with Slack and Microsoft Teams for communication-channel session initiation and progress reporting. The REST API enables programmatic session management for CI/CD pipelines, allowing engineering organizations to trigger Devin sessions automatically on specific events (e.g., failing test suite, customer bug report). The MCP Marketplace extends integration to monitoring platforms (Sentry, Datadog, PagerDuty), databases (PostgreSQL, MySQL, MongoDB), and documentation tools (Notion, Confluence). The Windsurf IDE integrates Cascade AI agent locally and the "Devin in Windsurf" handoff for cloud-based tasks. Reliability is not formally disclosed through uptime SLAs or status page data in Cognition's public materials. Task execution time is bounded by a 45-minute maximum runtime per session, though tasks can be broken into sequential sub-sessions for longer projects. The confidence meter in Devin 2.0 provides a pre-execution estimate of task success probability, enabling teams to filter out low-probability tasks before consuming ACUs. These quality controls reduce waste but also signal that Devin's reliability on complex tasks remains probabilistic rather than guaranteed.

Technical architecture components table
ComponentFunctionInputsOutputsProprietary?
Agent runtimeTask decomposition, multi-step planningNatural language task specOrdered sub-task planYes
Sandboxed environmentIsolated compute container per sessionAgent commandsCode edits, shell output, PR diffYes (cloud-hosted)
DeepWiki serverVectorized codebase graph for contextRepository filesGraph-indexed code mapYes
BlockDiff snapshotsIncremental state checkpoints for rollbackSession state at each stepCheckpoint index, rollback targetYes
SWE-1.5 modelBug detection (SWE-Check) + CodemapsCode snippets, repository graphsBug likelihood score, annotated mapYes
MCP layerExternal tool connectivityMCP-compliant tool specsLive tool data in agent contextOpen protocol
Devin APIProgrammatic session managementHTTP requests with task specSession ID, status, PR URLYes (REST)

Proprietary components are Cognition-developed and not open-sourced; MCP is an open protocol developed by Anthropic and adopted by Cognition.

[CE003, CE013, CE021, CE032, CE033]
FE004: Technical architecture layers flow
[CE003, CE008, CE013, CE032, CE033]

5.5 Technology Differentiation and Competitive Moats

Cognition AI's primary technical differentiator is the end-to-end autonomous task ownership model: Devin does not suggest code for a human to accept or reject, but independently plans, executes, and delivers a complete task artifact. This positions Devin against code-completion tools (GitHub Copilot, Cursor, Claude Code) rather than as a marginal improvement over them. The SWE-bench Full score of 13.86% set a new state-of-the-art for unassisted autonomous agents in March 2024, exceeding the prior best of 1.96% by 7x and exceeding the prior assisted SOTA of 4.80% by 3x. Devin's PR acceptance rate showed a consistent positive trend of +0.77% per week over 32 weeks, the only agent in a comparative 7,156-PR study to exhibit sustained improvement (Arxiv, 2026). The in-house SWE-1.5 model and SWE-Check tool represent Cognition's nascent vertical integration in model training. Most competitor AI coding agents (GitHub Copilot, Cursor, Windsurf pre-acquisition) are built on top of third-party foundation models (GPT-4, Claude, Gemini), giving those providers structural leverage over tool vendors. By training SWE-1.5 for software engineering tasks specifically, Cognition is building a model layer that is not dependent on API pricing from OpenAI or Anthropic for its most specialized use cases. The Windsurf acquisition added significant IP: the Cascade agentic workflow engine, the Windsurf IDE user base, and 350+ enterprise accounts, accelerating the product suite without requiring multi-year IDE development. Data flywheel advantages are present but not publicly quantified. Each Devin task generates execution traces that can be used to fine-tune future model versions. As enterprise customers use custom-trained Devin instances on their proprietary codebases, Cognition accumulates task-specific performance data that competitors without the same enterprise relationship cannot easily replicate. The DeepWiki codebase graph representation and BlockDiff snapshotting are proprietary engineering innovations that provide functional advantages in large-codebase contexts where pure-attention LLMs lose coherence.

Trust, security, and compliance table
ControlTypeStatusDetailsDisclosed?
Session sandbox isolationArchitectureConfirmedEach session runs in an ephemeral containerYes (official docs)
PR-gated code changesProcessConfirmedAll code changes require human approval before mergeYes
VPC deploymentArchitectureAvailable (Enterprise)Customer-controlled cloud perimeterYes
Execution trace loggingAuditConfirmedFull log of shell commands, file edits, API callsYes
SOC 2 Type IICertificationNot disclosedNo public attestation as of May 2026No
ISO 27001CertificationNot disclosedNo public attestationNo
Prompt-injection defense (patched)SecurityPatched Dec 2024Critical vuln found live-streamed; Cognition patchedAdverse
Data retention policyPrivacyNot disclosedCode submitted to cloud sessions; retention unclearNo

Absence of disclosed SOC 2 does not mean the certification does not exist; it may exist under NDA for enterprise customers. Cognition's trust.devin.ai page returned limited information at research date.

[CE015, CE028, CE029, CE031]

5.6 Trust, Safety, Security, and Compliance

Devin's primary security architecture relies on sandboxed compute isolation: each session runs in an ephemeral container with no access to other sessions or the customer's production environment unless explicitly granted through integration credentials. Code changes are surfaced as pull requests requiring human approval, and the Devin execution trace provides an auditable log of every shell command, file edit, and API call made during the session. VPC deployment adds an additional isolation layer by keeping source code within the customer's own cloud perimeter. A significant security incident emerged in late 2024 when a live-streamed demonstration exposed a major vulnerability in Devin's system prompt handling—effectively allowing prompt-injection attacks to manipulate Devin's behavior. Cognition acknowledged the issue and patched it rapidly, but the incident highlighted the inherent risks of deploying autonomous agents that can execute shell commands based on text inputs. Hacker News commentary characterized the failure as "amateurish given the severity," noting that prompt-injection defenses should have been a first-priority security control for an agent with shell execution capabilities. Cognition has not publicly disclosed SOC 2 Type II certification, ISO 27001, or equivalent enterprise security certifications as of May 2026. The trust.devin.ai subdomain exists but returned limited public information at time of research. Enterprise customers relying on VPC deployment and custom-trained models are likely subject to separate security agreements, but these are not publicly auditable. Data privacy and retention policies for code executed in Devin sessions are not fully disclosed in Cognition's public-facing documentation, representing a material compliance gap for financial-services and regulated-industry customers. The Mercedes-Benz and Goldman Sachs partnerships suggest compliance barriers have been addressed in custom agreements, but without public disclosure, independent assessment is not possible.

Chapter 06

06Customers

6.1 Customer Base Segmentation

Cognition AI targets professional engineering teams at technology-forward enterprises, mid-market companies, and developer-first organizations. The buyer profile is an engineering leader or CTO seeking to multiply developer throughput without proportional headcount growth; the user is typically an individual software engineer or engineering manager who delegates tasks directly via Slack, Jira, or the Devin web interface. The payer is the engineering budget owner, whether a department budget or a centralized IT procurement function at large enterprises. Verticals with confirmed production deployments include financial services and fintech (Nubank—LatAm's largest neobank), automotive/manufacturing (Mercedes-Benz), IT services and outsourcing (Cognizant), enterprise technology, and the US Federal Government. The COBOL modernization blog post confirmed Fortune 500 deployments in sectors that still run significant COBOL workloads: financial services, insurance, and public utilities. In 2026, Cognition launched a Government vertical to target legacy software modernization in US federal agencies. Japan and Singapore enterprise markets were opened in April 2026, suggesting Southeast Asia financial services, manufacturing, and technology firms are the next target segments. The geographic distribution is US-first (the majority of revenue and named customers as of early 2026), with Europe (London office opened January 2026) and APAC (Japan and Singapore offices opened April 2026) serving as growth centers. Channel distribution includes direct enterprise sales, the Windsurf IDE free-to-paid conversion funnel (250K+ daily active users at acquisition), and the Cognizant reseller partnership that deploys Devin and Windsurf across Cognizant's own engineering teams and its global client base. The Windsurf integration creates a bottom-up, developer-led adoption vector in addition to top-down enterprise procurement.

Customer segmentation table
SegmentBuyer TypeUser TypeUse CaseScale / ValueEvidence Level
LatAm Fintech EnterpriseCTO / Eng LeaderSWE / EMLegacy ETL migration, code modernization100M users, 6M+ LoC codebaseNamed (Nubank), quantified
Global Automotive / ManufacturingCTO / VP EngSWE teamsLegacy modernization, cloud-native dev, logisticsGlobal enterpriseNamed (Mercedes-Benz), announced
IT Services / OutsourcingIT Exec / ChannelSWE teams + client engineeringDeploy Devin to own and client engineering teams300+ client organizationsNamed (Cognizant), partner
Fortune 500 COBOL LegacyVP Eng / CIOLegacy SWECOBOL modernization to modern stacksMulti-decade COBOL systemsUnnamed, blog-disclosed
US Federal GovernmentAgency IT execAgency SWECritical infrastructure modernizationFederal agency scaleVertical launched (Feb 2026), no named agencies
APAC EnterpriseCTO / Eng LeaderSWE teamsSoftware production across SE Asia, JapanLarge-enterpriseMarket opened Apr 2026, no named customers
Individual Developer (SMB/indie)SelfSelfCoding automation, personal projectsLow value, high volumeSelf-serve Free/Pro tier

Evidence level reflects public disclosure depth. Mercedes-Benz, Cognizant, and COBOL Fortune 500 are at announcement stage without published outcome metrics.

[CU001, CU002, CU003, CU004, CU005]
FU001: ARR growth trajectory bar chart
[CU006, CU007, CU023]

6.2 Adoption Trajectory and Growth Metrics

Cognition AI's most striking adoption signal is the ARR growth trajectory: from approximately $1M ARR at general availability (December 2024) to approximately $73M ARR by April 2025—a roughly 73x increase in four months. This rate of growth is consistent with Sacra's estimate of ~$15M ARR in October 2024 and the Growjo platform's current estimate of $73M, though both figures are third-party estimates rather than audited or company-disclosed financials. The April 2025 pricing reset (Core plan from $500/month to $20/month base) accelerated individual developer adoption while maintaining enterprise revenue through volume. Internal usage signals are revealing. By February 2026, Cognition's own engineering team was merging 659 Devin PRs per week—four times the 154 per week achieved at their best week in 2025. This "dog-fooding" signal is significant: the team building Devin is also its most intensive user, providing a high-quality feedback loop that likely accelerates iteration velocity. The PR merge rate improved from 34% at Devin's launch to 67% by April 2025—a proxy for output quality improvement—though the metric is self-reported and denominator methodology is not specified. Geographic expansion (London, Tokyo, Singapore) in the first half of 2026 suggests management confidence in revenue sufficiency to support multi-office overhead. The Cognizant partnership is particularly significant as a channel amplifier: Cognizant's global client base spans 300+ organizations across financial services, manufacturing, and healthcare, meaning even partial penetration could materially accelerate Devin's enterprise account count without proportional Cognition sales headcount. However, no public data exists on the number of active paying accounts, the average contract value, or the customer growth rate beyond the ARR estimates.

Customer growth and adoption trajectory table
MetricValueDateSourceConfidenceImplication
ARR at GA launch~$1MDec 2024Third-party estimate (Sacra/imseankim)LowEarly adopter only; pricing was $500/month
ARR after pricing reset~$73MApr 2025Third-party estimate (Growjo / imseankim)Low-medium70x+ growth in ~4 months with price cut to $20/month
ARR per Growjo (2025)$73M estimatedSep 2025Growjo revenue estimateLowThird-party model; not company-disclosed
PR merge rate at launch34%Mar 2024Company-claimed (imseankim/VentureBeat)Medium2 in 3 initial PRs were rejected; low initial quality
PR merge rate Apr 202567%Apr 2025Company-claimed (imseankim)MediumQuality doubled in ~1 year; still 33% rejection
Cognition's own Devin PRs/week659Feb 2026Company blog (official)HighInternal dog-fooding at high intensity; 4x growth from best 2025 week
Windsurf DAU at acquisition250,000+Jul 2025Company-claimed (Cognition blog)HighLarge Windsurf user base as acquisition conversion funnel
Employee count growth~102% YoY2025Growjo estimateLowFast headcount growth consistent with revenue acceleration

ARR figures are third-party estimates, not company-disclosed. PR merge rate metrics are company-claimed. Missing denominator: total account count and ACV breakdown unknown.

[CU006, CU007, CU008, CU009, CU010]
FU002: Key customer and adoption KPIs
[CU006, CU007, CU008, CU009, CU010, CU012]

6.3 Named Customer Proof and Reference Quality

The highest-quality named customer proof is the Nubank case study, the only publicly detailed and quantified production deployment. Nubank, Latin America's largest neobank (approximately 100 million customers), used Devin to migrate a 6-million-line ETL monolith to sub-modules—a task originally estimated to require 1,000+ engineers over 18 months. With Devin, the migration progressed in weeks for each business unit (Data, Collections, Risk), achieving 12x engineering efficiency improvement in hours saved and 20x cost savings versus the all-human baseline. Crucially, the case study documents fine-tuning on the customer's specific migration patterns, which led to a 4x speed improvement (40 minutes per task to 10 minutes) and a 2x accuracy improvement on the internal benchmark. Mercedes-Benz is named in a blog post (April 27, 2026) as deploying Devin and Windsurf "across its global engineering organization" for legacy modernization, cloud-native development, and logistics. No outcome metrics are publicly disclosed; the announcement is at the partnership/announcement stage rather than a post-hoc case study. Cognizant (announced January 28, 2026) is deploying Devin and Windsurf "across its engineering organization and global clients"—this is a channel partnership rather than a named end-customer, and no production outcome data is available. Additional unnamed customer evidence: Fortune 500 COBOL modernization deployments (blog post April 8, 2026, no named customers), US Government (Cognition for Government launched February 25, 2026, no named agencies), Japan enterprise (launched April 9, 2026, no specific customers named). The depth and freshness of proof varies enormously: Nubank is production with quantified outcomes; others are at announcement stage. This creates a concentration of evidence quality risk—if Nubank is not representative of typical deployments, the customer proof base is shallow.

Named customer proof table
CustomerSegmentUse CaseProduction vs PilotOutcomeLimitation
Nubank (LatAm neobank)FintechETL monolith migration (6M LoC, 100K data classes)Production12x efficiency gain, 20x cost savings; weeks vs monthsFine-tuning required; Cognition-published case study
Mercedes-BenzAutomotiveLegacy modernization, cloud-native dev, logisticsAnnounced (Apr 2026)Not disclosedAnnouncement only; no outcome metrics
CognizantIT Services / ChannelDeploy Devin + Windsurf across own org + client basePartnership (Jan 2026)Not disclosedChannel partnership; no end-customer or outcome data
Fortune 500 COBOL (unnamed)Multiple legacy verticalsCOBOL modernization to modern stacksProduction (implied)Not quantified; blog describes ongoing deploymentsUnnamed; blog-only disclosure
Cognition AI itselfAI SaaS / internalDevin builds Devin; 659 PRs/week mergedProduction (internal)4x growth in weekly Devin PRs (2025→2026)Self-referential; no independent verification
US Government (agency unnamed)Federal governmentCritical infrastructure software modernizationVertical launchedNot disclosedNo named agencies; FedRAMP status unknown

Only Nubank has quantified production outcomes in a published case study. All other named relationships are at announcement stage or are channel partnerships without end-customer disclosure.

[CU003, CU004, CU005, CU011, CU012]
FU003: Customer acquisition and expansion flow
[CU017, CU018, CU019, CU021]

6.4 Retention, Satisfaction, and Durability

Publicly available retention and satisfaction metrics for Cognition AI are limited. The company has not disclosed Net Revenue Retention (NRR), Gross Revenue Retention (GRR), customer churn rate, or cohort analysis data. The most proximate public indicator of satisfaction is the PR merge rate (67% as of April 2025, up from 34% at launch), which measures the share of Devin-opened pull requests that are accepted by human reviewers—a reasonable proxy for output quality but not a standard customer satisfaction metric. A 67% PR acceptance rate implies that one-third of Devin's work products require rejection or significant rework, which may be acceptable for high-volume autonomous tasks but is a notable quality gap versus human engineers. Developer sentiment from independent sources is mixed. Hacker News communities have noted that early Devin demos appeared polished but real-world performance on complex tasks lagged expectations set by the original 13.86% SWE-bench announcement. The independent reviewer at imseankim.com—after waiting six weeks to observe the product in production—noted that "independent testers tell a more complicated story" relative to Cognition's official performance numbers, and flagged that ACU costs accumulate quickly in practice, with moderate usage easily exceeding $100-200 per month on the nominally $20/month Core plan. The security vulnerability disclosed in December 2024 temporarily damaged trust among security-conscious enterprise customers. On durability signals, the Nubank deployment describes continued and expanding use (data, collections, and risk business units progressively adopting Devin). Internal Cognition usage of Devin (659 PRs/week merged in Feb 2026) suggests strong product-led retention. The Cognizant and Mercedes-Benz partnerships represent structural switching costs once integrated into enterprise workflows. However, the contract length, renewal rates, and post-deployment satisfaction at named enterprise customers are not publicly known.

Retention, repeat usage, and satisfaction table
MetricValueSegmentConfidenceDiligence Ask
NRR (Net Revenue Retention)Not disclosedAll segmentsUnknownRequest from Cognition: NRR by cohort and ACV band
GRR (Gross Revenue Retention)Not disclosedAll segmentsUnknownRequest from Cognition: annual contract renewal rate
Churn rateNot disclosedAll segmentsUnknownRequest: monthly and annual subscriber churn by plan tier
PR merge rate67% (Apr 2025)All Devin usersMedium (company-claimed)Verify with independent PR audit or customer interviews
Developer satisfaction (HN)Mixed—some enthusiastic, some disappointedDeveloper communityLowCommission independent satisfaction survey of enterprise users
Nubank expansion depth3+ business units using DevinFintech enterpriseHigh (case study)Request: duration of Nubank engagement, current ACU spend
Cognition internal Devin usage659 PRs/week (Feb 2026)AI SaaS (internal)High (blog)Cross-check with Git stats; confirm PR type distribution
Average task success rate~67% PR acceptance (proxy)GeneralLowDisaggregate by task type, language, and complexity band

Retention metrics are not publicly disclosed. PR merge rate is the only available public quality proxy, and it is company-reported without denominator methodology. All retention figures require direct diligence requests.

[CU013, CU014, CU015, CU016]
FU004: Customer segmentation pyramid
[CU001, CU002, CU004, CU005, CU022]

6.5 Expansion and Concentration Risk

Cognition AI's land-and-expand strategy is evident: the Nubank deployment started with one business unit's ETL migration, then expanded to the Data, Collections, and Risk units. The Cognizant partnership implies a similar structure—Cognizant begins with internal deployment and then upsells to its global client base. The Free and Pro plan tiers are designed for developer-led adoption: individual developers experiment at low cost, drive internal adoption, and create bottom-up enterprise procurement pressure. The Windsurf IDE's 250,000+ daily active users provide a large organic install base from which to convert to paid Devin usage. Concentration risks are significant and not publicly disclosed. With ARR of ~$73M and a small set of confirmed named accounts, there is a material risk that a handful of large enterprise contracts (Nubank-scale, Mercedes-Benz-scale) constitute a dominant share of revenue. If the Cognizant channel partnership is performing well, it may itself represent disproportionate revenue concentration through a single reseller rather than diversified direct customer relationships. The COBOL modernization vertical targets high-value projects (multi-year legacy systems) that are structurally one-time engagements unless renegotiated as long-term support contracts, creating potential revenue cliff risk after initial migrations complete. Procurement friction at regulated industries is high. Financial services, healthcare, and government customers require security certifications (SOC 2, FedRAMP) and data residency controls that Cognition has not publicly disclosed meeting. VPC deployment mitigates the data residency concern, but the absence of published compliance attestations limits the addressable market in the most regulated industry verticals. The Government vertical launch in February 2026 suggests Cognition is actively addressing FedRAMP-equivalent certification requirements, but timelines and current status are not public. This compliance gap represents the primary enterprise procurement barrier for Cognition's target large-enterprise segment.

Expansion and concentration risk table
Driver / RiskTypeDescriptionImpactDiligence Path
Land-and-expand (use case depth)Expansion driverNubank started with 1 business unit, expanded to 3+; Cognizant starts internal then sells to clientsHighMap expansion timeline and ACU growth per account
Windsurf to Devin conversionExpansion driver250K+ daily Windsurf users as conversion funnel for cloud agent upsellMediumTrack conversion rate from Windsurf Free to paid Devin sessions
COBOL modernization (project-based)Concentration riskHigh-value one-time migrations; no structural recurring need after completionHighAssess whether COBOL accounts contract for ongoing support or leave post-migration
Top-customer concentration (unknown)Concentration riskWith ~$73M ARR and few named accounts, one Nubank-scale account ≈ material shareHighRequest top-10 customer concentration; ask for top-customer revenue share
Cognizant channel dependencyChannel riskSingle reseller could dominate new enterprise account acquisitionMediumUnderstand revenue share terms; assess lock-in vs. direct sales competitive dynamics
Compliance gap (SOC 2, FedRAMP)Procurement frictionRegulated industries cannot procure without certificationsHighConfirm SOC 2 Type II and FedRAMP In Process status; understand timeline
Developer fatigue / ACU cost sensitivityChurn riskPro/Core plans costly in practice ($100-200+/month for moderate use); developers may downgradeMediumRequest Pro-to-Free downgrade rate and ACU utilization distribution

Top-customer concentration is a critical diligence gap; no public data is available. COBOL project-based nature creates revenue cliff risk that requires probing.

[CU017, CU018, CU019, CU020]
Chapter 07

07Risks

7.1 Regulatory and Legal Risk Landscape

Cognition operates in a rapidly evolving regulatory environment. Three major frameworks create material near-term obligations. The EU AI Act (Regulation 2024/1689), effective August 2026 for GPAI model providers, imposes transparency documentation, copyright compliance policies, and EU AI Office registration. The UK GDPR—as administered by the ICO—applies to all data processing of UK/EU residents and requires Data Protection Impact Assessments for high-risk AI processing, which Devin's autonomous PR-merge and code-deployment capabilities likely trigger under Article 35. The January 2025 White House Executive Order revoked prior Biden-era AI safety obligations, creating a deregulatory US federal environment, but successor agency-level rules (particularly for government AI procurement) remain active. California SB 1047, which would have imposed frontier model accountability requirements, was vetoed in September 2024; successor legislation (AB 2013, SB 1235) is active through the 2025–2026 session. Separately, Doe v. GitHub, Microsoft, and OpenAI copyright litigation—consolidated in the Northern District of California—alleges AI code model training on open-source repositories constitutes copyright infringement; Cognition has not disclosed its model training data provenance, creating analogous latent IP exposure. Cognition's terms of service restrict output from being used to train competing models, but the upstream model training data copyright status is not independently verifiable. Compliance monitoring, cross-jurisdictional DPA coverage, and GPAI transparency documentation are each materially underfunded relative to Cognition's scale and enterprise customer base across the EU, Japan, and Singapore. Absent a dedicated regulatory affairs function, Cognition risks enforcement action, GPAI non-registration penalties, or restricted EU market access by Q3 2026. A proactive regulatory program should be treated as a business-critical investment at this stage of international expansion, not an optional legal overhead item.

Regulatory / legal risk register
Rule / CaseJurisdictionStatusLikelihoodSeverityMitigationResidual ExposureDiligence Path
EU AI Act GPAI provisions (Reg. 2024/1689)European UnionIn force Aug 2026MediumHighPublish GPAI transparency doc; register with EU AI OfficeMediumVerify GPAI registration; obtain AI transparency attestation
GDPR / UK GDPR (AI data processing)EU / UKActiveMediumHighDPA agreements; no training on Enterprise data (company-claimed)MediumAudit DPA coverage; verify ICO DPIA compliance for agentic processing
Doe v. GitHub / Microsoft / OpenAI (AI copyright)US N.D. Cal.Ongoing consolidated litigationMediumMediumToS restricts output for competing model training; training data provenance undisclosedMediumObtain legal opinion on Devin model training data provenance and OSS license exposure
CA SB 1047 successor bills (AB 2013)California, USAMonitoring 2025–2026LowMediumNo current binding obligation; SB 1047 vetoed Sep 2024LowMonitor quarterly; reassess if AB 2013 passes committee
US Federal AI EO — Jan 2025 (deregulatory)United StatesActiveLowLowBenefits from deregulatory stance; watch agency AI procurement rulesLowTrack OSTP AI Action Plan (due Jul 2025); monitor FedRAMP requirements for government deployments

Likelihood and severity ratings are qualitative diligence assessments. Residual exposure after mitigation reflects analyst judgment, not Cognition disclosures.

[CR001, CR002, CR003, CR004, CR005]
FR001: Risk Heatmap — Impact vs. Likelihood
[CR006, CR007, CR011, CR018, CR019, CR024]

7.2 Security and Technical Risk

Cognition's core product risk is that Devin operates with elevated system privileges— writing, executing, testing, and deploying code autonomously—which dramatically expands the attack surface compared to passive coding assistants. In December 2024, a live-streamed demonstration publicly exposed a prompt-injection vulnerability: an adversary could embed malicious instructions in a repository's README file, causing Devin to exfiltrate secrets, make unauthorized API calls, or plant backdoors in customer codebases. Cognition acknowledged and patched the issue. However, the incident revealed that prompt-injection defenses—top of the OWASP Top 10 for LLM Applications (LLM01)—were inadequately hardened prior to GA launch. Cognition's security page confirms SOC 2 Type II certification (audited March 2024), data encrypted in transit and at rest, and MFA for all employees. The Trust Center requires NDA execution before disclosing penetration test reports, audit scope, and third-party assessment results—limiting enterprise procurement teams' independent assessment of control effectiveness. Key unresolved gaps include: no published bug-bounty program; no formal Agent Security Framework; no public penetration test scope or red-team disclosure; and no published SLA or historical uptime data. Devin 2.0 (April 2025) expanded capabilities to merge PRs directly and schedule agents on customer infrastructure, materially increasing the blast radius of any future prompt-injection exploit. The OWASP GenAI Security Project identifies agent autonomy as an emerging, distinct risk category beyond classical LLM vulnerabilities. As enterprise adoption grows to VPC deployments at hundreds of companies, an undetected Devin exploit could simultaneously contaminate multiple enterprise codebases—a systemic supply-chain tail risk that individual customer security teams cannot independently mitigate.

Operational and security risk register
Failure ModeLikelihoodSeverityMitigation MaturityResidual ExposureUnresolved Gap
Prompt-injection / agent manipulation (OWASP LLM01)HighCriticalPartial — patched Dec 2024; no public bug-bounty or agentic security frameworkHighNo published bug-bounty; no independent red-team disclosure; no post-Devin-2.0 pen test summary
Supply-chain code contamination via autonomous PR mergeMediumCriticalPartial — SOC 2 Type II; code review recommended but not enforcedHighNo published penetration test scope; trust center details NDA-gated; blast radius expanded by Devin 2.0
SOC 2 Type II scope gap — no public report accessMediumHighPartial — SOC 2 Type II obtained Mar 2024; public report requires NDAMediumObtain NDA and review SOC 2 report scope, control descriptions, and exceptions
AI hallucination / insecure code generation in productionHighHighPartial — code review and branch protection recommended; not enforced by defaultMediumNo independent study of Devin-generated code vulnerability rate versus human-written code
Platform outage / ACU quota exhaustionLowMediumPartial — AWS-hosted; no published SLA or uptime history disclosedLowRequest SLA and historical uptime data; obtain enterprise credit terms for outages

Failure modes ordered by severity. Mitigation maturity reflects publicly available evidence; undisclosed internal controls may be more advanced.

[CR006, CR007, CR008, CR009, CR010]
FR002: Risk Transmission Map
[CR006, CR007, CR011, CR018, CR019, CR020]

7.3 Competitive and Market Risk

Cognition's most acute competitive risk is the rapid commoditization of agentic coding benchmarks that historically defined its leadership. At Devin's March 2024 launch, its SWE-bench Full score of 13.86% was groundbreaking. By mid-2025, Claude Code Opus 4 achieved 72.5% on SWE-bench Verified—a five-fold improvement in roughly 15 months. OpenAI Codex and GPT-5 achieve comparable Verified scores. This rate of improvement threatens Devin's technical differentiation: its multi-step planning, execution, and testing loop may be replicated by foundation model providers within 12–18 months. GitHub Copilot (250M+ installs, Microsoft distribution), Cursor ($500M+ ARR, $9B valuation, $40/month pricing), and Claude Code (Anthropic) compete for the same enterprise developer productivity budget as Devin ($500/month Team plan). The FTC has raised concerns that concentrated control of foundational AI inputs—including the models Devin depends on—could allow those providers to distort competition in downstream AI application markets, a structural risk for Cognition. Cognition's acquisition of Windsurf (July 2025) added 350+ enterprise accounts and 250K+ DAU, but also created integration risk: two separate AI agent codebases, model pipelines, billing systems, and sales motions must be merged simultaneously while maintaining ARR growth. Developer community skepticism at Devin's launch—focusing on benchmark reliability, closed demos, and misleading superiority claims—created an adverse reputation dynamic that increased adoption friction among technically sophisticated buyers and remains a headwind as enterprise evaluation rigor grows. Pricing compression is evident: Devin 2.0 reduced effective prices by approximately 3×; future competitive pressure may require further price reductions that compress revenue per session.

Partner and dependency risk register
DependencyCounterpartyRoleConcentrationFailure ScenarioSeverityResidual Exposure
Foundation LLM provider (primary)Anthropic / OpenAICore model intelligence for all Devin sessionsVery HighProvider raises API pricing 3–5× or restricts agentic use-case termsCriticalHigh — no disclosed multi-model fallback or proprietary model timeline
Cloud infrastructureAmazon Web ServicesVPC compute; enterprise deploymentHighAWS terms change for AI agents; pricing increase materiallyHighMedium — no public multi-cloud or on-premises fallback path
Developer workflow integrationsGitHub (Microsoft)PR review; code diff; CI integration; Copilot distributionHighMicrosoft restricts GitHub API access for AI competitors or modifies Copilot integration policyHighMedium — API-level integration; no exclusive arrangement
AI IDE platform (Windsurf, acquired Jul 2025)Internal (Cognition)Enterprise IDE distribution; 350+ enterprise accounts; 250K+ DAUHighIntegration failure; talent attrition post-acquisition; product cannibalization of Devin sessionsMediumMedium — controlled integration; leadership retained at acquisition

All counterparties listed have some degree of conflicting competitive interests. Dependency severity rated by revenue impact of failure or material price increase.

[CR011, CR012, CR013, CR014]
FR003: Critical Partner and Regulatory Dependency Map
[CR011, CR012, CR013, CR014, CR001, CR002]

7.4 Financial and Operational Risk

Cognition has raised approximately $1.575B in disclosed funding through mid-2026. With 222 employees (102% annual headcount growth, per Growjo) and a compute- intensive product, estimated monthly burn is $5–15M, implying 18–36 months of runway under conservative scenarios. Revenue growth from ~$1M ARR at GA (December 2024) to ~$73M ARR (April 2025 estimate) is exceptional but introduces customer concentration risk: the $73M ARR figure across a handful of named enterprise accounts suggests a small number of very large contracts that create material churn risk at renewal. Nubank accounts for the only disclosed quantified production case study. If Nubank or a comparably large customer churns—due to a security incident, budget reallocation, or competitive switch—the ARR decline could impair future fundraising at an assumed $10.2B valuation. ACU pricing models ($2.25 per ACU overage) create unpredictable usage-based revenue and potential bill-shock churn for customers who underestimate consumption. Margin compression from LLM inference costs—third-party API calls at commercial Anthropic/OpenAI rates—limits gross margin expansion until Cognition builds proprietary models or achieves volume pricing. No path to profitability, EBITDA, or gross margin targets has been publicly disclosed. Windsurf integration introduces additional operational complexity: merging billing systems, enterprise contracts, and product teams simultaneously increases execution risk and may slow the strategic roadmap. The January 2025 Series A at $2B valuation and post-Windsurf round imply a $10.2B Growjo valuation estimate; any material miss on ARR growth trajectory, a churn event, or a down round from a next capital raise could impair team motivation and trigger key employee departures. Financial risk is elevated by the company's refusal to disclose gross margins, NRR, or cohort retention metrics.

People and execution risk register
Role / FunctionDependency or GapLikelihoodSeverityMitigationDiligence Path
CEO / Co-founder (Scott Wu)Single point of product vision; investor credibility; technical leadership; competitive programmer identityLowCriticalCo-founder team; board oversight; no public succession planRequest governance structure; confirm board independence; review co-founder vesting cliff schedule
AI research / ML engineering talentExtreme competition from Anthropic, OpenAI, Google DeepMind, and Microsoft for the same competitive programming and ML talent poolMediumHighCompetitive compensation (est. $500K+ TC); strong Cognition brand in AI communityRequest headcount breakdown by function; review senior ML engineer tenure and equity refresh policy
International legal / regulatory operationsExpanding to EU, Japan, Singapore, and US Government with no disclosed dedicated legal infrastructure; FedRAMP not obtainedHighMediumNascent; likely ad hoc outside counsel for each jurisdictionVerify EU DPO appointment; request Japan PIPL and Singapore PDPA compliance roadmap; confirm FedRAMP timeline

Risk ratings based on publicly disclosed organizational structure. Internal succession plans, bench depth, and retention programs not publicly disclosed.

[CR015, CR016, CR017]

7.5 Execution, People, and Partner Risk

Cognition's three co-founders—Scott Wu (CEO), Steven Hao, and Walden Yan—are competitive programming champions with exceptional technical credentials but no prior track record building enterprise software companies beyond early-stage startups. Key-person risk is concentrated in Scott Wu, who drives technical vision, investor relationships, and product strategy. The company's AI talent pipeline is stressed by intense competition from Anthropic, OpenAI, Google DeepMind, and Microsoft—all operating frontier AI programs at compensation packages exceeding $500K total compensation, directly competing for the same competitive programming and ML engineering talent pool that Cognition relies on. Devin's platform depends critically on third-party partners: AWS (VPC compute), GitHub (PR workflow integrations), and Slack/Teams (enterprise communication). Any of these platforms modifying API access, pricing, or integration policy could disrupt Devin's enterprise value proposition. Anthropic and OpenAI present a dual risk as both supplier and competitor: they supply the underlying models Devin orchestrates, but compete via Claude Code and OpenAI Codex respectively, creating conflict-of-interest incentives. Geographic expansion to Japan, Singapore, and the EU—announced in Q1–Q2 2026—introduces jurisdiction-specific compliance obligations that require legal and operations hiring ahead of revenue materialization. US Government customers require FedRAMP or equivalent authorization, typically a 12–24 month process; the gap between announced government vertical and actual FedRAMP authorization creates a window of reputational and contractual risk. International legal infrastructure—EU DPO appointment, Japan PIPL compliance, Singapore PDPA— represents an emerging gap for a company that has prioritized product velocity over regulatory scaffolding.

Mitigation and kill criteria table
RiskMonitorable TriggerKill-Criteria ThresholdAction Implication
Prompt-injection / supply-chain breachDisclosed CVE; customer codebase contamination; second security incidentAny confirmed supply-chain contamination of production customer codebase OR second public security incident within 12 monthsImmediate thesis review; evaluate churn impact; assess remediation credibility; consider position exit
Competitor commoditization of agentic codingMonthly SWE-bench leaderboard; Cursor, GitHub Copilot, Claude Code pricing announcementsCompetitor at ≥80% SWE-bench Verified at ≤50% Devin pricing OR Devin YoY ARR growth <50%Reduce position; reassess moat thesis beyond benchmark scores; evaluate Windsurf-led differentiation
Customer concentration / Nubank churnNRR disclosed at renewal; top-customer ARR share; pipeline diversification evidenceSingle customer >30% ARR OR NRR <90% in any trailing quarterImmediate ARR quality review; model churn scenarios; request pipeline diversification evidence
LLM provider pricing shock or API restrictionAnthropic / OpenAI pricing announcements; API terms updates for agentic use casesLLM API cost increases >3× without corresponding Devin price increase OR API terms restrict autonomous PR mergesModel margin compression exposure; evaluate proprietary model timeline and cost pass-through feasibility

Kill criteria are thesis-break triggers. Thresholds reflect current consensus; reassess at each monitoring cycle.

[CR018, CR019, CR020, CR021]
Chapter 08

08Valuation

8.1 Investment Thesis and Anti-Thesis

The bull thesis for Cognition AI rests on three mutually reinforcing propositions. First, autonomous software engineering represents a genuine market category shift— not an incremental productivity improvement. The total addressable market for software development labor exceeds $650B globally; even a 10% displacement of developer time through autonomous agents implies a $65B+ revenue opportunity. Cognition's Devin product is the category pioneer, establishing the autonomous coding agent as a procurement line item at leading enterprises before most buyers understand the category. First-mover brand recognition, lighthouse customer proof (Nubank), and the Windsurf IDE network (250K+ DAU) create a platform moat that sustains pricing power beyond pure benchmark performance. Second, the ARR trajectory from $1M (GA, December 2024) to an estimated $73M (April 2025) in nine months implies a growth rate that is structurally disconnected from historical SaaS benchmarks. At this growth rate, the company could plausibly reach $500M ARR within 18 months of the April 2025 data point, which would place it in the cohort of fastest-growing enterprise SaaS companies in history and justify a $10B+ valuation on forward ARR multiple compression. Third, the co-founders' deep technical backgrounds and Founders Fund's conviction investment create a signal that the founding team can sustain the product velocity needed to maintain differentiation. The anti-thesis is equally forceful. Benchmark commoditization is the existential risk: Claude Code Opus 4's 72.5% SWE-bench Verified score in 2025 versus Devin's 13.86% Full score at launch demonstrates that the performance gap is closing at a rate that may eliminate Cognition's technical differentiation by late 2026. The $10.2B valuation at ~140× estimated trailing ARR prices in near-flawless execution on a trajectory that has no published corroboration beyond Growjo estimates. Customer concentration risk is unquantified: $73M ARR from an undisclosed number of enterprise accounts could mean three to five contracts, where a single defection triggers a material ARR decline. The December 2024 prompt-injection security incident—discovered during a live stream— damaged Cognition's credibility among technically sophisticated buyers at a critical early-adoption inflection point. LLM provider dependency on Anthropic and OpenAI, both of whom compete directly with Devin, introduces a structural conflict-of-interest that could manifest as sudden API pricing increases or feature restrictions that raise Devin's marginal cost of delivery. Without NRR, gross margin, or cohort retention data, the financial quality of the reported ARR cannot be independently assessed.

Recommendation summary table
DimensionAssessmentEvidence BaseConfidenceImplication
RecommendationWatchlist — conditional invest below $5B entryARR growth exceptional; valuation multiple extreme at 140× ARR; security/concentration risks unresolvedMediumMonitor NRR, margin, and integration progress; entry discipline required
Risk ratingHighSecurity incident history; benchmark commoditization; LLM dependency; customer concentration unknownHighHigh risk of capital impairment at $10.2B entry if execution falters
Valuation stanceSpeculative premium — current $10.2B at ~140× ARR is 7× Cursor multipleCursor $9.9B / $500M ARR = 20× ARR; Cognition $10.2B / $73M ARR = 140× ARRMediumFair value range $3–7B on base-case assumptions; entry above $7B requires bull case proof points
Bull case target$18–25B exit (3-year hold, $700M+ ARR, 25–35× forward ARR)Growth rate sustainability; Windsurf integration success; NRR ≥120%Low-Medium1.5–2.5× return from $10.2B entry; adequate for venture IRR only with near-flawless execution
Bear case target$800M–$1.8B exit (second security incident OR major churn OR >3× competitor pricing undercut)Security overhang; benchmark commoditization; LLM pricing riskMedium90%+ capital loss at $10.2B entry; unacceptable asymmetry without higher-confidence NRR data

Recommendation is for institutional investors evaluating a secondary or primary position in Cognition at current valuation. This is not investment advice.

[CV001, CV002, CV003]
Final diligence asks table
Diligence RequestPriorityWhy CriticalAcceptable Threshold
Net Revenue Retention by cohort (quarterly, trailing 4Q)CriticalARR quality; determines if growth is expansion- or churn-maskedNRR ≥110% confirms health; <90% is thesis-break
Gross margin by product line (Devin ACU, Windsurf, Enterprise)CriticalLLM cost structure; path to profitability; margin compression riskTarget 50%+ gross margin; <30% raises sustainability questions at scale
Top-5 customer ARR and share of total ARRCriticalConcentration risk; single churn event impactTop-5 <40% total ARR; any individual >15% ARR is high concentration
Post-Devin-2.0 penetration test report or red-team summary (post-December 2024 patch)HighSecurity credibility; expanded agentic capabilities require re-auditNDA access to SOC 2 Type II + at least 1 post-Devin-2.0 independent pen test
Windsurf integration customer retention rate (6 months post-acquisition)HighIntegration execution proof; IDE customer attrition riskWindsurf enterprise account retention ≥85% at 6 months post-acquisition
LLM API contract terms (Anthropic, OpenAI): volume pricing, agentic use-case clauses, change-of-controlHighSupplier conflict-of-interest; margin floor determinationLocked-in pricing through 2027 OR multi-model diversification strategy in place
EU GPAI compliance plan and regulatory affairs staffingMediumAugust 2026 compliance deadline; EU revenue at risk if non-compliantNamed EU regulatory affairs hire; GPAI transparency doc timeline <6 months
FedRAMP authorization status and timeline for US Government verticalMediumGovernment ARR pipeline; credibility of government vertical launch (Feb 2026)Clear FedRAMP authorization timeline; interim use-case confirmation from named agency

Priority levels: Critical = must-have before investment decision; High = must-have before term sheet; Medium = must-have before close.

[CV023, CV024, CV025, CV026, CV027, CV028]
FV001: Recommendation Decision Logic
[CV001, CV002, CV018, CV019]

8.2 Valuation Context and Financing History

Cognition has completed three disclosed financing rounds totaling approximately $1.575B. The $21M seed round established the founding team in 2023. The $175M Series A (January to April 2025, Founders Fund IX lead) set the initial institutional valuation at $2B post-money. SEC EDGAR records confirm Founders Fund IX (CIK 0001971631) as a large institutional vehicle that committed capital to Cognition; the Cognition Capital SPV I (CIK 0002072175, Form D filed 2025-06-11) indicates a follow-on structured vehicle was established concurrent with or shortly after the Windsurf acquisition close. The $400M round (September 2025) valued the company at $10.2B post-money, a 5.1× step-up from the Series A in approximately nine months—one of the steepest step-up trajectories in AI software history. The total financing of ~$1.575B implies approximately 15%+ dilution on a $10.2B post-money basis, though exact preference stack and liquidation waterfall are not disclosed. At the $10.2B valuation, the primary revenue multiple is approximately 140× the April 2025 ~$73M ARR estimate—a premium to all disclosed AI software peers. Cursor/Anysphere closed its Series D at $9.9B on approximately $500M ARR (September 2025), implying a 20× ARR multiple. Cognition's premium is partially justified by its superior growth rate and category-pioneer positioning but represents execution risk priced in for investors entering at $10B+. The Windsurf acquisition (July 2025) was completed via Cognition equity and cash for an undisclosed purchase price; the Cognition Capital SPV I suggests a structured investor-level vehicle, possibly used to fund the acquisition consideration or backstop the combined entity's operating plan. A "culture reset" reported by Winbuzzer in August 2025—Cognition offering nine-month buyouts to Windsurf staff and reportedly requiring 80-hour workweeks—introduces talent integration risk not reflected in the headline valuation. Any new institutional investor at $10.2B would face a meaningful preference overhang from the seed, Series A, and $400M round before reaching par on any return calculation below a $10.2B exit value.

Thesis / anti-thesis table
Thesis PillarSupporting EvidenceCountervailing EvidenceConviction
Autonomous engineering as a category creation eventARR growth from $1M to $73M in 9 months; Nubank 12× efficiency; Cognition uses Devin internally (659 PRs/week)Benchmark commoditization accelerating; peers may replicate orchestration layer without needing Cognition's toolingMedium
Network moat from Windsurf IDE + Devin cloud agent integrationWindsurf 250K+ DAU and 350+ enterprise accounts added at acquisition (Jul 2025)Integration execution risk; culture reset reported Aug 2025; two codebases to merge simultaneouslyLow-Medium
Founders Fund IX conviction as quality signalFounders Fund IX (CIK 0001971631) as lead Series A investor; Peter Thiel-associated track recordFounders Fund portfolio concentration; past thesis misses have occurred; not a guarantee of outcomeMedium
Price reduction as volume expansion strategyDevin 2.0 reduced effective price 3×; 83% more tasks per ACU; implies cost curve droppingGross margin unknown; 3× price cut suggests either cost structure improvement OR competitive response; either reduces total revenue headroomLow

Conviction rating is analyst judgment based on publicly available evidence. Internal financial metrics (NRR, gross margin) are not available to modify these assessments.

[CV004, CV005, CV006, CV007]
FV002: Valuation Sensitivity to ARR and Multiple
[CV008, CV009, CV010, CV011]

8.3 Bull, Base, and Bear Scenario Analysis

The bull scenario assumes Devin's ARR continues at a sustained 50% quarterly growth rate through 2027, reaching approximately $700M–$800M ARR by end of 2027. At this scale, with improving gross margins from model cost optimization and volume contracts, Cognition would likely command an IPO or acquisition valuation of $15–25B—a 1.5–2.5× return from $10.2B entry. The bull case requires: no material security incidents post- December 2024; successful Windsurf integration adding a credible IDE channel; NRR disclosed at ≥120%; and competitor models failing to commoditize at Devin's enterprise workflow integration depth. The base scenario assumes moderate ARR growth (30–40% quarterly decay from the 2025 sprint) reaching $200–350M ARR by end of 2027, with gross margin improvement to 50–60%. At a normalized 25–35× forward ARR multiple (consistent with high-growth AI SaaS), this implies a $5–12B valuation—at or near flat to entry at $10.2B, suggesting negligible return for investors at that entry. The base case requires no major churn events, reasonable Windsurf integration, and sustained AI infrastructure investment by enterprises. The bear scenario assumes a material adverse event within 18 months: a second security incident, Nubank churn, or a Devin price war triggered by Claude Code or OpenAI Codex commoditization. Under the bear case, ARR plateaus at $80–120M, NRR is disclosed below 90%, and the valuation compresses to 10–15× ARR implying an $800M–$1.8B exit value—a 90%+ capital loss for investors entering at $10.2B. The bear trigger probability is estimated at 20–30% given the concentration of known risk factors, particularly the security overhang and the structural benchmark commoditization trend. Return expectations are asymmetric: the bull upside (1.5–2.5× in 3 years) does not adequately compensate for a 30%+ probability of severe capital loss at a $10.2B entry valuation. This is the core investment thesis challenge: the risk-adjusted return at current entry is unattractive unless the investor has high-conviction, differentiated information about NRR and gross margin that the market does not.

Bull / base / bear scenario table
ScenarioARR (End 2027 est.)ARR MultipleImplied ValuationProbability SignalKey Assumption
Bull$700–800M25–35× forward ARR$18–28B20–25% (requires near-perfect execution)No major security incidents; NRR ≥120%; Windsurf integration successful; LLM pricing stable
Base$200–350M20–25× forward ARR$4–9B45–55% (moderate-growth execution)Moderate churn; Windsurf partial integration; 30–40% quarterly ARR growth rate decay
Bear$80–120M8–12× trailing ARR$640M–$1.4B25–30% (materially negative event)Second security incident OR major customer churn OR competitor price undercut >3× OR regulatory enforcement

Scenario ranges are analyst estimates based on comparable company benchmarks and disclosed growth metrics. Probabilities are qualitative signals, not quantitative models.

[CV008, CV009, CV010]
FV003: Valuation and Return Range by Scenario
[CV008, CV009, CV010, CV003]

8.4 Comparable Company Valuation Analysis

Cognition's peer group for comparables purposes includes AI-native developer productivity tools (Cursor, Replit), foundation model providers with coding products (Anthropic/Claude Code, OpenAI/Codex), and broader enterprise AI SaaS companies at comparable ARR run-rates. Cursor/Anysphere is the closest comparator: both are AI coding tools targeting enterprise developers, both are VC-backed, and both reached their most recent valuations in the second half of 2025. Cursor closed its Series D at $9.9B (Bloomberg, November 2025) on approximately $500M ARR—an 18–20× ARR multiple. Cognition at $10.2B on ~$73M ARR implies a ~140× ARR multiple, a 7–8× premium to Cursor on ARR-basis. The Cognition premium is partially justified by higher growth rate (Devin grew faster from launch) but is structurally unsustainable as Cursor's ARR ($500M) already demonstrates greater revenue quality and breadth. Harvey AI (legal AI, $3B valuation, ~$100M ARR, ~30× ARR) represents a vertical AI SaaS comparator where high-quality NRR justifies premium multiples. Glean (enterprise AI search, $4.6B, ~$100M ARR, ~46× ARR) demonstrates that enterprise AI SaaS can sustain 40–50× ARR multiples at $100M ARR, but NRR at Glean is reported above 150%. Replit ($1.16B valuation, 2023, development platform with AI pivot) represents the category risk: a pioneer pivot story that lost market share to more focused AI-native competitors. The implied IPO entry multiple for AI SaaS companies at or near profitability in 2025 is approximately 15–25× forward ARR; Cognition at $10.2B and ~$73M ARR would need to reach $400–680M ARR before an IPO at standard multiples would preserve the $10.2B entry valuation. No public market comparator exists: GitHub Copilot is embedded in Microsoft Azure revenue, Amazon Q in AWS, and Claude Code in Anthropic's API revenue. Private market premium over public comps is historically 20–40% for growth-stage AI; this suggests a fair-value range for Cognition (on base-case assumptions) of $3–7B, below the $10.2B last round, implying the current round was priced at a 1.5–3× speculative premium to intrinsic value. Entry discipline—waiting for either ARR confirmation at $300M+ or valuation correction to $4–6B—is the appropriate investor response.

Comparable valuation table
CompanyValuationARR / RevenueARR MultipleStageComparability to Cognition
Cursor (Anysphere)$9.9B (Nov 2025 Series D)~$500M ARR (2025 est.)~20× ARRLate private, pre-IPOClosest comp: AI coding tool, enterprise, developer-productivity focus; 7× lower ARR multiple
Glean$4.6B (2024)~$100M ARR (est.)~46× ARRLate privateEnterprise AI search; high NRR (≥150%) justifies premium; Cognition NRR unknown
Harvey AI$3B (2025)~$100M ARR (est.)~30× ARRGrowth stageVertical AI SaaS; legal focus; strong enterprise NRR; narrower TAM than Cognition
Cohere$5.5B (2024)~$100M ARR (est.)~55× ARRLate privateFoundation model provider; infrastructure-layer; less exposed to benchmark commoditization
Replit$1.16B (2023)Sub-$50M ARR (est.)~25× ARRGrowth stageCategory risk precedent: developer tool that lost share to AI-native entrants; watch for pattern
GitHub CopilotN/A (embedded in Microsoft)~$200M+ ARR (est.)N/APublic (Microsoft)Strongest distribution moat (250M+ GitHub users); pricing $10–$39/user/month; commoditizes Devin's $500/month price point
Amazon Q DeveloperN/A (embedded in AWS)~$50M ARR (est.)N/APublic (Amazon)AWS bundled pricing; relevant for Devin's enterprise cloud deployment competition

All non-public valuations are based on disclosed or reported funding rounds. Revenue multiples for private companies are analyst estimates; actual metrics not confirmed by companies.

[CV011, CV012, CV013, CV014, CV015, CV016]
FV004: Investment Quality KPIs — Cognition vs. Thresholds
[CV001, CV030, CV031, CV032, CV033]

8.5 Exit Readiness and Final Diligence Asks

Cognition is 2–4 years from an IPO readiness event given current ARR trajectory and the disclosure gap that public markets require. For IPO at $8B+ market capitalization, the company needs approximately $400–500M ARR, positive unit economics (40%+ gross margin), disclosed NRR above 120%, and a cleared SOC 2 Type II + GDPR compliance infrastructure. M&A exit paths exist: Microsoft (GitHub Copilot/Azure positioning), Alphabet (Google Cloud developer tools), Salesforce (developer productivity within Einstein platform), or Snowflake/Databricks (data engineering automation) are logical acquirers. The $10.2B valuation makes a strategic acquirer path challenging: Microsoft's prior GitHub acquisition at $7.5B (2018) was at scale, and a $10.2B bid for Cognition would be among the largest Microsoft AI acquisitions—requiring board-level conviction. The most likely exit path remains late-stage private equity / pre-IPO crossover participation, with an IPO window in 2028–2030 if ARR trajectory sustains. Final diligence priorities before investment: (1) NRR by cohort—disclosed above 110% is a green flag; below 90% is thesis-break; (2) gross margin at scale—target 50%+ gross; (3) penetration test report from post- Devin-2.0 audit; (4) Windsurf integration roadmap with customer retention data from the acquired base; (5) LLM provider contract terms including agentic use-case pricing floors; (6) GPAI compliance plan and EU regulatory timeline; (7) top-5 customer ARR concentration (>50% from top-5 is a material concentration risk); and (8) headcount by function to assess burn relative to ARR coverage ratio. Thesis-break conditions: a second disclosed security incident within 12 months, NRR disclosed below 90%, ARR growth rate deceleration to <50% YoY, or a confirmed major enterprise customer churn event.

Thesis-break and kill triggers table
Trigger EventThresholdMonitoring IndicatorAction Implication
Second security incident or supply-chain breachAny confirmed second Devin security incident within 18 months of December 2024 patchCVE disclosures; enterprise security advisories; HN/security communityImmediate thesis review; evaluate credibility of remediation; likely exit
Major customer churnTop customer ARR decline >25% YoY OR any individual customer ARR >$10M lostARR disclosure at funding; press announcements; customer reference interviewsReduce position; re-model ARR trajectory with churn scenario
Competitor price undercut >3× Devin equivalent at comparable capabilityCompetitor at SWE-bench ≥80% Verified at ≤$160/month (Devin equivalent)SWE-bench leaderboard; competitor pricing pages monthlyStructural differentiation review; accelerate Windsurf moat assessment
NRR disclosed below 90%NRR < 90% at any disclosed or estimated cohort measurementFinancial disclosure at next funding; data roomRe-model ARR growth; high churn at scale is thesis-break for $10B+ entry
EU regulatory enforcement action for GPAI non-complianceEU AI Office initiates formal investigation or issues compliance order against CognitionEU AI Office announcements; DPC press releases; Reuters/FT regulatory coverageAssess EU revenue at risk; model enforcement cost; could trigger broader enterprise procurement pause

Kill criteria are not legal thresholds; they are investment risk management indicators for monitoring committees.

[CV018, CV019, CV020, CV021, CV022]

Disclaimer

This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.

Evidence index

Claims
IDStatementConfidenceSources
CO001 Cognition AI was founded in November 2023 in San Francisco, California. High SO002, SO005, SO010
CO002 Cognition AI is headquartered in San Francisco, California. High SO001, SO005
CO003 Cognition AI's flagship product, Devin, is an autonomous AI software engineer capable of interpreting tickets, planning, writing code, debugging, testing, and deploying software with minimal human oversight. High SO002, SO025
CO004 Scott Wu is the co-founder and CEO of Cognition AI. High SO001, SO005, SO010, SO011
CO005 Steven Hao is the co-founder and CTO of Cognition AI. High SO001, SO005, SO010
CO006 Walden Yan is the co-founder and CPO (Chief Product Officer) of Cognition AI. High SO001, SO005, SO010
CO007 Scott Wu is a three-time IOI (International Olympiad in Informatics) gold medalist. Medium SO011, SO010
CO008 Cognition AI raised $175M at a $2B valuation in April 2024, led by Founders Fund. High SO006, SO007, SO008
CO009 Cognition AI raised $400M at a post-money valuation of $10.2B in September 2025, led by Founders Fund. High SO006, SO007, SO008, SO009
CO010 Cognition AI's total capital raised is approximately $696M as of September 2025. High SO008, SO014
CO011 Cognition AI acquired Windsurf, an AI-native IDE company, in July 2025. High SO004, SO009, SO015
CO012 Windsurf had approximately $82M ARR at the time of its acquisition by Cognition AI. High SO003, SO008, SO009
CO013 Windsurf had more than 350 enterprise customers at the time of its acquisition by Cognition AI. Medium SO004, SO009
CO014 Cognition AI's ARR grew from $1M in September 2024 to $73M in June 2025, representing approximately 73x growth in nine months. High SO008, SO024
CO015 Following the Windsurf acquisition, Cognition AI's combined ARR reached approximately $155M in July 2025. High SO003, SO008
CO016 Cognition AI's net cash burn was under $20M from founding through Q3 2025, indicating high capital efficiency. Medium SO003, SO014
CO017 Devin was publicly announced in March 2024. High SO002, SO005
CO018 Cognition AI initially explored cryptocurrency before pivoting to building autonomous AI coding agents. Medium SO005, SO010
CO019 The founding team of approximately 10 people collectively held 10 IOI gold medals at company launch. Medium SO002, SO011
CO020 Founders Fund led both the initial seed/Series A and the $400M Series B round for Cognition AI. High SO006, SO007, SO008
CO021 Goldman Sachs is among Cognition AI's enterprise customers and is reportedly running a large-scale Devin pilot with approximately 12,000 developers. Medium SO012, SO003
CO022 Cognition AI laid off 30 former Windsurf employees following the July 2025 acquisition. High SO013, SO016
CO023 Cognition AI offered nine-month salary buyout packages to approximately 200 remaining Windsurf employees who did not want to commit to the company's 80-hour, six-day workweek culture. High SO013, SO016, SO017
CO024 Cognition AI's enterprise ARR grew more than 30% in seven weeks following the Windsurf acquisition. Medium SO003
CO025 Cognition AI opened its Singapore APAC headquarters in April 2026. High SO001, SO003
CO026 Scott Wu studied economics at Harvard University. Medium SO011, SO010
CO027 Scott Wu previously co-founded Lunchclub, an AI-powered professional networking platform. Medium SO011, SO010
CO028 Devin's Core plan is priced at $20/month for individual developers with pay-as-you-go compute at approximately $2.25 per ACU. High SO026, SO025
CO029 Devin's Team plan costs $500/month and includes 250 Agent Compute Units plus full API access. High SO025, SO026
CO030 Cognition AI's $175M Series A was conducted at a $2B post-money valuation in April 2024. High SO006, SO008
CO031 Cognition AI raised approximately $21M in March 2024 at a valuation of approximately $350M in its initial institutional round. Medium SO008, SO021
CO032 Scott Wu holds the 'Legendary Grandmaster' designation on the Codeforces competitive programming platform. Medium SO011, SO010
CO033 Mercedes-Benz announced a partnership with Cognition AI in April 2026. High SO001, SO025
CO034 Independent product reviewers have found that Devin's real-world task success rates require significant human oversight on complex tasks, suggesting performance falls below the 'fully autonomous' marketing framing. Medium SO018, SO019
CO035 CEO Scott Wu has stated publicly that Cognition AI's culture requires six-day workweeks of more than 80 hours, framing it as a company-wide commitment to the mission. High SO013, SO016, SO017
CO036 Investors in the September 2025 $400M round included Lux Capital, 8VC, Elad Gil, Definition Capital, Swish Ventures, Bain Capital Ventures, and D1 Capital. High SO006, SO009, SO014
CO037 As of April 2026, Cognition AI is reportedly in discussions for a new financing round that could value the company at over $25 billion. Medium SO008
CO038 Walden Yan is an IOI gold medalist who leads product management and the product roadmap at Cognition AI. Medium SO010, SO005
CO039 Steven Hao previously held roles at Scale AI, Cursor, Modal, DeepMind, Waymo, and Nuro before co-founding Cognition AI. Medium SO010, SO011
CO040 Cognition AI had approximately 49 employees before the Windsurf acquisition according to revenue-per-employee analyses. Low SO008
CO041 Windsurf had been in acquisition discussions with OpenAI for approximately $3B and was the subject of a Google talent licensing deal worth approximately $2.4B before Cognition acquired the remaining team and IP. Medium SO015, SO009
CO042 Cognition AI's price-to-ARR multiple at the September 2025 valuation was approximately 65x (using $155M combined ARR) to 140x (using $73M pre-acquisition Devin ARR). Medium SO023, SO008
CO043 Devin operates within a sandboxed environment that includes a shell, code editor, and browser, using long-context model inference to sequence planning, implementation, and iterative testing steps autonomously. High SO002, SO025
CM001 The broad AI code tools market was valued at $7.37 billion in 2025, according to Mordor Intelligence, and is forecast to reach $23.97 billion by 2030 at a 26.6% CAGR. Medium SM001
CM002 Grand View Research forecasts the narrow generative AI coding assistants market at $92.5 million by 2030 at a 24.8% CAGR—a significantly smaller scope than Mordor's $24B figure. Medium SM002
CM003 ResearchAndMarkets published a $97.9 billion by 2030 forecast for the generative AI coding assistants space, though this figure likely uses an atypically broad scope including non-comparable segments. Low SM003
CM004 No single analyst consensus exists for the AI coding tools TAM: estimates range from $92M (Grand View, narrow) to $97.9B (ResearchAndMarkets, broad) for 2030, with the most operationally relevant SAM estimate at $1–8B. Medium SM001, SM002, SM003
CM005 The estimated serviceable addressable market for premium agentic developer tools (autonomous tier) is $1–2B in 2025–2026, growing to $5–8B by 2030 based on developer-seat and enterprise-spend lenses. Medium SM006, SM007, SM008
CM006 Global software spending reached approximately $675–$700 billion in 2024, according to WIPO's Global Innovation Index and multiple market research sources. High SM008, SM007
CM007 The number of professional software developers worldwide in 2024 is estimated at 27–28.7 million by Evans Data, Statista, and JetBrains Research. High SM006, SM007, SM020
CM008 Gartner forecasts that 90% of enterprise software engineers will use AI code assistants by 2028, up from less than 14% in early 2024. High SM004, SM005, SM009
CM009 GitHub Copilot holds approximately 42% market share among paid AI coding tools in 2025, with $2 billion in ARR and over 20 million all-time users. Medium SM017, SM018, SM019
CM010 GitHub Copilot is used by over 50,000 organizations including 90% of Fortune 100 companies in 2025, with enterprise adoption up 75% quarter-over-quarter. Medium SM017, SM019
CM011 Developers using AI coding assistants complete code tasks 51–55% faster, with AI-generated code constituting 46% of all code written by Copilot users on GitHub. Medium SM017, SM018
CM012 Enterprise spend on agentic AI systems is projected to surge from less than $1 billion in 2024 to over $51 billion by 2028 at approximately 150% CAGR. Medium SM023, SM024
CM013 Enterprise buyers universally require sandbox environments, audit logs, and human-in-the-loop review gates before deploying autonomous coding agents in production workflows. High SM013, SM014, SM015
CM014 The biggest barriers to enterprise adoption of agentic AI include trust deficits from hallucination risk, legacy system integration complexity, and unclear ROI measurement frameworks. High SM013, SM014, SM015, SM016
CM015 65% of enterprises were regularly using generative AI as of 2025, up from 33% in 2023, indicating rapid mainstream adoption of AI tools in enterprise workflows. Medium SM012, SM023
CM016 Only a minority of AI use cases in enterprises have reached full production in 2025; primary barriers are legacy systems, data silos, compliance concerns, and talent shortages. High SM024, SM013
CM017 72% of enterprises plan to deploy AI copilot and agent technologies by 2026, according to aggregated industry surveys. Medium SM023
CM018 The primary enterprise buyer for autonomous software engineering agents is VP Engineering or CTO, with budget sourced from developer productivity or digital transformation allocations. Medium SM012, SM024
CM019 Goldman Sachs (12,000-developer pilot), Citi, Cisco, Dell, Nubank, and Ramp are among Cognition AI's enterprise reference customers, representing the financial services and high-growth tech segments. Medium SM011, SM023
CM020 The EU AI Act and evolving US AI governance regulations impose documentation and explainability requirements that will affect enterprise deployment of autonomous coding agents beginning 2026–2027. Medium SM013, SM015
CM021 Status-quo substitutes for autonomous AI coding agents include offshore staff augmentation, traditional IDEs, internal developer tooling built on base LLM APIs, and cloud-vendor bundled IDE extensions. Medium SM001, SM011
CM022 Switching costs for enterprise AI coding tools include integration migration effort, process overhaul, vendor lock-in risks, and cultural reorientation, making incumbent vendor stickiness a meaningful competitive moat. Medium SM015, SM016
CM023 North America leads the AI coding tools market with the largest share; cloud deployment dominates though enterprise demand for on-premises solutions is growing, particularly among regulated industries. Medium SM001, SM002
CM024 The autonomous agents market (cross-sector) is forecast to grow from $4.35B in 2025 to $103B by 2034 at over 40% CAGR, providing a directionally supportive upper-bound context for the AI software engineering agent TAM. Low SM015, SM025
CM025 Gartner's Magic Quadrant for AI Code Assistants provides enterprise procurement legitimacy for the category, with GitHub Copilot named a Leader in both 2024 and 2025 editions. High SM004, SM010
CM026 87% of developers are using AI tools daily as of 2025, indicating that AI-assisted development has reached mainstream adoption among professional software engineers. Medium SM011, SM023
CM027 Developers using AI tools report 3.2× productivity improvements on average, with 67% using AI tools at least five days per week according to 2025 surveys. Medium SM011, SM017
CM028 Financial services enterprises present a high-value segment for Cognition AI given Goldman Sachs' 12,000-developer Devin pilot; these buyers require SOC 2 Type II, data residency, and regulatory compliance before enterprise sign-off. Medium SM013, SM024
CM029 The global software development market is valued at $675–$730B in 2024; a 1% shift of project labor costs to AI tools implies a $6.7B+ market, corroborating the Mordor $24B broad TAM estimate for 2030. Medium SM008, SM021
CM030 AI-assisted pull request cycle times have been reduced by up to 75% (from 9.6 days to 2.4 days) for organizations using GitHub Copilot, per reported enterprise metrics. Medium SM017, SM018
CM031 Unpredictable agent behavior including hallucinations, lack of transparency, and security concerns are the primary trust barriers slowing enterprise deployment of autonomous AI coding agents. High SM014, SM016
CM032 77% of companies are currently using or actively testing AI tools, with the global AI market estimated at approximately $298 billion in 2025. Medium SM023
CM033 Agentic AI adoption in software engineering is constrained not just by technology but by organizational readiness: workforce resistance, lack of AI talent, and unclear business cases are the dominant enterprise blockers as of 2025. High SM013, SM015, SM024
CM034 Competitive fragmentation in AI coding tools—GitHub Copilot (42% market share), Cursor ($9B valuation), Windsurf (acquired by Cognition), and OpenAI Codex—can produce buyer choice overload that stalls enterprise procurement decisions. Medium SM011, SM017
CM035 Cognition AI's $155M ARR as of July 2025 represents approximately 2% penetration of the estimated $7.37B AI code tools TAM, suggesting material white space remains if the market definition is correct. Low SM001
CP001 GitHub Copilot holds approximately 42% of the AI coding assistant market share as of 2025. Medium SP005, SP017
CP002 GitHub Copilot had over 20 million active users as of early 2025. Medium SP005, SP012
CP003 GitHub Copilot exceeded $2 billion in annual recurring revenue by early 2025. Medium SP002, SP004
CP004 GitHub Copilot is used by 90% of Fortune 100 companies. Medium SP012, SP021
CP005 Cursor (Anysphere) reached $2 billion ARR by February 2026. High SP001, SP003, SP004
CP006 Cursor raised a $2.3 billion Series D in November 2025 at a $29.3 billion valuation. High SP001, SP003
CP007 Cursor has over 1 million paying customers and 50,000 enterprise teams. Medium SP003, SP006
CP008 Cursor's multi-agent background agents feature runs up to eight parallel task threads simultaneously. Medium SP005, SP016
CP009 Amazon Q Developer offers a Free tier and a Pro tier priced at $19 per user per month. High SP013, SP018
CP010 Amazon Q Developer achieved SOC 2 Type II certification with VPC isolation support. Medium SP007, SP013
CP011 Amazon Q Developer has a 200,000 token context window. Medium SP007, SP018
CP012 Claude Code operates as a terminal-native agentic coding assistant without requiring a custom IDE. High SP008, SP009
CP013 Devin's initial SWE-bench score in March 2024 was 13.86%, a breakthrough at the time of announcement. High SP010, SP015, SP025
CP014 Subsequent model improvements from Anthropic pushed Claude-based agents above 50% on SWE-bench Verified. Medium SP010, SP015
CP015 Claude Code is priced at $10 per month on Pro and $100 per month on Max tiers. High SP027, SP008
CP016 OpenAI relaunched Codex as a web-based agentic coding environment backed by the o3 model family in 2025. Medium SP014, SP011
CP017 SWE-agent is an open-source research framework developed at Princeton NLP Group, not a commercial product. High SP025, SP010
CP018 Cursor investors include Thrive Capital, Andreessen Horowitz, Accel, Nvidia, and Google. High SP001, SP003
CP019 GitHub Copilot individual pricing is $10 per month; Business tier is $19 per month; Enterprise tier is $39 per month. High SP021, SP012
CP020 Windsurf (formerly Codeium) was acquired by Cognition in July 2025, adding approximately $82M ARR and 350-plus enterprise customers. Medium SP024, SP006
CP021 Cognition's acquisition of Windsurf converted a direct IDE-tier competitor into a distribution channel for Devin. Medium SP024, SP019
CP022 GitHub Copilot expanded from inline code completions to Copilot Workspace, a multi-step planning and execution agent. Medium SP012, SP005
CP023 Cursor pricing is $20 per month for Pro tier and $40 per month for Business tier as of 2025. Medium SP023, SP005
CP024 Replit offers an AI-native development platform with cloud execution environments targeting student and hobbyist developers. Medium SP022
CP025 The core architectural gap between Devin and Claude Code is that Claude Code still requires a human operator at the terminal, while Devin runs fully unattended. Medium SP009, SP008
CP026 Cognition's defensible competitive advantages include execution depth, enterprise data flywheel from completed tasks, and Windsurf distribution. Medium SP019, SP011
CP027 Benchmark credibility concerns about Devin emerged after independent evaluations showed other tools surpassing it within months of the March 2024 announcement. Medium SP010, SP015, SP025
CP028 The competitive window for pure agentic differentiation is narrowing as frontier model labs deploy equivalent agentic pipelines. Medium SP011, SP019
CP029 Cursor is growing faster than Cognition on ARR metrics and has more paying customers, providing proportionally more training signal. Medium SP001, SP003, SP004
CP030 Cursor uses a VS Code fork architecture that provides deep editor integration unavailable to plugin-based competitors. Medium SP005, SP016, SP023
CP031 Amazon Q Developer's native AWS service integrations position it as preferred for organizations standardized on AWS infrastructure. High SP007, SP013
CP032 GitHub Copilot's enterprise distribution advantages create high switching-cost inertia across Fortune 100 organizations. Medium SP012, SP021
CP033 The AI coding tools market in 2026 is stratified into hyperscaler-backed co-pilots, IDE-native assistants, and fully autonomous agent tiers. Medium SP011, SP017, SP019
CP034 Frontier model labs such as Anthropic and OpenAI control the underlying reasoning capabilities and can deploy equivalent agentic pipelines without licensing dependencies. High SP008, SP014
CP035 Cognition's enterprise go-to-market execution and customer retention metrics are the true leading indicators of sustainable competitive position, more so than benchmark scores. Medium SP009, SP011
CP036 Cursor's growth from zero to $2B ARR in approximately 24 months represents the fastest known ARR scale-up in AI coding tools. Medium SP001, SP003
CP037 Both GitHub Copilot and Amazon Q Developer lack end-to-end autonomous task completion without human-in-the-loop confirmation at each stage. Medium SP007, SP012
CI001 Cognition AI's Core plan is priced at $20 per month for 5 ACUs; Team plan at $500 per month for 250 ACUs. High SI004, SI022
CI002 Enterprise ACU overages are priced at approximately $2.25 per ACU beyond the plan inclusion. Medium SI004, SI001
CI003 Cognition AI reached approximately $1M ARR in September 2024. Medium SI001, SI005
CI004 Cognition AI grew from $1M to $73M ARR between September 2024 and June 2025—a 73x increase in nine months. High SI001, SI003, SI005
CI005 Windsurf (formerly Codeium) was priced at $0 (Free) and approximately $15 per month (Pro) before the Cognition acquisition. Medium SI006, SI007
CI006 Cognition AI's gross margin is estimated at 50–70%, based on inference compute costs versus ACU pricing—no primary disclosure exists. Low SI011, SI005
CI007 The Windsurf acquisition added approximately $82M ARR and 350-plus enterprise customers to Cognition's combined revenue base. Medium SI006, SI010, SI012
CI008 Cognition AI's combined ARR reached approximately $155M post-Windsurf acquisition in July 2025. Medium SI009, SI010, SI013
CI009 ARR per employee at Cognition was approximately $1.5M prior to the Windsurf acquisition (49 employees, $73M ARR in June 2025). Low SI003, SI011
CI010 ARR per employee dropped to approximately $623K post-Windsurf, reflecting the quadrupling of headcount to ~249 employees. Low SI019, SI003, SI011
CI011 Net revenue retention rate for Cognition AI is not publicly disclosed; analyst estimates range from 100–130% based on ARR growth trajectory. Low SI005, SI011
CI012 Cognition AI's seed/Series A in March 2024 raised approximately $21M at a $350M valuation, led by Founders Fund. High SI002, SI023, SI020
CI013 Cognition AI's April 2024 funding round raised $175M at a $2B post-money valuation, led by Founders Fund. High SI002, SI014, SI020
CI014 Cognition AI's September 2025 Series B raised $400M at a $10.2B post-money valuation with Lux Capital, 8VC, Elad Gil, Bain Capital Ventures, and D1 Capital as co-investors. High SI008, SI009, SI010
CI015 Cognition AI disclosed a net cash burn of under $20M from founding through Q3 2025—extraordinary capital efficiency for a company at this ARR growth rate. Medium SI001, SI009
CI016 Reports as of April 2026 indicate Cognition AI is in discussions for new financing at a $25B+ valuation. Low SI011, SI012
CI017 Cognition AI does not disclose gross margin, customer churn, NRR, CAC, or GAAP financial statements. High SI005, SI007, SI016
CI018 All ARR figures for Cognition AI rely on company blog posts, third-party news citing unnamed sources, or secondary market research—no audited financial data is available. High SI003, SI005, SI016
CI019 The Windsurf acquisition was estimated at approximately $250M, funded from Cognition's existing capital reserves without a new primary fundraise. Low SI006, SI010
CI020 Founders Fund led both the March 2024 seed/Series A and the April 2024 Series A extension, providing deep capital commitment and reputational signal. High SI002, SI008, SI020
CI021 A $25B+ valuation on $155M ARR implies a forward revenue multiple of approximately 160x, elevated even by hypergrowth AI company standards. Medium SI011, SI012
CI022 Cognition AI's total capital raised through Series B is approximately $596M excluding the Windsurf acquisition; total capital deployed is approximately $846M when including the acquisition. Medium SI008, SI009, SI010
CI023 Post-Windsurf headcount reportedly reached approximately 249 employees before layoffs of 30 and additional voluntary departures from buyout offers. Medium SI019, SI023
CI024 SEC Form D/A for Founders Fund IX, LP (CIK 0001971631, filed October 2025) discloses total capital raised of approximately $972M across the LP and principals fund, with Peter Thiel as managing member. Medium SI026
CI025 Founders Fund IX Principals Fund (CIK 0002090410) is a co-issuer with Founders Fund IX, LP per the amended SEC Form D/A, indicating a typical GP co-investment vehicle structure. Medium SI026
CI026 Cognition AI opened its Singapore APAC headquarters in April 2026, hiring Richard Spence as VP and General Manager APAC, targeting Southeast Asia, Australia, India, and South Korea. High SI027, SI018
CI027 Mercedes-Benz announced deployment of Devin and Windsurf across its global engineering organization in April 2026, focusing on legacy modernization, cloud-native development, and logistics applications. Medium SI028, SI018
CI028 Cognition AI introduced a revised pricing structure in April 2026, replacing the Core and Team plans with Free, Pro, Max, Teams, and Enterprise tiers, signaling a land-and-expand monetization strategy. High SI018, SI004
CI029 Cognition AI's annual Team plan pricing of $6,000 per developer is approximately 15x more expensive than GitHub Copilot Enterprise ($390/yr) and 25x more than Cursor Pro ($240/yr), justified by task-completion vs. code-suggestion value. Medium SI004, SI011
CI030 Total capital raised ($596M through Series B) relative to ARR at time of last raise ($73M Devin ARR at Series B in Sep 2025) implies a capital-to-ARR ratio of approximately 8x—high but declining as ARR compounds. Medium SI008, SI009, SI003
CI031 The 9-month severance buyout offered to approximately 200 Windsurf employees represents an estimated one-time integration cost of $15–30M, calculated on average startup engineer salaries of $100–150K. Low SI019
CI032 Cognition AI's April 2026 APAC office opening and Mercedes-Benz partnership signal material increases to operating expenditure in H1 2026 through regional headcount and GTM investment. Low SI027, SI028
CI033 Enterprise AI SaaS tools with variable compute cost structures typically achieve gross margins of 55–75% at scale, per industry benchmarks; Cognition's agentic model adds inference cost variance absent from seat-based tools. Low SI011, SI015
CI034 Cognition AI has not filed Form D or equivalent exempt offering notice with the SEC as of May 2026; the company's capital raises are documented only through investor disclosures and third-party press reports. High SI016, SI023, SI026
CI035 If the Windsurf $82M ARR churns by 30% post-acquisition migration, combined Cognition ARR falls to approximately $128M—a scenario not publicly addressed by company management. Low SI006, SI003, SI011
CE001 Devin is a cloud-hosted autonomous AI software engineer that accepts natural-language task specifications and executes complete software engineering tasks—planning, coding, testing, and deploying—without human intervention during execution. High SE002, SE004
CE002 In the original SWE-bench evaluation (March 2024), Devin resolved 13.86% of issues unassisted—far exceeding the prior state-of-the-art of 1.96% for unassisted agents and 4.80% for assisted LLMs. High SE003, SE006, SE020
CE003 Devin's agent runtime executes tasks within a sandboxed cloud environment equipped with a code editor, Unix shell, and headless browser—isolated from other sessions and the customer's production environment. High SE002, SE001
CE004 Each Devin session is bounded by a maximum runtime of 45 minutes; longer tasks must be broken into sequential sub-sessions. Medium SE003, SE001
CE005 Devin integrates natively with GitHub, GitLab, and Bitbucket for source control, enabling automated PR creation, branch management, and code contributions. High SE007, SE001
CE006 Devin integrates natively with Jira and Linear for project management ticket assignment and sprint delegation. High SE007, SE001
CE007 Devin integrates natively with Slack and Microsoft Teams for session initiation via chat messages and progress reporting. High SE007, SE001
CE008 Agent Compute Units (ACUs) are the proprietary metered compute token consumed per Devin task, priced at approximately $2.25 per ACU with plan-included allocations varying by tier. High SE004, SE008
CE009 Devin Enterprise tier offers VPC deployment allowing the agent to operate within customer-controlled cloud infrastructure, and custom-trained Devin instances tuned to proprietary codebases. High SE001, SE004
CE010 Devin accepts task inputs from multiple channels: web dashboard, Slack messages, Jira tickets, and Linear issues—translating natural-language instructions into executable multi-step plans. High SE001, SE002
CE011 Nubank achieved 12x engineering efficiency improvement and 20x cost savings using Devin to migrate a 6-million-line ETL monolith; Data, Collections, and Risk business units completed migrations in weeks instead of months. Medium SE016, SE014
CE012 SWE-Check, released April 2026 in collaboration with Applied Compute, is a reinforcement-learning-trained bug detection model that matches Anthropic Opus 4.6 performance while running approximately 10x faster. Medium SE024, SE002
CE013 The DeepWiki server builds vectorized project graphs of entire codebases, enabling Devin to navigate and reason about multi-million-line repositories beyond standard LLM context window limits. Medium SE001, SE002
CE014 The Windsurf acquisition (July 2025) added 350+ enterprise customers, 250,000+ daily active users, and the Windsurf IDE product including its Cascade agentic workflow engine. High SE015, SE022
CE015 In late 2024, a live-streamed demonstration publicly exposed a major security vulnerability in Devin's system prompt handling, potentially enabling prompt-injection attacks. Cognition acknowledged and patched the issue. Medium SE017, SE009
CE016 Devin 2.0 (April 2025) achieved 83% more tasks completed per ACU compared to Devin v1.x, representing a significant efficiency improvement for existing subscribers. Medium SE008, SE019
CE017 Devin 2.0 introduced a confidence meter that quantifies the probability of task success before and during execution, allowing teams to filter low-probability tasks and prioritize high-value delegations. Medium SE008, SE001
CE018 Model Context Protocol (MCP) integration allows Devin to connect to hundreds of external tools including monitoring platforms, databases, and documentation systems through the MCP Marketplace. Medium SE007, SE001
CE019 The Devin REST API enables programmatic session creation, status retrieval, and CI/CD pipeline integration for DevOps automation workflows. Medium SE007, SE001
CE020 A 32-week empirical study of 7,156 real-world pull requests showed Devin's PR acceptance rate increased by +0.77% per week—the only agent in the study with a consistent positive trend. Medium SE011, SE010
CE021 BlockDiff snapshotting records incremental state checkpoints during Devin's execution, enabling rapid rollback to a known-good state when tests fail or an approach proves unworkable. Medium SE001, SE002
CE022 Devin supports parallel task execution across multiple concurrent sessions, allowing enterprise teams to delegate entire sprint backlogs simultaneously rather than one task at a time. Medium SE001, SE004
CE023 Claude Code (Opus 4) achieved 72.5% on SWE-bench Verified (500 instances) in 2025, far exceeding Devin's original 13.86% benchmark score—though the two benchmarks use different task sets. Medium SE010, SE012
CE024 mini-SWE-agent achieved 65% on SWE-bench Verified in July 2025, matching most commercial agents at a fraction of the computational cost—a signal that benchmark performance is rapidly commoditizing. Medium SE006, SE020
CE025 Devin for Terminal (2026) allows developers to start an agent session locally from the terminal and escalate to cloud execution when the task outgrows local compute—preserving session state across the transition. Medium SE024, SE001
CE026 The CognitionAI/devin-swebench-results GitHub repository has 124 stars and 20 forks as of research date, indicating modest developer interest in the benchmark methodology transparency. Medium SE005
CE027 Cognition AI's 2026 product roadmap includes Devin-in-Windsurf integration, SWE-1.5 model improvements, SWE-Check bug detection, APAC regional expansion (Singapore), and Mercedes-Benz enterprise deployment. Medium SE024, SE015
CE028 Cognition AI has not publicly disclosed SOC 2 Type II, ISO 27001, or equivalent compliance certifications as of May 2026, representing a gap for regulated-industry enterprise procurement. High SE001, SE004, SE021
CE029 All Devin sessions run in isolated ephemeral compute containers, and code changes are surfaced only as pull requests requiring human approval before merging—preventing unauthorized production deployments. High SE001, SE004
CE030 The MCP Marketplace integrates Devin with monitoring tools (Sentry, Datadog, PagerDuty), databases (PostgreSQL, MySQL, MongoDB), and documentation (Notion, Confluence), enabling context-rich agent execution. Medium SE007, SE001
CE031 Enterprise VPC deployment allows Devin to execute tasks within the customer's own AWS, GCP, or Azure environment, eliminating source-code transmission to Cognition's external servers. Medium SE001, SE004
CE032 Windsurf Codemaps, powered by SWE-1.5 and Claude Sonnet 4.5, provides AI-annotated structured maps of codebases with 'trace guide' expansions, enabling navigation to exact file and line references. Medium SE024, SE013
CE033 The Devin REST API uses standardized HTTP endpoints for session creation, status polling, result retrieval, and webhook-based notifications, enabling integration into CI/CD pipelines. Medium SE007, SE001
CE034 GitHub Copilot was ranked a Leader in Gartner's 2025 Magic Quadrant for AI Code Assistants, representing the current benchmark for enterprise adoption against which Devin competes. Medium SE010, SE012
CE035 A 2026 empirical study of AI coding PRs across 7,156 tasks found documentation tasks achieved 82.1% acceptance versus 66.1% for new feature development—a 16-percentage-point gap exceeding inter-agent variance. Medium SE011, SE018
CU001 Cognition AI targets engineering teams at technology-forward enterprises, midmarket companies, and developer-first organizations, with buyer profiles including CTOs and VPs of Engineering seeking developer throughput multiplication. High SU001, SU021
CU002 Confirmed enterprise customer verticals include fintech/financial services (Nubank), automotive/manufacturing (Mercedes-Benz), IT services/outsourcing (Cognizant), US Federal Government (Feb 2026), and COBOL-heavy Fortune 500 legacy sectors. High SU002, SU003, SU004, SU006
CU003 Mercedes-Benz was announced in April 2026 as a production customer deploying Devin and Windsurf across its global engineering organization for legacy modernization, cloud-native development, and logistics. Medium SU003
CU004 Cognizant partnered with Cognition in January 2026 to deploy Devin and Windsurf across its own engineering organization and its global client base—a channel partnership that could expose Cognition to Cognizant's 300+ clients. Medium SU004, SU014
CU005 Cognition launched a Government vertical (Cognition for Government) in February 2026 to target US federal agencies for critical infrastructure software modernization; no named agencies have been publicly disclosed. Medium SU006
CU006 Cognition AI's estimated ARR grew from approximately $1M at general availability (December 2024) to approximately $73M by April 2025—a roughly 70x increase in approximately four months, coinciding with the price cut from $500/month to $20/month. Low SU007, SU008
CU007 Devin's PR merge rate (fraction of Devin-opened PRs accepted by human reviewers) improved from 34% at launch (March 2024) to 67% by April 2025—a doubling of accepted output quality in approximately 12 months. Medium SU008, SU009
CU008 By February 2026, Cognition's own engineering team was merging 659 Devin-authored pull requests per week—a 4x increase from their best week of 154 PRs in 2025. High SU017, SU025
CU009 Independent reviewers note that real-world ACU consumption on the $20/month Core plan can easily result in monthly bills of $100-200 for moderate usage, contradicting the headline price reduction's accessibility promise. Medium SU008, SU009
CU010 Windsurf IDE had 250,000+ daily active users and 350+ enterprise customers at the time of acquisition (July 2025), providing Cognition with a large top-of-funnel developer install base for Devin conversion. High SU010, SU014
CU011 Nubank used a fine-tuned custom Devin instance for ETL migration; fine-tuning doubled task completion scores and reduced per-task time from 40 minutes to 10 minutes (4x speed improvement). Medium SU002
CU012 Cognition opened offices in London (January 2026), Tokyo (April 2026), and Singapore (April 2026) within a four-month period, signaling management's confidence in revenue sufficiency for global expansion. High SU005, SU006
CU013 Net Revenue Retention (NRR), Gross Revenue Retention (GRR), and customer churn rate have not been publicly disclosed by Cognition AI; retention durability cannot be independently assessed. Medium SU007, SU008
CU014 Hacker News developer community expressed skepticism about Devin's real-world performance versus demo quality; early reviews noted that complex or novel tasks frequently required significant rework, exceeding expectations set by the SWE-bench launch announcement. Medium SU011, SU023
CU015 The security vulnerability disclosed in December 2024—a live-streamed prompt-injection attack on Devin—temporarily damaged trust among security-conscious enterprise buyers, requiring Cognition to accelerate security remediation communication. Medium SU018, SU011
CU016 Nubank's use of Devin expanded from the initial Data unit pilot to Data, Collections, and Risk business units, confirming land-and-expand progression within a single named enterprise customer. Medium SU002
CU017 Cognition AI's land-and-expand strategy relies on starting with a discrete project (migration, modernization) within one business unit, demonstrating ROI, then expanding to adjacent teams—as demonstrated by the Nubank case. Medium SU002, SU004
CU018 Windsurf's 250,000+ daily active users serve as a top-of-funnel for Devin cloud agent adoption: Windsurf Free users experiencing AI-assisted local development are natural conversion targets for cloud-based autonomous task delegation. Medium SU010, SU014
CU019 Devin's COBOL modernization use case is inherently project-based—once a legacy migration is complete, there is no structural recurring need unless customers contract for ongoing support or continued modernization work. Medium SU004, SU016
CU020 Top-customer concentration is a material but undisclosed diligence gap: with ~$73M ARR and a small set of confirmed named accounts, a single Nubank-scale contract could represent more than 10% of ARR. Medium SU007, SU008
CU021 Cognizant's reseller partnership creates a single-channel concentration risk: if Cognizant accounts for a material share of Cognition's enterprise pipeline, the loss of that relationship would disproportionately impact new account growth. Medium SU004, SU022
CU022 The self-serve developer tier (Free and Pro plans) provides a low-ACV but high-volume customer base that can generate bottom-up enterprise procurement pressure, analogous to Slack's or Atlassian's PLG motion. Medium SU008, SU009, SU015
CU023 Growjo estimates Cognition AI's total funding at $1.4B and current valuation at $10.2B as of September 2025, consistent with reports of a $400M fundraise following the Windsurf acquisition. Low SU007, SU010
CU024 Cognition AI had approximately 222 employees as of 2025 and grew headcount by approximately 102% year-over-year—a pace consistent with rapid revenue scaling but also requiring significant talent investment. Low SU007, SU020
CU025 The absence of SOC 2 Type II and FedRAMP compliance attestations in public documentation limits Cognition's addressable market among regulated enterprises (financial services, healthcare, US government) that require these certifications for procurement. High SU016, SU021
CU026 Devin's PR merge rate of 67% implies a 33% rejection or rework rate on AI-generated pull requests, which may be acceptable for high-volume routine tasks but represents a significant quality gap versus a senior human engineer. Medium SU008, SU019
CU027 Fortune 500 COBOL modernization deployments are confirmed via Cognition's April 2026 blog post, but no named customers, outcome metrics, or contract structures are disclosed—limiting diligence quality. Medium SU004
CU028 Geographic expansion from US-only to Europe and APAC in Q1-Q2 2026 within four months demonstrates aggressive international GTM investment, creating both market opportunity and potential cash burn risk from multi-office overhead. High SU005, SU006
CU029 The imseankim.com review noted ARR growth from $1M to $73M in nine months while also identifying that 'independent testers tell a more complicated story' compared to official benchmarks—suggesting a gap between marketing and user-level satisfaction at the margin. Medium SU008
CU030 Devin supports concurrent multi-session task execution, allowing enterprise teams to delegate entire sprint backlogs simultaneously, which is cited as a key adoption driver for teams managing high-volume routine work. Medium SU021, SU024
CU031 The Devin PR acceptance rate empirical study (Arxiv 2026) showed a +0.77%/week positive trend—the only agent with sustained improvement—suggesting iterative model quality gains that support long-term customer retention. Medium SU019
CU032 Cognition AI's Government vertical launch (February 2026) positions Devin as a tool for US federal legacy software modernization, a multi-billion dollar addressable budget segment, but no government contract or agency name has been publicly disclosed. Medium SU006
CU033 Nubank's ETL migration case study is the only published, quantified production customer proof; Mercedes-Benz, Cognizant, and Fortune 500 COBOL deployments lack independent third-party verification or outcome data. High SU002, SU003, SU004
CU034 Devin Autofix Review Comments (January 2026) closes a key customer workflow loop—Devin automatically addresses pull request review feedback—substantially improving the PR review cycle productivity for enterprise teams. Medium SU029, SU017
CU035 The Cognizant channel partnership bypasses individual enterprise sales cycles by embedding Devin into Cognizant's managed services offering—a go-to-market shortcut that scales account coverage rapidly but creates single-reseller revenue concentration risk. Medium SU004, SU021
CR001 The EU AI Act (Regulation 2024/1689) imposes GPAI transparency and documentation obligations on providers of general-purpose AI models effective August 2026; Cognition has not publicly disclosed its GPAI registration status, model transparency documentation, or training data copyright compliance policy. High SR009, SR007
CR002 Under GDPR and UK GDPR, organizations must conduct Data Protection Impact Assessments for high-risk AI processing; Devin's autonomous code writing, PR merging, and production deployment capabilities likely qualify as high-risk processing under GDPR Article 35, requiring DPIAs from EU/UK enterprise customers. High SR010, SR009
CR003 California SB 1047, which would have imposed safety requirements on frontier AI developers, was vetoed by Governor Newsom in September 2024; however, successor legislation including AB 2013 remained active in the 2025–2026 California legislative session, creating ongoing monitoring obligation. Medium SR006, SR008
CR004 The Doe v. GitHub, Microsoft, and OpenAI class action alleges that training AI coding models on public GitHub repositories without license compliance constitutes copyright infringement; Cognition has not disclosed its model training data provenance, creating analogous latent IP exposure under the same copyright theory. Medium SR011, SR007
CR005 The January 2025 White House Executive Order revoked prior Biden-era AI safety mandates, creating a deregulatory US federal environment for AI developers; however, agency-level rules for government AI procurement—relevant to Cognition's announced Government vertical—remain under development. High SR008, SR006
CR006 In December 2024, Cognition publicly disclosed a prompt-injection vulnerability in Devin discovered during a live-streamed demonstration; the vulnerability could enable an adversary to embed malicious instructions in repository content, causing Devin to exfiltrate credentials or insert backdoors in customer codebases. Medium SR001, SR013
CR007 The OWASP Top 10 for LLM Applications lists Prompt Injection (LLM01) as the highest-severity risk for LLM-based applications; for autonomous agents with code execution, PR merge, and terminal access permissions, a successful prompt-injection exploit has a blast radius materially larger than for passive coding assistants. High SR003, SR004
CR008 Cognition obtained SOC 2 Type II certification in March 2024, prior to the December 2024 prompt-injection incident; the SOC 2 report is not publicly accessible without executing an NDA through the Cognition Trust Center, limiting independent assessment of control effectiveness by enterprise procurement teams. High SR001, SR002
CR009 Devin 2.0 (April 2025) introduced the ability to directly merge pull requests and schedule autonomous agent runs on customer infrastructure, materially expanding the attack surface and potential blast radius of any future prompt-injection or supply-chain compromise beyond what existed at the December 2024 incident. Medium SR016, SR003
CR010 Cognition's security documentation acknowledges that Devin code output may contain hallucinations, bugs, or insecure code, and recommends code reviews and branch protections as mitigations; this acknowledged limitation creates ongoing liability exposure for enterprise customers deploying Devin in production code pipelines without mandatory human review gates. High SR001, SR004
CR011 Cognition is structurally dependent on Anthropic and OpenAI as foundation model API providers; these same companies compete with Devin through Claude Code and OpenAI Codex, creating a supply chain where the primary suppliers have economic incentives to disadvantage a dependent customer. High SR023, SR015
CR012 AWS provides the cloud infrastructure for Devin's VPC deployment option; Devin's enterprise architecture is AWS-native with no disclosed multi-cloud or on-premises fallback, creating material infrastructure concentration risk in a single cloud vendor. Medium SR001, SR016
CR013 GitHub (Microsoft) provides the primary code repository integration for Devin's PR workflow; Microsoft also distributes GitHub Copilot as a direct Devin competitor, meaning Cognition depends on a competitor-owned platform for core product functionality. High SR015, SR007
CR014 Cognition's acquisition of Windsurf (July 2025) added 350+ enterprise accounts and 250K+ daily active users but introduced integration execution risk: two AI agent codebases, model pipelines, enterprise sales motions, and billing systems must be combined simultaneously while maintaining ARR growth momentum. Medium SR018, SR030
CR015 Cognition's three co-founders have exceptional technical credentials as competitive programmers but no prior track record building enterprise software companies beyond early-stage startups; execution risk is elevated as the company scales from startup to enterprise sales motion across multiple geographies simultaneously. Medium SR025, SR017
CR016 AI engineering talent competition is severe; Anthropic, OpenAI, Google DeepMind, and Microsoft actively recruit competitive programming and ML talent at packages estimated above $500K total compensation, directly competing with Cognition's talent pool and creating ongoing retention risk. Medium SR023, SR015
CR017 Cognition's international expansion to Japan, Singapore, EU, and US Government sectors requires jurisdiction-specific compliance infrastructure—EU DPO appointment, Japan PIPL compliance, FedRAMP authorization—with no publicly disclosed plans or timelines, creating a gap between announced customer segments and compliance readiness. Medium SR009, SR010
CR018 At Devin's March 2024 launch its SWE-bench Full score of 13.86% was the then-best result for autonomous coding agents; by mid-2025 Claude Code Opus 4 achieved 72.5% on SWE-bench Verified—a competitive improvement rate that threatens to eliminate Devin's technical differentiation within 12–18 months. High SR015, SR023
CR019 The FTC stated that concentrated control of foundational AI inputs—including foundation models and cloud compute—could allow providers to distort competition in downstream AI application markets; this is a direct structural risk for Cognition given its dependence on Anthropic/OpenAI (models) and AWS (compute), both of which are also competitive threats. High SR007, SR011
CR020 ARR growth from ~$1M at GA (December 2024) to ~$73M estimate (April 2025) across a small number of named enterprise customers implies high per-customer ARR concentration; individual churn events at top customers could cause material ARR declines capable of impairing the $10.2B valuation estimate. Medium SR019, SR020
CR021 Devin's ACU pricing model ($2.25 per ACU overage) creates unpredictable usage-based cost exposure for enterprise customers; 'bill shock' from unexpected ACU consumption is a cited churn risk factor in independent community analysis of early Devin adopters. Medium SR020, SR028
CR022 Cognition has raised approximately $1.575B in disclosed funding through mid-2026; with 222 employees, 102% annual headcount growth, and compute-intensive AI infrastructure, estimated monthly burn is $5–15M, implying 18–36 months of runway under conservative scenarios assuming $73M ARR and aggressive growth continuation. Medium SR024, SR021
CR023 LLM inference costs for Devin sessions—paid at commercial API rates to Anthropic and OpenAI—compress gross margins structurally; no path to proprietary model development or renegotiated volume pricing is publicly disclosed, leaving margin compression risk unaddressed. Medium SR023, SR016
CR024 GitHub Copilot's distribution through Microsoft enterprise sales channels—250M+ installs, Microsoft 365 bundling potential, Azure integration—represents a structural distribution advantage that benchmark performance alone cannot overcome for Cognition. High SR015, SR027
CR025 Cursor's approximately $500M ARR at $9B valuation with $40/month pricing represents a 12.5× price advantage over Devin's Team plan at $500/month; this price sensitivity gap creates pricing risk as competitors improve and enterprise buyer ROI scrutiny increases. Medium SR027, SR019
CR026 Nubank is the only named customer with published and quantified production deployment outcomes; $73M ARR from an undisclosed number of additional enterprise accounts implies high revenue concentration risk and limits independent verification of Cognition's enterprise product-market fit beyond one case study. Medium SR026, SR020
CR027 Developer community analysis of the December 2024 prompt-injection incident described Cognition's prior credibility as damaged, noting: Devin's claimed product superiority was inconsistent with observed PR quality; the security gap was described as 'amateurish' for a product in development for over a year. Medium SR013, SR014
CR028 The OWASP GenAI Security Project identifies agent autonomy as an emerging, distinct risk category beyond classical LLM vulnerabilities; agents capable of modifying code, merging PRs, and executing shell commands face compound risks where a single exploit can traverse multiple security boundaries simultaneously. High SR004, SR003
CR029 CVE-2024-5185 (EmbedAI CSRF data-poisoning) demonstrates that AI application platforms face real-world vulnerability classes—CSRF, data poisoning—beyond prompt injection; the NVD tracks similar classes across AI applications, indicating a broad and evolving vulnerability landscape for agentic coding platforms like Devin. Medium SR005, SR003
CR030 EU AI Act GPAI provisions require frontier model providers to publish summary information about training data, establish copyright compliance policies, and publish technical documentation; Cognition has not publicly disclosed model training data provenance, open-source license compliance scope, or copyright policy. Medium SR009, SR001
CR031 The ICO's AI and data protection risk toolkit requires organizations to conduct DPIAs for high-risk AI processing; Devin's autonomous PR merges and production deployments on customer infrastructure in EU member states likely qualify under GDPR Article 35, creating DPIA obligations for UK/EU customers. Medium SR010, SR009
CR032 Doe v. GitHub and OpenAI consolidated copyright litigation (In Re: OpenAI Copyright Infringement, S.D.N.Y. 2025) involves claims that AI model training on copyrighted code and text violates copyright law; an adverse ruling could require Cognition to disclose or remediate its training data, imposing significant operational and legal costs. Medium SR011, SR004
CR033 Cognition's January 2025 Series A was led by Founders Fund at a $2B post-money valuation; a security incident, major regulatory enforcement action, or sustained competitor improvement could trigger down-round financing pressure at the next capital raise given the high valuation multiple relative to current ARR. Medium SR017, SR024
CR034 Developer community skepticism at Devin's launch—documented on HN and via independent benchmarking—highlighted concerns about benchmark reliability, limited demo scope, and performance gaps for non-standard tasks; this adverse reputation dynamic increases adoption friction among technically sophisticated enterprise evaluators. Medium SR014, SR013
CR035 Cognition's headcount grew 102% year-over-year to approximately 222 employees (Growjo, April 2025); this rapid hiring pace, without disclosed revenue-per-employee targets or profitability milestones, increases burn rate risk if ARR growth decelerates below the ~70×/year pace observed in the early 2025 growth sprint. Medium SR021, SR020
CR036 arXiv research on AI-assisted PR acceptance rates over 32 weeks shows a measurable +0.77%/week improvement trend, validating that autonomous coding agent productivity is improving; however, the long-run acceptance ceiling and transition from supervised to fully autonomous PR merges at enterprise scale remains empirically unproven. Medium SR022, SR015
CR037 Under GDPR Article 35, automated decision-making with significant effects on individuals requires a DPIA; Devin's autonomous code deployments affecting production systems could qualify in EU deployments, creating DPIA obligations that enterprise customers must fulfill before deploying Devin at scale. Medium SR010, SR002
CR038 Cognition's pricing dropped by approximately 3× with Devin 2.0 (April 2025), from higher effective ACU pricing to lower tiers; while improving adoption, this pricing compression requires proportional volume growth to sustain ARR trajectory and reduces the pricing power available to offset future LLM API cost increases. Medium SR028, SR019
CR039 The White House AI EO of January 2025 creates a deregulatory federal environment that reduces US-federal compliance burden for Cognition in the near term; but agency rules for government AI procurement—relevant to Cognition's announced Government customer vertical—may introduce FedRAMP and NIST AI RMF requirements on a 12–24 month timeline. Medium SR008, SR006
CR040 Independent developer analysis consistently identifies Devin's ACU cost model opacity and unpredictable session-level billing as adoption barriers; enterprise customers in community analysis report difficulty estimating total cost of ownership, creating adoption friction beyond the pilot phase. Medium SR013, SR020
CR041 SWE-bench Verified (500 human-filtered instances) is considered more reliable than SWE-bench Full (2294 instances); Claude Code Opus 4's 72.5% Verified score versus Devin's 13.86% Full score reflects both a benchmark type difference and a fundamental performance gap that, unless Cognition closes it, will increasingly drive enterprise procurement decisions toward Anthropic's product. High SR015, SR027
CR042 Cognition's terms of service prohibit using Devin outputs to train competing models, but do not publicly disclose the provenance of Devin's own training data; this asymmetry creates potential legal exposure under the same copyright theories being litigated against GitHub and OpenAI in the Doe v. GitHub class action. Medium SR001, SR011
CV001 Cognition AI's $10.2B September 2025 post-money valuation divided by an estimated $73M trailing ARR (April 2025) implies approximately a 140× ARR multiple—a 7× premium to the closest comparable Cursor/Anysphere, which closed a $9.9B Series D at approximately $500M ARR, implying ~20× ARR. Medium SV011, SV012, SV010
CV002 The recommendation for Cognition AI at $10.2B valuation is watchlist / conditional invest: the bull thesis (70×/year ARR growth, Windsurf integration moat, enterprise lighthouse customers) is structurally compelling but insufficient to justify a risk-adjusted positive return at 140× ARR without NRR, gross margin, and top-customer concentration data. Medium SV012, SV013, SV017
CV003 Entry discipline recommendation: investors should target a Cognition AI entry valuation below $5B (implying ≤70× current estimated ARR), at which the bull case returns are 3.5–5× in 3 years and the bear case returns imply losses limited to 50–80% of investment, reflecting a more acceptable risk-reward asymmetry. Low SV012, SV013
CV004 Cognition AI raised $175M at a $2B valuation in its Series A round led by Founders Fund IX (CIK 0001971631, EDGAR); this represents a 5.1× step-up to the September 2025 $10.2B round in approximately nine months—one of the fastest large-round step-up trajectories in AI software startup history. High SV002, SV011
CV005 Cognition Capital SPV I (CIK 0002072175, Form D filed June 2025) was structured contemporaneously with the Windsurf acquisition; the SPV structure suggests one or more institutional investors required a structured co-investment vehicle, which is a common signal of concentration risk in a single institution funding the Windsurf consideration. Medium SV001, SV003
CV006 A 'culture reset' at Cognition post-Windsurf acquisition—reported as Cognition offering nine-month buyouts to Windsurf staff and requiring 80-hour workweeks—introduces talent attrition risk that is not reflected in the $10.2B headline valuation and could impair the 250K+ DAU Windsurf user base that underpins the IDE moat thesis. Medium SV021, SV014
CV007 Founders Fund IX's portfolio page confirms Cognition AI as a portfolio company; Founders Fund IX filed an amended Form D (D/A) in October 2025, indicating continued capital activity in the fund consistent with ongoing Cognition investment or follow-on management activity. High SV003, SV007
CV008 Under the bull scenario, Cognition AI reaches $700–800M ARR by end-2027 (implying ~45% quarterly growth sustained for 10 quarters from the April 2025 base), at which a 25–35× forward ARR exit multiple would imply an exit valuation of $18–28B, representing a 1.5–2.5× return from the $10.2B September 2025 round. Low SV012, SV013
CV009 Under the base scenario, ARR growth decelerates from 9-month 70×/year pace to 30–40% quarterly growth, reaching $200–350M ARR by end-2027; at 20–25× forward ARR multiple, the implied exit valuation is $4–9B—flat to slight loss relative to the $10.2B entry, implying negative risk-adjusted returns for investors entering at $10.2B. Medium SV012, SV013
CV010 Under the bear scenario, a material adverse event (second security incident, major customer churn, or competitor price undercut >3×) causes ARR to plateau at $80–120M with NRR below 90%; at 8–12× trailing ARR, the implied exit valuation is $640M–$1.4B, representing a 90%+ capital loss for investors at $10.2B entry. Medium SV017, SV019
CV011 Cursor/Anysphere's Series D at $9.9B on approximately $500M ARR implies a 20× ARR multiple; Cognition's $10.2B valuation at approximately $73M ARR implies a 140× multiple, representing a 7× premium to Cursor on a comparable product category—enterprise AI coding tools—that is only sustainable if Cognition grows 7× faster to ARR parity with Cursor within 12–18 months. Medium SV010, SV012
CV012 Harvey AI (legal AI) at approximately $3B valuation and ~$100M ARR implies a 30× ARR multiple; high NRR (reportedly above 150% for legal enterprise accounts) justifies this premium. Cognition at 140× ARR requires NRR disclosure at a comparable level to avoid multiple compression, but NRR is currently undisclosed. Low SV018, SV013
CV013 Glean (enterprise AI search) at $4.6B valuation and approximately $100M ARR implies a 46× ARR multiple; its NRR above 150% and lack of benchmark commoditization risk distinguish it from Cognition. Cognition's 140× ARR multiple exceeds even high-NRR AI SaaS peers, suggesting the market is pricing in either a unique category premium or future ARR disclosure that narrows the apparent gap. Low SV018, SV004
CV014 Replit (developer platform, $1.16B valuation, 2023) represents the category risk for Cognition: a developer productivity pioneer that lost relative market share to more focused AI-native competitors (Cursor, Devin) during the 2024–2025 agentic coding transition. The Replit trajectory is a base-case downside reference for Cognition's own category risk. Medium SV018, SV017
CV015 GitHub Copilot, embedded in Microsoft's GitHub Enterprise product at $19–$39 per seat per month, represents a structural pricing threat: Microsoft can subsidize Copilot through Azure and M365 bundling, enabling sustained price pressure at a price point approximately 15–25× below Devin's $500/month Team plan while offering comparable pass-level coding assistance. Medium SV009, SV017
CV016 OpenAI Codex positions directly as an autonomous coding agent—similar to Devin—in Q2 2025, representing a new competitive threat from a foundation model provider that both supplies Devin's underlying intelligence and competes for the same enterprise developer productivity budget. High SV009, SV020
CV017 Amazon Q Developer (AWS-bundled AI coding assistant) competes with Devin for enterprise developer productivity budget within AWS cloud deployments; its bundling within AWS Enterprise Support creates lock-in dynamics that disadvantage Devin for customers already committed to the AWS ecosystem. Medium SV009, SV018
CV018 A second security incident within 18 months of the December 2024 prompt-injection disclosure is the primary thesis-break trigger; the probability is moderate given Devin 2.0's expanded attack surface (autonomous PR merges, scheduled agents) and the absence of a published post-Devin-2.0 penetration test or bug-bounty program. Medium SV019, SV020
CV019 Customer concentration risk—undisclosed top-customer ARR share—is the primary financial thesis-break variable. If Nubank accounts for greater than 30% of the estimated $73M ARR, a Nubank renewal risk is a material single-event that could reduce ARR by $20M+ without any other change in business trajectory. Medium SV027, SV012
CV020 Benchmark commoditization risk is the primary competitive thesis-break: Claude Code Opus 4's 72.5% SWE-bench Verified score (2025) versus Devin's 13.86% Full score (2024 launch) demonstrates a 5× improvement rate in 15 months that, if sustained, eliminates Devin's technical differentiation by late 2026. High SV017, SV020
CV021 NRR below 90% would be a thesis-break for any investment at $10.2B; NRR below 90% implies net negative ARR cohort contribution, meaning the existing customer base is contracting, and sustaining headline ARR growth requires ever-larger new logo acquisition to compensate—a structurally unsustainable growth model at enterprise SaaS scale. High SV013, SV012
CV022 EU regulatory enforcement action under the EU AI Act GPAI provisions (effective August 2026) could restrict Cognition's EU market access; Cognition expanded to Europe (London office, January 2026) without disclosed GPAI compliance infrastructure, creating a potential Q3 2026 enforcement risk. Medium SV022, SV023
CV023 Net Revenue Retention (NRR) by cohort is the single most critical undisclosed financial metric for valuation purposes; it determines whether the $73M ARR estimate reflects durable enterprise contracts or early-adopter pilots with high churn, a distinction worth $2–5B in implied valuation. High SV012, SV013
CV024 Gross margin is the second critical undisclosed metric; Devin's per-session LLM API costs (Anthropic/OpenAI commercial rates) are substantial for compute-intensive multi-hour sessions, and gross margin below 30% would signal an unsustainable cost structure that requires either proprietary model development or volume pricing renegotiation before profitability. Medium SV020, SV028
CV025 Post-Devin-2.0 penetration testing and red-team disclosure is required for enterprise procurement security sign-offs; the December 2024 incident occurred before Devin 2.0's expanded agentic capabilities (PR merges, scheduling), meaning the existing SOC 2 Type II audit (March 2024) does not cover the current expanded attack surface. High SV019, SV028
CV026 Windsurf customer retention at 6 months post-acquisition (January 2026) is a critical proof point for the IDE moat thesis; if the 350 enterprise accounts acquired with Windsurf show greater than 15% churn, the Windsurf contribution to ARR is overstated and the platform differentiation thesis weakens. Medium SV021, SV014
CV027 LLM API contract terms with Anthropic and OpenAI—specifically volume pricing floors, agentic use-case restrictions, and change-of-control provisions—are undisclosed but material; a scenario where Anthropic raises API pricing 3× as Claude Code gains enterprise share would compress Devin's gross margin materially and impair the $10.2B valuation. Medium SV020, SV009
CV028 FedRAMP authorization status for the US Government vertical (launched February 2026) is a credibility question for government procurement; without FedRAMP In Process or Authorized status, government agency sales are limited to informal pilots, capping US Government revenue contribution at a level well below the implied market opportunity. Medium SV022, SV006
CV029 The Founders Fund IX Form D/A amendment (October 2025, EDGAR) confirms continued fund activity consistent with Cognition follow-on capital management or new investment in the post-$400M round period; combined with the Cognition Capital SPV I (June 2025), the structured capital base is complex and the preference stack is undisclosed. High SV003, SV001
CV030 The $73M ARR estimate at April 2025, growing from $1M at GA (December 2024) in nine months, represents a 73× top-line growth rate—exceptional even by AI-native startup standards—but is based on third-party estimates (Growjo, Sacra) that have not been confirmed by Cognition in any official disclosure. Medium SV012, SV013
CV031 Devin's pricing dropped approximately 3× with Devin 2.0 (April 2025), with the Team plan remaining at $500/month but providing 83% more tasks per ACU; this effective price reduction improves adoption but compresses the revenue-per-session metric, requiring proportionally greater volume to maintain ARR trajectory. High SV028, SV012
CV032 Total disclosed funding across all rounds is approximately $1.575B; at a $10.2B post-money valuation, early rounds are already at 60–140× capital returned on paper, creating exit pressure for early investors that may accelerate secondary activity and reduce inside-round price discovery for new institutional investors. Medium SV011, SV023
CV033 Mercedes-Benz announced a partnership with Cognition AI in April 2026, adding a second named enterprise customer alongside Nubank; while terms and ARR contribution are undisclosed, the automotive/manufacturing vertical validates enterprise cross-sector demand and reduces the market risk of over-dependence on financial services verticals. High SV008, SV022
CV034 a16z's 100 Gen AI Apps report identifies coding tools and developer productivity as the highest-revenue AI application category, with enterprise developer tools generating $5B+ in collective ARR across the cohort as of their most recent analysis; this market context supports the bull-case premise that Cognition is entering the most valuable AI application segment. Medium SV004, SV018
CV035 Replit's valuation of $1.16B in 2023 as a developer platform that failed to capture the AI coding wave represents the primary bear-case precedent for Cognition; developer tool platforms with strong user bases but insufficient enterprise NRR can see rapid valuation compression when more focused AI-native competitors commoditize their core value proposition. Medium SV018, SV017
CV036 Cognition AI's geographic expansion to Europe (London, January 2026), Japan, and Singapore (April 2026) adds TAM and demonstrates customer pull outside the US but introduces compliance infrastructure gaps—EU GPAI registration, UK GDPR DPO appointment, Japan PIPL—that create near-term regulatory and reputational risk before those markets generate material ARR. Medium SV022, SV025
CV037 The Cognition Capital SPV I Form D (CIK 0002072175, filed 2025-06-11, Claymont, DE) is consistent with a structured co-investment vehicle used to fund the Windsurf acquisition consideration or provide a liquidity backstop to Windsurf investors, adding a layer of financial structure complexity to the Cognition cap table beyond a standard venture equity round. Medium SV001, SV003
CV038 The Dealroom.co listing for Cognition AI reflects the company's early financing stages at a $8M valuation figure—consistent with seed-round entry—while TechCrunch and CNBC confirm the September 2025 round at $10.2B; this 1,275× valuation step-up across approximately 24 months is unprecedented in enterprise SaaS history and indicates market expectation of near-monopoly market capture. Medium SV005, SV029
CV039 For an IPO at $8B+ market capitalization on conventional public market multiples (15–25× forward ARR), Cognition would need to demonstrate approximately $320–535M ARR in the 12 months preceding the IPO; at base-case growth rates, this milestone is not reached before 2028–2029, suggesting the IPO window is approximately 2–4 years from May 2026. Medium SV011, SV024
CV040 Strategic acquirer scenarios exist: Microsoft (GitHub distribution), Alphabet (Google Cloud developer tools), Salesforce (Einstein developer productivity), and Databricks/Snowflake (data engineering automation) are all logical fits; however, the $10.2B entry price for Cognition makes a strategic acquisition at premium unlikely unless ARR scales to $400M+ first, as acquirers would need to pay a 20–30% acquisition premium above the last private valuation. Medium SV011, SV023
CV041 Cursor/Anysphere's November 2025 Series D at $29.3B (per Business Wire) represents an even more recent and higher precedent valuation for AI coding tools; if confirmed, this would shift the peer set valuation anchor upward and could partially justify Cognition's 140× ARR multiple relative to Cursor's even higher implied multiple. Low SV010, SV012
CV042 Cognition AI's Mercedes-Benz partnership announcement (April 2026) and Cognizant channel partnership (January 2026) diversify the named customer base beyond Nubank, reducing—but not eliminating—customer concentration risk; the absence of disclosed outcome metrics for these relationships limits the financial materiality assessment. Medium SV008, SV022
Sources
IDPublisherTitleQuote
SO001 Cognition AI Cognition AI — Official Homepage
SO002 Cognition AI Introducing Devin, the first AI software engineer Devin is a tireless, skilled teammate, equally ready to build alongside you or independently complete tasks for you to review.
SO003 Cognition AI Funding, growth, and the next frontier of AI coding agents Enterprise ARR grew over 30% in just seven weeks since we acquired Windsurf.
SO004 Cognition AI Cognition's acquisition of Windsurf
SO005 Wikipedia Cognition AI — Wikipedia
SO006 TechCrunch Cognition AI defies turbulence with a $400M raise at $10.2B valuation The round values Cognition at $10.2 billion post-money.
SO007 CNBC Cognition valued at $10.2 billion two months after Windsurf purchase
SO008 Sacra Cognition revenue, valuation & funding Cognition ARR: $1M (Sep 2024) → $73M (Jun 2025) → $155M (Jul 2025 post-Windsurf)
SO009 VentureBeat Cognition follows Windsurf acquisition with $400M fundraise, showing strong enterprise momentum
SO010 FavTutor Meet The Team Behind Devin AI, Its Founders & Investors
SO011 Analytics India Magazine Meet the Creator of Devin: A Child Prodigy Who is Making Coding Obsolete
SO012 Observer Cognition, Maker of Goldman's First 'A.I. Employee'
SO013 Economic Times Work-life imbalance: After buying remnants of Windsurf, Cognition lays off 30, tells rest to work long hours Cognition laid off 30 employees and offered buyouts to approximately 200 remaining Windsurf staff with a requirement to commit to 80-hour six-day workweeks.
SO014 The AI Insider Cognition AI Closes $400M in Funding to Reach $10.2B Valuation Amid Rapid Growth
SO015 AI Invest (ainvest.com) Cognition AI's Windsurf Acquisition: A Masterstroke in the AI Talent & IP War
SO016 WinBuzzer Cognition AI's Culture Reset: Offers Nine-Month Buyouts to Windsurf Staff, Demands 80-Hour Weeks
SO017 SFGate SF tech CEO offers buyouts to let workers flee 'extreme' work culture
SO018 Eesel AI A deep dive into Cognition AI reviews: Hype vs. Reality Independent testing shows real-world task success rates well below the 'fully autonomous' marketing framing.
SO019 Sean Kim (imseankim.com) Devin AI Review: From $500 to $20 — 6 Weeks With Cognition's AI Software Engineer
SO020 Tech Funding News Cognition AI scores $400M at $10.2B valuation as demand spikes for coding agents
SO021 Stepmark AI Company Spotlight: Cognition AI – Devin, your Autonomous Software Engineer
SO022 Tech Funding News Cognition raises $500M at nearly $10B valuation following Windsurf acquisition
SO023 AI Invest (ainvest.com) AI-Driven Software Development and Startup Valuation: The Cognition AI Case Study
SO024 ARR Club Cognition ARR, Revenue Growth & Milestones
SO025 Devin.ai Devin — AI Software Engineer
SO026 Devin.ai Devin Pricing
SM001 Mordor Intelligence AI Code Tools Market Size, Share & 2030 Trends Report
SM002 Grand View Research Generative AI Coding Assistants Market Size Report, 2030
SM003 BusinessWire / ResearchAndMarkets Generative AI Coding Assistants Strategic Research Report 2025 — Market to Reach $97.9 Billion by 2030
SM004 Gartner Gartner Identifies the Top Strategic Trends in Software Engineering for 2025 and Beyond
SM005 Futuretechmag Gartner Highlights Strategic Software Engineering Trends Beyond 2025
SM006 Evans Data Corporation Worldwide Developer Population Grows to 27 Million
SM007 Statista Global developer population 2024
SM008 WIPO (World Intellectual Property Organization) Global Software Spending Surges to Close to USD 700 Billion in 2024
SM009 Signisys AI Code Assistant: 90% by 2028 — Gartner forecast summary
SM010 GitHub (official) Gartner positions GitHub as a Leader in the 2025 Magic Quadrant for AI Code Assistants GitHub Copilot surpassed 20 million users and is used by over 50,000 organizations.
SM011 Kingy.ai AI Coding Agents in 2025: The Ultimate Battle for Developer Supremacy
SM012 OpenAI (official) The State of Enterprise AI — 2025 Report
SM013 Deloitte AI Trends 2025: Adoption Barriers and Updated Predictions
SM014 Forbes The Biggest Barriers Blocking Agentic AI Adoption
SM015 California Management Review (UC Berkeley) Adoption of AI and Agentic Systems: Value, Challenges, and Pathways
SM016 VC Cafe Which Barriers Still Block Agentic AI Adoption?
SM017 Second Talent GitHub Copilot Statistics and Adoption Trends 2025
SM018 AI Business VC GitHub Copilot Crosses $2B ARR — 46% of Code Is Now AI-Generated
SM019 Endroid GitHub Copilot Surpasses 20 Million Users, Eyes Enterprise AI Coding Dominance
SM020 JetBrains Research Global Developer Population 2024
SM021 Grand View Research Software Market Size, Share and Trends — Industry Report, 2030
SM022 ResearchAndMarkets AI Coding Assistant Tools Global Market Insights 2025
SM023 GeniusAI Tech AI Statistics 2025: Market Growth, Usage, Trends and Adoption
SM024 ISG (Information Services Group) State of Enterprise AI Adoption Report 2025
SM025 MarketsAndMarkets AI Assistant Market Report 2025–2030, by Application, Geo, Tech
SP001 BusinessWire Cursor Secures $2.3 Billion Series D Financing at $29.3 Billion Valuation
SP002 devclass.com GitHub Copilot tops $2B ARR, confirms enterprise dev tools push
SP003 CNBC Cursor AI startup funding round valuation November 2025
SP004 The Next Web Cursor Anysphere $2B ARR funding $50B valuation talks
SP005 BetterStack GitHub Copilot vs Cursor vs Windsurf: AI Coding Assistant Comparison
SP006 Digital Applied Cursor AI $29B Valuation Agent Revolution
SP007 Amazon Web Services Amazon Q Developer — AI Coding Assistant for AWS
SP008 Anthropic Claude Code — Agentic coding in your terminal
SP009 Lowcode.agency Claude Code vs Devin: Autonomous Coding Agent Comparison
SP010 ArXiv SWE-bench and AI software engineering benchmark analysis 2026
SP011 Benched.ai Top Coding Agents 2025 — Feature and performance guide
SP012 GitHub GitHub Copilot — Features and Plans
SP013 Amazon Web Services Amazon Q Developer Pricing
SP014 GitHub / OpenAI OpenAI Codex GitHub Repository
SP015 SWE-bench (Princeton NLP) SWE-bench: Evaluating LLM Agents on Real-World Software Engineering Tasks
SP016 Educative.io Cursor vs Windsurf vs Copilot: AI IDE Comparison 2025
SP017 WeCompareAI Best AI Coding Tools 2025: GitHub Copilot vs Cursor vs Windsurf
SP018 CreateAIAgent.net Amazon Q Developer vs GitHub Copilot Workspace Comparison
SP019 Kingy.ai AI Coding Agents in 2025: The Ultimate Battle for Developer Supremacy
SP020 aloa.co GitHub Copilot vs Cursor vs Windsurf: Complete Comparison
SP021 GitHub Docs Subscription plans for GitHub Copilot
SP022 Replit Replit Pricing — AI development platform
SP023 Cursor Cursor Pricing
SP024 Windsurf Windsurf IDE Editor — formerly Codeium
SP025 GitHub (Princeton NLP / SWE-bench) SWE-bench: Software Engineering Benchmark GitHub Repository
SP026 Sean Kim Tech Blog Cursor vs Windsurf vs GitHub Copilot: AI IDE Battle October 2025
SP027 Anthropic Anthropic Claude Pricing — Pro and Max plans
SI001 Cognition AI (Official) Funding, Growth, and the Next Frontier of AI Coding Agents
SI002 Axios Cognition raises $175M at $2B valuation for AI software engineer Devin
SI003 Sacra Research Cognition AI revenue and growth analysis
SI004 Devin.ai (Official) Devin AI Pricing — Plans and ACU details
SI005 ARR.club Cognition AI ARR and revenue tracking
SI006 VentureBeat Cognition follows Windsurf acquisition with $400M fundraise
SI007 Stepmark AI Company Spotlight: Cognition AI — Devin Autonomous Software Engineer
SI008 Axios Cognition AI raises $400 million at $10 billion valuation for Devin
SI009 TechCrunch Cognition AI defies turbulence with a $400M raise at $10.2B valuation
SI010 CNBC Cognition valued at $10.2 billion two months after Windsurf acquisition
SI011 AInvest AI-Driven Software Development Startup Valuation: Cognition AI Case Study
SI012 TechFundingNews Cognition AI scores $400M at $10.2B valuation as demand spikes for coding agents
SI013 TechFundingNews Cognition raises $500M at nearly $10B valuation following Windsurf acquisition
SI014 Observer Cognition AI startup valuation $10B
SI015 TheAIInsider Cognition AI closes $400M in funding to reach $10.2B valuation amid rapid growth
SI016 CBInsights Cognition Labs company profile and financials
SI017 GrowJo Cognition AI company revenue and funding estimates
SI018 Cognition AI (Official) Cognition AI Blog
SI019 Winbuzzer Cognition AI culture reset: offers nine-month buyouts to Windsurf staff
SI020 PitchBook Cognition AI Series A valuation and Devin funding profile
SI021 Growjo Cognition AI funding and employee count 2025
SI022 Devin.ai (Official) Devin AI — Official product and pricing page
SI023 Wikipedia Cognition AI — Wikipedia article
SI024 AIForDevelopers AI Startup Cognition Labs raises $175M valued at $2 billion
SI025 Towards AI (Medium) Cognition AI's Devin: Is it worth the hype?
SI026 SEC EDGAR Form D/A: Founders Fund IX, LP — Amended Notice of Exempt Offering of Securities (CIK 0001971631)
SI027 Cognition AI (Official) Devin heads east: Cognition opens its Singapore APAC headquarters
SI028 Cognition AI (Official) Engineering in the fast lane: Mercedes-Benz partners with Cognition
SE001 Devin (Official Documentation) Devin Documentation — Getting Started and Integrations
SE002 Cognition AI (Official) Introducing Devin, the first AI software engineer
SE003 Cognition AI (Official) SWE-bench Technical Report
SE004 Devin (Official) Devin — The AI Software Engineer
SE005 Cognition AI (GitHub) CognitionAI/devin-swebench-results — SWE-bench evaluation results and methodology
SE006 SWE-bench (Princeton / CMU) SWE-bench Leaderboard and Benchmark Documentation
SE007 Devin (Official Documentation) Devin Documentation — Integrations
SE008 imseankim.com (Developer Blog) Devin 2.0 AI Software Engineer Review: Cognition Pricing and Benchmark
SE009 Hacker News (Y Combinator) Hacker News: Introducing Devin (community discussion thread)
SE010 Benched.ai Top AI Coding Agents 2025 — Benchmarks and Capabilities Snapshot
SE011 arxiv.org / MSR 2026 Empirical Study Comparing AI Coding Agents: Temporal Trends in PR Acceptance
SE012 Artificial Analysis AI Coding Agent Comparison — Artificial Analysis
SE013 Codeium / Windsurf (now Cognition) Windsurf — The AI Code Editor
SE014 Devin (Official) Devin Customer Case Studies
SE015 Cognition AI (Official) Cognition AI acquires Windsurf — blog announcement
SE016 Devin (Official) — Customer Proof How Nubank refactors millions of lines of code with Devin
SE017 Hacker News (Y Combinator) Hacker News: Streamer discovers major vulnerability in Cognition's Devin live on air
SE018 Towards AI (Medium) What happened when Devin AI took on 2,294 GitHub bugs — the 13.86% that changed everything
SE019 TechCrunch Devin AI software engineer is now just $20 a month, down from $500
SE020 SWE-bench (GitHub) SWE-bench GitHub Repository — Benchmark and Leaderboard
SE021 Kingy.ai (Technical Blog) AI Coding Agents in 2025: The Ultimate Battle for Developer Supremacy
SE022 VentureBeat Cognition follows Windsurf acquisition with $400M fundraise showing startup resolve
SE023 UC Berkeley Haas / CMR Adoption of AI and Agentic Systems: Value, Challenges, and Pathways
SE024 Cognition AI (Official) Cognition AI Blog — Product Updates and Technical Posts
SE025 Artificial Analysis AI Agent Benchmark Performance — Coding Agents Evaluation
SU001 Devin (Official) Devin AI — Customer Case Studies
SU002 Devin (Official) — Customer Case Study How Nubank refactors millions of lines of code with Devin
SU003 Cognition AI (Official Blog) Engineering in the fast lane: Mercedes-Benz partners with Cognition
SU004 Cognition AI (Official Blog) Cognizant partners with Cognition; COBOL modernization; Government vertical
SU005 Cognition AI (Official) Devin heads east: Cognition opens its Singapore APAC headquarters
SU006 Cognition AI (Official Blog) Launching in Japan; Cognition for Government; London office
SU007 Growjo Cognition AI: Revenue, Competitors, Alternatives
SU008 imseankim.com (Developer Blog) Devin 2.0 AI Software Engineer Review: Cognition Pricing and Benchmark
SU009 TechCrunch Devin AI software engineer is now just $20 a month, down from $500
SU010 VentureBeat Cognition follows Windsurf acquisition with $400M fundraise
SU011 Hacker News (Y Combinator) Hacker News: Introducing Devin — Developer community discussion
SU012 Artificial Analysis AI Coding Agent Comparison — Artificial Analysis
SU013 Benched.ai Top AI Coding Agents 2025 — Benchmarks and Capabilities Snapshot
SU014 Cognition AI (Official Blog) Devin in Windsurf and Singapore APAC expansion
SU015 Kingy.ai (Technical Blog) AI Coding Agents in 2025: The Ultimate Battle for Developer Supremacy
SU016 UC Berkeley Haas / CMR Adoption of AI and Agentic Systems: Value, Challenges, and Pathways
SU017 Cognition AI (Official Blog) How Cognition Uses Devin to Build Devin
SU018 Hacker News (Y Combinator) Streamer discovers major vulnerability in Cognition's Devin live on air
SU019 arxiv.org / MSR 2026 Empirical Study: PR Acceptance Rates for AI Coding Agents
SU020 Growjo (Employee/Revenue Intelligence) Cognition AI headcount, revenue growth, funding data
SU021 Cognition AI (Official) Devin AI — Product Page
SU022 VentureBeat Cognition AI $175M Series A at $2B valuation to build AI software engineers
SU023 Towards AI (Medium) What happened when Devin AI took on 2,294 GitHub bugs
SU024 Cognition AI (Official) Devin Documentation — Integrations, Enterprise Deployment
SU025 Cognition AI (Official Blog) Introducing Devin 2.2 — Most important update since launch
SU026 Stack Overflow (Developer Survey 2024) 2024 Stack Overflow Developer Survey — AI Tools Adoption
SU027 GitHub Blog How GitHub Copilot is getting better at understanding your code
SU028 State of AI Report State of AI Report — Enterprise AI Adoption Trends
SU029 Cognition AI (Official Blog) Closing the Agent Loop: Devin Autofixes Review Comments
SU030 Stanford HAI Stanford AI Index 2025 — Enterprise AI Adoption and Coding Agents
SR001 Cognition AI (Official Docs) Security at Cognition — SOC 2 Type II, Data Privacy, Intellectual Property
SR002 Cognition AI (Trust Center) Cognition Trust Center — Security Documentation Portal (NDA-gated)
SR003 OWASP Foundation OWASP Top 10 for Large Language Model Applications
SR004 OWASP Foundation (GenAI Security Project) OWASP GenAI Security Project — LLM Top 10 Risks
SR005 NIST National Vulnerability Database CVE-2024-5185 — EmbedAI CSRF Data Poisoning Vulnerability
SR006 California Legislative Information SB 1047 — Safe and Secure Innovation for Frontier Artificial Intelligence Models
SR007 Federal Trade Commission (US) Generative AI Raises Competition Concerns
SR008 The White House Executive Order — Removing Barriers to American Leadership in Artificial Intelligence
SR009 European Parliament EU AI Act — First Regulation on Artificial Intelligence
SR010 UK Information Commissioner's Office (ICO) AI and Data Protection Risk Toolkit
SR011 CourtListener (RECAP) Doe v. GitHub Inc. — AI Copyright Infringement Docket
SR012 Wikipedia (Wikimedia Foundation) Prompt Injection — Cybersecurity Attack Vector
SR013 Hacker News (Y Combinator) HN: Streamer discovers major vulnerability in Devin live on air (Dec 2024)
SR014 Hacker News (Y Combinator) HN: Cognition AI — Devin Launch Skepticism and Scope Analysis (Mar 2024)
SR015 SWE-bench (Princeton NLP / CMU) SWE-bench — Live Benchmark Leaderboard for AI Software Engineering Agents
SR016 Cognition AI (Official Blog) Introducing Devin 2.0 — 83% More Tasks per ACU
SR017 TechCrunch Cognition raises $175M at $2B valuation to build AI software engineers
SR018 VentureBeat Cognition follows Windsurf acquisition with $400M fundraise
SR019 Sacra (Research) Devin ARR and Revenue Metrics Tracker — Cognition
SR020 imseankim.com (Independent Analysis) Devin ARR $1M to $73M in 9 Months — PR Merge Rate and Cost Analysis
SR021 Growjo (Business Analytics) Cognition AI — Headcount, Revenue, and Growth Analytics
SR022 arXiv (Cornell) AI-Assisted Pull Requests: Longitudinal Acceptance Rate Trends in Open-Source
SR023 Anthropic (Official) Claude Code — Agentic Coding Assistant Product Overview
SR024 PitchBook Data Cognition AI — Funding Rounds and Investor Profile
SR025 Cognition AI (Official Blog) Introducing Devin — The AI Software Engineer (March 2024)
SR026 Devin.ai (Official) Nubank Case Study — 12x Efficiency, 20x Cost Savings
SR027 benched.ai (Independent Benchmarking) Top Coding Agents 2025 — Devin vs. Competitors
SR028 TechCrunch Devin 2.0 — Price Drops 3× as Cognition Expands Agentic Capabilities
SR029 CNBC (Technology) Cognition AI Raises $175M Valuing AI Coding Startup at $2 Billion
SR030 Axios (Technology) Cognition AI — Windsurf Acquisition and $400M Raise
SV001 US Securities and Exchange Commission (EDGAR) Cognition Capital SPV I — Form D, CIK 0002072175 (June 2025)
SV002 US Securities and Exchange Commission (EDGAR) Founders Fund IX, LP — Form D Original Filing, CIK 0001971631 (April 2023)
SV003 US Securities and Exchange Commission (EDGAR) EDGAR Full-Text Search — Founders Fund IX Form D Filings
SV004 a16z (Andreessen Horowitz) 100 Gen AI Apps — 4th Edition: Top Consumer and Enterprise AI Applications
SV005 Dealroom.co Cognition AI — Funding History and Investor Profile
SV006 The New Stack Cognition Launches Devin AI Software Engineer for Enterprise
SV007 Founders Fund (Official) Founders Fund Portfolio — Official Website
SV008 Business Wire Cognition AI and Mercedes-Benz Partner to Accelerate Software Development
SV009 OpenAI (Official) OpenAI Codex — Agentic Coding Assistant Product Overview
SV010 Business Wire Cursor Secures $2.3 Billion Series D Financing at $29.3 Billion Valuation
SV011 TechCrunch Cognition AI defies turbulence with $400M raise at $10.2B valuation
SV012 Growjo (Business Analytics) Cognition AI — Revenue and Valuation Analytics
SV013 Sacra (Research) Cognition AI Revenue and Metrics Tracker
SV014 VentureBeat Cognition follows Windsurf acquisition with $400M fundraise
SV015 Cognition AI (Official Blog) Cognition Funding, Growth, and the Next Frontier of AI Coding Agents
SV016 imseankim.com (Independent Analysis) Cognition Devin Product-Led Growth and ARR Analysis
SV017 SWE-bench (Princeton NLP / CMU) SWE-bench Live Leaderboard — Autonomous Software Engineering
SV018 benched.ai (Independent Benchmarking) Top AI Coding Agents 2025 — Competitive Analysis
SV019 Hacker News (Y Combinator) Cognition AI Security Incident Live-Stream Discussion
SV020 Anthropic (Official) Claude Code — Agentic Coding Assistant Overview
SV021 Winbuzzer (Tech News) Cognition AI's Culture Reset Offers Nine-Month Buyouts to Windsurf Staff
SV022 Cognition AI (Official Blog) Cognition Expands to Europe — London Office Opening
SV023 Axios (Technology) Cognition AI raises $400M at $10.2B valuation
SV024 The AI Insider Cognition AI Closes $400M to Reach $10.2B Valuation Amid Rapid Growth
SV025 TechFundingNews Cognition AI Scores $400M at $10.2B Valuation as Demand Spikes for Coding Agents
SV026 Observer (Business) Cognition AI Startup Valuation $10B
SV027 Devin.ai (Official) Nubank Case Study — 12× Efficiency Gain, 20× Cost Savings
SV028 Cognition AI (Official Blog) Introducing Devin 2.0 — 83% More Tasks Per ACU
SV029 CNBC (Technology) Cognition valued at $10.2 billion two months after Windsurf acquisition
SV030 Ainvest.com (AI Investment Research) AI-Driven Software Development Startup Valuation: Cognition AI Case Study