Diligence report AI Software / Developer Productivity Late Private (post-Series A) 2026-05-09

Cognition AI

Cognition AI is a category-defining autonomous software engineering platform with exceptional ARR growth ($1M → $73M in 9 months), but the $10.2B valuation at ~140× ARR prices in near-flawless execution in a market facing rapid benchmark commoditization and critical undisclosed financial metrics.

Cover facts

Last Round Valuation 01

$10.2B (September 2025) [CV004]

Total Funding 02

~$1.575B (seed + Series A + $400M round) [CV010]

Estimated ARR 03

~$73M (April 2025 est.; third-party, unconfirmed) [CV030]

ARR Growth 04

$1M → $73M in 9 months (Dec 2024 – Apr 2025) [CV001]

Lead Investor 05

Founders Fund IX (CIK 0001971631) [CV007]

Key Product 06

Devin — autonomous AI software engineer (cloud + VPC deployment) [CO002]

Windsurf Acquisition 07

Acquired July 2025; 350+ enterprise accounts, 250K+ DAU [CV006]

Named Customers 08

Nubank (12× efficiency, 20× cost savings), Mercedes-Benz, Cognizant [CU003]

Security Incident 09

Prompt-injection vulnerability disclosed Dec 2024; patched [CR006]

Benchmark Gap 10

Devin 13.86% SWE-bench Full (2024); Claude Code 72.5% Verified (2025) [CR018]

Company profile

Cognition AI is a San Francisco-based AI startup founded in 2023 by Scott Wu, Steven Hao, and Walden Yan—competitive programming champions and former olympiad medalists. The company's flagship product, Devin, is a cloud-hosted autonomous AI software engineer that plans, writes, tests, and deploys code end-to-end without constant human input. Devin is accessed via API, Slack, Jira, or a web interface and priced on a usage-based model (ACUs). In July 2025, Cognition acquired Windsurf IDE, adding 250K+ daily active users and 350+ enterprise accounts. The company has raised approximately $1.575B, with the most recent $400M round in September 2025 valuing it at $10.2B.

Website: cognition.ai
Founded: 2023-01-01
Founders: Scott Wu, Steven Hao, Walden Yan
Founding location: San Francisco, CA
Headquarters: San Francisco, CA
Product: Devin is a cloud-hosted autonomous AI software engineer that accepts tasks via Slack, Jira, or web interface, then plans, writes code, tests, and deploys independently using Agent Compute Units (ACUs). Devin 2.0 (April 2025) added direct PR merging at a 3× lower effective price per task. Windsurf IDE (acquired July 2025) adds 250K+ daily active users and 350+ enterprise accounts to the platform.
Customers: Enterprise engineering teams (1,000+ developers), mid-market technology companies, and individual professional developers. Named enterprise customers: Nubank, Mercedes-Benz, Cognizant. US Government vertical launched February 2026.
Business model: Usage-based SaaS: $500/month Team plan plus $2.25/ACU overage; Enterprise plan requires direct negotiation. Windsurf IDE adds free-to-paid conversion funnel. Cognizant channel partnership adds reseller revenue layer.
Stage: Late Private (post-Series A)
Funding status: $21M seed (2023), $175M Series A at $2B (Founders Fund IX, Jan/Apr 2025), $400M round at $10.2B (Sep 2025). Total ~$1.575B disclosed.

[CO001, CO002, CO003, CI001, CI002]

Executive summary

Top strengths

Exceptional ARR growth: $1M to $73M in 9 months represents a generational growth sprint with no direct analog in enterprise SaaS history
Category pioneer: Devin established 'autonomous AI software engineer' as an enterprise procurement line item before most competitors recognized the category
Windsurf IDE integration: 250K+ DAU and 350+ enterprise accounts create a unique combined agent-plus-IDE distribution moat not available to single-product competitors
Lighthouse customer proof: Nubank (12× efficiency, 20× cost savings, published case study) is a high-quality reference for enterprise ROI in financial services
Founder pedigree: Scott Wu, Steven Hao, Walden Yan are competitive programming champions with deep AI/ML engineering intuition relevant to autonomous coding
Devin eats its own dog food: 659 Devin PRs/week merged internally, providing rapid model improvement feedback loop

Top risks

Benchmark commoditization: Claude Code Opus 4 at 72.5% SWE-bench Verified vs. Devin's 13.86% Full demonstrates a 5× competitive improvement in 15 months; technical differentiation may erode by late 2026
Security overhang: December 2024 prompt-injection vulnerability; no published post-Devin-2.0 penetration test or bug-bounty program; expanded autonomous PR-merge capability increases attack surface
Valuation multiple: $10.2B at ~140× estimated ARR is a 7× premium to Cursor/Anysphere; risk-adjusted return at this entry is negative in the base case and catastrophic in the bear case
LLM provider conflict-of-interest: Anthropic and OpenAI are both primary LLM API suppliers and direct competitors via Claude Code and OpenAI Codex; pricing leverage and terms are undisclosed
Customer concentration: Only one quantified enterprise case study (Nubank); $73M ARR from an undisclosed small number of accounts creates high single-customer churn sensitivity
EU AI Act GPAI compliance: August 2026 deadline; no disclosed GPAI registration, EU DPO appointment, or transparency documentation; EU expansion announced without compliance infrastructure

Open gaps

Net Revenue Retention (NRR) by cohort: undisclosed; the single most critical metric for validating ARR quality at $10.2B valuation; must be ≥110% to justify premium
Gross margin by product line: undisclosed; LLM inference cost structure unknown; path to profitability cannot be modeled without this metric
Top-5 customer ARR concentration: undisclosed; Nubank is the only quantified reference; concentration above 30% in a single customer is a thesis-break risk
Post-Devin-2.0 security audit: no public penetration test report or red-team disclosure post-April 2025 expanded agentic capabilities; SOC 2 Type II (March 2024) predates current attack surface
Windsurf integration customer retention: 350+ enterprise accounts acquired July 2025; 6-month retention rate (January 2026) unknown; critical proof point for IDE moat thesis
LLM API contract terms: volume pricing, agentic use-case restrictions, and change-of-control provisions with Anthropic/OpenAI are undisclosed; margin floor and supplier conflict-of-interest cannot be assessed

Chapter 01

01Company Overview

1.1 Identity and Business Model

Cognition AI is a San Francisco-based AI-native software company building autonomous agents that perform end-to-end software engineering tasks. Its flagship product, Devin, is marketed as the world's first AI software engineer: an agent that can interpret tickets, plan solutions, write and debug code, run tests, and deploy to production with minimal human intervention. The company operates a SaaS subscription model for individual developers and small teams (Core plan at $20/month), a Team plan at $500/month including 250 Agent Compute Units, and custom enterprise contracts with VPC deployment options. Following the July 2025 acquisition of Windsurf—an AI-native IDE—Cognition expanded its product portfolio to serve developers who prefer a more interactive, IDE-centered workflow rather than a fully autonomous agent. The dual-product strategy allows Cognition to address both the agentic automation segment and the traditional coding-assistant market that GitHub Copilot and Cursor dominate. Revenue is primarily usage-based: additional Agent Compute Units beyond plan inclusions are priced at roughly $2.25 per ACU, creating a metered model that scales with enterprise task volume. The company's website and blog are public-facing channels; it does not publish GAAP financials and reports only via investor communications and company-controlled press releases.[CO001, CO002, CO003, CO028, CO029]

1.2 Founders and Leadership

Cognition AI was co-founded by three elite competitive programmers, each an International Olympiad in Informatics gold medalist, giving the company an exceptionally deep technical foundation from inception. Scott Wu, CEO, is a three-time IOI gold medalist, a 2011 MATHCOUNTS national champion, and holds a Harvard economics degree. On the Codeforces competitive programming platform, Wu holds the top designation of Legendary Grandmaster, reflecting continued excellence in algorithmic problem-solving. Prior to Cognition, he co-founded Lunchclub, an AI-powered professional networking product. Steven Hao, CTO, is also an IOI gold medalist with prior experience at Scale AI, DeepMind, Waymo, Nuro, Modal, and Cursor—a background spanning both frontier AI research and production engineering systems. Walden Yan, CPO, brings product leadership and IOI-level algorithmic depth. The founding team of approximately 10 individuals collectively held 10 IOI gold medals at launch, an extraordinary concentration of competitive-programming talent rarely seen at an early-stage startup. This profile has attracted top-tier investors and talent but also creates a meaningful key-person dependency: the founders' technical credibility is inseparable from the company's market positioning. No board composition or independent director information has been publicly disclosed.[CO004, CO005, CO006, CO007, CO026, CO027]

Leadership and Founder Table
Name	Role	Educational/Competitive Background	Prior Experience	Key-Person Dependency
Scott Wu	CEO & Co-founder	3× IOI gold medal; 2011 MATHCOUNTS national champion; Harvard (economics); Codeforces Legendary Grandmaster	Lunchclub co-founder	Critical — brand and technical credibility inseparable from product
Steven Hao	CTO & Co-founder	IOI gold medal	Scale AI, DeepMind, Waymo, Nuro, Modal, Cursor	Critical — core model and system architecture
Walden Yan	CPO & Co-founder	IOI gold medal	Product leadership in AI	High — product vision and roadmap

Board composition, independent directors, and broader leadership team (VP-level and below) are not publicly disclosed. Only co-founders have been publicly identified.

[CO004, CO005, CO006, CO007, CO026, CO027]

1.3 Funding History and Investors

Cognition AI's capital formation has been among the fastest in recent venture history. An initial ~$21M seed/Series A round in March 2024 valued the company at approximately $350M before any meaningful revenue existed, reflecting investor conviction in the founding team's ability to execute. Just weeks later in April 2024, Cognition closed $175M at a $2B valuation, crossing into unicorn territory within six months of founding. Founders Fund—Peter Thiel's venture firm—led both rounds, providing not only capital but reputational signal to the broader market. By September 2025, following the Windsurf acquisition and rapid ARR growth, Cognition raised $400M at a post-money valuation of $10.2B, bringing total capital raised to approximately $696M. Co-investors in the September 2025 round included Lux Capital, 8VC, Elad Gil (prominent angel and operator), Bain Capital Ventures, D1 Capital, Definition Capital, and Swish Ventures. As of May 2026, the company is reportedly in discussions for a new financing round at a potential $25B+ valuation, implying continued strong investor appetite for agentic AI coding infrastructure. Notably, the company has disclosed a total net cash burn of under $20M from founding through Q3 2025, suggesting the rapid growth was achieved with remarkable capital efficiency prior to the acquisition.[CO008, CO009, CO010, CO020, CO030, CO031]

Cognition AI Snapshot KPI Table
Metric	Value / Status	As-Of Date	Confidence	Gap / Caveat
Valuation (last round)	$10.2B	Sep 2025	high	Post-money; pre-money not disclosed
Total capital raised	~$696M	Sep 2025	high	Sum of known disclosed rounds
ARR (Devin, pre-acquisition)	$73M	Jun 2025	high	Company-disclosed via blog
ARR (post-Windsurf combined)	~$155M	Jul 2025	medium	Partly company-claimed; Windsurf component is estimate
Initial funding (seed/Series A)	$21M at $350M	Mar 2024	high	First disclosed institutional round
Series A extension	$175M at $2B	Apr 2024	high	Company announcement + press
Net cash burn (inception to Q3 2025)	<$20M	Sep 2025	medium	Company-claimed; not audited
Headcount (est.)	~50-250	2025	low	Pre-acquisition ~49; Windsurf integration unclear
Enterprise customers (Windsurf)	350+	Jul 2025	medium	Windsurf-only figure at acquisition
Price to ARR multiple (Sep 2025)	~65-140x	Sep 2025	medium	Depends on which ARR base is used

Revenue and valuation figures are company-reported or analyst-estimated; no audited financials are publicly available. ARR figures represent annualized subscription and usage revenue. Multiple ranges reflect pre- vs. post-Windsurf ARR bases.

[CO008, CO009, CO010, CO014, CO015, CO016]

Stakeholder or investor map
Investor	Round(s)	Role	Control / Economic Importance	Diligence Ask
Founders Fund	Seed/Series A + $400M	Lead investor	Largest single institutional holder; likely board representation	Confirm board seats, anti-dilution, pro-rata rights
Lux Capital	$400M (Sep 2025)	Co-investor	Deep-tech focus; likely minority stake	Confirm ownership %
8VC	$400M (Sep 2025)	Co-investor	Tech-focused growth fund	Confirm ownership %
D1 Capital	$400M (Sep 2025)	Co-investor	Late-stage growth equity	Confirm ownership % and governance rights
Bain Capital Ventures	$400M (Sep 2025)	Co-investor	Enterprise SaaS experience; minority	Confirm ownership %
Elad Gil	Multiple rounds	Individual investor	Prominent angel; advisory relationship likely	Confirm advisory vs. formal board role
Definition Capital	$400M (Sep 2025)	Co-investor	Technology focused	Confirm size of check
Swish Ventures	$400M (Sep 2025)	Co-investor	Early-stage / growth technology	Confirm role

Ownership percentages are not publicly disclosed. Round participation derived from press releases and news reports. Board rights, anti-dilution terms, and liquidation preferences are unknown.

[CO008, CO009, CO010, CO020, CO036]

FO002: Cognition AI ARR and Valuation Snapshot KPIs

Key financial and operational metrics as of the most recent disclosed data points.

ARR is a combination of company-disclosed Devin ARR ($73M Jun 2025) and Windsurf ARR ($82M at acquisition Jul 2025). Valuation is post-money from the September 2025 financing. Lifetime burn is company-stated and unaudited.

[CO037, CO042, CO015, CO016]

1.4 Scale and Milestones

Cognition AI's revenue trajectory is extraordinary by any SaaS benchmark. The company exited 2024 at approximately $1M ARR (September 2024) and accelerated to $73M ARR by June 2025—a roughly 73x increase in nine months driven by enterprise adoption of Devin for code migration, bug-fixing, and feature automation tasks. The July 2025 acquisition of Windsurf, which contributed $82M ARR from 350+ enterprise accounts, pushed the combined business to approximately $155M ARR within weeks. Major enterprise clients including Goldman Sachs (reportedly piloting Devin with 12,000 developers), Citi, Dell, Cisco, Ramp, Palantir, Nubank, and Mercado Libre have been publicly cited. Nubank reportedly achieved a 12x efficiency improvement in code migration workflows using Devin. In April 2026, Cognition opened a Singapore APAC headquarters, signaling international expansion intent. The same month, a Mercedes-Benz partnership was announced. Headcount data is not publicly disclosed in detail; pre-acquisition estimates placed the Cognition team at roughly 49 employees, while the Windsurf acquisition added potentially 200 individuals before layoffs and buyout departures.[CO011, CO012, CO013, CO014, CO015, CO021]

Milestone table
Date	Event	Type	Amount / Valuation / Status	Participants	Implication
Nov 2023	Company founded by Scott Wu, Steven Hao, and Walden Yan	founding	N/A	Three IOI gold medalists	Launch of first purpose-built agentic AI coding company
Mar 2024	$21M seed/Series A at $350M valuation	financing	$21M / $350M	Founders Fund	Institutional validation before first product release
Mar 2024	Devin publicly announced; SWE-bench 13.86% score published	product	N/A	Cognition AI	World's first AI software engineer claim; generated global press
Apr 2024	$175M raise at $2B valuation	financing	$175M / $2B	Founders Fund + others	Unicorn status within ~5 months of founding
Sep 2024	$1M ARR milestone	scale	$1M ARR	N/A	Early commercial traction from self-serve enterprise
Jan 2025	Devin 2.0 launched with multi-agent parallelization	product	N/A	Cognition AI	83% efficiency gain claimed vs. v1.x; parallel task execution
Jun 2025	$73M ARR reached (Devin only)	scale	$73M ARR	N/A	73x ARR growth in 9 months; breakthrough enterprise adoption
Jul 2025	Windsurf acquired (~$250M est.)	product	~$250M / +$82M ARR / 350+ customers	Cognition AI / Windsurf	Added AI-native IDE product and enterprise customer base
Aug 2025	30 Windsurf employees laid off; buyouts offered to ~200	adverse	9-month salary buyout packages	Cognition AI CEO Scott Wu	Culture integration controversy; talent attrition risk
Sep 2025	$400M raise at $10.2B valuation	financing	$400M / $10.2B	Founders Fund, Lux, 8VC, D1, Bain, Elad Gil + others	Decacorn status; 5x valuation expansion in 18 months
Apr 2026	Singapore APAC headquarters opened	scale	N/A	Cognition AI	First international office; APAC enterprise penetration
Apr 2026	Mercedes-Benz partnership announced	partnership	N/A	Cognition AI, Mercedes-Benz	European automotive enterprise entry point

Dates and amounts derived from company announcements and press coverage. Windsurf acquisition price is an industry estimate; official terms not disclosed. Milestone table reflects publicly known events only; internal pivots and regulatory activity are not known.

[CO001, CO008, CO009, CO011, CO014, CO017]

FO001: Cognition AI Company Milestone Timeline

Key founding, financing, product, scale, and adverse events from November 2023 through April 2026.

[CO034, CO037, CO041, CO042, CO025, CO033]

FO003: Cognition AI Business System Flow

How Cognition AI's identity, products, customers, and capital relate to each other.

[CO003, CO011, CO021]

1.5 Governance and Key-Person Risk

Cognition AI is a privately held C-corporation with no public financial disclosures, no publicly identified board composition, and no independent directors known to investors or analysts outside the cap table. The company's governance structure remains opaque: investor rights, board seats granted to Founders Fund and other lead investors, and secondary market transaction terms have not been disclosed. The extreme concentration of product vision and technical credibility in the three co-founders—particularly Scott Wu, whose public persona and coding legend status are inseparable from Devin's brand—creates substantial key-person risk. The mandatory 80-hour, six-day workweek culture that prompted voluntary buyout offers to Windsurf employees in 2025 raises questions about talent retention, especially for acquired personnel who did not self-select for that environment. Post-acquisition attrition among Windsurf engineers could erode the IDE product capabilities that justified the ~$250M deal. Critics and independent reviewers have noted that Devin's real-world task success rates are substantially lower than company-curated benchmark demonstrations suggest, and that the 'fully autonomous software engineer' framing may oversell current capabilities, creating future investor and customer expectation risk.[CO034, CO035, CO041, CO022, CO023]

1.6 Exhibits

Chapter 02

02Market Analysis

2.1 Market Definition and Boundaries

The market for Cognition AI's Devin product sits at the intersection of two overlapping categories: AI-powered developer tools (copilots, code completion, review automation) and autonomous software engineering agents (end-to-end task execution requiring planning, multi-step reasoning, and self-correction). The former is a large established market anchored by GitHub Copilot with $2B+ ARR; the latter is an emerging sub-segment where Devin occupies a pioneer position. For sizing purposes, the relevant Total Addressable Market spans tools that professional software development teams might purchase to increase engineering velocity—from autocomplete plugins through fully autonomous agents that can execute tickets without human oversight. The included spend is: enterprise and SMB subscriptions to AI coding assistants, autonomous agent platforms sold to engineering teams, and API consumption for programmatic code generation. Excluded spend is: general large language model API access not sold for code use-cases, low-code/no-code application builders targeting business users (distinct buyer and workflow), infrastructure software (compute, storage, networking), and traditional outsourced software development services. Status-quo substitutes include offshore staff augmentation, traditional IDEs without AI, internal developer tooling built on base LLM APIs, and cloud-vendor bundled IDE extensions such as AWS CodeWhisperer and Azure GitHub Copilot bundles. The market boundary is contested across analyst firms. Grand View Research segments "generative AI coding assistants" narrowly at $92–98M by 2030 using a restrictive scope, while Mordor Intelligence defines "AI code tools" broadly at $24B by 2030. Cognition's Devin occupies the premium agentic sub-segment with $500/month Team plans and enterprise contracts in the $500K+ per year range—far above the $10–$19/seat copilot tier. The serviceable addressable market thus comprises large enterprises willing to pay a productivity premium for autonomous ticket resolution, estimated at 5–15% of the global developer population of 27–28.7 million professional engineers. [CM001, CM003, CM007, CM021, CM023]

Market definition table
Category	Included Spend	Excluded Spend	Buyer / Payer	Relevance to Cognition
AI Coding Assistants (copilot tier)	IDE plugin subscriptions, autocomplete APIs, code review bots	General LLM API access not sold for code	Developer / Engineering budget	Indirect: establishes category norm Devin upgrades from
Autonomous Software Engineering Agents (agent tier)	End-to-end ticket resolution platforms, agent-as-engineer subscriptions, enterprise agent deployments	RPA tools, test automation (non-LLM)	VP Eng / CTO / IT budget	Direct: Devin's primary TAM
Low-Code / No-Code Platforms	Visual app builders, citizen-developer tools, spreadsheet automation	Excluded entirely from Cognition's TAM	Business users (not engineers)	Not addressable by Devin
Traditional Outsourced Dev Services	Staff augmentation, consulting, offshore engineering teams	In-house headcount salaries	Procurement / Finance	Substitution target: Devin competes for this budget
Cloud IDE and Infrastructure Bundles	AWS CodeWhisperer, Azure Dev tools, GCP AI bundled with cloud services	Core cloud compute, storage, networking	Engineering / Cloud buyer (bundled)	Partially competitive with Cognition for enterprise engineering wallet

Market boundary is disputed across analyst firms; this table reflects Cognition's serviceable scope, not industry consensus definitions. Row 3 (low-code) is explicitly excluded from Cognition's TAM despite some analyst reports conflating it with AI code tools.

[CM001, CM021, CM023]

2.2 Market Sizing — TAM, SAM, and SOM

Multiple independent sizing lenses are applied to bound the opportunity and reconcile widely varying analyst estimates. The lens approach is preferred over a single top-down estimate given material methodological variation across sources. Lens 1 — AI code tools market (Mordor Intelligence, 2025): The broad AI code tools market is valued at $7.37B in 2025 and forecast to reach $23.97B by 2030 at a 26.6% CAGR. This covers all AI-assisted coding tools from IDE plugins through full agents. TAM candidate: $7.4B (2025), $24B (2030). Limitation: overly broad; includes low-code tools, documentation AI, and testing automation not directly comparable to Devin's buyer profile. Lens 2 — Developer seats × willingness-to-pay: 27–28.7 million professional developers globally (Evans Data / Statista 2024). GitHub Copilot at $10–$19/month captures approximately 1.3 million paid seats from this pool in 2025. Autonomous agent tools targeting enterprise buyers at $500–$5,000/month per team are addressable to perhaps 10% of enterprise engineering teams—roughly 500K–1M developer seats × $2,000/year average contract value = $1B–$2B SAM in 2026, expanding to $5B–$8B by 2030 as adoption broadens. Lens 3 — Enterprise software engineering spend share: Global software spending reached approximately $675– $700B in 2024 (WIPO). Even a 1% shift of software project labor costs to AI tools implies a $6.7B market; a 3% shift implies $20B. This is an upper-bound lens corroborating the Mordor $24B 2030 figure. Lens 4 — Gartner adoption curve: Gartner forecasts 90% of enterprise engineers using AI code assistants by 2028 versus 14% in early 2024. Total estimated enterprise engineering headcount globally approximately 15 million. At 90% penetration × $500/year average price = $6.75B enterprise sub-segment alone by 2028. These estimates conflict materially: the narrow Grand View Research figure of $92–98M by 2030 excludes coding-adjacent AI tools entirely, while the $97.9B ResearchAndMarkets figure appears to use an atypically broad scope that inflates the addressable market. Diligence should weight the $1–8B SAM range most heavily. [CM001, CM002, CM003, CM004, CM005, CM006]

TAM/SAM/SOM or sizing lens table
Publisher	Year	Geography	Value	CAGR	Methodology	Confidence	Limitation
Mordor Intelligence	2025→2030	Global	$7.37B→$23.97B	26.6%	Bottom-up vendor revenues + adoption modeling	medium	Broad scope includes low-code tools; overstates Devin-comparable market
Grand View Research	2024→2030	Global	$25.9M→$92.5M (narrow)	24.8%	Narrow 'generative AI coding assistants' only	medium	Likely understates by excluding broader developer AI tools
ResearchAndMarkets / BusinessWire	2025→2030	Global	→$97.9B	24.8%	Broad AI tools including dev infrastructure	low	TAM figure implausibly large; includes non-comparable segments
Developer seats lens (this analysis)	2025→2030	Global	$1–2B SAM→$5–8B	~40%	27M developers × enterprise penetration × ASP	medium	ASP assumption ($2K/yr avg) sensitive to enterprise/SMB mix
Gartner adoption lens (this analysis)	2028	Enterprise global	~$6.75B enterprise segment	N/A	90% enterprise penetration × 15M eng × $500/yr	medium	Assumes full Gartner adoption curve materializes by 2028
Agentic AI market (CMR / VC Cafe)	2025→2034	Global	$4.35B→$103B	>40%	Cross-sector autonomous agent spend	low	Includes non-software verticals; directionally supportive only

Wide TAM range reflects fundamentally different scope definitions. SAM lens ($1–2B today) is the most actionable figure for Cognition planning. Low/base/high scenario: $1B / $7B / $24B by 2030 for the broad AI code tools market.

[CM001, CM002, CM003, CM004, CM005, CM024]

FM001: Market sizing lens

TAM/SAM/SOM layers for the AI autonomous software engineering market as of 2025, showing addressable market funnel from broad AI tools to Cognition's target SOM

SAM and SOM are analyst estimates based on developer-seat and enterprise-spend lenses; actual market boundaries disputed across research firms.

[CM001, CM005, CM007, CM035]

FM002: Market estimate range

Low / base / high estimates for the AI code tools market by 2030, showing analyst disagreement across scope definitions

All values are 2030 projections in USD billions. Wide range reflects fundamentally different scope definitions across analyst sources. ResearchAndMarkets $97.9B figure excluded as outlier with implausibly broad scope.

[CM002, CM003, CM004, CM005, CM024]

2.3 Buyer, User, and Payer Segmentation

Cognition AI targets three principal buyer segments with distinct budget ownership and adoption triggers. The primary segment, enterprise software teams, sees VP Engineering or CTO act as buyer with software engineers as users and IT budget as payer. The adoption trigger is a measurable productivity gap—typically revealed by benchmarking AI tool ROI against headcount costs. Reference customers Goldman Sachs (12K developer pilot), Citi, Cisco, Dell, and Nubank illustrate this segment. Deal sizes range from $100K to $2M+ annually for enterprise contracts, with sales cycles of 3–9 months including legal and security review. The secondary segment, high-growth technology startups, sees CTOs or founding engineers as both buyer and user, with founders allocating seed capital as payer. The adoption trigger is velocity: Devin allows a 10-person team to execute like a 50-person team. These buyers typically start on the $20/month Core plan and upgrade to Team. Key reference customers include Ramp and Nubank, which reported a 12× efficiency gain. Deal value is lower at $500–$6,000/year but volume is high and conversion to enterprise contracts is likely as companies scale. The tertiary segment, individual developers and freelancers, presents lower strategic priority. Buyers and users are the same person; payers are self-employed developers billing hourly. The adoption trigger is competitive necessity as AI tooling becomes table-stakes for professional developers. The Core plan at $20/month targets this group, which has high churn sensitivity and primarily serves as a product feedback loop and lead-generation funnel rather than a primary revenue driver. Financial services enterprises represent a cross-cutting priority segment given Goldman Sachs' 12K-developer Devin pilot and Citi's reported deployment—these buyers require SOC 2 Type II, data residency, and regulatory compliance commitments before enterprise sign-off. [CM010, CM011, CM018, CM019, CM022, CM025]

Segment / buyer map
Segment	Buyer	User	Payer	Workflow	Budget Owner	Adoption Trigger
Enterprise F500 Engineering	VP Engineering / CTO	Software engineers, DevOps	Engineering / IT budget	Ticket resolution, code review, new feature development	CTO or VP Eng	Developer headcount cost pressure; AI ROI demonstrated by pilot
Mid-market SaaS Companies	CTO / Eng Manager	Backend + frontend developers	Engineering budget	Feature velocity, tech debt paydown	CTO	Speed-to-market competitive pressure; fundraise-to-launch cycle time
High-Growth Startups (Seed–Series B)	Founding engineer / CTO	Developers + CTO	Founder budget / seed capital	Full-stack feature development, MVP builds	Founders	Capital efficiency: replace 2–3 developer headcount cost
Financial Services Enterprises	CIO / Head of Platform Eng	Compliance-conscious engineers	IT / digital transformation budget	Internal tooling, regulatory reporting code	CIO	Regulatory tech modernization + developer cost reduction
Individual Developers / Freelancers	Self	Self	Self	Client project execution, side projects	Individual	Competitive necessity; peers using AI tools

Budget ownership varies: enterprise deals flow through formal procurement with security review lasting 3–9 months; startup deals are founder-approved in days. Financial services buyers require SOC 2 Type II and data residency commitments before enterprise sign-off.

[CM018, CM019, CM025, CM026]

FM003: Buyer / segment map

Buyer-user-payer relationships and value flow across Cognition AI's key market segments from budget owner through to enterprise deployment

[CM018, CM019, CM025, CM026, CM028]

2.4 Growth Drivers and Adoption Constraints

The market is subject to strong structural tailwinds as well as specific constraints governing adoption velocity and the pace at which Cognition can monetize the opportunity. Growth drivers include documented developer productivity gains—GitHub Copilot users complete tasks 51–55% faster with 46% of code AI-generated, and organizations using AI tools report 3.2× developer productivity improvements—which validate the ROI case and lower the procurement approval bar. The rapid shift to agentic AI is particularly significant: enterprise spend on agentic AI systems is projected to surge from under $1B in 2024 to $51B+ by 2028 at approximately 150% CAGR, normalizing autonomous agent budgets across enterprise technology organizations. Labor cost and talent scarcity create a compelling economic case. The median US software engineer salary exceeds $130K annually, meaning a $500/month tool that autonomously resolves even 10–20% of engineering tickets has a self-funding ROI measurable in weeks rather than quarters. Gartner's Magic Quadrant coverage of AI code assistants provides enterprise procurement legitimacy, and the Gartner forecast of 90% enterprise adoption by 2028 creates a top-down organizational mandate supporting category-level spending approval. The primary constraints are trust and governance: enterprises universally require role-based access controls, audit trails, IP ownership clarity, and security review before autonomous deployment, adding 3–9 months to enterprise sales cycles. Hallucination and code quality risk mean autonomous agents can introduce hard-to-catch bugs; buyers insist on review layers that limit full automation scope. Integration with legacy toolchains—on-premise Git servers, air-gapped CI/CD pipelines, proprietary IDEs—increases deployment cost and complexity. The EU AI Act and US executive orders on AI safety impose documentation and explainability requirements on enterprise AI systems in critical workflows. Competitive fragmentation, with GitHub Copilot (42% market share), Cursor ($9B valuation), Windsurf (now part of Cognition), and OpenAI Codex all competing, can produce buyer choice overload that stalls procurement decisions. [CM008, CM009, CM012, CM013, CM014, CM015]

Growth drivers and constraints table
Factor	Direction	Timing	Implication	Diligence Ask
Developer productivity ROI documented (51–55% faster with AI)	Tailwind	Present (2024–2026)	Validates economic case; accelerates procurement approval for AI tools	Verify Cognition-specific ROI data from customers, not just GitHub Copilot proxies
Gartner: 90% enterprise AI code adoption by 2028	Tailwind	Near-term (2026–2028)	Category legitimacy; creates top-down mandate in large enterprises	Confirm enterprise customers are accelerating commitments in 2026
Agentic AI spend: $1B→$51B by 2028 (150% CAGR)	Tailwind	Near-term (2026–2028)	Massive budget reallocation toward autonomous tools expands TAM rapidly	Monitor whether enterprise CFOs approve agentic budgets separately from copilot budgets
Software engineer talent scarcity and salary inflation	Tailwind	Ongoing	Makes $500/month Devin compelling vs. $130K+ engineer hire economically	Track developer salary surveys; assess if AI tools slow hiring demand in target segments
Hallucination / code quality trust deficit	Headwind	Near-term (2025–2027)	Slows autonomous deployment; requires human review layers limiting full TAM capture	Ask customers what % of Devin output is deployed without human review in production
Enterprise security and IP ownership concerns	Headwind	Present and ongoing	Extends sales cycles 3–9 months; restricts Fortune 500 deployments pending legal review	Review Cognition's IP terms, data handling, and SOC 2 / ISO 27001 certifications
EU AI Act and US AI governance regulation	Headwind	Medium-term (2026–2028)	Adds compliance documentation burden; may require explainability features Devin lacks	Review Cognition's regulatory compliance roadmap; assess EU market readiness
Competitive fragmentation (Copilot, Cursor, Windsurf, Codex)	Headwind	Present	Buyer choice overload stalls procurement; risk of commoditization at copilot tier	Track whether Devin's agentic differentiation sustains or competitors close the gap

Timing reflects estimated onset or peak relevance of each factor. All tailwinds are structural (secular trends) rather than cyclical, suggesting the favorable dynamics are durable over the 3–5 year investment horizon.

[CM008, CM009, CM012, CM013, CM014, CM020]

FM004: Adoption funnel or value-chain map

Adoption journey from developer awareness to full autonomous deployment of Cognition AI Devin in enterprise environments

Funnel stage percentages are estimates based on Gartner adoption data and reported customer counts; intermediate stages (evaluation, departmental) are analyst estimates not directly published by any research firm.

[CM008, CM010, CM015, CM017, CM019]

2.5 Exhibits

Chapter 03

03Competitors

3.1 Competitive Landscape Overview

The AI-assisted software engineering market in 2026 is stratified into three broad competitive tiers. The first tier—established workflow co-pilots—includes GitHub Copilot (Microsoft) and Amazon Q Developer, both backed by cloud hyperscaler distribution. GitHub Copilot commands an estimated 42% category market share with 20 million users and over $2 billion in annual recurring revenue as of early 2025, deeply embedded across enterprise developer toolchains. Amazon Q Developer provides similar completion and chat capabilities optimized for AWS workloads, targeting the large installed base of AWS enterprise customers with SOC 2 Type II compliance and native IAM integrations. The second tier—IDE-native AI coding assistants—is led by Cursor (Anysphere), which reached $2 billion ARR by February 2026 after launching its first paid tiers in mid-2024 and now operates at a $29.3 billion valuation. Cursor's VS Code fork model allows deep editor integration unavailable to plugin-based competitors, and its multi-agent parallel execution architecture (up to eight concurrent agents) increasingly overlaps with Devin's autonomous coding use case. Windsurf (formerly Codeium), now acquired by Cognition, previously competed in this tier before integration. The third tier—fully autonomous AI software engineers—is where Cognition's Devin directly positions. Claude Code (Anthropic), OpenAI Codex (re-launched as a web-based agent), and SWE-agent (open-source academic benchmark harness) all compete here to varying degrees. Claude Code's terminal-native approach and superior code quality on SWE-bench evaluations represent a quality threat, while OpenAI Codex's re-launch as an agentic execution environment targets Devin's core positioning. The differentiation frontier is shifting from benchmark scores toward measurable enterprise workflow integration and verifiable ROI metrics—terrain where Cognition's deployment evidence with Goldman Sachs and others provides a current advantage. [CP001, CP002, CP003, CP033]

Competitor profile table
Competitor	Company	Founded	Tier	ARR (est.)	Valuation	Key Investors
GitHub Copilot	Microsoft/GitHub	2021	Hyperscaler co-pilot	$2B+	N/A (MSFT)	Microsoft
Cursor	Anysphere	2022	IDE-native assistant	$2B	$29.3B (Nov 2025)	Thrive, a16z, Accel, Nvidia, Google
Claude Code	Anthropic	2023	Autonomous agent	N/A (bundled)	$60B+	Google, Amazon, Salesforce
Amazon Q Developer	Amazon Web Services	2023	Hyperscaler co-pilot	N/A (AWS bundle)	N/A (Amazon)	Amazon
OpenAI Codex	OpenAI	2025 (re-launch)	Autonomous agent	N/A (bundled)	$300B+	Microsoft, a16z, Thrive
Replit	Replit Inc.	2016	Cloud IDE platform	$80M est.	$1.2B	Andreessen Horowitz, Coatue
SWE-agent	Princeton NLP	2024	Open-source framework	Free/open-source	N/A	Academic
Windsurf (pre-acq.)	Codeium (acquired)	2021	IDE-native assistant	$82M (at acq.)	$250M est.	General Catalyst, Kleiner

ARR and valuation figures are estimates from secondary sources as of early 2026 where primary sources unavailable.

[CP001, CP002, CP003, CP005, CP006, CP016]

FP001: Competitive positioning map

[CP025, CP033, CP037]

3.2 Tier-One Hyperscaler-Backed Competitors

GitHub Copilot, owned by Microsoft since the GitHub acquisition, benefits from unmatched distribution embedded in the world's largest developer platform. With 90% of Fortune 100 companies using Copilot and over 20 million active developers, its scale creates high switching-cost inertia. Copilot's pricing ranges from $10 per month for individuals to $19 per month for Business tier and $39 per month for Enterprise, making it the default low-cost entry point for most developer teams. The platform has progressively expanded from inline code completions to chat, pull-request summarization, and workspace-level Copilot Workspace—a multi-step planning and execution agent that increasingly encroaches on Devin's autonomous territory. GitHub Copilot reached over $2 billion in annual recurring revenue by early 2025, making it the largest pure-play AI coding revenue line globally. Amazon Q Developer addresses the enterprise segment's compliance and security requirements most directly, offering a Free tier and a Pro tier at $19 per user per month. Its native AWS service integrations, 200,000-token context window, and Agents for Amazon CodeWhisperer capability set position it as the preferred choice for organizations standardized on AWS. Q Developer achieved SOC 2 Type II certification and supports VPC isolation—security postures unavailable from most startup competitors. The hyperscaler tier's primary limitation versus Cognition is the absence of full-cycle autonomous task completion: neither Copilot nor Q Developer autonomously plans, implements, tests, and deploys end-to-end without human-in-the-loop confirmation at each stage. This architectural constraint creates the addressable gap Devin is designed to exploit, though both hyperscalers are actively extending agent capabilities toward this frontier. [CP004, CP009, CP010, CP011, CP019, CP022]

Feature / capability matrix
Feature	Devin (Cognition)	GitHub Copilot	Cursor	Claude Code	Amazon Q Dev
Autonomous end-to-end execution	Yes (full cycle)	Partial (Workspace)	Partial (8 agents)	Partial (human-supervised)	Partial (Agents feature)
IDE integration	SaaS + Windsurf IDE	VS Code, JetBrains, Vim	VS Code fork	Terminal / CLI	VS Code, JetBrains, CLI
Context window	Not disclosed	Not disclosed	Not disclosed	200K tokens	200K tokens
Multi-agent parallel	Yes (Devin Teams)	No	Yes (up to 8)	No	No
Open-source model option	No	No	Model routing (multiple)	No	No
SOC 2 / Enterprise compliance	Partial	Enterprise tier	Enterprise tier	Enterprise via API	SOC 2 Type II
Free tier	No	Free (limited)	Free (hobby)	Yes (limited)	Free (50 req/mo)
VPC / on-prem deployment	Not documented	No	No	Via API gateway	Yes (VPC)

Capability ratings are based on publicly documented features as of May 2026; roadmap features excluded.

[CP008, CP010, CP012, CP022, CP025, CP030]

Pricing / packaging comparison
Tool	Free Tier	Entry Paid	Mid Tier	Enterprise
Devin (Cognition)	No	$20/mo (5 ACUs)	$500/mo (250 ACUs, Team)	Custom
GitHub Copilot	Yes (2k completions)	$10/mo (Individual)	$19/mo (Business)	$39/mo (Enterprise)
Cursor	Yes (hobby)	$20/mo (Pro)	$40/mo (Business)	Custom
Claude Code	Yes (limited)	$10/mo (Pro)	$100/mo (Max)	Enterprise API
Amazon Q Developer	Yes (50 req/mo)	$19/mo (Pro)	N/A	Custom / Marketplace
Replit	Yes	$25/mo (Core)	$40/mo (Teams)	Custom

Pricing as of May 2026 from official product pages; subject to frequent change.

[CP009, CP015, CP019, CP023]

FP002: Feature breadth / capability map

[CP008, CP010, CP022, CP029]

3.3 IDE-Native Competitors — Cursor and Windsurf

Cursor (Anysphere) is the most commercially formidable competitor to Cognition in the pure-ARR sense, having scaled from zero to $2 billion ARR within approximately 24 months of its public launch. Its Series D funding round in November 2025 raised $2.3 billion at a $29.3 billion valuation—the largest venture financing in AI coding history at the time. Investors include Thrive Capital, Andreessen Horowitz, Accel, Nvidia, and Google. Cursor's architecture relies on a VS Code fork with deep token-budget controls, model-agnostic routing (Sonnet, GPT-4o, Gemini), and a multi-agent background agents feature that runs up to eight parallel task threads—narrowing the functionality gap with Devin's full-autonomy proposition. Cursor's 1 million-plus paying customers and 50,000 enterprise teams demonstrate product-market fit across the individual developer and team segments. Cursor is growing faster than Cognition on ARR metrics and has more paying customers, providing proportionally more training signal and pricing power. Windsurf (formerly Codeium) built strong developer mindshare with a free-tier, VS Code compatible AI IDE and over 800,000 users before the Cognition acquisition in July 2025. Post-acquisition, Windsurf's enterprise book ($82M ARR contributed, 350-plus enterprise customers) was folded into Cognition's revenue base, converting a direct competitor into a distribution channel. The integration is not yet complete, and retention of Windsurf's enterprise customers under the Cognition brand is a key execution risk. The acquisition also demonstrated Cognition's willingness to pursue inorganic growth to defend against IDE-layer encroachment—a signal that the competitive pressure from Cursor in the IDE tier was viewed as existential to distribution strategy. [CP005, CP006, CP007, CP008, CP018, CP020]

3.4 Autonomous Agent Competitors — Claude Code, OpenAI Codex, and SWE-agent

Claude Code (Anthropic) is arguably Cognition's most direct quality-based competitor in the autonomous agent tier. Claude Code operates as a terminal-native agentic coding assistant layered on top of Anthropic's Claude 3.7 Sonnet and Claude 4 models, which have consistently ranked among the highest performers on SWE-bench Verified—an independent benchmark measuring an agent's ability to resolve real GitHub issues. Devin's initial SWE-bench score of 13.86% (March 2024) was a breakthrough at the time, but subsequent model improvements from Anthropic pushed Claude-based agents above 50%, raising questions about Devin's benchmark differentiation. Claude Code's pricing ($10 per month on Pro, $100 per month on Max) undercuts or matches Devin's Core tier ($20 per month for 5 ACUs) on entry price, though it offers different economics for agentic-scale autonomous use. The key architectural gap is that Claude Code still requires a human operator to supervise at the terminal, whereas Devin is designed to run fully unattended in enterprise cloud environments—a meaningful operational distinction for large-scale automation. OpenAI re-launched Codex in 2025 as a web-based agentic coding environment backed by the o3 model family, positioning it as a direct Devin competitor. Codex Workspace allows users to assign high-level feature tasks and receive completed pull requests, mirroring Devin's core value proposition. Given OpenAI's $300 billion-plus valuation, API distribution, and ChatGPT developer ecosystem, Codex's competitive entry represents a fundamental threat to Cognition's differentiation if OpenAI prioritizes execution quality and aggressive pricing. SWE-agent (Princeton NLP Group) is an open-source research framework rather than a commercial product; it is significant primarily as a benchmark comparator and as a recruitment signal that world-class ML researchers are actively working on autonomous code execution frameworks outside commercial settings, which could accelerate open-source commoditization of the core technology. [CP012, CP013, CP014, CP015, CP016, CP017]

Moat durability / competitive risk register
Moat Factor	Cognition Strength	Durability (1-5)	Primary Threat	Threat Source
Autonomous execution depth	High – full unattended task cycles	3	Rapid capability parity from labs	Anthropic, OpenAI
Enterprise integrations	Medium – Goldman, Citi, Nubank proofs	4	Hyperscaler bundling incentives	Microsoft, Amazon
Windsurf IDE distribution	Medium – 350+ enterprise clients	3	Cursor IDE market dominance	Cursor (Anysphere)
Founder talent + network	High – 3x IOI gold medalists, ex-Scale/DeepMind	4	Talent poaching by frontier labs	OpenAI, Google, Anthropic
Training data flywheel	Medium – speculative, unverified	2	Larger model labs have more training data	OpenAI, Google
Benchmark leadership	Low – lost within months of launch	1	All major competitors surpassed SWE-bench score	All peers
Pricing competitiveness	Low – most expensive per-task	2	Commoditization pressure from free/cheap tiers	Claude Code, Copilot

Durability scores are qualitative analyst assessments; scale 1 (weakest) to 5 (strongest).

[CP026, CP027, CP028, CP032, CP034]

3.5 Moat Assessment and Competitive Dynamics

Cognition's defensible competitive advantages break down into three categories: execution depth, enterprise data flywheel, and distribution via Windsurf. Execution depth—the ability to complete multi-hour, multi-file agentic coding sessions with planning, sandboxed execution, testing, and PR creation—remains ahead of most competitors in verified enterprise deployments (Goldman Sachs 12K developer pilot, Nubank 12x efficiency claim). The enterprise data flywheel from each completed task creates training signal for model fine-tuning, which potentially compounds over time—but this advantage is speculative until Cognition's model quality can be independently verified versus frontier labs. Windsurf integration adds IDE-layer distribution that Devin alone could not have quickly organically scaled. Counterarguments to moat durability are significant. Cursor is growing faster than Cognition on pure ARR metrics and has more paying customers. Frontier model labs (Anthropic, OpenAI) control the underlying reasoning capabilities and can deploy equivalent agentic pipelines without licensing dependencies—giving them structural cost advantages. Benchmark credibility questions persist: Cognition's March 2024 Devin demo was partially disputed, and independent evaluations of SWE-bench performance showed other tools surpassing Devin within months of its launch. The competitive window for pure agentic differentiation is narrowing, making Cognition's enterprise go-to-market execution and customer retention metrics the true leading indicators of sustainable competitive position. Cognition's compliance posture relative to enterprise-grade competitors like Amazon Q Developer (SOC 2 Type II, VPC isolation) is another documented gap that could slow enterprise contract cycles for security-sensitive verticals. [CP026, CP028, CP031, CP035]

FP003: Moat / readiness KPIs

[CP026, CP028, CP035]

Chapter 04

04Financials

4.1 Revenue Model and ARR Trajectory

Cognition AI operates a consumption-based SaaS revenue model anchored to Agent Compute Units (ACUs)—a proprietary token of compute capacity consumed per autonomous task. Individual developers subscribe at $20 per month (Core plan: 5 ACUs), teams at $500 per month (Team plan: 250 ACUs), and enterprises under custom contracts with volume discounts and additional ACU pools. The pricing ladder is designed to monetize proportionally to task complexity and volume, making it structurally different from seat-based tools like GitHub Copilot or Cursor. However, this model creates lumpy revenue: enterprise customers with variable sprint workloads may have uneven monthly consumption patterns, requiring careful cohort-level analysis to distinguish durable ARR from one-time expansion. Cognition's ARR trajectory has been one of the most rapid in enterprise AI history. The company exited the first six months of operations with negligible revenue, crossed $1M ARR in September 2024, reached $73M ARR by June 2025—a 73x increase in nine months—and then jumped to approximately $155M combined ARR following the July 2025 Windsurf acquisition ($82M ARR contributed). The Cognition AI blog post from September 2024 ("Funding, Growth, and the Next Frontier of AI Coding Agents") confirmed the $73M ARR figure and outlined the company's two-product strategy. Post-Windsurf, the combined entity has a bifurcated revenue base: Devin's enterprise automation revenue (high ACV, low seat count) and Windsurf's IDE subscription revenue (lower ACV, higher seat count). Separating and growing both lines while integrating the teams is a material execution risk. As of April 2026, reports suggest the company is in discussions for additional financing at a potential $25B+ valuation, implying investors believe ARR will continue growing materially above $155M. However, post-acquisition ARR disclosed figures should be interpreted cautiously: the $82M Windsurf ARR contribution was measured at time of acquisition and may not reflect post-migration retention rates.

Revenue streams table
Revenue Stream	Product	Model	ACV Range	Launched	Est. Contribution to ARR
Individual subscriptions	Devin Core	Usage-based, $20/mo (5 ACUs)	$240/yr	2024	~$5M est.
Team subscriptions	Devin Teams	Usage-based, $500/mo (250 ACUs)	$6,000/yr	2024	~$25M est.
Enterprise contracts	Devin Enterprise	Custom, VPC deployment	$50K–$1M+ /yr	2024	~$43M est.
IDE subscriptions (Windsurf)	Windsurf Free/Pro	Seat-based, $0–$15/mo	~$120/yr paid seat	Pre-acq. (2024)	~$30M est.
IDE enterprise (Windsurf)	Windsurf Enterprise	Custom enterprise contracts	$50K–$500K/yr	Pre-acq. (2024)	~$52M est.
Usage overages	Devin all plans	Per-ACU metered, ~$2.25/ACU	Varies	2024	Not disclosed

ARR contribution estimates are analyst approximations; Cognition does not disclose per-stream revenue breakdown.

[CI001, CI002, CI003, CI008]

FI001: Revenue model bridge

[CI003, CI004, CI007]

4.2 Funding History and Capital Structure

Cognition AI has raised approximately $696 million in venture capital across three primary rounds in under 18 months, making it one of the fastest-funded AI infrastructure companies on record. The seed/Series A in March 2024 raised approximately $21 million at a $350 million pre-money valuation from Founders Fund, signaling early conviction in the founding team before any product revenue. In April 2024, just weeks after launch, Cognition closed $175 million at a $2 billion post-money valuation—again led by Founders Fund—crossing unicorn status within six months of founding. The September 2025 Series B raised $400 million at a $10.2 billion post-money valuation, with co-investors including Lux Capital, 8VC, Elad Gil, Bain Capital Ventures, D1 Capital, Definition Capital, and Swish Ventures. VentureBeat and TechCrunch confirmed this round in September 2025. The company disclosed a remarkably low net cash burn of under $20 million from founding through Q3 2025—extraordinary given the fundraising scale—suggesting the bulk of capital was held in reserve rather than spent on headcount or infrastructure at a rate typical for similar-stage companies. The Windsurf acquisition in July 2025 was estimated at approximately $250 million, funded from existing capital reserves without a new primary raise—a sign of financial discipline given the scale of the transaction relative to employee count. Post-acquisition headcount reportedly reached approximately 249 before layoffs of 30 employees and the departure of Windsurf personnel who accepted the 9-month buyout offer.

Pricing / monetization table
Plan	Product	Price	ACU Allowance	Target Segment	Overage Rate
Core	Devin	$20/month	5 ACUs	Individual developers	~$2.25/ACU
Team	Devin	$500/month	250 ACUs	Engineering teams	~$2.25/ACU
Enterprise	Devin	Custom	Custom pool	Enterprise orgs	Negotiated
Free	Windsurf IDE	$0/month	Limited AI requests	Hobbyist/student	N/A
Pro	Windsurf IDE	~$15/month	Standard AI credits	Individual devs	Varies
Enterprise	Windsurf IDE	Custom	Custom	Enterprise orgs	Negotiated

Windsurf pricing as of acquisition July 2025; subject to change post-integration with Cognition pricing structure.

[CI001, CI002, CI005]

FI002: Unit economics bridge

[CI006, CI035, CI011]

4.3 Unit Economics and Margin Structure

Cognition's unit economics are difficult to model precisely because the company does not disclose gross margins, customer acquisition cost (CAC), or cohort-level retention data. However, structural inferences are possible. The ACU model implies a cost-of-revenue that is partially variable (inference compute per task) and partially fixed (model hosting, sandbox environments, CI/CD infrastructure). Enterprise AI inference at frontier model scale (likely GPT-4 or Claude API calls per task) can cost $5–$50 per hour of agent runtime at current cloud spot prices, while enterprise ACU packages price at roughly $2.25 per ACU with 250 ACUs per Team plan month—creating a gross margin that depends heavily on task length and model call efficiency. High-value enterprise contracts (Goldman Sachs, Citi, Nubank) likely have custom pricing with volume discounts that could improve margin at scale if infrastructure costs are amortized across large task volumes. The Nubank case study reporting a 12x efficiency improvement in code migration workflows suggests that customers perceive strong value relative to developer hours saved—a favorable indicator for pricing power. However, the inherent challenge for fully autonomous coding agents is that task complexity variance is high: simple bug fixes cost little to execute but complex multi-week feature builds can consume compute budget disproportionately. Unless Cognition gates task scope at the enterprise level, margin management at scale will require sophisticated compute forecasting. The company's capital efficiency pre-acquisition (sub-$20M burn on $350M+ raised before revenue scale) contrasts sharply with the likely post-acquisition cost structure. Adding 250 employees (Windsurf personnel) to a ~49-person team more than quadruples headcount, compressing per-employee revenue significantly in the near term unless post-integration synergies materialize quickly.

Unit economics table
Metric	Estimate	Basis	Confidence	Caveat
Gross margin (est.)	50–70%	Inferred from compute cost vs. ACU pricing	low	Not disclosed; highly variable by task complexity
Inference cost per ACU (est.)	$0.50–$1.50	Market rates for frontier model inference	low	Depends on model mix and task length
Customer ACV (enterprise)	$50K–$1M+	Industry comp, customer case studies	medium	Range is wide; no primary disclosure
Net revenue retention (est.)	100–130%	Inferred from ARR growth trajectory	low	No NRR data publicly disclosed
CAC (est.)	Not disclosed	N/A	unknown	No marketing spend or sales data published
ARR per employee (pre-Windsurf)	~$1.5M	~49 employees, $73M ARR as of June 2025	medium	Headcount estimate is approximate
ARR per employee (post-Windsurf)	~$623K	~249 employees, $155M ARR as of July 2025	medium	Includes all acquired Windsurf staff before departures

All unit economics are analyst estimates; no primary financial disclosure exists. Treat all values as directional only.

[CI006, CI009, CI010, CI011]

FI003: Financial estimate range

[CI012, CI013, CI014, CI016]

4.4 Public Financial Data Gaps and Disclosure Limitations

Cognition AI is a privately held Delaware C-corporation with no public filing obligations, no disclosed GAAP financial statements, and no investor day or earnings calls. All financial data in this chapter relies on: (1) company-authored blog posts; (2) third-party news reports citing unnamed sources; (3) investor communications and press release summaries; and (4) secondary market research estimates. This disclosure posture is typical for pre-IPO AI companies but creates material risk for investors and customers attempting to verify sustainability of the ARR trajectory. The most significant undisclosed financial metrics are: gross margin by product line (Devin vs. Windsurf), customer churn and net revenue retention (NRR), CAC and payback period, enterprise deal sizes and contract lengths (ACV), and post-Windsurf integration costs. Without NRR data, it is impossible to distinguish whether the $73M to $155M ARR jump reflects organic customer expansion or purely inorganic acquisition contribution. Enterprise AI tool contracts in other companies typically show NRR of 100–140% when adoption is strong; if Cognition's NRR is below 100%, the underlying Devin-only ARR may be stagnating despite the Windsurf injection. Sacra, CB Insights, and other secondary research platforms provide partial revenue and headcount estimates that must be treated as approximations. A secondary concern is the reported $25B+ valuation discussions in early 2026, which would imply a forward revenue multiple of approximately 160x on $155M ARR. While extreme multiples are historically common for hypergrowth AI companies in their early stages, they are sustainable only if ARR continues growing at 2x+ annually—a bar that requires substantial new enterprise customer acquisition and retention of the Windsurf base. Any deceleration in ARR growth would make the valuation multiple compress significantly.

Capital adequacy table
Round	Date	Amount	Valuation (post-money)	Lead Investors	Confirmed Source
Seed/Series A	Mar 2024	$21M	$350M	Founders Fund	Multiple news reports
Series A extension	Apr 2024	$175M	$2B	Founders Fund	Axios, CNBC, multiple
Series B	Sep 2025	$400M	$10.2B	Lux Capital, 8VC, Elad Gil, BCV, D1	TechCrunch, VentureBeat, CNBC
Total raised (through Series B)	Sep 2025	~$596M	$10.2B	Multiple	Summed from above
Windsurf acquisition	Jul 2025	~$250M (est.)	N/A	Cognition (acquirer)	Widely reported estimate
Potential new round (rumored)	2026	Not disclosed	$25B+ (reported)	Not disclosed	Press reports, not confirmed

Series B total excludes the Windsurf acquisition cost; combined capital deployed (raised + acquisition) is approximately $846M. Founders Fund IX Form D/A (SEC, Oct 2025) confirms $972M LP capital base.

[CI012, CI013, CI014, CI015, CI016, CI024]

FI004: Capital intensity / cash-flow map

[CI015, CI019, CI020]

4.5 Capital Adequacy and Financial Runway

With approximately $696M raised and sub-$20M cash burn disclosed through Q3 2025, Cognition had substantial financial runway heading into 2026 even before any new financing. The Windsurf acquisition consumed an estimated $250M of that reserve, leaving approximately $426M available minus ongoing operating expenses. At a conservative 200-employee burn rate of $400K per employee-year (blended fully-loaded cost), annual operating costs could be $80M or higher excluding compute. If the company is generating $155M ARR with moderate gross margins (estimated 50–70% given inference costs), gross profit could fund ongoing operations without immediate new capital. However, the $25B+ valuation discussions suggest the company is seeking capital to accelerate—not merely to sustain—operations. Expected uses of new capital include R&D for Devin next-generation models, enterprise sales force expansion, global data center capacity for inference at scale, and potential further acquisitions to consolidate the AI coding infrastructure market. The Founders Fund lead investor relationship provides additional runway optionality: Founders Fund has historically shown willingness to lead follow-on rounds for breakout portfolio companies, reducing financing risk. The presence of large institutional co-investors (D1 Capital, Bain Capital Ventures) signals access to crossover and late-stage capital. Overall capital adequacy is rated medium-high: the company has sufficient runway for at least 24 months at current burn estimates, but aggressive growth execution and integration costs from Windsurf could accelerate burn faster than current estimates suggest.

Public financial gaps table
Missing Metric	Why It Matters	Severity	Proxy Used
Gross margin by product line	Determines profitability and long-term unit economics	High	Industry comp: 50–70% est.
Net revenue retention (NRR)	Indicates whether ARR is organically durable	High	Not available; assume 100–130%
Customer churn rate	Validates stickiness of enterprise contracts	High	Not disclosed; inferred from case studies
CAC and payback period	Determines GTM efficiency at scale	Medium	Not available
GAAP revenue vs. ARR	Deferred revenue may differ from ARR	Medium	ARR used as proxy
Windsurf post-acquisition retention	Critical for combined ARR durability	High	Not yet disclosed

Gaps listed represent the most decision-relevant undisclosed financial data as of May 2026 research date.

[CI017, CI018]

Chapter 05

05Product & Technology

5.1 Product Definition and Customer Workflow

Cognition AI offers two core products: Devin, a cloud-hosted autonomous AI software engineer, and Windsurf, an agentic IDE for local and cloud-assisted development. Devin accepts natural-language task descriptions—submitted via the web dashboard, Slack, Linear, or Jira—and executes full end-to-end software engineering tasks autonomously: planning the approach, writing and modifying code, running tests, iterating on failures, and opening pull requests with reviewable diffs and confidence indicators. Unlike code-completion tools (GitHub Copilot, Cursor), Devin takes ownership of entire tasks rather than assisting a human in writing individual lines. In a typical enterprise workflow, a developer or engineering manager assigns a Jira ticket or Slack message to Devin. Devin reads the ticket, explores the repository context using its DeepWiki server, sets up the necessary environment, writes the implementation, runs the test suite, and reports back with a pull request. The PR includes Devin's internal trace of decisions for human review. The Devin 2.0 release added a confidence meter that quantifies the probability of task success before committing, allowing teams to triage tasks efficiently. Devin can handle tasks concurrently across multiple sessions, enabling teams to delegate entire sprint backlogs rather than individual tickets. Windsurf IDE complements Devin by providing a local development environment with AI Cascade agent integration. Developers can start a coding session locally in Windsurf and hand off complex, compute-intensive work to Devin in the cloud mid-session, with state preserved across the transition. The "Devin in Windsurf" feature announced in April 2026 formalizes this local-to-cloud handoff workflow. Together, the two products target the full developer lifecycle: Windsurf for daily coding assistance and Devin for delegated autonomous task execution.

Product/SKU map table
Product	Delivery Mode	Target User	Key Features	Pricing Tier	Launched
Devin Free	Cloud agent	Individual dev	Limited ACUs, web dashboard, basic integrations	Free	Apr 2026
Devin Pro	Cloud agent	Individual dev	More ACUs, API access, confidence meter	Paid ($20/mo base)	Apr 2025
Devin Max	Cloud agent	Power user	High ACU pool, priority queue	Paid (custom)	Apr 2026
Devin Teams	Cloud agent	Dev teams	Shared ACU pool, team management, Slack/Jira integration	Team pricing	2025
Devin Enterprise	Cloud or VPC	Enterprise org	VPC deploy, custom-trained Devin, dedicated support	Custom contract	2024
Windsurf IDE (Free/Pro)	Desktop IDE	Individual dev	Cascade AI agent, in-editor completions, Codemaps	Free/$15/mo	2024
Windsurf Enterprise	Desktop IDE + cloud	Enterprise org	SSO, centralized billing, enterprise support	Custom contract	2024
Devin API	REST API	DevOps/CI	Programmatic session management, CI/CD integration	Included in paid plans	2025

Plan structure as of May 2026 research date; April 2026 plan refresh replaced legacy Core/Team plans with Free/Pro/Max/Teams/Enterprise structure.

[CE001, CE008, CE009, CE025, CE026]

FE001: Devin task lifecycle flow

[CE003, CE010, CE017, CE021]

5.2 Product and Technology Map

Cognition AI's product portfolio as of May 2026 comprises four distinct product units: (1) Devin Core/Pro/Max/Enterprise—cloud agent plans for individual, team, and enterprise customers; (2) Windsurf IDE—the agentic desktop IDE with Cascade AI agent, acquired from Codeium in July 2025; (3) the Devin API—a RESTful API for programmatic session creation and CI/CD pipeline automation; and (4) Windsurf Enterprise—the enterprise tier of the IDE with SSO, centralized billing, and advanced admin controls. A fifth element, Windsurf Codemaps, is an AI-annotated structured codebase map powered by SWE-1.5 and Claude Sonnet 4.5, enabling rapid onboarding and debugging by grounding navigation to exact lines and visual node graphs. The April 2026 plan revision retired the original Core and Team plans, replacing them with a five-tier structure: Free (limited ACUs), Pro, Max, Teams, and Enterprise. This signals a land-and-expand strategy: the Free tier provides entry-level exposure for individual developers and bootcamp students, while Pro and Max serve power users, and Teams/Enterprise capture organizational deployment. SWE-Check, introduced in collaboration with Applied Compute in April 2026, is a specialized bug detection model trained via reinforcement learning that matches Opus 4.6 on internal evals while running approximately 10x faster—positioned as a cost-efficient quality gate within the Devin agent loop. The company's in-house model, SWE-1.5, powers both SWE-Check and Windsurf Codemaps, representing Cognition's first proprietary model release beyond the Devin runtime ensemble. All products are delivered cloud-first, with VPC deployment available for enterprise customers requiring on-premise or isolated cloud infrastructure. The Devin for Terminal feature, released in 2026, enables developers to start agent sessions from their local terminal and escalate to the cloud when the task outgrows local resources, bridging the CLI and cloud deployment modes.

Integration ecosystem table
Category	Platform	Integration Type	Capability	Status
Source control	GitHub	Native	PR creation, repo access, issue tracking	GA
Source control	GitLab	Native	Branch management, MR creation	GA
Source control	Bitbucket	Native	PR creation, code contribution	GA
Project management	Jira	Native	Ticket assignment, status sync	GA
Project management	Linear	Native	Issue tracking, sprint delegation	GA
Communication	Slack	Native	Session initiation, progress updates	GA
Communication	Microsoft Teams	Native	Session initiation, notifications	GA
Monitoring	Sentry / Datadog / PagerDuty	MCP	Alert-triggered sessions, log context	MCP marketplace
Databases	PostgreSQL / MySQL / MongoDB	MCP	Read/write access for data tasks	MCP marketplace
Documentation	Notion / Confluence	MCP	Context retrieval for tasks	MCP marketplace

MCP Marketplace integrations are third-party configured; availability depends on customer configuration. Native integrations are Cognition-maintained.

[CE005, CE006, CE007, CE018, CE030]

FE002: SWE-bench performance bar chart

[CE002, CE016, CE020, CE023]

5.3 Technical Architecture and Agent Runtime

Devin's core technical architecture centers on an agent runtime that manages long-horizon planning and multi-step execution within a sandboxed cloud environment. The agent receives a natural-language task specification and decomposes it into an ordered sequence of sub-steps: repository exploration, environment setup, implementation, test execution, and pull request creation. At each step, the agent calls specialized tools—a code editor, a Unix shell, and a headless browser— within an isolated compute container that prevents lateral access to other sessions or customer data. This sandboxed design separates Devin's execution context from the customer's production environment, with code changes surfaced only through pull requests that require human approval before merge. The DeepWiki server enables large-codebase comprehension by building vectorized project graphs that represent relationships between files, functions, and modules. This graph-based representation allows Devin to navigate million-line codebases more effectively than purely token-based context windows. BlockDiff snapshotting records incremental state checkpoints during task execution, enabling rapid rollback when a test fails or an approach is abandoned. These two features—DeepWiki and BlockDiff—are among Cognition's most differentiating technical capabilities, as they address the two primary failure modes of LLM-based coding agents: context loss in large repositories and unrecoverable error cascades. The proprietary SWE-1.5 model, released in 2025-2026, underlies SWE-Check and Windsurf Codemaps. It was trained via reinforcement learning specifically for software engineering tasks, contrasting with the general-purpose LLMs used as base layers in most competitor products. The agent runtime also supports Model Context Protocol (MCP), allowing Devin to connect to external tooling including Sentry, Datadog, PostgreSQL, MongoDB, Notion, and hundreds of other services through an extensible plugin marketplace. ACUs (Agent Compute Units) are the metered token of computation consumed per task, priced at approximately $2.25 per ACU with plan-included allocations varying by tier.

SWE-bench performance comparison table
Agent / Model	Benchmark Version	Score (% Resolved)	Evaluation Mode	Date	Source
Devin (Cognition AI)	Full (25% subset)	13.86%	Unassisted	Mar 2024	Cognition AI blog
Prior SOTA (best LLM)	Full	1.96%	Unassisted	Mar 2024	SWE-bench leaderboard
Best assisted LLM	Full	4.80%	Assisted (files given)	Mar 2024	SWE-bench leaderboard
SWE-agent (open source)	Lite	12.47%	Unassisted	Mar 2024	SWE-bench / Princeton
Claude Code (Opus 4)	Verified (500)	72.5%	Unassisted	2025	Anthropic
mini-SWE-agent	Verified (500)	65%	Unassisted	Jul 2025	SWE-bench update
Devin PR acceptance trend	AIDev dataset (7,156 PRs)	+0.77%/week	Real-world PRs	32 weeks	Arxiv 2026

Different benchmark versions (Full vs. Verified vs. Lite) are not directly comparable. Cognition's 13.86% used a 25% random subset of the Full benchmark. Claude Code's 72.5% uses the 500-instance Verified subset.

[CE002, CE020, CE023, CE024, CE035]

FE003: Product capability KPIs

[CE002, CE004, CE014, CE016, CE026]

5.4 Deployment, Integration, and Reliability

Devin supports two primary deployment modes: cloud-hosted (the default, operating in Cognition-managed infrastructure) and VPC deployment for enterprise customers requiring data residency, network isolation, or regulatory compliance. VPC mode allows Devin to operate within the customer's AWS, GCP, or Azure environment, eliminating the need to transmit source code to Cognition's servers. Custom-trained Devin instances are available at the enterprise tier, allowing customers to fine-tune the agent on their proprietary codebases and internal conventions. The platform integrates natively with GitHub, GitLab, and Bitbucket for source control (PR creation, code review automation, branch management); with Jira and Linear for ticket assignment and status tracking; and with Slack and Microsoft Teams for communication-channel session initiation and progress reporting. The REST API enables programmatic session management for CI/CD pipelines, allowing engineering organizations to trigger Devin sessions automatically on specific events (e.g., failing test suite, customer bug report). The MCP Marketplace extends integration to monitoring platforms (Sentry, Datadog, PagerDuty), databases (PostgreSQL, MySQL, MongoDB), and documentation tools (Notion, Confluence). The Windsurf IDE integrates Cascade AI agent locally and the "Devin in Windsurf" handoff for cloud-based tasks. Reliability is not formally disclosed through uptime SLAs or status page data in Cognition's public materials. Task execution time is bounded by a 45-minute maximum runtime per session, though tasks can be broken into sequential sub-sessions for longer projects. The confidence meter in Devin 2.0 provides a pre-execution estimate of task success probability, enabling teams to filter out low-probability tasks before consuming ACUs. These quality controls reduce waste but also signal that Devin's reliability on complex tasks remains probabilistic rather than guaranteed.

Technical architecture components table
Component	Function	Inputs	Outputs	Proprietary?
Agent runtime	Task decomposition, multi-step planning	Natural language task spec	Ordered sub-task plan	Yes
Sandboxed environment	Isolated compute container per session	Agent commands	Code edits, shell output, PR diff	Yes (cloud-hosted)
DeepWiki server	Vectorized codebase graph for context	Repository files	Graph-indexed code map	Yes
BlockDiff snapshots	Incremental state checkpoints for rollback	Session state at each step	Checkpoint index, rollback target	Yes
SWE-1.5 model	Bug detection (SWE-Check) + Codemaps	Code snippets, repository graphs	Bug likelihood score, annotated map	Yes
MCP layer	External tool connectivity	MCP-compliant tool specs	Live tool data in agent context	Open protocol
Devin API	Programmatic session management	HTTP requests with task spec	Session ID, status, PR URL	Yes (REST)

Proprietary components are Cognition-developed and not open-sourced; MCP is an open protocol developed by Anthropic and adopted by Cognition.

[CE003, CE013, CE021, CE032, CE033]

FE004: Technical architecture layers flow

[CE003, CE008, CE013, CE032, CE033]

5.5 Technology Differentiation and Competitive Moats

Cognition AI's primary technical differentiator is the end-to-end autonomous task ownership model: Devin does not suggest code for a human to accept or reject, but independently plans, executes, and delivers a complete task artifact. This positions Devin against code-completion tools (GitHub Copilot, Cursor, Claude Code) rather than as a marginal improvement over them. The SWE-bench Full score of 13.86% set a new state-of-the-art for unassisted autonomous agents in March 2024, exceeding the prior best of 1.96% by 7x and exceeding the prior assisted SOTA of 4.80% by 3x. Devin's PR acceptance rate showed a consistent positive trend of +0.77% per week over 32 weeks, the only agent in a comparative 7,156-PR study to exhibit sustained improvement (Arxiv, 2026). The in-house SWE-1.5 model and SWE-Check tool represent Cognition's nascent vertical integration in model training. Most competitor AI coding agents (GitHub Copilot, Cursor, Windsurf pre-acquisition) are built on top of third-party foundation models (GPT-4, Claude, Gemini), giving those providers structural leverage over tool vendors. By training SWE-1.5 for software engineering tasks specifically, Cognition is building a model layer that is not dependent on API pricing from OpenAI or Anthropic for its most specialized use cases. The Windsurf acquisition added significant IP: the Cascade agentic workflow engine, the Windsurf IDE user base, and 350+ enterprise accounts, accelerating the product suite without requiring multi-year IDE development. Data flywheel advantages are present but not publicly quantified. Each Devin task generates execution traces that can be used to fine-tune future model versions. As enterprise customers use custom-trained Devin instances on their proprietary codebases, Cognition accumulates task-specific performance data that competitors without the same enterprise relationship cannot easily replicate. The DeepWiki codebase graph representation and BlockDiff snapshotting are proprietary engineering innovations that provide functional advantages in large-codebase contexts where pure-attention LLMs lose coherence.

Trust, security, and compliance table
Control	Type	Status	Details	Disclosed?
Session sandbox isolation	Architecture	Confirmed	Each session runs in an ephemeral container	Yes (official docs)
PR-gated code changes	Process	Confirmed	All code changes require human approval before merge	Yes
VPC deployment	Architecture	Available (Enterprise)	Customer-controlled cloud perimeter	Yes
Execution trace logging	Audit	Confirmed	Full log of shell commands, file edits, API calls	Yes
SOC 2 Type II	Certification	Not disclosed	No public attestation as of May 2026	No
ISO 27001	Certification	Not disclosed	No public attestation	No
Prompt-injection defense (patched)	Security	Patched Dec 2024	Critical vuln found live-streamed; Cognition patched	Adverse
Data retention policy	Privacy	Not disclosed	Code submitted to cloud sessions; retention unclear	No

Absence of disclosed SOC 2 does not mean the certification does not exist; it may exist under NDA for enterprise customers. Cognition's trust.devin.ai page returned limited information at research date.

[CE015, CE028, CE029, CE031]

5.6 Trust, Safety, Security, and Compliance

Devin's primary security architecture relies on sandboxed compute isolation: each session runs in an ephemeral container with no access to other sessions or the customer's production environment unless explicitly granted through integration credentials. Code changes are surfaced as pull requests requiring human approval, and the Devin execution trace provides an auditable log of every shell command, file edit, and API call made during the session. VPC deployment adds an additional isolation layer by keeping source code within the customer's own cloud perimeter. A significant security incident emerged in late 2024 when a live-streamed demonstration exposed a major vulnerability in Devin's system prompt handling—effectively allowing prompt-injection attacks to manipulate Devin's behavior. Cognition acknowledged the issue and patched it rapidly, but the incident highlighted the inherent risks of deploying autonomous agents that can execute shell commands based on text inputs. Hacker News commentary characterized the failure as "amateurish given the severity," noting that prompt-injection defenses should have been a first-priority security control for an agent with shell execution capabilities. Cognition has not publicly disclosed SOC 2 Type II certification, ISO 27001, or equivalent enterprise security certifications as of May 2026. The trust.devin.ai subdomain exists but returned limited public information at time of research. Enterprise customers relying on VPC deployment and custom-trained models are likely subject to separate security agreements, but these are not publicly auditable. Data privacy and retention policies for code executed in Devin sessions are not fully disclosed in Cognition's public-facing documentation, representing a material compliance gap for financial-services and regulated-industry customers. The Mercedes-Benz and Goldman Sachs partnerships suggest compliance barriers have been addressed in custom agreements, but without public disclosure, independent assessment is not possible.

Chapter 06

06Customers

6.1 Customer Base Segmentation

Cognition AI targets professional engineering teams at technology-forward enterprises, mid-market companies, and developer-first organizations. The buyer profile is an engineering leader or CTO seeking to multiply developer throughput without proportional headcount growth; the user is typically an individual software engineer or engineering manager who delegates tasks directly via Slack, Jira, or the Devin web interface. The payer is the engineering budget owner, whether a department budget or a centralized IT procurement function at large enterprises. Verticals with confirmed production deployments include financial services and fintech (Nubank—LatAm's largest neobank), automotive/manufacturing (Mercedes-Benz), IT services and outsourcing (Cognizant), enterprise technology, and the US Federal Government. The COBOL modernization blog post confirmed Fortune 500 deployments in sectors that still run significant COBOL workloads: financial services, insurance, and public utilities. In 2026, Cognition launched a Government vertical to target legacy software modernization in US federal agencies. Japan and Singapore enterprise markets were opened in April 2026, suggesting Southeast Asia financial services, manufacturing, and technology firms are the next target segments. The geographic distribution is US-first (the majority of revenue and named customers as of early 2026), with Europe (London office opened January 2026) and APAC (Japan and Singapore offices opened April 2026) serving as growth centers. Channel distribution includes direct enterprise sales, the Windsurf IDE free-to-paid conversion funnel (250K+ daily active users at acquisition), and the Cognizant reseller partnership that deploys Devin and Windsurf across Cognizant's own engineering teams and its global client base. The Windsurf integration creates a bottom-up, developer-led adoption vector in addition to top-down enterprise procurement.

Customer segmentation table
Segment	Buyer Type	User Type	Use Case	Scale / Value	Evidence Level
LatAm Fintech Enterprise	CTO / Eng Leader	SWE / EM	Legacy ETL migration, code modernization	100M users, 6M+ LoC codebase	Named (Nubank), quantified
Global Automotive / Manufacturing	CTO / VP Eng	SWE teams	Legacy modernization, cloud-native dev, logistics	Global enterprise	Named (Mercedes-Benz), announced
IT Services / Outsourcing	IT Exec / Channel	SWE teams + client engineering	Deploy Devin to own and client engineering teams	300+ client organizations	Named (Cognizant), partner
Fortune 500 COBOL Legacy	VP Eng / CIO	Legacy SWE	COBOL modernization to modern stacks	Multi-decade COBOL systems	Unnamed, blog-disclosed
US Federal Government	Agency IT exec	Agency SWE	Critical infrastructure modernization	Federal agency scale	Vertical launched (Feb 2026), no named agencies
APAC Enterprise	CTO / Eng Leader	SWE teams	Software production across SE Asia, Japan	Large-enterprise	Market opened Apr 2026, no named customers
Individual Developer (SMB/indie)	Self	Self	Coding automation, personal projects	Low value, high volume	Self-serve Free/Pro tier

Evidence level reflects public disclosure depth. Mercedes-Benz, Cognizant, and COBOL Fortune 500 are at announcement stage without published outcome metrics.

[CU001, CU002, CU003, CU004, CU005]

FU001: ARR growth trajectory bar chart

[CU006, CU007, CU023]

6.2 Adoption Trajectory and Growth Metrics

Cognition AI's most striking adoption signal is the ARR growth trajectory: from approximately $1M ARR at general availability (December 2024) to approximately $73M ARR by April 2025—a roughly 73x increase in four months. This rate of growth is consistent with Sacra's estimate of ~$15M ARR in October 2024 and the Growjo platform's current estimate of $73M, though both figures are third-party estimates rather than audited or company-disclosed financials. The April 2025 pricing reset (Core plan from $500/month to $20/month base) accelerated individual developer adoption while maintaining enterprise revenue through volume. Internal usage signals are revealing. By February 2026, Cognition's own engineering team was merging 659 Devin PRs per week—four times the 154 per week achieved at their best week in 2025. This "dog-fooding" signal is significant: the team building Devin is also its most intensive user, providing a high-quality feedback loop that likely accelerates iteration velocity. The PR merge rate improved from 34% at Devin's launch to 67% by April 2025—a proxy for output quality improvement—though the metric is self-reported and denominator methodology is not specified. Geographic expansion (London, Tokyo, Singapore) in the first half of 2026 suggests management confidence in revenue sufficiency to support multi-office overhead. The Cognizant partnership is particularly significant as a channel amplifier: Cognizant's global client base spans 300+ organizations across financial services, manufacturing, and healthcare, meaning even partial penetration could materially accelerate Devin's enterprise account count without proportional Cognition sales headcount. However, no public data exists on the number of active paying accounts, the average contract value, or the customer growth rate beyond the ARR estimates.

Customer growth and adoption trajectory table
Metric	Value	Date	Source	Confidence	Implication
ARR at GA launch	~$1M	Dec 2024	Third-party estimate (Sacra/imseankim)	Low	Early adopter only; pricing was $500/month
ARR after pricing reset	~$73M	Apr 2025	Third-party estimate (Growjo / imseankim)	Low-medium	70x+ growth in ~4 months with price cut to $20/month
ARR per Growjo (2025)	$73M estimated	Sep 2025	Growjo revenue estimate	Low	Third-party model; not company-disclosed
PR merge rate at launch	34%	Mar 2024	Company-claimed (imseankim/VentureBeat)	Medium	2 in 3 initial PRs were rejected; low initial quality
PR merge rate Apr 2025	67%	Apr 2025	Company-claimed (imseankim)	Medium	Quality doubled in ~1 year; still 33% rejection
Cognition's own Devin PRs/week	659	Feb 2026	Company blog (official)	High	Internal dog-fooding at high intensity; 4x growth from best 2025 week
Windsurf DAU at acquisition	250,000+	Jul 2025	Company-claimed (Cognition blog)	High	Large Windsurf user base as acquisition conversion funnel
Employee count growth	~102% YoY	2025	Growjo estimate	Low	Fast headcount growth consistent with revenue acceleration

ARR figures are third-party estimates, not company-disclosed. PR merge rate metrics are company-claimed. Missing denominator: total account count and ACV breakdown unknown.

[CU006, CU007, CU008, CU009, CU010]

FU002: Key customer and adoption KPIs

[CU006, CU007, CU008, CU009, CU010, CU012]

6.3 Named Customer Proof and Reference Quality

The highest-quality named customer proof is the Nubank case study, the only publicly detailed and quantified production deployment. Nubank, Latin America's largest neobank (approximately 100 million customers), used Devin to migrate a 6-million-line ETL monolith to sub-modules—a task originally estimated to require 1,000+ engineers over 18 months. With Devin, the migration progressed in weeks for each business unit (Data, Collections, Risk), achieving 12x engineering efficiency improvement in hours saved and 20x cost savings versus the all-human baseline. Crucially, the case study documents fine-tuning on the customer's specific migration patterns, which led to a 4x speed improvement (40 minutes per task to 10 minutes) and a 2x accuracy improvement on the internal benchmark. Mercedes-Benz is named in a blog post (April 27, 2026) as deploying Devin and Windsurf "across its global engineering organization" for legacy modernization, cloud-native development, and logistics. No outcome metrics are publicly disclosed; the announcement is at the partnership/announcement stage rather than a post-hoc case study. Cognizant (announced January 28, 2026) is deploying Devin and Windsurf "across its engineering organization and global clients"—this is a channel partnership rather than a named end-customer, and no production outcome data is available. Additional unnamed customer evidence: Fortune 500 COBOL modernization deployments (blog post April 8, 2026, no named customers), US Government (Cognition for Government launched February 25, 2026, no named agencies), Japan enterprise (launched April 9, 2026, no specific customers named). The depth and freshness of proof varies enormously: Nubank is production with quantified outcomes; others are at announcement stage. This creates a concentration of evidence quality risk—if Nubank is not representative of typical deployments, the customer proof base is shallow.

Named customer proof table
Customer	Segment	Use Case	Production vs Pilot	Outcome	Limitation
Nubank (LatAm neobank)	Fintech	ETL monolith migration (6M LoC, 100K data classes)	Production	12x efficiency gain, 20x cost savings; weeks vs months	Fine-tuning required; Cognition-published case study
Mercedes-Benz	Automotive	Legacy modernization, cloud-native dev, logistics	Announced (Apr 2026)	Not disclosed	Announcement only; no outcome metrics
Cognizant	IT Services / Channel	Deploy Devin + Windsurf across own org + client base	Partnership (Jan 2026)	Not disclosed	Channel partnership; no end-customer or outcome data
Fortune 500 COBOL (unnamed)	Multiple legacy verticals	COBOL modernization to modern stacks	Production (implied)	Not quantified; blog describes ongoing deployments	Unnamed; blog-only disclosure
Cognition AI itself	AI SaaS / internal	Devin builds Devin; 659 PRs/week merged	Production (internal)	4x growth in weekly Devin PRs (2025→2026)	Self-referential; no independent verification
US Government (agency unnamed)	Federal government	Critical infrastructure software modernization	Vertical launched	Not disclosed	No named agencies; FedRAMP status unknown

Only Nubank has quantified production outcomes in a published case study. All other named relationships are at announcement stage or are channel partnerships without end-customer disclosure.

[CU003, CU004, CU005, CU011, CU012]

FU003: Customer acquisition and expansion flow

[CU017, CU018, CU019, CU021]

6.4 Retention, Satisfaction, and Durability

Publicly available retention and satisfaction metrics for Cognition AI are limited. The company has not disclosed Net Revenue Retention (NRR), Gross Revenue Retention (GRR), customer churn rate, or cohort analysis data. The most proximate public indicator of satisfaction is the PR merge rate (67% as of April 2025, up from 34% at launch), which measures the share of Devin-opened pull requests that are accepted by human reviewers—a reasonable proxy for output quality but not a standard customer satisfaction metric. A 67% PR acceptance rate implies that one-third of Devin's work products require rejection or significant rework, which may be acceptable for high-volume autonomous tasks but is a notable quality gap versus human engineers. Developer sentiment from independent sources is mixed. Hacker News communities have noted that early Devin demos appeared polished but real-world performance on complex tasks lagged expectations set by the original 13.86% SWE-bench announcement. The independent reviewer at imseankim.com—after waiting six weeks to observe the product in production—noted that "independent testers tell a more complicated story" relative to Cognition's official performance numbers, and flagged that ACU costs accumulate quickly in practice, with moderate usage easily exceeding $100-200 per month on the nominally $20/month Core plan. The security vulnerability disclosed in December 2024 temporarily damaged trust among security-conscious enterprise customers. On durability signals, the Nubank deployment describes continued and expanding use (data, collections, and risk business units progressively adopting Devin). Internal Cognition usage of Devin (659 PRs/week merged in Feb 2026) suggests strong product-led retention. The Cognizant and Mercedes-Benz partnerships represent structural switching costs once integrated into enterprise workflows. However, the contract length, renewal rates, and post-deployment satisfaction at named enterprise customers are not publicly known.

Retention, repeat usage, and satisfaction table
Metric	Value	Segment	Confidence	Diligence Ask
NRR (Net Revenue Retention)	Not disclosed	All segments	Unknown	Request from Cognition: NRR by cohort and ACV band
GRR (Gross Revenue Retention)	Not disclosed	All segments	Unknown	Request from Cognition: annual contract renewal rate
Churn rate	Not disclosed	All segments	Unknown	Request: monthly and annual subscriber churn by plan tier
PR merge rate	67% (Apr 2025)	All Devin users	Medium (company-claimed)	Verify with independent PR audit or customer interviews
Developer satisfaction (HN)	Mixed—some enthusiastic, some disappointed	Developer community	Low	Commission independent satisfaction survey of enterprise users
Nubank expansion depth	3+ business units using Devin	Fintech enterprise	High (case study)	Request: duration of Nubank engagement, current ACU spend
Cognition internal Devin usage	659 PRs/week (Feb 2026)	AI SaaS (internal)	High (blog)	Cross-check with Git stats; confirm PR type distribution
Average task success rate	~67% PR acceptance (proxy)	General	Low	Disaggregate by task type, language, and complexity band

Retention metrics are not publicly disclosed. PR merge rate is the only available public quality proxy, and it is company-reported without denominator methodology. All retention figures require direct diligence requests.

[CU013, CU014, CU015, CU016]

FU004: Customer segmentation pyramid

[CU001, CU002, CU004, CU005, CU022]

6.5 Expansion and Concentration Risk

Cognition AI's land-and-expand strategy is evident: the Nubank deployment started with one business unit's ETL migration, then expanded to the Data, Collections, and Risk units. The Cognizant partnership implies a similar structure—Cognizant begins with internal deployment and then upsells to its global client base. The Free and Pro plan tiers are designed for developer-led adoption: individual developers experiment at low cost, drive internal adoption, and create bottom-up enterprise procurement pressure. The Windsurf IDE's 250,000+ daily active users provide a large organic install base from which to convert to paid Devin usage. Concentration risks are significant and not publicly disclosed. With ARR of ~$73M and a small set of confirmed named accounts, there is a material risk that a handful of large enterprise contracts (Nubank-scale, Mercedes-Benz-scale) constitute a dominant share of revenue. If the Cognizant channel partnership is performing well, it may itself represent disproportionate revenue concentration through a single reseller rather than diversified direct customer relationships. The COBOL modernization vertical targets high-value projects (multi-year legacy systems) that are structurally one-time engagements unless renegotiated as long-term support contracts, creating potential revenue cliff risk after initial migrations complete. Procurement friction at regulated industries is high. Financial services, healthcare, and government customers require security certifications (SOC 2, FedRAMP) and data residency controls that Cognition has not publicly disclosed meeting. VPC deployment mitigates the data residency concern, but the absence of published compliance attestations limits the addressable market in the most regulated industry verticals. The Government vertical launch in February 2026 suggests Cognition is actively addressing FedRAMP-equivalent certification requirements, but timelines and current status are not public. This compliance gap represents the primary enterprise procurement barrier for Cognition's target large-enterprise segment.

Expansion and concentration risk table
Driver / Risk	Type	Description	Impact	Diligence Path
Land-and-expand (use case depth)	Expansion driver	Nubank started with 1 business unit, expanded to 3+; Cognizant starts internal then sells to clients	High	Map expansion timeline and ACU growth per account
Windsurf to Devin conversion	Expansion driver	250K+ daily Windsurf users as conversion funnel for cloud agent upsell	Medium	Track conversion rate from Windsurf Free to paid Devin sessions
COBOL modernization (project-based)	Concentration risk	High-value one-time migrations; no structural recurring need after completion	High	Assess whether COBOL accounts contract for ongoing support or leave post-migration
Top-customer concentration (unknown)	Concentration risk	With ~$73M ARR and few named accounts, one Nubank-scale account ≈ material share	High	Request top-10 customer concentration; ask for top-customer revenue share
Cognizant channel dependency	Channel risk	Single reseller could dominate new enterprise account acquisition	Medium	Understand revenue share terms; assess lock-in vs. direct sales competitive dynamics
Compliance gap (SOC 2, FedRAMP)	Procurement friction	Regulated industries cannot procure without certifications	High	Confirm SOC 2 Type II and FedRAMP In Process status; understand timeline
Developer fatigue / ACU cost sensitivity	Churn risk	Pro/Core plans costly in practice ($100-200+/month for moderate use); developers may downgrade	Medium	Request Pro-to-Free downgrade rate and ACU utilization distribution

Top-customer concentration is a critical diligence gap; no public data is available. COBOL project-based nature creates revenue cliff risk that requires probing.

[CU017, CU018, CU019, CU020]

Chapter 07

07Risks

7.1 Regulatory and Legal Risk Landscape

Cognition operates in a rapidly evolving regulatory environment. Three major frameworks create material near-term obligations. The EU AI Act (Regulation 2024/1689), effective August 2026 for GPAI model providers, imposes transparency documentation, copyright compliance policies, and EU AI Office registration. The UK GDPR—as administered by the ICO—applies to all data processing of UK/EU residents and requires Data Protection Impact Assessments for high-risk AI processing, which Devin's autonomous PR-merge and code-deployment capabilities likely trigger under Article 35. The January 2025 White House Executive Order revoked prior Biden-era AI safety obligations, creating a deregulatory US federal environment, but successor agency-level rules (particularly for government AI procurement) remain active. California SB 1047, which would have imposed frontier model accountability requirements, was vetoed in September 2024; successor legislation (AB 2013, SB 1235) is active through the 2025–2026 session. Separately, Doe v. GitHub, Microsoft, and OpenAI copyright litigation—consolidated in the Northern District of California—alleges AI code model training on open-source repositories constitutes copyright infringement; Cognition has not disclosed its model training data provenance, creating analogous latent IP exposure. Cognition's terms of service restrict output from being used to train competing models, but the upstream model training data copyright status is not independently verifiable. Compliance monitoring, cross-jurisdictional DPA coverage, and GPAI transparency documentation are each materially underfunded relative to Cognition's scale and enterprise customer base across the EU, Japan, and Singapore. Absent a dedicated regulatory affairs function, Cognition risks enforcement action, GPAI non-registration penalties, or restricted EU market access by Q3 2026. A proactive regulatory program should be treated as a business-critical investment at this stage of international expansion, not an optional legal overhead item.

Regulatory / legal risk register
Rule / Case	Jurisdiction	Status	Likelihood	Severity	Mitigation	Residual Exposure	Diligence Path
EU AI Act GPAI provisions (Reg. 2024/1689)	European Union	In force Aug 2026	Medium	High	Publish GPAI transparency doc; register with EU AI Office	Medium	Verify GPAI registration; obtain AI transparency attestation
GDPR / UK GDPR (AI data processing)	EU / UK	Active	Medium	High	DPA agreements; no training on Enterprise data (company-claimed)	Medium	Audit DPA coverage; verify ICO DPIA compliance for agentic processing
Doe v. GitHub / Microsoft / OpenAI (AI copyright)	US N.D. Cal.	Ongoing consolidated litigation	Medium	Medium	ToS restricts output for competing model training; training data provenance undisclosed	Medium	Obtain legal opinion on Devin model training data provenance and OSS license exposure
CA SB 1047 successor bills (AB 2013)	California, USA	Monitoring 2025–2026	Low	Medium	No current binding obligation; SB 1047 vetoed Sep 2024	Low	Monitor quarterly; reassess if AB 2013 passes committee
US Federal AI EO — Jan 2025 (deregulatory)	United States	Active	Low	Low	Benefits from deregulatory stance; watch agency AI procurement rules	Low	Track OSTP AI Action Plan (due Jul 2025); monitor FedRAMP requirements for government deployments

Likelihood and severity ratings are qualitative diligence assessments. Residual exposure after mitigation reflects analyst judgment, not Cognition disclosures.

[CR001, CR002, CR003, CR004, CR005]

FR001: Risk Heatmap — Impact vs. Likelihood

[CR006, CR007, CR011, CR018, CR019, CR024]

7.2 Security and Technical Risk

Cognition's core product risk is that Devin operates with elevated system privileges— writing, executing, testing, and deploying code autonomously—which dramatically expands the attack surface compared to passive coding assistants. In December 2024, a live-streamed demonstration publicly exposed a prompt-injection vulnerability: an adversary could embed malicious instructions in a repository's README file, causing Devin to exfiltrate secrets, make unauthorized API calls, or plant backdoors in customer codebases. Cognition acknowledged and patched the issue. However, the incident revealed that prompt-injection defenses—top of the OWASP Top 10 for LLM Applications (LLM01)—were inadequately hardened prior to GA launch. Cognition's security page confirms SOC 2 Type II certification (audited March 2024), data encrypted in transit and at rest, and MFA for all employees. The Trust Center requires NDA execution before disclosing penetration test reports, audit scope, and third-party assessment results—limiting enterprise procurement teams' independent assessment of control effectiveness. Key unresolved gaps include: no published bug-bounty program; no formal Agent Security Framework; no public penetration test scope or red-team disclosure; and no published SLA or historical uptime data. Devin 2.0 (April 2025) expanded capabilities to merge PRs directly and schedule agents on customer infrastructure, materially increasing the blast radius of any future prompt-injection exploit. The OWASP GenAI Security Project identifies agent autonomy as an emerging, distinct risk category beyond classical LLM vulnerabilities. As enterprise adoption grows to VPC deployments at hundreds of companies, an undetected Devin exploit could simultaneously contaminate multiple enterprise codebases—a systemic supply-chain tail risk that individual customer security teams cannot independently mitigate.

Operational and security risk register
Failure Mode	Likelihood	Severity	Mitigation Maturity	Residual Exposure	Unresolved Gap
Prompt-injection / agent manipulation (OWASP LLM01)	High	Critical	Partial — patched Dec 2024; no public bug-bounty or agentic security framework	High	No published bug-bounty; no independent red-team disclosure; no post-Devin-2.0 pen test summary
Supply-chain code contamination via autonomous PR merge	Medium	Critical	Partial — SOC 2 Type II; code review recommended but not enforced	High	No published penetration test scope; trust center details NDA-gated; blast radius expanded by Devin 2.0
SOC 2 Type II scope gap — no public report access	Medium	High	Partial — SOC 2 Type II obtained Mar 2024; public report requires NDA	Medium	Obtain NDA and review SOC 2 report scope, control descriptions, and exceptions
AI hallucination / insecure code generation in production	High	High	Partial — code review and branch protection recommended; not enforced by default	Medium	No independent study of Devin-generated code vulnerability rate versus human-written code
Platform outage / ACU quota exhaustion	Low	Medium	Partial — AWS-hosted; no published SLA or uptime history disclosed	Low	Request SLA and historical uptime data; obtain enterprise credit terms for outages

Failure modes ordered by severity. Mitigation maturity reflects publicly available evidence; undisclosed internal controls may be more advanced.

[CR006, CR007, CR008, CR009, CR010]

FR002: Risk Transmission Map

[CR006, CR007, CR011, CR018, CR019, CR020]

7.3 Competitive and Market Risk

Cognition's most acute competitive risk is the rapid commoditization of agentic coding benchmarks that historically defined its leadership. At Devin's March 2024 launch, its SWE-bench Full score of 13.86% was groundbreaking. By mid-2025, Claude Code Opus 4 achieved 72.5% on SWE-bench Verified—a five-fold improvement in roughly 15 months. OpenAI Codex and GPT-5 achieve comparable Verified scores. This rate of improvement threatens Devin's technical differentiation: its multi-step planning, execution, and testing loop may be replicated by foundation model providers within 12–18 months. GitHub Copilot (250M+ installs, Microsoft distribution), Cursor ($500M+ ARR, $9B valuation, $40/month pricing), and Claude Code (Anthropic) compete for the same enterprise developer productivity budget as Devin ($500/month Team plan). The FTC has raised concerns that concentrated control of foundational AI inputs—including the models Devin depends on—could allow those providers to distort competition in downstream AI application markets, a structural risk for Cognition. Cognition's acquisition of Windsurf (July 2025) added 350+ enterprise accounts and 250K+ DAU, but also created integration risk: two separate AI agent codebases, model pipelines, billing systems, and sales motions must be merged simultaneously while maintaining ARR growth. Developer community skepticism at Devin's launch—focusing on benchmark reliability, closed demos, and misleading superiority claims—created an adverse reputation dynamic that increased adoption friction among technically sophisticated buyers and remains a headwind as enterprise evaluation rigor grows. Pricing compression is evident: Devin 2.0 reduced effective prices by approximately 3×; future competitive pressure may require further price reductions that compress revenue per session.

Partner and dependency risk register
Dependency	Counterparty	Role	Concentration	Failure Scenario	Severity	Residual Exposure
Foundation LLM provider (primary)	Anthropic / OpenAI	Core model intelligence for all Devin sessions	Very High	Provider raises API pricing 3–5× or restricts agentic use-case terms	Critical	High — no disclosed multi-model fallback or proprietary model timeline
Cloud infrastructure	Amazon Web Services	VPC compute; enterprise deployment	High	AWS terms change for AI agents; pricing increase materially	High	Medium — no public multi-cloud or on-premises fallback path
Developer workflow integrations	GitHub (Microsoft)	PR review; code diff; CI integration; Copilot distribution	High	Microsoft restricts GitHub API access for AI competitors or modifies Copilot integration policy	High	Medium — API-level integration; no exclusive arrangement
AI IDE platform (Windsurf, acquired Jul 2025)	Internal (Cognition)	Enterprise IDE distribution; 350+ enterprise accounts; 250K+ DAU	High	Integration failure; talent attrition post-acquisition; product cannibalization of Devin sessions	Medium	Medium — controlled integration; leadership retained at acquisition

All counterparties listed have some degree of conflicting competitive interests. Dependency severity rated by revenue impact of failure or material price increase.

[CR011, CR012, CR013, CR014]

FR003: Critical Partner and Regulatory Dependency Map

[CR011, CR012, CR013, CR014, CR001, CR002]

7.4 Financial and Operational Risk

Cognition has raised approximately $1.575B in disclosed funding through mid-2026. With 222 employees (102% annual headcount growth, per Growjo) and a compute- intensive product, estimated monthly burn is $5–15M, implying 18–36 months of runway under conservative scenarios. Revenue growth from ~$1M ARR at GA (December 2024) to ~$73M ARR (April 2025 estimate) is exceptional but introduces customer concentration risk: the $73M ARR figure across a handful of named enterprise accounts suggests a small number of very large contracts that create material churn risk at renewal. Nubank accounts for the only disclosed quantified production case study. If Nubank or a comparably large customer churns—due to a security incident, budget reallocation, or competitive switch—the ARR decline could impair future fundraising at an assumed $10.2B valuation. ACU pricing models ($2.25 per ACU overage) create unpredictable usage-based revenue and potential bill-shock churn for customers who underestimate consumption. Margin compression from LLM inference costs—third-party API calls at commercial Anthropic/OpenAI rates—limits gross margin expansion until Cognition builds proprietary models or achieves volume pricing. No path to profitability, EBITDA, or gross margin targets has been publicly disclosed. Windsurf integration introduces additional operational complexity: merging billing systems, enterprise contracts, and product teams simultaneously increases execution risk and may slow the strategic roadmap. The January 2025 Series A at $2B valuation and post-Windsurf round imply a $10.2B Growjo valuation estimate; any material miss on ARR growth trajectory, a churn event, or a down round from a next capital raise could impair team motivation and trigger key employee departures. Financial risk is elevated by the company's refusal to disclose gross margins, NRR, or cohort retention metrics.

People and execution risk register
Role / Function	Dependency or Gap	Likelihood	Severity	Mitigation	Diligence Path
CEO / Co-founder (Scott Wu)	Single point of product vision; investor credibility; technical leadership; competitive programmer identity	Low	Critical	Co-founder team; board oversight; no public succession plan	Request governance structure; confirm board independence; review co-founder vesting cliff schedule
AI research / ML engineering talent	Extreme competition from Anthropic, OpenAI, Google DeepMind, and Microsoft for the same competitive programming and ML talent pool	Medium	High	Competitive compensation (est. $500K+ TC); strong Cognition brand in AI community	Request headcount breakdown by function; review senior ML engineer tenure and equity refresh policy
International legal / regulatory operations	Expanding to EU, Japan, Singapore, and US Government with no disclosed dedicated legal infrastructure; FedRAMP not obtained	High	Medium	Nascent; likely ad hoc outside counsel for each jurisdiction	Verify EU DPO appointment; request Japan PIPL and Singapore PDPA compliance roadmap; confirm FedRAMP timeline

Risk ratings based on publicly disclosed organizational structure. Internal succession plans, bench depth, and retention programs not publicly disclosed.

[CR015, CR016, CR017]

7.5 Execution, People, and Partner Risk

Cognition's three co-founders—Scott Wu (CEO), Steven Hao, and Walden Yan—are competitive programming champions with exceptional technical credentials but no prior track record building enterprise software companies beyond early-stage startups. Key-person risk is concentrated in Scott Wu, who drives technical vision, investor relationships, and product strategy. The company's AI talent pipeline is stressed by intense competition from Anthropic, OpenAI, Google DeepMind, and Microsoft—all operating frontier AI programs at compensation packages exceeding $500K total compensation, directly competing for the same competitive programming and ML engineering talent pool that Cognition relies on. Devin's platform depends critically on third-party partners: AWS (VPC compute), GitHub (PR workflow integrations), and Slack/Teams (enterprise communication). Any of these platforms modifying API access, pricing, or integration policy could disrupt Devin's enterprise value proposition. Anthropic and OpenAI present a dual risk as both supplier and competitor: they supply the underlying models Devin orchestrates, but compete via Claude Code and OpenAI Codex respectively, creating conflict-of-interest incentives. Geographic expansion to Japan, Singapore, and the EU—announced in Q1–Q2 2026—introduces jurisdiction-specific compliance obligations that require legal and operations hiring ahead of revenue materialization. US Government customers require FedRAMP or equivalent authorization, typically a 12–24 month process; the gap between announced government vertical and actual FedRAMP authorization creates a window of reputational and contractual risk. International legal infrastructure—EU DPO appointment, Japan PIPL compliance, Singapore PDPA— represents an emerging gap for a company that has prioritized product velocity over regulatory scaffolding.

Mitigation and kill criteria table
Risk	Monitorable Trigger	Kill-Criteria Threshold	Action Implication
Prompt-injection / supply-chain breach	Disclosed CVE; customer codebase contamination; second security incident	Any confirmed supply-chain contamination of production customer codebase OR second public security incident within 12 months	Immediate thesis review; evaluate churn impact; assess remediation credibility; consider position exit
Competitor commoditization of agentic coding	Monthly SWE-bench leaderboard; Cursor, GitHub Copilot, Claude Code pricing announcements	Competitor at ≥80% SWE-bench Verified at ≤50% Devin pricing OR Devin YoY ARR growth <50%	Reduce position; reassess moat thesis beyond benchmark scores; evaluate Windsurf-led differentiation
Customer concentration / Nubank churn	NRR disclosed at renewal; top-customer ARR share; pipeline diversification evidence	Single customer >30% ARR OR NRR <90% in any trailing quarter	Immediate ARR quality review; model churn scenarios; request pipeline diversification evidence
LLM provider pricing shock or API restriction	Anthropic / OpenAI pricing announcements; API terms updates for agentic use cases	LLM API cost increases >3× without corresponding Devin price increase OR API terms restrict autonomous PR merges	Model margin compression exposure; evaluate proprietary model timeline and cost pass-through feasibility

Kill criteria are thesis-break triggers. Thresholds reflect current consensus; reassess at each monitoring cycle.

[CR018, CR019, CR020, CR021]

Chapter 08

08Valuation

8.1 Investment Thesis and Anti-Thesis

The bull thesis for Cognition AI rests on three mutually reinforcing propositions. First, autonomous software engineering represents a genuine market category shift— not an incremental productivity improvement. The total addressable market for software development labor exceeds $650B globally; even a 10% displacement of developer time through autonomous agents implies a $65B+ revenue opportunity. Cognition's Devin product is the category pioneer, establishing the autonomous coding agent as a procurement line item at leading enterprises before most buyers understand the category. First-mover brand recognition, lighthouse customer proof (Nubank), and the Windsurf IDE network (250K+ DAU) create a platform moat that sustains pricing power beyond pure benchmark performance. Second, the ARR trajectory from $1M (GA, December 2024) to an estimated $73M (April 2025) in nine months implies a growth rate that is structurally disconnected from historical SaaS benchmarks. At this growth rate, the company could plausibly reach $500M ARR within 18 months of the April 2025 data point, which would place it in the cohort of fastest-growing enterprise SaaS companies in history and justify a $10B+ valuation on forward ARR multiple compression. Third, the co-founders' deep technical backgrounds and Founders Fund's conviction investment create a signal that the founding team can sustain the product velocity needed to maintain differentiation. The anti-thesis is equally forceful. Benchmark commoditization is the existential risk: Claude Code Opus 4's 72.5% SWE-bench Verified score in 2025 versus Devin's 13.86% Full score at launch demonstrates that the performance gap is closing at a rate that may eliminate Cognition's technical differentiation by late 2026. The $10.2B valuation at ~140× estimated trailing ARR prices in near-flawless execution on a trajectory that has no published corroboration beyond Growjo estimates. Customer concentration risk is unquantified: $73M ARR from an undisclosed number of enterprise accounts could mean three to five contracts, where a single defection triggers a material ARR decline. The December 2024 prompt-injection security incident—discovered during a live stream— damaged Cognition's credibility among technically sophisticated buyers at a critical early-adoption inflection point. LLM provider dependency on Anthropic and OpenAI, both of whom compete directly with Devin, introduces a structural conflict-of-interest that could manifest as sudden API pricing increases or feature restrictions that raise Devin's marginal cost of delivery. Without NRR, gross margin, or cohort retention data, the financial quality of the reported ARR cannot be independently assessed.

Recommendation summary table
Dimension	Assessment	Evidence Base	Confidence	Implication
Recommendation	Watchlist — conditional invest below $5B entry	ARR growth exceptional; valuation multiple extreme at 140× ARR; security/concentration risks unresolved	Medium	Monitor NRR, margin, and integration progress; entry discipline required
Risk rating	High	Security incident history; benchmark commoditization; LLM dependency; customer concentration unknown	High	High risk of capital impairment at $10.2B entry if execution falters
Valuation stance	Speculative premium — current $10.2B at ~140× ARR is 7× Cursor multiple	Cursor $9.9B / $500M ARR = 20× ARR; Cognition $10.2B / $73M ARR = 140× ARR	Medium	Fair value range $3–7B on base-case assumptions; entry above $7B requires bull case proof points
Bull case target	$18–25B exit (3-year hold, $700M+ ARR, 25–35× forward ARR)	Growth rate sustainability; Windsurf integration success; NRR ≥120%	Low-Medium	1.5–2.5× return from $10.2B entry; adequate for venture IRR only with near-flawless execution
Bear case target	$800M–$1.8B exit (second security incident OR major churn OR >3× competitor pricing undercut)	Security overhang; benchmark commoditization; LLM pricing risk	Medium	90%+ capital loss at $10.2B entry; unacceptable asymmetry without higher-confidence NRR data

Recommendation is for institutional investors evaluating a secondary or primary position in Cognition at current valuation. This is not investment advice.

[CV001, CV002, CV003]

Final diligence asks table
Diligence Request	Priority	Why Critical	Acceptable Threshold
Net Revenue Retention by cohort (quarterly, trailing 4Q)	Critical	ARR quality; determines if growth is expansion- or churn-masked	NRR ≥110% confirms health; <90% is thesis-break
Gross margin by product line (Devin ACU, Windsurf, Enterprise)	Critical	LLM cost structure; path to profitability; margin compression risk	Target 50%+ gross margin; <30% raises sustainability questions at scale
Top-5 customer ARR and share of total ARR	Critical	Concentration risk; single churn event impact	Top-5 <40% total ARR; any individual >15% ARR is high concentration
Post-Devin-2.0 penetration test report or red-team summary (post-December 2024 patch)	High	Security credibility; expanded agentic capabilities require re-audit	NDA access to SOC 2 Type II + at least 1 post-Devin-2.0 independent pen test
Windsurf integration customer retention rate (6 months post-acquisition)	High	Integration execution proof; IDE customer attrition risk	Windsurf enterprise account retention ≥85% at 6 months post-acquisition
LLM API contract terms (Anthropic, OpenAI): volume pricing, agentic use-case clauses, change-of-control	High	Supplier conflict-of-interest; margin floor determination	Locked-in pricing through 2027 OR multi-model diversification strategy in place
EU GPAI compliance plan and regulatory affairs staffing	Medium	August 2026 compliance deadline; EU revenue at risk if non-compliant	Named EU regulatory affairs hire; GPAI transparency doc timeline <6 months
FedRAMP authorization status and timeline for US Government vertical	Medium	Government ARR pipeline; credibility of government vertical launch (Feb 2026)	Clear FedRAMP authorization timeline; interim use-case confirmation from named agency

Priority levels: Critical = must-have before investment decision; High = must-have before term sheet; Medium = must-have before close.

[CV023, CV024, CV025, CV026, CV027, CV028]

FV001: Recommendation Decision Logic

[CV001, CV002, CV018, CV019]

8.2 Valuation Context and Financing History

Cognition has completed three disclosed financing rounds totaling approximately $1.575B. The $21M seed round established the founding team in 2023. The $175M Series A (January to April 2025, Founders Fund IX lead) set the initial institutional valuation at $2B post-money. SEC EDGAR records confirm Founders Fund IX (CIK 0001971631) as a large institutional vehicle that committed capital to Cognition; the Cognition Capital SPV I (CIK 0002072175, Form D filed 2025-06-11) indicates a follow-on structured vehicle was established concurrent with or shortly after the Windsurf acquisition close. The $400M round (September 2025) valued the company at $10.2B post-money, a 5.1× step-up from the Series A in approximately nine months—one of the steepest step-up trajectories in AI software history. The total financing of ~$1.575B implies approximately 15%+ dilution on a $10.2B post-money basis, though exact preference stack and liquidation waterfall are not disclosed. At the $10.2B valuation, the primary revenue multiple is approximately 140× the April 2025 ~$73M ARR estimate—a premium to all disclosed AI software peers. Cursor/Anysphere closed its Series D at $9.9B on approximately $500M ARR (September 2025), implying a 20× ARR multiple. Cognition's premium is partially justified by its superior growth rate and category-pioneer positioning but represents execution risk priced in for investors entering at $10B+. The Windsurf acquisition (July 2025) was completed via Cognition equity and cash for an undisclosed purchase price; the Cognition Capital SPV I suggests a structured investor-level vehicle, possibly used to fund the acquisition consideration or backstop the combined entity's operating plan. A "culture reset" reported by Winbuzzer in August 2025—Cognition offering nine-month buyouts to Windsurf staff and reportedly requiring 80-hour workweeks—introduces talent integration risk not reflected in the headline valuation. Any new institutional investor at $10.2B would face a meaningful preference overhang from the seed, Series A, and $400M round before reaching par on any return calculation below a $10.2B exit value.

Thesis / anti-thesis table
Thesis Pillar	Supporting Evidence	Countervailing Evidence	Conviction
Autonomous engineering as a category creation event	ARR growth from $1M to $73M in 9 months; Nubank 12× efficiency; Cognition uses Devin internally (659 PRs/week)	Benchmark commoditization accelerating; peers may replicate orchestration layer without needing Cognition's tooling	Medium
Network moat from Windsurf IDE + Devin cloud agent integration	Windsurf 250K+ DAU and 350+ enterprise accounts added at acquisition (Jul 2025)	Integration execution risk; culture reset reported Aug 2025; two codebases to merge simultaneously	Low-Medium
Founders Fund IX conviction as quality signal	Founders Fund IX (CIK 0001971631) as lead Series A investor; Peter Thiel-associated track record	Founders Fund portfolio concentration; past thesis misses have occurred; not a guarantee of outcome	Medium
Price reduction as volume expansion strategy	Devin 2.0 reduced effective price 3×; 83% more tasks per ACU; implies cost curve dropping	Gross margin unknown; 3× price cut suggests either cost structure improvement OR competitive response; either reduces total revenue headroom	Low

Conviction rating is analyst judgment based on publicly available evidence. Internal financial metrics (NRR, gross margin) are not available to modify these assessments.

[CV004, CV005, CV006, CV007]

FV002: Valuation Sensitivity to ARR and Multiple

[CV008, CV009, CV010, CV011]

8.3 Bull, Base, and Bear Scenario Analysis

The bull scenario assumes Devin's ARR continues at a sustained 50% quarterly growth rate through 2027, reaching approximately $700M–$800M ARR by end of 2027. At this scale, with improving gross margins from model cost optimization and volume contracts, Cognition would likely command an IPO or acquisition valuation of $15–25B—a 1.5–2.5× return from $10.2B entry. The bull case requires: no material security incidents post- December 2024; successful Windsurf integration adding a credible IDE channel; NRR disclosed at ≥120%; and competitor models failing to commoditize at Devin's enterprise workflow integration depth. The base scenario assumes moderate ARR growth (30–40% quarterly decay from the 2025 sprint) reaching $200–350M ARR by end of 2027, with gross margin improvement to 50–60%. At a normalized 25–35× forward ARR multiple (consistent with high-growth AI SaaS), this implies a $5–12B valuation—at or near flat to entry at $10.2B, suggesting negligible return for investors at that entry. The base case requires no major churn events, reasonable Windsurf integration, and sustained AI infrastructure investment by enterprises. The bear scenario assumes a material adverse event within 18 months: a second security incident, Nubank churn, or a Devin price war triggered by Claude Code or OpenAI Codex commoditization. Under the bear case, ARR plateaus at $80–120M, NRR is disclosed below 90%, and the valuation compresses to 10–15× ARR implying an $800M–$1.8B exit value—a 90%+ capital loss for investors entering at $10.2B. The bear trigger probability is estimated at 20–30% given the concentration of known risk factors, particularly the security overhang and the structural benchmark commoditization trend. Return expectations are asymmetric: the bull upside (1.5–2.5× in 3 years) does not adequately compensate for a 30%+ probability of severe capital loss at a $10.2B entry valuation. This is the core investment thesis challenge: the risk-adjusted return at current entry is unattractive unless the investor has high-conviction, differentiated information about NRR and gross margin that the market does not.

Bull / base / bear scenario table
Scenario	ARR (End 2027 est.)	ARR Multiple	Implied Valuation	Probability Signal	Key Assumption
Bull	$700–800M	25–35× forward ARR	$18–28B	20–25% (requires near-perfect execution)	No major security incidents; NRR ≥120%; Windsurf integration successful; LLM pricing stable
Base	$200–350M	20–25× forward ARR	$4–9B	45–55% (moderate-growth execution)	Moderate churn; Windsurf partial integration; 30–40% quarterly ARR growth rate decay
Bear	$80–120M	8–12× trailing ARR	$640M–$1.4B	25–30% (materially negative event)	Second security incident OR major customer churn OR competitor price undercut >3× OR regulatory enforcement

Scenario ranges are analyst estimates based on comparable company benchmarks and disclosed growth metrics. Probabilities are qualitative signals, not quantitative models.

[CV008, CV009, CV010]

FV003: Valuation and Return Range by Scenario

[CV008, CV009, CV010, CV003]

8.4 Comparable Company Valuation Analysis

Cognition's peer group for comparables purposes includes AI-native developer productivity tools (Cursor, Replit), foundation model providers with coding products (Anthropic/Claude Code, OpenAI/Codex), and broader enterprise AI SaaS companies at comparable ARR run-rates. Cursor/Anysphere is the closest comparator: both are AI coding tools targeting enterprise developers, both are VC-backed, and both reached their most recent valuations in the second half of 2025. Cursor closed its Series D at $9.9B (Bloomberg, November 2025) on approximately $500M ARR—an 18–20× ARR multiple. Cognition at $10.2B on ~$73M ARR implies a ~140× ARR multiple, a 7–8× premium to Cursor on ARR-basis. The Cognition premium is partially justified by higher growth rate (Devin grew faster from launch) but is structurally unsustainable as Cursor's ARR ($500M) already demonstrates greater revenue quality and breadth. Harvey AI (legal AI, $3B valuation, ~$100M ARR, ~30× ARR) represents a vertical AI SaaS comparator where high-quality NRR justifies premium multiples. Glean (enterprise AI search, $4.6B, ~$100M ARR, ~46× ARR) demonstrates that enterprise AI SaaS can sustain 40–50× ARR multiples at $100M ARR, but NRR at Glean is reported above 150%. Replit ($1.16B valuation, 2023, development platform with AI pivot) represents the category risk: a pioneer pivot story that lost market share to more focused AI-native competitors. The implied IPO entry multiple for AI SaaS companies at or near profitability in 2025 is approximately 15–25× forward ARR; Cognition at $10.2B and ~$73M ARR would need to reach $400–680M ARR before an IPO at standard multiples would preserve the $10.2B entry valuation. No public market comparator exists: GitHub Copilot is embedded in Microsoft Azure revenue, Amazon Q in AWS, and Claude Code in Anthropic's API revenue. Private market premium over public comps is historically 20–40% for growth-stage AI; this suggests a fair-value range for Cognition (on base-case assumptions) of $3–7B, below the $10.2B last round, implying the current round was priced at a 1.5–3× speculative premium to intrinsic value. Entry discipline—waiting for either ARR confirmation at $300M+ or valuation correction to $4–6B—is the appropriate investor response.

Comparable valuation table
Company	Valuation	ARR / Revenue	ARR Multiple	Stage	Comparability to Cognition
Cursor (Anysphere)	$9.9B (Nov 2025 Series D)	~$500M ARR (2025 est.)	~20× ARR	Late private, pre-IPO	Closest comp: AI coding tool, enterprise, developer-productivity focus; 7× lower ARR multiple
Glean	$4.6B (2024)	~$100M ARR (est.)	~46× ARR	Late private	Enterprise AI search; high NRR (≥150%) justifies premium; Cognition NRR unknown
Harvey AI	$3B (2025)	~$100M ARR (est.)	~30× ARR	Growth stage	Vertical AI SaaS; legal focus; strong enterprise NRR; narrower TAM than Cognition
Cohere	$5.5B (2024)	~$100M ARR (est.)	~55× ARR	Late private	Foundation model provider; infrastructure-layer; less exposed to benchmark commoditization
Replit	$1.16B (2023)	Sub-$50M ARR (est.)	~25× ARR	Growth stage	Category risk precedent: developer tool that lost share to AI-native entrants; watch for pattern
GitHub Copilot	N/A (embedded in Microsoft)	~$200M+ ARR (est.)	N/A	Public (Microsoft)	Strongest distribution moat (250M+ GitHub users); pricing $10–$39/user/month; commoditizes Devin's $500/month price point
Amazon Q Developer	N/A (embedded in AWS)	~$50M ARR (est.)	N/A	Public (Amazon)	AWS bundled pricing; relevant for Devin's enterprise cloud deployment competition

All non-public valuations are based on disclosed or reported funding rounds. Revenue multiples for private companies are analyst estimates; actual metrics not confirmed by companies.

[CV011, CV012, CV013, CV014, CV015, CV016]

FV004: Investment Quality KPIs — Cognition vs. Thresholds

[CV001, CV030, CV031, CV032, CV033]

8.5 Exit Readiness and Final Diligence Asks

Cognition is 2–4 years from an IPO readiness event given current ARR trajectory and the disclosure gap that public markets require. For IPO at $8B+ market capitalization, the company needs approximately $400–500M ARR, positive unit economics (40%+ gross margin), disclosed NRR above 120%, and a cleared SOC 2 Type II + GDPR compliance infrastructure. M&A exit paths exist: Microsoft (GitHub Copilot/Azure positioning), Alphabet (Google Cloud developer tools), Salesforce (developer productivity within Einstein platform), or Snowflake/Databricks (data engineering automation) are logical acquirers. The $10.2B valuation makes a strategic acquirer path challenging: Microsoft's prior GitHub acquisition at $7.5B (2018) was at scale, and a $10.2B bid for Cognition would be among the largest Microsoft AI acquisitions—requiring board-level conviction. The most likely exit path remains late-stage private equity / pre-IPO crossover participation, with an IPO window in 2028–2030 if ARR trajectory sustains. Final diligence priorities before investment: (1) NRR by cohort—disclosed above 110% is a green flag; below 90% is thesis-break; (2) gross margin at scale—target 50%+ gross; (3) penetration test report from post- Devin-2.0 audit; (4) Windsurf integration roadmap with customer retention data from the acquired base; (5) LLM provider contract terms including agentic use-case pricing floors; (6) GPAI compliance plan and EU regulatory timeline; (7) top-5 customer ARR concentration (>50% from top-5 is a material concentration risk); and (8) headcount by function to assess burn relative to ARR coverage ratio. Thesis-break conditions: a second disclosed security incident within 12 months, NRR disclosed below 90%, ARR growth rate deceleration to <50% YoY, or a confirmed major enterprise customer churn event.

Thesis-break and kill triggers table
Trigger Event	Threshold	Monitoring Indicator	Action Implication
Second security incident or supply-chain breach	Any confirmed second Devin security incident within 18 months of December 2024 patch	CVE disclosures; enterprise security advisories; HN/security community	Immediate thesis review; evaluate credibility of remediation; likely exit
Major customer churn	Top customer ARR decline >25% YoY OR any individual customer ARR >$10M lost	ARR disclosure at funding; press announcements; customer reference interviews	Reduce position; re-model ARR trajectory with churn scenario
Competitor price undercut >3× Devin equivalent at comparable capability	Competitor at SWE-bench ≥80% Verified at ≤$160/month (Devin equivalent)	SWE-bench leaderboard; competitor pricing pages monthly	Structural differentiation review; accelerate Windsurf moat assessment
NRR disclosed below 90%	NRR < 90% at any disclosed or estimated cohort measurement	Financial disclosure at next funding; data room	Re-model ARR growth; high churn at scale is thesis-break for $10B+ entry
EU regulatory enforcement action for GPAI non-compliance	EU AI Office initiates formal investigation or issues compliance order against Cognition	EU AI Office announcements; DPC press releases; Reuters/FT regulatory coverage	Assess EU revenue at risk; model enforcement cost; could trigger broader enterprise procurement pause

Kill criteria are not legal thresholds; they are investment risk management indicators for monitoring committees.

[CV018, CV019, CV020, CV021, CV022]

Disclaimer

This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.

Evidence index

Claims
ID	Statement	Confidence	Sources
CO001	Cognition AI was founded in November 2023 in San Francisco, California.	High	SO002, SO005, SO010
CO002	Cognition AI is headquartered in San Francisco, California.	High	SO001, SO005
CO003	Cognition AI's flagship product, Devin, is an autonomous AI software engineer capable of interpreting tickets, planning, writing code, debugging, testing, and deploying software with minimal human oversight.	High	SO002, SO025
CO004	Scott Wu is the co-founder and CEO of Cognition AI.	High	SO001, SO005, SO010, SO011
CO005	Steven Hao is the co-founder and CTO of Cognition AI.	High	SO001, SO005, SO010
CO006	Walden Yan is the co-founder and CPO (Chief Product Officer) of Cognition AI.	High	SO001, SO005, SO010
CO007	Scott Wu is a three-time IOI (International Olympiad in Informatics) gold medalist.	Medium	SO011, SO010
CO008	Cognition AI raised $175M at a $2B valuation in April 2024, led by Founders Fund.	High	SO006, SO007, SO008
CO009	Cognition AI raised $400M at a post-money valuation of $10.2B in September 2025, led by Founders Fund.	High	SO006, SO007, SO008, SO009
CO010	Cognition AI's total capital raised is approximately $696M as of September 2025.	High	SO008, SO014
CO011	Cognition AI acquired Windsurf, an AI-native IDE company, in July 2025.	High	SO004, SO009, SO015
CO012	Windsurf had approximately $82M ARR at the time of its acquisition by Cognition AI.	High	SO003, SO008, SO009
CO013	Windsurf had more than 350 enterprise customers at the time of its acquisition by Cognition AI.	Medium	SO004, SO009
CO014	Cognition AI's ARR grew from $1M in September 2024 to $73M in June 2025, representing approximately 73x growth in nine months.	High	SO008, SO024
CO015	Following the Windsurf acquisition, Cognition AI's combined ARR reached approximately $155M in July 2025.	High	SO003, SO008
CO016	Cognition AI's net cash burn was under $20M from founding through Q3 2025, indicating high capital efficiency.	Medium	SO003, SO014
CO017	Devin was publicly announced in March 2024.	High	SO002, SO005
CO018	Cognition AI initially explored cryptocurrency before pivoting to building autonomous AI coding agents.	Medium	SO005, SO010
CO019	The founding team of approximately 10 people collectively held 10 IOI gold medals at company launch.	Medium	SO002, SO011
CO020	Founders Fund led both the initial seed/Series A and the $400M Series B round for Cognition AI.	High	SO006, SO007, SO008
CO021	Goldman Sachs is among Cognition AI's enterprise customers and is reportedly running a large-scale Devin pilot with approximately 12,000 developers.	Medium	SO012, SO003
CO022	Cognition AI laid off 30 former Windsurf employees following the July 2025 acquisition.	High	SO013, SO016
CO023	Cognition AI offered nine-month salary buyout packages to approximately 200 remaining Windsurf employees who did not want to commit to the company's 80-hour, six-day workweek culture.	High	SO013, SO016, SO017
CO024	Cognition AI's enterprise ARR grew more than 30% in seven weeks following the Windsurf acquisition.	Medium	SO003
CO025	Cognition AI opened its Singapore APAC headquarters in April 2026.	High	SO001, SO003
CO026	Scott Wu studied economics at Harvard University.	Medium	SO011, SO010
CO027	Scott Wu previously co-founded Lunchclub, an AI-powered professional networking platform.	Medium	SO011, SO010
CO028	Devin's Core plan is priced at $20/month for individual developers with pay-as-you-go compute at approximately $2.25 per ACU.	High	SO026, SO025
CO029	Devin's Team plan costs $500/month and includes 250 Agent Compute Units plus full API access.	High	SO025, SO026
CO030	Cognition AI's $175M Series A was conducted at a $2B post-money valuation in April 2024.	High	SO006, SO008
CO031	Cognition AI raised approximately $21M in March 2024 at a valuation of approximately $350M in its initial institutional round.	Medium	SO008, SO021
CO032	Scott Wu holds the 'Legendary Grandmaster' designation on the Codeforces competitive programming platform.	Medium	SO011, SO010
CO033	Mercedes-Benz announced a partnership with Cognition AI in April 2026.	High	SO001, SO025
CO034	Independent product reviewers have found that Devin's real-world task success rates require significant human oversight on complex tasks, suggesting performance falls below the 'fully autonomous' marketing framing.	Medium	SO018, SO019
CO035	CEO Scott Wu has stated publicly that Cognition AI's culture requires six-day workweeks of more than 80 hours, framing it as a company-wide commitment to the mission.	High	SO013, SO016, SO017
CO036	Investors in the September 2025 $400M round included Lux Capital, 8VC, Elad Gil, Definition Capital, Swish Ventures, Bain Capital Ventures, and D1 Capital.	High	SO006, SO009, SO014
CO037	As of April 2026, Cognition AI is reportedly in discussions for a new financing round that could value the company at over $25 billion.	Medium	SO008
CO038	Walden Yan is an IOI gold medalist who leads product management and the product roadmap at Cognition AI.	Medium	SO010, SO005
CO039	Steven Hao previously held roles at Scale AI, Cursor, Modal, DeepMind, Waymo, and Nuro before co-founding Cognition AI.	Medium	SO010, SO011
CO040	Cognition AI had approximately 49 employees before the Windsurf acquisition according to revenue-per-employee analyses.	Low	SO008
CO041	Windsurf had been in acquisition discussions with OpenAI for approximately $3B and was the subject of a Google talent licensing deal worth approximately $2.4B before Cognition acquired the remaining team and IP.	Medium	SO015, SO009
CO042	Cognition AI's price-to-ARR multiple at the September 2025 valuation was approximately 65x (using $155M combined ARR) to 140x (using $73M pre-acquisition Devin ARR).	Medium	SO023, SO008
CO043	Devin operates within a sandboxed environment that includes a shell, code editor, and browser, using long-context model inference to sequence planning, implementation, and iterative testing steps autonomously.	High	SO002, SO025
CM001	The broad AI code tools market was valued at $7.37 billion in 2025, according to Mordor Intelligence, and is forecast to reach $23.97 billion by 2030 at a 26.6% CAGR.	Medium	SM001
CM002	Grand View Research forecasts the narrow generative AI coding assistants market at $92.5 million by 2030 at a 24.8% CAGR—a significantly smaller scope than Mordor's $24B figure.	Medium	SM002
CM003	ResearchAndMarkets published a $97.9 billion by 2030 forecast for the generative AI coding assistants space, though this figure likely uses an atypically broad scope including non-comparable segments.	Low	SM003
CM004	No single analyst consensus exists for the AI coding tools TAM: estimates range from $92M (Grand View, narrow) to $97.9B (ResearchAndMarkets, broad) for 2030, with the most operationally relevant SAM estimate at $1–8B.	Medium	SM001, SM002, SM003
CM005	The estimated serviceable addressable market for premium agentic developer tools (autonomous tier) is $1–2B in 2025–2026, growing to $5–8B by 2030 based on developer-seat and enterprise-spend lenses.	Medium	SM006, SM007, SM008
CM006	Global software spending reached approximately $675–$700 billion in 2024, according to WIPO's Global Innovation Index and multiple market research sources.	High	SM008, SM007
CM007	The number of professional software developers worldwide in 2024 is estimated at 27–28.7 million by Evans Data, Statista, and JetBrains Research.	High	SM006, SM007, SM020
CM008	Gartner forecasts that 90% of enterprise software engineers will use AI code assistants by 2028, up from less than 14% in early 2024.	High	SM004, SM005, SM009
CM009	GitHub Copilot holds approximately 42% market share among paid AI coding tools in 2025, with $2 billion in ARR and over 20 million all-time users.	Medium	SM017, SM018, SM019
CM010	GitHub Copilot is used by over 50,000 organizations including 90% of Fortune 100 companies in 2025, with enterprise adoption up 75% quarter-over-quarter.	Medium	SM017, SM019
CM011	Developers using AI coding assistants complete code tasks 51–55% faster, with AI-generated code constituting 46% of all code written by Copilot users on GitHub.	Medium	SM017, SM018
CM012	Enterprise spend on agentic AI systems is projected to surge from less than $1 billion in 2024 to over $51 billion by 2028 at approximately 150% CAGR.	Medium	SM023, SM024
CM013	Enterprise buyers universally require sandbox environments, audit logs, and human-in-the-loop review gates before deploying autonomous coding agents in production workflows.	High	SM013, SM014, SM015
CM014	The biggest barriers to enterprise adoption of agentic AI include trust deficits from hallucination risk, legacy system integration complexity, and unclear ROI measurement frameworks.	High	SM013, SM014, SM015, SM016
CM015	65% of enterprises were regularly using generative AI as of 2025, up from 33% in 2023, indicating rapid mainstream adoption of AI tools in enterprise workflows.	Medium	SM012, SM023
CM016	Only a minority of AI use cases in enterprises have reached full production in 2025; primary barriers are legacy systems, data silos, compliance concerns, and talent shortages.	High	SM024, SM013
CM017	72% of enterprises plan to deploy AI copilot and agent technologies by 2026, according to aggregated industry surveys.	Medium	SM023
CM018	The primary enterprise buyer for autonomous software engineering agents is VP Engineering or CTO, with budget sourced from developer productivity or digital transformation allocations.	Medium	SM012, SM024
CM019	Goldman Sachs (12,000-developer pilot), Citi, Cisco, Dell, Nubank, and Ramp are among Cognition AI's enterprise reference customers, representing the financial services and high-growth tech segments.	Medium	SM011, SM023
CM020	The EU AI Act and evolving US AI governance regulations impose documentation and explainability requirements that will affect enterprise deployment of autonomous coding agents beginning 2026–2027.	Medium	SM013, SM015
CM021	Status-quo substitutes for autonomous AI coding agents include offshore staff augmentation, traditional IDEs, internal developer tooling built on base LLM APIs, and cloud-vendor bundled IDE extensions.	Medium	SM001, SM011
CM022	Switching costs for enterprise AI coding tools include integration migration effort, process overhaul, vendor lock-in risks, and cultural reorientation, making incumbent vendor stickiness a meaningful competitive moat.	Medium	SM015, SM016
CM023	North America leads the AI coding tools market with the largest share; cloud deployment dominates though enterprise demand for on-premises solutions is growing, particularly among regulated industries.	Medium	SM001, SM002
CM024	The autonomous agents market (cross-sector) is forecast to grow from $4.35B in 2025 to $103B by 2034 at over 40% CAGR, providing a directionally supportive upper-bound context for the AI software engineering agent TAM.	Low	SM015, SM025
CM025	Gartner's Magic Quadrant for AI Code Assistants provides enterprise procurement legitimacy for the category, with GitHub Copilot named a Leader in both 2024 and 2025 editions.	High	SM004, SM010
CM026	87% of developers are using AI tools daily as of 2025, indicating that AI-assisted development has reached mainstream adoption among professional software engineers.	Medium	SM011, SM023
CM027	Developers using AI tools report 3.2× productivity improvements on average, with 67% using AI tools at least five days per week according to 2025 surveys.	Medium	SM011, SM017
CM028	Financial services enterprises present a high-value segment for Cognition AI given Goldman Sachs' 12,000-developer Devin pilot; these buyers require SOC 2 Type II, data residency, and regulatory compliance before enterprise sign-off.	Medium	SM013, SM024
CM029	The global software development market is valued at $675–$730B in 2024; a 1% shift of project labor costs to AI tools implies a $6.7B+ market, corroborating the Mordor $24B broad TAM estimate for 2030.	Medium	SM008, SM021
CM030	AI-assisted pull request cycle times have been reduced by up to 75% (from 9.6 days to 2.4 days) for organizations using GitHub Copilot, per reported enterprise metrics.	Medium	SM017, SM018
CM031	Unpredictable agent behavior including hallucinations, lack of transparency, and security concerns are the primary trust barriers slowing enterprise deployment of autonomous AI coding agents.	High	SM014, SM016
CM032	77% of companies are currently using or actively testing AI tools, with the global AI market estimated at approximately $298 billion in 2025.	Medium	SM023
CM033	Agentic AI adoption in software engineering is constrained not just by technology but by organizational readiness: workforce resistance, lack of AI talent, and unclear business cases are the dominant enterprise blockers as of 2025.	High	SM013, SM015, SM024
CM034	Competitive fragmentation in AI coding tools—GitHub Copilot (42% market share), Cursor ($9B valuation), Windsurf (acquired by Cognition), and OpenAI Codex—can produce buyer choice overload that stalls enterprise procurement decisions.	Medium	SM011, SM017
CM035	Cognition AI's $155M ARR as of July 2025 represents approximately 2% penetration of the estimated $7.37B AI code tools TAM, suggesting material white space remains if the market definition is correct.	Low	SM001
CP001	GitHub Copilot holds approximately 42% of the AI coding assistant market share as of 2025.	Medium	SP005, SP017
CP002	GitHub Copilot had over 20 million active users as of early 2025.	Medium	SP005, SP012
CP003	GitHub Copilot exceeded $2 billion in annual recurring revenue by early 2025.	Medium	SP002, SP004
CP004	GitHub Copilot is used by 90% of Fortune 100 companies.	Medium	SP012, SP021
CP005	Cursor (Anysphere) reached $2 billion ARR by February 2026.	High	SP001, SP003, SP004
CP006	Cursor raised a $2.3 billion Series D in November 2025 at a $29.3 billion valuation.	High	SP001, SP003
CP007	Cursor has over 1 million paying customers and 50,000 enterprise teams.	Medium	SP003, SP006
CP008	Cursor's multi-agent background agents feature runs up to eight parallel task threads simultaneously.	Medium	SP005, SP016
CP009	Amazon Q Developer offers a Free tier and a Pro tier priced at $19 per user per month.	High	SP013, SP018
CP010	Amazon Q Developer achieved SOC 2 Type II certification with VPC isolation support.	Medium	SP007, SP013
CP011	Amazon Q Developer has a 200,000 token context window.	Medium	SP007, SP018
CP012	Claude Code operates as a terminal-native agentic coding assistant without requiring a custom IDE.	High	SP008, SP009
CP013	Devin's initial SWE-bench score in March 2024 was 13.86%, a breakthrough at the time of announcement.	High	SP010, SP015, SP025
CP014	Subsequent model improvements from Anthropic pushed Claude-based agents above 50% on SWE-bench Verified.	Medium	SP010, SP015
CP015	Claude Code is priced at $10 per month on Pro and $100 per month on Max tiers.	High	SP027, SP008
CP016	OpenAI relaunched Codex as a web-based agentic coding environment backed by the o3 model family in 2025.	Medium	SP014, SP011
CP017	SWE-agent is an open-source research framework developed at Princeton NLP Group, not a commercial product.	High	SP025, SP010
CP018	Cursor investors include Thrive Capital, Andreessen Horowitz, Accel, Nvidia, and Google.	High	SP001, SP003
CP019	GitHub Copilot individual pricing is $10 per month; Business tier is $19 per month; Enterprise tier is $39 per month.	High	SP021, SP012
CP020	Windsurf (formerly Codeium) was acquired by Cognition in July 2025, adding approximately $82M ARR and 350-plus enterprise customers.	Medium	SP024, SP006
CP021	Cognition's acquisition of Windsurf converted a direct IDE-tier competitor into a distribution channel for Devin.	Medium	SP024, SP019
CP022	GitHub Copilot expanded from inline code completions to Copilot Workspace, a multi-step planning and execution agent.	Medium	SP012, SP005
CP023	Cursor pricing is $20 per month for Pro tier and $40 per month for Business tier as of 2025.	Medium	SP023, SP005
CP024	Replit offers an AI-native development platform with cloud execution environments targeting student and hobbyist developers.	Medium	SP022
CP025	The core architectural gap between Devin and Claude Code is that Claude Code still requires a human operator at the terminal, while Devin runs fully unattended.	Medium	SP009, SP008
CP026	Cognition's defensible competitive advantages include execution depth, enterprise data flywheel from completed tasks, and Windsurf distribution.	Medium	SP019, SP011
CP027	Benchmark credibility concerns about Devin emerged after independent evaluations showed other tools surpassing it within months of the March 2024 announcement.	Medium	SP010, SP015, SP025
CP028	The competitive window for pure agentic differentiation is narrowing as frontier model labs deploy equivalent agentic pipelines.	Medium	SP011, SP019
CP029	Cursor is growing faster than Cognition on ARR metrics and has more paying customers, providing proportionally more training signal.	Medium	SP001, SP003, SP004
CP030	Cursor uses a VS Code fork architecture that provides deep editor integration unavailable to plugin-based competitors.	Medium	SP005, SP016, SP023
CP031	Amazon Q Developer's native AWS service integrations position it as preferred for organizations standardized on AWS infrastructure.	High	SP007, SP013
CP032	GitHub Copilot's enterprise distribution advantages create high switching-cost inertia across Fortune 100 organizations.	Medium	SP012, SP021
CP033	The AI coding tools market in 2026 is stratified into hyperscaler-backed co-pilots, IDE-native assistants, and fully autonomous agent tiers.	Medium	SP011, SP017, SP019
CP034	Frontier model labs such as Anthropic and OpenAI control the underlying reasoning capabilities and can deploy equivalent agentic pipelines without licensing dependencies.	High	SP008, SP014
CP035	Cognition's enterprise go-to-market execution and customer retention metrics are the true leading indicators of sustainable competitive position, more so than benchmark scores.	Medium	SP009, SP011
CP036	Cursor's growth from zero to $2B ARR in approximately 24 months represents the fastest known ARR scale-up in AI coding tools.	Medium	SP001, SP003
CP037	Both GitHub Copilot and Amazon Q Developer lack end-to-end autonomous task completion without human-in-the-loop confirmation at each stage.	Medium	SP007, SP012
CI001	Cognition AI's Core plan is priced at $20 per month for 5 ACUs; Team plan at $500 per month for 250 ACUs.	High	SI004, SI022
CI002	Enterprise ACU overages are priced at approximately $2.25 per ACU beyond the plan inclusion.	Medium	SI004, SI001
CI003	Cognition AI reached approximately $1M ARR in September 2024.	Medium	SI001, SI005
CI004	Cognition AI grew from $1M to $73M ARR between September 2024 and June 2025—a 73x increase in nine months.	High	SI001, SI003, SI005
CI005	Windsurf (formerly Codeium) was priced at $0 (Free) and approximately $15 per month (Pro) before the Cognition acquisition.	Medium	SI006, SI007
CI006	Cognition AI's gross margin is estimated at 50–70%, based on inference compute costs versus ACU pricing—no primary disclosure exists.	Low	SI011, SI005
CI007	The Windsurf acquisition added approximately $82M ARR and 350-plus enterprise customers to Cognition's combined revenue base.	Medium	SI006, SI010, SI012
CI008	Cognition AI's combined ARR reached approximately $155M post-Windsurf acquisition in July 2025.	Medium	SI009, SI010, SI013
CI009	ARR per employee at Cognition was approximately $1.5M prior to the Windsurf acquisition (49 employees, $73M ARR in June 2025).	Low	SI003, SI011
CI010	ARR per employee dropped to approximately $623K post-Windsurf, reflecting the quadrupling of headcount to ~249 employees.	Low	SI019, SI003, SI011
CI011	Net revenue retention rate for Cognition AI is not publicly disclosed; analyst estimates range from 100–130% based on ARR growth trajectory.	Low	SI005, SI011
CI012	Cognition AI's seed/Series A in March 2024 raised approximately $21M at a $350M valuation, led by Founders Fund.	High	SI002, SI023, SI020
CI013	Cognition AI's April 2024 funding round raised $175M at a $2B post-money valuation, led by Founders Fund.	High	SI002, SI014, SI020
CI014	Cognition AI's September 2025 Series B raised $400M at a $10.2B post-money valuation with Lux Capital, 8VC, Elad Gil, Bain Capital Ventures, and D1 Capital as co-investors.	High	SI008, SI009, SI010
CI015	Cognition AI disclosed a net cash burn of under $20M from founding through Q3 2025—extraordinary capital efficiency for a company at this ARR growth rate.	Medium	SI001, SI009
CI016	Reports as of April 2026 indicate Cognition AI is in discussions for new financing at a $25B+ valuation.	Low	SI011, SI012
CI017	Cognition AI does not disclose gross margin, customer churn, NRR, CAC, or GAAP financial statements.	High	SI005, SI007, SI016
CI018	All ARR figures for Cognition AI rely on company blog posts, third-party news citing unnamed sources, or secondary market research—no audited financial data is available.	High	SI003, SI005, SI016
CI019	The Windsurf acquisition was estimated at approximately $250M, funded from Cognition's existing capital reserves without a new primary fundraise.	Low	SI006, SI010
CI020	Founders Fund led both the March 2024 seed/Series A and the April 2024 Series A extension, providing deep capital commitment and reputational signal.	High	SI002, SI008, SI020
CI021	A $25B+ valuation on $155M ARR implies a forward revenue multiple of approximately 160x, elevated even by hypergrowth AI company standards.	Medium	SI011, SI012
CI022	Cognition AI's total capital raised through Series B is approximately $596M excluding the Windsurf acquisition; total capital deployed is approximately $846M when including the acquisition.	Medium	SI008, SI009, SI010
CI023	Post-Windsurf headcount reportedly reached approximately 249 employees before layoffs of 30 and additional voluntary departures from buyout offers.	Medium	SI019, SI023
CI024	SEC Form D/A for Founders Fund IX, LP (CIK 0001971631, filed October 2025) discloses total capital raised of approximately $972M across the LP and principals fund, with Peter Thiel as managing member.	Medium	SI026
CI025	Founders Fund IX Principals Fund (CIK 0002090410) is a co-issuer with Founders Fund IX, LP per the amended SEC Form D/A, indicating a typical GP co-investment vehicle structure.	Medium	SI026
CI026	Cognition AI opened its Singapore APAC headquarters in April 2026, hiring Richard Spence as VP and General Manager APAC, targeting Southeast Asia, Australia, India, and South Korea.	High	SI027, SI018
CI027	Mercedes-Benz announced deployment of Devin and Windsurf across its global engineering organization in April 2026, focusing on legacy modernization, cloud-native development, and logistics applications.	Medium	SI028, SI018
CI028	Cognition AI introduced a revised pricing structure in April 2026, replacing the Core and Team plans with Free, Pro, Max, Teams, and Enterprise tiers, signaling a land-and-expand monetization strategy.	High	SI018, SI004
CI029	Cognition AI's annual Team plan pricing of $6,000 per developer is approximately 15x more expensive than GitHub Copilot Enterprise ($390/yr) and 25x more than Cursor Pro ($240/yr), justified by task-completion vs. code-suggestion value.	Medium	SI004, SI011
CI030	Total capital raised ($596M through Series B) relative to ARR at time of last raise ($73M Devin ARR at Series B in Sep 2025) implies a capital-to-ARR ratio of approximately 8x—high but declining as ARR compounds.	Medium	SI008, SI009, SI003
CI031	The 9-month severance buyout offered to approximately 200 Windsurf employees represents an estimated one-time integration cost of $15–30M, calculated on average startup engineer salaries of $100–150K.	Low	SI019
CI032	Cognition AI's April 2026 APAC office opening and Mercedes-Benz partnership signal material increases to operating expenditure in H1 2026 through regional headcount and GTM investment.	Low	SI027, SI028
CI033	Enterprise AI SaaS tools with variable compute cost structures typically achieve gross margins of 55–75% at scale, per industry benchmarks; Cognition's agentic model adds inference cost variance absent from seat-based tools.	Low	SI011, SI015
CI034	Cognition AI has not filed Form D or equivalent exempt offering notice with the SEC as of May 2026; the company's capital raises are documented only through investor disclosures and third-party press reports.	High	SI016, SI023, SI026
CI035	If the Windsurf $82M ARR churns by 30% post-acquisition migration, combined Cognition ARR falls to approximately $128M—a scenario not publicly addressed by company management.	Low	SI006, SI003, SI011
CE001	Devin is a cloud-hosted autonomous AI software engineer that accepts natural-language task specifications and executes complete software engineering tasks—planning, coding, testing, and deploying—without human intervention during execution.	High	SE002, SE004
CE002	In the original SWE-bench evaluation (March 2024), Devin resolved 13.86% of issues unassisted—far exceeding the prior state-of-the-art of 1.96% for unassisted agents and 4.80% for assisted LLMs.	High	SE003, SE006, SE020
CE003	Devin's agent runtime executes tasks within a sandboxed cloud environment equipped with a code editor, Unix shell, and headless browser—isolated from other sessions and the customer's production environment.	High	SE002, SE001
CE004	Each Devin session is bounded by a maximum runtime of 45 minutes; longer tasks must be broken into sequential sub-sessions.	Medium	SE003, SE001
CE005	Devin integrates natively with GitHub, GitLab, and Bitbucket for source control, enabling automated PR creation, branch management, and code contributions.	High	SE007, SE001
CE006	Devin integrates natively with Jira and Linear for project management ticket assignment and sprint delegation.	High	SE007, SE001
CE007	Devin integrates natively with Slack and Microsoft Teams for session initiation via chat messages and progress reporting.	High	SE007, SE001
CE008	Agent Compute Units (ACUs) are the proprietary metered compute token consumed per Devin task, priced at approximately $2.25 per ACU with plan-included allocations varying by tier.	High	SE004, SE008
CE009	Devin Enterprise tier offers VPC deployment allowing the agent to operate within customer-controlled cloud infrastructure, and custom-trained Devin instances tuned to proprietary codebases.	High	SE001, SE004
CE010	Devin accepts task inputs from multiple channels: web dashboard, Slack messages, Jira tickets, and Linear issues—translating natural-language instructions into executable multi-step plans.	High	SE001, SE002
CE011	Nubank achieved 12x engineering efficiency improvement and 20x cost savings using Devin to migrate a 6-million-line ETL monolith; Data, Collections, and Risk business units completed migrations in weeks instead of months.	Medium	SE016, SE014
CE012	SWE-Check, released April 2026 in collaboration with Applied Compute, is a reinforcement-learning-trained bug detection model that matches Anthropic Opus 4.6 performance while running approximately 10x faster.	Medium	SE024, SE002
CE013	The DeepWiki server builds vectorized project graphs of entire codebases, enabling Devin to navigate and reason about multi-million-line repositories beyond standard LLM context window limits.	Medium	SE001, SE002
CE014	The Windsurf acquisition (July 2025) added 350+ enterprise customers, 250,000+ daily active users, and the Windsurf IDE product including its Cascade agentic workflow engine.	High	SE015, SE022
CE015	In late 2024, a live-streamed demonstration publicly exposed a major security vulnerability in Devin's system prompt handling, potentially enabling prompt-injection attacks. Cognition acknowledged and patched the issue.	Medium	SE017, SE009
CE016	Devin 2.0 (April 2025) achieved 83% more tasks completed per ACU compared to Devin v1.x, representing a significant efficiency improvement for existing subscribers.	Medium	SE008, SE019
CE017	Devin 2.0 introduced a confidence meter that quantifies the probability of task success before and during execution, allowing teams to filter low-probability tasks and prioritize high-value delegations.	Medium	SE008, SE001
CE018	Model Context Protocol (MCP) integration allows Devin to connect to hundreds of external tools including monitoring platforms, databases, and documentation systems through the MCP Marketplace.	Medium	SE007, SE001
CE019	The Devin REST API enables programmatic session creation, status retrieval, and CI/CD pipeline integration for DevOps automation workflows.	Medium	SE007, SE001
CE020	A 32-week empirical study of 7,156 real-world pull requests showed Devin's PR acceptance rate increased by +0.77% per week—the only agent in the study with a consistent positive trend.	Medium	SE011, SE010
CE021	BlockDiff snapshotting records incremental state checkpoints during Devin's execution, enabling rapid rollback to a known-good state when tests fail or an approach proves unworkable.	Medium	SE001, SE002
CE022	Devin supports parallel task execution across multiple concurrent sessions, allowing enterprise teams to delegate entire sprint backlogs simultaneously rather than one task at a time.	Medium	SE001, SE004
CE023	Claude Code (Opus 4) achieved 72.5% on SWE-bench Verified (500 instances) in 2025, far exceeding Devin's original 13.86% benchmark score—though the two benchmarks use different task sets.	Medium	SE010, SE012
CE024	mini-SWE-agent achieved 65% on SWE-bench Verified in July 2025, matching most commercial agents at a fraction of the computational cost—a signal that benchmark performance is rapidly commoditizing.	Medium	SE006, SE020
CE025	Devin for Terminal (2026) allows developers to start an agent session locally from the terminal and escalate to cloud execution when the task outgrows local compute—preserving session state across the transition.	Medium	SE024, SE001
CE026	The CognitionAI/devin-swebench-results GitHub repository has 124 stars and 20 forks as of research date, indicating modest developer interest in the benchmark methodology transparency.	Medium	SE005
CE027	Cognition AI's 2026 product roadmap includes Devin-in-Windsurf integration, SWE-1.5 model improvements, SWE-Check bug detection, APAC regional expansion (Singapore), and Mercedes-Benz enterprise deployment.	Medium	SE024, SE015
CE028	Cognition AI has not publicly disclosed SOC 2 Type II, ISO 27001, or equivalent compliance certifications as of May 2026, representing a gap for regulated-industry enterprise procurement.	High	SE001, SE004, SE021
CE029	All Devin sessions run in isolated ephemeral compute containers, and code changes are surfaced only as pull requests requiring human approval before merging—preventing unauthorized production deployments.	High	SE001, SE004
CE030	The MCP Marketplace integrates Devin with monitoring tools (Sentry, Datadog, PagerDuty), databases (PostgreSQL, MySQL, MongoDB), and documentation (Notion, Confluence), enabling context-rich agent execution.	Medium	SE007, SE001
CE031	Enterprise VPC deployment allows Devin to execute tasks within the customer's own AWS, GCP, or Azure environment, eliminating source-code transmission to Cognition's external servers.	Medium	SE001, SE004
CE032	Windsurf Codemaps, powered by SWE-1.5 and Claude Sonnet 4.5, provides AI-annotated structured maps of codebases with 'trace guide' expansions, enabling navigation to exact file and line references.	Medium	SE024, SE013
CE033	The Devin REST API uses standardized HTTP endpoints for session creation, status polling, result retrieval, and webhook-based notifications, enabling integration into CI/CD pipelines.	Medium	SE007, SE001
CE034	GitHub Copilot was ranked a Leader in Gartner's 2025 Magic Quadrant for AI Code Assistants, representing the current benchmark for enterprise adoption against which Devin competes.	Medium	SE010, SE012
CE035	A 2026 empirical study of AI coding PRs across 7,156 tasks found documentation tasks achieved 82.1% acceptance versus 66.1% for new feature development—a 16-percentage-point gap exceeding inter-agent variance.	Medium	SE011, SE018
CU001	Cognition AI targets engineering teams at technology-forward enterprises, midmarket companies, and developer-first organizations, with buyer profiles including CTOs and VPs of Engineering seeking developer throughput multiplication.	High	SU001, SU021
CU002	Confirmed enterprise customer verticals include fintech/financial services (Nubank), automotive/manufacturing (Mercedes-Benz), IT services/outsourcing (Cognizant), US Federal Government (Feb 2026), and COBOL-heavy Fortune 500 legacy sectors.	High	SU002, SU003, SU004, SU006
CU003	Mercedes-Benz was announced in April 2026 as a production customer deploying Devin and Windsurf across its global engineering organization for legacy modernization, cloud-native development, and logistics.	Medium	SU003
CU004	Cognizant partnered with Cognition in January 2026 to deploy Devin and Windsurf across its own engineering organization and its global client base—a channel partnership that could expose Cognition to Cognizant's 300+ clients.	Medium	SU004, SU014
CU005	Cognition launched a Government vertical (Cognition for Government) in February 2026 to target US federal agencies for critical infrastructure software modernization; no named agencies have been publicly disclosed.	Medium	SU006
CU006	Cognition AI's estimated ARR grew from approximately $1M at general availability (December 2024) to approximately $73M by April 2025—a roughly 70x increase in approximately four months, coinciding with the price cut from $500/month to $20/month.	Low	SU007, SU008
CU007	Devin's PR merge rate (fraction of Devin-opened PRs accepted by human reviewers) improved from 34% at launch (March 2024) to 67% by April 2025—a doubling of accepted output quality in approximately 12 months.	Medium	SU008, SU009
CU008	By February 2026, Cognition's own engineering team was merging 659 Devin-authored pull requests per week—a 4x increase from their best week of 154 PRs in 2025.	High	SU017, SU025
CU009	Independent reviewers note that real-world ACU consumption on the $20/month Core plan can easily result in monthly bills of $100-200 for moderate usage, contradicting the headline price reduction's accessibility promise.	Medium	SU008, SU009
CU010	Windsurf IDE had 250,000+ daily active users and 350+ enterprise customers at the time of acquisition (July 2025), providing Cognition with a large top-of-funnel developer install base for Devin conversion.	High	SU010, SU014
CU011	Nubank used a fine-tuned custom Devin instance for ETL migration; fine-tuning doubled task completion scores and reduced per-task time from 40 minutes to 10 minutes (4x speed improvement).	Medium	SU002
CU012	Cognition opened offices in London (January 2026), Tokyo (April 2026), and Singapore (April 2026) within a four-month period, signaling management's confidence in revenue sufficiency for global expansion.	High	SU005, SU006
CU013	Net Revenue Retention (NRR), Gross Revenue Retention (GRR), and customer churn rate have not been publicly disclosed by Cognition AI; retention durability cannot be independently assessed.	Medium	SU007, SU008
CU014	Hacker News developer community expressed skepticism about Devin's real-world performance versus demo quality; early reviews noted that complex or novel tasks frequently required significant rework, exceeding expectations set by the SWE-bench launch announcement.	Medium	SU011, SU023
CU015	The security vulnerability disclosed in December 2024—a live-streamed prompt-injection attack on Devin—temporarily damaged trust among security-conscious enterprise buyers, requiring Cognition to accelerate security remediation communication.	Medium	SU018, SU011
CU016	Nubank's use of Devin expanded from the initial Data unit pilot to Data, Collections, and Risk business units, confirming land-and-expand progression within a single named enterprise customer.	Medium	SU002
CU017	Cognition AI's land-and-expand strategy relies on starting with a discrete project (migration, modernization) within one business unit, demonstrating ROI, then expanding to adjacent teams—as demonstrated by the Nubank case.	Medium	SU002, SU004
CU018	Windsurf's 250,000+ daily active users serve as a top-of-funnel for Devin cloud agent adoption: Windsurf Free users experiencing AI-assisted local development are natural conversion targets for cloud-based autonomous task delegation.	Medium	SU010, SU014
CU019	Devin's COBOL modernization use case is inherently project-based—once a legacy migration is complete, there is no structural recurring need unless customers contract for ongoing support or continued modernization work.	Medium	SU004, SU016
CU020	Top-customer concentration is a material but undisclosed diligence gap: with ~$73M ARR and a small set of confirmed named accounts, a single Nubank-scale contract could represent more than 10% of ARR.	Medium	SU007, SU008
CU021	Cognizant's reseller partnership creates a single-channel concentration risk: if Cognizant accounts for a material share of Cognition's enterprise pipeline, the loss of that relationship would disproportionately impact new account growth.	Medium	SU004, SU022
CU022	The self-serve developer tier (Free and Pro plans) provides a low-ACV but high-volume customer base that can generate bottom-up enterprise procurement pressure, analogous to Slack's or Atlassian's PLG motion.	Medium	SU008, SU009, SU015
CU023	Growjo estimates Cognition AI's total funding at $1.4B and current valuation at $10.2B as of September 2025, consistent with reports of a $400M fundraise following the Windsurf acquisition.	Low	SU007, SU010
CU024	Cognition AI had approximately 222 employees as of 2025 and grew headcount by approximately 102% year-over-year—a pace consistent with rapid revenue scaling but also requiring significant talent investment.	Low	SU007, SU020
CU025	The absence of SOC 2 Type II and FedRAMP compliance attestations in public documentation limits Cognition's addressable market among regulated enterprises (financial services, healthcare, US government) that require these certifications for procurement.	High	SU016, SU021
CU026	Devin's PR merge rate of 67% implies a 33% rejection or rework rate on AI-generated pull requests, which may be acceptable for high-volume routine tasks but represents a significant quality gap versus a senior human engineer.	Medium	SU008, SU019
CU027	Fortune 500 COBOL modernization deployments are confirmed via Cognition's April 2026 blog post, but no named customers, outcome metrics, or contract structures are disclosed—limiting diligence quality.	Medium	SU004
CU028	Geographic expansion from US-only to Europe and APAC in Q1-Q2 2026 within four months demonstrates aggressive international GTM investment, creating both market opportunity and potential cash burn risk from multi-office overhead.	High	SU005, SU006
CU029	The imseankim.com review noted ARR growth from $1M to $73M in nine months while also identifying that 'independent testers tell a more complicated story' compared to official benchmarks—suggesting a gap between marketing and user-level satisfaction at the margin.	Medium	SU008
CU030	Devin supports concurrent multi-session task execution, allowing enterprise teams to delegate entire sprint backlogs simultaneously, which is cited as a key adoption driver for teams managing high-volume routine work.	Medium	SU021, SU024
CU031	The Devin PR acceptance rate empirical study (Arxiv 2026) showed a +0.77%/week positive trend—the only agent with sustained improvement—suggesting iterative model quality gains that support long-term customer retention.	Medium	SU019
CU032	Cognition AI's Government vertical launch (February 2026) positions Devin as a tool for US federal legacy software modernization, a multi-billion dollar addressable budget segment, but no government contract or agency name has been publicly disclosed.	Medium	SU006
CU033	Nubank's ETL migration case study is the only published, quantified production customer proof; Mercedes-Benz, Cognizant, and Fortune 500 COBOL deployments lack independent third-party verification or outcome data.	High	SU002, SU003, SU004
CU034	Devin Autofix Review Comments (January 2026) closes a key customer workflow loop—Devin automatically addresses pull request review feedback—substantially improving the PR review cycle productivity for enterprise teams.	Medium	SU029, SU017
CU035	The Cognizant channel partnership bypasses individual enterprise sales cycles by embedding Devin into Cognizant's managed services offering—a go-to-market shortcut that scales account coverage rapidly but creates single-reseller revenue concentration risk.	Medium	SU004, SU021
CR001	The EU AI Act (Regulation 2024/1689) imposes GPAI transparency and documentation obligations on providers of general-purpose AI models effective August 2026; Cognition has not publicly disclosed its GPAI registration status, model transparency documentation, or training data copyright compliance policy.	High	SR009, SR007
CR002	Under GDPR and UK GDPR, organizations must conduct Data Protection Impact Assessments for high-risk AI processing; Devin's autonomous code writing, PR merging, and production deployment capabilities likely qualify as high-risk processing under GDPR Article 35, requiring DPIAs from EU/UK enterprise customers.	High	SR010, SR009
CR003	California SB 1047, which would have imposed safety requirements on frontier AI developers, was vetoed by Governor Newsom in September 2024; however, successor legislation including AB 2013 remained active in the 2025–2026 California legislative session, creating ongoing monitoring obligation.	Medium	SR006, SR008
CR004	The Doe v. GitHub, Microsoft, and OpenAI class action alleges that training AI coding models on public GitHub repositories without license compliance constitutes copyright infringement; Cognition has not disclosed its model training data provenance, creating analogous latent IP exposure under the same copyright theory.	Medium	SR011, SR007
CR005	The January 2025 White House Executive Order revoked prior Biden-era AI safety mandates, creating a deregulatory US federal environment for AI developers; however, agency-level rules for government AI procurement—relevant to Cognition's announced Government vertical—remain under development.	High	SR008, SR006
CR006	In December 2024, Cognition publicly disclosed a prompt-injection vulnerability in Devin discovered during a live-streamed demonstration; the vulnerability could enable an adversary to embed malicious instructions in repository content, causing Devin to exfiltrate credentials or insert backdoors in customer codebases.	Medium	SR001, SR013
CR007	The OWASP Top 10 for LLM Applications lists Prompt Injection (LLM01) as the highest-severity risk for LLM-based applications; for autonomous agents with code execution, PR merge, and terminal access permissions, a successful prompt-injection exploit has a blast radius materially larger than for passive coding assistants.	High	SR003, SR004
CR008	Cognition obtained SOC 2 Type II certification in March 2024, prior to the December 2024 prompt-injection incident; the SOC 2 report is not publicly accessible without executing an NDA through the Cognition Trust Center, limiting independent assessment of control effectiveness by enterprise procurement teams.	High	SR001, SR002
CR009	Devin 2.0 (April 2025) introduced the ability to directly merge pull requests and schedule autonomous agent runs on customer infrastructure, materially expanding the attack surface and potential blast radius of any future prompt-injection or supply-chain compromise beyond what existed at the December 2024 incident.	Medium	SR016, SR003
CR010	Cognition's security documentation acknowledges that Devin code output may contain hallucinations, bugs, or insecure code, and recommends code reviews and branch protections as mitigations; this acknowledged limitation creates ongoing liability exposure for enterprise customers deploying Devin in production code pipelines without mandatory human review gates.	High	SR001, SR004
CR011	Cognition is structurally dependent on Anthropic and OpenAI as foundation model API providers; these same companies compete with Devin through Claude Code and OpenAI Codex, creating a supply chain where the primary suppliers have economic incentives to disadvantage a dependent customer.	High	SR023, SR015
CR012	AWS provides the cloud infrastructure for Devin's VPC deployment option; Devin's enterprise architecture is AWS-native with no disclosed multi-cloud or on-premises fallback, creating material infrastructure concentration risk in a single cloud vendor.	Medium	SR001, SR016
CR013	GitHub (Microsoft) provides the primary code repository integration for Devin's PR workflow; Microsoft also distributes GitHub Copilot as a direct Devin competitor, meaning Cognition depends on a competitor-owned platform for core product functionality.	High	SR015, SR007
CR014	Cognition's acquisition of Windsurf (July 2025) added 350+ enterprise accounts and 250K+ daily active users but introduced integration execution risk: two AI agent codebases, model pipelines, enterprise sales motions, and billing systems must be combined simultaneously while maintaining ARR growth momentum.	Medium	SR018, SR030
CR015	Cognition's three co-founders have exceptional technical credentials as competitive programmers but no prior track record building enterprise software companies beyond early-stage startups; execution risk is elevated as the company scales from startup to enterprise sales motion across multiple geographies simultaneously.	Medium	SR025, SR017
CR016	AI engineering talent competition is severe; Anthropic, OpenAI, Google DeepMind, and Microsoft actively recruit competitive programming and ML talent at packages estimated above $500K total compensation, directly competing with Cognition's talent pool and creating ongoing retention risk.	Medium	SR023, SR015
CR017	Cognition's international expansion to Japan, Singapore, EU, and US Government sectors requires jurisdiction-specific compliance infrastructure—EU DPO appointment, Japan PIPL compliance, FedRAMP authorization—with no publicly disclosed plans or timelines, creating a gap between announced customer segments and compliance readiness.	Medium	SR009, SR010
CR018	At Devin's March 2024 launch its SWE-bench Full score of 13.86% was the then-best result for autonomous coding agents; by mid-2025 Claude Code Opus 4 achieved 72.5% on SWE-bench Verified—a competitive improvement rate that threatens to eliminate Devin's technical differentiation within 12–18 months.	High	SR015, SR023
CR019	The FTC stated that concentrated control of foundational AI inputs—including foundation models and cloud compute—could allow providers to distort competition in downstream AI application markets; this is a direct structural risk for Cognition given its dependence on Anthropic/OpenAI (models) and AWS (compute), both of which are also competitive threats.	High	SR007, SR011
CR020	ARR growth from ~$1M at GA (December 2024) to ~$73M estimate (April 2025) across a small number of named enterprise customers implies high per-customer ARR concentration; individual churn events at top customers could cause material ARR declines capable of impairing the $10.2B valuation estimate.	Medium	SR019, SR020
CR021	Devin's ACU pricing model ($2.25 per ACU overage) creates unpredictable usage-based cost exposure for enterprise customers; 'bill shock' from unexpected ACU consumption is a cited churn risk factor in independent community analysis of early Devin adopters.	Medium	SR020, SR028
CR022	Cognition has raised approximately $1.575B in disclosed funding through mid-2026; with 222 employees, 102% annual headcount growth, and compute-intensive AI infrastructure, estimated monthly burn is $5–15M, implying 18–36 months of runway under conservative scenarios assuming $73M ARR and aggressive growth continuation.	Medium	SR024, SR021
CR023	LLM inference costs for Devin sessions—paid at commercial API rates to Anthropic and OpenAI—compress gross margins structurally; no path to proprietary model development or renegotiated volume pricing is publicly disclosed, leaving margin compression risk unaddressed.	Medium	SR023, SR016
CR024	GitHub Copilot's distribution through Microsoft enterprise sales channels—250M+ installs, Microsoft 365 bundling potential, Azure integration—represents a structural distribution advantage that benchmark performance alone cannot overcome for Cognition.	High	SR015, SR027
CR025	Cursor's approximately $500M ARR at $9B valuation with $40/month pricing represents a 12.5× price advantage over Devin's Team plan at $500/month; this price sensitivity gap creates pricing risk as competitors improve and enterprise buyer ROI scrutiny increases.	Medium	SR027, SR019
CR026	Nubank is the only named customer with published and quantified production deployment outcomes; $73M ARR from an undisclosed number of additional enterprise accounts implies high revenue concentration risk and limits independent verification of Cognition's enterprise product-market fit beyond one case study.	Medium	SR026, SR020
CR027	Developer community analysis of the December 2024 prompt-injection incident described Cognition's prior credibility as damaged, noting: Devin's claimed product superiority was inconsistent with observed PR quality; the security gap was described as 'amateurish' for a product in development for over a year.	Medium	SR013, SR014
CR028	The OWASP GenAI Security Project identifies agent autonomy as an emerging, distinct risk category beyond classical LLM vulnerabilities; agents capable of modifying code, merging PRs, and executing shell commands face compound risks where a single exploit can traverse multiple security boundaries simultaneously.	High	SR004, SR003
CR029	CVE-2024-5185 (EmbedAI CSRF data-poisoning) demonstrates that AI application platforms face real-world vulnerability classes—CSRF, data poisoning—beyond prompt injection; the NVD tracks similar classes across AI applications, indicating a broad and evolving vulnerability landscape for agentic coding platforms like Devin.	Medium	SR005, SR003
CR030	EU AI Act GPAI provisions require frontier model providers to publish summary information about training data, establish copyright compliance policies, and publish technical documentation; Cognition has not publicly disclosed model training data provenance, open-source license compliance scope, or copyright policy.	Medium	SR009, SR001
CR031	The ICO's AI and data protection risk toolkit requires organizations to conduct DPIAs for high-risk AI processing; Devin's autonomous PR merges and production deployments on customer infrastructure in EU member states likely qualify under GDPR Article 35, creating DPIA obligations for UK/EU customers.	Medium	SR010, SR009
CR032	Doe v. GitHub and OpenAI consolidated copyright litigation (In Re: OpenAI Copyright Infringement, S.D.N.Y. 2025) involves claims that AI model training on copyrighted code and text violates copyright law; an adverse ruling could require Cognition to disclose or remediate its training data, imposing significant operational and legal costs.	Medium	SR011, SR004
CR033	Cognition's January 2025 Series A was led by Founders Fund at a $2B post-money valuation; a security incident, major regulatory enforcement action, or sustained competitor improvement could trigger down-round financing pressure at the next capital raise given the high valuation multiple relative to current ARR.	Medium	SR017, SR024
CR034	Developer community skepticism at Devin's launch—documented on HN and via independent benchmarking—highlighted concerns about benchmark reliability, limited demo scope, and performance gaps for non-standard tasks; this adverse reputation dynamic increases adoption friction among technically sophisticated enterprise evaluators.	Medium	SR014, SR013
CR035	Cognition's headcount grew 102% year-over-year to approximately 222 employees (Growjo, April 2025); this rapid hiring pace, without disclosed revenue-per-employee targets or profitability milestones, increases burn rate risk if ARR growth decelerates below the ~70×/year pace observed in the early 2025 growth sprint.	Medium	SR021, SR020
CR036	arXiv research on AI-assisted PR acceptance rates over 32 weeks shows a measurable +0.77%/week improvement trend, validating that autonomous coding agent productivity is improving; however, the long-run acceptance ceiling and transition from supervised to fully autonomous PR merges at enterprise scale remains empirically unproven.	Medium	SR022, SR015
CR037	Under GDPR Article 35, automated decision-making with significant effects on individuals requires a DPIA; Devin's autonomous code deployments affecting production systems could qualify in EU deployments, creating DPIA obligations that enterprise customers must fulfill before deploying Devin at scale.	Medium	SR010, SR002
CR038	Cognition's pricing dropped by approximately 3× with Devin 2.0 (April 2025), from higher effective ACU pricing to lower tiers; while improving adoption, this pricing compression requires proportional volume growth to sustain ARR trajectory and reduces the pricing power available to offset future LLM API cost increases.	Medium	SR028, SR019
CR039	The White House AI EO of January 2025 creates a deregulatory federal environment that reduces US-federal compliance burden for Cognition in the near term; but agency rules for government AI procurement—relevant to Cognition's announced Government customer vertical—may introduce FedRAMP and NIST AI RMF requirements on a 12–24 month timeline.	Medium	SR008, SR006
CR040	Independent developer analysis consistently identifies Devin's ACU cost model opacity and unpredictable session-level billing as adoption barriers; enterprise customers in community analysis report difficulty estimating total cost of ownership, creating adoption friction beyond the pilot phase.	Medium	SR013, SR020
CR041	SWE-bench Verified (500 human-filtered instances) is considered more reliable than SWE-bench Full (2294 instances); Claude Code Opus 4's 72.5% Verified score versus Devin's 13.86% Full score reflects both a benchmark type difference and a fundamental performance gap that, unless Cognition closes it, will increasingly drive enterprise procurement decisions toward Anthropic's product.	High	SR015, SR027
CR042	Cognition's terms of service prohibit using Devin outputs to train competing models, but do not publicly disclose the provenance of Devin's own training data; this asymmetry creates potential legal exposure under the same copyright theories being litigated against GitHub and OpenAI in the Doe v. GitHub class action.	Medium	SR001, SR011
CV001	Cognition AI's $10.2B September 2025 post-money valuation divided by an estimated $73M trailing ARR (April 2025) implies approximately a 140× ARR multiple—a 7× premium to the closest comparable Cursor/Anysphere, which closed a $9.9B Series D at approximately $500M ARR, implying ~20× ARR.	Medium	SV011, SV012, SV010
CV002	The recommendation for Cognition AI at $10.2B valuation is watchlist / conditional invest: the bull thesis (70×/year ARR growth, Windsurf integration moat, enterprise lighthouse customers) is structurally compelling but insufficient to justify a risk-adjusted positive return at 140× ARR without NRR, gross margin, and top-customer concentration data.	Medium	SV012, SV013, SV017
CV003	Entry discipline recommendation: investors should target a Cognition AI entry valuation below $5B (implying ≤70× current estimated ARR), at which the bull case returns are 3.5–5× in 3 years and the bear case returns imply losses limited to 50–80% of investment, reflecting a more acceptable risk-reward asymmetry.	Low	SV012, SV013
CV004	Cognition AI raised $175M at a $2B valuation in its Series A round led by Founders Fund IX (CIK 0001971631, EDGAR); this represents a 5.1× step-up to the September 2025 $10.2B round in approximately nine months—one of the fastest large-round step-up trajectories in AI software startup history.	High	SV002, SV011
CV005	Cognition Capital SPV I (CIK 0002072175, Form D filed June 2025) was structured contemporaneously with the Windsurf acquisition; the SPV structure suggests one or more institutional investors required a structured co-investment vehicle, which is a common signal of concentration risk in a single institution funding the Windsurf consideration.	Medium	SV001, SV003
CV006	A 'culture reset' at Cognition post-Windsurf acquisition—reported as Cognition offering nine-month buyouts to Windsurf staff and requiring 80-hour workweeks—introduces talent attrition risk that is not reflected in the $10.2B headline valuation and could impair the 250K+ DAU Windsurf user base that underpins the IDE moat thesis.	Medium	SV021, SV014
CV007	Founders Fund IX's portfolio page confirms Cognition AI as a portfolio company; Founders Fund IX filed an amended Form D (D/A) in October 2025, indicating continued capital activity in the fund consistent with ongoing Cognition investment or follow-on management activity.	High	SV003, SV007
CV008	Under the bull scenario, Cognition AI reaches $700–800M ARR by end-2027 (implying ~45% quarterly growth sustained for 10 quarters from the April 2025 base), at which a 25–35× forward ARR exit multiple would imply an exit valuation of $18–28B, representing a 1.5–2.5× return from the $10.2B September 2025 round.	Low	SV012, SV013
CV009	Under the base scenario, ARR growth decelerates from 9-month 70×/year pace to 30–40% quarterly growth, reaching $200–350M ARR by end-2027; at 20–25× forward ARR multiple, the implied exit valuation is $4–9B—flat to slight loss relative to the $10.2B entry, implying negative risk-adjusted returns for investors entering at $10.2B.	Medium	SV012, SV013
CV010	Under the bear scenario, a material adverse event (second security incident, major customer churn, or competitor price undercut >3×) causes ARR to plateau at $80–120M with NRR below 90%; at 8–12× trailing ARR, the implied exit valuation is $640M–$1.4B, representing a 90%+ capital loss for investors at $10.2B entry.	Medium	SV017, SV019
CV011	Cursor/Anysphere's Series D at $9.9B on approximately $500M ARR implies a 20× ARR multiple; Cognition's $10.2B valuation at approximately $73M ARR implies a 140× multiple, representing a 7× premium to Cursor on a comparable product category—enterprise AI coding tools—that is only sustainable if Cognition grows 7× faster to ARR parity with Cursor within 12–18 months.	Medium	SV010, SV012
CV012	Harvey AI (legal AI) at approximately $3B valuation and ~$100M ARR implies a 30× ARR multiple; high NRR (reportedly above 150% for legal enterprise accounts) justifies this premium. Cognition at 140× ARR requires NRR disclosure at a comparable level to avoid multiple compression, but NRR is currently undisclosed.	Low	SV018, SV013
CV013	Glean (enterprise AI search) at $4.6B valuation and approximately $100M ARR implies a 46× ARR multiple; its NRR above 150% and lack of benchmark commoditization risk distinguish it from Cognition. Cognition's 140× ARR multiple exceeds even high-NRR AI SaaS peers, suggesting the market is pricing in either a unique category premium or future ARR disclosure that narrows the apparent gap.	Low	SV018, SV004
CV014	Replit (developer platform, $1.16B valuation, 2023) represents the category risk for Cognition: a developer productivity pioneer that lost relative market share to more focused AI-native competitors (Cursor, Devin) during the 2024–2025 agentic coding transition. The Replit trajectory is a base-case downside reference for Cognition's own category risk.	Medium	SV018, SV017
CV015	GitHub Copilot, embedded in Microsoft's GitHub Enterprise product at $19–$39 per seat per month, represents a structural pricing threat: Microsoft can subsidize Copilot through Azure and M365 bundling, enabling sustained price pressure at a price point approximately 15–25× below Devin's $500/month Team plan while offering comparable pass-level coding assistance.	Medium	SV009, SV017
CV016	OpenAI Codex positions directly as an autonomous coding agent—similar to Devin—in Q2 2025, representing a new competitive threat from a foundation model provider that both supplies Devin's underlying intelligence and competes for the same enterprise developer productivity budget.	High	SV009, SV020
CV017	Amazon Q Developer (AWS-bundled AI coding assistant) competes with Devin for enterprise developer productivity budget within AWS cloud deployments; its bundling within AWS Enterprise Support creates lock-in dynamics that disadvantage Devin for customers already committed to the AWS ecosystem.	Medium	SV009, SV018
CV018	A second security incident within 18 months of the December 2024 prompt-injection disclosure is the primary thesis-break trigger; the probability is moderate given Devin 2.0's expanded attack surface (autonomous PR merges, scheduled agents) and the absence of a published post-Devin-2.0 penetration test or bug-bounty program.	Medium	SV019, SV020
CV019	Customer concentration risk—undisclosed top-customer ARR share—is the primary financial thesis-break variable. If Nubank accounts for greater than 30% of the estimated $73M ARR, a Nubank renewal risk is a material single-event that could reduce ARR by $20M+ without any other change in business trajectory.	Medium	SV027, SV012
CV020	Benchmark commoditization risk is the primary competitive thesis-break: Claude Code Opus 4's 72.5% SWE-bench Verified score (2025) versus Devin's 13.86% Full score (2024 launch) demonstrates a 5× improvement rate in 15 months that, if sustained, eliminates Devin's technical differentiation by late 2026.	High	SV017, SV020
CV021	NRR below 90% would be a thesis-break for any investment at $10.2B; NRR below 90% implies net negative ARR cohort contribution, meaning the existing customer base is contracting, and sustaining headline ARR growth requires ever-larger new logo acquisition to compensate—a structurally unsustainable growth model at enterprise SaaS scale.	High	SV013, SV012
CV022	EU regulatory enforcement action under the EU AI Act GPAI provisions (effective August 2026) could restrict Cognition's EU market access; Cognition expanded to Europe (London office, January 2026) without disclosed GPAI compliance infrastructure, creating a potential Q3 2026 enforcement risk.	Medium	SV022, SV023
CV023	Net Revenue Retention (NRR) by cohort is the single most critical undisclosed financial metric for valuation purposes; it determines whether the $73M ARR estimate reflects durable enterprise contracts or early-adopter pilots with high churn, a distinction worth $2–5B in implied valuation.	High	SV012, SV013
CV024	Gross margin is the second critical undisclosed metric; Devin's per-session LLM API costs (Anthropic/OpenAI commercial rates) are substantial for compute-intensive multi-hour sessions, and gross margin below 30% would signal an unsustainable cost structure that requires either proprietary model development or volume pricing renegotiation before profitability.	Medium	SV020, SV028
CV025	Post-Devin-2.0 penetration testing and red-team disclosure is required for enterprise procurement security sign-offs; the December 2024 incident occurred before Devin 2.0's expanded agentic capabilities (PR merges, scheduling), meaning the existing SOC 2 Type II audit (March 2024) does not cover the current expanded attack surface.	High	SV019, SV028
CV026	Windsurf customer retention at 6 months post-acquisition (January 2026) is a critical proof point for the IDE moat thesis; if the 350 enterprise accounts acquired with Windsurf show greater than 15% churn, the Windsurf contribution to ARR is overstated and the platform differentiation thesis weakens.	Medium	SV021, SV014
CV027	LLM API contract terms with Anthropic and OpenAI—specifically volume pricing floors, agentic use-case restrictions, and change-of-control provisions—are undisclosed but material; a scenario where Anthropic raises API pricing 3× as Claude Code gains enterprise share would compress Devin's gross margin materially and impair the $10.2B valuation.	Medium	SV020, SV009
CV028	FedRAMP authorization status for the US Government vertical (launched February 2026) is a credibility question for government procurement; without FedRAMP In Process or Authorized status, government agency sales are limited to informal pilots, capping US Government revenue contribution at a level well below the implied market opportunity.	Medium	SV022, SV006
CV029	The Founders Fund IX Form D/A amendment (October 2025, EDGAR) confirms continued fund activity consistent with Cognition follow-on capital management or new investment in the post-$400M round period; combined with the Cognition Capital SPV I (June 2025), the structured capital base is complex and the preference stack is undisclosed.	High	SV003, SV001
CV030	The $73M ARR estimate at April 2025, growing from $1M at GA (December 2024) in nine months, represents a 73× top-line growth rate—exceptional even by AI-native startup standards—but is based on third-party estimates (Growjo, Sacra) that have not been confirmed by Cognition in any official disclosure.	Medium	SV012, SV013
CV031	Devin's pricing dropped approximately 3× with Devin 2.0 (April 2025), with the Team plan remaining at $500/month but providing 83% more tasks per ACU; this effective price reduction improves adoption but compresses the revenue-per-session metric, requiring proportionally greater volume to maintain ARR trajectory.	High	SV028, SV012
CV032	Total disclosed funding across all rounds is approximately $1.575B; at a $10.2B post-money valuation, early rounds are already at 60–140× capital returned on paper, creating exit pressure for early investors that may accelerate secondary activity and reduce inside-round price discovery for new institutional investors.	Medium	SV011, SV023
CV033	Mercedes-Benz announced a partnership with Cognition AI in April 2026, adding a second named enterprise customer alongside Nubank; while terms and ARR contribution are undisclosed, the automotive/manufacturing vertical validates enterprise cross-sector demand and reduces the market risk of over-dependence on financial services verticals.	High	SV008, SV022
CV034	a16z's 100 Gen AI Apps report identifies coding tools and developer productivity as the highest-revenue AI application category, with enterprise developer tools generating $5B+ in collective ARR across the cohort as of their most recent analysis; this market context supports the bull-case premise that Cognition is entering the most valuable AI application segment.	Medium	SV004, SV018
CV035	Replit's valuation of $1.16B in 2023 as a developer platform that failed to capture the AI coding wave represents the primary bear-case precedent for Cognition; developer tool platforms with strong user bases but insufficient enterprise NRR can see rapid valuation compression when more focused AI-native competitors commoditize their core value proposition.	Medium	SV018, SV017
CV036	Cognition AI's geographic expansion to Europe (London, January 2026), Japan, and Singapore (April 2026) adds TAM and demonstrates customer pull outside the US but introduces compliance infrastructure gaps—EU GPAI registration, UK GDPR DPO appointment, Japan PIPL—that create near-term regulatory and reputational risk before those markets generate material ARR.	Medium	SV022, SV025
CV037	The Cognition Capital SPV I Form D (CIK 0002072175, filed 2025-06-11, Claymont, DE) is consistent with a structured co-investment vehicle used to fund the Windsurf acquisition consideration or provide a liquidity backstop to Windsurf investors, adding a layer of financial structure complexity to the Cognition cap table beyond a standard venture equity round.	Medium	SV001, SV003
CV038	The Dealroom.co listing for Cognition AI reflects the company's early financing stages at a $8M valuation figure—consistent with seed-round entry—while TechCrunch and CNBC confirm the September 2025 round at $10.2B; this 1,275× valuation step-up across approximately 24 months is unprecedented in enterprise SaaS history and indicates market expectation of near-monopoly market capture.	Medium	SV005, SV029
CV039	For an IPO at $8B+ market capitalization on conventional public market multiples (15–25× forward ARR), Cognition would need to demonstrate approximately $320–535M ARR in the 12 months preceding the IPO; at base-case growth rates, this milestone is not reached before 2028–2029, suggesting the IPO window is approximately 2–4 years from May 2026.	Medium	SV011, SV024
CV040	Strategic acquirer scenarios exist: Microsoft (GitHub distribution), Alphabet (Google Cloud developer tools), Salesforce (Einstein developer productivity), and Databricks/Snowflake (data engineering automation) are all logical fits; however, the $10.2B entry price for Cognition makes a strategic acquisition at premium unlikely unless ARR scales to $400M+ first, as acquirers would need to pay a 20–30% acquisition premium above the last private valuation.	Medium	SV011, SV023
CV041	Cursor/Anysphere's November 2025 Series D at $29.3B (per Business Wire) represents an even more recent and higher precedent valuation for AI coding tools; if confirmed, this would shift the peer set valuation anchor upward and could partially justify Cognition's 140× ARR multiple relative to Cursor's even higher implied multiple.	Low	SV010, SV012
CV042	Cognition AI's Mercedes-Benz partnership announcement (April 2026) and Cognizant channel partnership (January 2026) diversify the named customer base beyond Nubank, reducing—but not eliminating—customer concentration risk; the absence of disclosed outcome metrics for these relationships limits the financial materiality assessment.	Medium	SV008, SV022

Sources
ID	Publisher	Title	Quote
SO001	Cognition AI	Cognition AI — Official Homepage
SO002	Cognition AI	Introducing Devin, the first AI software engineer	Devin is a tireless, skilled teammate, equally ready to build alongside you or independently complete tasks for you to review.
SO003	Cognition AI	Funding, growth, and the next frontier of AI coding agents	Enterprise ARR grew over 30% in just seven weeks since we acquired Windsurf.
SO004	Cognition AI	Cognition's acquisition of Windsurf
SO005	Wikipedia	Cognition AI — Wikipedia
SO006	TechCrunch	Cognition AI defies turbulence with a $400M raise at $10.2B valuation	The round values Cognition at $10.2 billion post-money.
SO007	CNBC	Cognition valued at $10.2 billion two months after Windsurf purchase
SO008	Sacra	Cognition revenue, valuation & funding	Cognition ARR: $1M (Sep 2024) → $73M (Jun 2025) → $155M (Jul 2025 post-Windsurf)
SO009	VentureBeat	Cognition follows Windsurf acquisition with $400M fundraise, showing strong enterprise momentum
SO010	FavTutor	Meet The Team Behind Devin AI, Its Founders & Investors
SO011	Analytics India Magazine	Meet the Creator of Devin: A Child Prodigy Who is Making Coding Obsolete
SO012	Observer	Cognition, Maker of Goldman's First 'A.I. Employee'
SO013	Economic Times	Work-life imbalance: After buying remnants of Windsurf, Cognition lays off 30, tells rest to work long hours	Cognition laid off 30 employees and offered buyouts to approximately 200 remaining Windsurf staff with a requirement to commit to 80-hour six-day workweeks.
SO014	The AI Insider	Cognition AI Closes $400M in Funding to Reach $10.2B Valuation Amid Rapid Growth
SO015	AI Invest (ainvest.com)	Cognition AI's Windsurf Acquisition: A Masterstroke in the AI Talent & IP War
SO016	WinBuzzer	Cognition AI's Culture Reset: Offers Nine-Month Buyouts to Windsurf Staff, Demands 80-Hour Weeks
SO017	SFGate	SF tech CEO offers buyouts to let workers flee 'extreme' work culture
SO018	Eesel AI	A deep dive into Cognition AI reviews: Hype vs. Reality	Independent testing shows real-world task success rates well below the 'fully autonomous' marketing framing.
SO019	Sean Kim (imseankim.com)	Devin AI Review: From $500 to $20 — 6 Weeks With Cognition's AI Software Engineer
SO020	Tech Funding News	Cognition AI scores $400M at $10.2B valuation as demand spikes for coding agents
SO021	Stepmark AI	Company Spotlight: Cognition AI – Devin, your Autonomous Software Engineer
SO022	Tech Funding News	Cognition raises $500M at nearly $10B valuation following Windsurf acquisition
SO023	AI Invest (ainvest.com)	AI-Driven Software Development and Startup Valuation: The Cognition AI Case Study
SO024	ARR Club	Cognition ARR, Revenue Growth & Milestones
SO025	Devin.ai	Devin — AI Software Engineer
SO026	Devin.ai	Devin Pricing
SM001	Mordor Intelligence	AI Code Tools Market Size, Share & 2030 Trends Report
SM002	Grand View Research	Generative AI Coding Assistants Market Size Report, 2030
SM003	BusinessWire / ResearchAndMarkets	Generative AI Coding Assistants Strategic Research Report 2025 — Market to Reach $97.9 Billion by 2030
SM004	Gartner	Gartner Identifies the Top Strategic Trends in Software Engineering for 2025 and Beyond
SM005	Futuretechmag	Gartner Highlights Strategic Software Engineering Trends Beyond 2025
SM006	Evans Data Corporation	Worldwide Developer Population Grows to 27 Million
SM007	Statista	Global developer population 2024
SM008	WIPO (World Intellectual Property Organization)	Global Software Spending Surges to Close to USD 700 Billion in 2024
SM009	Signisys	AI Code Assistant: 90% by 2028 — Gartner forecast summary
SM010	GitHub (official)	Gartner positions GitHub as a Leader in the 2025 Magic Quadrant for AI Code Assistants	GitHub Copilot surpassed 20 million users and is used by over 50,000 organizations.
SM011	Kingy.ai	AI Coding Agents in 2025: The Ultimate Battle for Developer Supremacy
SM012	OpenAI (official)	The State of Enterprise AI — 2025 Report
SM013	Deloitte	AI Trends 2025: Adoption Barriers and Updated Predictions
SM014	Forbes	The Biggest Barriers Blocking Agentic AI Adoption
SM015	California Management Review (UC Berkeley)	Adoption of AI and Agentic Systems: Value, Challenges, and Pathways
SM016	VC Cafe	Which Barriers Still Block Agentic AI Adoption?
SM017	Second Talent	GitHub Copilot Statistics and Adoption Trends 2025
SM018	AI Business VC	GitHub Copilot Crosses $2B ARR — 46% of Code Is Now AI-Generated
SM019	Endroid	GitHub Copilot Surpasses 20 Million Users, Eyes Enterprise AI Coding Dominance
SM020	JetBrains Research	Global Developer Population 2024
SM021	Grand View Research	Software Market Size, Share and Trends — Industry Report, 2030
SM022	ResearchAndMarkets	AI Coding Assistant Tools Global Market Insights 2025
SM023	GeniusAI Tech	AI Statistics 2025: Market Growth, Usage, Trends and Adoption
SM024	ISG (Information Services Group)	State of Enterprise AI Adoption Report 2025
SM025	MarketsAndMarkets	AI Assistant Market Report 2025–2030, by Application, Geo, Tech
SP001	BusinessWire	Cursor Secures $2.3 Billion Series D Financing at $29.3 Billion Valuation
SP002	devclass.com	GitHub Copilot tops $2B ARR, confirms enterprise dev tools push
SP003	CNBC	Cursor AI startup funding round valuation November 2025
SP004	The Next Web	Cursor Anysphere $2B ARR funding $50B valuation talks
SP005	BetterStack	GitHub Copilot vs Cursor vs Windsurf: AI Coding Assistant Comparison
SP006	Digital Applied	Cursor AI $29B Valuation Agent Revolution
SP007	Amazon Web Services	Amazon Q Developer — AI Coding Assistant for AWS
SP008	Anthropic	Claude Code — Agentic coding in your terminal
SP009	Lowcode.agency	Claude Code vs Devin: Autonomous Coding Agent Comparison
SP010	ArXiv	SWE-bench and AI software engineering benchmark analysis 2026
SP011	Benched.ai	Top Coding Agents 2025 — Feature and performance guide
SP012	GitHub	GitHub Copilot — Features and Plans
SP013	Amazon Web Services	Amazon Q Developer Pricing
SP014	GitHub / OpenAI	OpenAI Codex GitHub Repository
SP015	SWE-bench (Princeton NLP)	SWE-bench: Evaluating LLM Agents on Real-World Software Engineering Tasks
SP016	Educative.io	Cursor vs Windsurf vs Copilot: AI IDE Comparison 2025
SP017	WeCompareAI	Best AI Coding Tools 2025: GitHub Copilot vs Cursor vs Windsurf
SP018	CreateAIAgent.net	Amazon Q Developer vs GitHub Copilot Workspace Comparison
SP019	Kingy.ai	AI Coding Agents in 2025: The Ultimate Battle for Developer Supremacy
SP020	aloa.co	GitHub Copilot vs Cursor vs Windsurf: Complete Comparison
SP021	GitHub Docs	Subscription plans for GitHub Copilot
SP022	Replit	Replit Pricing — AI development platform
SP023	Cursor	Cursor Pricing
SP024	Windsurf	Windsurf IDE Editor — formerly Codeium
SP025	GitHub (Princeton NLP / SWE-bench)	SWE-bench: Software Engineering Benchmark GitHub Repository
SP026	Sean Kim Tech Blog	Cursor vs Windsurf vs GitHub Copilot: AI IDE Battle October 2025
SP027	Anthropic	Anthropic Claude Pricing — Pro and Max plans
SI001	Cognition AI (Official)	Funding, Growth, and the Next Frontier of AI Coding Agents
SI002	Axios	Cognition raises $175M at $2B valuation for AI software engineer Devin
SI003	Sacra Research	Cognition AI revenue and growth analysis
SI004	Devin.ai (Official)	Devin AI Pricing — Plans and ACU details
SI005	ARR.club	Cognition AI ARR and revenue tracking
SI006	VentureBeat	Cognition follows Windsurf acquisition with $400M fundraise
SI007	Stepmark AI	Company Spotlight: Cognition AI — Devin Autonomous Software Engineer
SI008	Axios	Cognition AI raises $400 million at $10 billion valuation for Devin
SI009	TechCrunch	Cognition AI defies turbulence with a $400M raise at $10.2B valuation
SI010	CNBC	Cognition valued at $10.2 billion two months after Windsurf acquisition
SI011	AInvest	AI-Driven Software Development Startup Valuation: Cognition AI Case Study
SI012	TechFundingNews	Cognition AI scores $400M at $10.2B valuation as demand spikes for coding agents
SI013	TechFundingNews	Cognition raises $500M at nearly $10B valuation following Windsurf acquisition
SI014	Observer	Cognition AI startup valuation $10B
SI015	TheAIInsider	Cognition AI closes $400M in funding to reach $10.2B valuation amid rapid growth
SI016	CBInsights	Cognition Labs company profile and financials
SI017	GrowJo	Cognition AI company revenue and funding estimates
SI018	Cognition AI (Official)	Cognition AI Blog
SI019	Winbuzzer	Cognition AI culture reset: offers nine-month buyouts to Windsurf staff
SI020	PitchBook	Cognition AI Series A valuation and Devin funding profile
SI021	Growjo	Cognition AI funding and employee count 2025
SI022	Devin.ai (Official)	Devin AI — Official product and pricing page
SI023	Wikipedia	Cognition AI — Wikipedia article
SI024	AIForDevelopers	AI Startup Cognition Labs raises $175M valued at $2 billion
SI025	Towards AI (Medium)	Cognition AI's Devin: Is it worth the hype?
SI026	SEC EDGAR	Form D/A: Founders Fund IX, LP — Amended Notice of Exempt Offering of Securities (CIK 0001971631)
SI027	Cognition AI (Official)	Devin heads east: Cognition opens its Singapore APAC headquarters
SI028	Cognition AI (Official)	Engineering in the fast lane: Mercedes-Benz partners with Cognition
SE001	Devin (Official Documentation)	Devin Documentation — Getting Started and Integrations
SE002	Cognition AI (Official)	Introducing Devin, the first AI software engineer
SE003	Cognition AI (Official)	SWE-bench Technical Report
SE004	Devin (Official)	Devin — The AI Software Engineer
SE005	Cognition AI (GitHub)	CognitionAI/devin-swebench-results — SWE-bench evaluation results and methodology
SE006	SWE-bench (Princeton / CMU)	SWE-bench Leaderboard and Benchmark Documentation
SE007	Devin (Official Documentation)	Devin Documentation — Integrations
SE008	imseankim.com (Developer Blog)	Devin 2.0 AI Software Engineer Review: Cognition Pricing and Benchmark
SE009	Hacker News (Y Combinator)	Hacker News: Introducing Devin (community discussion thread)
SE010	Benched.ai	Top AI Coding Agents 2025 — Benchmarks and Capabilities Snapshot
SE011	arxiv.org / MSR 2026	Empirical Study Comparing AI Coding Agents: Temporal Trends in PR Acceptance
SE012	Artificial Analysis	AI Coding Agent Comparison — Artificial Analysis
SE013	Codeium / Windsurf (now Cognition)	Windsurf — The AI Code Editor
SE014	Devin (Official)	Devin Customer Case Studies
SE015	Cognition AI (Official)	Cognition AI acquires Windsurf — blog announcement
SE016	Devin (Official) — Customer Proof	How Nubank refactors millions of lines of code with Devin
SE017	Hacker News (Y Combinator)	Hacker News: Streamer discovers major vulnerability in Cognition's Devin live on air
SE018	Towards AI (Medium)	What happened when Devin AI took on 2,294 GitHub bugs — the 13.86% that changed everything
SE019	TechCrunch	Devin AI software engineer is now just $20 a month, down from $500
SE020	SWE-bench (GitHub)	SWE-bench GitHub Repository — Benchmark and Leaderboard
SE021	Kingy.ai (Technical Blog)	AI Coding Agents in 2025: The Ultimate Battle for Developer Supremacy
SE022	VentureBeat	Cognition follows Windsurf acquisition with $400M fundraise showing startup resolve
SE023	UC Berkeley Haas / CMR	Adoption of AI and Agentic Systems: Value, Challenges, and Pathways
SE024	Cognition AI (Official)	Cognition AI Blog — Product Updates and Technical Posts
SE025	Artificial Analysis	AI Agent Benchmark Performance — Coding Agents Evaluation
SU001	Devin (Official)	Devin AI — Customer Case Studies
SU002	Devin (Official) — Customer Case Study	How Nubank refactors millions of lines of code with Devin
SU003	Cognition AI (Official Blog)	Engineering in the fast lane: Mercedes-Benz partners with Cognition
SU004	Cognition AI (Official Blog)	Cognizant partners with Cognition; COBOL modernization; Government vertical
SU005	Cognition AI (Official)	Devin heads east: Cognition opens its Singapore APAC headquarters
SU006	Cognition AI (Official Blog)	Launching in Japan; Cognition for Government; London office
SU007	Growjo	Cognition AI: Revenue, Competitors, Alternatives
SU008	imseankim.com (Developer Blog)	Devin 2.0 AI Software Engineer Review: Cognition Pricing and Benchmark
SU009	TechCrunch	Devin AI software engineer is now just $20 a month, down from $500
SU010	VentureBeat	Cognition follows Windsurf acquisition with $400M fundraise
SU011	Hacker News (Y Combinator)	Hacker News: Introducing Devin — Developer community discussion
SU012	Artificial Analysis	AI Coding Agent Comparison — Artificial Analysis
SU013	Benched.ai	Top AI Coding Agents 2025 — Benchmarks and Capabilities Snapshot
SU014	Cognition AI (Official Blog)	Devin in Windsurf and Singapore APAC expansion
SU015	Kingy.ai (Technical Blog)	AI Coding Agents in 2025: The Ultimate Battle for Developer Supremacy
SU016	UC Berkeley Haas / CMR	Adoption of AI and Agentic Systems: Value, Challenges, and Pathways
SU017	Cognition AI (Official Blog)	How Cognition Uses Devin to Build Devin
SU018	Hacker News (Y Combinator)	Streamer discovers major vulnerability in Cognition's Devin live on air
SU019	arxiv.org / MSR 2026	Empirical Study: PR Acceptance Rates for AI Coding Agents
SU020	Growjo (Employee/Revenue Intelligence)	Cognition AI headcount, revenue growth, funding data
SU021	Cognition AI (Official)	Devin AI — Product Page
SU022	VentureBeat	Cognition AI $175M Series A at $2B valuation to build AI software engineers
SU023	Towards AI (Medium)	What happened when Devin AI took on 2,294 GitHub bugs
SU024	Cognition AI (Official)	Devin Documentation — Integrations, Enterprise Deployment
SU025	Cognition AI (Official Blog)	Introducing Devin 2.2 — Most important update since launch
SU026	Stack Overflow (Developer Survey 2024)	2024 Stack Overflow Developer Survey — AI Tools Adoption
SU027	GitHub Blog	How GitHub Copilot is getting better at understanding your code
SU028	State of AI Report	State of AI Report — Enterprise AI Adoption Trends
SU029	Cognition AI (Official Blog)	Closing the Agent Loop: Devin Autofixes Review Comments
SU030	Stanford HAI	Stanford AI Index 2025 — Enterprise AI Adoption and Coding Agents
SR001	Cognition AI (Official Docs)	Security at Cognition — SOC 2 Type II, Data Privacy, Intellectual Property
SR002	Cognition AI (Trust Center)	Cognition Trust Center — Security Documentation Portal (NDA-gated)
SR003	OWASP Foundation	OWASP Top 10 for Large Language Model Applications
SR004	OWASP Foundation (GenAI Security Project)	OWASP GenAI Security Project — LLM Top 10 Risks
SR005	NIST National Vulnerability Database	CVE-2024-5185 — EmbedAI CSRF Data Poisoning Vulnerability
SR006	California Legislative Information	SB 1047 — Safe and Secure Innovation for Frontier Artificial Intelligence Models
SR007	Federal Trade Commission (US)	Generative AI Raises Competition Concerns
SR008	The White House	Executive Order — Removing Barriers to American Leadership in Artificial Intelligence
SR009	European Parliament	EU AI Act — First Regulation on Artificial Intelligence
SR010	UK Information Commissioner's Office (ICO)	AI and Data Protection Risk Toolkit
SR011	CourtListener (RECAP)	Doe v. GitHub Inc. — AI Copyright Infringement Docket
SR012	Wikipedia (Wikimedia Foundation)	Prompt Injection — Cybersecurity Attack Vector
SR013	Hacker News (Y Combinator)	HN: Streamer discovers major vulnerability in Devin live on air (Dec 2024)
SR014	Hacker News (Y Combinator)	HN: Cognition AI — Devin Launch Skepticism and Scope Analysis (Mar 2024)
SR015	SWE-bench (Princeton NLP / CMU)	SWE-bench — Live Benchmark Leaderboard for AI Software Engineering Agents
SR016	Cognition AI (Official Blog)	Introducing Devin 2.0 — 83% More Tasks per ACU
SR017	TechCrunch	Cognition raises $175M at $2B valuation to build AI software engineers
SR018	VentureBeat	Cognition follows Windsurf acquisition with $400M fundraise
SR019	Sacra (Research)	Devin ARR and Revenue Metrics Tracker — Cognition
SR020	imseankim.com (Independent Analysis)	Devin ARR $1M to $73M in 9 Months — PR Merge Rate and Cost Analysis
SR021	Growjo (Business Analytics)	Cognition AI — Headcount, Revenue, and Growth Analytics
SR022	arXiv (Cornell)	AI-Assisted Pull Requests: Longitudinal Acceptance Rate Trends in Open-Source
SR023	Anthropic (Official)	Claude Code — Agentic Coding Assistant Product Overview
SR024	PitchBook Data	Cognition AI — Funding Rounds and Investor Profile
SR025	Cognition AI (Official Blog)	Introducing Devin — The AI Software Engineer (March 2024)
SR026	Devin.ai (Official)	Nubank Case Study — 12x Efficiency, 20x Cost Savings
SR027	benched.ai (Independent Benchmarking)	Top Coding Agents 2025 — Devin vs. Competitors
SR028	TechCrunch	Devin 2.0 — Price Drops 3× as Cognition Expands Agentic Capabilities
SR029	CNBC (Technology)	Cognition AI Raises $175M Valuing AI Coding Startup at $2 Billion
SR030	Axios (Technology)	Cognition AI — Windsurf Acquisition and $400M Raise
SV001	US Securities and Exchange Commission (EDGAR)	Cognition Capital SPV I — Form D, CIK 0002072175 (June 2025)
SV002	US Securities and Exchange Commission (EDGAR)	Founders Fund IX, LP — Form D Original Filing, CIK 0001971631 (April 2023)
SV003	US Securities and Exchange Commission (EDGAR)	EDGAR Full-Text Search — Founders Fund IX Form D Filings
SV004	a16z (Andreessen Horowitz)	100 Gen AI Apps — 4th Edition: Top Consumer and Enterprise AI Applications
SV005	Dealroom.co	Cognition AI — Funding History and Investor Profile
SV006	The New Stack	Cognition Launches Devin AI Software Engineer for Enterprise
SV007	Founders Fund (Official)	Founders Fund Portfolio — Official Website
SV008	Business Wire	Cognition AI and Mercedes-Benz Partner to Accelerate Software Development
SV009	OpenAI (Official)	OpenAI Codex — Agentic Coding Assistant Product Overview
SV010	Business Wire	Cursor Secures $2.3 Billion Series D Financing at $29.3 Billion Valuation
SV011	TechCrunch	Cognition AI defies turbulence with $400M raise at $10.2B valuation
SV012	Growjo (Business Analytics)	Cognition AI — Revenue and Valuation Analytics
SV013	Sacra (Research)	Cognition AI Revenue and Metrics Tracker
SV014	VentureBeat	Cognition follows Windsurf acquisition with $400M fundraise
SV015	Cognition AI (Official Blog)	Cognition Funding, Growth, and the Next Frontier of AI Coding Agents
SV016	imseankim.com (Independent Analysis)	Cognition Devin Product-Led Growth and ARR Analysis
SV017	SWE-bench (Princeton NLP / CMU)	SWE-bench Live Leaderboard — Autonomous Software Engineering
SV018	benched.ai (Independent Benchmarking)	Top AI Coding Agents 2025 — Competitive Analysis
SV019	Hacker News (Y Combinator)	Cognition AI Security Incident Live-Stream Discussion
SV020	Anthropic (Official)	Claude Code — Agentic Coding Assistant Overview
SV021	Winbuzzer (Tech News)	Cognition AI's Culture Reset Offers Nine-Month Buyouts to Windsurf Staff
SV022	Cognition AI (Official Blog)	Cognition Expands to Europe — London Office Opening
SV023	Axios (Technology)	Cognition AI raises $400M at $10.2B valuation
SV024	The AI Insider	Cognition AI Closes $400M to Reach $10.2B Valuation Amid Rapid Growth
SV025	TechFundingNews	Cognition AI Scores $400M at $10.2B Valuation as Demand Spikes for Coding Agents
SV026	Observer (Business)	Cognition AI Startup Valuation $10B
SV027	Devin.ai (Official)	Nubank Case Study — 12× Efficiency Gain, 20× Cost Savings
SV028	Cognition AI (Official Blog)	Introducing Devin 2.0 — 83% More Tasks Per ACU
SV029	CNBC (Technology)	Cognition valued at $10.2 billion two months after Windsurf acquisition
SV030	Ainvest.com (AI Investment Research)	AI-Driven Software Development Startup Valuation: Cognition AI Case Study