Cognition AI
Cognition AI is a category-defining autonomous software engineering platform with exceptional ARR growth ($1M → $73M in 9 months), but the $10.2B valuation at ~140× ARR prices in near-flawless execution in a market facing rapid benchmark commoditization and critical undisclosed financial metrics.
Cover facts
Company profile
Cognition AI is a San Francisco-based AI startup founded in 2023 by Scott Wu, Steven Hao, and Walden Yan—competitive programming champions and former olympiad medalists. The company's flagship product, Devin, is a cloud-hosted autonomous AI software engineer that plans, writes, tests, and deploys code end-to-end without constant human input. Devin is accessed via API, Slack, Jira, or a web interface and priced on a usage-based model (ACUs). In July 2025, Cognition acquired Windsurf IDE, adding 250K+ daily active users and 350+ enterprise accounts. The company has raised approximately $1.575B, with the most recent $400M round in September 2025 valuing it at $10.2B.
- Website
- cognition.ai
- Founded
- 2023-01-01
- Founders
- Scott Wu, Steven Hao, Walden Yan
- Founding location
- San Francisco, CA
- Headquarters
- San Francisco, CA
- Product
- Devin is a cloud-hosted autonomous AI software engineer that accepts tasks via Slack, Jira, or web interface, then plans, writes code, tests, and deploys independently using Agent Compute Units (ACUs). Devin 2.0 (April 2025) added direct PR merging at a 3× lower effective price per task. Windsurf IDE (acquired July 2025) adds 250K+ daily active users and 350+ enterprise accounts to the platform.
- Customers
- Enterprise engineering teams (1,000+ developers), mid-market technology companies, and individual professional developers. Named enterprise customers: Nubank, Mercedes-Benz, Cognizant. US Government vertical launched February 2026.
- Business model
- Usage-based SaaS: $500/month Team plan plus $2.25/ACU overage; Enterprise plan requires direct negotiation. Windsurf IDE adds free-to-paid conversion funnel. Cognizant channel partnership adds reseller revenue layer.
- Stage
- Late Private (post-Series A)
- Funding status
- $21M seed (2023), $175M Series A at $2B (Founders Fund IX, Jan/Apr 2025), $400M round at $10.2B (Sep 2025). Total ~$1.575B disclosed.
Executive summary
Top strengths
- Exceptional ARR growth: $1M to $73M in 9 months represents a generational growth sprint with no direct analog in enterprise SaaS history
- Category pioneer: Devin established 'autonomous AI software engineer' as an enterprise procurement line item before most competitors recognized the category
- Windsurf IDE integration: 250K+ DAU and 350+ enterprise accounts create a unique combined agent-plus-IDE distribution moat not available to single-product competitors
- Lighthouse customer proof: Nubank (12× efficiency, 20× cost savings, published case study) is a high-quality reference for enterprise ROI in financial services
- Founder pedigree: Scott Wu, Steven Hao, Walden Yan are competitive programming champions with deep AI/ML engineering intuition relevant to autonomous coding
- Devin eats its own dog food: 659 Devin PRs/week merged internally, providing rapid model improvement feedback loop
Top risks
- Benchmark commoditization: Claude Code Opus 4 at 72.5% SWE-bench Verified vs. Devin's 13.86% Full demonstrates a 5× competitive improvement in 15 months; technical differentiation may erode by late 2026
- Security overhang: December 2024 prompt-injection vulnerability; no published post-Devin-2.0 penetration test or bug-bounty program; expanded autonomous PR-merge capability increases attack surface
- Valuation multiple: $10.2B at ~140× estimated ARR is a 7× premium to Cursor/Anysphere; risk-adjusted return at this entry is negative in the base case and catastrophic in the bear case
- LLM provider conflict-of-interest: Anthropic and OpenAI are both primary LLM API suppliers and direct competitors via Claude Code and OpenAI Codex; pricing leverage and terms are undisclosed
- Customer concentration: Only one quantified enterprise case study (Nubank); $73M ARR from an undisclosed small number of accounts creates high single-customer churn sensitivity
- EU AI Act GPAI compliance: August 2026 deadline; no disclosed GPAI registration, EU DPO appointment, or transparency documentation; EU expansion announced without compliance infrastructure
Open gaps
- Net Revenue Retention (NRR) by cohort: undisclosed; the single most critical metric for validating ARR quality at $10.2B valuation; must be ≥110% to justify premium
- Gross margin by product line: undisclosed; LLM inference cost structure unknown; path to profitability cannot be modeled without this metric
- Top-5 customer ARR concentration: undisclosed; Nubank is the only quantified reference; concentration above 30% in a single customer is a thesis-break risk
- Post-Devin-2.0 security audit: no public penetration test report or red-team disclosure post-April 2025 expanded agentic capabilities; SOC 2 Type II (March 2024) predates current attack surface
- Windsurf integration customer retention: 350+ enterprise accounts acquired July 2025; 6-month retention rate (January 2026) unknown; critical proof point for IDE moat thesis
- LLM API contract terms: volume pricing, agentic use-case restrictions, and change-of-control provisions with Anthropic/OpenAI are undisclosed; margin floor and supplier conflict-of-interest cannot be assessed
Contents
01Company Overview
1.1 Identity and Business Model
Cognition AI is a San Francisco-based AI-native software company building autonomous agents that perform end-to-end software engineering tasks. Its flagship product, Devin, is marketed as the world's first AI software engineer: an agent that can interpret tickets, plan solutions, write and debug code, run tests, and deploy to production with minimal human intervention. The company operates a SaaS subscription model for individual developers and small teams (Core plan at $20/month), a Team plan at $500/month including 250 Agent Compute Units, and custom enterprise contracts with VPC deployment options. Following the July 2025 acquisition of Windsurf—an AI-native IDE—Cognition expanded its product portfolio to serve developers who prefer a more interactive, IDE-centered workflow rather than a fully autonomous agent. The dual-product strategy allows Cognition to address both the agentic automation segment and the traditional coding-assistant market that GitHub Copilot and Cursor dominate. Revenue is primarily usage-based: additional Agent Compute Units beyond plan inclusions are priced at roughly $2.25 per ACU, creating a metered model that scales with enterprise task volume. The company's website and blog are public-facing channels; it does not publish GAAP financials and reports only via investor communications and company-controlled press releases.[CO001, CO002, CO003, CO028, CO029]
1.2 Founders and Leadership
Cognition AI was co-founded by three elite competitive programmers, each an International Olympiad in Informatics gold medalist, giving the company an exceptionally deep technical foundation from inception. Scott Wu, CEO, is a three-time IOI gold medalist, a 2011 MATHCOUNTS national champion, and holds a Harvard economics degree. On the Codeforces competitive programming platform, Wu holds the top designation of Legendary Grandmaster, reflecting continued excellence in algorithmic problem-solving. Prior to Cognition, he co-founded Lunchclub, an AI-powered professional networking product. Steven Hao, CTO, is also an IOI gold medalist with prior experience at Scale AI, DeepMind, Waymo, Nuro, Modal, and Cursor—a background spanning both frontier AI research and production engineering systems. Walden Yan, CPO, brings product leadership and IOI-level algorithmic depth. The founding team of approximately 10 individuals collectively held 10 IOI gold medals at launch, an extraordinary concentration of competitive-programming talent rarely seen at an early-stage startup. This profile has attracted top-tier investors and talent but also creates a meaningful key-person dependency: the founders' technical credibility is inseparable from the company's market positioning. No board composition or independent director information has been publicly disclosed.[CO004, CO005, CO006, CO007, CO026, CO027]
| Name | Role | Educational/Competitive Background | Prior Experience | Key-Person Dependency |
|---|---|---|---|---|
| Scott Wu | CEO & Co-founder | 3× IOI gold medal; 2011 MATHCOUNTS national champion; Harvard (economics); Codeforces Legendary Grandmaster | Lunchclub co-founder | Critical — brand and technical credibility inseparable from product |
| Steven Hao | CTO & Co-founder | IOI gold medal | Scale AI, DeepMind, Waymo, Nuro, Modal, Cursor | Critical — core model and system architecture |
| Walden Yan | CPO & Co-founder | IOI gold medal | Product leadership in AI | High — product vision and roadmap |
Board composition, independent directors, and broader leadership team (VP-level and below) are not publicly disclosed. Only co-founders have been publicly identified.
[CO004, CO005, CO006, CO007, CO026, CO027]1.3 Funding History and Investors
Cognition AI's capital formation has been among the fastest in recent venture history. An initial ~$21M seed/Series A round in March 2024 valued the company at approximately $350M before any meaningful revenue existed, reflecting investor conviction in the founding team's ability to execute. Just weeks later in April 2024, Cognition closed $175M at a $2B valuation, crossing into unicorn territory within six months of founding. Founders Fund—Peter Thiel's venture firm—led both rounds, providing not only capital but reputational signal to the broader market. By September 2025, following the Windsurf acquisition and rapid ARR growth, Cognition raised $400M at a post-money valuation of $10.2B, bringing total capital raised to approximately $696M. Co-investors in the September 2025 round included Lux Capital, 8VC, Elad Gil (prominent angel and operator), Bain Capital Ventures, D1 Capital, Definition Capital, and Swish Ventures. As of May 2026, the company is reportedly in discussions for a new financing round at a potential $25B+ valuation, implying continued strong investor appetite for agentic AI coding infrastructure. Notably, the company has disclosed a total net cash burn of under $20M from founding through Q3 2025, suggesting the rapid growth was achieved with remarkable capital efficiency prior to the acquisition.[CO008, CO009, CO010, CO020, CO030, CO031]
| Metric | Value / Status | As-Of Date | Confidence | Gap / Caveat |
|---|---|---|---|---|
| Valuation (last round) | $10.2B | Sep 2025 | high | Post-money; pre-money not disclosed |
| Total capital raised | ~$696M | Sep 2025 | high | Sum of known disclosed rounds |
| ARR (Devin, pre-acquisition) | $73M | Jun 2025 | high | Company-disclosed via blog |
| ARR (post-Windsurf combined) | ~$155M | Jul 2025 | medium | Partly company-claimed; Windsurf component is estimate |
| Initial funding (seed/Series A) | $21M at $350M | Mar 2024 | high | First disclosed institutional round |
| Series A extension | $175M at $2B | Apr 2024 | high | Company announcement + press |
| Net cash burn (inception to Q3 2025) | <$20M | Sep 2025 | medium | Company-claimed; not audited |
| Headcount (est.) | ~50-250 | 2025 | low | Pre-acquisition ~49; Windsurf integration unclear |
| Enterprise customers (Windsurf) | 350+ | Jul 2025 | medium | Windsurf-only figure at acquisition |
| Price to ARR multiple (Sep 2025) | ~65-140x | Sep 2025 | medium | Depends on which ARR base is used |
Revenue and valuation figures are company-reported or analyst-estimated; no audited financials are publicly available. ARR figures represent annualized subscription and usage revenue. Multiple ranges reflect pre- vs. post-Windsurf ARR bases.
[CO008, CO009, CO010, CO014, CO015, CO016]| Investor | Round(s) | Role | Control / Economic Importance | Diligence Ask |
|---|---|---|---|---|
| Founders Fund | Seed/Series A + $400M | Lead investor | Largest single institutional holder; likely board representation | Confirm board seats, anti-dilution, pro-rata rights |
| Lux Capital | $400M (Sep 2025) | Co-investor | Deep-tech focus; likely minority stake | Confirm ownership % |
| 8VC | $400M (Sep 2025) | Co-investor | Tech-focused growth fund | Confirm ownership % |
| D1 Capital | $400M (Sep 2025) | Co-investor | Late-stage growth equity | Confirm ownership % and governance rights |
| Bain Capital Ventures | $400M (Sep 2025) | Co-investor | Enterprise SaaS experience; minority | Confirm ownership % |
| Elad Gil | Multiple rounds | Individual investor | Prominent angel; advisory relationship likely | Confirm advisory vs. formal board role |
| Definition Capital | $400M (Sep 2025) | Co-investor | Technology focused | Confirm size of check |
| Swish Ventures | $400M (Sep 2025) | Co-investor | Early-stage / growth technology | Confirm role |
Ownership percentages are not publicly disclosed. Round participation derived from press releases and news reports. Board rights, anti-dilution terms, and liquidation preferences are unknown.
[CO008, CO009, CO010, CO020, CO036]Key financial and operational metrics as of the most recent disclosed data points.
ARR is a combination of company-disclosed Devin ARR ($73M Jun 2025) and Windsurf ARR ($82M at acquisition Jul 2025). Valuation is post-money from the September 2025 financing. Lifetime burn is company-stated and unaudited.
[CO037, CO042, CO015, CO016]1.4 Scale and Milestones
Cognition AI's revenue trajectory is extraordinary by any SaaS benchmark. The company exited 2024 at approximately $1M ARR (September 2024) and accelerated to $73M ARR by June 2025—a roughly 73x increase in nine months driven by enterprise adoption of Devin for code migration, bug-fixing, and feature automation tasks. The July 2025 acquisition of Windsurf, which contributed $82M ARR from 350+ enterprise accounts, pushed the combined business to approximately $155M ARR within weeks. Major enterprise clients including Goldman Sachs (reportedly piloting Devin with 12,000 developers), Citi, Dell, Cisco, Ramp, Palantir, Nubank, and Mercado Libre have been publicly cited. Nubank reportedly achieved a 12x efficiency improvement in code migration workflows using Devin. In April 2026, Cognition opened a Singapore APAC headquarters, signaling international expansion intent. The same month, a Mercedes-Benz partnership was announced. Headcount data is not publicly disclosed in detail; pre-acquisition estimates placed the Cognition team at roughly 49 employees, while the Windsurf acquisition added potentially 200 individuals before layoffs and buyout departures.[CO011, CO012, CO013, CO014, CO015, CO021]
| Date | Event | Type | Amount / Valuation / Status | Participants | Implication |
|---|---|---|---|---|---|
| Nov 2023 | Company founded by Scott Wu, Steven Hao, and Walden Yan | founding | N/A | Three IOI gold medalists | Launch of first purpose-built agentic AI coding company |
| Mar 2024 | $21M seed/Series A at $350M valuation | financing | $21M / $350M | Founders Fund | Institutional validation before first product release |
| Mar 2024 | Devin publicly announced; SWE-bench 13.86% score published | product | N/A | Cognition AI | World's first AI software engineer claim; generated global press |
| Apr 2024 | $175M raise at $2B valuation | financing | $175M / $2B | Founders Fund + others | Unicorn status within ~5 months of founding |
| Sep 2024 | $1M ARR milestone | scale | $1M ARR | N/A | Early commercial traction from self-serve enterprise |
| Jan 2025 | Devin 2.0 launched with multi-agent parallelization | product | N/A | Cognition AI | 83% efficiency gain claimed vs. v1.x; parallel task execution |
| Jun 2025 | $73M ARR reached (Devin only) | scale | $73M ARR | N/A | 73x ARR growth in 9 months; breakthrough enterprise adoption |
| Jul 2025 | Windsurf acquired (~$250M est.) | product | ~$250M / +$82M ARR / 350+ customers | Cognition AI / Windsurf | Added AI-native IDE product and enterprise customer base |
| Aug 2025 | 30 Windsurf employees laid off; buyouts offered to ~200 | adverse | 9-month salary buyout packages | Cognition AI CEO Scott Wu | Culture integration controversy; talent attrition risk |
| Sep 2025 | $400M raise at $10.2B valuation | financing | $400M / $10.2B | Founders Fund, Lux, 8VC, D1, Bain, Elad Gil + others | Decacorn status; 5x valuation expansion in 18 months |
| Apr 2026 | Singapore APAC headquarters opened | scale | N/A | Cognition AI | First international office; APAC enterprise penetration |
| Apr 2026 | Mercedes-Benz partnership announced | partnership | N/A | Cognition AI, Mercedes-Benz | European automotive enterprise entry point |
Dates and amounts derived from company announcements and press coverage. Windsurf acquisition price is an industry estimate; official terms not disclosed. Milestone table reflects publicly known events only; internal pivots and regulatory activity are not known.
[CO001, CO008, CO009, CO011, CO014, CO017]Key founding, financing, product, scale, and adverse events from November 2023 through April 2026.
[CO034, CO037, CO041, CO042, CO025, CO033]How Cognition AI's identity, products, customers, and capital relate to each other.
[CO003, CO011, CO021]1.5 Governance and Key-Person Risk
Cognition AI is a privately held C-corporation with no public financial disclosures, no publicly identified board composition, and no independent directors known to investors or analysts outside the cap table. The company's governance structure remains opaque: investor rights, board seats granted to Founders Fund and other lead investors, and secondary market transaction terms have not been disclosed. The extreme concentration of product vision and technical credibility in the three co-founders—particularly Scott Wu, whose public persona and coding legend status are inseparable from Devin's brand—creates substantial key-person risk. The mandatory 80-hour, six-day workweek culture that prompted voluntary buyout offers to Windsurf employees in 2025 raises questions about talent retention, especially for acquired personnel who did not self-select for that environment. Post-acquisition attrition among Windsurf engineers could erode the IDE product capabilities that justified the ~$250M deal. Critics and independent reviewers have noted that Devin's real-world task success rates are substantially lower than company-curated benchmark demonstrations suggest, and that the 'fully autonomous software engineer' framing may oversell current capabilities, creating future investor and customer expectation risk.[CO034, CO035, CO041, CO022, CO023]
1.6 Exhibits
02Market Analysis
2.1 Market Definition and Boundaries
The market for Cognition AI's Devin product sits at the intersection of two overlapping categories: AI-powered developer tools (copilots, code completion, review automation) and autonomous software engineering agents (end-to-end task execution requiring planning, multi-step reasoning, and self-correction). The former is a large established market anchored by GitHub Copilot with $2B+ ARR; the latter is an emerging sub-segment where Devin occupies a pioneer position. For sizing purposes, the relevant Total Addressable Market spans tools that professional software development teams might purchase to increase engineering velocity—from autocomplete plugins through fully autonomous agents that can execute tickets without human oversight. The included spend is: enterprise and SMB subscriptions to AI coding assistants, autonomous agent platforms sold to engineering teams, and API consumption for programmatic code generation. Excluded spend is: general large language model API access not sold for code use-cases, low-code/no-code application builders targeting business users (distinct buyer and workflow), infrastructure software (compute, storage, networking), and traditional outsourced software development services. Status-quo substitutes include offshore staff augmentation, traditional IDEs without AI, internal developer tooling built on base LLM APIs, and cloud-vendor bundled IDE extensions such as AWS CodeWhisperer and Azure GitHub Copilot bundles. The market boundary is contested across analyst firms. Grand View Research segments "generative AI coding assistants" narrowly at $92–98M by 2030 using a restrictive scope, while Mordor Intelligence defines "AI code tools" broadly at $24B by 2030. Cognition's Devin occupies the premium agentic sub-segment with $500/month Team plans and enterprise contracts in the $500K+ per year range—far above the $10–$19/seat copilot tier. The serviceable addressable market thus comprises large enterprises willing to pay a productivity premium for autonomous ticket resolution, estimated at 5–15% of the global developer population of 27–28.7 million professional engineers. [CM001, CM003, CM007, CM021, CM023]
| Category | Included Spend | Excluded Spend | Buyer / Payer | Relevance to Cognition |
|---|---|---|---|---|
| AI Coding Assistants (copilot tier) | IDE plugin subscriptions, autocomplete APIs, code review bots | General LLM API access not sold for code | Developer / Engineering budget | Indirect: establishes category norm Devin upgrades from |
| Autonomous Software Engineering Agents (agent tier) | End-to-end ticket resolution platforms, agent-as-engineer subscriptions, enterprise agent deployments | RPA tools, test automation (non-LLM) | VP Eng / CTO / IT budget | Direct: Devin's primary TAM |
| Low-Code / No-Code Platforms | Visual app builders, citizen-developer tools, spreadsheet automation | Excluded entirely from Cognition's TAM | Business users (not engineers) | Not addressable by Devin |
| Traditional Outsourced Dev Services | Staff augmentation, consulting, offshore engineering teams | In-house headcount salaries | Procurement / Finance | Substitution target: Devin competes for this budget |
| Cloud IDE and Infrastructure Bundles | AWS CodeWhisperer, Azure Dev tools, GCP AI bundled with cloud services | Core cloud compute, storage, networking | Engineering / Cloud buyer (bundled) | Partially competitive with Cognition for enterprise engineering wallet |
Market boundary is disputed across analyst firms; this table reflects Cognition's serviceable scope, not industry consensus definitions. Row 3 (low-code) is explicitly excluded from Cognition's TAM despite some analyst reports conflating it with AI code tools.
[CM001, CM021, CM023]2.2 Market Sizing — TAM, SAM, and SOM
Multiple independent sizing lenses are applied to bound the opportunity and reconcile widely varying analyst estimates. The lens approach is preferred over a single top-down estimate given material methodological variation across sources. Lens 1 — AI code tools market (Mordor Intelligence, 2025): The broad AI code tools market is valued at $7.37B in 2025 and forecast to reach $23.97B by 2030 at a 26.6% CAGR. This covers all AI-assisted coding tools from IDE plugins through full agents. TAM candidate: $7.4B (2025), $24B (2030). Limitation: overly broad; includes low-code tools, documentation AI, and testing automation not directly comparable to Devin's buyer profile. Lens 2 — Developer seats × willingness-to-pay: 27–28.7 million professional developers globally (Evans Data / Statista 2024). GitHub Copilot at $10–$19/month captures approximately 1.3 million paid seats from this pool in 2025. Autonomous agent tools targeting enterprise buyers at $500–$5,000/month per team are addressable to perhaps 10% of enterprise engineering teams—roughly 500K–1M developer seats × $2,000/year average contract value = $1B–$2B SAM in 2026, expanding to $5B–$8B by 2030 as adoption broadens. Lens 3 — Enterprise software engineering spend share: Global software spending reached approximately $675– $700B in 2024 (WIPO). Even a 1% shift of software project labor costs to AI tools implies a $6.7B market; a 3% shift implies $20B. This is an upper-bound lens corroborating the Mordor $24B 2030 figure. Lens 4 — Gartner adoption curve: Gartner forecasts 90% of enterprise engineers using AI code assistants by 2028 versus 14% in early 2024. Total estimated enterprise engineering headcount globally approximately 15 million. At 90% penetration × $500/year average price = $6.75B enterprise sub-segment alone by 2028. These estimates conflict materially: the narrow Grand View Research figure of $92–98M by 2030 excludes coding-adjacent AI tools entirely, while the $97.9B ResearchAndMarkets figure appears to use an atypically broad scope that inflates the addressable market. Diligence should weight the $1–8B SAM range most heavily. [CM001, CM002, CM003, CM004, CM005, CM006]
| Publisher | Year | Geography | Value | CAGR | Methodology | Confidence | Limitation |
|---|---|---|---|---|---|---|---|
| Mordor Intelligence | 2025→2030 | Global | $7.37B→$23.97B | 26.6% | Bottom-up vendor revenues + adoption modeling | medium | Broad scope includes low-code tools; overstates Devin-comparable market |
| Grand View Research | 2024→2030 | Global | $25.9M→$92.5M (narrow) | 24.8% | Narrow 'generative AI coding assistants' only | medium | Likely understates by excluding broader developer AI tools |
| ResearchAndMarkets / BusinessWire | 2025→2030 | Global | →$97.9B | 24.8% | Broad AI tools including dev infrastructure | low | TAM figure implausibly large; includes non-comparable segments |
| Developer seats lens (this analysis) | 2025→2030 | Global | $1–2B SAM→$5–8B | ~40% | 27M developers × enterprise penetration × ASP | medium | ASP assumption ($2K/yr avg) sensitive to enterprise/SMB mix |
| Gartner adoption lens (this analysis) | 2028 | Enterprise global | ~$6.75B enterprise segment | N/A | 90% enterprise penetration × 15M eng × $500/yr | medium | Assumes full Gartner adoption curve materializes by 2028 |
| Agentic AI market (CMR / VC Cafe) | 2025→2034 | Global | $4.35B→$103B | >40% | Cross-sector autonomous agent spend | low | Includes non-software verticals; directionally supportive only |
Wide TAM range reflects fundamentally different scope definitions. SAM lens ($1–2B today) is the most actionable figure for Cognition planning. Low/base/high scenario: $1B / $7B / $24B by 2030 for the broad AI code tools market.
[CM001, CM002, CM003, CM004, CM005, CM024]TAM/SAM/SOM layers for the AI autonomous software engineering market as of 2025, showing addressable market funnel from broad AI tools to Cognition's target SOM
SAM and SOM are analyst estimates based on developer-seat and enterprise-spend lenses; actual market boundaries disputed across research firms.
[CM001, CM005, CM007, CM035]Low / base / high estimates for the AI code tools market by 2030, showing analyst disagreement across scope definitions
All values are 2030 projections in USD billions. Wide range reflects fundamentally different scope definitions across analyst sources. ResearchAndMarkets $97.9B figure excluded as outlier with implausibly broad scope.
[CM002, CM003, CM004, CM005, CM024]2.3 Buyer, User, and Payer Segmentation
Cognition AI targets three principal buyer segments with distinct budget ownership and adoption triggers. The primary segment, enterprise software teams, sees VP Engineering or CTO act as buyer with software engineers as users and IT budget as payer. The adoption trigger is a measurable productivity gap—typically revealed by benchmarking AI tool ROI against headcount costs. Reference customers Goldman Sachs (12K developer pilot), Citi, Cisco, Dell, and Nubank illustrate this segment. Deal sizes range from $100K to $2M+ annually for enterprise contracts, with sales cycles of 3–9 months including legal and security review. The secondary segment, high-growth technology startups, sees CTOs or founding engineers as both buyer and user, with founders allocating seed capital as payer. The adoption trigger is velocity: Devin allows a 10-person team to execute like a 50-person team. These buyers typically start on the $20/month Core plan and upgrade to Team. Key reference customers include Ramp and Nubank, which reported a 12× efficiency gain. Deal value is lower at $500–$6,000/year but volume is high and conversion to enterprise contracts is likely as companies scale. The tertiary segment, individual developers and freelancers, presents lower strategic priority. Buyers and users are the same person; payers are self-employed developers billing hourly. The adoption trigger is competitive necessity as AI tooling becomes table-stakes for professional developers. The Core plan at $20/month targets this group, which has high churn sensitivity and primarily serves as a product feedback loop and lead-generation funnel rather than a primary revenue driver. Financial services enterprises represent a cross-cutting priority segment given Goldman Sachs' 12K-developer Devin pilot and Citi's reported deployment—these buyers require SOC 2 Type II, data residency, and regulatory compliance commitments before enterprise sign-off. [CM010, CM011, CM018, CM019, CM022, CM025]
| Segment | Buyer | User | Payer | Workflow | Budget Owner | Adoption Trigger |
|---|---|---|---|---|---|---|
| Enterprise F500 Engineering | VP Engineering / CTO | Software engineers, DevOps | Engineering / IT budget | Ticket resolution, code review, new feature development | CTO or VP Eng | Developer headcount cost pressure; AI ROI demonstrated by pilot |
| Mid-market SaaS Companies | CTO / Eng Manager | Backend + frontend developers | Engineering budget | Feature velocity, tech debt paydown | CTO | Speed-to-market competitive pressure; fundraise-to-launch cycle time |
| High-Growth Startups (Seed–Series B) | Founding engineer / CTO | Developers + CTO | Founder budget / seed capital | Full-stack feature development, MVP builds | Founders | Capital efficiency: replace 2–3 developer headcount cost |
| Financial Services Enterprises | CIO / Head of Platform Eng | Compliance-conscious engineers | IT / digital transformation budget | Internal tooling, regulatory reporting code | CIO | Regulatory tech modernization + developer cost reduction |
| Individual Developers / Freelancers | Self | Self | Self | Client project execution, side projects | Individual | Competitive necessity; peers using AI tools |
Budget ownership varies: enterprise deals flow through formal procurement with security review lasting 3–9 months; startup deals are founder-approved in days. Financial services buyers require SOC 2 Type II and data residency commitments before enterprise sign-off.
[CM018, CM019, CM025, CM026]Buyer-user-payer relationships and value flow across Cognition AI's key market segments from budget owner through to enterprise deployment
[CM018, CM019, CM025, CM026, CM028]2.4 Growth Drivers and Adoption Constraints
The market is subject to strong structural tailwinds as well as specific constraints governing adoption velocity and the pace at which Cognition can monetize the opportunity. Growth drivers include documented developer productivity gains—GitHub Copilot users complete tasks 51–55% faster with 46% of code AI-generated, and organizations using AI tools report 3.2× developer productivity improvements—which validate the ROI case and lower the procurement approval bar. The rapid shift to agentic AI is particularly significant: enterprise spend on agentic AI systems is projected to surge from under $1B in 2024 to $51B+ by 2028 at approximately 150% CAGR, normalizing autonomous agent budgets across enterprise technology organizations. Labor cost and talent scarcity create a compelling economic case. The median US software engineer salary exceeds $130K annually, meaning a $500/month tool that autonomously resolves even 10–20% of engineering tickets has a self-funding ROI measurable in weeks rather than quarters. Gartner's Magic Quadrant coverage of AI code assistants provides enterprise procurement legitimacy, and the Gartner forecast of 90% enterprise adoption by 2028 creates a top-down organizational mandate supporting category-level spending approval. The primary constraints are trust and governance: enterprises universally require role-based access controls, audit trails, IP ownership clarity, and security review before autonomous deployment, adding 3–9 months to enterprise sales cycles. Hallucination and code quality risk mean autonomous agents can introduce hard-to-catch bugs; buyers insist on review layers that limit full automation scope. Integration with legacy toolchains—on-premise Git servers, air-gapped CI/CD pipelines, proprietary IDEs—increases deployment cost and complexity. The EU AI Act and US executive orders on AI safety impose documentation and explainability requirements on enterprise AI systems in critical workflows. Competitive fragmentation, with GitHub Copilot (42% market share), Cursor ($9B valuation), Windsurf (now part of Cognition), and OpenAI Codex all competing, can produce buyer choice overload that stalls procurement decisions. [CM008, CM009, CM012, CM013, CM014, CM015]
| Factor | Direction | Timing | Implication | Diligence Ask |
|---|---|---|---|---|
| Developer productivity ROI documented (51–55% faster with AI) | Tailwind | Present (2024–2026) | Validates economic case; accelerates procurement approval for AI tools | Verify Cognition-specific ROI data from customers, not just GitHub Copilot proxies |
| Gartner: 90% enterprise AI code adoption by 2028 | Tailwind | Near-term (2026–2028) | Category legitimacy; creates top-down mandate in large enterprises | Confirm enterprise customers are accelerating commitments in 2026 |
| Agentic AI spend: $1B→$51B by 2028 (150% CAGR) | Tailwind | Near-term (2026–2028) | Massive budget reallocation toward autonomous tools expands TAM rapidly | Monitor whether enterprise CFOs approve agentic budgets separately from copilot budgets |
| Software engineer talent scarcity and salary inflation | Tailwind | Ongoing | Makes $500/month Devin compelling vs. $130K+ engineer hire economically | Track developer salary surveys; assess if AI tools slow hiring demand in target segments |
| Hallucination / code quality trust deficit | Headwind | Near-term (2025–2027) | Slows autonomous deployment; requires human review layers limiting full TAM capture | Ask customers what % of Devin output is deployed without human review in production |
| Enterprise security and IP ownership concerns | Headwind | Present and ongoing | Extends sales cycles 3–9 months; restricts Fortune 500 deployments pending legal review | Review Cognition's IP terms, data handling, and SOC 2 / ISO 27001 certifications |
| EU AI Act and US AI governance regulation | Headwind | Medium-term (2026–2028) | Adds compliance documentation burden; may require explainability features Devin lacks | Review Cognition's regulatory compliance roadmap; assess EU market readiness |
| Competitive fragmentation (Copilot, Cursor, Windsurf, Codex) | Headwind | Present | Buyer choice overload stalls procurement; risk of commoditization at copilot tier | Track whether Devin's agentic differentiation sustains or competitors close the gap |
Timing reflects estimated onset or peak relevance of each factor. All tailwinds are structural (secular trends) rather than cyclical, suggesting the favorable dynamics are durable over the 3–5 year investment horizon.
[CM008, CM009, CM012, CM013, CM014, CM020]Adoption journey from developer awareness to full autonomous deployment of Cognition AI Devin in enterprise environments
Funnel stage percentages are estimates based on Gartner adoption data and reported customer counts; intermediate stages (evaluation, departmental) are analyst estimates not directly published by any research firm.
[CM008, CM010, CM015, CM017, CM019]2.5 Exhibits
03Competitors
3.1 Competitive Landscape Overview
The AI-assisted software engineering market in 2026 is stratified into three broad competitive tiers. The first tier—established workflow co-pilots—includes GitHub Copilot (Microsoft) and Amazon Q Developer, both backed by cloud hyperscaler distribution. GitHub Copilot commands an estimated 42% category market share with 20 million users and over $2 billion in annual recurring revenue as of early 2025, deeply embedded across enterprise developer toolchains. Amazon Q Developer provides similar completion and chat capabilities optimized for AWS workloads, targeting the large installed base of AWS enterprise customers with SOC 2 Type II compliance and native IAM integrations. The second tier—IDE-native AI coding assistants—is led by Cursor (Anysphere), which reached $2 billion ARR by February 2026 after launching its first paid tiers in mid-2024 and now operates at a $29.3 billion valuation. Cursor's VS Code fork model allows deep editor integration unavailable to plugin-based competitors, and its multi-agent parallel execution architecture (up to eight concurrent agents) increasingly overlaps with Devin's autonomous coding use case. Windsurf (formerly Codeium), now acquired by Cognition, previously competed in this tier before integration. The third tier—fully autonomous AI software engineers—is where Cognition's Devin directly positions. Claude Code (Anthropic), OpenAI Codex (re-launched as a web-based agent), and SWE-agent (open-source academic benchmark harness) all compete here to varying degrees. Claude Code's terminal-native approach and superior code quality on SWE-bench evaluations represent a quality threat, while OpenAI Codex's re-launch as an agentic execution environment targets Devin's core positioning. The differentiation frontier is shifting from benchmark scores toward measurable enterprise workflow integration and verifiable ROI metrics—terrain where Cognition's deployment evidence with Goldman Sachs and others provides a current advantage. [CP001, CP002, CP003, CP033]
| Competitor | Company | Founded | Tier | ARR (est.) | Valuation | Key Investors |
|---|---|---|---|---|---|---|
| GitHub Copilot | Microsoft/GitHub | 2021 | Hyperscaler co-pilot | $2B+ | N/A (MSFT) | Microsoft |
| Cursor | Anysphere | 2022 | IDE-native assistant | $2B | $29.3B (Nov 2025) | Thrive, a16z, Accel, Nvidia, Google |
| Claude Code | Anthropic | 2023 | Autonomous agent | N/A (bundled) | $60B+ | Google, Amazon, Salesforce |
| Amazon Q Developer | Amazon Web Services | 2023 | Hyperscaler co-pilot | N/A (AWS bundle) | N/A (Amazon) | Amazon |
| OpenAI Codex | OpenAI | 2025 (re-launch) | Autonomous agent | N/A (bundled) | $300B+ | Microsoft, a16z, Thrive |
| Replit | Replit Inc. | 2016 | Cloud IDE platform | $80M est. | $1.2B | Andreessen Horowitz, Coatue |
| SWE-agent | Princeton NLP | 2024 | Open-source framework | Free/open-source | N/A | Academic |
| Windsurf (pre-acq.) | Codeium (acquired) | 2021 | IDE-native assistant | $82M (at acq.) | $250M est. | General Catalyst, Kleiner |
ARR and valuation figures are estimates from secondary sources as of early 2026 where primary sources unavailable.
[CP001, CP002, CP003, CP005, CP006, CP016]3.2 Tier-One Hyperscaler-Backed Competitors
GitHub Copilot, owned by Microsoft since the GitHub acquisition, benefits from unmatched distribution embedded in the world's largest developer platform. With 90% of Fortune 100 companies using Copilot and over 20 million active developers, its scale creates high switching-cost inertia. Copilot's pricing ranges from $10 per month for individuals to $19 per month for Business tier and $39 per month for Enterprise, making it the default low-cost entry point for most developer teams. The platform has progressively expanded from inline code completions to chat, pull-request summarization, and workspace-level Copilot Workspace—a multi-step planning and execution agent that increasingly encroaches on Devin's autonomous territory. GitHub Copilot reached over $2 billion in annual recurring revenue by early 2025, making it the largest pure-play AI coding revenue line globally. Amazon Q Developer addresses the enterprise segment's compliance and security requirements most directly, offering a Free tier and a Pro tier at $19 per user per month. Its native AWS service integrations, 200,000-token context window, and Agents for Amazon CodeWhisperer capability set position it as the preferred choice for organizations standardized on AWS. Q Developer achieved SOC 2 Type II certification and supports VPC isolation—security postures unavailable from most startup competitors. The hyperscaler tier's primary limitation versus Cognition is the absence of full-cycle autonomous task completion: neither Copilot nor Q Developer autonomously plans, implements, tests, and deploys end-to-end without human-in-the-loop confirmation at each stage. This architectural constraint creates the addressable gap Devin is designed to exploit, though both hyperscalers are actively extending agent capabilities toward this frontier. [CP004, CP009, CP010, CP011, CP019, CP022]
| Feature | Devin (Cognition) | GitHub Copilot | Cursor | Claude Code | Amazon Q Dev |
|---|---|---|---|---|---|
| Autonomous end-to-end execution | Yes (full cycle) | Partial (Workspace) | Partial (8 agents) | Partial (human-supervised) | Partial (Agents feature) |
| IDE integration | SaaS + Windsurf IDE | VS Code, JetBrains, Vim | VS Code fork | Terminal / CLI | VS Code, JetBrains, CLI |
| Context window | Not disclosed | Not disclosed | Not disclosed | 200K tokens | 200K tokens |
| Multi-agent parallel | Yes (Devin Teams) | No | Yes (up to 8) | No | No |
| Open-source model option | No | No | Model routing (multiple) | No | No |
| SOC 2 / Enterprise compliance | Partial | Enterprise tier | Enterprise tier | Enterprise via API | SOC 2 Type II |
| Free tier | No | Free (limited) | Free (hobby) | Yes (limited) | Free (50 req/mo) |
| VPC / on-prem deployment | Not documented | No | No | Via API gateway | Yes (VPC) |
Capability ratings are based on publicly documented features as of May 2026; roadmap features excluded.
[CP008, CP010, CP012, CP022, CP025, CP030]| Tool | Free Tier | Entry Paid | Mid Tier | Enterprise |
|---|---|---|---|---|
| Devin (Cognition) | No | $20/mo (5 ACUs) | $500/mo (250 ACUs, Team) | Custom |
| GitHub Copilot | Yes (2k completions) | $10/mo (Individual) | $19/mo (Business) | $39/mo (Enterprise) |
| Cursor | Yes (hobby) | $20/mo (Pro) | $40/mo (Business) | Custom |
| Claude Code | Yes (limited) | $10/mo (Pro) | $100/mo (Max) | Enterprise API |
| Amazon Q Developer | Yes (50 req/mo) | $19/mo (Pro) | N/A | Custom / Marketplace |
| Replit | Yes | $25/mo (Core) | $40/mo (Teams) | Custom |
Pricing as of May 2026 from official product pages; subject to frequent change.
[CP009, CP015, CP019, CP023]3.3 IDE-Native Competitors — Cursor and Windsurf
Cursor (Anysphere) is the most commercially formidable competitor to Cognition in the pure-ARR sense, having scaled from zero to $2 billion ARR within approximately 24 months of its public launch. Its Series D funding round in November 2025 raised $2.3 billion at a $29.3 billion valuation—the largest venture financing in AI coding history at the time. Investors include Thrive Capital, Andreessen Horowitz, Accel, Nvidia, and Google. Cursor's architecture relies on a VS Code fork with deep token-budget controls, model-agnostic routing (Sonnet, GPT-4o, Gemini), and a multi-agent background agents feature that runs up to eight parallel task threads—narrowing the functionality gap with Devin's full-autonomy proposition. Cursor's 1 million-plus paying customers and 50,000 enterprise teams demonstrate product-market fit across the individual developer and team segments. Cursor is growing faster than Cognition on ARR metrics and has more paying customers, providing proportionally more training signal and pricing power. Windsurf (formerly Codeium) built strong developer mindshare with a free-tier, VS Code compatible AI IDE and over 800,000 users before the Cognition acquisition in July 2025. Post-acquisition, Windsurf's enterprise book ($82M ARR contributed, 350-plus enterprise customers) was folded into Cognition's revenue base, converting a direct competitor into a distribution channel. The integration is not yet complete, and retention of Windsurf's enterprise customers under the Cognition brand is a key execution risk. The acquisition also demonstrated Cognition's willingness to pursue inorganic growth to defend against IDE-layer encroachment—a signal that the competitive pressure from Cursor in the IDE tier was viewed as existential to distribution strategy. [CP005, CP006, CP007, CP008, CP018, CP020]
3.4 Autonomous Agent Competitors — Claude Code, OpenAI Codex, and SWE-agent
Claude Code (Anthropic) is arguably Cognition's most direct quality-based competitor in the autonomous agent tier. Claude Code operates as a terminal-native agentic coding assistant layered on top of Anthropic's Claude 3.7 Sonnet and Claude 4 models, which have consistently ranked among the highest performers on SWE-bench Verified—an independent benchmark measuring an agent's ability to resolve real GitHub issues. Devin's initial SWE-bench score of 13.86% (March 2024) was a breakthrough at the time, but subsequent model improvements from Anthropic pushed Claude-based agents above 50%, raising questions about Devin's benchmark differentiation. Claude Code's pricing ($10 per month on Pro, $100 per month on Max) undercuts or matches Devin's Core tier ($20 per month for 5 ACUs) on entry price, though it offers different economics for agentic-scale autonomous use. The key architectural gap is that Claude Code still requires a human operator to supervise at the terminal, whereas Devin is designed to run fully unattended in enterprise cloud environments—a meaningful operational distinction for large-scale automation. OpenAI re-launched Codex in 2025 as a web-based agentic coding environment backed by the o3 model family, positioning it as a direct Devin competitor. Codex Workspace allows users to assign high-level feature tasks and receive completed pull requests, mirroring Devin's core value proposition. Given OpenAI's $300 billion-plus valuation, API distribution, and ChatGPT developer ecosystem, Codex's competitive entry represents a fundamental threat to Cognition's differentiation if OpenAI prioritizes execution quality and aggressive pricing. SWE-agent (Princeton NLP Group) is an open-source research framework rather than a commercial product; it is significant primarily as a benchmark comparator and as a recruitment signal that world-class ML researchers are actively working on autonomous code execution frameworks outside commercial settings, which could accelerate open-source commoditization of the core technology. [CP012, CP013, CP014, CP015, CP016, CP017]
| Moat Factor | Cognition Strength | Durability (1-5) | Primary Threat | Threat Source |
|---|---|---|---|---|
| Autonomous execution depth | High – full unattended task cycles | 3 | Rapid capability parity from labs | Anthropic, OpenAI |
| Enterprise integrations | Medium – Goldman, Citi, Nubank proofs | 4 | Hyperscaler bundling incentives | Microsoft, Amazon |
| Windsurf IDE distribution | Medium – 350+ enterprise clients | 3 | Cursor IDE market dominance | Cursor (Anysphere) |
| Founder talent + network | High – 3x IOI gold medalists, ex-Scale/DeepMind | 4 | Talent poaching by frontier labs | OpenAI, Google, Anthropic |
| Training data flywheel | Medium – speculative, unverified | 2 | Larger model labs have more training data | OpenAI, Google |
| Benchmark leadership | Low – lost within months of launch | 1 | All major competitors surpassed SWE-bench score | All peers |
| Pricing competitiveness | Low – most expensive per-task | 2 | Commoditization pressure from free/cheap tiers | Claude Code, Copilot |
Durability scores are qualitative analyst assessments; scale 1 (weakest) to 5 (strongest).
[CP026, CP027, CP028, CP032, CP034]3.5 Moat Assessment and Competitive Dynamics
Cognition's defensible competitive advantages break down into three categories: execution depth, enterprise data flywheel, and distribution via Windsurf. Execution depth—the ability to complete multi-hour, multi-file agentic coding sessions with planning, sandboxed execution, testing, and PR creation—remains ahead of most competitors in verified enterprise deployments (Goldman Sachs 12K developer pilot, Nubank 12x efficiency claim). The enterprise data flywheel from each completed task creates training signal for model fine-tuning, which potentially compounds over time—but this advantage is speculative until Cognition's model quality can be independently verified versus frontier labs. Windsurf integration adds IDE-layer distribution that Devin alone could not have quickly organically scaled. Counterarguments to moat durability are significant. Cursor is growing faster than Cognition on pure ARR metrics and has more paying customers. Frontier model labs (Anthropic, OpenAI) control the underlying reasoning capabilities and can deploy equivalent agentic pipelines without licensing dependencies—giving them structural cost advantages. Benchmark credibility questions persist: Cognition's March 2024 Devin demo was partially disputed, and independent evaluations of SWE-bench performance showed other tools surpassing Devin within months of its launch. The competitive window for pure agentic differentiation is narrowing, making Cognition's enterprise go-to-market execution and customer retention metrics the true leading indicators of sustainable competitive position. Cognition's compliance posture relative to enterprise-grade competitors like Amazon Q Developer (SOC 2 Type II, VPC isolation) is another documented gap that could slow enterprise contract cycles for security-sensitive verticals. [CP026, CP028, CP031, CP035]
04Financials
4.1 Revenue Model and ARR Trajectory
Cognition AI operates a consumption-based SaaS revenue model anchored to Agent Compute Units (ACUs)—a proprietary token of compute capacity consumed per autonomous task. Individual developers subscribe at $20 per month (Core plan: 5 ACUs), teams at $500 per month (Team plan: 250 ACUs), and enterprises under custom contracts with volume discounts and additional ACU pools. The pricing ladder is designed to monetize proportionally to task complexity and volume, making it structurally different from seat-based tools like GitHub Copilot or Cursor. However, this model creates lumpy revenue: enterprise customers with variable sprint workloads may have uneven monthly consumption patterns, requiring careful cohort-level analysis to distinguish durable ARR from one-time expansion. Cognition's ARR trajectory has been one of the most rapid in enterprise AI history. The company exited the first six months of operations with negligible revenue, crossed $1M ARR in September 2024, reached $73M ARR by June 2025—a 73x increase in nine months—and then jumped to approximately $155M combined ARR following the July 2025 Windsurf acquisition ($82M ARR contributed). The Cognition AI blog post from September 2024 ("Funding, Growth, and the Next Frontier of AI Coding Agents") confirmed the $73M ARR figure and outlined the company's two-product strategy. Post-Windsurf, the combined entity has a bifurcated revenue base: Devin's enterprise automation revenue (high ACV, low seat count) and Windsurf's IDE subscription revenue (lower ACV, higher seat count). Separating and growing both lines while integrating the teams is a material execution risk. As of April 2026, reports suggest the company is in discussions for additional financing at a potential $25B+ valuation, implying investors believe ARR will continue growing materially above $155M. However, post-acquisition ARR disclosed figures should be interpreted cautiously: the $82M Windsurf ARR contribution was measured at time of acquisition and may not reflect post-migration retention rates.
| Revenue Stream | Product | Model | ACV Range | Launched | Est. Contribution to ARR |
|---|---|---|---|---|---|
| Individual subscriptions | Devin Core | Usage-based, $20/mo (5 ACUs) | $240/yr | 2024 | ~$5M est. |
| Team subscriptions | Devin Teams | Usage-based, $500/mo (250 ACUs) | $6,000/yr | 2024 | ~$25M est. |
| Enterprise contracts | Devin Enterprise | Custom, VPC deployment | $50K–$1M+ /yr | 2024 | ~$43M est. |
| IDE subscriptions (Windsurf) | Windsurf Free/Pro | Seat-based, $0–$15/mo | ~$120/yr paid seat | Pre-acq. (2024) | ~$30M est. |
| IDE enterprise (Windsurf) | Windsurf Enterprise | Custom enterprise contracts | $50K–$500K/yr | Pre-acq. (2024) | ~$52M est. |
| Usage overages | Devin all plans | Per-ACU metered, ~$2.25/ACU | Varies | 2024 | Not disclosed |
ARR contribution estimates are analyst approximations; Cognition does not disclose per-stream revenue breakdown.
[CI001, CI002, CI003, CI008]4.2 Funding History and Capital Structure
Cognition AI has raised approximately $696 million in venture capital across three primary rounds in under 18 months, making it one of the fastest-funded AI infrastructure companies on record. The seed/Series A in March 2024 raised approximately $21 million at a $350 million pre-money valuation from Founders Fund, signaling early conviction in the founding team before any product revenue. In April 2024, just weeks after launch, Cognition closed $175 million at a $2 billion post-money valuation—again led by Founders Fund—crossing unicorn status within six months of founding. The September 2025 Series B raised $400 million at a $10.2 billion post-money valuation, with co-investors including Lux Capital, 8VC, Elad Gil, Bain Capital Ventures, D1 Capital, Definition Capital, and Swish Ventures. VentureBeat and TechCrunch confirmed this round in September 2025. The company disclosed a remarkably low net cash burn of under $20 million from founding through Q3 2025—extraordinary given the fundraising scale—suggesting the bulk of capital was held in reserve rather than spent on headcount or infrastructure at a rate typical for similar-stage companies. The Windsurf acquisition in July 2025 was estimated at approximately $250 million, funded from existing capital reserves without a new primary raise—a sign of financial discipline given the scale of the transaction relative to employee count. Post-acquisition headcount reportedly reached approximately 249 before layoffs of 30 employees and the departure of Windsurf personnel who accepted the 9-month buyout offer.
| Plan | Product | Price | ACU Allowance | Target Segment | Overage Rate |
|---|---|---|---|---|---|
| Core | Devin | $20/month | 5 ACUs | Individual developers | ~$2.25/ACU |
| Team | Devin | $500/month | 250 ACUs | Engineering teams | ~$2.25/ACU |
| Enterprise | Devin | Custom | Custom pool | Enterprise orgs | Negotiated |
| Free | Windsurf IDE | $0/month | Limited AI requests | Hobbyist/student | N/A |
| Pro | Windsurf IDE | ~$15/month | Standard AI credits | Individual devs | Varies |
| Enterprise | Windsurf IDE | Custom | Custom | Enterprise orgs | Negotiated |
Windsurf pricing as of acquisition July 2025; subject to change post-integration with Cognition pricing structure.
[CI001, CI002, CI005]4.3 Unit Economics and Margin Structure
Cognition's unit economics are difficult to model precisely because the company does not disclose gross margins, customer acquisition cost (CAC), or cohort-level retention data. However, structural inferences are possible. The ACU model implies a cost-of-revenue that is partially variable (inference compute per task) and partially fixed (model hosting, sandbox environments, CI/CD infrastructure). Enterprise AI inference at frontier model scale (likely GPT-4 or Claude API calls per task) can cost $5–$50 per hour of agent runtime at current cloud spot prices, while enterprise ACU packages price at roughly $2.25 per ACU with 250 ACUs per Team plan month—creating a gross margin that depends heavily on task length and model call efficiency. High-value enterprise contracts (Goldman Sachs, Citi, Nubank) likely have custom pricing with volume discounts that could improve margin at scale if infrastructure costs are amortized across large task volumes. The Nubank case study reporting a 12x efficiency improvement in code migration workflows suggests that customers perceive strong value relative to developer hours saved—a favorable indicator for pricing power. However, the inherent challenge for fully autonomous coding agents is that task complexity variance is high: simple bug fixes cost little to execute but complex multi-week feature builds can consume compute budget disproportionately. Unless Cognition gates task scope at the enterprise level, margin management at scale will require sophisticated compute forecasting. The company's capital efficiency pre-acquisition (sub-$20M burn on $350M+ raised before revenue scale) contrasts sharply with the likely post-acquisition cost structure. Adding 250 employees (Windsurf personnel) to a ~49-person team more than quadruples headcount, compressing per-employee revenue significantly in the near term unless post-integration synergies materialize quickly.
| Metric | Estimate | Basis | Confidence | Caveat |
|---|---|---|---|---|
| Gross margin (est.) | 50–70% | Inferred from compute cost vs. ACU pricing | low | Not disclosed; highly variable by task complexity |
| Inference cost per ACU (est.) | $0.50–$1.50 | Market rates for frontier model inference | low | Depends on model mix and task length |
| Customer ACV (enterprise) | $50K–$1M+ | Industry comp, customer case studies | medium | Range is wide; no primary disclosure |
| Net revenue retention (est.) | 100–130% | Inferred from ARR growth trajectory | low | No NRR data publicly disclosed |
| CAC (est.) | Not disclosed | N/A | unknown | No marketing spend or sales data published |
| ARR per employee (pre-Windsurf) | ~$1.5M | ~49 employees, $73M ARR as of June 2025 | medium | Headcount estimate is approximate |
| ARR per employee (post-Windsurf) | ~$623K | ~249 employees, $155M ARR as of July 2025 | medium | Includes all acquired Windsurf staff before departures |
All unit economics are analyst estimates; no primary financial disclosure exists. Treat all values as directional only.
[CI006, CI009, CI010, CI011]4.4 Public Financial Data Gaps and Disclosure Limitations
Cognition AI is a privately held Delaware C-corporation with no public filing obligations, no disclosed GAAP financial statements, and no investor day or earnings calls. All financial data in this chapter relies on: (1) company-authored blog posts; (2) third-party news reports citing unnamed sources; (3) investor communications and press release summaries; and (4) secondary market research estimates. This disclosure posture is typical for pre-IPO AI companies but creates material risk for investors and customers attempting to verify sustainability of the ARR trajectory. The most significant undisclosed financial metrics are: gross margin by product line (Devin vs. Windsurf), customer churn and net revenue retention (NRR), CAC and payback period, enterprise deal sizes and contract lengths (ACV), and post-Windsurf integration costs. Without NRR data, it is impossible to distinguish whether the $73M to $155M ARR jump reflects organic customer expansion or purely inorganic acquisition contribution. Enterprise AI tool contracts in other companies typically show NRR of 100–140% when adoption is strong; if Cognition's NRR is below 100%, the underlying Devin-only ARR may be stagnating despite the Windsurf injection. Sacra, CB Insights, and other secondary research platforms provide partial revenue and headcount estimates that must be treated as approximations. A secondary concern is the reported $25B+ valuation discussions in early 2026, which would imply a forward revenue multiple of approximately 160x on $155M ARR. While extreme multiples are historically common for hypergrowth AI companies in their early stages, they are sustainable only if ARR continues growing at 2x+ annually—a bar that requires substantial new enterprise customer acquisition and retention of the Windsurf base. Any deceleration in ARR growth would make the valuation multiple compress significantly.
| Round | Date | Amount | Valuation (post-money) | Lead Investors | Confirmed Source |
|---|---|---|---|---|---|
| Seed/Series A | Mar 2024 | $21M | $350M | Founders Fund | Multiple news reports |
| Series A extension | Apr 2024 | $175M | $2B | Founders Fund | Axios, CNBC, multiple |
| Series B | Sep 2025 | $400M | $10.2B | Lux Capital, 8VC, Elad Gil, BCV, D1 | TechCrunch, VentureBeat, CNBC |
| Total raised (through Series B) | Sep 2025 | ~$596M | $10.2B | Multiple | Summed from above |
| Windsurf acquisition | Jul 2025 | ~$250M (est.) | N/A | Cognition (acquirer) | Widely reported estimate |
| Potential new round (rumored) | 2026 | Not disclosed | $25B+ (reported) | Not disclosed | Press reports, not confirmed |
Series B total excludes the Windsurf acquisition cost; combined capital deployed (raised + acquisition) is approximately $846M. Founders Fund IX Form D/A (SEC, Oct 2025) confirms $972M LP capital base.
[CI012, CI013, CI014, CI015, CI016, CI024]4.5 Capital Adequacy and Financial Runway
With approximately $696M raised and sub-$20M cash burn disclosed through Q3 2025, Cognition had substantial financial runway heading into 2026 even before any new financing. The Windsurf acquisition consumed an estimated $250M of that reserve, leaving approximately $426M available minus ongoing operating expenses. At a conservative 200-employee burn rate of $400K per employee-year (blended fully-loaded cost), annual operating costs could be $80M or higher excluding compute. If the company is generating $155M ARR with moderate gross margins (estimated 50–70% given inference costs), gross profit could fund ongoing operations without immediate new capital. However, the $25B+ valuation discussions suggest the company is seeking capital to accelerate—not merely to sustain—operations. Expected uses of new capital include R&D for Devin next-generation models, enterprise sales force expansion, global data center capacity for inference at scale, and potential further acquisitions to consolidate the AI coding infrastructure market. The Founders Fund lead investor relationship provides additional runway optionality: Founders Fund has historically shown willingness to lead follow-on rounds for breakout portfolio companies, reducing financing risk. The presence of large institutional co-investors (D1 Capital, Bain Capital Ventures) signals access to crossover and late-stage capital. Overall capital adequacy is rated medium-high: the company has sufficient runway for at least 24 months at current burn estimates, but aggressive growth execution and integration costs from Windsurf could accelerate burn faster than current estimates suggest.
| Missing Metric | Why It Matters | Severity | Proxy Used |
|---|---|---|---|
| Gross margin by product line | Determines profitability and long-term unit economics | High | Industry comp: 50–70% est. |
| Net revenue retention (NRR) | Indicates whether ARR is organically durable | High | Not available; assume 100–130% |
| Customer churn rate | Validates stickiness of enterprise contracts | High | Not disclosed; inferred from case studies |
| CAC and payback period | Determines GTM efficiency at scale | Medium | Not available |
| GAAP revenue vs. ARR | Deferred revenue may differ from ARR | Medium | ARR used as proxy |
| Windsurf post-acquisition retention | Critical for combined ARR durability | High | Not yet disclosed |
Gaps listed represent the most decision-relevant undisclosed financial data as of May 2026 research date.
[CI017, CI018]05Product & Technology
5.1 Product Definition and Customer Workflow
Cognition AI offers two core products: Devin, a cloud-hosted autonomous AI software engineer, and Windsurf, an agentic IDE for local and cloud-assisted development. Devin accepts natural-language task descriptions—submitted via the web dashboard, Slack, Linear, or Jira—and executes full end-to-end software engineering tasks autonomously: planning the approach, writing and modifying code, running tests, iterating on failures, and opening pull requests with reviewable diffs and confidence indicators. Unlike code-completion tools (GitHub Copilot, Cursor), Devin takes ownership of entire tasks rather than assisting a human in writing individual lines. In a typical enterprise workflow, a developer or engineering manager assigns a Jira ticket or Slack message to Devin. Devin reads the ticket, explores the repository context using its DeepWiki server, sets up the necessary environment, writes the implementation, runs the test suite, and reports back with a pull request. The PR includes Devin's internal trace of decisions for human review. The Devin 2.0 release added a confidence meter that quantifies the probability of task success before committing, allowing teams to triage tasks efficiently. Devin can handle tasks concurrently across multiple sessions, enabling teams to delegate entire sprint backlogs rather than individual tickets. Windsurf IDE complements Devin by providing a local development environment with AI Cascade agent integration. Developers can start a coding session locally in Windsurf and hand off complex, compute-intensive work to Devin in the cloud mid-session, with state preserved across the transition. The "Devin in Windsurf" feature announced in April 2026 formalizes this local-to-cloud handoff workflow. Together, the two products target the full developer lifecycle: Windsurf for daily coding assistance and Devin for delegated autonomous task execution.
| Product | Delivery Mode | Target User | Key Features | Pricing Tier | Launched |
|---|---|---|---|---|---|
| Devin Free | Cloud agent | Individual dev | Limited ACUs, web dashboard, basic integrations | Free | Apr 2026 |
| Devin Pro | Cloud agent | Individual dev | More ACUs, API access, confidence meter | Paid ($20/mo base) | Apr 2025 |
| Devin Max | Cloud agent | Power user | High ACU pool, priority queue | Paid (custom) | Apr 2026 |
| Devin Teams | Cloud agent | Dev teams | Shared ACU pool, team management, Slack/Jira integration | Team pricing | 2025 |
| Devin Enterprise | Cloud or VPC | Enterprise org | VPC deploy, custom-trained Devin, dedicated support | Custom contract | 2024 |
| Windsurf IDE (Free/Pro) | Desktop IDE | Individual dev | Cascade AI agent, in-editor completions, Codemaps | Free/$15/mo | 2024 |
| Windsurf Enterprise | Desktop IDE + cloud | Enterprise org | SSO, centralized billing, enterprise support | Custom contract | 2024 |
| Devin API | REST API | DevOps/CI | Programmatic session management, CI/CD integration | Included in paid plans | 2025 |
Plan structure as of May 2026 research date; April 2026 plan refresh replaced legacy Core/Team plans with Free/Pro/Max/Teams/Enterprise structure.
[CE001, CE008, CE009, CE025, CE026]5.2 Product and Technology Map
Cognition AI's product portfolio as of May 2026 comprises four distinct product units: (1) Devin Core/Pro/Max/Enterprise—cloud agent plans for individual, team, and enterprise customers; (2) Windsurf IDE—the agentic desktop IDE with Cascade AI agent, acquired from Codeium in July 2025; (3) the Devin API—a RESTful API for programmatic session creation and CI/CD pipeline automation; and (4) Windsurf Enterprise—the enterprise tier of the IDE with SSO, centralized billing, and advanced admin controls. A fifth element, Windsurf Codemaps, is an AI-annotated structured codebase map powered by SWE-1.5 and Claude Sonnet 4.5, enabling rapid onboarding and debugging by grounding navigation to exact lines and visual node graphs. The April 2026 plan revision retired the original Core and Team plans, replacing them with a five-tier structure: Free (limited ACUs), Pro, Max, Teams, and Enterprise. This signals a land-and-expand strategy: the Free tier provides entry-level exposure for individual developers and bootcamp students, while Pro and Max serve power users, and Teams/Enterprise capture organizational deployment. SWE-Check, introduced in collaboration with Applied Compute in April 2026, is a specialized bug detection model trained via reinforcement learning that matches Opus 4.6 on internal evals while running approximately 10x faster—positioned as a cost-efficient quality gate within the Devin agent loop. The company's in-house model, SWE-1.5, powers both SWE-Check and Windsurf Codemaps, representing Cognition's first proprietary model release beyond the Devin runtime ensemble. All products are delivered cloud-first, with VPC deployment available for enterprise customers requiring on-premise or isolated cloud infrastructure. The Devin for Terminal feature, released in 2026, enables developers to start agent sessions from their local terminal and escalate to the cloud when the task outgrows local resources, bridging the CLI and cloud deployment modes.
| Category | Platform | Integration Type | Capability | Status |
|---|---|---|---|---|
| Source control | GitHub | Native | PR creation, repo access, issue tracking | GA |
| Source control | GitLab | Native | Branch management, MR creation | GA |
| Source control | Bitbucket | Native | PR creation, code contribution | GA |
| Project management | Jira | Native | Ticket assignment, status sync | GA |
| Project management | Linear | Native | Issue tracking, sprint delegation | GA |
| Communication | Slack | Native | Session initiation, progress updates | GA |
| Communication | Microsoft Teams | Native | Session initiation, notifications | GA |
| Monitoring | Sentry / Datadog / PagerDuty | MCP | Alert-triggered sessions, log context | MCP marketplace |
| Databases | PostgreSQL / MySQL / MongoDB | MCP | Read/write access for data tasks | MCP marketplace |
| Documentation | Notion / Confluence | MCP | Context retrieval for tasks | MCP marketplace |
MCP Marketplace integrations are third-party configured; availability depends on customer configuration. Native integrations are Cognition-maintained.
[CE005, CE006, CE007, CE018, CE030]5.3 Technical Architecture and Agent Runtime
Devin's core technical architecture centers on an agent runtime that manages long-horizon planning and multi-step execution within a sandboxed cloud environment. The agent receives a natural-language task specification and decomposes it into an ordered sequence of sub-steps: repository exploration, environment setup, implementation, test execution, and pull request creation. At each step, the agent calls specialized tools—a code editor, a Unix shell, and a headless browser— within an isolated compute container that prevents lateral access to other sessions or customer data. This sandboxed design separates Devin's execution context from the customer's production environment, with code changes surfaced only through pull requests that require human approval before merge. The DeepWiki server enables large-codebase comprehension by building vectorized project graphs that represent relationships between files, functions, and modules. This graph-based representation allows Devin to navigate million-line codebases more effectively than purely token-based context windows. BlockDiff snapshotting records incremental state checkpoints during task execution, enabling rapid rollback when a test fails or an approach is abandoned. These two features—DeepWiki and BlockDiff—are among Cognition's most differentiating technical capabilities, as they address the two primary failure modes of LLM-based coding agents: context loss in large repositories and unrecoverable error cascades. The proprietary SWE-1.5 model, released in 2025-2026, underlies SWE-Check and Windsurf Codemaps. It was trained via reinforcement learning specifically for software engineering tasks, contrasting with the general-purpose LLMs used as base layers in most competitor products. The agent runtime also supports Model Context Protocol (MCP), allowing Devin to connect to external tooling including Sentry, Datadog, PostgreSQL, MongoDB, Notion, and hundreds of other services through an extensible plugin marketplace. ACUs (Agent Compute Units) are the metered token of computation consumed per task, priced at approximately $2.25 per ACU with plan-included allocations varying by tier.
| Agent / Model | Benchmark Version | Score (% Resolved) | Evaluation Mode | Date | Source |
|---|---|---|---|---|---|
| Devin (Cognition AI) | Full (25% subset) | 13.86% | Unassisted | Mar 2024 | Cognition AI blog |
| Prior SOTA (best LLM) | Full | 1.96% | Unassisted | Mar 2024 | SWE-bench leaderboard |
| Best assisted LLM | Full | 4.80% | Assisted (files given) | Mar 2024 | SWE-bench leaderboard |
| SWE-agent (open source) | Lite | 12.47% | Unassisted | Mar 2024 | SWE-bench / Princeton |
| Claude Code (Opus 4) | Verified (500) | 72.5% | Unassisted | 2025 | Anthropic |
| mini-SWE-agent | Verified (500) | 65% | Unassisted | Jul 2025 | SWE-bench update |
| Devin PR acceptance trend | AIDev dataset (7,156 PRs) | +0.77%/week | Real-world PRs | 32 weeks | Arxiv 2026 |
Different benchmark versions (Full vs. Verified vs. Lite) are not directly comparable. Cognition's 13.86% used a 25% random subset of the Full benchmark. Claude Code's 72.5% uses the 500-instance Verified subset.
[CE002, CE020, CE023, CE024, CE035]5.4 Deployment, Integration, and Reliability
Devin supports two primary deployment modes: cloud-hosted (the default, operating in Cognition-managed infrastructure) and VPC deployment for enterprise customers requiring data residency, network isolation, or regulatory compliance. VPC mode allows Devin to operate within the customer's AWS, GCP, or Azure environment, eliminating the need to transmit source code to Cognition's servers. Custom-trained Devin instances are available at the enterprise tier, allowing customers to fine-tune the agent on their proprietary codebases and internal conventions. The platform integrates natively with GitHub, GitLab, and Bitbucket for source control (PR creation, code review automation, branch management); with Jira and Linear for ticket assignment and status tracking; and with Slack and Microsoft Teams for communication-channel session initiation and progress reporting. The REST API enables programmatic session management for CI/CD pipelines, allowing engineering organizations to trigger Devin sessions automatically on specific events (e.g., failing test suite, customer bug report). The MCP Marketplace extends integration to monitoring platforms (Sentry, Datadog, PagerDuty), databases (PostgreSQL, MySQL, MongoDB), and documentation tools (Notion, Confluence). The Windsurf IDE integrates Cascade AI agent locally and the "Devin in Windsurf" handoff for cloud-based tasks. Reliability is not formally disclosed through uptime SLAs or status page data in Cognition's public materials. Task execution time is bounded by a 45-minute maximum runtime per session, though tasks can be broken into sequential sub-sessions for longer projects. The confidence meter in Devin 2.0 provides a pre-execution estimate of task success probability, enabling teams to filter out low-probability tasks before consuming ACUs. These quality controls reduce waste but also signal that Devin's reliability on complex tasks remains probabilistic rather than guaranteed.
| Component | Function | Inputs | Outputs | Proprietary? |
|---|---|---|---|---|
| Agent runtime | Task decomposition, multi-step planning | Natural language task spec | Ordered sub-task plan | Yes |
| Sandboxed environment | Isolated compute container per session | Agent commands | Code edits, shell output, PR diff | Yes (cloud-hosted) |
| DeepWiki server | Vectorized codebase graph for context | Repository files | Graph-indexed code map | Yes |
| BlockDiff snapshots | Incremental state checkpoints for rollback | Session state at each step | Checkpoint index, rollback target | Yes |
| SWE-1.5 model | Bug detection (SWE-Check) + Codemaps | Code snippets, repository graphs | Bug likelihood score, annotated map | Yes |
| MCP layer | External tool connectivity | MCP-compliant tool specs | Live tool data in agent context | Open protocol |
| Devin API | Programmatic session management | HTTP requests with task spec | Session ID, status, PR URL | Yes (REST) |
Proprietary components are Cognition-developed and not open-sourced; MCP is an open protocol developed by Anthropic and adopted by Cognition.
[CE003, CE013, CE021, CE032, CE033]5.5 Technology Differentiation and Competitive Moats
Cognition AI's primary technical differentiator is the end-to-end autonomous task ownership model: Devin does not suggest code for a human to accept or reject, but independently plans, executes, and delivers a complete task artifact. This positions Devin against code-completion tools (GitHub Copilot, Cursor, Claude Code) rather than as a marginal improvement over them. The SWE-bench Full score of 13.86% set a new state-of-the-art for unassisted autonomous agents in March 2024, exceeding the prior best of 1.96% by 7x and exceeding the prior assisted SOTA of 4.80% by 3x. Devin's PR acceptance rate showed a consistent positive trend of +0.77% per week over 32 weeks, the only agent in a comparative 7,156-PR study to exhibit sustained improvement (Arxiv, 2026). The in-house SWE-1.5 model and SWE-Check tool represent Cognition's nascent vertical integration in model training. Most competitor AI coding agents (GitHub Copilot, Cursor, Windsurf pre-acquisition) are built on top of third-party foundation models (GPT-4, Claude, Gemini), giving those providers structural leverage over tool vendors. By training SWE-1.5 for software engineering tasks specifically, Cognition is building a model layer that is not dependent on API pricing from OpenAI or Anthropic for its most specialized use cases. The Windsurf acquisition added significant IP: the Cascade agentic workflow engine, the Windsurf IDE user base, and 350+ enterprise accounts, accelerating the product suite without requiring multi-year IDE development. Data flywheel advantages are present but not publicly quantified. Each Devin task generates execution traces that can be used to fine-tune future model versions. As enterprise customers use custom-trained Devin instances on their proprietary codebases, Cognition accumulates task-specific performance data that competitors without the same enterprise relationship cannot easily replicate. The DeepWiki codebase graph representation and BlockDiff snapshotting are proprietary engineering innovations that provide functional advantages in large-codebase contexts where pure-attention LLMs lose coherence.
| Control | Type | Status | Details | Disclosed? |
|---|---|---|---|---|
| Session sandbox isolation | Architecture | Confirmed | Each session runs in an ephemeral container | Yes (official docs) |
| PR-gated code changes | Process | Confirmed | All code changes require human approval before merge | Yes |
| VPC deployment | Architecture | Available (Enterprise) | Customer-controlled cloud perimeter | Yes |
| Execution trace logging | Audit | Confirmed | Full log of shell commands, file edits, API calls | Yes |
| SOC 2 Type II | Certification | Not disclosed | No public attestation as of May 2026 | No |
| ISO 27001 | Certification | Not disclosed | No public attestation | No |
| Prompt-injection defense (patched) | Security | Patched Dec 2024 | Critical vuln found live-streamed; Cognition patched | Adverse |
| Data retention policy | Privacy | Not disclosed | Code submitted to cloud sessions; retention unclear | No |
Absence of disclosed SOC 2 does not mean the certification does not exist; it may exist under NDA for enterprise customers. Cognition's trust.devin.ai page returned limited information at research date.
[CE015, CE028, CE029, CE031]5.6 Trust, Safety, Security, and Compliance
Devin's primary security architecture relies on sandboxed compute isolation: each session runs in an ephemeral container with no access to other sessions or the customer's production environment unless explicitly granted through integration credentials. Code changes are surfaced as pull requests requiring human approval, and the Devin execution trace provides an auditable log of every shell command, file edit, and API call made during the session. VPC deployment adds an additional isolation layer by keeping source code within the customer's own cloud perimeter. A significant security incident emerged in late 2024 when a live-streamed demonstration exposed a major vulnerability in Devin's system prompt handling—effectively allowing prompt-injection attacks to manipulate Devin's behavior. Cognition acknowledged the issue and patched it rapidly, but the incident highlighted the inherent risks of deploying autonomous agents that can execute shell commands based on text inputs. Hacker News commentary characterized the failure as "amateurish given the severity," noting that prompt-injection defenses should have been a first-priority security control for an agent with shell execution capabilities. Cognition has not publicly disclosed SOC 2 Type II certification, ISO 27001, or equivalent enterprise security certifications as of May 2026. The trust.devin.ai subdomain exists but returned limited public information at time of research. Enterprise customers relying on VPC deployment and custom-trained models are likely subject to separate security agreements, but these are not publicly auditable. Data privacy and retention policies for code executed in Devin sessions are not fully disclosed in Cognition's public-facing documentation, representing a material compliance gap for financial-services and regulated-industry customers. The Mercedes-Benz and Goldman Sachs partnerships suggest compliance barriers have been addressed in custom agreements, but without public disclosure, independent assessment is not possible.
06Customers
6.1 Customer Base Segmentation
Cognition AI targets professional engineering teams at technology-forward enterprises, mid-market companies, and developer-first organizations. The buyer profile is an engineering leader or CTO seeking to multiply developer throughput without proportional headcount growth; the user is typically an individual software engineer or engineering manager who delegates tasks directly via Slack, Jira, or the Devin web interface. The payer is the engineering budget owner, whether a department budget or a centralized IT procurement function at large enterprises. Verticals with confirmed production deployments include financial services and fintech (Nubank—LatAm's largest neobank), automotive/manufacturing (Mercedes-Benz), IT services and outsourcing (Cognizant), enterprise technology, and the US Federal Government. The COBOL modernization blog post confirmed Fortune 500 deployments in sectors that still run significant COBOL workloads: financial services, insurance, and public utilities. In 2026, Cognition launched a Government vertical to target legacy software modernization in US federal agencies. Japan and Singapore enterprise markets were opened in April 2026, suggesting Southeast Asia financial services, manufacturing, and technology firms are the next target segments. The geographic distribution is US-first (the majority of revenue and named customers as of early 2026), with Europe (London office opened January 2026) and APAC (Japan and Singapore offices opened April 2026) serving as growth centers. Channel distribution includes direct enterprise sales, the Windsurf IDE free-to-paid conversion funnel (250K+ daily active users at acquisition), and the Cognizant reseller partnership that deploys Devin and Windsurf across Cognizant's own engineering teams and its global client base. The Windsurf integration creates a bottom-up, developer-led adoption vector in addition to top-down enterprise procurement.
| Segment | Buyer Type | User Type | Use Case | Scale / Value | Evidence Level |
|---|---|---|---|---|---|
| LatAm Fintech Enterprise | CTO / Eng Leader | SWE / EM | Legacy ETL migration, code modernization | 100M users, 6M+ LoC codebase | Named (Nubank), quantified |
| Global Automotive / Manufacturing | CTO / VP Eng | SWE teams | Legacy modernization, cloud-native dev, logistics | Global enterprise | Named (Mercedes-Benz), announced |
| IT Services / Outsourcing | IT Exec / Channel | SWE teams + client engineering | Deploy Devin to own and client engineering teams | 300+ client organizations | Named (Cognizant), partner |
| Fortune 500 COBOL Legacy | VP Eng / CIO | Legacy SWE | COBOL modernization to modern stacks | Multi-decade COBOL systems | Unnamed, blog-disclosed |
| US Federal Government | Agency IT exec | Agency SWE | Critical infrastructure modernization | Federal agency scale | Vertical launched (Feb 2026), no named agencies |
| APAC Enterprise | CTO / Eng Leader | SWE teams | Software production across SE Asia, Japan | Large-enterprise | Market opened Apr 2026, no named customers |
| Individual Developer (SMB/indie) | Self | Self | Coding automation, personal projects | Low value, high volume | Self-serve Free/Pro tier |
Evidence level reflects public disclosure depth. Mercedes-Benz, Cognizant, and COBOL Fortune 500 are at announcement stage without published outcome metrics.
[CU001, CU002, CU003, CU004, CU005]6.2 Adoption Trajectory and Growth Metrics
Cognition AI's most striking adoption signal is the ARR growth trajectory: from approximately $1M ARR at general availability (December 2024) to approximately $73M ARR by April 2025—a roughly 73x increase in four months. This rate of growth is consistent with Sacra's estimate of ~$15M ARR in October 2024 and the Growjo platform's current estimate of $73M, though both figures are third-party estimates rather than audited or company-disclosed financials. The April 2025 pricing reset (Core plan from $500/month to $20/month base) accelerated individual developer adoption while maintaining enterprise revenue through volume. Internal usage signals are revealing. By February 2026, Cognition's own engineering team was merging 659 Devin PRs per week—four times the 154 per week achieved at their best week in 2025. This "dog-fooding" signal is significant: the team building Devin is also its most intensive user, providing a high-quality feedback loop that likely accelerates iteration velocity. The PR merge rate improved from 34% at Devin's launch to 67% by April 2025—a proxy for output quality improvement—though the metric is self-reported and denominator methodology is not specified. Geographic expansion (London, Tokyo, Singapore) in the first half of 2026 suggests management confidence in revenue sufficiency to support multi-office overhead. The Cognizant partnership is particularly significant as a channel amplifier: Cognizant's global client base spans 300+ organizations across financial services, manufacturing, and healthcare, meaning even partial penetration could materially accelerate Devin's enterprise account count without proportional Cognition sales headcount. However, no public data exists on the number of active paying accounts, the average contract value, or the customer growth rate beyond the ARR estimates.
| Metric | Value | Date | Source | Confidence | Implication |
|---|---|---|---|---|---|
| ARR at GA launch | ~$1M | Dec 2024 | Third-party estimate (Sacra/imseankim) | Low | Early adopter only; pricing was $500/month |
| ARR after pricing reset | ~$73M | Apr 2025 | Third-party estimate (Growjo / imseankim) | Low-medium | 70x+ growth in ~4 months with price cut to $20/month |
| ARR per Growjo (2025) | $73M estimated | Sep 2025 | Growjo revenue estimate | Low | Third-party model; not company-disclosed |
| PR merge rate at launch | 34% | Mar 2024 | Company-claimed (imseankim/VentureBeat) | Medium | 2 in 3 initial PRs were rejected; low initial quality |
| PR merge rate Apr 2025 | 67% | Apr 2025 | Company-claimed (imseankim) | Medium | Quality doubled in ~1 year; still 33% rejection |
| Cognition's own Devin PRs/week | 659 | Feb 2026 | Company blog (official) | High | Internal dog-fooding at high intensity; 4x growth from best 2025 week |
| Windsurf DAU at acquisition | 250,000+ | Jul 2025 | Company-claimed (Cognition blog) | High | Large Windsurf user base as acquisition conversion funnel |
| Employee count growth | ~102% YoY | 2025 | Growjo estimate | Low | Fast headcount growth consistent with revenue acceleration |
ARR figures are third-party estimates, not company-disclosed. PR merge rate metrics are company-claimed. Missing denominator: total account count and ACV breakdown unknown.
[CU006, CU007, CU008, CU009, CU010]6.3 Named Customer Proof and Reference Quality
The highest-quality named customer proof is the Nubank case study, the only publicly detailed and quantified production deployment. Nubank, Latin America's largest neobank (approximately 100 million customers), used Devin to migrate a 6-million-line ETL monolith to sub-modules—a task originally estimated to require 1,000+ engineers over 18 months. With Devin, the migration progressed in weeks for each business unit (Data, Collections, Risk), achieving 12x engineering efficiency improvement in hours saved and 20x cost savings versus the all-human baseline. Crucially, the case study documents fine-tuning on the customer's specific migration patterns, which led to a 4x speed improvement (40 minutes per task to 10 minutes) and a 2x accuracy improvement on the internal benchmark. Mercedes-Benz is named in a blog post (April 27, 2026) as deploying Devin and Windsurf "across its global engineering organization" for legacy modernization, cloud-native development, and logistics. No outcome metrics are publicly disclosed; the announcement is at the partnership/announcement stage rather than a post-hoc case study. Cognizant (announced January 28, 2026) is deploying Devin and Windsurf "across its engineering organization and global clients"—this is a channel partnership rather than a named end-customer, and no production outcome data is available. Additional unnamed customer evidence: Fortune 500 COBOL modernization deployments (blog post April 8, 2026, no named customers), US Government (Cognition for Government launched February 25, 2026, no named agencies), Japan enterprise (launched April 9, 2026, no specific customers named). The depth and freshness of proof varies enormously: Nubank is production with quantified outcomes; others are at announcement stage. This creates a concentration of evidence quality risk—if Nubank is not representative of typical deployments, the customer proof base is shallow.
| Customer | Segment | Use Case | Production vs Pilot | Outcome | Limitation |
|---|---|---|---|---|---|
| Nubank (LatAm neobank) | Fintech | ETL monolith migration (6M LoC, 100K data classes) | Production | 12x efficiency gain, 20x cost savings; weeks vs months | Fine-tuning required; Cognition-published case study |
| Mercedes-Benz | Automotive | Legacy modernization, cloud-native dev, logistics | Announced (Apr 2026) | Not disclosed | Announcement only; no outcome metrics |
| Cognizant | IT Services / Channel | Deploy Devin + Windsurf across own org + client base | Partnership (Jan 2026) | Not disclosed | Channel partnership; no end-customer or outcome data |
| Fortune 500 COBOL (unnamed) | Multiple legacy verticals | COBOL modernization to modern stacks | Production (implied) | Not quantified; blog describes ongoing deployments | Unnamed; blog-only disclosure |
| Cognition AI itself | AI SaaS / internal | Devin builds Devin; 659 PRs/week merged | Production (internal) | 4x growth in weekly Devin PRs (2025→2026) | Self-referential; no independent verification |
| US Government (agency unnamed) | Federal government | Critical infrastructure software modernization | Vertical launched | Not disclosed | No named agencies; FedRAMP status unknown |
Only Nubank has quantified production outcomes in a published case study. All other named relationships are at announcement stage or are channel partnerships without end-customer disclosure.
[CU003, CU004, CU005, CU011, CU012]6.4 Retention, Satisfaction, and Durability
Publicly available retention and satisfaction metrics for Cognition AI are limited. The company has not disclosed Net Revenue Retention (NRR), Gross Revenue Retention (GRR), customer churn rate, or cohort analysis data. The most proximate public indicator of satisfaction is the PR merge rate (67% as of April 2025, up from 34% at launch), which measures the share of Devin-opened pull requests that are accepted by human reviewers—a reasonable proxy for output quality but not a standard customer satisfaction metric. A 67% PR acceptance rate implies that one-third of Devin's work products require rejection or significant rework, which may be acceptable for high-volume autonomous tasks but is a notable quality gap versus human engineers. Developer sentiment from independent sources is mixed. Hacker News communities have noted that early Devin demos appeared polished but real-world performance on complex tasks lagged expectations set by the original 13.86% SWE-bench announcement. The independent reviewer at imseankim.com—after waiting six weeks to observe the product in production—noted that "independent testers tell a more complicated story" relative to Cognition's official performance numbers, and flagged that ACU costs accumulate quickly in practice, with moderate usage easily exceeding $100-200 per month on the nominally $20/month Core plan. The security vulnerability disclosed in December 2024 temporarily damaged trust among security-conscious enterprise customers. On durability signals, the Nubank deployment describes continued and expanding use (data, collections, and risk business units progressively adopting Devin). Internal Cognition usage of Devin (659 PRs/week merged in Feb 2026) suggests strong product-led retention. The Cognizant and Mercedes-Benz partnerships represent structural switching costs once integrated into enterprise workflows. However, the contract length, renewal rates, and post-deployment satisfaction at named enterprise customers are not publicly known.
| Metric | Value | Segment | Confidence | Diligence Ask |
|---|---|---|---|---|
| NRR (Net Revenue Retention) | Not disclosed | All segments | Unknown | Request from Cognition: NRR by cohort and ACV band |
| GRR (Gross Revenue Retention) | Not disclosed | All segments | Unknown | Request from Cognition: annual contract renewal rate |
| Churn rate | Not disclosed | All segments | Unknown | Request: monthly and annual subscriber churn by plan tier |
| PR merge rate | 67% (Apr 2025) | All Devin users | Medium (company-claimed) | Verify with independent PR audit or customer interviews |
| Developer satisfaction (HN) | Mixed—some enthusiastic, some disappointed | Developer community | Low | Commission independent satisfaction survey of enterprise users |
| Nubank expansion depth | 3+ business units using Devin | Fintech enterprise | High (case study) | Request: duration of Nubank engagement, current ACU spend |
| Cognition internal Devin usage | 659 PRs/week (Feb 2026) | AI SaaS (internal) | High (blog) | Cross-check with Git stats; confirm PR type distribution |
| Average task success rate | ~67% PR acceptance (proxy) | General | Low | Disaggregate by task type, language, and complexity band |
Retention metrics are not publicly disclosed. PR merge rate is the only available public quality proxy, and it is company-reported without denominator methodology. All retention figures require direct diligence requests.
[CU013, CU014, CU015, CU016]6.5 Expansion and Concentration Risk
Cognition AI's land-and-expand strategy is evident: the Nubank deployment started with one business unit's ETL migration, then expanded to the Data, Collections, and Risk units. The Cognizant partnership implies a similar structure—Cognizant begins with internal deployment and then upsells to its global client base. The Free and Pro plan tiers are designed for developer-led adoption: individual developers experiment at low cost, drive internal adoption, and create bottom-up enterprise procurement pressure. The Windsurf IDE's 250,000+ daily active users provide a large organic install base from which to convert to paid Devin usage. Concentration risks are significant and not publicly disclosed. With ARR of ~$73M and a small set of confirmed named accounts, there is a material risk that a handful of large enterprise contracts (Nubank-scale, Mercedes-Benz-scale) constitute a dominant share of revenue. If the Cognizant channel partnership is performing well, it may itself represent disproportionate revenue concentration through a single reseller rather than diversified direct customer relationships. The COBOL modernization vertical targets high-value projects (multi-year legacy systems) that are structurally one-time engagements unless renegotiated as long-term support contracts, creating potential revenue cliff risk after initial migrations complete. Procurement friction at regulated industries is high. Financial services, healthcare, and government customers require security certifications (SOC 2, FedRAMP) and data residency controls that Cognition has not publicly disclosed meeting. VPC deployment mitigates the data residency concern, but the absence of published compliance attestations limits the addressable market in the most regulated industry verticals. The Government vertical launch in February 2026 suggests Cognition is actively addressing FedRAMP-equivalent certification requirements, but timelines and current status are not public. This compliance gap represents the primary enterprise procurement barrier for Cognition's target large-enterprise segment.
| Driver / Risk | Type | Description | Impact | Diligence Path |
|---|---|---|---|---|
| Land-and-expand (use case depth) | Expansion driver | Nubank started with 1 business unit, expanded to 3+; Cognizant starts internal then sells to clients | High | Map expansion timeline and ACU growth per account |
| Windsurf to Devin conversion | Expansion driver | 250K+ daily Windsurf users as conversion funnel for cloud agent upsell | Medium | Track conversion rate from Windsurf Free to paid Devin sessions |
| COBOL modernization (project-based) | Concentration risk | High-value one-time migrations; no structural recurring need after completion | High | Assess whether COBOL accounts contract for ongoing support or leave post-migration |
| Top-customer concentration (unknown) | Concentration risk | With ~$73M ARR and few named accounts, one Nubank-scale account ≈ material share | High | Request top-10 customer concentration; ask for top-customer revenue share |
| Cognizant channel dependency | Channel risk | Single reseller could dominate new enterprise account acquisition | Medium | Understand revenue share terms; assess lock-in vs. direct sales competitive dynamics |
| Compliance gap (SOC 2, FedRAMP) | Procurement friction | Regulated industries cannot procure without certifications | High | Confirm SOC 2 Type II and FedRAMP In Process status; understand timeline |
| Developer fatigue / ACU cost sensitivity | Churn risk | Pro/Core plans costly in practice ($100-200+/month for moderate use); developers may downgrade | Medium | Request Pro-to-Free downgrade rate and ACU utilization distribution |
Top-customer concentration is a critical diligence gap; no public data is available. COBOL project-based nature creates revenue cliff risk that requires probing.
[CU017, CU018, CU019, CU020]07Risks
7.1 Regulatory and Legal Risk Landscape
Cognition operates in a rapidly evolving regulatory environment. Three major frameworks create material near-term obligations. The EU AI Act (Regulation 2024/1689), effective August 2026 for GPAI model providers, imposes transparency documentation, copyright compliance policies, and EU AI Office registration. The UK GDPR—as administered by the ICO—applies to all data processing of UK/EU residents and requires Data Protection Impact Assessments for high-risk AI processing, which Devin's autonomous PR-merge and code-deployment capabilities likely trigger under Article 35. The January 2025 White House Executive Order revoked prior Biden-era AI safety obligations, creating a deregulatory US federal environment, but successor agency-level rules (particularly for government AI procurement) remain active. California SB 1047, which would have imposed frontier model accountability requirements, was vetoed in September 2024; successor legislation (AB 2013, SB 1235) is active through the 2025–2026 session. Separately, Doe v. GitHub, Microsoft, and OpenAI copyright litigation—consolidated in the Northern District of California—alleges AI code model training on open-source repositories constitutes copyright infringement; Cognition has not disclosed its model training data provenance, creating analogous latent IP exposure. Cognition's terms of service restrict output from being used to train competing models, but the upstream model training data copyright status is not independently verifiable. Compliance monitoring, cross-jurisdictional DPA coverage, and GPAI transparency documentation are each materially underfunded relative to Cognition's scale and enterprise customer base across the EU, Japan, and Singapore. Absent a dedicated regulatory affairs function, Cognition risks enforcement action, GPAI non-registration penalties, or restricted EU market access by Q3 2026. A proactive regulatory program should be treated as a business-critical investment at this stage of international expansion, not an optional legal overhead item.
| Rule / Case | Jurisdiction | Status | Likelihood | Severity | Mitigation | Residual Exposure | Diligence Path |
|---|---|---|---|---|---|---|---|
| EU AI Act GPAI provisions (Reg. 2024/1689) | European Union | In force Aug 2026 | Medium | High | Publish GPAI transparency doc; register with EU AI Office | Medium | Verify GPAI registration; obtain AI transparency attestation |
| GDPR / UK GDPR (AI data processing) | EU / UK | Active | Medium | High | DPA agreements; no training on Enterprise data (company-claimed) | Medium | Audit DPA coverage; verify ICO DPIA compliance for agentic processing |
| Doe v. GitHub / Microsoft / OpenAI (AI copyright) | US N.D. Cal. | Ongoing consolidated litigation | Medium | Medium | ToS restricts output for competing model training; training data provenance undisclosed | Medium | Obtain legal opinion on Devin model training data provenance and OSS license exposure |
| CA SB 1047 successor bills (AB 2013) | California, USA | Monitoring 2025–2026 | Low | Medium | No current binding obligation; SB 1047 vetoed Sep 2024 | Low | Monitor quarterly; reassess if AB 2013 passes committee |
| US Federal AI EO — Jan 2025 (deregulatory) | United States | Active | Low | Low | Benefits from deregulatory stance; watch agency AI procurement rules | Low | Track OSTP AI Action Plan (due Jul 2025); monitor FedRAMP requirements for government deployments |
Likelihood and severity ratings are qualitative diligence assessments. Residual exposure after mitigation reflects analyst judgment, not Cognition disclosures.
[CR001, CR002, CR003, CR004, CR005]7.2 Security and Technical Risk
Cognition's core product risk is that Devin operates with elevated system privileges— writing, executing, testing, and deploying code autonomously—which dramatically expands the attack surface compared to passive coding assistants. In December 2024, a live-streamed demonstration publicly exposed a prompt-injection vulnerability: an adversary could embed malicious instructions in a repository's README file, causing Devin to exfiltrate secrets, make unauthorized API calls, or plant backdoors in customer codebases. Cognition acknowledged and patched the issue. However, the incident revealed that prompt-injection defenses—top of the OWASP Top 10 for LLM Applications (LLM01)—were inadequately hardened prior to GA launch. Cognition's security page confirms SOC 2 Type II certification (audited March 2024), data encrypted in transit and at rest, and MFA for all employees. The Trust Center requires NDA execution before disclosing penetration test reports, audit scope, and third-party assessment results—limiting enterprise procurement teams' independent assessment of control effectiveness. Key unresolved gaps include: no published bug-bounty program; no formal Agent Security Framework; no public penetration test scope or red-team disclosure; and no published SLA or historical uptime data. Devin 2.0 (April 2025) expanded capabilities to merge PRs directly and schedule agents on customer infrastructure, materially increasing the blast radius of any future prompt-injection exploit. The OWASP GenAI Security Project identifies agent autonomy as an emerging, distinct risk category beyond classical LLM vulnerabilities. As enterprise adoption grows to VPC deployments at hundreds of companies, an undetected Devin exploit could simultaneously contaminate multiple enterprise codebases—a systemic supply-chain tail risk that individual customer security teams cannot independently mitigate.
| Failure Mode | Likelihood | Severity | Mitigation Maturity | Residual Exposure | Unresolved Gap |
|---|---|---|---|---|---|
| Prompt-injection / agent manipulation (OWASP LLM01) | High | Critical | Partial — patched Dec 2024; no public bug-bounty or agentic security framework | High | No published bug-bounty; no independent red-team disclosure; no post-Devin-2.0 pen test summary |
| Supply-chain code contamination via autonomous PR merge | Medium | Critical | Partial — SOC 2 Type II; code review recommended but not enforced | High | No published penetration test scope; trust center details NDA-gated; blast radius expanded by Devin 2.0 |
| SOC 2 Type II scope gap — no public report access | Medium | High | Partial — SOC 2 Type II obtained Mar 2024; public report requires NDA | Medium | Obtain NDA and review SOC 2 report scope, control descriptions, and exceptions |
| AI hallucination / insecure code generation in production | High | High | Partial — code review and branch protection recommended; not enforced by default | Medium | No independent study of Devin-generated code vulnerability rate versus human-written code |
| Platform outage / ACU quota exhaustion | Low | Medium | Partial — AWS-hosted; no published SLA or uptime history disclosed | Low | Request SLA and historical uptime data; obtain enterprise credit terms for outages |
Failure modes ordered by severity. Mitigation maturity reflects publicly available evidence; undisclosed internal controls may be more advanced.
[CR006, CR007, CR008, CR009, CR010]7.3 Competitive and Market Risk
Cognition's most acute competitive risk is the rapid commoditization of agentic coding benchmarks that historically defined its leadership. At Devin's March 2024 launch, its SWE-bench Full score of 13.86% was groundbreaking. By mid-2025, Claude Code Opus 4 achieved 72.5% on SWE-bench Verified—a five-fold improvement in roughly 15 months. OpenAI Codex and GPT-5 achieve comparable Verified scores. This rate of improvement threatens Devin's technical differentiation: its multi-step planning, execution, and testing loop may be replicated by foundation model providers within 12–18 months. GitHub Copilot (250M+ installs, Microsoft distribution), Cursor ($500M+ ARR, $9B valuation, $40/month pricing), and Claude Code (Anthropic) compete for the same enterprise developer productivity budget as Devin ($500/month Team plan). The FTC has raised concerns that concentrated control of foundational AI inputs—including the models Devin depends on—could allow those providers to distort competition in downstream AI application markets, a structural risk for Cognition. Cognition's acquisition of Windsurf (July 2025) added 350+ enterprise accounts and 250K+ DAU, but also created integration risk: two separate AI agent codebases, model pipelines, billing systems, and sales motions must be merged simultaneously while maintaining ARR growth. Developer community skepticism at Devin's launch—focusing on benchmark reliability, closed demos, and misleading superiority claims—created an adverse reputation dynamic that increased adoption friction among technically sophisticated buyers and remains a headwind as enterprise evaluation rigor grows. Pricing compression is evident: Devin 2.0 reduced effective prices by approximately 3×; future competitive pressure may require further price reductions that compress revenue per session.
| Dependency | Counterparty | Role | Concentration | Failure Scenario | Severity | Residual Exposure |
|---|---|---|---|---|---|---|
| Foundation LLM provider (primary) | Anthropic / OpenAI | Core model intelligence for all Devin sessions | Very High | Provider raises API pricing 3–5× or restricts agentic use-case terms | Critical | High — no disclosed multi-model fallback or proprietary model timeline |
| Cloud infrastructure | Amazon Web Services | VPC compute; enterprise deployment | High | AWS terms change for AI agents; pricing increase materially | High | Medium — no public multi-cloud or on-premises fallback path |
| Developer workflow integrations | GitHub (Microsoft) | PR review; code diff; CI integration; Copilot distribution | High | Microsoft restricts GitHub API access for AI competitors or modifies Copilot integration policy | High | Medium — API-level integration; no exclusive arrangement |
| AI IDE platform (Windsurf, acquired Jul 2025) | Internal (Cognition) | Enterprise IDE distribution; 350+ enterprise accounts; 250K+ DAU | High | Integration failure; talent attrition post-acquisition; product cannibalization of Devin sessions | Medium | Medium — controlled integration; leadership retained at acquisition |
All counterparties listed have some degree of conflicting competitive interests. Dependency severity rated by revenue impact of failure or material price increase.
[CR011, CR012, CR013, CR014]7.4 Financial and Operational Risk
Cognition has raised approximately $1.575B in disclosed funding through mid-2026. With 222 employees (102% annual headcount growth, per Growjo) and a compute- intensive product, estimated monthly burn is $5–15M, implying 18–36 months of runway under conservative scenarios. Revenue growth from ~$1M ARR at GA (December 2024) to ~$73M ARR (April 2025 estimate) is exceptional but introduces customer concentration risk: the $73M ARR figure across a handful of named enterprise accounts suggests a small number of very large contracts that create material churn risk at renewal. Nubank accounts for the only disclosed quantified production case study. If Nubank or a comparably large customer churns—due to a security incident, budget reallocation, or competitive switch—the ARR decline could impair future fundraising at an assumed $10.2B valuation. ACU pricing models ($2.25 per ACU overage) create unpredictable usage-based revenue and potential bill-shock churn for customers who underestimate consumption. Margin compression from LLM inference costs—third-party API calls at commercial Anthropic/OpenAI rates—limits gross margin expansion until Cognition builds proprietary models or achieves volume pricing. No path to profitability, EBITDA, or gross margin targets has been publicly disclosed. Windsurf integration introduces additional operational complexity: merging billing systems, enterprise contracts, and product teams simultaneously increases execution risk and may slow the strategic roadmap. The January 2025 Series A at $2B valuation and post-Windsurf round imply a $10.2B Growjo valuation estimate; any material miss on ARR growth trajectory, a churn event, or a down round from a next capital raise could impair team motivation and trigger key employee departures. Financial risk is elevated by the company's refusal to disclose gross margins, NRR, or cohort retention metrics.
| Role / Function | Dependency or Gap | Likelihood | Severity | Mitigation | Diligence Path |
|---|---|---|---|---|---|
| CEO / Co-founder (Scott Wu) | Single point of product vision; investor credibility; technical leadership; competitive programmer identity | Low | Critical | Co-founder team; board oversight; no public succession plan | Request governance structure; confirm board independence; review co-founder vesting cliff schedule |
| AI research / ML engineering talent | Extreme competition from Anthropic, OpenAI, Google DeepMind, and Microsoft for the same competitive programming and ML talent pool | Medium | High | Competitive compensation (est. $500K+ TC); strong Cognition brand in AI community | Request headcount breakdown by function; review senior ML engineer tenure and equity refresh policy |
| International legal / regulatory operations | Expanding to EU, Japan, Singapore, and US Government with no disclosed dedicated legal infrastructure; FedRAMP not obtained | High | Medium | Nascent; likely ad hoc outside counsel for each jurisdiction | Verify EU DPO appointment; request Japan PIPL and Singapore PDPA compliance roadmap; confirm FedRAMP timeline |
Risk ratings based on publicly disclosed organizational structure. Internal succession plans, bench depth, and retention programs not publicly disclosed.
[CR015, CR016, CR017]7.5 Execution, People, and Partner Risk
Cognition's three co-founders—Scott Wu (CEO), Steven Hao, and Walden Yan—are competitive programming champions with exceptional technical credentials but no prior track record building enterprise software companies beyond early-stage startups. Key-person risk is concentrated in Scott Wu, who drives technical vision, investor relationships, and product strategy. The company's AI talent pipeline is stressed by intense competition from Anthropic, OpenAI, Google DeepMind, and Microsoft—all operating frontier AI programs at compensation packages exceeding $500K total compensation, directly competing for the same competitive programming and ML engineering talent pool that Cognition relies on. Devin's platform depends critically on third-party partners: AWS (VPC compute), GitHub (PR workflow integrations), and Slack/Teams (enterprise communication). Any of these platforms modifying API access, pricing, or integration policy could disrupt Devin's enterprise value proposition. Anthropic and OpenAI present a dual risk as both supplier and competitor: they supply the underlying models Devin orchestrates, but compete via Claude Code and OpenAI Codex respectively, creating conflict-of-interest incentives. Geographic expansion to Japan, Singapore, and the EU—announced in Q1–Q2 2026—introduces jurisdiction-specific compliance obligations that require legal and operations hiring ahead of revenue materialization. US Government customers require FedRAMP or equivalent authorization, typically a 12–24 month process; the gap between announced government vertical and actual FedRAMP authorization creates a window of reputational and contractual risk. International legal infrastructure—EU DPO appointment, Japan PIPL compliance, Singapore PDPA— represents an emerging gap for a company that has prioritized product velocity over regulatory scaffolding.
| Risk | Monitorable Trigger | Kill-Criteria Threshold | Action Implication |
|---|---|---|---|
| Prompt-injection / supply-chain breach | Disclosed CVE; customer codebase contamination; second security incident | Any confirmed supply-chain contamination of production customer codebase OR second public security incident within 12 months | Immediate thesis review; evaluate churn impact; assess remediation credibility; consider position exit |
| Competitor commoditization of agentic coding | Monthly SWE-bench leaderboard; Cursor, GitHub Copilot, Claude Code pricing announcements | Competitor at ≥80% SWE-bench Verified at ≤50% Devin pricing OR Devin YoY ARR growth <50% | Reduce position; reassess moat thesis beyond benchmark scores; evaluate Windsurf-led differentiation |
| Customer concentration / Nubank churn | NRR disclosed at renewal; top-customer ARR share; pipeline diversification evidence | Single customer >30% ARR OR NRR <90% in any trailing quarter | Immediate ARR quality review; model churn scenarios; request pipeline diversification evidence |
| LLM provider pricing shock or API restriction | Anthropic / OpenAI pricing announcements; API terms updates for agentic use cases | LLM API cost increases >3× without corresponding Devin price increase OR API terms restrict autonomous PR merges | Model margin compression exposure; evaluate proprietary model timeline and cost pass-through feasibility |
Kill criteria are thesis-break triggers. Thresholds reflect current consensus; reassess at each monitoring cycle.
[CR018, CR019, CR020, CR021]08Valuation
8.1 Investment Thesis and Anti-Thesis
The bull thesis for Cognition AI rests on three mutually reinforcing propositions. First, autonomous software engineering represents a genuine market category shift— not an incremental productivity improvement. The total addressable market for software development labor exceeds $650B globally; even a 10% displacement of developer time through autonomous agents implies a $65B+ revenue opportunity. Cognition's Devin product is the category pioneer, establishing the autonomous coding agent as a procurement line item at leading enterprises before most buyers understand the category. First-mover brand recognition, lighthouse customer proof (Nubank), and the Windsurf IDE network (250K+ DAU) create a platform moat that sustains pricing power beyond pure benchmark performance. Second, the ARR trajectory from $1M (GA, December 2024) to an estimated $73M (April 2025) in nine months implies a growth rate that is structurally disconnected from historical SaaS benchmarks. At this growth rate, the company could plausibly reach $500M ARR within 18 months of the April 2025 data point, which would place it in the cohort of fastest-growing enterprise SaaS companies in history and justify a $10B+ valuation on forward ARR multiple compression. Third, the co-founders' deep technical backgrounds and Founders Fund's conviction investment create a signal that the founding team can sustain the product velocity needed to maintain differentiation. The anti-thesis is equally forceful. Benchmark commoditization is the existential risk: Claude Code Opus 4's 72.5% SWE-bench Verified score in 2025 versus Devin's 13.86% Full score at launch demonstrates that the performance gap is closing at a rate that may eliminate Cognition's technical differentiation by late 2026. The $10.2B valuation at ~140× estimated trailing ARR prices in near-flawless execution on a trajectory that has no published corroboration beyond Growjo estimates. Customer concentration risk is unquantified: $73M ARR from an undisclosed number of enterprise accounts could mean three to five contracts, where a single defection triggers a material ARR decline. The December 2024 prompt-injection security incident—discovered during a live stream— damaged Cognition's credibility among technically sophisticated buyers at a critical early-adoption inflection point. LLM provider dependency on Anthropic and OpenAI, both of whom compete directly with Devin, introduces a structural conflict-of-interest that could manifest as sudden API pricing increases or feature restrictions that raise Devin's marginal cost of delivery. Without NRR, gross margin, or cohort retention data, the financial quality of the reported ARR cannot be independently assessed.
| Dimension | Assessment | Evidence Base | Confidence | Implication |
|---|---|---|---|---|
| Recommendation | Watchlist — conditional invest below $5B entry | ARR growth exceptional; valuation multiple extreme at 140× ARR; security/concentration risks unresolved | Medium | Monitor NRR, margin, and integration progress; entry discipline required |
| Risk rating | High | Security incident history; benchmark commoditization; LLM dependency; customer concentration unknown | High | High risk of capital impairment at $10.2B entry if execution falters |
| Valuation stance | Speculative premium — current $10.2B at ~140× ARR is 7× Cursor multiple | Cursor $9.9B / $500M ARR = 20× ARR; Cognition $10.2B / $73M ARR = 140× ARR | Medium | Fair value range $3–7B on base-case assumptions; entry above $7B requires bull case proof points |
| Bull case target | $18–25B exit (3-year hold, $700M+ ARR, 25–35× forward ARR) | Growth rate sustainability; Windsurf integration success; NRR ≥120% | Low-Medium | 1.5–2.5× return from $10.2B entry; adequate for venture IRR only with near-flawless execution |
| Bear case target | $800M–$1.8B exit (second security incident OR major churn OR >3× competitor pricing undercut) | Security overhang; benchmark commoditization; LLM pricing risk | Medium | 90%+ capital loss at $10.2B entry; unacceptable asymmetry without higher-confidence NRR data |
Recommendation is for institutional investors evaluating a secondary or primary position in Cognition at current valuation. This is not investment advice.
[CV001, CV002, CV003]| Diligence Request | Priority | Why Critical | Acceptable Threshold |
|---|---|---|---|
| Net Revenue Retention by cohort (quarterly, trailing 4Q) | Critical | ARR quality; determines if growth is expansion- or churn-masked | NRR ≥110% confirms health; <90% is thesis-break |
| Gross margin by product line (Devin ACU, Windsurf, Enterprise) | Critical | LLM cost structure; path to profitability; margin compression risk | Target 50%+ gross margin; <30% raises sustainability questions at scale |
| Top-5 customer ARR and share of total ARR | Critical | Concentration risk; single churn event impact | Top-5 <40% total ARR; any individual >15% ARR is high concentration |
| Post-Devin-2.0 penetration test report or red-team summary (post-December 2024 patch) | High | Security credibility; expanded agentic capabilities require re-audit | NDA access to SOC 2 Type II + at least 1 post-Devin-2.0 independent pen test |
| Windsurf integration customer retention rate (6 months post-acquisition) | High | Integration execution proof; IDE customer attrition risk | Windsurf enterprise account retention ≥85% at 6 months post-acquisition |
| LLM API contract terms (Anthropic, OpenAI): volume pricing, agentic use-case clauses, change-of-control | High | Supplier conflict-of-interest; margin floor determination | Locked-in pricing through 2027 OR multi-model diversification strategy in place |
| EU GPAI compliance plan and regulatory affairs staffing | Medium | August 2026 compliance deadline; EU revenue at risk if non-compliant | Named EU regulatory affairs hire; GPAI transparency doc timeline <6 months |
| FedRAMP authorization status and timeline for US Government vertical | Medium | Government ARR pipeline; credibility of government vertical launch (Feb 2026) | Clear FedRAMP authorization timeline; interim use-case confirmation from named agency |
Priority levels: Critical = must-have before investment decision; High = must-have before term sheet; Medium = must-have before close.
[CV023, CV024, CV025, CV026, CV027, CV028]8.2 Valuation Context and Financing History
Cognition has completed three disclosed financing rounds totaling approximately $1.575B. The $21M seed round established the founding team in 2023. The $175M Series A (January to April 2025, Founders Fund IX lead) set the initial institutional valuation at $2B post-money. SEC EDGAR records confirm Founders Fund IX (CIK 0001971631) as a large institutional vehicle that committed capital to Cognition; the Cognition Capital SPV I (CIK 0002072175, Form D filed 2025-06-11) indicates a follow-on structured vehicle was established concurrent with or shortly after the Windsurf acquisition close. The $400M round (September 2025) valued the company at $10.2B post-money, a 5.1× step-up from the Series A in approximately nine months—one of the steepest step-up trajectories in AI software history. The total financing of ~$1.575B implies approximately 15%+ dilution on a $10.2B post-money basis, though exact preference stack and liquidation waterfall are not disclosed. At the $10.2B valuation, the primary revenue multiple is approximately 140× the April 2025 ~$73M ARR estimate—a premium to all disclosed AI software peers. Cursor/Anysphere closed its Series D at $9.9B on approximately $500M ARR (September 2025), implying a 20× ARR multiple. Cognition's premium is partially justified by its superior growth rate and category-pioneer positioning but represents execution risk priced in for investors entering at $10B+. The Windsurf acquisition (July 2025) was completed via Cognition equity and cash for an undisclosed purchase price; the Cognition Capital SPV I suggests a structured investor-level vehicle, possibly used to fund the acquisition consideration or backstop the combined entity's operating plan. A "culture reset" reported by Winbuzzer in August 2025—Cognition offering nine-month buyouts to Windsurf staff and reportedly requiring 80-hour workweeks—introduces talent integration risk not reflected in the headline valuation. Any new institutional investor at $10.2B would face a meaningful preference overhang from the seed, Series A, and $400M round before reaching par on any return calculation below a $10.2B exit value.
| Thesis Pillar | Supporting Evidence | Countervailing Evidence | Conviction |
|---|---|---|---|
| Autonomous engineering as a category creation event | ARR growth from $1M to $73M in 9 months; Nubank 12× efficiency; Cognition uses Devin internally (659 PRs/week) | Benchmark commoditization accelerating; peers may replicate orchestration layer without needing Cognition's tooling | Medium |
| Network moat from Windsurf IDE + Devin cloud agent integration | Windsurf 250K+ DAU and 350+ enterprise accounts added at acquisition (Jul 2025) | Integration execution risk; culture reset reported Aug 2025; two codebases to merge simultaneously | Low-Medium |
| Founders Fund IX conviction as quality signal | Founders Fund IX (CIK 0001971631) as lead Series A investor; Peter Thiel-associated track record | Founders Fund portfolio concentration; past thesis misses have occurred; not a guarantee of outcome | Medium |
| Price reduction as volume expansion strategy | Devin 2.0 reduced effective price 3×; 83% more tasks per ACU; implies cost curve dropping | Gross margin unknown; 3× price cut suggests either cost structure improvement OR competitive response; either reduces total revenue headroom | Low |
Conviction rating is analyst judgment based on publicly available evidence. Internal financial metrics (NRR, gross margin) are not available to modify these assessments.
[CV004, CV005, CV006, CV007]8.3 Bull, Base, and Bear Scenario Analysis
The bull scenario assumes Devin's ARR continues at a sustained 50% quarterly growth rate through 2027, reaching approximately $700M–$800M ARR by end of 2027. At this scale, with improving gross margins from model cost optimization and volume contracts, Cognition would likely command an IPO or acquisition valuation of $15–25B—a 1.5–2.5× return from $10.2B entry. The bull case requires: no material security incidents post- December 2024; successful Windsurf integration adding a credible IDE channel; NRR disclosed at ≥120%; and competitor models failing to commoditize at Devin's enterprise workflow integration depth. The base scenario assumes moderate ARR growth (30–40% quarterly decay from the 2025 sprint) reaching $200–350M ARR by end of 2027, with gross margin improvement to 50–60%. At a normalized 25–35× forward ARR multiple (consistent with high-growth AI SaaS), this implies a $5–12B valuation—at or near flat to entry at $10.2B, suggesting negligible return for investors at that entry. The base case requires no major churn events, reasonable Windsurf integration, and sustained AI infrastructure investment by enterprises. The bear scenario assumes a material adverse event within 18 months: a second security incident, Nubank churn, or a Devin price war triggered by Claude Code or OpenAI Codex commoditization. Under the bear case, ARR plateaus at $80–120M, NRR is disclosed below 90%, and the valuation compresses to 10–15× ARR implying an $800M–$1.8B exit value—a 90%+ capital loss for investors entering at $10.2B. The bear trigger probability is estimated at 20–30% given the concentration of known risk factors, particularly the security overhang and the structural benchmark commoditization trend. Return expectations are asymmetric: the bull upside (1.5–2.5× in 3 years) does not adequately compensate for a 30%+ probability of severe capital loss at a $10.2B entry valuation. This is the core investment thesis challenge: the risk-adjusted return at current entry is unattractive unless the investor has high-conviction, differentiated information about NRR and gross margin that the market does not.
| Scenario | ARR (End 2027 est.) | ARR Multiple | Implied Valuation | Probability Signal | Key Assumption |
|---|---|---|---|---|---|
| Bull | $700–800M | 25–35× forward ARR | $18–28B | 20–25% (requires near-perfect execution) | No major security incidents; NRR ≥120%; Windsurf integration successful; LLM pricing stable |
| Base | $200–350M | 20–25× forward ARR | $4–9B | 45–55% (moderate-growth execution) | Moderate churn; Windsurf partial integration; 30–40% quarterly ARR growth rate decay |
| Bear | $80–120M | 8–12× trailing ARR | $640M–$1.4B | 25–30% (materially negative event) | Second security incident OR major customer churn OR competitor price undercut >3× OR regulatory enforcement |
Scenario ranges are analyst estimates based on comparable company benchmarks and disclosed growth metrics. Probabilities are qualitative signals, not quantitative models.
[CV008, CV009, CV010]8.4 Comparable Company Valuation Analysis
Cognition's peer group for comparables purposes includes AI-native developer productivity tools (Cursor, Replit), foundation model providers with coding products (Anthropic/Claude Code, OpenAI/Codex), and broader enterprise AI SaaS companies at comparable ARR run-rates. Cursor/Anysphere is the closest comparator: both are AI coding tools targeting enterprise developers, both are VC-backed, and both reached their most recent valuations in the second half of 2025. Cursor closed its Series D at $9.9B (Bloomberg, November 2025) on approximately $500M ARR—an 18–20× ARR multiple. Cognition at $10.2B on ~$73M ARR implies a ~140× ARR multiple, a 7–8× premium to Cursor on ARR-basis. The Cognition premium is partially justified by higher growth rate (Devin grew faster from launch) but is structurally unsustainable as Cursor's ARR ($500M) already demonstrates greater revenue quality and breadth. Harvey AI (legal AI, $3B valuation, ~$100M ARR, ~30× ARR) represents a vertical AI SaaS comparator where high-quality NRR justifies premium multiples. Glean (enterprise AI search, $4.6B, ~$100M ARR, ~46× ARR) demonstrates that enterprise AI SaaS can sustain 40–50× ARR multiples at $100M ARR, but NRR at Glean is reported above 150%. Replit ($1.16B valuation, 2023, development platform with AI pivot) represents the category risk: a pioneer pivot story that lost market share to more focused AI-native competitors. The implied IPO entry multiple for AI SaaS companies at or near profitability in 2025 is approximately 15–25× forward ARR; Cognition at $10.2B and ~$73M ARR would need to reach $400–680M ARR before an IPO at standard multiples would preserve the $10.2B entry valuation. No public market comparator exists: GitHub Copilot is embedded in Microsoft Azure revenue, Amazon Q in AWS, and Claude Code in Anthropic's API revenue. Private market premium over public comps is historically 20–40% for growth-stage AI; this suggests a fair-value range for Cognition (on base-case assumptions) of $3–7B, below the $10.2B last round, implying the current round was priced at a 1.5–3× speculative premium to intrinsic value. Entry discipline—waiting for either ARR confirmation at $300M+ or valuation correction to $4–6B—is the appropriate investor response.
| Company | Valuation | ARR / Revenue | ARR Multiple | Stage | Comparability to Cognition |
|---|---|---|---|---|---|
| Cursor (Anysphere) | $9.9B (Nov 2025 Series D) | ~$500M ARR (2025 est.) | ~20× ARR | Late private, pre-IPO | Closest comp: AI coding tool, enterprise, developer-productivity focus; 7× lower ARR multiple |
| Glean | $4.6B (2024) | ~$100M ARR (est.) | ~46× ARR | Late private | Enterprise AI search; high NRR (≥150%) justifies premium; Cognition NRR unknown |
| Harvey AI | $3B (2025) | ~$100M ARR (est.) | ~30× ARR | Growth stage | Vertical AI SaaS; legal focus; strong enterprise NRR; narrower TAM than Cognition |
| Cohere | $5.5B (2024) | ~$100M ARR (est.) | ~55× ARR | Late private | Foundation model provider; infrastructure-layer; less exposed to benchmark commoditization |
| Replit | $1.16B (2023) | Sub-$50M ARR (est.) | ~25× ARR | Growth stage | Category risk precedent: developer tool that lost share to AI-native entrants; watch for pattern |
| GitHub Copilot | N/A (embedded in Microsoft) | ~$200M+ ARR (est.) | N/A | Public (Microsoft) | Strongest distribution moat (250M+ GitHub users); pricing $10–$39/user/month; commoditizes Devin's $500/month price point |
| Amazon Q Developer | N/A (embedded in AWS) | ~$50M ARR (est.) | N/A | Public (Amazon) | AWS bundled pricing; relevant for Devin's enterprise cloud deployment competition |
All non-public valuations are based on disclosed or reported funding rounds. Revenue multiples for private companies are analyst estimates; actual metrics not confirmed by companies.
[CV011, CV012, CV013, CV014, CV015, CV016]8.5 Exit Readiness and Final Diligence Asks
Cognition is 2–4 years from an IPO readiness event given current ARR trajectory and the disclosure gap that public markets require. For IPO at $8B+ market capitalization, the company needs approximately $400–500M ARR, positive unit economics (40%+ gross margin), disclosed NRR above 120%, and a cleared SOC 2 Type II + GDPR compliance infrastructure. M&A exit paths exist: Microsoft (GitHub Copilot/Azure positioning), Alphabet (Google Cloud developer tools), Salesforce (developer productivity within Einstein platform), or Snowflake/Databricks (data engineering automation) are logical acquirers. The $10.2B valuation makes a strategic acquirer path challenging: Microsoft's prior GitHub acquisition at $7.5B (2018) was at scale, and a $10.2B bid for Cognition would be among the largest Microsoft AI acquisitions—requiring board-level conviction. The most likely exit path remains late-stage private equity / pre-IPO crossover participation, with an IPO window in 2028–2030 if ARR trajectory sustains. Final diligence priorities before investment: (1) NRR by cohort—disclosed above 110% is a green flag; below 90% is thesis-break; (2) gross margin at scale—target 50%+ gross; (3) penetration test report from post- Devin-2.0 audit; (4) Windsurf integration roadmap with customer retention data from the acquired base; (5) LLM provider contract terms including agentic use-case pricing floors; (6) GPAI compliance plan and EU regulatory timeline; (7) top-5 customer ARR concentration (>50% from top-5 is a material concentration risk); and (8) headcount by function to assess burn relative to ARR coverage ratio. Thesis-break conditions: a second disclosed security incident within 12 months, NRR disclosed below 90%, ARR growth rate deceleration to <50% YoY, or a confirmed major enterprise customer churn event.
| Trigger Event | Threshold | Monitoring Indicator | Action Implication |
|---|---|---|---|
| Second security incident or supply-chain breach | Any confirmed second Devin security incident within 18 months of December 2024 patch | CVE disclosures; enterprise security advisories; HN/security community | Immediate thesis review; evaluate credibility of remediation; likely exit |
| Major customer churn | Top customer ARR decline >25% YoY OR any individual customer ARR >$10M lost | ARR disclosure at funding; press announcements; customer reference interviews | Reduce position; re-model ARR trajectory with churn scenario |
| Competitor price undercut >3× Devin equivalent at comparable capability | Competitor at SWE-bench ≥80% Verified at ≤$160/month (Devin equivalent) | SWE-bench leaderboard; competitor pricing pages monthly | Structural differentiation review; accelerate Windsurf moat assessment |
| NRR disclosed below 90% | NRR < 90% at any disclosed or estimated cohort measurement | Financial disclosure at next funding; data room | Re-model ARR growth; high churn at scale is thesis-break for $10B+ entry |
| EU regulatory enforcement action for GPAI non-compliance | EU AI Office initiates formal investigation or issues compliance order against Cognition | EU AI Office announcements; DPC press releases; Reuters/FT regulatory coverage | Assess EU revenue at risk; model enforcement cost; could trigger broader enterprise procurement pause |
Kill criteria are not legal thresholds; they are investment risk management indicators for monitoring committees.
[CV018, CV019, CV020, CV021, CV022]Disclaimer
This report is a public-evidence diligence snapshot, not investment advice. Important financial, legal, technical, and contractual facts remain non-public and should be verified directly with management and primary documents before any investment decision.
Evidence index
| ID | Statement | Confidence | Sources |
|---|---|---|---|
| CO001 | Cognition AI was founded in November 2023 in San Francisco, California. | High | SO002, SO005, SO010 |
| CO002 | Cognition AI is headquartered in San Francisco, California. | High | SO001, SO005 |
| CO003 | Cognition AI's flagship product, Devin, is an autonomous AI software engineer capable of interpreting tickets, planning, writing code, debugging, testing, and deploying software with minimal human oversight. | High | SO002, SO025 |
| CO004 | Scott Wu is the co-founder and CEO of Cognition AI. | High | SO001, SO005, SO010, SO011 |
| CO005 | Steven Hao is the co-founder and CTO of Cognition AI. | High | SO001, SO005, SO010 |
| CO006 | Walden Yan is the co-founder and CPO (Chief Product Officer) of Cognition AI. | High | SO001, SO005, SO010 |
| CO007 | Scott Wu is a three-time IOI (International Olympiad in Informatics) gold medalist. | Medium | SO011, SO010 |
| CO008 | Cognition AI raised $175M at a $2B valuation in April 2024, led by Founders Fund. | High | SO006, SO007, SO008 |
| CO009 | Cognition AI raised $400M at a post-money valuation of $10.2B in September 2025, led by Founders Fund. | High | SO006, SO007, SO008, SO009 |
| CO010 | Cognition AI's total capital raised is approximately $696M as of September 2025. | High | SO008, SO014 |
| CO011 | Cognition AI acquired Windsurf, an AI-native IDE company, in July 2025. | High | SO004, SO009, SO015 |
| CO012 | Windsurf had approximately $82M ARR at the time of its acquisition by Cognition AI. | High | SO003, SO008, SO009 |
| CO013 | Windsurf had more than 350 enterprise customers at the time of its acquisition by Cognition AI. | Medium | SO004, SO009 |
| CO014 | Cognition AI's ARR grew from $1M in September 2024 to $73M in June 2025, representing approximately 73x growth in nine months. | High | SO008, SO024 |
| CO015 | Following the Windsurf acquisition, Cognition AI's combined ARR reached approximately $155M in July 2025. | High | SO003, SO008 |
| CO016 | Cognition AI's net cash burn was under $20M from founding through Q3 2025, indicating high capital efficiency. | Medium | SO003, SO014 |
| CO017 | Devin was publicly announced in March 2024. | High | SO002, SO005 |
| CO018 | Cognition AI initially explored cryptocurrency before pivoting to building autonomous AI coding agents. | Medium | SO005, SO010 |
| CO019 | The founding team of approximately 10 people collectively held 10 IOI gold medals at company launch. | Medium | SO002, SO011 |
| CO020 | Founders Fund led both the initial seed/Series A and the $400M Series B round for Cognition AI. | High | SO006, SO007, SO008 |
| CO021 | Goldman Sachs is among Cognition AI's enterprise customers and is reportedly running a large-scale Devin pilot with approximately 12,000 developers. | Medium | SO012, SO003 |
| CO022 | Cognition AI laid off 30 former Windsurf employees following the July 2025 acquisition. | High | SO013, SO016 |
| CO023 | Cognition AI offered nine-month salary buyout packages to approximately 200 remaining Windsurf employees who did not want to commit to the company's 80-hour, six-day workweek culture. | High | SO013, SO016, SO017 |
| CO024 | Cognition AI's enterprise ARR grew more than 30% in seven weeks following the Windsurf acquisition. | Medium | SO003 |
| CO025 | Cognition AI opened its Singapore APAC headquarters in April 2026. | High | SO001, SO003 |
| CO026 | Scott Wu studied economics at Harvard University. | Medium | SO011, SO010 |
| CO027 | Scott Wu previously co-founded Lunchclub, an AI-powered professional networking platform. | Medium | SO011, SO010 |
| CO028 | Devin's Core plan is priced at $20/month for individual developers with pay-as-you-go compute at approximately $2.25 per ACU. | High | SO026, SO025 |
| CO029 | Devin's Team plan costs $500/month and includes 250 Agent Compute Units plus full API access. | High | SO025, SO026 |
| CO030 | Cognition AI's $175M Series A was conducted at a $2B post-money valuation in April 2024. | High | SO006, SO008 |
| CO031 | Cognition AI raised approximately $21M in March 2024 at a valuation of approximately $350M in its initial institutional round. | Medium | SO008, SO021 |
| CO032 | Scott Wu holds the 'Legendary Grandmaster' designation on the Codeforces competitive programming platform. | Medium | SO011, SO010 |
| CO033 | Mercedes-Benz announced a partnership with Cognition AI in April 2026. | High | SO001, SO025 |
| CO034 | Independent product reviewers have found that Devin's real-world task success rates require significant human oversight on complex tasks, suggesting performance falls below the 'fully autonomous' marketing framing. | Medium | SO018, SO019 |
| CO035 | CEO Scott Wu has stated publicly that Cognition AI's culture requires six-day workweeks of more than 80 hours, framing it as a company-wide commitment to the mission. | High | SO013, SO016, SO017 |
| CO036 | Investors in the September 2025 $400M round included Lux Capital, 8VC, Elad Gil, Definition Capital, Swish Ventures, Bain Capital Ventures, and D1 Capital. | High | SO006, SO009, SO014 |
| CO037 | As of April 2026, Cognition AI is reportedly in discussions for a new financing round that could value the company at over $25 billion. | Medium | SO008 |
| CO038 | Walden Yan is an IOI gold medalist who leads product management and the product roadmap at Cognition AI. | Medium | SO010, SO005 |
| CO039 | Steven Hao previously held roles at Scale AI, Cursor, Modal, DeepMind, Waymo, and Nuro before co-founding Cognition AI. | Medium | SO010, SO011 |
| CO040 | Cognition AI had approximately 49 employees before the Windsurf acquisition according to revenue-per-employee analyses. | Low | SO008 |
| CO041 | Windsurf had been in acquisition discussions with OpenAI for approximately $3B and was the subject of a Google talent licensing deal worth approximately $2.4B before Cognition acquired the remaining team and IP. | Medium | SO015, SO009 |
| CO042 | Cognition AI's price-to-ARR multiple at the September 2025 valuation was approximately 65x (using $155M combined ARR) to 140x (using $73M pre-acquisition Devin ARR). | Medium | SO023, SO008 |
| CO043 | Devin operates within a sandboxed environment that includes a shell, code editor, and browser, using long-context model inference to sequence planning, implementation, and iterative testing steps autonomously. | High | SO002, SO025 |
| CM001 | The broad AI code tools market was valued at $7.37 billion in 2025, according to Mordor Intelligence, and is forecast to reach $23.97 billion by 2030 at a 26.6% CAGR. | Medium | SM001 |
| CM002 | Grand View Research forecasts the narrow generative AI coding assistants market at $92.5 million by 2030 at a 24.8% CAGR—a significantly smaller scope than Mordor's $24B figure. | Medium | SM002 |
| CM003 | ResearchAndMarkets published a $97.9 billion by 2030 forecast for the generative AI coding assistants space, though this figure likely uses an atypically broad scope including non-comparable segments. | Low | SM003 |
| CM004 | No single analyst consensus exists for the AI coding tools TAM: estimates range from $92M (Grand View, narrow) to $97.9B (ResearchAndMarkets, broad) for 2030, with the most operationally relevant SAM estimate at $1–8B. | Medium | SM001, SM002, SM003 |
| CM005 | The estimated serviceable addressable market for premium agentic developer tools (autonomous tier) is $1–2B in 2025–2026, growing to $5–8B by 2030 based on developer-seat and enterprise-spend lenses. | Medium | SM006, SM007, SM008 |
| CM006 | Global software spending reached approximately $675–$700 billion in 2024, according to WIPO's Global Innovation Index and multiple market research sources. | High | SM008, SM007 |
| CM007 | The number of professional software developers worldwide in 2024 is estimated at 27–28.7 million by Evans Data, Statista, and JetBrains Research. | High | SM006, SM007, SM020 |
| CM008 | Gartner forecasts that 90% of enterprise software engineers will use AI code assistants by 2028, up from less than 14% in early 2024. | High | SM004, SM005, SM009 |
| CM009 | GitHub Copilot holds approximately 42% market share among paid AI coding tools in 2025, with $2 billion in ARR and over 20 million all-time users. | Medium | SM017, SM018, SM019 |
| CM010 | GitHub Copilot is used by over 50,000 organizations including 90% of Fortune 100 companies in 2025, with enterprise adoption up 75% quarter-over-quarter. | Medium | SM017, SM019 |
| CM011 | Developers using AI coding assistants complete code tasks 51–55% faster, with AI-generated code constituting 46% of all code written by Copilot users on GitHub. | Medium | SM017, SM018 |
| CM012 | Enterprise spend on agentic AI systems is projected to surge from less than $1 billion in 2024 to over $51 billion by 2028 at approximately 150% CAGR. | Medium | SM023, SM024 |
| CM013 | Enterprise buyers universally require sandbox environments, audit logs, and human-in-the-loop review gates before deploying autonomous coding agents in production workflows. | High | SM013, SM014, SM015 |
| CM014 | The biggest barriers to enterprise adoption of agentic AI include trust deficits from hallucination risk, legacy system integration complexity, and unclear ROI measurement frameworks. | High | SM013, SM014, SM015, SM016 |
| CM015 | 65% of enterprises were regularly using generative AI as of 2025, up from 33% in 2023, indicating rapid mainstream adoption of AI tools in enterprise workflows. | Medium | SM012, SM023 |
| CM016 | Only a minority of AI use cases in enterprises have reached full production in 2025; primary barriers are legacy systems, data silos, compliance concerns, and talent shortages. | High | SM024, SM013 |
| CM017 | 72% of enterprises plan to deploy AI copilot and agent technologies by 2026, according to aggregated industry surveys. | Medium | SM023 |
| CM018 | The primary enterprise buyer for autonomous software engineering agents is VP Engineering or CTO, with budget sourced from developer productivity or digital transformation allocations. | Medium | SM012, SM024 |
| CM019 | Goldman Sachs (12,000-developer pilot), Citi, Cisco, Dell, Nubank, and Ramp are among Cognition AI's enterprise reference customers, representing the financial services and high-growth tech segments. | Medium | SM011, SM023 |
| CM020 | The EU AI Act and evolving US AI governance regulations impose documentation and explainability requirements that will affect enterprise deployment of autonomous coding agents beginning 2026–2027. | Medium | SM013, SM015 |
| CM021 | Status-quo substitutes for autonomous AI coding agents include offshore staff augmentation, traditional IDEs, internal developer tooling built on base LLM APIs, and cloud-vendor bundled IDE extensions. | Medium | SM001, SM011 |
| CM022 | Switching costs for enterprise AI coding tools include integration migration effort, process overhaul, vendor lock-in risks, and cultural reorientation, making incumbent vendor stickiness a meaningful competitive moat. | Medium | SM015, SM016 |
| CM023 | North America leads the AI coding tools market with the largest share; cloud deployment dominates though enterprise demand for on-premises solutions is growing, particularly among regulated industries. | Medium | SM001, SM002 |
| CM024 | The autonomous agents market (cross-sector) is forecast to grow from $4.35B in 2025 to $103B by 2034 at over 40% CAGR, providing a directionally supportive upper-bound context for the AI software engineering agent TAM. | Low | SM015, SM025 |
| CM025 | Gartner's Magic Quadrant for AI Code Assistants provides enterprise procurement legitimacy for the category, with GitHub Copilot named a Leader in both 2024 and 2025 editions. | High | SM004, SM010 |
| CM026 | 87% of developers are using AI tools daily as of 2025, indicating that AI-assisted development has reached mainstream adoption among professional software engineers. | Medium | SM011, SM023 |
| CM027 | Developers using AI tools report 3.2× productivity improvements on average, with 67% using AI tools at least five days per week according to 2025 surveys. | Medium | SM011, SM017 |
| CM028 | Financial services enterprises present a high-value segment for Cognition AI given Goldman Sachs' 12,000-developer Devin pilot; these buyers require SOC 2 Type II, data residency, and regulatory compliance before enterprise sign-off. | Medium | SM013, SM024 |
| CM029 | The global software development market is valued at $675–$730B in 2024; a 1% shift of project labor costs to AI tools implies a $6.7B+ market, corroborating the Mordor $24B broad TAM estimate for 2030. | Medium | SM008, SM021 |
| CM030 | AI-assisted pull request cycle times have been reduced by up to 75% (from 9.6 days to 2.4 days) for organizations using GitHub Copilot, per reported enterprise metrics. | Medium | SM017, SM018 |
| CM031 | Unpredictable agent behavior including hallucinations, lack of transparency, and security concerns are the primary trust barriers slowing enterprise deployment of autonomous AI coding agents. | High | SM014, SM016 |
| CM032 | 77% of companies are currently using or actively testing AI tools, with the global AI market estimated at approximately $298 billion in 2025. | Medium | SM023 |
| CM033 | Agentic AI adoption in software engineering is constrained not just by technology but by organizational readiness: workforce resistance, lack of AI talent, and unclear business cases are the dominant enterprise blockers as of 2025. | High | SM013, SM015, SM024 |
| CM034 | Competitive fragmentation in AI coding tools—GitHub Copilot (42% market share), Cursor ($9B valuation), Windsurf (acquired by Cognition), and OpenAI Codex—can produce buyer choice overload that stalls enterprise procurement decisions. | Medium | SM011, SM017 |
| CM035 | Cognition AI's $155M ARR as of July 2025 represents approximately 2% penetration of the estimated $7.37B AI code tools TAM, suggesting material white space remains if the market definition is correct. | Low | SM001 |
| CP001 | GitHub Copilot holds approximately 42% of the AI coding assistant market share as of 2025. | Medium | SP005, SP017 |
| CP002 | GitHub Copilot had over 20 million active users as of early 2025. | Medium | SP005, SP012 |
| CP003 | GitHub Copilot exceeded $2 billion in annual recurring revenue by early 2025. | Medium | SP002, SP004 |
| CP004 | GitHub Copilot is used by 90% of Fortune 100 companies. | Medium | SP012, SP021 |
| CP005 | Cursor (Anysphere) reached $2 billion ARR by February 2026. | High | SP001, SP003, SP004 |
| CP006 | Cursor raised a $2.3 billion Series D in November 2025 at a $29.3 billion valuation. | High | SP001, SP003 |
| CP007 | Cursor has over 1 million paying customers and 50,000 enterprise teams. | Medium | SP003, SP006 |
| CP008 | Cursor's multi-agent background agents feature runs up to eight parallel task threads simultaneously. | Medium | SP005, SP016 |
| CP009 | Amazon Q Developer offers a Free tier and a Pro tier priced at $19 per user per month. | High | SP013, SP018 |
| CP010 | Amazon Q Developer achieved SOC 2 Type II certification with VPC isolation support. | Medium | SP007, SP013 |
| CP011 | Amazon Q Developer has a 200,000 token context window. | Medium | SP007, SP018 |
| CP012 | Claude Code operates as a terminal-native agentic coding assistant without requiring a custom IDE. | High | SP008, SP009 |
| CP013 | Devin's initial SWE-bench score in March 2024 was 13.86%, a breakthrough at the time of announcement. | High | SP010, SP015, SP025 |
| CP014 | Subsequent model improvements from Anthropic pushed Claude-based agents above 50% on SWE-bench Verified. | Medium | SP010, SP015 |
| CP015 | Claude Code is priced at $10 per month on Pro and $100 per month on Max tiers. | High | SP027, SP008 |
| CP016 | OpenAI relaunched Codex as a web-based agentic coding environment backed by the o3 model family in 2025. | Medium | SP014, SP011 |
| CP017 | SWE-agent is an open-source research framework developed at Princeton NLP Group, not a commercial product. | High | SP025, SP010 |
| CP018 | Cursor investors include Thrive Capital, Andreessen Horowitz, Accel, Nvidia, and Google. | High | SP001, SP003 |
| CP019 | GitHub Copilot individual pricing is $10 per month; Business tier is $19 per month; Enterprise tier is $39 per month. | High | SP021, SP012 |
| CP020 | Windsurf (formerly Codeium) was acquired by Cognition in July 2025, adding approximately $82M ARR and 350-plus enterprise customers. | Medium | SP024, SP006 |
| CP021 | Cognition's acquisition of Windsurf converted a direct IDE-tier competitor into a distribution channel for Devin. | Medium | SP024, SP019 |
| CP022 | GitHub Copilot expanded from inline code completions to Copilot Workspace, a multi-step planning and execution agent. | Medium | SP012, SP005 |
| CP023 | Cursor pricing is $20 per month for Pro tier and $40 per month for Business tier as of 2025. | Medium | SP023, SP005 |
| CP024 | Replit offers an AI-native development platform with cloud execution environments targeting student and hobbyist developers. | Medium | SP022 |
| CP025 | The core architectural gap between Devin and Claude Code is that Claude Code still requires a human operator at the terminal, while Devin runs fully unattended. | Medium | SP009, SP008 |
| CP026 | Cognition's defensible competitive advantages include execution depth, enterprise data flywheel from completed tasks, and Windsurf distribution. | Medium | SP019, SP011 |
| CP027 | Benchmark credibility concerns about Devin emerged after independent evaluations showed other tools surpassing it within months of the March 2024 announcement. | Medium | SP010, SP015, SP025 |
| CP028 | The competitive window for pure agentic differentiation is narrowing as frontier model labs deploy equivalent agentic pipelines. | Medium | SP011, SP019 |
| CP029 | Cursor is growing faster than Cognition on ARR metrics and has more paying customers, providing proportionally more training signal. | Medium | SP001, SP003, SP004 |
| CP030 | Cursor uses a VS Code fork architecture that provides deep editor integration unavailable to plugin-based competitors. | Medium | SP005, SP016, SP023 |
| CP031 | Amazon Q Developer's native AWS service integrations position it as preferred for organizations standardized on AWS infrastructure. | High | SP007, SP013 |
| CP032 | GitHub Copilot's enterprise distribution advantages create high switching-cost inertia across Fortune 100 organizations. | Medium | SP012, SP021 |
| CP033 | The AI coding tools market in 2026 is stratified into hyperscaler-backed co-pilots, IDE-native assistants, and fully autonomous agent tiers. | Medium | SP011, SP017, SP019 |
| CP034 | Frontier model labs such as Anthropic and OpenAI control the underlying reasoning capabilities and can deploy equivalent agentic pipelines without licensing dependencies. | High | SP008, SP014 |
| CP035 | Cognition's enterprise go-to-market execution and customer retention metrics are the true leading indicators of sustainable competitive position, more so than benchmark scores. | Medium | SP009, SP011 |
| CP036 | Cursor's growth from zero to $2B ARR in approximately 24 months represents the fastest known ARR scale-up in AI coding tools. | Medium | SP001, SP003 |
| CP037 | Both GitHub Copilot and Amazon Q Developer lack end-to-end autonomous task completion without human-in-the-loop confirmation at each stage. | Medium | SP007, SP012 |
| CI001 | Cognition AI's Core plan is priced at $20 per month for 5 ACUs; Team plan at $500 per month for 250 ACUs. | High | SI004, SI022 |
| CI002 | Enterprise ACU overages are priced at approximately $2.25 per ACU beyond the plan inclusion. | Medium | SI004, SI001 |
| CI003 | Cognition AI reached approximately $1M ARR in September 2024. | Medium | SI001, SI005 |
| CI004 | Cognition AI grew from $1M to $73M ARR between September 2024 and June 2025—a 73x increase in nine months. | High | SI001, SI003, SI005 |
| CI005 | Windsurf (formerly Codeium) was priced at $0 (Free) and approximately $15 per month (Pro) before the Cognition acquisition. | Medium | SI006, SI007 |
| CI006 | Cognition AI's gross margin is estimated at 50–70%, based on inference compute costs versus ACU pricing—no primary disclosure exists. | Low | SI011, SI005 |
| CI007 | The Windsurf acquisition added approximately $82M ARR and 350-plus enterprise customers to Cognition's combined revenue base. | Medium | SI006, SI010, SI012 |
| CI008 | Cognition AI's combined ARR reached approximately $155M post-Windsurf acquisition in July 2025. | Medium | SI009, SI010, SI013 |
| CI009 | ARR per employee at Cognition was approximately $1.5M prior to the Windsurf acquisition (49 employees, $73M ARR in June 2025). | Low | SI003, SI011 |
| CI010 | ARR per employee dropped to approximately $623K post-Windsurf, reflecting the quadrupling of headcount to ~249 employees. | Low | SI019, SI003, SI011 |
| CI011 | Net revenue retention rate for Cognition AI is not publicly disclosed; analyst estimates range from 100–130% based on ARR growth trajectory. | Low | SI005, SI011 |
| CI012 | Cognition AI's seed/Series A in March 2024 raised approximately $21M at a $350M valuation, led by Founders Fund. | High | SI002, SI023, SI020 |
| CI013 | Cognition AI's April 2024 funding round raised $175M at a $2B post-money valuation, led by Founders Fund. | High | SI002, SI014, SI020 |
| CI014 | Cognition AI's September 2025 Series B raised $400M at a $10.2B post-money valuation with Lux Capital, 8VC, Elad Gil, Bain Capital Ventures, and D1 Capital as co-investors. | High | SI008, SI009, SI010 |
| CI015 | Cognition AI disclosed a net cash burn of under $20M from founding through Q3 2025—extraordinary capital efficiency for a company at this ARR growth rate. | Medium | SI001, SI009 |
| CI016 | Reports as of April 2026 indicate Cognition AI is in discussions for new financing at a $25B+ valuation. | Low | SI011, SI012 |
| CI017 | Cognition AI does not disclose gross margin, customer churn, NRR, CAC, or GAAP financial statements. | High | SI005, SI007, SI016 |
| CI018 | All ARR figures for Cognition AI rely on company blog posts, third-party news citing unnamed sources, or secondary market research—no audited financial data is available. | High | SI003, SI005, SI016 |
| CI019 | The Windsurf acquisition was estimated at approximately $250M, funded from Cognition's existing capital reserves without a new primary fundraise. | Low | SI006, SI010 |
| CI020 | Founders Fund led both the March 2024 seed/Series A and the April 2024 Series A extension, providing deep capital commitment and reputational signal. | High | SI002, SI008, SI020 |
| CI021 | A $25B+ valuation on $155M ARR implies a forward revenue multiple of approximately 160x, elevated even by hypergrowth AI company standards. | Medium | SI011, SI012 |
| CI022 | Cognition AI's total capital raised through Series B is approximately $596M excluding the Windsurf acquisition; total capital deployed is approximately $846M when including the acquisition. | Medium | SI008, SI009, SI010 |
| CI023 | Post-Windsurf headcount reportedly reached approximately 249 employees before layoffs of 30 and additional voluntary departures from buyout offers. | Medium | SI019, SI023 |
| CI024 | SEC Form D/A for Founders Fund IX, LP (CIK 0001971631, filed October 2025) discloses total capital raised of approximately $972M across the LP and principals fund, with Peter Thiel as managing member. | Medium | SI026 |
| CI025 | Founders Fund IX Principals Fund (CIK 0002090410) is a co-issuer with Founders Fund IX, LP per the amended SEC Form D/A, indicating a typical GP co-investment vehicle structure. | Medium | SI026 |
| CI026 | Cognition AI opened its Singapore APAC headquarters in April 2026, hiring Richard Spence as VP and General Manager APAC, targeting Southeast Asia, Australia, India, and South Korea. | High | SI027, SI018 |
| CI027 | Mercedes-Benz announced deployment of Devin and Windsurf across its global engineering organization in April 2026, focusing on legacy modernization, cloud-native development, and logistics applications. | Medium | SI028, SI018 |
| CI028 | Cognition AI introduced a revised pricing structure in April 2026, replacing the Core and Team plans with Free, Pro, Max, Teams, and Enterprise tiers, signaling a land-and-expand monetization strategy. | High | SI018, SI004 |
| CI029 | Cognition AI's annual Team plan pricing of $6,000 per developer is approximately 15x more expensive than GitHub Copilot Enterprise ($390/yr) and 25x more than Cursor Pro ($240/yr), justified by task-completion vs. code-suggestion value. | Medium | SI004, SI011 |
| CI030 | Total capital raised ($596M through Series B) relative to ARR at time of last raise ($73M Devin ARR at Series B in Sep 2025) implies a capital-to-ARR ratio of approximately 8x—high but declining as ARR compounds. | Medium | SI008, SI009, SI003 |
| CI031 | The 9-month severance buyout offered to approximately 200 Windsurf employees represents an estimated one-time integration cost of $15–30M, calculated on average startup engineer salaries of $100–150K. | Low | SI019 |
| CI032 | Cognition AI's April 2026 APAC office opening and Mercedes-Benz partnership signal material increases to operating expenditure in H1 2026 through regional headcount and GTM investment. | Low | SI027, SI028 |
| CI033 | Enterprise AI SaaS tools with variable compute cost structures typically achieve gross margins of 55–75% at scale, per industry benchmarks; Cognition's agentic model adds inference cost variance absent from seat-based tools. | Low | SI011, SI015 |
| CI034 | Cognition AI has not filed Form D or equivalent exempt offering notice with the SEC as of May 2026; the company's capital raises are documented only through investor disclosures and third-party press reports. | High | SI016, SI023, SI026 |
| CI035 | If the Windsurf $82M ARR churns by 30% post-acquisition migration, combined Cognition ARR falls to approximately $128M—a scenario not publicly addressed by company management. | Low | SI006, SI003, SI011 |
| CE001 | Devin is a cloud-hosted autonomous AI software engineer that accepts natural-language task specifications and executes complete software engineering tasks—planning, coding, testing, and deploying—without human intervention during execution. | High | SE002, SE004 |
| CE002 | In the original SWE-bench evaluation (March 2024), Devin resolved 13.86% of issues unassisted—far exceeding the prior state-of-the-art of 1.96% for unassisted agents and 4.80% for assisted LLMs. | High | SE003, SE006, SE020 |
| CE003 | Devin's agent runtime executes tasks within a sandboxed cloud environment equipped with a code editor, Unix shell, and headless browser—isolated from other sessions and the customer's production environment. | High | SE002, SE001 |
| CE004 | Each Devin session is bounded by a maximum runtime of 45 minutes; longer tasks must be broken into sequential sub-sessions. | Medium | SE003, SE001 |
| CE005 | Devin integrates natively with GitHub, GitLab, and Bitbucket for source control, enabling automated PR creation, branch management, and code contributions. | High | SE007, SE001 |
| CE006 | Devin integrates natively with Jira and Linear for project management ticket assignment and sprint delegation. | High | SE007, SE001 |
| CE007 | Devin integrates natively with Slack and Microsoft Teams for session initiation via chat messages and progress reporting. | High | SE007, SE001 |
| CE008 | Agent Compute Units (ACUs) are the proprietary metered compute token consumed per Devin task, priced at approximately $2.25 per ACU with plan-included allocations varying by tier. | High | SE004, SE008 |
| CE009 | Devin Enterprise tier offers VPC deployment allowing the agent to operate within customer-controlled cloud infrastructure, and custom-trained Devin instances tuned to proprietary codebases. | High | SE001, SE004 |
| CE010 | Devin accepts task inputs from multiple channels: web dashboard, Slack messages, Jira tickets, and Linear issues—translating natural-language instructions into executable multi-step plans. | High | SE001, SE002 |
| CE011 | Nubank achieved 12x engineering efficiency improvement and 20x cost savings using Devin to migrate a 6-million-line ETL monolith; Data, Collections, and Risk business units completed migrations in weeks instead of months. | Medium | SE016, SE014 |
| CE012 | SWE-Check, released April 2026 in collaboration with Applied Compute, is a reinforcement-learning-trained bug detection model that matches Anthropic Opus 4.6 performance while running approximately 10x faster. | Medium | SE024, SE002 |
| CE013 | The DeepWiki server builds vectorized project graphs of entire codebases, enabling Devin to navigate and reason about multi-million-line repositories beyond standard LLM context window limits. | Medium | SE001, SE002 |
| CE014 | The Windsurf acquisition (July 2025) added 350+ enterprise customers, 250,000+ daily active users, and the Windsurf IDE product including its Cascade agentic workflow engine. | High | SE015, SE022 |
| CE015 | In late 2024, a live-streamed demonstration publicly exposed a major security vulnerability in Devin's system prompt handling, potentially enabling prompt-injection attacks. Cognition acknowledged and patched the issue. | Medium | SE017, SE009 |
| CE016 | Devin 2.0 (April 2025) achieved 83% more tasks completed per ACU compared to Devin v1.x, representing a significant efficiency improvement for existing subscribers. | Medium | SE008, SE019 |
| CE017 | Devin 2.0 introduced a confidence meter that quantifies the probability of task success before and during execution, allowing teams to filter low-probability tasks and prioritize high-value delegations. | Medium | SE008, SE001 |
| CE018 | Model Context Protocol (MCP) integration allows Devin to connect to hundreds of external tools including monitoring platforms, databases, and documentation systems through the MCP Marketplace. | Medium | SE007, SE001 |
| CE019 | The Devin REST API enables programmatic session creation, status retrieval, and CI/CD pipeline integration for DevOps automation workflows. | Medium | SE007, SE001 |
| CE020 | A 32-week empirical study of 7,156 real-world pull requests showed Devin's PR acceptance rate increased by +0.77% per week—the only agent in the study with a consistent positive trend. | Medium | SE011, SE010 |
| CE021 | BlockDiff snapshotting records incremental state checkpoints during Devin's execution, enabling rapid rollback to a known-good state when tests fail or an approach proves unworkable. | Medium | SE001, SE002 |
| CE022 | Devin supports parallel task execution across multiple concurrent sessions, allowing enterprise teams to delegate entire sprint backlogs simultaneously rather than one task at a time. | Medium | SE001, SE004 |
| CE023 | Claude Code (Opus 4) achieved 72.5% on SWE-bench Verified (500 instances) in 2025, far exceeding Devin's original 13.86% benchmark score—though the two benchmarks use different task sets. | Medium | SE010, SE012 |
| CE024 | mini-SWE-agent achieved 65% on SWE-bench Verified in July 2025, matching most commercial agents at a fraction of the computational cost—a signal that benchmark performance is rapidly commoditizing. | Medium | SE006, SE020 |
| CE025 | Devin for Terminal (2026) allows developers to start an agent session locally from the terminal and escalate to cloud execution when the task outgrows local compute—preserving session state across the transition. | Medium | SE024, SE001 |
| CE026 | The CognitionAI/devin-swebench-results GitHub repository has 124 stars and 20 forks as of research date, indicating modest developer interest in the benchmark methodology transparency. | Medium | SE005 |
| CE027 | Cognition AI's 2026 product roadmap includes Devin-in-Windsurf integration, SWE-1.5 model improvements, SWE-Check bug detection, APAC regional expansion (Singapore), and Mercedes-Benz enterprise deployment. | Medium | SE024, SE015 |
| CE028 | Cognition AI has not publicly disclosed SOC 2 Type II, ISO 27001, or equivalent compliance certifications as of May 2026, representing a gap for regulated-industry enterprise procurement. | High | SE001, SE004, SE021 |
| CE029 | All Devin sessions run in isolated ephemeral compute containers, and code changes are surfaced only as pull requests requiring human approval before merging—preventing unauthorized production deployments. | High | SE001, SE004 |
| CE030 | The MCP Marketplace integrates Devin with monitoring tools (Sentry, Datadog, PagerDuty), databases (PostgreSQL, MySQL, MongoDB), and documentation (Notion, Confluence), enabling context-rich agent execution. | Medium | SE007, SE001 |
| CE031 | Enterprise VPC deployment allows Devin to execute tasks within the customer's own AWS, GCP, or Azure environment, eliminating source-code transmission to Cognition's external servers. | Medium | SE001, SE004 |
| CE032 | Windsurf Codemaps, powered by SWE-1.5 and Claude Sonnet 4.5, provides AI-annotated structured maps of codebases with 'trace guide' expansions, enabling navigation to exact file and line references. | Medium | SE024, SE013 |
| CE033 | The Devin REST API uses standardized HTTP endpoints for session creation, status polling, result retrieval, and webhook-based notifications, enabling integration into CI/CD pipelines. | Medium | SE007, SE001 |
| CE034 | GitHub Copilot was ranked a Leader in Gartner's 2025 Magic Quadrant for AI Code Assistants, representing the current benchmark for enterprise adoption against which Devin competes. | Medium | SE010, SE012 |
| CE035 | A 2026 empirical study of AI coding PRs across 7,156 tasks found documentation tasks achieved 82.1% acceptance versus 66.1% for new feature development—a 16-percentage-point gap exceeding inter-agent variance. | Medium | SE011, SE018 |
| CU001 | Cognition AI targets engineering teams at technology-forward enterprises, midmarket companies, and developer-first organizations, with buyer profiles including CTOs and VPs of Engineering seeking developer throughput multiplication. | High | SU001, SU021 |
| CU002 | Confirmed enterprise customer verticals include fintech/financial services (Nubank), automotive/manufacturing (Mercedes-Benz), IT services/outsourcing (Cognizant), US Federal Government (Feb 2026), and COBOL-heavy Fortune 500 legacy sectors. | High | SU002, SU003, SU004, SU006 |
| CU003 | Mercedes-Benz was announced in April 2026 as a production customer deploying Devin and Windsurf across its global engineering organization for legacy modernization, cloud-native development, and logistics. | Medium | SU003 |
| CU004 | Cognizant partnered with Cognition in January 2026 to deploy Devin and Windsurf across its own engineering organization and its global client base—a channel partnership that could expose Cognition to Cognizant's 300+ clients. | Medium | SU004, SU014 |
| CU005 | Cognition launched a Government vertical (Cognition for Government) in February 2026 to target US federal agencies for critical infrastructure software modernization; no named agencies have been publicly disclosed. | Medium | SU006 |
| CU006 | Cognition AI's estimated ARR grew from approximately $1M at general availability (December 2024) to approximately $73M by April 2025—a roughly 70x increase in approximately four months, coinciding with the price cut from $500/month to $20/month. | Low | SU007, SU008 |
| CU007 | Devin's PR merge rate (fraction of Devin-opened PRs accepted by human reviewers) improved from 34% at launch (March 2024) to 67% by April 2025—a doubling of accepted output quality in approximately 12 months. | Medium | SU008, SU009 |
| CU008 | By February 2026, Cognition's own engineering team was merging 659 Devin-authored pull requests per week—a 4x increase from their best week of 154 PRs in 2025. | High | SU017, SU025 |
| CU009 | Independent reviewers note that real-world ACU consumption on the $20/month Core plan can easily result in monthly bills of $100-200 for moderate usage, contradicting the headline price reduction's accessibility promise. | Medium | SU008, SU009 |
| CU010 | Windsurf IDE had 250,000+ daily active users and 350+ enterprise customers at the time of acquisition (July 2025), providing Cognition with a large top-of-funnel developer install base for Devin conversion. | High | SU010, SU014 |
| CU011 | Nubank used a fine-tuned custom Devin instance for ETL migration; fine-tuning doubled task completion scores and reduced per-task time from 40 minutes to 10 minutes (4x speed improvement). | Medium | SU002 |
| CU012 | Cognition opened offices in London (January 2026), Tokyo (April 2026), and Singapore (April 2026) within a four-month period, signaling management's confidence in revenue sufficiency for global expansion. | High | SU005, SU006 |
| CU013 | Net Revenue Retention (NRR), Gross Revenue Retention (GRR), and customer churn rate have not been publicly disclosed by Cognition AI; retention durability cannot be independently assessed. | Medium | SU007, SU008 |
| CU014 | Hacker News developer community expressed skepticism about Devin's real-world performance versus demo quality; early reviews noted that complex or novel tasks frequently required significant rework, exceeding expectations set by the SWE-bench launch announcement. | Medium | SU011, SU023 |
| CU015 | The security vulnerability disclosed in December 2024—a live-streamed prompt-injection attack on Devin—temporarily damaged trust among security-conscious enterprise buyers, requiring Cognition to accelerate security remediation communication. | Medium | SU018, SU011 |
| CU016 | Nubank's use of Devin expanded from the initial Data unit pilot to Data, Collections, and Risk business units, confirming land-and-expand progression within a single named enterprise customer. | Medium | SU002 |
| CU017 | Cognition AI's land-and-expand strategy relies on starting with a discrete project (migration, modernization) within one business unit, demonstrating ROI, then expanding to adjacent teams—as demonstrated by the Nubank case. | Medium | SU002, SU004 |
| CU018 | Windsurf's 250,000+ daily active users serve as a top-of-funnel for Devin cloud agent adoption: Windsurf Free users experiencing AI-assisted local development are natural conversion targets for cloud-based autonomous task delegation. | Medium | SU010, SU014 |
| CU019 | Devin's COBOL modernization use case is inherently project-based—once a legacy migration is complete, there is no structural recurring need unless customers contract for ongoing support or continued modernization work. | Medium | SU004, SU016 |
| CU020 | Top-customer concentration is a material but undisclosed diligence gap: with ~$73M ARR and a small set of confirmed named accounts, a single Nubank-scale contract could represent more than 10% of ARR. | Medium | SU007, SU008 |
| CU021 | Cognizant's reseller partnership creates a single-channel concentration risk: if Cognizant accounts for a material share of Cognition's enterprise pipeline, the loss of that relationship would disproportionately impact new account growth. | Medium | SU004, SU022 |
| CU022 | The self-serve developer tier (Free and Pro plans) provides a low-ACV but high-volume customer base that can generate bottom-up enterprise procurement pressure, analogous to Slack's or Atlassian's PLG motion. | Medium | SU008, SU009, SU015 |
| CU023 | Growjo estimates Cognition AI's total funding at $1.4B and current valuation at $10.2B as of September 2025, consistent with reports of a $400M fundraise following the Windsurf acquisition. | Low | SU007, SU010 |
| CU024 | Cognition AI had approximately 222 employees as of 2025 and grew headcount by approximately 102% year-over-year—a pace consistent with rapid revenue scaling but also requiring significant talent investment. | Low | SU007, SU020 |
| CU025 | The absence of SOC 2 Type II and FedRAMP compliance attestations in public documentation limits Cognition's addressable market among regulated enterprises (financial services, healthcare, US government) that require these certifications for procurement. | High | SU016, SU021 |
| CU026 | Devin's PR merge rate of 67% implies a 33% rejection or rework rate on AI-generated pull requests, which may be acceptable for high-volume routine tasks but represents a significant quality gap versus a senior human engineer. | Medium | SU008, SU019 |
| CU027 | Fortune 500 COBOL modernization deployments are confirmed via Cognition's April 2026 blog post, but no named customers, outcome metrics, or contract structures are disclosed—limiting diligence quality. | Medium | SU004 |
| CU028 | Geographic expansion from US-only to Europe and APAC in Q1-Q2 2026 within four months demonstrates aggressive international GTM investment, creating both market opportunity and potential cash burn risk from multi-office overhead. | High | SU005, SU006 |
| CU029 | The imseankim.com review noted ARR growth from $1M to $73M in nine months while also identifying that 'independent testers tell a more complicated story' compared to official benchmarks—suggesting a gap between marketing and user-level satisfaction at the margin. | Medium | SU008 |
| CU030 | Devin supports concurrent multi-session task execution, allowing enterprise teams to delegate entire sprint backlogs simultaneously, which is cited as a key adoption driver for teams managing high-volume routine work. | Medium | SU021, SU024 |
| CU031 | The Devin PR acceptance rate empirical study (Arxiv 2026) showed a +0.77%/week positive trend—the only agent with sustained improvement—suggesting iterative model quality gains that support long-term customer retention. | Medium | SU019 |
| CU032 | Cognition AI's Government vertical launch (February 2026) positions Devin as a tool for US federal legacy software modernization, a multi-billion dollar addressable budget segment, but no government contract or agency name has been publicly disclosed. | Medium | SU006 |
| CU033 | Nubank's ETL migration case study is the only published, quantified production customer proof; Mercedes-Benz, Cognizant, and Fortune 500 COBOL deployments lack independent third-party verification or outcome data. | High | SU002, SU003, SU004 |
| CU034 | Devin Autofix Review Comments (January 2026) closes a key customer workflow loop—Devin automatically addresses pull request review feedback—substantially improving the PR review cycle productivity for enterprise teams. | Medium | SU029, SU017 |
| CU035 | The Cognizant channel partnership bypasses individual enterprise sales cycles by embedding Devin into Cognizant's managed services offering—a go-to-market shortcut that scales account coverage rapidly but creates single-reseller revenue concentration risk. | Medium | SU004, SU021 |
| CR001 | The EU AI Act (Regulation 2024/1689) imposes GPAI transparency and documentation obligations on providers of general-purpose AI models effective August 2026; Cognition has not publicly disclosed its GPAI registration status, model transparency documentation, or training data copyright compliance policy. | High | SR009, SR007 |
| CR002 | Under GDPR and UK GDPR, organizations must conduct Data Protection Impact Assessments for high-risk AI processing; Devin's autonomous code writing, PR merging, and production deployment capabilities likely qualify as high-risk processing under GDPR Article 35, requiring DPIAs from EU/UK enterprise customers. | High | SR010, SR009 |
| CR003 | California SB 1047, which would have imposed safety requirements on frontier AI developers, was vetoed by Governor Newsom in September 2024; however, successor legislation including AB 2013 remained active in the 2025–2026 California legislative session, creating ongoing monitoring obligation. | Medium | SR006, SR008 |
| CR004 | The Doe v. GitHub, Microsoft, and OpenAI class action alleges that training AI coding models on public GitHub repositories without license compliance constitutes copyright infringement; Cognition has not disclosed its model training data provenance, creating analogous latent IP exposure under the same copyright theory. | Medium | SR011, SR007 |
| CR005 | The January 2025 White House Executive Order revoked prior Biden-era AI safety mandates, creating a deregulatory US federal environment for AI developers; however, agency-level rules for government AI procurement—relevant to Cognition's announced Government vertical—remain under development. | High | SR008, SR006 |
| CR006 | In December 2024, Cognition publicly disclosed a prompt-injection vulnerability in Devin discovered during a live-streamed demonstration; the vulnerability could enable an adversary to embed malicious instructions in repository content, causing Devin to exfiltrate credentials or insert backdoors in customer codebases. | Medium | SR001, SR013 |
| CR007 | The OWASP Top 10 for LLM Applications lists Prompt Injection (LLM01) as the highest-severity risk for LLM-based applications; for autonomous agents with code execution, PR merge, and terminal access permissions, a successful prompt-injection exploit has a blast radius materially larger than for passive coding assistants. | High | SR003, SR004 |
| CR008 | Cognition obtained SOC 2 Type II certification in March 2024, prior to the December 2024 prompt-injection incident; the SOC 2 report is not publicly accessible without executing an NDA through the Cognition Trust Center, limiting independent assessment of control effectiveness by enterprise procurement teams. | High | SR001, SR002 |
| CR009 | Devin 2.0 (April 2025) introduced the ability to directly merge pull requests and schedule autonomous agent runs on customer infrastructure, materially expanding the attack surface and potential blast radius of any future prompt-injection or supply-chain compromise beyond what existed at the December 2024 incident. | Medium | SR016, SR003 |
| CR010 | Cognition's security documentation acknowledges that Devin code output may contain hallucinations, bugs, or insecure code, and recommends code reviews and branch protections as mitigations; this acknowledged limitation creates ongoing liability exposure for enterprise customers deploying Devin in production code pipelines without mandatory human review gates. | High | SR001, SR004 |
| CR011 | Cognition is structurally dependent on Anthropic and OpenAI as foundation model API providers; these same companies compete with Devin through Claude Code and OpenAI Codex, creating a supply chain where the primary suppliers have economic incentives to disadvantage a dependent customer. | High | SR023, SR015 |
| CR012 | AWS provides the cloud infrastructure for Devin's VPC deployment option; Devin's enterprise architecture is AWS-native with no disclosed multi-cloud or on-premises fallback, creating material infrastructure concentration risk in a single cloud vendor. | Medium | SR001, SR016 |
| CR013 | GitHub (Microsoft) provides the primary code repository integration for Devin's PR workflow; Microsoft also distributes GitHub Copilot as a direct Devin competitor, meaning Cognition depends on a competitor-owned platform for core product functionality. | High | SR015, SR007 |
| CR014 | Cognition's acquisition of Windsurf (July 2025) added 350+ enterprise accounts and 250K+ daily active users but introduced integration execution risk: two AI agent codebases, model pipelines, enterprise sales motions, and billing systems must be combined simultaneously while maintaining ARR growth momentum. | Medium | SR018, SR030 |
| CR015 | Cognition's three co-founders have exceptional technical credentials as competitive programmers but no prior track record building enterprise software companies beyond early-stage startups; execution risk is elevated as the company scales from startup to enterprise sales motion across multiple geographies simultaneously. | Medium | SR025, SR017 |
| CR016 | AI engineering talent competition is severe; Anthropic, OpenAI, Google DeepMind, and Microsoft actively recruit competitive programming and ML talent at packages estimated above $500K total compensation, directly competing with Cognition's talent pool and creating ongoing retention risk. | Medium | SR023, SR015 |
| CR017 | Cognition's international expansion to Japan, Singapore, EU, and US Government sectors requires jurisdiction-specific compliance infrastructure—EU DPO appointment, Japan PIPL compliance, FedRAMP authorization—with no publicly disclosed plans or timelines, creating a gap between announced customer segments and compliance readiness. | Medium | SR009, SR010 |
| CR018 | At Devin's March 2024 launch its SWE-bench Full score of 13.86% was the then-best result for autonomous coding agents; by mid-2025 Claude Code Opus 4 achieved 72.5% on SWE-bench Verified—a competitive improvement rate that threatens to eliminate Devin's technical differentiation within 12–18 months. | High | SR015, SR023 |
| CR019 | The FTC stated that concentrated control of foundational AI inputs—including foundation models and cloud compute—could allow providers to distort competition in downstream AI application markets; this is a direct structural risk for Cognition given its dependence on Anthropic/OpenAI (models) and AWS (compute), both of which are also competitive threats. | High | SR007, SR011 |
| CR020 | ARR growth from ~$1M at GA (December 2024) to ~$73M estimate (April 2025) across a small number of named enterprise customers implies high per-customer ARR concentration; individual churn events at top customers could cause material ARR declines capable of impairing the $10.2B valuation estimate. | Medium | SR019, SR020 |
| CR021 | Devin's ACU pricing model ($2.25 per ACU overage) creates unpredictable usage-based cost exposure for enterprise customers; 'bill shock' from unexpected ACU consumption is a cited churn risk factor in independent community analysis of early Devin adopters. | Medium | SR020, SR028 |
| CR022 | Cognition has raised approximately $1.575B in disclosed funding through mid-2026; with 222 employees, 102% annual headcount growth, and compute-intensive AI infrastructure, estimated monthly burn is $5–15M, implying 18–36 months of runway under conservative scenarios assuming $73M ARR and aggressive growth continuation. | Medium | SR024, SR021 |
| CR023 | LLM inference costs for Devin sessions—paid at commercial API rates to Anthropic and OpenAI—compress gross margins structurally; no path to proprietary model development or renegotiated volume pricing is publicly disclosed, leaving margin compression risk unaddressed. | Medium | SR023, SR016 |
| CR024 | GitHub Copilot's distribution through Microsoft enterprise sales channels—250M+ installs, Microsoft 365 bundling potential, Azure integration—represents a structural distribution advantage that benchmark performance alone cannot overcome for Cognition. | High | SR015, SR027 |
| CR025 | Cursor's approximately $500M ARR at $9B valuation with $40/month pricing represents a 12.5× price advantage over Devin's Team plan at $500/month; this price sensitivity gap creates pricing risk as competitors improve and enterprise buyer ROI scrutiny increases. | Medium | SR027, SR019 |
| CR026 | Nubank is the only named customer with published and quantified production deployment outcomes; $73M ARR from an undisclosed number of additional enterprise accounts implies high revenue concentration risk and limits independent verification of Cognition's enterprise product-market fit beyond one case study. | Medium | SR026, SR020 |
| CR027 | Developer community analysis of the December 2024 prompt-injection incident described Cognition's prior credibility as damaged, noting: Devin's claimed product superiority was inconsistent with observed PR quality; the security gap was described as 'amateurish' for a product in development for over a year. | Medium | SR013, SR014 |
| CR028 | The OWASP GenAI Security Project identifies agent autonomy as an emerging, distinct risk category beyond classical LLM vulnerabilities; agents capable of modifying code, merging PRs, and executing shell commands face compound risks where a single exploit can traverse multiple security boundaries simultaneously. | High | SR004, SR003 |
| CR029 | CVE-2024-5185 (EmbedAI CSRF data-poisoning) demonstrates that AI application platforms face real-world vulnerability classes—CSRF, data poisoning—beyond prompt injection; the NVD tracks similar classes across AI applications, indicating a broad and evolving vulnerability landscape for agentic coding platforms like Devin. | Medium | SR005, SR003 |
| CR030 | EU AI Act GPAI provisions require frontier model providers to publish summary information about training data, establish copyright compliance policies, and publish technical documentation; Cognition has not publicly disclosed model training data provenance, open-source license compliance scope, or copyright policy. | Medium | SR009, SR001 |
| CR031 | The ICO's AI and data protection risk toolkit requires organizations to conduct DPIAs for high-risk AI processing; Devin's autonomous PR merges and production deployments on customer infrastructure in EU member states likely qualify under GDPR Article 35, creating DPIA obligations for UK/EU customers. | Medium | SR010, SR009 |
| CR032 | Doe v. GitHub and OpenAI consolidated copyright litigation (In Re: OpenAI Copyright Infringement, S.D.N.Y. 2025) involves claims that AI model training on copyrighted code and text violates copyright law; an adverse ruling could require Cognition to disclose or remediate its training data, imposing significant operational and legal costs. | Medium | SR011, SR004 |
| CR033 | Cognition's January 2025 Series A was led by Founders Fund at a $2B post-money valuation; a security incident, major regulatory enforcement action, or sustained competitor improvement could trigger down-round financing pressure at the next capital raise given the high valuation multiple relative to current ARR. | Medium | SR017, SR024 |
| CR034 | Developer community skepticism at Devin's launch—documented on HN and via independent benchmarking—highlighted concerns about benchmark reliability, limited demo scope, and performance gaps for non-standard tasks; this adverse reputation dynamic increases adoption friction among technically sophisticated enterprise evaluators. | Medium | SR014, SR013 |
| CR035 | Cognition's headcount grew 102% year-over-year to approximately 222 employees (Growjo, April 2025); this rapid hiring pace, without disclosed revenue-per-employee targets or profitability milestones, increases burn rate risk if ARR growth decelerates below the ~70×/year pace observed in the early 2025 growth sprint. | Medium | SR021, SR020 |
| CR036 | arXiv research on AI-assisted PR acceptance rates over 32 weeks shows a measurable +0.77%/week improvement trend, validating that autonomous coding agent productivity is improving; however, the long-run acceptance ceiling and transition from supervised to fully autonomous PR merges at enterprise scale remains empirically unproven. | Medium | SR022, SR015 |
| CR037 | Under GDPR Article 35, automated decision-making with significant effects on individuals requires a DPIA; Devin's autonomous code deployments affecting production systems could qualify in EU deployments, creating DPIA obligations that enterprise customers must fulfill before deploying Devin at scale. | Medium | SR010, SR002 |
| CR038 | Cognition's pricing dropped by approximately 3× with Devin 2.0 (April 2025), from higher effective ACU pricing to lower tiers; while improving adoption, this pricing compression requires proportional volume growth to sustain ARR trajectory and reduces the pricing power available to offset future LLM API cost increases. | Medium | SR028, SR019 |
| CR039 | The White House AI EO of January 2025 creates a deregulatory federal environment that reduces US-federal compliance burden for Cognition in the near term; but agency rules for government AI procurement—relevant to Cognition's announced Government customer vertical—may introduce FedRAMP and NIST AI RMF requirements on a 12–24 month timeline. | Medium | SR008, SR006 |
| CR040 | Independent developer analysis consistently identifies Devin's ACU cost model opacity and unpredictable session-level billing as adoption barriers; enterprise customers in community analysis report difficulty estimating total cost of ownership, creating adoption friction beyond the pilot phase. | Medium | SR013, SR020 |
| CR041 | SWE-bench Verified (500 human-filtered instances) is considered more reliable than SWE-bench Full (2294 instances); Claude Code Opus 4's 72.5% Verified score versus Devin's 13.86% Full score reflects both a benchmark type difference and a fundamental performance gap that, unless Cognition closes it, will increasingly drive enterprise procurement decisions toward Anthropic's product. | High | SR015, SR027 |
| CR042 | Cognition's terms of service prohibit using Devin outputs to train competing models, but do not publicly disclose the provenance of Devin's own training data; this asymmetry creates potential legal exposure under the same copyright theories being litigated against GitHub and OpenAI in the Doe v. GitHub class action. | Medium | SR001, SR011 |
| CV001 | Cognition AI's $10.2B September 2025 post-money valuation divided by an estimated $73M trailing ARR (April 2025) implies approximately a 140× ARR multiple—a 7× premium to the closest comparable Cursor/Anysphere, which closed a $9.9B Series D at approximately $500M ARR, implying ~20× ARR. | Medium | SV011, SV012, SV010 |
| CV002 | The recommendation for Cognition AI at $10.2B valuation is watchlist / conditional invest: the bull thesis (70×/year ARR growth, Windsurf integration moat, enterprise lighthouse customers) is structurally compelling but insufficient to justify a risk-adjusted positive return at 140× ARR without NRR, gross margin, and top-customer concentration data. | Medium | SV012, SV013, SV017 |
| CV003 | Entry discipline recommendation: investors should target a Cognition AI entry valuation below $5B (implying ≤70× current estimated ARR), at which the bull case returns are 3.5–5× in 3 years and the bear case returns imply losses limited to 50–80% of investment, reflecting a more acceptable risk-reward asymmetry. | Low | SV012, SV013 |
| CV004 | Cognition AI raised $175M at a $2B valuation in its Series A round led by Founders Fund IX (CIK 0001971631, EDGAR); this represents a 5.1× step-up to the September 2025 $10.2B round in approximately nine months—one of the fastest large-round step-up trajectories in AI software startup history. | High | SV002, SV011 |
| CV005 | Cognition Capital SPV I (CIK 0002072175, Form D filed June 2025) was structured contemporaneously with the Windsurf acquisition; the SPV structure suggests one or more institutional investors required a structured co-investment vehicle, which is a common signal of concentration risk in a single institution funding the Windsurf consideration. | Medium | SV001, SV003 |
| CV006 | A 'culture reset' at Cognition post-Windsurf acquisition—reported as Cognition offering nine-month buyouts to Windsurf staff and requiring 80-hour workweeks—introduces talent attrition risk that is not reflected in the $10.2B headline valuation and could impair the 250K+ DAU Windsurf user base that underpins the IDE moat thesis. | Medium | SV021, SV014 |
| CV007 | Founders Fund IX's portfolio page confirms Cognition AI as a portfolio company; Founders Fund IX filed an amended Form D (D/A) in October 2025, indicating continued capital activity in the fund consistent with ongoing Cognition investment or follow-on management activity. | High | SV003, SV007 |
| CV008 | Under the bull scenario, Cognition AI reaches $700–800M ARR by end-2027 (implying ~45% quarterly growth sustained for 10 quarters from the April 2025 base), at which a 25–35× forward ARR exit multiple would imply an exit valuation of $18–28B, representing a 1.5–2.5× return from the $10.2B September 2025 round. | Low | SV012, SV013 |
| CV009 | Under the base scenario, ARR growth decelerates from 9-month 70×/year pace to 30–40% quarterly growth, reaching $200–350M ARR by end-2027; at 20–25× forward ARR multiple, the implied exit valuation is $4–9B—flat to slight loss relative to the $10.2B entry, implying negative risk-adjusted returns for investors entering at $10.2B. | Medium | SV012, SV013 |
| CV010 | Under the bear scenario, a material adverse event (second security incident, major customer churn, or competitor price undercut >3×) causes ARR to plateau at $80–120M with NRR below 90%; at 8–12× trailing ARR, the implied exit valuation is $640M–$1.4B, representing a 90%+ capital loss for investors at $10.2B entry. | Medium | SV017, SV019 |
| CV011 | Cursor/Anysphere's Series D at $9.9B on approximately $500M ARR implies a 20× ARR multiple; Cognition's $10.2B valuation at approximately $73M ARR implies a 140× multiple, representing a 7× premium to Cursor on a comparable product category—enterprise AI coding tools—that is only sustainable if Cognition grows 7× faster to ARR parity with Cursor within 12–18 months. | Medium | SV010, SV012 |
| CV012 | Harvey AI (legal AI) at approximately $3B valuation and ~$100M ARR implies a 30× ARR multiple; high NRR (reportedly above 150% for legal enterprise accounts) justifies this premium. Cognition at 140× ARR requires NRR disclosure at a comparable level to avoid multiple compression, but NRR is currently undisclosed. | Low | SV018, SV013 |
| CV013 | Glean (enterprise AI search) at $4.6B valuation and approximately $100M ARR implies a 46× ARR multiple; its NRR above 150% and lack of benchmark commoditization risk distinguish it from Cognition. Cognition's 140× ARR multiple exceeds even high-NRR AI SaaS peers, suggesting the market is pricing in either a unique category premium or future ARR disclosure that narrows the apparent gap. | Low | SV018, SV004 |
| CV014 | Replit (developer platform, $1.16B valuation, 2023) represents the category risk for Cognition: a developer productivity pioneer that lost relative market share to more focused AI-native competitors (Cursor, Devin) during the 2024–2025 agentic coding transition. The Replit trajectory is a base-case downside reference for Cognition's own category risk. | Medium | SV018, SV017 |
| CV015 | GitHub Copilot, embedded in Microsoft's GitHub Enterprise product at $19–$39 per seat per month, represents a structural pricing threat: Microsoft can subsidize Copilot through Azure and M365 bundling, enabling sustained price pressure at a price point approximately 15–25× below Devin's $500/month Team plan while offering comparable pass-level coding assistance. | Medium | SV009, SV017 |
| CV016 | OpenAI Codex positions directly as an autonomous coding agent—similar to Devin—in Q2 2025, representing a new competitive threat from a foundation model provider that both supplies Devin's underlying intelligence and competes for the same enterprise developer productivity budget. | High | SV009, SV020 |
| CV017 | Amazon Q Developer (AWS-bundled AI coding assistant) competes with Devin for enterprise developer productivity budget within AWS cloud deployments; its bundling within AWS Enterprise Support creates lock-in dynamics that disadvantage Devin for customers already committed to the AWS ecosystem. | Medium | SV009, SV018 |
| CV018 | A second security incident within 18 months of the December 2024 prompt-injection disclosure is the primary thesis-break trigger; the probability is moderate given Devin 2.0's expanded attack surface (autonomous PR merges, scheduled agents) and the absence of a published post-Devin-2.0 penetration test or bug-bounty program. | Medium | SV019, SV020 |
| CV019 | Customer concentration risk—undisclosed top-customer ARR share—is the primary financial thesis-break variable. If Nubank accounts for greater than 30% of the estimated $73M ARR, a Nubank renewal risk is a material single-event that could reduce ARR by $20M+ without any other change in business trajectory. | Medium | SV027, SV012 |
| CV020 | Benchmark commoditization risk is the primary competitive thesis-break: Claude Code Opus 4's 72.5% SWE-bench Verified score (2025) versus Devin's 13.86% Full score (2024 launch) demonstrates a 5× improvement rate in 15 months that, if sustained, eliminates Devin's technical differentiation by late 2026. | High | SV017, SV020 |
| CV021 | NRR below 90% would be a thesis-break for any investment at $10.2B; NRR below 90% implies net negative ARR cohort contribution, meaning the existing customer base is contracting, and sustaining headline ARR growth requires ever-larger new logo acquisition to compensate—a structurally unsustainable growth model at enterprise SaaS scale. | High | SV013, SV012 |
| CV022 | EU regulatory enforcement action under the EU AI Act GPAI provisions (effective August 2026) could restrict Cognition's EU market access; Cognition expanded to Europe (London office, January 2026) without disclosed GPAI compliance infrastructure, creating a potential Q3 2026 enforcement risk. | Medium | SV022, SV023 |
| CV023 | Net Revenue Retention (NRR) by cohort is the single most critical undisclosed financial metric for valuation purposes; it determines whether the $73M ARR estimate reflects durable enterprise contracts or early-adopter pilots with high churn, a distinction worth $2–5B in implied valuation. | High | SV012, SV013 |
| CV024 | Gross margin is the second critical undisclosed metric; Devin's per-session LLM API costs (Anthropic/OpenAI commercial rates) are substantial for compute-intensive multi-hour sessions, and gross margin below 30% would signal an unsustainable cost structure that requires either proprietary model development or volume pricing renegotiation before profitability. | Medium | SV020, SV028 |
| CV025 | Post-Devin-2.0 penetration testing and red-team disclosure is required for enterprise procurement security sign-offs; the December 2024 incident occurred before Devin 2.0's expanded agentic capabilities (PR merges, scheduling), meaning the existing SOC 2 Type II audit (March 2024) does not cover the current expanded attack surface. | High | SV019, SV028 |
| CV026 | Windsurf customer retention at 6 months post-acquisition (January 2026) is a critical proof point for the IDE moat thesis; if the 350 enterprise accounts acquired with Windsurf show greater than 15% churn, the Windsurf contribution to ARR is overstated and the platform differentiation thesis weakens. | Medium | SV021, SV014 |
| CV027 | LLM API contract terms with Anthropic and OpenAI—specifically volume pricing floors, agentic use-case restrictions, and change-of-control provisions—are undisclosed but material; a scenario where Anthropic raises API pricing 3× as Claude Code gains enterprise share would compress Devin's gross margin materially and impair the $10.2B valuation. | Medium | SV020, SV009 |
| CV028 | FedRAMP authorization status for the US Government vertical (launched February 2026) is a credibility question for government procurement; without FedRAMP In Process or Authorized status, government agency sales are limited to informal pilots, capping US Government revenue contribution at a level well below the implied market opportunity. | Medium | SV022, SV006 |
| CV029 | The Founders Fund IX Form D/A amendment (October 2025, EDGAR) confirms continued fund activity consistent with Cognition follow-on capital management or new investment in the post-$400M round period; combined with the Cognition Capital SPV I (June 2025), the structured capital base is complex and the preference stack is undisclosed. | High | SV003, SV001 |
| CV030 | The $73M ARR estimate at April 2025, growing from $1M at GA (December 2024) in nine months, represents a 73× top-line growth rate—exceptional even by AI-native startup standards—but is based on third-party estimates (Growjo, Sacra) that have not been confirmed by Cognition in any official disclosure. | Medium | SV012, SV013 |
| CV031 | Devin's pricing dropped approximately 3× with Devin 2.0 (April 2025), with the Team plan remaining at $500/month but providing 83% more tasks per ACU; this effective price reduction improves adoption but compresses the revenue-per-session metric, requiring proportionally greater volume to maintain ARR trajectory. | High | SV028, SV012 |
| CV032 | Total disclosed funding across all rounds is approximately $1.575B; at a $10.2B post-money valuation, early rounds are already at 60–140× capital returned on paper, creating exit pressure for early investors that may accelerate secondary activity and reduce inside-round price discovery for new institutional investors. | Medium | SV011, SV023 |
| CV033 | Mercedes-Benz announced a partnership with Cognition AI in April 2026, adding a second named enterprise customer alongside Nubank; while terms and ARR contribution are undisclosed, the automotive/manufacturing vertical validates enterprise cross-sector demand and reduces the market risk of over-dependence on financial services verticals. | High | SV008, SV022 |
| CV034 | a16z's 100 Gen AI Apps report identifies coding tools and developer productivity as the highest-revenue AI application category, with enterprise developer tools generating $5B+ in collective ARR across the cohort as of their most recent analysis; this market context supports the bull-case premise that Cognition is entering the most valuable AI application segment. | Medium | SV004, SV018 |
| CV035 | Replit's valuation of $1.16B in 2023 as a developer platform that failed to capture the AI coding wave represents the primary bear-case precedent for Cognition; developer tool platforms with strong user bases but insufficient enterprise NRR can see rapid valuation compression when more focused AI-native competitors commoditize their core value proposition. | Medium | SV018, SV017 |
| CV036 | Cognition AI's geographic expansion to Europe (London, January 2026), Japan, and Singapore (April 2026) adds TAM and demonstrates customer pull outside the US but introduces compliance infrastructure gaps—EU GPAI registration, UK GDPR DPO appointment, Japan PIPL—that create near-term regulatory and reputational risk before those markets generate material ARR. | Medium | SV022, SV025 |
| CV037 | The Cognition Capital SPV I Form D (CIK 0002072175, filed 2025-06-11, Claymont, DE) is consistent with a structured co-investment vehicle used to fund the Windsurf acquisition consideration or provide a liquidity backstop to Windsurf investors, adding a layer of financial structure complexity to the Cognition cap table beyond a standard venture equity round. | Medium | SV001, SV003 |
| CV038 | The Dealroom.co listing for Cognition AI reflects the company's early financing stages at a $8M valuation figure—consistent with seed-round entry—while TechCrunch and CNBC confirm the September 2025 round at $10.2B; this 1,275× valuation step-up across approximately 24 months is unprecedented in enterprise SaaS history and indicates market expectation of near-monopoly market capture. | Medium | SV005, SV029 |
| CV039 | For an IPO at $8B+ market capitalization on conventional public market multiples (15–25× forward ARR), Cognition would need to demonstrate approximately $320–535M ARR in the 12 months preceding the IPO; at base-case growth rates, this milestone is not reached before 2028–2029, suggesting the IPO window is approximately 2–4 years from May 2026. | Medium | SV011, SV024 |
| CV040 | Strategic acquirer scenarios exist: Microsoft (GitHub distribution), Alphabet (Google Cloud developer tools), Salesforce (Einstein developer productivity), and Databricks/Snowflake (data engineering automation) are all logical fits; however, the $10.2B entry price for Cognition makes a strategic acquisition at premium unlikely unless ARR scales to $400M+ first, as acquirers would need to pay a 20–30% acquisition premium above the last private valuation. | Medium | SV011, SV023 |
| CV041 | Cursor/Anysphere's November 2025 Series D at $29.3B (per Business Wire) represents an even more recent and higher precedent valuation for AI coding tools; if confirmed, this would shift the peer set valuation anchor upward and could partially justify Cognition's 140× ARR multiple relative to Cursor's even higher implied multiple. | Low | SV010, SV012 |
| CV042 | Cognition AI's Mercedes-Benz partnership announcement (April 2026) and Cognizant channel partnership (January 2026) diversify the named customer base beyond Nubank, reducing—but not eliminating—customer concentration risk; the absence of disclosed outcome metrics for these relationships limits the financial materiality assessment. | Medium | SV008, SV022 |