insitro
Real partner proof and meaningful platform ambition, but price discipline still matters more than narrative enthusiasm
Research-more: insitro has real partner proof and a plausible route to premium techbio value, but public evidence does not support underwriting an aggressive private valuation without clean terms and stronger clinic-ready proof.
Cover facts
Company profile
insitro is a South San Francisco AI therapeutics company founded in 2018 by Daphne Koller that has evolved from an ML drug-discovery platform into a private pipeline builder spanning metabolism, neuroscience, and ophthalmology. Public evidence shows stronger external proof than many techbio peers through BMS, Lilly, and earlier Gilead collaboration economics, but the company is still best understood as a late-private, preclinical-to-IND-enabling platform-plus-pipeline biotech rather than a commercial software business.
- Website
- www.insitro.com
- Founded
- 2018-01-01
- Founders
- Daphne Koller
- Founding location
- South San Francisco, California, USA
- Headquarters
- South San Francisco, CA
- Product
- insitro combines multimodal human-cohort and cellular data, machine learning, and internal plus partnered discovery workflows. The visible portfolio spans metabolism, neuroscience, and ophthalmology, while CTRO-1013 remains the clearest internal asset approaching clinic entry.
- Customers
- Public customer proof is concentrated in a small number of large-pharma collaborators rather than a broad software customer base; BMS is the strongest public durability signal, with Lilly as a second major partner and Gilead as historical proof.
- Business model
- The model blends milestone-bearing pharma collaborations with longer-dated internal pipeline option value. Public pricing, renewal, and revenue-recognition details remain opaque, so the business is better framed as a partner-validated techbio platform than as a transparently priced product company.
- Stage
- private, late-preclinical
- Funding status
- Last clearly disclosed financing was the $400M Series C in 2021. Forge estimates that round at roughly $2.57B post-money, while low-confidence public trackers still cluster insitro in the low-$2B range. Management now cites roughly $800M of capital and about $150M of cumulative partnership revenue, but current financing terms and preference structure are not public.
Executive summary
Top strengths
- BMS has provided unusually strong public proof for a private techbio through upfront cash, milestone conversion, extension funding, and additional target nominations.
- Public materials cite roughly $800M of capital and about $150M of partnership revenue, indicating more financial and commercial validation than a typical preclinical platform story.
- The platform-plus-pipeline model gives insitro multiple ways to create value if CTRO-1013 and future internal assets reach the clinic while partnerships continue to expand.
- Governance and talent additions such as Amy Abernethy and Joe Hand suggest a company trying to mature beyond an early narrative-stage startup posture.
Top risks
- Current price, cap table, liquidation preferences, and latest financing terms are not public, so entry discipline cannot be judged directly from public evidence.
- Internal asset value is still preclinical: CTRO-1013 is in IND-enabling / first-in-human preparation rather than human proof.
- Revenue, current cash, and burn remain too opaque for a reliable revenue-multiple or runway-led valuation model, and public trackers conflict sharply.
- Public AI-techbio comparables sit mostly in a $0.5B-$2.5B band, which constrains how much premium can be justified without stronger private proof.
- AI validation, security, and regulatory-readiness materials remain private even though those artifacts will matter for both partner confidence and later-stage financing.
Open gaps
- Current fully diluted cap table and liquidation preference waterfall
- Current cash balance, monthly burn, and downside runway plan
- Any priced financing, secondary mark, or 409A update after the 2021 Series C
- BMS and Lilly contract terms, renewal mechanics, and revenue-recognition treatment
- IND critical path and AI validation package for CTRO-1013 and related workflows
- Security, quality, and audit materials expected by later-stage investors or strategic buyers
Contents
01Company Overview
1.1 Identity, mission, and current stage
insitro positions itself as an AI therapeutics company built on causal biology rather than as a pure software vendor or tools provider. The company was founded in 2018 by Daphne Koller, is headquartered at 279 East Grand Avenue in South San Francisco, and publicly frames its mission as decoding disease with multimodal human and cellular data plus machine learning. That framing matters because insitro is now selling neither a single therapeutic asset nor a horizontal ML platform; it is trying to build a repeatable engine that can move from target discovery into modality-specific drug design and then into internally controlled pipeline assets. The public website and purpose page show a company still anchored in drug discovery and development, not commercialization. It has no marketed product, but it does have a disclosed portfolio of programs and a growing set of data and design partnerships that move it beyond the pre-seed “AI for biotech” phase into a late-private, preclinical-to-IND-enabling operating stage.[CO001, CO002, CO003, CO004, CO005, CO029]
| Metric | Value or status | Date | Confidence | Gap or note |
|---|---|---|---|---|
| Founded | 2018 | 2018 | high | Supported by official purpose page and Forbes profile. |
| Headquarters | 279 East Grand Avenue, South San Francisco, CA | current | high | Official address is clear. |
| Operating stage | Private AI therapeutics company; pre-commercial, no marketed product | 2026 | medium | No public clinical-stage or commercial product disclosed. |
| Publicly disclosed venture funding | $243M by 2020; $400M Series C in 2021 | 2020-2021 | medium | Funding chronology is public, but current total depends on whether partnership cash is counted. |
| Company-reported capital narrative | More than $700M to approximately $800M including partnership cash | 2024-2026 | medium | Not directly comparable with pure equity funding. |
| Tracker valuation range | $2.2B to $2.5B | 2025-2026 | low | Open trackers conflict and no priced 2026 financing is public. |
| Headcount range | ~230 after layoffs; trackers show ~250 to 300 | 2025-2026 | low | Public scale metrics conflict across third-party trackers. |
| Named pharma collaborators | Gilead, Bristol Myers Squibb, Lilly | 2019-2026 | medium | Partner concentration is high in public evidence. |
| Named data or research partners | Genomics England and INSIGHT at Moorfields | 2022-2025 | medium | Important for dataset access and modality expansion. |
| Customer count / debt / secondaries | current | low | No reviewed open source disclosed these metrics. |
This table mixes official disclosures and low-confidence tracker estimates; nulls indicate metrics that remain unsupported in reviewed open sources.
[CO001, CO002, CO011, CO012, CO021, CO022]How insitro’s data, platform, partners, and therapeutic programs fit together as one value-creation loop.
This is a structural representation of the public operating model rather than a process chart disclosed by management.
[CO003, CO004, CO018, CO019, CO020, CO027]1.2 Leadership, governance, and organizational fit
Founder and CEO Daphne Koller remains the company’s central public face and the clearest source of founder-market fit. External coverage ties her credibility to deep machine-learning expertise and the prior creation of Coursera, while company materials emphasize her role in building a cross-disciplinary culture that blends data science, biology, and drug development. Since the 2021 financing cycle, governance has gained more drug-development and clinical evidence depth through board and executive additions. Paul McCracken joined the board through the Series C financing, Amy Abernethy joined in 2024 with FDA and real-world evidence experience, and Joe Hand became chief people officer in 2026 to professionalize talent strategy as the company scales. Those hires signal a company preparing for more mature portfolio management and organizational discipline. At the same time, public sources still do not reveal a full board-control map, succession planning, or preference structure, so governance remains directionally visible but economically opaque.[CO006, CO007, CO008, CO009, CO010, CO033]
| Person | Role | Why it matters | Public support | Key dependency or gap |
|---|---|---|---|---|
| Daphne Koller | Founder and CEO | Embeds machine-learning credibility and is the clearest founder-market-fit anchor for the company. | Forbes 2019/2020 and official site | High key-person dependence; no public succession plan found. |
| Amy Abernethy | Board director (since 2024) | Adds FDA, real-world evidence, and clinical-development depth to governance. | Official board announcement | Board economics and committee structure are not public. |
| Joe Hand | Chief People Officer (since 2026) | Signals transition from research-heavy startup to scaled organization with formal talent architecture. | Official appointment announcement | Appointment does not resolve exact current org size or attrition trend. |
| Paul McCracken | Board director via Series C | Represents CPP-backed governance influence and long-duration capital alignment. | Series C financing announcement | No public detail on voting rights or investor control. |
Coverage is partial and limited to named public leaders or directors that appeared in reviewed open sources.
[CO006, CO007, CO008, CO009, CO010, CO033]1.3 Capitalization, partner economics, and stakeholder map
Open sources show a capital stack built from both equity financings and collaboration cash, which is why public numbers disagree. Forbes reported that the 2020 Series B brought total venture funding to $243 million, and insitro’s 2021 press release documented a $400 million Series C with CPP Investments leading. But later company releases cite more than $700 million and then approximately $800 million of capital when partnership cash is included, while third-party trackers publish inconsistent funding and valuation figures. The cleaner conclusion is that insitro has attracted a top-tier private syndicate and meaningful pharma cash, but not that any single public number should be treated as canonical. The partner map is economically important: Gilead validated the early NASH thesis, Bristol Myers Squibb remains the most financially material neuroscience collaborator, and Lilly broadened the metabolism strategy with both 2024 and 2025 agreements. That concentration helps explain both the upside and the financing risk, because a small set of counterparties drive much of the public evidence for external validation, milestone inflows, and downstream optionality.[CO011, CO012, CO013, CO014, CO015, CO016]
| Stakeholder | Role | Economic importance | Why it matters | Diligence ask |
|---|---|---|---|---|
| Andreessen Horowitz | Early lead investor | Part of the core equity syndicate from early rounds | Signals top-tier tech investor support for the ML thesis. | Confirm current ownership and board rights. |
| CPP Investments | Series C lead and board seat sponsor | Led the 2021 $400M round and placed Paul McCracken on the board | Represents long-duration institutional capital. | Request current ownership percentage and any pro rata rights. |
| Bristol Myers Squibb | Neuroscience partner | Upfront, milestones, and potential multi-billion downstream economics | Most clearly disclosed milestone-bearing pharma relationship. | Request current milestone schedule and option structure by target. |
| Eli Lilly | Metabolic-disease and ADMET-data partner | Provides technology, data, and future milestone or royalty pathways | Broadens metabolism execution and chemistry capabilities. | Request economics by agreement and obligations around rights retention. |
| Gilead | Early NASH validation partner | Helped validate the original platform with data and milestone potential | Demonstrates early willingness of major pharma to pay for the platform. | Confirm whether the relationship is still active and economically relevant. |
| Genomics England | Data and research partner | Gives access to NHS-linked multimodal genomics and pathology data | Important for clinical-data scale and discovery workflow. | Clarify term length, exclusivity, and downstream IP rights. |
| INSIGHT at Moorfields | Data and research partner | Provides 35 million-image ophthalmic dataset for neurodegeneration work | Expands disease scope and foundation-model training data. | Clarify rights to derived models and target insights. |
| CombinAbleAI / AION ecosystem | Acquisition and biologics capability | Adds modality breadth rather than direct cash | Strengthens insitro’s cross-modality design stack. | Request post-close retention and integration plan. |
This is a public stakeholder map, not a cap table. Economic importance is directional because ownership percentages and contractual waterfalls are not public.
[CO013, CO014, CO015, CO018, CO019, CO020]High-level operating indicators that matter most for judging insitro’s current maturity and visibility.
Values mix company-reported ranges and third-party tracker estimates; where figures conflict, the item highlights the range instead of a false point estimate.
[CO021, CO022, CO023, CO024, CO029, CO032]1.4 Milestones and portfolio evolution
The most useful way to read insitro’s trajectory is as a sequence of milestones that converted a data-and-ML thesis into a broader therapeutic portfolio. Early validation came from the Gilead collaboration and the 2020 Bristol Myers Squibb ALS agreement, which showed major pharma willingness to pay for insitro’s discovery engine. The next phase added larger equity backing and research partnerships such as Genomics England, then moved into more explicit pipeline construction with Lilly in metabolic disease and milestone-bearing Bristol Myers Squibb target work in ALS. By 2025 and 2026 the operating model became visibly more industrialized: the company expanded its ophthalmology and neurodegeneration data network through Moorfields, disclosed obesity and MASH programs on the pipeline page, and launched TherML after acquiring CombinAbleAI. The current disclosed portfolio spans metabolism, neuroscience, and ophthalmology, with eight named programs but still no marketed product or public clinical-stage asset. That makes the milestone chronology more important than a conventional commercialization timeline.[CO005, CO014, CO015, CO016, CO017, CO018]
| Date | Event | Type | Amount or status | Participants | Implication |
|---|---|---|---|---|---|
| 2018 | insitro founded in South San Francisco by Daphne Koller | founding | company formation | Daphne Koller | Launches ML-first drug-discovery thesis. |
| 2019-04 | Gilead NASH collaboration publicly described in external coverage | partnership | $15M upfront; up to $1B potential per Forbes | insitro, Gilead | Earliest pharma validation of the platform. |
| 2020-05 | Series B financing | financing | $143M; total VC $243M at the time | insitro, a16z, T. Rowe, BlackRock, Casdin, CPP and others | Capitalized expansion beyond proof-of-concept stage. |
| 2020-10 | Bristol Myers Squibb neuroscience collaboration | partnership | $50M upfront; $20M near-term; >$2B potential milestones | insitro, Bristol Myers Squibb | Creates largest publicly disclosed partner economics. |
| 2021-03 | Series C financing | financing | $400M | insitro, CPP and syndicate | Adds crossover capital and board depth. |
| 2022-03 | Genomics England partnership | partnership | embedding search deployed to NHS-linked dataset | insitro, Genomics England | Expands multimodal clinical-data access. |
| 2024-04 | Amy Abernethy joins board | governance | board addition | insitro, Amy Abernethy | Adds clinical-data and FDA experience. |
| 2024-10 | Lilly metabolic-disease agreements | partnership | rights-retaining collaboration structure | insitro, Eli Lilly | Broadens metabolism pipeline and modality options. |
| 2024-12 | First BMS ALS target milestone payment | scale | $25M milestone | insitro, Bristol Myers Squibb | Demonstrates target-nomination progress. |
| 2025-03 | Moorfields INSIGHT collaboration | partnership | 35M eye-image resource | insitro, INSIGHT at Moorfields | Adds ophthalmic and neurodegenerative data scale. |
| 2025-05 | Workforce reduction | adverse | 22% layoffs; about 230 workers after cut | insitro | Shows capital-discipline pressure before clinic readiness. |
| 2025-10 | BMS ChemML extension | partnership | up to $20M new funding | insitro, Bristol Myers Squibb | Moves ALS work from biology to molecule design. |
| 2026-01 | CombinAbleAI acquisition and TherML launch | product | modality-agnostic therapeutic design platform | insitro, CombinAbleAI | Expands into biologics and adds Israel R&D center. |
| 2026-02 | Joe Hand appointed chief people officer | governance | executive hire | insitro, Joe Hand | Signals organization scaling around talent systems. |
| 2026-02 | BAT study and obesity target disclosure | product | 15% body-weight reduction in mice | insitro | Shows newer metabolic pipeline traction. |
| 2026-03 | BMS collaboration expansion to ALS-2 and ALS-3 | partnership | $10M milestone payment | insitro, Bristol Myers Squibb | Deepens neuroscience pipeline breadth. |
This chronology is limited to milestones that were explicitly visible in reviewed public sources and therefore excludes undisclosed internal or financing events.
[CO001, CO007, CO008, CO010, CO011, CO012]Timeline of public financings, partnerships, governance changes, and adverse events from founding through March 2026.
Amounts and dates reflect public disclosures only; private cap-table events may be missing.
[CO001, CO011, CO012, CO014, CO015, CO016]1.5 Adverse signals and open diligence gaps
The cleanest adverse signal in open sources is the 2025 workforce reduction. BioPharma Dive reported a 22 percent cut that left insitro with roughly 230 workers and explicitly tied the move to clinic readiness and the tougher funding climate. That fact matters because current tracker pages still publish materially different employee counts, funding totals, and valuation estimates, so the company snapshot is less precise than the surface narrative suggests. The same ambiguity applies to revenue quality: low-quality trackers publish revenue estimates, but no reviewed public source provided an audited current revenue mix or balance-sheet bridge. Taken together, the adverse picture is not that insitro lacks technical ambition or partner validation; it is that the public diligence record is thin where an investor would most want precision. Current valuation, customer concentration, debt or secondary history, and exact current operating scale all remain unresolved enough that later chapters should treat them as open underwriting questions rather than settled facts.[CO022, CO023, CO024, CO034, CO036, CO040]
02Market Analysis
2.1 Market boundary, adjacencies, and status-quo substitutes
The first analytical mistake with insitro is to call its market simply “AI in pharma” or simply “obesity drugs.” The company is not yet a commercial drug seller, but it is also no longer a pure tools vendor. Its own platform and pipeline pages show a business model that uses multimodal human and cellular data, machine learning, and design tooling to generate both internal programs and partner-facing discovery output. That means the near-term monetizable market is partnership and milestone economics with pharmaceutical R&D buyers, while the downstream economic upside sits in much larger therapeutic end markets that will matter only if internal assets reach later development. The right boundary therefore has three layers: AI-enabled drug-discovery partnerships, insitro-controlled therapeutic programs in metabolism, neuroscience, and ophthalmology, and data-intensive discovery collaborations that strengthen the platform. The status-quo alternatives are internal pharma discovery teams, conventional CRO and medicinal-chemistry workflows, and competing AI-biopharma platforms such as Recursion, Schrödinger, Relay, and Evotec—not consumer digital-health tools or hospital IT software.[CM001, CM002, CM003, CM004, CM040, CM041]
| Segment / category | Included spend or activity | Excluded spend | Buyer / payer | Relevance to insitro |
|---|---|---|---|---|
| AI-enabled drug-discovery partnerships | Upfronts, milestones, option fees, and research funding tied to target discovery, model building, and program design | Commercial drug sales, hospital software, broad consumer health apps | Large-pharma R&D and BD budgets | Immediate monetization path; this is where insitro signs deals today |
| Internal metabolic-disease therapeutics | Future revenue from obesity, MASLD, and related metabolic programs if assets advance | Primary-care chronic-disease management services or wellness subscriptions | Future prescribers, payers, and specialty pharma channels | Important long-term upside, but mostly pre-commercial today |
| Internal neuroscience therapeutics | ALS, FTD, and related neuro target economics through partnerships or internal assets | Neurology care delivery spend unrelated to drugs | Pharma partners today; clinicians and payers later | Current BMS collaboration makes neuroscience commercially relevant already |
| Ophthalmology and neurodegeneration data layer | Retinal imaging and multimodal discovery inputs that improve target selection and patient segmentation | General hospital IT, PACS software, and routine ophthalmology services | Research partners and future pharma users | Strategic moat and discovery input rather than clean standalone revenue category |
| Status-quo substitute stack | In-house pharma discovery, conventional CRO workflows, medicinal chemistry, and competitor AI platforms | Consumer digital health, billing software, or general hospital analytics | Existing pharma budgets already allocated to incumbent workflows | Defines what insitro must displace to win budgets |
Rows define market boundary logic, not additive TAM buckets. Included and excluded spend intentionally separates present buyer economics from future therapeutic upside.
[CM001, CM002, CM003, CM004, CM041, CM042]Illustrative market layers from broad biopharma budget pools to insitro’s narrower partnership and disease-specific demand lenses.
Layers are evidence lenses, not additive market buckets. Currency and unit differences are preserved instead of normalized because the public sources measure different things.
[CM001, CM002, CM013, CM018, CM022, CM025]2.2 Sizing lenses: AI market forecasts, R&D budgets, and disease-burden demand
Public sizing data exists, but it does not line up neatly enough to produce a single underwritten TAM. Analyst pages for AI in pharma and AI in drug discovery are directionally bullish, with forecasts ranging from low-single-digit billions today to mid-teens or even mid-twenties of billions by the early 2030s. Those estimates, however, mix different category boundaries: some include broad pharmaceutical AI workflows, some isolate drug discovery, and others include the wider precision-medicine software stack. A second lens is the budget pool behind the buyer: EFPIA estimates EUR 55 billion of pharmaceutical R&D spend in Europe alone in 2024, which shows the scale of the buyer base that can fund partnerships. A third lens is demand pull from disease burden. WHO reports 890 million adults living with obesity, insitro’s Lilly release says MASLD affects about 100 million people in the United States, WHO reports 2.2 billion people with vision impairment, and CDC estimates 9.6 million U.S. people with diabetic retinopathy. These figures confirm large demand domains around insitro’s programs, but they are not interchangeable with a software or partnership TAM. The clean conclusion is that insitro’s market must be sized through multiple evidence-constrained lenses, not through one generic headline number.[CM009, CM010, CM011, CM012, CM013, CM014]
| Publisher / lens | Year | Geography | Value | CAGR | Methodology or unit | Confidence | Key limitation |
|---|---|---|---|---|---|---|---|
| McKinsey | 2025/2030 | Global | $4B in 2025 to $25.7B by 2030 | n/a | AI market in pharma | medium | Broad category and future-year forecast, not insitro-specific |
| MarketsandMarkets | 2024/2029 | Global | $1.86B in 2024 to $6.89B by 2029 | 29.9% | AI in drug discovery market | medium | Category boundary narrower than broader AI-in-pharma estimates |
| Precedence Research | 2025/2034 | Global | $1.94B in 2025 to $16.49B by 2034 | 27% | AI in pharmaceutical market | low | Long horizon and broad workflow scope |
| Precedence Research | 2025/2035 | Global | $6.93B in 2025 to $17.81B by 2035 | 9.9% | AI in drug discovery market | low | Different base year and methodology from other analyst pages |
| Grand View Research | 2024/2030 | Global | $2.29B in 2024 to $14.53B by 2030 | 36.23% | AI in precision medicine market | medium | Precision medicine is broader than insitro’s present business model |
| EFPIA | 2024 | Europe | €55B R&D expenditure | n/a | Pharma R&D budget pool | high | Budget pool is not a software or partnership TAM |
| WHO / Lilly / liver sources | 2022-2025 | Global / U.S. | 890M obese adults globally; ~100M U.S. MASLD patients; 10-46% U.S. MASLD prevalence range | n/a | Disease-burden lens | high | Burden does not equal reachable revenue |
| WHO / CDC | 2022-2025 | Global / U.S. | 2.2B vision impairment globally; 9.603M U.S. diabetic retinopathy | n/a | Disease-burden lens | high | Burden confirms demand domain, not insitro capture rate |
This table intentionally mixes budget, software-market, and disease-burden lenses because no reviewed public source isolates a clean insitro SAM. Rows are informative, not additive.
[CM009, CM010, CM011, CM012, CM013, CM014]Range view of public AI-pharma and AI-drug-discovery market forecasts using one consistent unit: USD billions.
Forecast horizons vary from 2029 to 2035. The low and high values are uncertainty bands wrapped around point estimates to visualize spread; they should not be averaged or summed.
[CM009, CM010, CM011, CM012, CM013, CM014]2.3 Buyer, user, payer, and adoption path
insitro’s buyer map changes depending on where one sits in the value chain. In the current partnership-led model, the buyer is usually a pharmaceutical R&D or business-development organization deciding whether insitro’s discovery engine, disease models, or target work justify upfront capital and milestone exposure. The functional users are scientific teams on both sides, while the budget owner is a pharma R&D or external-innovation budget. The adoption path is therefore evidence-heavy: platform validation, disease-model credibility, target nomination, and then negotiated development rights with milestone structures. In a later internal-drug path, the buyer map shifts. Physicians, health systems, and payers become relevant only after clinical proof and regulatory clearance, and that path is still mostly hypothetical for insitro. The data collaborations with Genomics England and Moorfields sit alongside this as enabling inputs rather than direct clean-market outputs. This is why buyer analysis matters so much for valuation: the company’s immediate revenue path is controlled by sophisticated counterparties who demand proof long before patients or payers ever see the product.[CM005, CM006, CM007, CM008, CM035, CM036]
| Segment | Buyer | User | Payer | Workflow context | Budget owner | Adoption trigger |
|---|---|---|---|---|---|---|
| Large-pharma discovery partnerships | Head of external innovation, BD lead, or therapeutic-area R&D leader | Partner scientists, computational biologists, translational teams | Pharma R&D budget | Platform evaluation, target nomination, milestone-bearing partnership | CSO, BD committee, or therapeutic-area budget owner | Convincing data, target, or model evidence |
| Existing neuroscience alliances | Large-pharma partner already in a collaboration | Joint alliance teams and project leaders | Partner R&D and milestone budget | Advancing nominated targets through option or development rights | Alliance steering committee and partner finance | Target selection and milestone readiness |
| Future internal metabolic or neuro assets | Hospital systems and specialty prescribers only after approval | Physicians, care teams, and patients | Commercial insurers, Medicare, other health systems | Clinical adoption and reimbursement after trials | Payer formularies and provider budgets | Clinical efficacy, safety, and label approval |
| Data and research collaborators | Academic or public-data institutions | Joint data science and biology teams | Grant, research, or internal innovation budgets | Dataset access, search tooling, and model-building | Research program owners | Governance, privacy, and scientific utility |
| Competing AI-biopharma ecosystem | Peer buyers seeking partnerships or capital | Internal platform and pipeline teams | VC, public-market, or partner capital | Benchmark for what buyers see as credible category positioning | Board and financing stakeholders | Clinical depth, data scale, or integrated partnering model |
Buyer and payer roles differ sharply between the current partnership path and the future commercialization path; the table preserves that timing split explicitly.
[CM005, CM006, CM007, CM008, CM035, CM040]Matrix of the key segments in insitro’s current and future market paths and the evidence burden attached to each.
Values are qualitative descriptors, not survey measurements. The aim is to show the buyer relationship and proof burden by segment.
[CM005, CM006, CM035, CM040, CM041, CM042]Relative funnel showing how insitro’s current commercial path narrows from data and model value to downstream healthcare payment.
Values are ordinal funnel weights, not empirical conversion rates. They illustrate where the market narrows and where more proof is demanded.
[CM031, CM032, CM041, CM046, CM047]2.4 Growth drivers, adoption constraints, and preserved contradictions
The growth case for insitro’s market is straightforward. Disease burdens in obesity, MASLD, vision, and neurodegeneration are large; pharmaceutical R&D budgets are enormous; and AI is spreading deeper into the medicines lifecycle across discovery, safety, and operations. At the same time, the category’s adoption constraints are unusually important. WHO and EMA both stress governance, risk management, and lifecycle controls. McKinsey says pharma has not yet seen clear proof that AI alone materially shortens timelines or boosts success rates, and KPMG notes that tougher financing conditions pushed buyers toward lower-risk structures. Those facts mean enthusiasm does not automatically convert into premium economics. Investors therefore need to preserve two tensions instead of collapsing them. First, analyst market forecasts are bullish but non-comparable. Second, disease burden confirms demand, but it does not prove that insitro captures software revenue, partnership revenue, or future product revenue at attractive conversion rates. The chapter should therefore carry forward explicit gaps around exact SAM, buyer conversion, and category-level proof of AI productivity.[CM015, CM016, CM017, CM019, CM020, CM021]
| Driver / constraint | Direction | Timing | Implication for insitro | Diligence ask |
|---|---|---|---|---|
| Large global obesity burden and cost | Growth driver | Long-term | Supports metabolic-program relevance and pharma willingness to pursue large chronic-disease markets | What patient subsegments does insitro target first? |
| Large ophthalmic and vision burden | Growth driver | Long-term | Supports continued investment in retinal-data partnerships and neurodegeneration discovery | How does insitro convert eye-data advantage into target or biomarker differentiation? |
| Large pharma R&D budget pools | Growth driver | Near-term | Shows buyers have material budgets if insitro can clear proof thresholds | Which partner budgets are actually accessible to insitro today? |
| AI adoption across discovery, safety, and operations | Growth driver | Near-term | Expands the number of workflows where insitro can look relevant to a buyer | Where is insitro strongest relative to generic AI vendors? |
| Governance and regulatory controls | Constraint | Persistent | Raises validation and documentation burden before buyers will rely on AI outputs | What governance stack does insitro expose to partners today? |
| Lower-risk deal preference after market reset | Constraint | Near-term | Pushes buyers toward smaller upfronts or heavier milestone weighting | How resilient are insitro economics under back-loaded deals? |
| Lack of clear proof that AI alone improves clinical outcomes | Constraint | Persistent | Makes category claims harder to monetize at premium valuations | What evidence shows insitro improves conversion or milestone yield? |
| Long development and reimbursement timelines | Constraint | Persistent | Delays conversion from discovery promise into downstream drug revenue | What is the realistic timeline from platform signal to payer-relevant asset? |
Timing fields are qualitative. Constraints emphasize adoption friction and monetization timing rather than scientific impossibility.
[CM015, CM017, CM018, CM020, CM021, CM025]03Competitors
3.1 Landscape: direct peers, adjacents, substitutes, and likely entrants
insitro does not face a single obvious rival. The competitive set breaks into at least three buckets. First are direct AI-drug-discovery peers that also try to turn proprietary data, machine learning, and wet-lab workflows into therapeutic pipelines or partnerable programs. In reviewed public sources, the clearest names are Recursion/Exscientia, Insilico Medicine, Xaira, Isomorphic Labs, Generate Biomedicines, Valo Health, and BenevolentAI. Second are adjacent substitutes such as Schrödinger and Evotec that solve similar buyer problems through different packaging: physics-based computational discovery, integrated R&D services, or flexible partnering. Third is internal pharma build, which remains the default substitute whenever a large buyer chooses to keep discovery in-house rather than pay an external platform. This matters because it means insitro competes not only on raw science, but also on whether buyers believe its multimodal human-data approach is sufficiently differentiated from a proliferating set of AI narratives and substitute workflows.[CP001, CP002, CP003, CP004, CP011, CP014]
Quadrant view of insitro and major peers by breadth of platform ambition and amount of public proof or validation visible in reviewed sources.
Axes use qualitative 1-10 scores derived from reviewed public evidence. Higher x means broader platform ambition; higher y means more visible public proof, partnerships, or clinical progression.
[CP002, CP006, CP009, CP012, CP017, CP018]3.2 Competitor profiles: scale, funding, and public proof signals
Among direct peers, Recursion/Exscientia is the clearest scale benchmark in public markets: a large biological-data moat, >10 internal programs, >10 partnered programs, and hundreds of millions of dollars of realized partner cash. Insilico looks strongest on public proof cadence among generative-AI peers, with 40-plus total programs, multiple IND-cleared assets, and specific license-out economics disclosed. Xaira and Isomorphic Labs show the strongest public platform ambition among private peers, with Xaira pushing a virtual-cell thesis and Isomorphic extending AlphaFold-derived design into major pharma collaborations and a large funding round. Generate Biomedicines and Valo Health look more differentiated by modality or human-causal-biology angle than by clear public commercial proof. BenevolentAI is still a meaningful category reference because its knowledge-graph heritage made it an early AI drug-discovery leader, yet its strategic overhaul and proposed delisting show how quickly category narratives can compress when execution or market appetite softens. Taken together, the profile set suggests insitro is differentiated, but not the category’s scale or funding leader.[CP005, CP006, CP007, CP008, CP009, CP010]
| Competitor | Category | Public scale / funding signal | Product scope / target customer | Key differentiation | Key limitation vs insitro |
|---|---|---|---|---|---|
| insitro | Reference company | Private; public partner validation through BMS and Lilly; pipeline spans three disease buckets | AI therapeutics platform plus internal/partnered programs for pharma buyers | Multimodal human and cellular data tied to platform-to-pipeline continuity | Not the public capital-scale leader; limited public pricing transparency |
| Recursion / Exscientia | Direct AI platform peer | >50 petabytes; >10 internal and >10 partnered programs; ~$450M realized partner cash; ~800 employees | Broadest AI discovery platform for pharma partners and internal pipeline | Industrialized phenomics + chemistry + partner cash proof | Broad scope may dilute disease-specific focus; no marketed drug disclosed in reviewed source |
| Insilico Medicine | Direct generative-AI peer | 40+ programs; 13 IND-approved pipelines; 20 PCCs from 2021-2024; multiple disclosed deal values | Generative-AI drug discovery for internal pipeline and license-outs | Fast output cadence and unusually visible deal economics | Less obviously differentiated by multimodal disease-data moat than insitro |
| Xaira | Direct frontier-platform peer | Nearly $1B raised; largest publicly available genome-wide Perturb-seq dataset; virtual-cell push | AI-first drug discovery and development across full stack | Virtual-cell ambition and exceptional starting capital base | Still earlier on public clinical proof |
| Isomorphic Labs | Direct frontier-model peer | $600M external round; Novartis, Lilly, and J&J collaborations | AI-powered drug design built beyond AlphaFold for pharma collaboration and internal discovery | AlphaFold-adjacent brand, predictive and generative design engine | Public pipeline detail is thinner than platform ambition |
| Generate Biomedicines | Adjacent biologics generator | 42,000 proteins generated, built, and tested; 140k+ sq ft footprint | Generative biology and protein therapeutics | Strong protein-design and biologics angle | More modality-specific and less like-for-like with insitro’s current public story |
| Valo Health | Adjacent human-data peer | Human-causal-biology and closed-loop chemistry positioning; ecosystem-led model | AI-guided target and molecule discovery with partnerships | Human-data narrative resonates with insitro’s own thesis | Public proof points are thinner in reviewed sources |
| BenevolentAI | Adjacent knowledge-graph peer | Decade of ontology and knowledge-graph investment; strategic overhaul and proposed delisting | AI decision support for life-science R&D and target discovery | Long-standing AI pharma brand and knowledge graph heritage | Adverse restructuring evidence weakens credibility |
| Schrödinger | Computational substitute | Collaborative plus proprietary programs from physics-based platform | Computational discovery for pharma and internal pipeline | Physics-based simulation and established platform identity | Less obviously centered on multimodal human disease data |
| Evotec | Integrated-service substitute | Integrated R&D value chain and flexible partnering model | Partnered drug discovery and development services | Commercially legible service packaging and modality breadth | Different business model and less pure AI moat |
Scale signals are limited to what reviewed public sources exposed. Private company cash and valuation fields remain incomplete unless explicitly visible.
[CP004, CP005, CP006, CP007, CP009, CP011]3.3 Capability, packaging, GTM, and switching-cost comparisons
Capability comparison is less about who “uses AI” and more about what kind of proof each company can attach to that claim. insitro’s public angle is multimodal human and cellular data tied to disease understanding and internal pipeline formation. Recursion’s public edge is dataset and industrial scale. Insilico’s public edge is velocity and repeated program output. Xaira sells a virtual-cell ambition, while Isomorphic Labs sells AlphaFold-adjacent predictive design. Schrödinger is the clearest computational substitute, and Evotec is the clearest integrated-service substitute. Public pricing comparison is much weaker than capability comparison. In reviewed sources, economics are disclosed through collaboration structures, license-outs, milestone packages, and service or partnering posture—not through transparent seat pricing or menu-based packaging. That makes GTM and switching-cost analysis especially important. Buyers can likely multi-home across several platforms before deep integration, but switching costs rise once data, model workflows, or program rights are embedded inside a collaboration. Distribution power, meanwhile, still favors big pharma and later-stage commercial organizations over insitro and most private peers.[CP023, CP024, CP025, CP026, CP027, CP028]
| Capability | insitro | Recursion / Exscientia | Insilico | Xaira / Isomorphic | Adjacents (Generate / Schrödinger / Evotec) |
|---|---|---|---|---|---|
| Primary data moat | Multimodal human and cellular disease data | Large-scale phenomics plus chemistry and patient data | Generative models plus internal and partnered program data | Virtual-cell / AlphaFold-style predictive models | Protein generation, physics-based simulation, or integrated service data |
| Wet-lab integration | Yes, cell-model heavy | Yes, industrial wet lab and automation | Yes, program output plus chemistry engine | Implied full-stack ambition, public detail varies | Varies by player; strong for Evotec, lighter for Schrödinger |
| Internal pipeline ownership | Yes | Yes | Yes | Yes or developing | Varies; strongest for Generate, weaker for service models |
| Public partner proof | BMS and Lilly structures disclosed | Multiple partnered programs and realized partner cash disclosed | Dozens of collaborations and disclosed deal values | Major pharma collaborations disclosed for Isomorphic; Xaira still earlier | Partnership or service posture visible but economics often opaque |
| Modality breadth | Small molecules plus broader design ambition across disease areas | Broad chemistry and platform breadth | Generative chemistry with wide disease scope | Frontier-model breadth; biology-engine angle | Biologics, physics models, or integrated discovery services |
| Commercial packaging visibility | Milestone/rights structures, no seat pricing | Partnered programs and milestone economics | License-outs, co-development, milestones | Funding and collaboration oriented, pricing opaque | Service or software packaging visible conceptually, detailed pricing opaque |
Cells summarize public evidence only. Unknown or thin public proof should not be read as capability absence.
[CP004, CP011, CP014, CP016, CP019, CP020]| Competitor | Public package / economic model | Disclosed proof point | Pricing visibility | Likely buyer | Implication |
|---|---|---|---|---|---|
| insitro | Discovery collaboration with options, milestones, royalties, and retained rights | BMS and Lilly deal structures disclosed at high level | Low | Large-pharma R&D and BD | Value is sold as strategic program access, not software seats |
| Recursion / Exscientia | Partnered programs plus internal pipeline and milestone cash | Nasdaq merger release disclosed >10 partnered programs and ~$450M realized partner cash | Low-medium | Large pharma and public investors | Economic proof is stronger than most peers despite opaque per-program pricing |
| Insilico | License-outs, co-development, milestones, and internal pipeline progression | Up to $66M on ISM8969 co-development; up to $2.1B across three key license-outs | Medium | Pharma partners and investors | Most transparent public economics among private generative peers reviewed |
| Isomorphic Labs | Strategic pharma collaborations plus large financing rounds | Collaborations with Lilly, Novartis, and J&J; $600M external round | Low | Big pharma and Alphabet-backed strategic ecosystem | Packaging skews strategic rather than transparently transactional |
| Schrödinger | Platform plus collaborative and proprietary program model | Collaborative and proprietary programs visible on home page | Low | Pharma R&D and materials/life-science customers | Competes through software-plus-program framing, but detailed public pricing is thin |
| Evotec | Flexible partnering and integrated R&D services | Home page stresses flexible partnering models | Low | Biopharma R&D buyers | Service-style packaging can be easier for buyers to understand than AI-platform claims |
| Xaira / Generate / Valo / Benevolent | Funding- and platform-led narratives with limited disclosed transaction detail | Public proof centers on datasets, platform claims, or strategic changes | Very low | Partners, investors, and future collaborators | Public pricing comparisons remain highly incomplete |
This table compares economic packaging, not list price. Most private peers disclose strategy and collaborations, not apples-to-apples commercial price points.
[CP009, CP010, CP017, CP018, CP028, CP030]Compressed matrix showing which platforms appear strongest on breadth, trust posture, and commercial legibility.
Entries are qualitative and based only on reviewed public evidence. Unknown or weaker public evidence does not prove absence of capability.
[CP026, CP027, CP028, CP029, CP031, CP032]3.4 Moat durability, commoditization risk, and adverse competitive evidence
The strongest case for insitro is that it does not rely on a single algorithmic pitch. Its moat claim combines multimodal disease data, cell-based models, program ownership, and demonstrated partner validation. But that moat is not immune to commoditization. If buyers increasingly view AI-drug-discovery vendors as interchangeable pre-deal, pricing power compresses. If internal pharma build improves, external platform appetite weakens. If peers such as Recursion, Insilico, Xaira, or Isomorphic Labs continue to out-scale insitro on capital, dataset breadth, or public proof cadence, they may shape buyer expectations for the whole category. The adverse evidence matters here. BenevolentAI’s strategic overhaul and proposed delisting show that early AI-drug-discovery leadership does not guarantee durable economics, while Exscientia’s absorption into Recursion shows that consolidation is already underway. The competitive bottom line is therefore nuanced: insitro is differentiated, but its moat will only look durable if later evidence shows better target quality, stronger milestone conversion, or more defensible buyer lock-in than a widening field of AI-biopharma alternatives.[CP036, CP037, CP038, CP039, CP040, CP041]
| Moat or risk | Current direction | Public evidence | Implication for insitro | Diligence ask |
|---|---|---|---|---|
| Multimodal disease-data moat | Potential strength | insitro platform and pipeline framing | Could support differentiated target and patient-selection insight | How unique are datasets relative to peers and partners? |
| Partner validation | Strength with caveats | BMS and Lilly structures disclosed | Helps credibility with buyers, but not yet proof of category-leading outcomes | What milestone conversion has insitro actually achieved versus peers? |
| Capital-scale gap | Risk | Xaira and Isomorphic public funding signals; Recursion public scale | Larger peers may shape category expectations and absorb more failure | What is insitro’s effective runway versus frontier peers? |
| Virtual-cell and AlphaFold displacement | Risk | Xaira and Isomorphic ambitions | Could narrow the distinctiveness of insitro’s own model-centric claims | Where is insitro uniquely better than virtual-cell or structural-AI approaches? |
| Generative-chemistry velocity | Risk | Insilico output cadence and disclosed IND progress | Faster-output peers may win buyer attention if insitro looks slower | How does insitro compare on target-to-candidate speed? |
| Multi-homing and internal build | Risk | KPMG / McKinsey plus substitute set | Buyers may distribute spend across multiple platforms or keep work internal | Which buyers use insitro exclusively versus alongside other platforms? |
| Category consolidation / fragility | Risk | Exscientia absorbed into Recursion; BenevolentAI restructuring and delisting plan | Investor and buyer confidence can compress quickly | Does consolidation increase buyer trust in scaled leaders or open whitespace for specialists? |
| Distribution asymmetry | Persistent risk | Large pharma or service incumbents still control many end channels | insitro remains dependent on partners for downstream reach | What routes to distribution or commercialization remain realistic? |
The register separates genuine moat strengths from areas where public evidence is still too thin to underwrite durability.
[CP021, CP022, CP027, CP028, CP029, CP033]Compact indicators for how secure insitro’s competitive posture currently looks from public evidence.
Values are evidence-backed qualitative indicators, not financial KPIs. Deltas indicate directional pressure on moat durability.
[CP030, CP036, CP037, CP038, CP040, CP041]04Financials
4.1 Revenue model and pricing: bespoke partnership economics, not list pricing
Public evidence supports only one clean monetization conclusion: insitro is not being sold through product revenue or public seat-based software pricing. Instead, value is packaged through bespoke pharma collaborations and milestone-bearing extensions. The 2020 Bristol Myers Squibb deal disclosed $50 million upfront, $20 million in near-term operational milestones, more than $2 billion in downstream milestones, and royalties. By late 2024, insitro had announced a further $25 million BMS milestone; the 2025 ChemML extension could add up to $20 million more funding; and the 2026 target-expansion announcement added another $10 million milestone. Lilly’s 2024 structure shows a different economic design: insitro keeps full global rights while Lilly becomes eligible for milestones and royalties, meaning the partnership can accelerate assets without functioning as a clean annual-revenue stream. Official 2026 company messaging says insitro has generated about $150 million of partnership revenue across BMS, Lilly, and Gilead, but that figure is cumulative and unaudited. The result is a real but hard-to-normalize revenue base: meaningful partner cash, weak public pricing transparency, and no audited product-sales engine.[CI001, CI002, CI004, CI005, CI006, CI007]
| Revenue stream | Public economic signal | Current status | Revenue quality | Main uncertainty | Key source |
|---|---|---|---|---|---|
| BMS 2020 collaboration | $50M upfront, $20M near-term operational milestones, >$2B downstream milestones, royalties | Active and expanded over time | Medium-low today; improves if milestones keep converting | How much has been recognized as revenue versus deferred | BMS 2020 announcement |
| BMS milestone / expansion cash (2024-2026) | $25M milestone in 2024, up to $20M extension funding in 2025, $10M milestone in 2026 | Realized and near-term non-dilutive cash | Medium; real cash but timing and accounting are opaque | Whether extension funding is recognized ratably, milestone-by-milestone, or netted against costs | insitro 2024/2025/2026 BMS updates |
| Gilead historical collaboration | Forbes-reported $15M upfront and up to $1B potential | Historical early validation | Low-medium; credible but old and lightly updated publicly | Current status and any realized milestones are not public | Forbes 2020 |
| Lilly 2024 strategic agreements | insitro retains full global rights; Lilly eligible for milestones and royalties | Strategic enablement, not a simple disclosed annual-revenue stream | Low for current revenue, higher for long-term option value | Near-term cash economics are not transparently disclosed | Lilly 2024 announcement |
| Future own-drug / royalty revenue | Potential from internal metabolic or neuro assets and partner-linked royalties | Still contingent on clinic-readiness and downstream success | Low today; entirely forward-looking | Timeline, ownership splits, and commercialization posture remain private | insitro official materials |
This table distinguishes cash visibility from revenue quality. Real cash events do not by themselves prove recurring or high-quality annual revenue.
[CI001, CI002, CI004, CI005, CI006, CI011]| Package / model | Public price visibility | Disclosed economics | Buyer / counterparty | Recognition implication | Interpretation |
|---|---|---|---|---|---|
| BMS 2020 platform collaboration | Medium | Upfront + milestones + royalties | Large-pharma neuroscience buyer | Multi-element arrangement; cash and GAAP revenue timing may diverge | Classic enterprise pharma collaboration, not software pricing |
| BMS 2025-2026 expansion work | Medium | Up to $20M extension funding plus a $10M 2026 milestone | Existing marquee account | Expansion cash likely tied to specific phases and milestones | Supports land-and-expand motion |
| Lilly 2024 strategic agreements | Low | Rights-retention plus future milestones/royalties | Large-pharma metabolic partner | Economics may be more asset-option-like than subscription-like | Partnership can build option value without clear annual revenue |
| Gilead early collaboration | Low | Forbes-reported $15M upfront and long-tail milestone upside | Large-pharma liver-disease partner | Historical cash may not say much about current run-rate | Useful as early validation, weak as a current annual-revenue anchor |
| Internal program future monetization | Very low | Potential product revenue, royalties, or later licensing | Future payers, partners, or acquirers | No current basis for revenue recognition | Main upside case remains forward-looking |
insitro’s public monetization resembles strategic account economics, not transparent SKU pricing. Price visibility is therefore structurally low.
[CI006, CI007, CI008, CI009, CI010, CI037]Flow view of how insitro turns raised capital and platform investment into collaboration cash today and option-like downstream economics later.
The figure is structural, not a forecast. It maps the value path implied by public disclosures rather than projecting financial outcomes.
[CI001, CI002, CI003, CI006, CI007, CI008]4.2 GTM and sales-efficiency proxies: enterprise pharma accounts and land-and-expand motion
insitro’s go-to-market motion resembles strategic enterprise business development for large pharma rather than broad software distribution. Each disclosed agreement is customized, rights-heavy, and designed around therapeutic programs, not subscriptions. The BMS relationship is especially revealing: discovery collaboration in 2020, milestone conversion in 2024, ChemML extension in 2025, and additional targets in 2026. That sequence looks like land-and-expand selling, where trust is earned through scientific progress and then monetized through follow-on work. KPMG and McKinsey both describe pharma AI adoption as proof-sensitive and partnership oriented, which fits insitro’s public posture as strategic discovery capacity rather than generalized AI tooling. Because no public source discloses customer-acquisition cost, win rate, or payback period, sales efficiency can only be proxied by the ability to deepen a small number of marquee accounts over time. Public evidence says BMS has expanded; it does not yet prove that insitro has a broad, repeatable, multi-customer commercial engine with measurable funnel efficiency.[CI007, CI008, CI009, CI010, CI037]
4.3 Cost structure, gross-margin path, and public traction
Cost structure is easier to infer than to verify. insitro runs wet-lab biology, human-cell data generation, chemistry, and substantial compute, not a lightweight SaaS stack. The 2025 ChemML extension specifically highlighted a 192-H100 GPU cluster, while the company’s broader platform narrative depends on large-scale data generation and internal pipeline work. That means current delivery economics are unlikely to resemble pure software gross margins even if long-run milestone or royalty economics could eventually be attractive. Public traction is similarly uneven. Official materials now cite about $150 million of cumulative partnership revenue, which is meaningful for a private biotech, but third-party trackers disagree sharply on annualized revenue and headcount. GetLatka reports $69 million of 2024 revenue and 262 employees, Usearch lists $7.5 million of revenue and 267 employees, Awaira reports around 300 employees and only a wide estimated ARR range, and WorxForm compresses funding to a single $400 million figure. The safest reading is that partner cash exists and matters, but no clean public source reconciles annual revenue, gross margin, or operating scale.[CI011, CI012, CI013, CI014, CI015, CI016]
| Metric | Public signal | Anchor / comparator | Why it matters | Diligence ask |
|---|---|---|---|---|
| Revenue concentration | Named monetized partner set is concentrated in BMS, Lilly, and earlier Gilead | Not diversified across dozens of customers | A small number of counterparties can drive a large share of economics and strategic risk | Request partner-by-partner revenue and deferred-revenue split |
| Current workforce scale | BioPharma Dive said ~230 employees after 2025 cuts; trackers still show ~250-300 | Tracker disagreement remains material | Personnel is likely one of the largest cost pools, but even the denominator is fuzzy publicly | Request current org chart and fully loaded compensation base |
| Compute intensity | 2025 ChemML update highlighted a 192-H100 GPU cluster | Closer to frontier-model R&D than light enterprise software | Supports the view that compute and experimentation are non-trivial cost centers | Request annual cloud / compute and depreciation spend |
| Peer R&D expense benchmark (2023) | Recursion $241.2M; Schrödinger $181.8M; Relay $330.0M | Public AI-biopharma comps | Bounds what scaled platform-plus-pipeline operations can cost annually | Show insitro platform vs program R&D allocation against these peers |
| Peer operating cash use benchmark (2023) | Schrödinger ~$136.7M; Relay ~$300.3M | Public cash-burn anchors | Shows how quickly cash can be consumed even when some revenue exists | Request insitro monthly cash-burn trend for last 24 months |
| Gross-margin path | No public insitro figure; economics likely between software and wet-lab service models | Public comps still absorb large R&D despite revenue | Without margin disclosure, collaboration revenue quality remains hard to underwrite | Request gross margin by collaboration or program family |
The table intentionally mixes direct insitro signals with peer anchors because insitro-specific unit-economics disclosure is incomplete.
[CI016, CI017, CI018, CI024, CI026, CI028]Structural flow of the main cost buckets and proof points that likely determine insitro’s current unit economics.
Nodes identify cost and value drivers; they do not assign audited dollar amounts because insitro does not disclose them publicly.
[CI009, CI010, CI017, CI018, CI019, CI033]4.4 Capital adequacy and peer capital-intensity benchmarks
Capital adequacy has to be reconstructed from financings, collaboration cash, and cost resets. Public reporting described more than $100 million raised by 2019 and $243 million of venture funding by mid-2020; insitro then disclosed a further $400 million Series C in 2021. By February 2026, the company described itself as backed by roughly $800 million of capital, a larger figure that likely mixes equity with collaboration cash or other capital sources. The 2025 workforce reduction is therefore financially important. BioPharma Dive reported that insitro cut 22% of staff, leaving about 230 workers and targeting operation into 2027. Sector context makes that behavior unsurprising: EY’s 2025 biotech report says follow-on financings were the worst since 2016 and IPO windows remained muted, while layoffs stayed widespread into 2026. Public AI-biopharma comparables reinforce the burn risk. Recursion lost about $328 million on $44.6 million of revenue in 2023, Relay lost $342 million with no product revenue, and Schrödinger still used about $136.7 million of operating cash despite a much larger revenue base. insitro may well be more capital disciplined than those peers, but public data is not strong enough to prove it.[CI003, CI020, CI021, CI022, CI023, CI024]
| Indicator | Public read | Evidence | Implication | Next diligence step |
|---|---|---|---|---|
| Historical disclosed equity financing | >100M by 2019; $243M total VC by 2020; +$400M Series C in 2021 | Forbes 2019/2020 and Series C announcement | Large but still finite private-company capital base | Request full round-by-round financing ledger including any post-2021 capital |
| Current capital claim | ~$800M in capital cited by insitro in 2026 | Joe Hand announcement | Headline capital is larger than disclosed VC rounds alone | Request reconciliation of equity, partner cash, and any other capital sources |
| 2025 restructuring signal | 22% workforce reduction; operate into 2027 | BioPharma Dive | Suggests capital preservation ahead of next proof point | Request the internal runway model before and after restructuring |
| Sector financing backdrop | Follow-on financings worst since 2016; IPOs still muted; layoffs widespread | EY + Fierce | Makes additional financing more conditional on proof and partner support | Request financing options analysis under base / downside market scenarios |
| Public peer cash positions (2023) | Recursion $391.6M; Schrödinger $468.8M; Relay $750.1M | Public filings | Shows how much cash scaled peers still carry while consuming large amounts annually | Benchmark insitro cash needs versus peer burn and pipeline scope |
| Public peer equity values (May 2026) | Recursion $1.73B; Schrödinger $0.95B; Relay $2.46B | CompaniesMarketCap | Frames the valuation band public markets currently assign to comparable AI-biopharma names | Use as a sanity check against any private mark discussion |
Capital adequacy remains directional only because insitro has not published a current balance sheet or explicit burn-rate disclosure.
[CI003, CI020, CI021, CI022, CI023, CI024]Range view of selected public-comp financial metrics that bound how capital intensive AI-biopharma platforms can be.
Values are source-backed public-comp figures, not insitro forecasts. They are used as external bounds for category capital intensity.
[CI024, CI026, CI028, CI029, CI030, CI031]Matrix comparing insitro’s public financial transparency and monetization posture with public AI-biopharma comps.
Values are qualitative descriptors based on public evidence only. "Low" can reflect missing disclosure rather than weak underlying economics.
[CI019, CI024, CI026, CI028, CI029, CI032]4.5 Public financial gaps and unreconciled estimates
From an underwriting perspective, the biggest problem is not the absence of signals but the absence of reconciled signals. Public sources do not disclose current cash, monthly burn, deferred revenue, debt, lease commitments, or program-level spend. Trackers disagree on funding, revenue, valuation, and headcount, and the company’s own capital and partnership-revenue claims are cumulative rather than audit-style period disclosures. Even where public benchmarks exist, they cannot answer insitro-specific questions such as how much collaboration cash is recognized as revenue, what gross margin is earned on active partner work, or how much capital has already been committed to internal clinical programs. Public-comparable filings are useful for bounding category capital intensity, not for replacing company data. The result is that almost every real financial judgment—runway, burn net of partner cash, next-round size, or dilution risk—still depends on private disclosures rather than on a complete public record.[CI016, CI019, CI034, CI035, CI036, CI040]
| Missing metric | Why it is missing | Why it matters | Public proxy today | Exact diligence ask |
|---|---|---|---|---|
| Current cash and unrestricted cash | Private company; no balance sheet published | Runway and dilution risk cannot be independently modeled | Only a press quote about operating into 2027 | Request latest monthly cash report and board runway deck |
| Monthly burn by category | No audited or management P&L detail is public | Gross burn versus partner cash offset is the core financing question | Peer filings give only category-level external bounds | Request 24-month burn history split into R&D, G&A, compute, and facilities |
| Partner-by-partner recognized revenue and deferred revenue | Public sources disclose cash events but not accounting treatment | Revenue quality cannot be judged from milestone headlines alone | Recursion filing shows cash and GAAP timing can diverge | Request collaboration-accounting schedule by counterparty |
| Obligations: debt, leases, compute contracts, equipment financing | No contract schedules or obligation tables are public | Hidden obligations can materially shrink effective runway | Only peer filings show how material these commitments can become | Request full obligations schedule with maturities and covenants |
| Program-level spend and expected use of funds | Internal pipeline budgeting is private | Needed to determine whether platform spend is converting into assets efficiently | Company press releases imply platform and pipeline expansion but not cost allocation | Request by-program budget and next-24-month use-of-funds plan |
| Board-approved financing plan / next-round trigger | No investor letter or financing memo is public | Determines how close the company is to another raise and what milestones matter most | Public proxy is only the 2025 restructuring plus 2026 partner rhetoric | Request financing strategy memo and milestone-based capital plan |
Every row in this table is material to valuation, dilution analysis, or the credibility of any revenue-quality argument.
[CI016, CI034, CI035, CI040, CI041]4.6 Financial verdict: real partner validation, still heavy financing dependence
insitro is not a zero-validation science project. Major pharma has paid it real cash, extended at least one core relationship repeatedly, and given the company a plausible route from platform work toward milestone, royalty, or internally retained asset value. That is stronger than the economics of a purely preclinical biotech with no monetization. But the company is still financing-dependent by public evidence. There is no audited cash position, no clean annual revenue bridge, no disclosed obligation schedule, and no credible public way to underwrite current gross margin or burn. The 2025 restructuring suggests management is still optimizing for proof-point reach rather than for self-sustaining operating leverage. Relative to public AI-biopharma peers, insitro likely has a financially credible platform story, but not one that can yet be underwritten with traditional-model precision. The key diligence blockers remain current cash, true annual collaboration revenue, obligation load, and program-level spend by asset and stage.[CI031, CI032, CI033, CI038, CI040, CI041]
05Product & Technology
5.1 Product definition in workflow terms: discovery system first, product SKU second
Public materials consistently describe insitro less as a software vendor and more as an industrialized discovery system. The company starts with multimodal human-cohort and clinical data, adds internally generated cellular and perturbation data, and uses machine learning to derive causal hypotheses about disease biology. That output is then pushed into a second layer of therapeutic design rather than stopping at target ranking. This matters because it clarifies what the “product” actually is in customer terms: for pharma partners it is a workflow that can move from biology discovery to optimized intervention design; for data partners it can be a specific deployed tool such as embedding search or a co-developed foundation model; and for insitro internally it is the operating system that produces retained pipeline assets. The absence of public pricing, self-serve API surfaces, or packaged software documentation reinforces that the public product surface is workflow-centric and collaboration-specific, not a generalized enterprise SaaS platform.[CE001, CE002, CE003, CE019, CE020, CE029]
| User job | Current workflow problem | insitro solution | Publicly visible benefit | Key limitation |
|---|---|---|---|---|
| Discover ALS disease drivers | Known hypotheses have produced limited progress | Virtual Human + CellML/POSH with BMS | Multiple ALS targets nominated and advanced into modality work | No public human efficacy proof yet |
| Design small molecules for hard targets | ADMET and PK are slow and costly to optimize experimentally | ChemML / TherML active-learning loop | Partner expansions and Lilly ADMET collaboration show partner pull | No public candidate-quality benchmark or success-rate disclosure |
| Advance metabolic RNA and antibody programs | Modality choice can be constrained by internal tool bias | TherML plus Lilly delivery and antibody agreements | insitro retains rights while broadening modality options | Timing to clinic for named assets remains private |
| Explore multimodal NHS cases | Label-based search underuses high-dimensional histopathology and genomics | Embedding search engine for Genomics England | Semantic retrieval inside secure research environment | No public usage or outcome metrics |
| Build ocular biomarker models for neurodegeneration | OCT data are rich but hard to use at scale for target discovery | INSIGHT / Moorfields foundation model collaboration | Access to millions of linked OCT images in secure environment | Still collaboration-stage; no public deployment metrics |
| Generate obesity targets from scalable human phenotypes | BAT is difficult to measure at population scale | ClinML phenotype plus CellML screening and in vivo follow-up | Named preclinical asset BAT-01 with animal efficacy data | Preclinical proof only |
Benefits are limited to what has been publicly described by insitro or its named partners; commercial value capture is still mostly opaque.
[CE001, CE005, CE006, CE015, CE017, CE018]How insitro’s discovery platform moves from data access through target nomination to designed interventions and partner or internal programs.
The flow abstracts multiple partner variants into one canonical operating loop.
[CE001, CE005, CE006, CE013, CE015, CE017]5.2 Module map and operating architecture: from causal biology to modality-aware design
insitro’s architecture is publicly legible as a layered system. At the discovery end, Virtual Human is described as the causal-biology engine, while ClinML and CellML create the phenotypes and cellular evidence that make that causal map actionable. POSH sits inside the cellular perturbation layer as a high-content screening mechanism that preserves phenotypic depth at scale rather than collapsing everything to low-dimensional readouts. Downstream, TherML and ChemML convert that biological insight into intervention design, with active-learning loops tied directly to automated laboratories. The result is not just a collection of models but a pipeline-shaped operating model: discover disease structure, identify leverage points, choose modality, generate compound or oligo candidates, and iterate using experimental feedback. Public GitHub assets and cp-posh resources make slices of this architecture tangible, but the full stack remains partly opaque because only research-facing fragments are open while the most valuable datasets, internal models, and operating metrics remain private.[CE004, CE005, CE006, CE007, CE008, CE009]
| Module / asset | Primary user | Status / maturity | Differentiation | Main diligence gap |
|---|---|---|---|---|
| Virtual Human | insitro disease teams; pharma partners | Active core discovery layer | Causal-biology engine anchored in human and cellular data | No public benchmark for target hit-rate versus alternatives |
| ClinML | Human-data and translational teams | Active research module | Scalable phenotype generation from cohort and imaging data | Data-rights and phenotype-validation terms remain private |
| CellML / POSH | Target-discovery and validation teams | Active research module | High-content perturbation screening with phenotypic depth at scale | No public throughput or cost-per-screen metrics |
| ChemML / small-molecule design | Chemistry teams; BMS; Lilly-linked workflows | Active and partner-validated | QALs, ADMET models, active-learning design loop, large compute | No public lead-to-candidate conversion data |
| Oligonucleotide design stack | Metabolic and ALS program teams | Active / preclinical | AI-guided siRNA design plus delivery-tech integration | Public first-in-human timing and CMC readiness are not disclosed |
| TherML biologics module | Biologics / antibody design teams | Launched 2026 | CombinAbleAI physics-informed optimization for antibodies and other biologics | Post-acquisition integration maturity is not yet externally benchmarked |
| External deployment tools | Genomics England; INSIGHT/Moorfields researchers | Deployed / in joint development | Embedding search and OCT foundation-model workflows in secure environments | Commercial terms, usage metrics, and repeatability are not public |
insitro markets a discovery system rather than a SKU catalog, so modules are reconstructed from public program and platform disclosures.
[CE001, CE004, CE005, CE006, CE007, CE011]| Layer / component | Role | Key dependency | Primary risk |
|---|---|---|---|
| Human cohort and partner data | Supply multimodal clinical and phenotypic inputs | Biobank and partner data rights | Rights fragmentation or restricted reuse |
| Phenotype / representation learning | Build embeddings, foundation models, and scalable human phenotypes | Curated high-quality linked datasets | Drift, bias, or weak transfer outside source cohorts |
| Cell model generation | Create disease-relevant human cellular systems for discovery | Automated labs and differentiated protocols | Biological reproducibility risk |
| Perturbation screening | Measure the effect of genes or perturbations at phenotypic depth | POSH imaging, barcode, and sequencing stack | Throughput and cost are not externally disclosed |
| Therapeutic design engine | Choose modality and optimize interventions for potency plus developability | Compute, partner datasets, QALs, molecular simulation | Compute economics and post-acquisition integration risk |
| Program translation layer | Move outputs into partner or internal asset programs | BMS, Lilly, and internal pipeline execution | Clinical translation and CMC risk |
| Governance layer | Constrain high-stakes AI use with privacy and validation controls | Partner secure environments; regulatory frameworks | Control implementation is not publicly auditable |
Detailed system diagrams are not public; the table reconstructs the operating model from the company’s own module descriptions and partner deployments.
[CE003, CE004, CE005, CE006, CE007, CE011]High-level view of insitro’s product stack from data access through causal discovery to therapeutic design and program delivery.
Layers are reconstructed from public platform and partner materials; internal subservices are not disclosed.
[CE001, CE003, CE004, CE005, CE006, CE011]5.3 Deployment, maturity, and roadmap: real partner use, still mostly preclinical proof
The strongest maturity evidence is not broad customer rollout but repeated use inside specific collaborations and named programs. Bristol Myers Squibb first engaged insitro to build ALS and FTD disease models, then converted that relationship into nominated targets, a ChemML-based molecule-design phase, and finally a multimodality expansion that spans oligonucleotides and small molecules. Lilly likewise broadened from metabolic agreements around siRNA-delivery and antibody discovery into small-molecule ADMET model development. Outside pharma, Genomics England and INSIGHT/Moorfields describe partner-specific deployments of insitro technology inside secure environments, showing that at least some platform components are deliverable to external organizations. The roadmap from late 2025 into 2026 also matters: POSH moved from concept to publication plus public code release, TherML launched as a branded therapeutic-design layer, and BAT-01 emerged as a named preclinical asset from the end-to-end stack. Even so, the maturity ceiling is still clear. Public proof stops mostly at preclinical or collaboration-stage milestones rather than approved products or public human efficacy outcomes.[CE015, CE016, CE017, CE018, CE019, CE020]
| Date / stage | Feature / milestone | Status | Implication | Source |
|---|---|---|---|---|
| 2020-10 | BMS ALS discovery collaboration launched | Completed | External validation of disease-model and target-discovery workflow | BMS 2020 |
| 2024-10 | Lilly metabolic agreements (siRNA delivery + antibody discovery) | Active | Confirms modality expansion beyond target nomination | Lilly 2024 |
| 2024-12 | First BMS ALS target milestone | Completed | Shows target nomination converting into paid follow-on work | BMS milestone 2024 |
| 2025-09 | Lilly small-molecule ADMET collaboration | Active | Adds partner-trained ADMET layer and TuneLab exposure | Lilly small-molecule 2025 |
| 2025-10 | BMS ChemML extension | Active | Moves ALS work from biology discovery into molecule design | BMS ChemML extension |
| 2025-12 | POSH paper and public cp-posh assets | Completed | External technical proof plus open research artifacts | POSH 2025 + GitHub |
| 2026-01 | TherML launch and CombinAbleAI acquisition | Completed | Completes modality-agnostic design story including biologics | TherML / CombinAbleAI |
| 2026-02 | BAT-01 preclinical obesity data | Completed | Named asset demonstrates end-to-end target-to-animal workflow | BAT study 2026 |
| 2026-03 | BMS ALS target expansion and multimodality plan | Active | Shows platform now supports oligo and small-molecule branches against shared biology | BMS expansion 2026 |
Rows are ordered to show capability broadening from discovery partnerships to modality-aware design and named preclinical assets.
[CE015, CE016, CE017, CE018, CE021, CE030]5.4 Differentiation, data moat, and IP: strongest where biology, data density, and modality breadth intersect
insitro’s differentiation does not appear to rest on a single model architecture or a single therapeutic modality. The stronger public story is the combination of massive human-data assets, high-density cellular perturbation systems, and an integrated design loop that chooses and engineers interventions across modalities. POSH gives the company a concrete technical proof point because it shows how insitro tries to break the scale-versus-depth trade-off in phenotypic screening, and the cp-posh repository adds real developer signal that some of this work is reproducible. TherML extends that moat claim by linking target choice to modality choice, which is more ambitious than simply optimizing a small molecule against a target. The Justia patent record also suggests that insitro is building a real image-and-omics IP estate. Still, moat visibility remains incomplete. The most valuable parts of the stack are probably the private datasets, internal feedback loops, and contractual data rights that the public record does not expose cleanly.[CE007, CE008, CE009, CE010, CE011, CE012]
Qualitative assessment of the strength and visibility of insitro’s main modules across scientific proof, external proof, modality breadth, and audit visibility.
Scores are qualitative analyst judgments based only on public evidence and are not substitutes for internal KPI reviews.
[CE007, CE011, CE012, CE019, CE020, CE027]5.5 Trust, safety, privacy, compliance, and quality controls: principles are visible; audits are not
Public trust evidence is directionally positive but incomplete. Genomics England and INSIGHT/Moorfields both emphasize secure research environments and restricted data access, which is meaningful because insitro’s platform touches sensitive clinical and imaging datasets. External frameworks also give a clear view of what “good” should look like: FDA highlights risk-based credibility assessments for AI models used in drug decision-making, EMA emphasizes data governance and lifecycle management, and WHO frames trust and ethics as foundational because technical deployment is moving faster than law. But insitro’s own public disclosures remain thin on operational detail. The published privacy policy is website-focused, not a window into collaboration-data controls, and no sourced materials disclosed SOC 2, ISO 27001, GxP, HIPAA, or HITRUST certification posture. That leaves a familiar diligence conclusion: the company appears to understand the right governance language, but investors still need private evidence to verify whether those controls are implemented at industrial depth across the actual discovery workflow.[CE019, CE020, CE032, CE033, CE034, CE035]
| Control / framework | Status | Scope | Main gap |
|---|---|---|---|
| Secure research environments | Visible in partner materials | Genomics England and INSIGHT/Moorfields data access | Partner-specific controls do not prove enterprise-wide security posture |
| FDA AI credibility guidance | Applicable external framework | AI outputs used in drug regulatory decision-making | No public mapping of insitro controls to FDA framework |
| EMA good AI practice principles | Applicable external framework | Drug-development AI lifecycle, validation, governance | No public implementation detail from insitro |
| WHO ethics and governance guidance | High-level external framework | Trust, equity, and governance for AI in health | Not a company-specific control set |
| Website privacy policy | Publicly disclosed | Site cookies, device data, and user inquiries | Does not cover the governed collaboration datasets that power the platform |
| Public certifications and audits | Not found in reviewed materials | Security, privacy, and laboratory-quality posture | No SOC2 / ISO / GxP / HIPAA / HITRUST disclosure surfaced |
| Open research artifacts | Visible on GitHub | cp-posh datasets, scripts, and model weights | Only covers research subsets, not the full internal platform |
Statuses distinguish between visible partner-specific controls, external frameworks that apply, and true company-specific audit evidence that remains undisclosed.
[CE009, CE019, CE020, CE032, CE033, CE034]Major external and internal dependencies that the public record suggests are critical to insitro’s product and technology stack.
Dependencies are reconstructed from public partner, governance, and platform materials rather than from internal architecture documents.
[CE015, CE016, CE018, CE019, CE020, CE024]06Customers
6.1 Customer base segmentation: a few pharma payers, a few external users, and no broad install base
As of May 2026, insitro’s customer map is narrow. The clearest direct payers are Bristol Myers Squibb, Lilly, and historically Gilead: large-pharma counterparties that use insitro’s platform to discover targets, design modalities, and build ML-enabled chemistry or ADMET capabilities. Separate from those payers are research-environment partners such as Genomics England and INSIGHT/Moorfields, which matter because they show insitro tools can be delivered into real external data infrastructures with strict governance. A third surface is indirect ecosystem reach via Lilly TuneLab and Catalyze360, where insitro-built models may reach biotech users without those users necessarily becoming direct insitro customers. What is absent is just as important: no self-serve product, public pricing, marketplace motion, or long tail of named enterprise accounts surfaced in reviewed sources. Home, platform, purpose, and pipeline materials still describe a company balancing internal programs with partnered programs, so current customer economics depend on a few high-value relationships rather than a broad installed base.[CU001, CU002, CU003, CU004, CU005, CU006]
| Segment | Buyer / user / payer | Use case | Scale | Revenue / strategic value | Main gap |
|---|---|---|---|---|---|
| Large-pharma co-discovery counterparties | Buyer/payer: BMS, Lilly, Gilead; users: pharma R&D and translational teams | Target discovery, modality selection, chemistry, ADMET, disease-modeling | 3 named pharma accounts | Highest visible cash value; majority of disclosed partnership revenue likely concentrated here | No account-level revenue split, renewal terms, or customer-count denominator |
| Research-environment deployment partners | Buyer: partner leadership; users: approved Genomics England and INSIGHT/Moorfields researchers; payer: unclear | Embedding search and foundation-model workflows inside secure environments | 2 named U.K. institutions | Strong strategic proof that tools can run externally on sensitive data | Commercial terms, user counts, and repeat-usage metrics are undisclosed |
| Indirect Lilly TuneLab ecosystem users | Buyer/payer: Lilly and partner biotechs; users: medicinal chemistry and data-science teams | Access to insitro-built ADMET models inside federated Lilly infrastructure | Indirect; partner count not disclosed | Potential path to broader reach beyond direct bilateral deals | insitro may not be the direct commercial vendor to those users |
| Internal insitro pipeline programs | Buyer/payer: insitro itself; users: internal disease and platform teams | Use the platform to advance wholly owned assets and partnered programs | Multiple programs but not customer accounts | Strategically important, but does not diversify external revenue | Consumes capital without proving third-party willingness to pay |
| Broad self-serve or long-tail enterprise customers | Not evidenced publicly | No public product-led or marketplace workflow surfaced | 0 disclosed | None public | Existence of any long tail remains unverified |
The table distinguishes direct cash counterparties from external users and from internal platform consumption so customer quality is not overstated.
[CU001, CU002, CU003, CU004, CU005, CU006]How insitro’s named customer relationships typically move from unmet-science need to deployment, milestone proof, and expansion.
The journey map is reconstructed from public partnership and secure-environment materials rather than from a company-published GTM diagram.
[CU001, CU004, CU006, CU018, CU021, CU026]6.2 Named adoption proof: strongest where counterparties describe real deployments or paid milestone progression
Named customer proof is real, but it is collaboration-led rather than subscription-led. Gilead’s 2019 NASH deal disclosed a three-year term, upfront cash, milestone potential, and rights to advance up to five targets, which is much stronger than a passive logo reference. Bristol Myers Squibb is the best public adoption case: a five-year 2020 collaboration, a $25 million 2024 milestone and first nominated ALS target, a 2025 ChemML extension into small-molecule design, and a 2026 expansion with two additional targets plus another milestone. Lilly is the second major land-and-expand example: three 2024 metabolic agreements broadened in 2025 to ADMET models trained on Lilly data and exposed through Lilly TuneLab. Outside pharma, Genomics England says insitro’s embedding search will be available inside its secure Research Environment, and INSIGHT/Moorfields says a co-developed OCT foundation model is being built on millions of linked images. These are meaningful external deployments, but their public outcomes are still framed as milestones or technical capabilities rather than utilization or ROI.[CU008, CU009, CU010, CU011, CU012, CU013]
| Metric / milestone | Value | Date | Source | Confidence | Implication | Missing denominator |
|---|---|---|---|---|---|---|
| Gilead discovery collaboration launched | 3-year NASH collaboration; up to 5 targets; $15M upfront plus milestone stack | 2019-04-16 | Gilead official + Forbes | High | Earliest clear proof that a large pharma was willing to pay for insitro platform work | No public view of realized milestones or renewal |
| BMS collaboration launched | 5-year ALS/FTD discovery collaboration with upfront and downstream milestone potential | 2020-10-28 | insitro BMS 2020 | High | Establishes BMS as the earliest visible durable pharma account | No disclosed annual revenue contribution |
| Genomics England deployment announced | Embedding search to be made available to Genomics England research partners inside the secure Research Environment | 2022-03-09 | Genomics England official | High | Strong partner-side proof of an external tool deployment on sensitive data | No public user count or query volume |
| BMS first cash conversion | $25M milestone payment and first novel ALS target selected | 2024-12-18 | insitro BMS milestone 2024 | High | Moves BMS from announced partnership to realized economic progress | No view of total contract value recognized to date |
| Lilly relationship initiated | 3 strategic agreements spanning siRNA delivery and antibody discovery in metabolic disease | 2024-10-09 | insitro Lilly 2024 | High | Creates a second major pharma account beyond BMS | No disclosed near-term cash economics |
| Lilly relationship expanded | ADMET models trained on Lilly data; models available to Lilly TuneLab partners | 2025-09-09 | insitro Lilly 2025 | High | Shows early land-and-expand and indirect ecosystem reach | No public count of partner biotechs using the models |
| INSIGHT / Moorfields collaboration announced | Foundation model built on millions of OCT images inside secure environment | 2025-05-05 | INSIGHT collaboration page | High | Shows a second partner-controlled external deployment surface in the U.K. | No public deployment usage or commercialization data |
| BMS expanded again | 2 additional targets nominated and $10M milestone paid | 2026-03-23 | insitro BMS 2026 | High | Makes BMS the clearest public repeat-expansion account | No remaining milestone schedule or term disclosed |
| Aggregate partnership revenue disclosed | ~$150M from BMS, Lilly, and Gilead | 2026-02-26 | Joe Hand 2026 | High | Confirms customers matter economically | Revenue split by account is undisclosed |
The trajectory table focuses on directly observable external milestones and deployments. It keeps customer counts and retention metrics null when they are not disclosed.
[CU008, CU009, CU010, CU011, CU012, CU013]| Customer / partner | Segment | Deployment / use case | Production vs pilot | Outcome / proof | Main limitation |
|---|---|---|---|---|---|
| Bristol Myers Squibb | Large-pharma payer | ALS/FTD target discovery that expanded into ChemML small-molecule design and additional targets | Active strategic program | 2020 launch, 2024 $25M milestone, 2025 extension, 2026 extra targets + $10M milestone | No public revenue split, contract term remaining, or renewal mechanics |
| Eli Lilly | Large-pharma payer | Metabolic disease target and modality agreements plus ADMET / PK model development for small molecules | Active strategic program | 2024 three strategic agreements and 2025 expansion into TuneLab-linked ADMET models | Public duration is still short and economics remain undisclosed |
| Gilead Sciences | Large-pharma payer | NASH disease modeling and target discovery with rights to advance up to five targets | Historical production-like collaboration | Official Gilead terms disclosed upfront cash, milestones, royalties, and a defined three-year term | Current status after the initial term is not publicly visible |
| Genomics England | Research-data partner / external user surface | Embedding search made available inside the secure Research Environment for research partners | Deployed external tool inside partner environment | Partner-side page plus current RE docs confirm a real governed external environment | No public usage, satisfaction, or cash-value data |
| INSIGHT / Moorfields | Research-data partner / external user surface | Co-developed OCT foundation model for neurodegeneration-related discovery inside secure environment | Active joint development in partner-controlled environment | Partner-side materials describe millions of OCT images, secure access controls, and approved-researcher infrastructure | No public repeat-usage, monetization, or renewal metrics |
Rows are limited to externally named counterparties with concrete deployment or economic proof. The evidence is materially stronger than logos alone, but still incomplete on utilization and renewal.
[CU008, CU009, CU010, CU011, CU012, CU013]Generalized flow from opportunity identification to secure deployment and expansion for insitro’s named customer relationships.
The flow abstracts across pharma and research-environment relationships using only steps visible in public announcements and documentation.
[CU004, CU008, CU010, CU014, CU018, CU024]6.3 Retention and durability: BMS is the best signal, Lilly is early, Gilead is opaque, and usage metrics are absent
Retention and durability are where the public record thins materially. No NRR, GRR, churn, NPS, active-customer counts, or account-level revenue-retention metrics are disclosed. The strongest durability proxy is observed continuity: BMS has stayed public and expanded across four distinct proof points from 2020 through 2026, making it the best evidence that insitro can deepen a marquee relationship once scientific trust is earned. Lilly is encouraging but newer, with only a 2024-to-2025 public sequence so far. Gilead proves that large pharma was willing to pay early, but current status after the original term is not visible in reviewed sources. Research-environment partnerships add a different kind of stickiness. Genomics England and INSIGHT both describe secure environments where data stay inside controlled infrastructure, exports are gated, and tools or virtual machines are provisioned for approved researchers. That operating model implies onboarding friction and some switching cost, but it still does not reveal renewal economics, user satisfaction, or repeat-usage quality.[CU022, CU023, CU025, CU026, CU027, CU030]
| Metric | Value / null | Segment | Confidence | Diligence ask |
|---|---|---|---|---|
| Named direct payers | 3 publicly named pharma counterparties (BMS, Lilly, Gilead) | Pharma relationships | High | Request full active-account roster and any dormant or expired accounts |
| Public land-and-expand evidence | BMS 2020→2026 and Lilly 2024→2025 | Pharma relationships | High | Request account-by-account milestone timeline, renewal dates, and expansion pipeline |
| Public contract length visibility | BMS original term 5 years; Gilead original term 3 years; Lilly current term not disclosed | Pharma relationships | High | Obtain current term remaining, auto-renewal terms, and termination rights |
| NRR / GRR / churn / NPS | All external customers | Low | Request revenue-retention cohorts, lost-account history, and satisfaction survey data | |
| Deployment utilization | Genomics England, INSIGHT/Moorfields, Lilly TuneLab users | Low | Request active-user counts, query volume, model calls, and repeat-usage frequency | |
| Procurement / switching friction | Secure environments, Airlock controls, approved export, and provisioned VMs imply moderate stickiness | Research-environment partners | Medium | Quantify onboarding time, revalidation burden, and renewal effort for governed deployments |
Nulls are deliberate where the company has not disclosed retention or satisfaction metrics. Public continuity signals are strongest for BMS and weakest for Gilead’s current-state visibility.
[CU022, CU023, CU025, CU026, CU030, CU031]Qualitative comparison of proof strength across insitro’s named counterparties and external deployment partners.
Scores are qualitative analyst judgments based on public evidence only. Low scores often reflect missing disclosure rather than a known weak relationship.
[CU028, CU029, CU032, CU033, CU034, CU035]Illustrative continuity proxy scenarios for insitro-style relationship types, used only because the company discloses no retention metrics.
These percentages are analyst heuristics, not company-reported retention. They translate the public continuity seen in BMS, Lilly, Gilead, and partner deployments into a comparative durability frame for diligence only.
[CU032, CU033, CU034, CU035, CU043]6.4 Expansion upside exists, but concentration risk dominates the investable customer story
Expansion upside and concentration risk coexist. On the upside, insitro has shown that a customer relationship can move from target discovery into chemistry, modality expansion, or ecosystem exposure: BMS broadened across targets and modalities; Lilly broadened into ADMET and TuneLab; Genomics England can expose insitro search to a wider research network. On the downside, Joe Hand’s 2026 company statement lumps roughly $150 million of partnership revenue into just three names—BMS, Lilly, and Gilead—without revealing the split. That is enough to indicate concentration, but not enough to underwrite quality of revenue. BioPharma Dive’s 2025 layoff coverage, including a 22% workforce reduction and runway-to-2027 language, reinforces the view that insitro still operates like a capital-intensive biotech that depends on a handful of counterparties and internal proof points. KPMG and McKinsey contextualize this as normal for AI-biopharma deals: partnerships are bespoke, proof-sensitive, and integration-heavy. The result is a customer base that is strategically credible but still too opaque to treat as a diversified recurring-revenue engine.[CU036, CU037, CU038, CU039, CU040, CU042]
| Expansion driver | Concentration / execution risk | Impact | Diligence path |
|---|---|---|---|
| BMS land-and-expand across targets and modalities | Highest-quality public customer proof sits in one marquee account | Loss or slowdown at BMS would remove the strongest durability signal and a meaningful share of partnership value | Request BMS revenue share, remaining milestone schedule, and staffing support by program |
| Lilly expansion into ADMET and TuneLab ecosystem | Indirect ecosystem reach may not diversify direct payers | Can broaden platform exposure, but may still tie economics to one pharma sponsor | Request direct versus indirect economics, user counts, and roadmap for additional Lilly-linked programs |
| Gilead-style discovery economics | Historical proof may not reflect current revenue quality | Useful as early validation, but opacity around current status weakens durability analysis | Request realized milestones, current obligations, and any surviving royalty or co-development rights |
| Genomics England and INSIGHT secure-environment deployments | Strong strategic proof but unclear monetization | Validates external usability on sensitive data but may not translate into meaningful cash revenue | Request commercial terms, hosting obligations, and usage metrics for each deployment |
| Aggregate partnership revenue concentration | ~$150M disclosed across just three counterparties with no split | Any partner pause could materially affect cash planning and perceived momentum | Request top-customer contribution percentages and forward revenue concentration scenarios |
| 2025 restructuring and runway framing | Support capacity and account focus may tighten under capital discipline | Could increase prioritization risk across accounts and internal programs | Request post-restructuring account coverage plan, BD pipeline, and support SLA commitments |
This table intentionally pairs upside with concentration risk because insitro’s best customer story is also its clearest exposure.
[CU036, CU037, CU038, CU039, CU040, CU042]07Risks
7.1 Severity-ranked risk posture: translation and AI-governance risks dominate the stack
insitro’s risk stack is led by two tightly linked issues: getting AI-enabled discovery into the clinic, and doing so under a regulatory environment that is getting more explicit about context of use, data governance, lifecycle management, and credibility evidence. Public proof still sits at the preclinical and collaboration stage. The MASLD update frames CTRO-1013 as moving through IND-enabling work toward first-in-human, while layoffs and runway language show that timing matters financially as well as scientifically. This creates a compounding structure: if translation slows, milestone timing and financing flexibility can deteriorate together. The customer side does not offset this risk because partnership validation is concentrated in BMS, Lilly, and Gilead rather than spread across a broad portfolio. The chapter therefore treats clinic-readiness risk, AI-governance risk, and concentration-financing risk as the three most important categories, with data-rights, quality-certification opacity, and people-execution risk as amplifiers rather than independent distractions.[CR001, CR002, CR003, CR011, CR019, CR025]
Relative likelihood, impact, mitigation maturity, and residual severity across the main insitro risk buckets.
Scores are qualitative analyst judgments based on public evidence only. Missing disclosure often drives low mitigation-maturity scores.
[CR001, CR011, CR019, CR025, CR039, CR040]7.2 Regulatory, legal, and data-governance risk: visible principles, limited company-specific evidence
FDA and EMA now provide a reasonably clear picture of what “good” looks like for AI in drug development. FDA points to a significant increase in AI-related submissions, a risk-based regulatory framework, and the importance of context of use. EMA explicitly extends its reflection paper from drug discovery through post-authorisation and highlights bias, patient safety, privacy, and data governance. For US-facing programs, those expectations sit next to formal legal rules around IND submissions and electronic records. insitro’s public record, by contrast, does not show a mapped validation package against those expectations. The legal picture is similarly mixed. The website privacy policy is narrow and site-specific; it is not a substitute for partner-data governance documentation. Genomics England and INSIGHT show serious secure-environment controls, but those controls live in partner environments and may also restrict data portability and reuse. Patents indicate a real IP surface, yet freedom-to-operate, licensing encumbrances, litigation history, and indemnity terms remain private. The result is not a red flag of known noncompliance, but a material visibility gap in exactly the areas regulators and sophisticated partners care about most.[CR005, CR006, CR007, CR008, CR009, CR010]
| Risk / rule / legal issue | Jurisdiction | Current public status | Likelihood | Severity | Mitigation | Residual exposure | Diligence path |
|---|---|---|---|---|---|---|---|
| AI model credibility and regulatory evidence package | US / EU | FDA and EMA expectations are visible; insitro-specific mapped package is not public | High | High | General FDA/EMA principles and internal expertise additions | High | Request regulator-facing AI validation package and context-of-use mapping |
| IND / clinic-entry readiness for lead assets | US | Directional proof only: IND-enabling and clinic-readiness language, no public filing or dosing milestone | High | High | Program progress, partner validation, and board expertise | High | Request IND/CTA status, critical-path plan, and remaining blockers |
| Partner-data privacy and controlled-access constraints | UK / EU / US | Secure environments and export controls are visible | Medium-high | High | Governed environments at Genomics England and INSIGHT reduce raw-data leakage risk | Medium-high | Request data-rights, reuse, and export-governance terms for key collaborations |
| Quality / security certification opacity | Global | No public SOC 2 / ISO / GxP / HIPAA / HITRUST evidence surfaced | Medium-high | High | Reasonable precautions language and partner controls | High | Request audit artifacts, incident history, and quality-system documentation |
| IP / FTO / litigation opacity | Global | Patent surface visible; FTO, encumbrances, litigation history, and indemnities not public | Medium | Medium-high | Growing patent portfolio | Medium-high | Request FTO review, material-license schedule, and legal-risk summary |
Rows are ordered by residual severity rather than by how easy they are to describe publicly.
[CR005, CR007, CR008, CR009, CR010, CR012]How insitro’s major risks can cascade into milestones, customers, financing, and valuation.
The DAG highlights the most important public transmission paths rather than every conceivable operational interaction.
[CR011, CR015, CR019, CR025, CR035, CR040]7.3 Operational, partner, and financing dependency risk: the model is powerful, but tightly coupled
insitro’s operating model is not a lightweight software business. It combines multimodal data generation, wet-lab workflows, heavy compute, proprietary partner datasets, and an internal pipeline that competes for capital with external partnership work. Public sources make this visible in pieces: the BMS ChemML extension references QALs, ADMET models, and a 192-H100 cluster; Lilly small-molecule work depends on Lilly’s proprietary data; Genomics England and INSIGHT deployments run inside governed partner environments; and Joe Hand’s company statement still concentrates partnership revenue in three names. That means several dependencies can fail together. A partner delay can affect data access, milestone timing, and strategic validation at the same time. A slowdown in clinic readiness can worsen both fundraising needs and customer leverage. BioPharma Dive and Fierce add important context: layoffs are not unique to insitro, but the sector backdrop reinforces that capital and execution slack remain limited. Operational risk here is therefore less about one dramatic failure and more about a tightly coupled system with limited room for error.[CR004, CR014, CR015, CR019, CR020, CR021]
| Failure mode | Likelihood | Severity | Mitigation maturity | Residual exposure | Unresolved gap |
|---|---|---|---|---|---|
| Preclinical findings fail to convert into IND-ready or clinically useful programs | High | High | Medium | High | No public IND or human-dosing milestone yet |
| Compute, data-generation, and wet-lab costs outpace milestone timing | Medium-high | High | Medium | Medium-high | No public unit-economics, burn, or throughput metrics |
| AI model validation, documentation, or performance evidence proves insufficient for regulators or partners | High | High | Low-medium | High | No public mapped validation package or audit trail |
| Secure-environment deployment friction slows partner execution or data reuse | Medium | Medium-high | Medium | Medium-high | Export controls and partner-side governance can limit agility |
| Security and quality posture remains under-audited in the public record | Medium-high | Medium-high | Low | High | No public certification, incident, or quality-system disclosure |
Operational risk is not just about science; it is the combination of science, documentation, infrastructure, and timing.
[CR001, CR003, CR010, CR014, CR015, CR016]| Dependency | Counterparty | Role | Concentration | Failure scenario | Severity | Mitigation | Residual exposure |
|---|---|---|---|---|---|---|---|
| Strategic neuroscience partner | Bristol Myers Squibb | Validation, milestone cash, target expansion, modality branch | High | No further target progress or milestone conversion | High | Deep multi-year relationship and repeated progress | High |
| Metabolic and chemistry partner | Lilly | Data, modality support, TuneLab ecosystem reach | High | Lilly deprioritizes programs or limits ecosystem access | High | Multiple agreements and expanded scope | Medium-high |
| Historical large-pharma validator | Gilead | Early external proof of willingness to pay for platform work | Medium | Relationship has effectively ended or contributes little going forward | Medium-high | Historical validation only | Medium-high |
| Governed data and deployment surfaces | Genomics England / INSIGHT | Sensitive-data access, secure external deployments, partner proof | Medium | Data-rights limits or export controls impede reuse and scale | Medium-high | Secure environments reduce leakage risk | Medium-high |
| External financing and milestone funding | Private investors and partner cash | Funds clinical transition, compute, labs, and hiring | High | Capital becomes more expensive before clinic proof improves | High | Historic capital base and partnerships | High |
| Regulatory acceptance | FDA / EMA | Set expectations for AI credibility and evidence | High | Insufficient documentation slows or blocks filing progress | High | Published guidance and engagement pathways exist | High |
Dependencies are listed by the magnitude of the damage their failure could cause, not by contractual status alone.
[CR019, CR020, CR021, CR022, CR023, CR026]7.4 People, execution, and kill criteria: mitigations exist, but the burden of proof is still ahead
There are real mitigants in the public record. Amy Abernethy adds clinical-development and former FDA experience at the board level. Joe Hand’s appointment signals that the company is thinking explicitly about talent strategy as it moves into a more operationally demanding phase. Secure environments at partner institutions show that some high-stakes governance controls are already part of how insitro works externally. But none of those mitigants eliminate the core burden of proof. The company still needs to show that it can move programs into the clinic, produce validation documentation that sophisticated partners and regulators will accept, and do so without further erosion of execution capacity or partner concentration. For diligence, the most useful monitorable triggers are not vague impressions of progress but concrete events: filing status, first-in-human readiness, additional layoffs, missing validation packages, absence of audit evidence, and partner non-renewal or de-prioritization. Until those are clarified, the right stance is to treat leadership additions as meaningful but incomplete mitigation against a still-high residual risk profile.[CR007, CR008, CR026, CR027, CR028, CR029]
| Role / function | Dependency or gap | Likelihood | Severity | Mitigation | Diligence path |
|---|---|---|---|---|---|
| Clinical development and regulatory execution | Transition from discovery-heavy organization to clinic-ready operating model is not yet publicly proven | Medium-high | High | Amy Abernethy board addition and explicit clinic-readiness focus | Request org chart, external CRO/regulatory advisors, and IND workstreams |
| Founder-led strategy concentration | Daphne Koller remains central to scientific and strategic narrative | Medium | Medium-high | Board expansion and senior hires | Request delegated decision rights and succession planning |
| Cross-functional AI / biology / chemistry talent retention | Layoffs reduce slack in a talent-scarce operating model | High | Medium-high | CPO hire and public talent-strategy messaging | Request attrition data, critical vacancies, and post-layoff hiring plan |
| Governance and policy depth | Need sophisticated oversight on AI, clinical evidence, and health-policy interfaces | Medium | Medium | Abernethy adds former FDA and clinical research leadership | Request board committee structure and external advisory support |
| Organizational scaling discipline | Platform, internal assets, and partnerships all compete for attention and capital | Medium-high | Medium-high | Transparent culture messaging and leadership additions | Request portfolio-prioritization framework and resourcing model |
People risk is amplified because insitro’s differentiator depends on integrating functions that are usually siloed in biotech and software companies.
[CR027, CR028, CR029, CR030, CR031, CR039]| Risk | Monitorable trigger | Threshold / event | Action implication |
|---|---|---|---|
| Clinical translation delay | IND / first-in-human readiness | No filing or human-dosing milestone by end-2027 | Move to high-alert / avoid underwriting platform premium |
| AI credibility gap | Validation package availability | Management cannot provide mapped controls against FDA / EMA principles | Treat AI-governance risk as unresolved blocker |
| Quality / security opacity | Audit evidence | No meaningful security, quality, or incident documentation in diligence room | Assume enterprise readiness is unproven |
| Partner concentration | BMS / Lilly milestone and renewal cadence | No further milestone or expansion progress across top partners | Increase concentration discount and financing caution |
| Execution-capacity erosion | Workforce and hiring trend | Another material layoff round or visible inability to fill critical clinical roles | Downgrade confidence in clinic-readiness timeline |
| Data-rights constraints | Contractual data-use terms | Restrictions prevent meaningful reuse, model updating, or deployment scale | Reduce moat and scalability assumptions |
Triggers are chosen for observability: they can be checked in diligence, future disclosures, or partner updates rather than inferred only from narrative tone.
[CR034, CR035, CR036, CR037, CR038, CR040]Critical counterparties and infrastructure dependencies around insitro’s platform, programs, and clinic-readiness path.
Counterparties are grouped at the level that matters for diligence: named partners, regulators, and core enabling infrastructure classes.
[CR014, CR015, CR019, CR020, CR021, CR023]08Valuation
8.1 Recommendation and valuation framework: price discipline comes before enthusiasm
The first question is not whether insitro is impressive. It clearly is. The question is whether public evidence is sufficient to price it. Public sources show real positives: marquee partners, repeated BMS expansion, roughly $150 million of cumulative partnership revenue, and roughly $800 million of capital raised or cited. But they do not show the current entry price, preference stack, current cash balance, reliable annual revenue, or a human-data proof point for internal assets. That makes a hard buy or hard avoid call look forced. The honest public-evidence stance is research-more and price-sensitive. The right framework is therefore not a single-point target return but a set of reference ranges built from public comps, the last known financing anchor, and proof-state assumptions. If the company is being offered near the public-comp cluster plus a rational premium, diligence may be worthwhile. If it is already being priced like a de-risked frontier platform, the public record says slow down.[CV001, CV004, CV005, CV006, CV007, CV008]
| Dimension | Assessment | Confidence | Decision implication |
|---|---|---|---|
| Overall recommendation | research-more / track | Medium | Continue only if price and terms can be diligenced against the public-evidence range. |
| Risk rating | High | High | Translation, regulatory, concentration, and financing risk remain additive rather than offsetting. |
| Valuation stance | Price-sensitive; base reference range about $1.5B-$2.5B | Medium | Do not underwrite a premium valuation from narrative strength alone. |
| Entry discipline | Interesting at or below about $2.0B effective post-money if terms are clean | Medium | Below that range, deeper diligence can be justified; around $2.57B only with stronger private proof. |
| Confidence in public evidence | Moderate on partner proof, weak on economics and terms | Medium | Strong enough to set guardrails, not strong enough to price the actual round. |
All valuation references are equity-value heuristics from public evidence, not a substitute for a term sheet or full cap-table model.
[CV001, CV022, CV025, CV026, CV027, CV028]Flow chart linking insitro’s strongest public positives, the missing valuation inputs, and the resulting price-sensitive recommendation.
The flow is intentionally decision-oriented rather than exhaustive. It shows the dominant logic for an investor asked to price a private round from public evidence.
[CV001, CV002, CV003, CV004, CV025, CV026]8.2 Investment thesis and anti-thesis: real partner proof, still incomplete economic proof
The positive case for insitro is not hypothetical. BMS has supplied the strongest public validation sequence in the file: upfront economics, milestone conversion, platform-extension funding, and new target nominations. Lilly adds a second major relationship and keeps open a route to substantial downstream option value. The company also benefits from governance and talent signals, including Amy Abernethy’s board role and Joe Hand’s talent mandate. Those are not trivial assets. The anti-thesis is that the most important value drivers remain either preclinical or private. CTRO-1013 is still moving through IND-enabling work, not through human proof. Revenue visibility is poor enough that public trackers disagree wildly. Contract terms, data rights, preference overhang, and the current cash-and-burn picture are not public. In other words, insitro may be a good company, but the public record still prices promise more easily than it prices proven economics. That distinction should dominate the investment posture.[CV002, CV003, CV010, CV011, CV012, CV013]
| Dimension | Thesis | Anti-thesis | What changes the view |
|---|---|---|---|
| Partner proof | BMS and Lilly relationships show real external validation and paid strategic demand. | Only BMS shows repeated public expansion; Lilly economics are still less visible and Gilead is mostly historical. | More public partner conversion, renewal, or new counterparty proof. |
| Internal pipeline | CTRO-1013 gives insitro a route to proprietary asset upside beyond services-like partnerships. | CTRO-1013 remains preclinical / IND-enabling, so internal asset value is still highly risk-adjusted. | Clear IND timing, FIH status, or early human proof. |
| Economics visibility | Cumulative partnership revenue and large capital raised are real positives. | Current cash, burn, annual revenue, and contract accounting are too opaque for clean underwriting. | Current cash and burn model plus contract economics. |
| Comparable context | insitro may deserve a premium to weaker or less validated public techbio names. | Public comp cluster still sits well below frontier-AI narratives. | Proof that insitro deserves to sit above Relay-like public context. |
| Financing resilience | Large historical capital base and restructuring discipline may buy time to reach proof points. | If runway still ends before clinic proof, investors may face pricing pressure and preference-heavy financing. | Updated runway model and proof-point calendar. |
| Governance / leadership | Board and leadership additions strengthen execution and regulatory judgment. | Governance cannot substitute for price, term, or clinical proof. | Visible execution milestones plus clean financing terms. |
The table separates company quality from investability at a specific private price.
[CV002, CV003, CV010, CV011, CV012, CV013]8.3 Financing context and entry discipline: old anchor, noisy trackers, limited margin for error
The cleanest financing anchor remains the 2021 Series C. insitro officially disclosed the $400 million raise, and Forge estimates the post-money at roughly $2.57 billion. Beyond that, public valuation visibility becomes much softer. GetLatka and Awaira both place insitro in the low-$2 billion range, but they conflict on revenue, funding totals, and even basic round details. Usearch cuts revenue down to $7.5 million while GetLatka claims $69 million. That level of disagreement is not a small accounting issue; it means public trackers are useful only as rough signposts. The practical result is an entry-discipline rule rather than a target price. Below about $2.0 billion effective post-money with clean terms, the opportunity begins to look interesting enough for deeper work. Around the old $2.57 billion anchor, the public record supports only modest upside. Above roughly $3 billion, investors are paying for private proof the public file does not contain.[CV003, CV005, CV006, CV007, CV008, CV009]
8.4 Comparable context and scenario ranges: public techbio marks set the guardrails
The strongest public protection against overpaying is the comp set. As of May 2026, Eikon trades around $0.54 billion, Absci around $0.90 billion, Schrödinger around $0.95 billion, Recursion around $1.73 billion, and Relay around $2.46 billion. Those are imperfect peers, but each offers an instructive constraint. Eikon shows how fast public markets can reset an early proof story. Absci shows that AI-biology platforms can remain sub-$1 billion. Schrödinger shows that even a more monetized computational platform does not receive a software-style premium. Recursion shows that scale, capital, and partnerships do not automatically sustain a very high multiple. Relay shows that clearer clinical assets can push a company toward the top of the public range, but still not into unconstrained territory. Against that backdrop, insitro’s public-evidence base case belongs around $1.5-2.5 billion, its bear case around $0.8-1.3 billion, and its bull case around $3.0-4.5 billion only if clinic entry and partner conversion improve materially.[CV016, CV017, CV018, CV019, CV020, CV021]
| Scenario | Key assumptions | Valuation range ($B) | Probability signal | What would confirm / break it |
|---|---|---|---|---|
| Bull | CTRO-1013 enters clinic on schedule; BMS or Lilly convert further; no additional major workforce reset; terms are clean. | $3.0-$4.5 | 20-25% | Confirm with IND/FIH progress and partner expansion; break with delay or punitive terms. |
| Base | Partner economics hold; clinic timing remains directional but unproven; price lands around the low-$2Bs; preferences are manageable. | $1.5-$2.5 | 40-45% | Confirm with stable runway and no deterioration in partner proof; break with stalled timing or hidden preference burden. |
| Bear | No clinic proof by end-2027; partner momentum plateaus; financing terms reveal heavy dilution or strong preferences. | $0.8-$1.3 | 30-35% | Confirm with delay, layoffs, or harsh terms; break only if proof improves faster than expected. |
Probabilities are analyst signals, not statistical outputs. Ranges are meant to bound plausible public-evidence outcomes, not to imply market precision.
[CV022, CV023, CV024, CV025, CV026, CV027]| Comparable | Current public value | Status / proof state | Relevance to insitro | Limitation |
|---|---|---|---|---|
| Recursion | ~$1.73B | Broad AI drug-discovery platform with public filings, partnerships, and multiple programs | Useful upper-mid public platform anchor with more disclosure than insitro | Still not directly comparable on terms, modality mix, or current proof state |
| Schrödinger | ~$0.95B | Computational platform with meaningful software/services revenue plus pipeline optionality | Shows that even monetized platform models can trade at modest public valuations | More software revenue and a different business mix than insitro |
| Relay Therapeutics | ~$2.46B | Clinical oncology platform with clearer human-data and asset-specific valuation logic | Useful upper-end public anchor when asset proof is clearer | More clinical and oncology-specific than insitro’s current public record |
| Absci | ~$0.90B | AI biology platform with public-market history and still-modest valuation | Useful lower-bound techbio platform anchor | Different modality focus and commercialization path |
| Eikon Therapeutics | ~$0.54B | Recent IPO therapeutics platform that public markets repriced sharply downward | Useful cautionary comp for early-proof public appetite | More directly clinical and newly public, so not a clean platform analogue |
The set is intentionally public and priceable. It does not claim that insitro should match any one comp exactly.
[CV016, CV017, CV018, CV019, CV020, CV021]Bar chart comparing current public techbio market caps with insitro bear, base, and bull midpoints plus the last-known financing anchor.
Values are approximate equity values in $M. insitro bars are reference scenarios, not observed market marks.
[CV016, CV017, CV018, CV019, CV020, CV021]Range chart showing public comp cluster, last-known financing references, and insitro bear/base/bull equity-value ranges.
Values are in $M equity value. The financing-anchor row is noisy because tracker methodologies differ and current terms are not public.
[CV007, CV016, CV022, CV023, CV024, CV025]8.5 Exit readiness, thesis-break triggers, and final verdict
On public evidence, insitro is not IPO-ready. There is no public price history for the private shares, no audited public financial package, no clean current cash and burn view, and no internal asset with visible human data. The nearer-term value-realization path is therefore more likely to run through continued partner expansion, strategic interest, or a later private financing rather than a straightforward public-market underwriting story. That conclusion sharpens the final diligence agenda. Investors need the current cap table and preference stack, current burn and runway model, contract terms and revenue-recognition logic for the major collaborations, the IND critical path for CTRO-1013, and the AI validation and security materials that later-stage counterparties will expect. The key thesis-break triggers are similarly concrete: no IND or first-in-human progress by end-2027, another major workforce reset, visible partner non-renewal, or a term sheet that embeds heavy dilution above a premium valuation. Final verdict: research-more, with high risk and medium confidence, and only under disciplined pricing.[CV028, CV029, CV030, CV031, CV032, CV036]
| Trigger | Threshold / event | Transmission to thesis | Action implication |
|---|---|---|---|
| Clinic timing slips materially | No IND or first-in-human progress by end-2027 | Reduces proprietary-asset option value and increases financing pressure | Move toward bear case / pass unless price resets meaningfully |
| Another major workforce reset | A further material layoff or visible hiring freeze in critical functions | Signals weaker runway or execution capacity than assumed | Increase downside weighting and question proof-point reach |
| Partner momentum stalls | No new BMS/Lilly expansion, explicit partner de-prioritization, or non-renewal | Weakens the strongest external validation pillar | Raise concentration discount and lower base-case range |
| AI / security diligence fails | Management cannot provide validation, governance, or security materials | Keeps regulatory and enterprise-readiness risk unresolved | Treat as a hard blocker for premium pricing |
| Price or terms become punitive | Round priced above ~$3B or with heavy participating preference / anti-dilution protection | Eliminates margin of safety | Pass unless private proof is materially better than public evidence |
| Runway is shorter than believed | Current cash cannot comfortably reach the next proof point | Raises forced-financing and dilution risk | Re-cut valuation toward bear case immediately |
Triggers are chosen because they can be checked in diligence, management updates, or future disclosures rather than inferred only from narrative tone.
[CV030, CV035, CV036, CV043, CV044]| Topic | Missing evidence | Why it matters | Owner / diligence path |
|---|---|---|---|
| Cap table and preferences | Current fully diluted capitalization and preferred-stock rights | Determines whether nominal upside is actually available to a new investor | Request board deck, cap table, and waterfall analysis |
| Cash, burn, and runway | Current cash balance, monthly burn, and downside runway cases | Determines whether the next financing is forced or optional | Request finance model and proof-point budget |
| Latest priced round or secondary signal | Any priced transaction, secondary mark, or 409A update after 2021 | Anchors real entry price rather than stale round mythology | Request financing history and valuation memos |
| Partner contract economics | Renewal, exclusivity, data-rights, output-ownership, and revenue-recognition details | Separates durable economics from milestone optics | Review BMS, Lilly, and any current Gilead-related agreements |
| IND and AI validation path | Regulator interactions, IND timeline, and AI validation package | Primary determinant of upside from platform to therapeutics value | Request development and regulatory diligence pack |
| Security and quality posture | Audit evidence, certifications, incident history, and enterprise controls | Affects partner trust, investor diligence conversion, and exit readiness | Request security / quality diligence room materials |
These asks are intentionally chosen for decision leverage: each one can move price, risk rating, or recommendation materially.
[CV003, CV009, CV036, CV037, CV043, CV044]IC-style scorecard for insitro on market, partner proof, internal proof, economics visibility, risk, valuation support, and exit readiness.
Scores are on a 0-10 scale and reflect investability at an unknown private price, not absolute company quality.
[CV002, CV003, CV011, CV014, CV022, CV028]8.6 Exhibits
Disclaimer
This report was generated for diligence research purposes using publicly available information as of May 12, 2026. It does not constitute investment advice. Private-company valuation, financing, and contractual conclusions should be verified against primary diligence materials.
Evidence index
| ID | Statement | Confidence | Sources |
|---|---|---|---|
| CO001 | insitro was founded in 2018 by Daphne Koller to apply machine learning and multimodal data to drug discovery. | High | SO002, SO016 |
| CO002 | insitro’s public headquarters is 279 East Grand Avenue in South San Francisco, California. | High | SO001, SO004 |
| CO003 | insitro’s core operating model combines human-cohort data and cellular data with machine learning to identify causal targets and therapeutic hypotheses. | High | SO001, SO002 |
| CO004 | insitro describes itself as a pipeline-through-platform company that both advances internal programs and partners with large pharmaceutical companies. | Medium | SO001, SO003, SO009 |
| CO005 | The current disclosed pipeline spans metabolism, neuroscience, and ophthalmology rather than a single-disease portfolio. | Medium | SO003 |
| CO006 | Daphne Koller’s founder-market fit rests on a Stanford machine-learning background, the prior creation of Coursera, and a long-standing focus on applying AI to high-dimensional problems. | Medium | SO016, SO017 |
| CO007 | Amy Abernethy joined insitro’s board in 2024, adding FDA, real-world evidence, and clinical-development experience to governance. | Medium | SO006 |
| CO008 | Joe Hand joined insitro as Chief People Officer in February 2026 to lead global people strategy, organizational development, and culture. | Medium | SO005 |
| CO009 | Joe Hand previously held senior leadership roles at Celgene and Phathom Pharmaceuticals, giving him large-scale biotech HR and transaction experience. | Medium | SO005 |
| CO010 | CPP Investments executive Paul McCracken joined insitro’s board as part of the 2021 Series C financing. | Medium | SO007 |
| CO011 | Forbes reported that insitro raised a $143 million Series B in 2020, bringing total venture funding at that time to $243 million. | Medium | SO017 |
| CO012 | insitro’s 2021 Series C raised $400 million and was led by CPP Investments with participation from both existing investors and new crossover backers. | Medium | SO007 |
| CO013 | Public sources identify a16z, ARCH, GV, Third Rock, CPP, BlackRock, T. Rowe, Casdin, Temasek, SoftBank, and Foresite among insitro’s named equity backers by 2021. | Medium | SO007, SO017 |
| CO014 | By 2020, Forbes described insitro’s Gilead NASH collaboration as including a $15 million upfront payment and up to $1 billion in milestone potential. | Medium | SO016, SO017 |
| CO015 | The 2020 Bristol Myers Squibb collaboration added $50 million upfront, $20 million in near-term operational milestones, and more than $2 billion in downstream milestones plus royalties. | Medium | SO008 |
| CO016 | insitro disclosed a $25 million milestone payment from Bristol Myers Squibb in 2024 after selecting the first novel ALS target from the collaboration. | Medium | SO010 |
| CO017 | The 2025 Bristol Myers Squibb extension could provide up to $20 million in new funding for ChemML-enabled ALS small-molecule work. | Medium | SO013 |
| CO018 | The 2024 Lilly agreements targeted metabolic diseases including MASLD while letting insitro retain global rights and leaving Lilly eligible for milestones and royalties. | Medium | SO009 |
| CO019 | The Moorfields INSIGHT collaboration added access to a 35 million-image ophthalmic dataset to support neurodegeneration and ocular target discovery. | Medium | SO012 |
| CO020 | The 2022 Genomics England partnership gave insitro access to an NHS-linked resource of almost 150,000 whole genomes and associated multimodal phenotypic data. | Medium | SO011 |
| CO021 | insitro’s own 2024 to 2026 communications frame the company as having more than $700 million to approximately $800 million in capital when partnership cash is included. | Medium | SO005, SO006, SO014, SO015 |
| CO022 | BioPharma Dive reported that insitro cut 22 percent of its workforce in 2025, leaving about 230 employees and targeting runway into 2027. | Medium | SO018 |
| CO023 | Open tracker pages still show inconsistent headcount estimates of roughly 250, 262, 267, or 300 employees, so current scale remains only partially verified in public sources. | Low | SO022, SO023, SO024, SO025 |
| CO024 | Open tracker pages conflict on funding, valuation, and revenue, with GetLatka reporting $643 million raised and $69 million of 2024 revenue while Awaira reports $743 million raised and a $2.2 billion March 2026 valuation. | Low | SO022, SO023 |
| CO025 | Public company pages show insitro now references colleagues in Israel, Poland, and Malaysia in addition to its South San Francisco base. | Medium | SO005, SO014 |
| CO026 | The company still publicly anchors its office footprint at South San Francisco rather than disclosing a broad network of U.S. office sites. | Medium | SO001, SO004 |
| CO027 | The 2026 CombinAbleAI acquisition launched TherML and extended insitro’s design stack from small molecules and oligonucleotides into biologics and other complex modalities. | Medium | SO014 |
| CO028 | The 2026 BAT study said BAT-01 knockdown drove a 15 percent body-weight reduction in obese mice while preserving lean mass, signaling preclinical obesity momentum. | Medium | SO015 |
| CO029 | insitro’s public pipeline page discloses eight named programs ranging from target credentialing to IND-enabling, but no marketed product or public clinical-stage asset. | Medium | SO003 |
| CO030 | The metabolism pipeline currently includes CTRO-1013, CTRO-1029, CTRO-1035, OBS-1, and OBS-2. | Medium | SO003 |
| CO031 | The neuroscience pipeline currently includes ALS-1 small molecule, CTRO-2018 oligonucleotide, ALS-2, and ALS-3. | Medium | SO003, SO013 |
| CO032 | Publicly disclosed revenue-bearing or milestone-bearing pharma relationships are concentrated in Gilead, Bristol Myers Squibb, and Lilly. | Medium | SO008, SO009, SO010, SO016, SO017 |
| CO033 | Public governance clues are limited to named investors and board additions, with no open-source disclosure of control rights, preference stack, or special voting arrangements. | Medium | SO006, SO007, SO017 |
| CO034 | Reviewed open sources did not disclose customer count, debt facilities, secondaries, or a current audited revenue mix for insitro. | Medium | SO001, SO005, SO022, SO023 |
| CO035 | insitro’s GitHub organization exposes public research artifacts such as cp-posh, kindel, and other 2022 to 2025 repositories, indicating an externally visible technical output stream. | Medium | SO019 |
| CO036 | KPMG and McKinsey both describe AI-biopharma partnerships as a large and still-expanding market, but also note valuation pressure, regulatory scrutiny, and more conservative dealmaking after 2022. | Medium | SO020, SO021 |
| CO037 | insitro’s public narrative evolved by 2026 from a machine-learning-driven drug discovery company to an AI therapeutics or physical AI company built on causal biology. | Medium | SO007, SO014, SO015 |
| CO038 | The company’s current one-line product logic is to use Virtual Human and ClinML for target discovery and TherML or ChemML for modality-specific therapeutic design. | Medium | SO003, SO013, SO014, SO015 |
| CO039 | The current operating architecture combines internal data generation with external datasets, partner technologies, and targeted acquisitions to industrialize discovery. | Medium | SO011, SO012, SO014 |
| CO040 | The main overview-level adverse signals are a 2025 workforce reduction, opaque current valuation and scale metrics, and dependence on a small number of pharma partners for data, milestones, or funding. | Medium | SO018, SO022, SO023, SO024, SO025 |
| CM001 | insitro’s current commercial posture is still a platform-backed drug R&D company rather than a company already selling marketed therapeutics. | High | SM001, SM002 |
| CM002 | The relevant market boundary for insitro includes AI drug-discovery partnerships, internally controlled therapeutic programs, and data-driven discovery collaborations. | High | SM001, SM002, SM005, SM006 |
| CM003 | insitro says its platform is built to derisk and accelerate multiple steps across the research and development value chain. | Medium | SM001 |
| CM004 | insitro’s disclosed pipeline currently spans metabolism, neuroscience, and ophthalmology. | Medium | SM002 |
| CM005 | The 2024 Lilly agreements support metabolic-disease programs while leaving insitro with full global rights and Lilly eligible for milestones and royalties. | Medium | SM003 |
| CM006 | The 2020 Bristol Myers Squibb agreement lets BMS opt selected targets into downstream development and commercialization. | Medium | SM004 |
| CM007 | The Moorfields collaboration gives insitro access to a 35 million-image ophthalmic dataset for neurodegeneration research. | Medium | SM005 |
| CM008 | The Genomics England collaboration gives insitro multimodal search capability across NHS-linked genomics and pathology data. | Medium | SM006 |
| CM009 | MarketsandMarkets estimates the AI in drug discovery market will grow from USD 1.86 billion in 2024 to USD 6.89 billion by 2029 at a 29.9% CAGR. | Medium | SM021 |
| CM010 | Precedence Research estimates the AI in pharmaceutical market at USD 1.94 billion in 2025 and USD 16.49 billion by 2034. | Medium | SM022 |
| CM011 | Precedence Research separately estimates the AI in drug discovery market at USD 6.93 billion in 2025 and USD 17.81 billion by 2035. | Medium | SM023 |
| CM012 | Grand View Research estimates the AI in precision medicine market at USD 2.29 billion in 2024 and USD 14.53 billion by 2030. | Medium | SM029 |
| CM013 | McKinsey says the pharma AI market is projected to grow from more than USD 4 billion in 2025 to USD 25.7 billion by 2030. | Medium | SM008 |
| CM014 | The reviewed analyst forecasts are directionally bullish but not directly comparable because they measure different boundaries such as AI in pharma, AI in drug discovery, and AI in precision medicine across different forecast horizons. | Medium | SM008, SM021, SM022, SM023, SM029 |
| CM015 | KPMG reports that AI-focused M&A and partnership deals in biopharma grew at a 27.3% compound annual rate from 2013 to 2022. | Medium | SM009 |
| CM016 | KPMG says AI-related biopharma deals during the period concentrated in R&D-only agreements and development or commercialization licensing. | Medium | SM009 |
| CM017 | KPMG says the 2022 downturn pushed pharma companies toward lower-risk partnerships and asset acquisitions. | Medium | SM009 |
| CM018 | EFPIA estimates that research-based pharmaceutical companies invested about EUR 55 billion in Europe in R&D during 2024. | Medium | SM010 |
| CM019 | EFPIA says North America accounted for 54.8% of world pharmaceutical sales in 2024 versus 22.7% for Europe. | Medium | SM010 |
| CM020 | WHO says 890 million adults worldwide were living with obesity in 2022. | Medium | SM014 |
| CM021 | WHO says the global costs of overweight and obesity could reach about USD 3 trillion per year by 2030. | Medium | SM014 |
| CM022 | insitro’s Lilly announcement says MASLD affects about 100 million people in the United States and lacked approved interventional treatments in 2024. | Medium | SM003 |
| CM023 | The American Liver Foundation says MASLD affects up to 25% of people in the United States. | Medium | SM019 |
| CM024 | AASLD says MASLD prevalence estimates run from 10% to 46% in the United States and 6% to 35% worldwide. | Medium | SM020 |
| CM025 | WHO says at least 2.2 billion people globally have near or distance vision impairment. | Medium | SM013 |
| CM026 | WHO says at least 1 billion vision-impairment cases are preventable or unaddressed and the associated annual productivity loss is about USD 411 billion. | Medium | SM013 |
| CM027 | CDC estimates about 9.603 million people in the United States have any diabetic retinopathy. | Medium | SM016 |
| CM028 | CDC’s National ALS Registry publishes national prevalence estimates through 2018 and projections out to 2030. | Medium | SM017 |
| CM029 | CDC’s 50-state ALS prevalence paper reports state prevalence ranging from 2.5 to 7.8 per 100,000. | Medium | SM017 |
| CM030 | NINDS says there is currently no known treatment that stops or reverses ALS progression. | Medium | SM018 |
| CM031 | WHO says AI for health faces a pacing gap because technology is advancing faster than legal frameworks. | Medium | SM007 |
| CM032 | EMA’s 2026 AI principles require human-centric design, a risk-based approach, fit-for-use data, lifecycle monitoring, and clear context of use. | Medium | SM012 |
| CM033 | EFPIA’s 2025 AI lifecycle report says AI is spreading from discovery into clinical development, manufacturing, and post-approval safety under trust and transparency expectations. | Medium | SM011 |
| CM034 | IQVIA says pharma companies are extending AI into pharmacovigilance, medical information, and lifecycle management while relying on experienced technology partners. | Medium | SM030 |
| CM035 | The combined Recursion-Exscientia entity disclosed more than 10 internal programs, more than 10 partnered programs, and about USD 450 million of upfront and milestone cash realized from partners. | Medium | SM024 |
| CM036 | Recursion says its platform is built on more than 50 petabytes of proprietary data and millions of experiments per week. | Medium | SM025 |
| CM037 | Schrödinger says it deploys its computational platform across both collaborative and proprietary drug-discovery programs. | Medium | SM026 |
| CM038 | Relay Therapeutics says it uses integrated experimental and computational discovery against previously intractable biology in precision oncology and genetic disease. | Medium | SM027 |
| CM039 | Evotec says it offers a fully integrated R&D value chain and flexible partnering models across small molecules and biotherapeutics. | Medium | SM028 |
| CM040 | Taken together, public peers suggest the AI-drug-discovery market increasingly rewards platform-plus-pipeline or platform-plus-partnering capability rather than tools-only positioning. | Medium | SM024, SM025, SM026, SM027, SM028 |
| CM041 | insitro’s immediate buyers are large-pharma R&D and business-development teams, while physicians, payers, and patients matter mainly in a later downstream drug path. | Medium | SM003, SM004, SM009 |
| CM042 | The Genomics England and Moorfields relationships function primarily as strategic discovery inputs and moats, not as a clean standalone monetizable market category. | Medium | SM005, SM006, SM011 |
| CM043 | A defensible market view for insitro needs multiple lenses—AI market forecasts, pharma R&D budget pools, and disease-burden demand—rather than one generic TAM number. | High | SM008, SM010, SM013, SM014, SM020 |
| CM044 | No reviewed public source cleanly isolates insitro’s exact SAM or SOM across its metabolic, neuro, and ophthalmic strategy. | Medium | SM008, SM009, SM010, SM021, SM022, SM023, SM029 |
| CM045 | Growth drivers include high disease burden, large R&D budgets, multimodal data availability, and ongoing AI adoption across the medicines lifecycle. | High | SM010, SM013, SM014, SM030 |
| CM046 | Adoption constraints include governance demands, long drug timelines, demand for proof beyond hype, and buyer preference for lower-risk deal structures. | High | SM008, SM009, SM011, SM012 |
| CM047 | The partnership adoption path runs from data and model validation to target nomination and then to negotiated development rights and milestone-bearing execution. | High | SM003, SM004, SM005, SM006 |
| CM048 | The main status-quo substitutes for insitro are in-house pharma discovery teams, conventional CRO-style workflows, and competing AI-platform companies rather than consumer health or hospital IT vendors. | Medium | SM009, SM024, SM025, SM026, SM027, SM028 |
| CP001 | insitro’s competitive landscape spans direct AI-drug-discovery peers, adjacent integrated-service substitutes, and internal pharma build rather than one like-for-like rival. | High | SP001, SP002, SP016, SP018, SP020, SP021, SP022 |
| CP002 | The most visible direct AI-drug-discovery peers in reviewed public sources are Recursion/Exscientia, Insilico Medicine, Xaira, Isomorphic Labs, Generate Biomedicines, Valo Health, and BenevolentAI. | High | SP005, SP008, SP010, SP011, SP013, SP014, SP016 |
| CP003 | Adjacent substitutes include Schrödinger’s physics-based platform, Evotec’s integrated R&D services, and in-house pharma discovery teams. | Medium | SP018, SP020, SP021, SP022 |
| CP004 | insitro’s own differentiation claim rests on multimodal human and cellular data, machine learning, and a pipeline spanning metabolism, neuroscience, and ophthalmology. | Medium | SP001, SP002 |
| CP005 | Recursion says its platform is built on more than 50 petabytes of data and millions of cell experiments per week. | Medium | SP016 |
| CP006 | The Recursion-Exscientia combination disclosed more than 10 internal programs, more than 10 partnered programs, roughly USD 450 million of realized partner cash, and about 800 employees. | Medium | SP017 |
| CP007 | Insilico’s pipeline page lists more than 40 total programs, 30 preclinical candidates nominated since 2021, and 13 pipelines that received IND approval. | Medium | SP006 |
| CP008 | Insilico says it nominated 20 preclinical candidates from 2021 to 2024 with an average 12- to 18-month timeline and only 60 to 200 molecules synthesized and tested per program. | Medium | SP007 |
| CP009 | Insilico’s ISM8969 received FDA IND clearance in 2026 for Parkinson’s disease, with a co-development arrangement that includes 50/50 global rights and up to USD 66 million in upfront and milestones. | Medium | SP007 |
| CP010 | Insilico says its three key license-out deals add up to as much as USD 2.1 billion in total contract value. | Medium | SP007 |
| CP011 | Xaira says it is building predictive and agentic AI models across the complete spectrum of drug discovery and development. | Medium | SP008 |
| CP012 | Xaira’s news page says X-Cell launched in 2026 as a virtual-cell model trained on the largest-ever genome-wide perturbation dataset, X-Atlas/Pisces. | Medium | SP009 |
| CP013 | Xaira’s news page also says the company raised nearly USD 1 billion and released the largest publicly available genome-wide Perturb-seq dataset in 2025. | Medium | SP009 |
| CP014 | Generate Biomedicines positions itself as a generative-biology company that uses machine learning to create medicines on demand across therapeutic modalities. | Medium | SP010 |
| CP015 | Generate says it has generated, built, and tested 42,000 proteins and operates with more than 140,000 square feet of space, making it more biologics-oriented than insitro’s current public narrative. | Medium | SP010 |
| CP016 | Isomorphic Labs says it is building predictive and generative AI beyond AlphaFold to design novel medicines. | Medium | SP011 |
| CP017 | Isomorphic Labs’ news page lists a USD 600 million external investment round announced in 2025. | Medium | SP012 |
| CP018 | Isomorphic Labs’ news page also lists collaborations or collaboration expansions with Lilly, Novartis, and Johnson & Johnson between 2024 and 2026. | Medium | SP012 |
| CP019 | Valo Health emphasizes AI-enabled human causal biology and closed-loop chemistry through an ecosystem-led innovation model. | Medium | SP013 |
| CP020 | BenevolentAI emphasizes a decade of investment in a knowledge graph and proprietary ontologies supporting life-science decision making. | Medium | SP014 |
| CP021 | BenevolentAI’s news page shows both a strategic overhaul and a proposed delisting via merger, which is adverse evidence for category durability. | Medium | SP015 |
| CP022 | The Exscientia news URL resolving to Recursion and the Nasdaq merger announcement together show that Exscientia has effectively been absorbed into the Recursion platform. | Medium | SP017, SP026 |
| CP023 | Schrödinger says it deploys a physics-based computational platform across both collaborative and proprietary drug-discovery programs. | Medium | SP018 |
| CP024 | Relay Therapeutics says it uses integrated experimental and computational discovery against challenging targets in precision oncology and genetic disease. | Medium | SP019 |
| CP025 | Evotec competes through a fully integrated R&D value chain and flexible partnering models rather than a pure AI-only value proposition. | Medium | SP020 |
| CP026 | MarketsandMarkets says pharmaceutical and biotechnology companies are the largest and fastest-growing end users of AI in drug discovery. | Medium | SP027 |
| CP027 | WHO, EMA, and EFPIA all frame trust, transparency, risk-based governance, and lifecycle controls as critical to AI adoption in medicines, making trust posture a competitive variable rather than only a compliance issue. | High | SP023, SP024, SP025 |
| CP028 | KPMG says the post-2022 market pushed buyers toward lower-risk partnership structures. | Medium | SP021 |
| CP029 | McKinsey says pharma still has not seen clear evidence that AI alone materially shortens development timelines or improves success rates. | Medium | SP022 |
| CP030 | insitro’s own BMS and Lilly deals show that it packages value through collaboration rights, milestones, royalties, and option-style economics rather than through public seat-based software pricing. | High | SP003, SP004 |
| CP031 | Across the reviewed peer set, public pricing is seldom list-priced; economics appear through milestone deals, license-outs, co-development, or integrated service bundles. | Medium | SP007, SP017, SP020, SP021 |
| CP032 | Recursion and Insilico provide clearer public economic proof points than most private peers in the reviewed source set. | Medium | SP007, SP017 |
| CP033 | Pharma buyers can likely multi-home across several AI platforms because many pre-deal value propositions remain partially substitutable. | Medium | SP021, SP022, SP027 |
| CP034 | Switching costs rise only after shared data pipelines, model tuning, or program rights become embedded inside an active collaboration. | Medium | SP003, SP004, SP021 |
| CP035 | Distribution power still sits with large pharma or later-stage commercial organizations rather than insitro or most private AI-biopharma peers. | Medium | SP003, SP004, SP018, SP020 |
| CP036 | insitro’s moat is not raw capital scale but the combination of multimodal disease data, platform-to-pipeline continuity, and partner validation. | High | SP001, SP002, SP003, SP004 |
| CP037 | Recursion, Xaira, and Isomorphic appear stronger than insitro on capital or platform ambition in current public evidence. | Medium | SP009, SP012, SP017 |
| CP038 | Insilico and Recursion show stronger public industrialization or proof cadence than insitro in the reviewed sources. | Medium | SP006, SP007, SP017 |
| CP039 | Generate, Valo, and Benevolent are more adjacent or differently shaped substitutes than direct one-for-one peers for insitro’s current pipeline. | Medium | SP010, SP013, SP014 |
| CP040 | insitro’s moat could commoditize if platform claims become substitutable across many AI vendors and internal pharma build teams. | Medium | SP021, SP022, SP023, SP024, SP025 |
| CP041 | BenevolentAI’s restructuring and Exscientia’s absorption provide concrete adverse evidence that the category can consolidate or compress before durable pricing power appears. | Medium | SP015, SP017, SP026 |
| CP042 | The net picture is a crowded but still fragmented field in which insitro is differentiated, yet not clearly dominant on public scale, proof, or pricing power. | High | SP021, SP022, SP023, SP024, SP025 |
| CI001 | insitro’s public financial model is partnership-driven and still pre-product, with monetization disclosed through collaboration economics rather than product sales or list-priced software. | High | SI001, SI003, SI004, SI005, SI006, SI007 |
| CI002 | Official sources document a $50M BMS upfront, a $25M BMS milestone in 2024, up to $20M of new funding in the 2025 ChemML extension, and a $10M milestone in the 2026 target expansion; 2026 company messaging also cites about $150M of partnership revenue across BMS, Lilly, and Gilead. | High | SI001, SI003, SI004, SI005, SI006, SI008 |
| CI003 | Public funding history supports more than $100M raised by 2019, $243M total venture funding by mid-2020, and a further $400M Series C in 2021; later company messaging describes insitro as backed by roughly $800M in capital. | High | SI001, SI002, SI009, SI010 |
| CI004 | Publicly disclosed revenue-bearing or milestone-bearing pharma relationships are concentrated in Gilead, Bristol Myers Squibb, and Lilly. | High | SI001, SI003, SI007, SI010 |
| CI005 | Forbes reported that insitro’s Gilead collaboration included a $15M upfront payment and up to $1B in milestone potential. | Medium | SI010 |
| CI006 | The 2024 Lilly structure is not a simple up-front monetization event: insitro keeps full global rights to its programs while Lilly becomes eligible for milestones and royalties. | Medium | SI007 |
| CI007 | Public pricing is bespoke and milestone-heavy rather than list-priced, with economics disclosed as upfronts, milestones, royalties, retained rights, and option-style collaboration terms. | High | SI003, SI004, SI005, SI006, SI007, SI025, SI026 |
| CI008 | The Bristol Myers Squibb relationship shows a land-and-expand GTM pattern: discovery collaboration in 2020, milestone conversion in 2024, ChemML extension in 2025, and additional target expansion in 2026. | High | SI003, SI004, SI005, SI006, SI008 |
| CI009 | insitro’s apparent buyer is large-pharma R&D and business development, implying a low-customer-count, high-contract-value enterprise selling motion rather than broad-distribution software sales. | High | SI003, SI007, SI025, SI026 |
| CI010 | Public CAC or payback metrics are unavailable; the best public proxy for sales efficiency is whether insitro can deepen a small number of strategic accounts over multiple years. | Medium | SI003, SI006, SI025, SI026 |
| CI011 | In February 2026, insitro itself said it was backed by roughly $800M in capital and about $150M in partnership revenue from BMS, Lilly, and Gilead. | Medium | SI001 |
| CI012 | GetLatka reports $69M of 2024 revenue, $643M total funding, a $2.4B 2021 valuation, and 262 employees. | Low | SI014 |
| CI013 | Awaira reports $743M of disclosed funding, an approximately $2.2B valuation as of March 2026, around 300 employees, and only a very broad estimated ARR range. | Low | SI015 |
| CI014 | Usearch lists $7.5M of revenue and 267 employees for insitro. | Low | SI016 |
| CI015 | WorxForm summarizes insitro as having raised $400M with roughly 250 employees, reinforcing that public trackers disagree even on basic company scale. | Low | SI017 |
| CI016 | Public third-party trackers conflict materially on insitro’s revenue, funding, valuation, and headcount, so none should be treated as canonical without management backup. | High | SI001, SI014, SI015, SI016, SI017 |
| CI017 | insitro’s cost structure likely skews toward wet-lab biology, data generation, compute, chemistry, and senior scientific talent rather than software-only delivery costs. | Medium | SI001, SI005, SI006, SI007 |
| CI018 | The 2025 ChemML extension highlighted a 192-H100 GPU compute cluster, reinforcing that high-end compute is a real cost center inside the discovery model. | Medium | SI005 |
| CI019 | Public traction is visible mainly through collaboration cash events and cumulative partnership-revenue claims, not through audited annual revenue, bookings, or backlog disclosure. | High | SI001, SI004, SI005, SI006, SI007 |
| CI020 | BioPharma Dive reported that insitro cut 22% of its workforce in 2025, leaving about 230 workers and aiming to operate into 2027. | Medium | SI011 |
| CI021 | The 2025 restructuring suggests management prioritized cash preservation and clinic readiness over continued hiring velocity. | Medium | SI011, SI012 |
| CI022 | EY said 2024 follow-on financings were the worst since 2016, IPO activity remained muted, and biotechs were being pushed toward cost reductions, alternate funding, and pharma partnerships. | Medium | SI012 |
| CI023 | Fierce’s 2026 layoff tracker shows workforce reductions remained widespread across biotech after 2025, reinforcing that cost resets were a sector-wide feature rather than an insitro-only anomaly. | Medium | SI012, SI013 |
| CI024 | Recursion’s 2023 annual report shows $44.6M of total revenue, a $328.1M net loss, $241.2M of R&D expense, and $391.6M of year-end cash. | Medium | SI018 |
| CI025 | Recursion’s filing also says revenue recognition is not directly correlated to cash receipts and includes milestone-based variable consideration, illustrating why collaboration cash does not map cleanly to GAAP revenue. | Medium | SI018 |
| CI026 | Schrödinger’s 2023 annual report shows $216.7M of total revenue, $181.8M of R&D expense, $468.8M of cash, cash equivalents, restricted cash, and marketable securities, and about $136.7M of operating cash use. | Medium | SI019 |
| CI027 | Macrotrends shows Schrödinger revenue at roughly $208M in 2024 after $216.7M in 2023, suggesting the most revenue-visible public comp still does not scale like hypergrowth software. | Medium | SI019, SI024 |
| CI028 | Relay’s 2023 annual report shows no product revenue, a $342.0M net loss, $330.0M of R&D expense, $750.1M of cash/investments, and $300.3M of operating cash use. | Medium | SI020 |
| CI029 | Taken together, Recursion, Schrödinger, and Relay show that AI-biopharma platforms can remain highly cash consumptive even with collaboration or software revenue. | High | SI018, SI019, SI020 |
| CI030 | CompaniesMarketCap listed public 2026 market caps of about $1.73B for Recursion, $0.95B for Schrödinger, and $2.46B for Relay. | Medium | SI021, SI022, SI023 |
| CI031 | Tracker-based private valuation signals for insitro of roughly $2.2B to $2.4B sit inside the public-equity band of these comps despite far lower disclosure quality. | Medium | SI014, SI015, SI021, SI022, SI023 |
| CI032 | Because insitro has public evidence of realized collaboration cash, revenue quality is better than that of a zero-monetization preclinical biotech, but still materially weaker than a disclosed commercial or audited recurring-revenue model. | High | SI001, SI004, SI007, SI018, SI019, SI020 |
| CI033 | Gross margin path is hard to underwrite publicly; partnership economics carry milestone and royalty upside, but current delivery almost certainly absorbs significant lab, compute, and personnel expense. | Medium | SI005, SI018, SI019, SI020 |
| CI034 | Working-capital, capex, lease, and debt burdens are essentially opaque for insitro because no audited statements or contract schedules are public. | Medium | SI001, SI002, SI011 |
| CI035 | No public source reviewed disclosed insitro’s current cash balance, monthly burn, deferred revenue, lease commitments, or debt facilities. | High | SI001, SI002, SI011 |
| CI036 | The trackers themselves acknowledge that at least some of their figures are estimated or based on public data, limiting their weight for real underwriting. | Medium | SI014, SI015 |
| CI037 | KPMG and McKinsey both describe pharma AI adoption as proof-sensitive and partnership-oriented, consistent with insitro selling strategic program access rather than generalized software subscriptions. | Medium | SI025, SI026 |
| CI038 | The most plausible next value-inflection triggers are additional partnership expansion, milestone conversion, or successful clinic-readiness progression rather than near-term public-market liquidity. | Medium | SI005, SI006, SI011, SI012 |
| CI039 | The 2021 Series C proceeds were earmarked for platform expansion, enabling datasets, complementary technologies, and in-licensed assets, implying capital deployment is platform- and pipeline-building rather than near-term profitability. | Medium | SI002 |
| CI040 | Financial model quality is currently blocked by missing current cash, burn, revenue-recognition, and obligation data. | High | SI001, SI002, SI011, SI014 |
| CI041 | Overall financial verdict: insitro is partnership-validated but still opaque and financing-dependent until internal programs or milestone conversion create clearer, auditable proof of revenue quality and capital efficiency. | High | SI001, SI011, SI012, SI018, SI019, SI020 |
| CE001 | insitro’s public product is not a general-purpose software application but an end-to-end discovery workflow that turns multimodal human and cellular data into targets, biomarkers, and therapeutic designs. | High | SE001, SE002, SE004 |
| CE002 | insitro publicly describes the platform as powering both wholly owned and partnered programs across metabolism, neuroscience, and oncology. | High | SE002, SE004 |
| CE003 | The company’s platform narrative emphasizes a modular, reusable, automated stack that generates or acquires data and then converts it into decision support for target and drug discovery. | High | SE001, SE004 |
| CE004 | Virtual Human is publicly framed as a genetically anchored causal AI engine that explains disease mechanisms and helps decide what biology to target. | High | SE011, SE012, SE015 |
| CE005 | ClinML creates scalable human-derived phenotypes from cohort data, illustrated by a brown-adipose-tissue phenotype derived from 69,598 UK Biobank MRI scans. | Medium | SE012 |
| CE006 | CellML is used to screen genetically supported targets in relevant human cells using high-content imaging, transcriptomics, and functional assays. | High | SE012, SE015 |
| CE007 | POSH integrates pooled CRISPR screening, Cell Painting, and self-supervised deep learning to infer gene function at scale. | High | SE010, SE018 |
| CE008 | In the public POSH study, self-supervised models recovered 2.5 times more functional gene relationships than conventional expert-designed analysis and yielded validated target insights. | High | SE010, SE018 |
| CE009 | The cp-posh repository publicly exposes datasets, training and inference scripts, notebooks, and pretrained model weights, showing that part of insitro’s research tooling is reproducible outside the company. | Medium | SE017, SE018 |
| CE010 | insitro’s GitHub organization shows a maintained public engineering and research surface that includes redun and multiple public-research repositories. | Medium | SE016, SE017 |
| CE011 | TherML is publicly described as an integrated design layer spanning small molecules, oligonucleotides, and complex biologics within one platform. | High | SE011, SE014 |
| CE012 | The CombinAbleAI acquisition extends TherML into antibodies and other complex biologics using a physics-informed optimization engine pre-trained on more than 100,000 molecular-dynamics surrogates. | High | SE011, SE014 |
| CE013 | TherML operates as a closed-loop active-learning system connected to automated labs so that experimental results iteratively refine design predictions. | High | SE011, SE014 |
| CE014 | TherML and ChemML are positioned to optimize both target activity and developability rather than deferring manufacturability and ADMET concerns until late in the workflow. | High | SE011, SE014 |
| CE015 | The BMS ChemML extension and the Lilly small-molecule collaboration together show that insitro’s small-molecule stack includes QAL-driven data generation, active-learning medicinal chemistry, ADMET models, and substantial compute infrastructure including a 192-H100 GPU cluster. | High | SE008, SE009 |
| CE016 | The 2025 Lilly collaboration shows insitro building ADMET models on Lilly preclinical data and making those models available to Lilly TuneLab partners on a federated infrastructure. | Medium | SE009 |
| CE017 | The 2024 Lilly metabolic agreements show the platform pairing internal target discovery with both siRNA-delivery technology and antibody discovery while insitro retains global rights. | Medium | SE006, SE009 |
| CE018 | The 2026 BMS expansion shows insitro pursuing ALS-1 across both oligonucleotide and small-molecule modalities and using TherML to select the optimal intervention for each target. | High | SE013, SE015 |
| CE019 | Genomics England’s own partner page confirms that insitro’s embedding search engine is deployed into the secure Genomics England Research Environment for research partners. | Medium | SE019 |
| CE020 | INSIGHT’s own collaboration page says only INSIGHT researchers access the OCT data while the foundation model is built inside a secure research environment. | Medium | SE020 |
| CE021 | The BAT program demonstrates a public workflow in which a ClinML-derived human phenotype feeds a CellML screen and then in vivo validation, culminating in a named preclinical asset, BAT-01. | Medium | SE002, SE012 |
| CE022 | BAT-01 knockdown produced a 15 percent body-weight reduction and a 25 percent fat-mass reduction in obese mice without reducing caloric intake. | Medium | SE012 |
| CE023 | The 2020 BMS collaboration used insitro’s Human platform to build iPSC-derived ALS/FTD disease models and gave BMS opt-in rights for targets that insitro identified. | High | SE005, SE007 |
| CE024 | The 2024 BMS milestone update described more than 200 engineered and patient ALS cell lines, ML-enabled motor-neuron differentiation, high-content imaging, and POSH as core proprietary elements of the discovery engine. | High | SE007, SE015 |
| CE025 | The ALS playbook says POSH can measure thousands of cellular features across millions of perturbed cells, underscoring data density as a core design principle of Virtual Human. | Medium | SE015 |
| CE026 | insitro’s differentiation is publicly anchored in combining large-scale human cohort data with internally generated cellular perturbation data rather than in molecule-generation models alone. | High | SE001, SE011, SE012, SE015 |
| CE027 | Justia’s assignee page lists multiple 2025-2026 insitro patents on biological image transformation, autonomous cell imaging and modeling, and machine-learning-based spatial omics imputation. | Medium | SE025 |
| CE028 | Public patent activity suggests a real imaging and omics IP estate, but the scope, field-of-use limits, and licensing terms are not auditable from open sources alone. | Medium | SE025 |
| CE029 | External proof exists for discrete platform components, but there is no public evidence of a broad self-serve product surface with pricing, API documentation, or support SLAs for general customers. | Medium | SE001, SE002, SE016, SE017, SE018, SE019, SE020 |
| CE030 | The public roadmap from 2024-2026 shows insitro moving from discovery-engine branding toward modality-agnostic therapeutic design and nearer-clinic asset language. | High | SE006, SE008, SE011, SE013, SE014 |
| CE031 | Despite richer platform messaging, public proof still centers on preclinical, discovery, or partner-delivery milestones rather than approved products or public human efficacy readouts. | Medium | SE002, SE006, SE012, SE013 |
| CE032 | Trust posture in external data partnerships is framed around secure partner environments, restricted data access, and use-specific model development rather than consumer-style product controls. | Medium | SE019, SE020, SE021 |
| CE033 | FDA’s January 2025 draft guidance says AI used to support drug regulatory decisions should undergo risk-based credibility assessment for a specific context of use. | Medium | SE024 |
| CE034 | EMA’s January 2026 principles require human-centric design, data governance, validation, lifecycle management, and a clear context of use for AI in drug development. | Medium | SE022 |
| CE035 | WHO’s AI-for-health guidance emphasizes ethics, governance, and trust because legal frameworks lag technical deployment. | Medium | SE023 |
| CE036 | insitro’s public privacy policy applies to website and cookie or inquiry data and does not publicly describe enterprise-grade controls for collaboration datasets. | Medium | SE019, SE020, SE021 |
| CE037 | No public SOC 2, ISO 27001, GxP, HIPAA, or HITRUST certification disclosure was found across the sourced platform, partner, and privacy materials. | Medium | SE001, SE019, SE020, SE021 |
| CE038 | No public source reviewed discloses platform throughput metrics such as screens per quarter, targets advanced per year, average design-make-test cycle time, or support uptime. | Medium | SE001, SE002, SE010, SE011 |
| CE039 | The public IP and open-source surface is directionally supportive of moat, but externally auditable evidence still covers only slices of the platform and not the proprietary datasets or internal model weights that likely matter most. | Medium | SE016, SE017, SE018, SE025 |
| CE040 | Overall product-tech verdict: insitro appears to have a genuinely differentiated, modular, modality-agnostic discovery stack with real external deployments and partner pull, but public evidence still leaves quality-system, data-rights, throughput, and clinical-translation validation gaps that matter for underwriting. | High | SE011, SE014, SE019, SE020, SE021, SE024, SE025 |
| CU001 | insitro’s visible external customer base is best described as a small set of high-value pharma counterparties plus a small number of research-data partners, not a broad software install base. | High | SU001, SU002, SU005, SU007, SU011, SU012, SU013 |
| CU002 | insitro’s home and pipeline materials show that the platform supports both partnered and wholly owned programs, so some platform output is consumed internally rather than sold to external customers. | High | SU001, SU002, SU003, SU004 |
| CU003 | The public record does not show a self-serve product, public pricing page, marketplace channel, or broad long-tail customer roster. | Medium | SU001, SU002, SU003, SU004 |
| CU004 | The visible channel motion is direct, science-led business development around bespoke collaborations rather than transactional software sales. | High | SU005, SU007, SU011, SU023, SU024 |
| CU005 | Named external customer proof is geographically concentrated in U.S. big pharma and U.K. health-data institutions. | High | SU005, SU007, SU011, SU012, SU013 |
| CU006 | The visible buyer-user-payer map has three layers: direct pharma payers, partner-controlled research-environment users, and indirect biotech users reached through Lilly TuneLab. | High | SU007, SU009, SU012, SU013, SU015, SU020 |
| CU007 | There is no public evidence that insitro has diversified into a large number of mid-market biotech or self-serve customers. | Medium | SU001, SU002, SU003, SU004, SU023, SU024 |
| CU008 | Gilead’s 2019 agreement was a three-year NASH collaboration in which insitro’s platform was used to create disease models and identify targets that Gilead could advance. | High | SU011, SU027 |
| CU009 | The Gilead agreement disclosed enterprise-scale economics: $15 million upfront, up to $35 million in near-term operational milestones, up to $200 million per target plus royalties, and opt-in co-development rights. | High | SU011, SU025 |
| CU010 | Bristol Myers Squibb entered a five-year discovery collaboration with insitro in 2020 around ALS and FTD, including upfront cash and downstream milestone potential. | Medium | SU005 |
| CU011 | In December 2024 insitro disclosed a $25 million BMS milestone and the selection of the first novel ALS target, proving follow-on economics beyond the original launch announcement. | Medium | SU006 |
| CU012 | In October 2025 the BMS relationship expanded into ChemML-enabled small-molecule design for a novel ALS target. | Medium | SU008 |
| CU013 | In March 2026 the BMS relationship expanded again with two additional targets and a $10 million milestone payment, making BMS the clearest public land-and-expand account. | High | SU010, SU026 |
| CU014 | Lilly and insitro announced three strategic agreements in 2024 spanning metabolic disease target programs, siRNA delivery, and antibody discovery. | Medium | SU007 |
| CU015 | The 2025 Lilly small-molecule collaboration broadened the relationship into ADMET and pharmacokinetic model building on Lilly’s proprietary preclinical data. | Medium | SU009 |
| CU016 | The 2025 Lilly collaboration explicitly says insitro-built models will be available to Lilly, insitro, and Lilly TuneLab partners, giving indirect reach into a broader biotech partner ecosystem. | Medium | SU009 |
| CU017 | Lilly ExploR&D describes itself as supporting emerging biotech companies from early stage through clinical proof of concept and cites 50+ external programs over 15 years, which contextualizes insitro within an established long-duration external-innovation channel. | Medium | SU020 |
| CU018 | Genomics England’s official partner page says insitro will make its embedding search engine available to Genomics England’s network of research partners within the secure Research Environment. | Medium | SU012 |
| CU019 | Genomics England also says insitro became a broader research partner, implying the relationship extends beyond a one-off technical demo. | Medium | SU012 |
| CU020 | Genomics England’s current Research Environment page describes a multi-user platform for academia and industry built around one of the largest genomic datasets linked to clinical records. | Medium | SU014 |
| CU021 | Genomics England’s industry-researcher page shows three visible commercial engagement modes—self-service data access, bioinformatics consulting, and R&D collaboration—which clarifies how insitro’s tool sits inside a broader buyer and user workflow. | Medium | SU015 |
| CU022 | Genomics England’s documentation shows the Research Environment is an AWS-accessed virtual desktop in which data stay inside the environment and only results are exported. | Medium | SU016 |
| CU023 | Genomics England security guidance emphasizes Airlock-mediated import/export and explicitly forbids screenshots that bypass controls, highlighting the operational friction of partner-environment deployments. | Medium | SU017 |
| CU024 | INSIGHT and Moorfields say they are collaborating with insitro to build an OCT foundation model on millions of linked retinal images and clinical records for neurodegeneration-related discovery. | Medium | SU013 |
| CU025 | INSIGHT/Moorfields also states that only INSIGHT researchers access the underlying data while building the model in a secure research environment, so insitro benefits from the model without direct raw-data possession. | High | SU013, SU018 |
| CU026 | INSIGHT’s Secure Research Environment provisions approved researchers with anonymised data, requested software tools, and virtual machines, with export approval controlled by the data custodian. | Medium | SU018 |
| CU027 | INSIGHT’s Data Use Register shows multiple approved external research projects with completed contracts and secure access granted, evidencing that the surrounding infrastructure supports real external research use rather than a marketing pilot. | High | SU018, SU019 |
| CU028 | The strongest named customer proof combines either paid milestone progression or partner-side descriptions of concrete tool and model usage; it is materially better than a passive logo wall. | High | SU006, SU010, SU011, SU012, SU013, SU019 |
| CU029 | Public outcome evidence exists mainly as disclosed milestones, target nominations, and deployment descriptions, not as user counts, query volumes, ROI statistics, or partner satisfaction metrics. | Medium | SU006, SU009, SU010, SU012, SU013, SU014, SU015, SU019 |
| CU030 | No public source reviewed disclosed NRR, GRR, churn, NPS, active-customer counts, or account-level revenue retention metrics. | Medium | SU001, SU002, SU003, SU004, SU021, SU022 |
| CU031 | There is no public evidence of broad recurring software revenue; the disclosed economics are collaboration-, milestone-, royalty-, and rights-structured. | High | SU005, SU007, SU009, SU011, SU021, SU023, SU024 |
| CU032 | BMS is the strongest public durability signal because the relationship progressed from 2020 launch to 2024 milestone conversion, 2025 chemistry extension, and 2026 target expansion. | High | SU005, SU006, SU008, SU010 |
| CU033 | Lilly provides positive but still early durability evidence because the public relationship broadened from 2024 multi-agreement collaboration into 2025 small-molecule model work. | Medium | SU007, SU009, SU020 |
| CU034 | Gilead provides strong early customer proof but weak current durability visibility because reviewed public materials do not describe post-term renewal, later milestones, or the collaboration’s present status. | Medium | SU011, SU025 |
| CU035 | Research-environment collaborations validate external adoption and create some operational stickiness, but they do not prove large cash revenue because commercial terms and utilization metrics are undisclosed. | Medium | SU012, SU013, SU016, SU017, SU018, SU019 |
| CU036 | A 2026 company statement says insitro has generated about $150 million in partnership revenue from BMS, Lilly, and Gilead, implying meaningful concentration in just three names. | Medium | SU021 |
| CU037 | Because the company does not disclose the split of partnership revenue across BMS, Lilly, and Gilead, exact top-customer concentration cannot be quantified even though concentration is likely high. | Medium | SU021 |
| CU038 | BioPharma Dive’s 2025 layoff report and insitro’s runway-to-2027 framing suggest the company still depends on concentrated partnerships and internal pipeline proof points rather than diversified customer cash flows. | Medium | SU022 |
| CU039 | KPMG and McKinsey both describe AI-biopharma commercialization as partnership-heavy, proof-sensitive, and integration-intensive, which is consistent with insitro’s public customer motion. | High | SU023, SU024 |
| CU040 | Expansion upside exists through deeper BMS and Lilly scopes, broader use inside the Genomics England research network, and indirect reach to biotech users through Lilly TuneLab. | High | SU009, SU010, SU012, SU015, SU020 |
| CU041 | Customer satisfaction and procurement quality remain largely private because public materials surfaced no marketplace reviews, independent user testimonials, or disclosed renewal outcomes. | Medium | SU001, SU002, SU012, SU013, SU014, SU015, SU019 |
| CU042 | Overall, insitro’s customer base looks strategically credible and scientifically integrated, but concentrated and under-disclosed relative to what investors would want for a repeatable revenue engine. | High | SU021, SU022, SU023, SU024 |
| CU043 | Illustrative continuity proxies suggest BMS-style integrated partnerships should retain better than single-indication discovery deals, but these are analyst heuristics rather than company-reported retention figures. | Low | SU005, SU007, SU011, SU022, SU023, SU024 |
| CR001 | The highest residual risk in the current insitro case is clinical translation, because public proof remains preclinical or collaboration-led rather than human-outcome-based. | High | SR008, SR009, SR013, SR016, SR017, SR018, SR025 |
| CR002 | Public company messaging frames clinic readiness as an upcoming milestone rather than an achieved state. | Medium | SR009, SR025 |
| CR003 | The MASLD program update says CTRO-1013 is still in IND-enabling work and preparing for first-in-human trials, which confirms preclinical progress but not regulatory clearance. | Medium | SR008 |
| CR004 | BMS and Lilly milestones validate discovery and partner appetite, but they do not prove that insitro can convert AI-driven discovery outputs into successful human clinical programs. | High | SR013, SR014, SR015, SR016, SR017, SR018 |
| CR005 | FDA says it is seeing a significant increase in AI-related submissions across nonclinical, clinical, postmarketing, and manufacturing phases of the drug lifecycle. | Medium | SR001 |
| CR006 | FDA says its 2025 draft guidance was informed by over 500 submissions with AI components from 2016 to 2023 plus more than 800 external comments on the discussion paper. | Medium | SR001 |
| CR007 | FDA’s guiding-principles page and EMA’s reflection paper both emphasize human-centric design, risk-based assessment, data governance, performance evaluation, and lifecycle management for AI in medicines. | High | SR002, SR004, SR005 |
| CR008 | FDA’s discussion paper says careful assessments tied to the specific context of use and a risk-based approach are needed for AI/ML in drug development. | High | SR002, SR003 |
| CR009 | EMA’s reflection paper explicitly extends AI considerations from drug discovery through post-authorisation, expanding the compliance surface beyond a narrow clinical-model question. | High | SR004, SR005 |
| CR010 | Public insitro materials reviewed do not disclose a mapped control framework showing how the company satisfies FDA and EMA expectations on context of use, data governance, validation, or lifecycle management. | High | SR001, SR002, SR004, SR005, SR028, SR029, SR030 |
| CR011 | AI regulatory credibility risk is therefore material before any filing, pivotal partner expansion, or public re-rating tied to clinic entry. | High | SR001, SR002, SR003, SR004, SR005 |
| CR012 | insitro’s public privacy policy governs the website only and says no security measures can guarantee security. | Medium | SR010 |
| CR013 | The website privacy policy does not provide assurance about governance for the multimodal collaboration datasets that matter most to the business model. | High | SR010, SR020, SR023, SR024 |
| CR014 | Genomics England and INSIGHT partner materials show that sensitive data are kept inside partner-controlled secure environments with gated export processes. | High | SR020, SR022, SR023, SR024 |
| CR015 | Those controls mitigate direct leakage risk but also create friction, limit data portability, and may constrain how broadly insitro can compound partner data into reusable platform assets. | High | SR020, SR021, SR022, SR023, SR024 |
| CR016 | No public SOC 2, ISO 27001, GxP, HIPAA, or HITRUST disclosure surfaced in the reviewed materials. | High | SR010, SR022, SR024, SR028, SR030 |
| CR017 | The public patent record shows a growing IP surface across imaging, biomarker discovery, and platform workflows, but it does not expose freedom-to-operate analyses, inbound license encumbrances, or indemnity terms. | Medium | SR011 |
| CR018 | No public litigation or enforcement action surfaced in the reviewed materials, but that remains a visibility gap rather than a strong clean-bill-of-health signal because insitro is private and lightly disclosing. | Medium | SR010, SR011, SR025 |
| CR019 | Customer and partner concentration risk is high because the company groups about $150 million of partnership revenue into just three names: BMS, Lilly, and Gilead. | High | SR009, SR013, SR017, SR019 |
| CR020 | BMS is the strongest de-risking partner relationship and also the clearest single dependency because it has expanded across multiple milestones, targets, and modalities. | High | SR013, SR014, SR015, SR016 |
| CR021 | Lilly adds a second major partner but also creates dependency on Lilly-controlled data, TuneLab distribution, and Catalyze360 ecosystem choices. | High | SR017, SR018 |
| CR022 | Gilead proves large-pharma willingness to pay, but the current status of that relationship is opaque, reducing visibility into true partner durability. | Medium | SR019 |
| CR023 | Genomics England and INSIGHT relationships show partner-controlled data and deployment dependencies that could constrain reuse, scaling, and external portability. | High | SR020, SR021, SR022, SR023, SR024 |
| CR024 | The BMS ChemML extension publicly references QALs, ADMET models, and a 192-H100 GPU cluster, implying material compute dependence and fixed-cost exposure. | Medium | SR015 |
| CR025 | insitro’s operating model is capital intensive because it combines multimodal data generation, wet-lab experimentation, compute infrastructure, and multiple internal pipeline programs. | High | SR008, SR015, SR028, SR029, SR030 |
| CR026 | Sector layoff reporting and insitro’s own restructuring show that biotech funding and execution conditions remain difficult. | High | SR006, SR025, SR026, SR027 |
| CR027 | BioPharma Dive reports that insitro cut 22% of its workforce and sought to keep operating into 2027, which is a direct signal of both discipline and pressure. | Medium | SR025 |
| CR028 | The 2025 layoff raises execution-capacity risk even if it improves runway, because insitro still needs to move multiple programs and partnerships toward more operationally complex milestones. | High | SR006, SR008, SR025 |
| CR029 | Amy Abernethy’s board appointment is a real mitigation for clinical and regulatory-evidence risk because she brings former FDA leadership and clinical-development expertise. | Medium | SR007 |
| CR030 | Joe Hand’s appointment is a real mitigation for talent-strategy risk, but it is not proof that insitro can retain or cheaply hire scarce cross-functional talent after layoffs. | High | SR007, SR009, SR025 |
| CR031 | KPMG and McKinsey both describe AI-biopharma success as proof-sensitive, data-dependent, and multidisciplinary, reinforcing that execution risk is structural rather than incidental. | High | SR026, SR027 |
| CR032 | FDA and EMA’s joint work on AI principles suggests regulatory expectations are converging internationally, which raises the compliance burden for globally ambitious AI-drug platforms. | High | SR001, SR002, SR004, SR005 |
| CR033 | No public IND, CTA, or human-dosing milestone surfaced in the reviewed materials. | Medium | SR008, SR009, SR028, SR029 |
| CR034 | Public clinic-readiness evidence is directional rather than audited: approaching clinic, IND-enabling, and next-year clinic readiness are all visible, but none prove filing success. | High | SR008, SR009, SR025 |
| CR035 | If model credibility or regulatory documentation is insufficient, partner milestone timing and financing dependence are likely to worsen because discovery validation and clinic progression are intertwined. | High | SR001, SR009, SR013, SR014, SR015, SR016, SR017, SR018, SR025 |
| CR036 | A further material workforce reduction or failure to reach IND or first-in-human readiness by 2027 would be a strong thesis-break signal. | High | SR006, SR008, SR025 |
| CR037 | Inability to produce a private AI validation package or mapped controls against FDA and EMA principles would keep regulatory risk high. | High | SR001, SR002, SR003, SR004, SR005 |
| CR038 | Inability to produce security audits, quality documentation, or collaboration data-rights summaries would keep legal and operational risk high. | High | SR010, SR020, SR022, SR023, SR024 |
| CR039 | Overall residual exposure remains high even though board additions, leadership hires, and secure-environment controls are meaningful mitigants. | High | SR007, SR009, SR014, SR029, SR030 |
| CR040 | Risk transmission is reinforcing: AI-governance gaps can slow clinic progress; slower clinic progress can delay partner milestones; delayed milestones can tighten capital and talent flexibility. | High | SR001, SR002, SR003, SR004, SR005, SR009, SR025 |
| CR041 | Data-rights and privacy risk is partially mitigated, not eliminated, because the strongest public controls sit in partner environments rather than in an auditable insitro-wide security disclosure set. | High | SR010, SR020, SR021, SR022, SR023, SR024 |
| CR042 | IP and legal risk is medium: patents exist, but freedom-to-operate, contract terms, and litigation visibility remain thin. | Medium | SR011, SR019, SR025 |
| CR043 | People and execution risk is medium-high after the layoffs, partially offset by visible investment in leadership and governance. | High | SR007, SR009, SR025 |
| CR044 | The overall risk rating should remain high until clinic entry, validation documentation, and concentration transparency improve materially. | High | SR008, SR009, SR019, SR025, SR026, SR027 |
| CR045 | For US-facing programs, the documentation burden is also legal rather than merely best-practice: IND workflow expectations sit alongside formal rules such as 21 CFR Part 312 and 21 CFR Part 11. | High | SR012, SR031, SR032 |
| CV001 | Public evidence supports a research-more, price-sensitive recommendation rather than a clean buy or pass today. | High | SV001, SV002, SV011, SV017, SV019, SV020, SV024, SV025, SV026, SV027, SV029 |
| CV002 | insitro has unusually strong positive inputs for a private techbio: about $800 million of capital and roughly $150 million of cumulative partnership revenue are both cited publicly. | High | SV001, SV002, SV003, SV004, SV005, SV006, SV007, SV009 |
| CV003 | The public record still lacks the inputs needed for hard valuation underwriting: current share price, cap table, preference stack, audited current cash, and reliable annual revenue. | High | SV001, SV011, SV013, SV014, SV015, SV016, SV017 |
| CV004 | Because entry price and term quality are not public, scenario ranges and public comparables are more honest than a precise IRR or target-return claim. | High | SV017, SV019, SV020, SV024, SV025, SV026, SV027, SV029 |
| CV005 | The last clearly disclosed financing benchmark is insitro’s $400 million Series C in 2021. | Medium | SV002 |
| CV006 | Forge estimates the 2021 Series C post-money valuation at about $2.57 billion and lists a $18.29 price per share for that round. | Medium | SV017 |
| CV007 | Low-confidence public trackers cluster insitro around roughly $2.2 billion to $2.4 billion, but they disagree on revenue, funding, and timing details. | Medium | SV013, SV014, SV017 |
| CV008 | That low-$2 billions tracker cluster suggests public secondary references still anchor near the old round rather than proving a later step-up. | Medium | SV013, SV014, SV017 |
| CV009 | No reviewed public source disclosed a later clearly priced financing round, current preference stack, or current liquidation waterfall after the 2021 Series C. | High | SV002, SV013, SV014, SV017 |
| CV010 | Joe Hand’s 2026 announcement and BioPharma Dive’s 2025 layoff coverage show management emphasizing clinic readiness and runway into 2027 rather than near-term IPO readiness. | High | SV001, SV011 |
| CV011 | BMS is the strongest value-supporting proof asset because the relationship shows upfront cash, later milestones, extension funding, and further target nominations. | High | SV003, SV004, SV005, SV006 |
| CV012 | Lilly adds real upside optionality, but public near-term economics are less visible than the BMS relationship. | High | SV007, SV008 |
| CV013 | Gilead is useful as historical proof of willingness to pay, but it is not a strong current pillar for valuation support. | Medium | SV009 |
| CV014 | CTRO-1013 remains in IND-enabling and first-in-human preparation, so internal-asset value should still be heavily risk-adjusted. | Medium | SV010 |
| CV015 | FDA and EMA guidance plus insitro’s still-preclinical internal asset state argue against paying frontier-AI style multiples today. | High | SV010, SV034, SV035, SV036 |
| CV016 | The most relevant public comp cluster spans roughly $0.5 billion to $2.46 billion across Eikon, Absci, Schrödinger, Recursion, and Relay. | Medium | SV024, SV025, SV026, SV027, SV029, SV031, SV032, SV033 |
| CV017 | Eikon’s public reset to about $539 million and its 44.6% decline since its 2026 IPO show how quickly public markets can discount precommercial therapeutics stories. | Medium | SV029, SV030 |
| CV018 | Recursion at about $1.73 billion despite broader public disclosure, multiple programs, and a large capital base shows that scale and financing alone do not command a frontier premium. | High | SV021, SV024, SV031 |
| CV019 | Schrödinger at about $0.95 billion despite real software and services revenue shows that even the most monetized AI-drug-discovery comp is not valued like hypergrowth SaaS. | High | SV022, SV025, SV033 |
| CV020 | Relay at about $2.46 billion shows public markets can support a higher mark when clinical programs are clearer, but that upper end is still only low single-digit billions. | High | SV023, SV026, SV032 |
| CV021 | Absci at about $0.90 billion shows AI-biology platforms can remain sub-$1 billion in public markets even after multiple years of operating history. | Medium | SV027, SV028 |
| CV022 | A base public-evidence valuation range of about $1.5 billion to $2.5 billion is defensible if partner durability and clinic timing remain intact but no human data or cleaner term disclosure emerges. | High | SV006, SV007, SV011, SV012, SV016, SV018, SV019, SV020, SV021 |
| CV023 | A bear range of about $0.8 billion to $1.3 billion is appropriate if clinic timing slips, partner validation stalls, or financing terms prove investor-unfriendly. | High | SV009, SV010, SV014, SV015, SV017, SV021 |
| CV024 | A bull range of about $3.0 billion to $4.5 billion requires actual clinic entry, further BMS or Lilly conversion, and private-market willingness to look through current public comp marks. | High | SV011, SV012, SV014, SV017, SV018, SV020 |
| CV025 | At or below roughly $2.0 billion effective post-money with clean terms, the asymmetry becomes interesting enough to pursue diligence aggressively. | High | SV007, SV022, SV023, SV024 |
| CV026 | Around the last-known ~$2.57 billion reference, upside looks modest unless private diligence reveals evidence materially stronger than the public record. | High | SV006, SV022, SV024 |
| CV027 | Above roughly $3 billion, the current public record does not support underwriting the round. | High | SV017, SV018, SV019, SV020, SV024 |
| CV028 | The correct public-evidence stance is therefore research-more / track rather than buy or pass outright. | High | SV001, SV004, SV025, SV026, SV027 |
| CV029 | Recommendation confidence should be medium: strong enough to set price guardrails, not strong enough to bless an undisclosed premium valuation. | High | SV002, SV003, SV007, SV028 |
| CV030 | Overall risk rating should remain high, because valuation uncertainty compounds regulatory, translation, concentration, and financing risk rather than offsetting them. | High | SV010, SV011, SV014, SV015, SV034, SV035, SV036 |
| CV031 | insitro is not IPO-ready on public evidence because no current price history, audited public financials, or clinic-stage product proof are visible. | High | SV003, SV010, SV014, SV030 |
| CV032 | The most plausible nearer-term value-realization path is further partner expansion or strategic interest rather than a clean near-term public-market underwriting case. | High | SV011, SV012, SV031 |
| CV033 | The bull case requires CTRO-1013 to enter clinic on schedule, continued BMS or Lilly conversion, no further workforce reset, and clean terms. | High | SV010, SV011, SV012, SV018 |
| CV034 | The base case assumes partner economics hold, clinic timing remains directional but unproven, and price lands around the low-$2 billions without harsh preferences. | High | SV007, SV010, SV022 |
| CV035 | The bear case assumes no clinic proof by end-2027, partner plateau, or term-sheet features that reveal heavy dilution or preference overhang. | High | SV009, SV010, SV018, SV030 |
| CV036 | Thesis-break triggers include another material layoff, no IND or first-in-human progress by end-2027, inability to produce AI validation or security materials, and visible partner non-renewal. | High | SV010, SV018, SV034, SV035, SV036, SV037 |
| CV037 | The top diligence blockers are current cash and burn, current cap table and preferences, AI validation documentation, partner contract terms, and the IND path. | High | SV003, SV009, SV034, SV035, SV036, SV037 |
| CV038 | Tracker revenue estimates ranging from roughly $7.5 million to $69 million are too noisy to support a revenue-multiple valuation. | Medium | SV013, SV015 |
| CV039 | Because annual revenue is unreliable and internal asset value is preclinical, valuation should anchor to proof state and public comp cluster rather than SaaS-like multiples. | High | SV014, SV016, SV019, SV020, SV038 |
| CV040 | KPMG and McKinsey both describe AI-biopharma adoption as proof-sensitive, supporting only a disciplined premium until insitro converts promise into durable external or clinical outcomes. | High | SV019, SV020 |
| CV041 | Public peer filings show severe cash consumption even after public listing: Recursion lost $328 million, Schrödinger used about $137 million of operating cash, and Relay lost $342 million in 2023. | High | SV021, SV022, SV023 |
| CV042 | Public comp one-year moves are volatile: Recursion is down about 24.9%, Schrödinger about 51.0%, Eikon about 44.6%, while Relay and Absci moved sharply in the opposite direction. | Medium | SV028, SV030, SV031, SV032, SV033 |
| CV043 | If private diligence shows price below the public-comp-plus-premium range and terms are clean, the recommendation could upgrade to selective pursue. | High | SV025, SV029, SV037 |
| CV044 | If private diligence shows price above $3 billion or harsh preferences without stronger proof, the recommendation should downgrade to pass. | High | SV027, SV035, SV037 |
| CV045 | The central valuation stance is not that insitro lacks quality; it is that price discipline has to dominate because public proof still lags private narrative potential. | High | SV002, SV028, SV029, SV030 |
| CV046 | Amy Abernethy’s board addition is a meaningful governance mitigant, but governance strength alone cannot carry valuation without price, term, and clinic proof. | High | SV010, SV011, SV038 |